[[FrontPage]]

*Computational tools and methods for corpus compilation and analysis (by Paul Rayson) [#pccb11cc]

**1 Introduction [#hbe0d9bc]

-[[Roberto Busa>http://stephenramsay.us/2011/08/11/father-roberto-busa/]]
-[[COBUILD>http://www.collins.co.uk/page/The+Collins+Corpus]]

**2 Survey of tools and methods [#y28770ea]

-2.1 Compilation

-Spoken corpora
--[[Voicewalker (Santa Barbara corpus)>http://www.linguistics.ucsb.edu/projects/transcription/tools.html]]
--[[SoundScriber (MICASE)>http://www-personal.umich.edu/~ebreck/code/sscriber/]]
--[[SCOTS>http://www.scottishcorpus.ac.uk/]]
--[[SACODEYL>http://sacodeyl.inf.um.es/sacodeyl-search2/]]
--[[EXMARaLDA>http://www.exmaralda.org/en]]
--[[NITE XML Toolkit>http://groups.inf.ed.ac.uk/nxt/]]

-Written corpora
--[[BE06 corpus>http://www.helsinki.fi/varieng/CoRD/corpora/BE06/]]
--[[Project Gutenberg>https://www.gutenberg.org/]]
--[[WaC>https://www.sigwac.org.uk/wiki/WAC-X]]
---[[BootCat>http://bootcat.sslmit.unibo.it/]]
--[[WebGetter (in WordSmith Tools)>http://www.lexically.net/downloads/version4/html/index.html?webgetter_proc.htm]]

-2.2 Annotation

-Intelligent editors
--[[Dexter software>http://www.dexster.net/]] : audio editor
--eMargin software
--[[Xanadu editor>https://en.wikipedia.org/wiki/Project_Xanadu]]

-Automatic taggers
--BLARKs
--CLAWS
--[[CLAWS>http://ucrel.lancs.ac.uk/claws/]]

-2.3 Retrieval

-Corpus retrieval software
--Longman Mini-Concordancer
--Micro-Concord
--Wordcruntcher
--OCP
--少し古いもの
---[[Longman Mini-Concordancer>https://www.jstor.org/stable/30204444?seq=1#page_scan_tab_contents]] 紹介論文 
---[[Micro-Concord>http://www.sciencedirect.com/science/article/pii/0346251X86900047]] 紹介論文
---[[Wordcruncher>http://www.wordcruncher.com/]]
---[[Xaira>http://xaira.sourceforge.net/]]
---[[OCP>http://users.ox.ac.uk/~ctitext2/resguide/resources/o125.html]]
--現役
--WordSmith
--MonoConc
--AntConc
--Xaira
--COCA
--COHA
--BNCWeb
--Sketch Engine
--CQPweb
--Intellitext
--Netspeak
--ANNIS
--[[Netspeak>http://www.netspeak.org/]]
--[[ANNIS>http://corpus-tools.org/annis/]]
--Wmatrix

-3. Case study

-N-grams
--kfNgram
--Wmatrix's "c-grams"

-4. Conclusion

-Computer Assisted Qualitative Data Analysis (CAQDAS)
--ATLAS.ti
--NVivo
--QDA Minor
--Wordstat

-LIWC (Linguistic Inquiry and Word Count)

-Geographical Information System (GIS) or Social Network Analysis (SNA)
--Voyant
--MONK

-[[GATE>https://gate.ac.uk/]]

トップ   編集 差分 バックアップ 添付 複製 名前変更 リロード   新規 一覧 単語検索 最終更新   ヘルプ   最終更新のRSS