[[FrontPage]] *Computational tools and methods for corpus compilation and analysis (by Paul Rayson) [#pccb11cc] **1 Introduction [#hbe0d9bc] -[[Roberto Busa>http://stephenramsay.us/2011/08/11/father-roberto-busa/]] -[[COBUILD>http://www.collins.co.uk/page/The+Collins+Corpus]] **2 Survey of tools and methods [#y28770ea] -2.1 Compilation -Spoken corpora --[[Voicewalker (Santa Barbara corpus)>http://www.linguistics.ucsb.edu/projects/transcription/tools.html]] --[[SoundScriber (MICASE)>http://www-personal.umich.edu/~ebreck/code/sscriber/]] --[[SCOTS>http://www.scottishcorpus.ac.uk/]] --[[SACODEYL>http://sacodeyl.inf.um.es/sacodeyl-search2/]] --[[EXMARaLDA>http://www.exmaralda.org/en]] --[[NITE XML Toolkit>http://groups.inf.ed.ac.uk/nxt/]] -Written corpora --[[BE06 corpus>http://www.helsinki.fi/varieng/CoRD/corpora/BE06/]] --[[Project Gutenberg>https://www.gutenberg.org/]] --[[WaC>https://www.sigwac.org.uk/wiki/WAC-X]] ---[[BootCat>http://bootcat.sslmit.unibo.it/]] --[[WebGetter (in WordSmith Tools)>http://www.lexically.net/downloads/version4/html/index.html?webgetter_proc.htm]] -2.2 Annotation -Intelligent editors --[[Dexter software>http://www.dexster.net/]] : audio editor --eMargin software --[[Xanadu editor>https://en.wikipedia.org/wiki/Project_Xanadu]] -Automatic taggers --BLARKs --CLAWS --[[CLAWS>http://ucrel.lancs.ac.uk/claws/]] -2.3 Retrieval -Corpus retrieval software --Longman Mini-Concordancer --Micro-Concord --Wordcruntcher --OCP --少し古いもの ---[[Longman Mini-Concordancer>https://www.jstor.org/stable/30204444?seq=1#page_scan_tab_contents]] 紹介論文 ---[[Micro-Concord>http://www.sciencedirect.com/science/article/pii/0346251X86900047]] 紹介論文 ---[[Wordcruncher>http://www.wordcruncher.com/]] ---[[Xaira>http://xaira.sourceforge.net/]] ---[[OCP>http://users.ox.ac.uk/~ctitext2/resguide/resources/o125.html]] --現役 --WordSmith --MonoConc --AntConc --Xaira --COCA --COHA --BNCWeb --Sketch Engine --CQPweb --Intellitext --Netspeak --ANNIS --[[Netspeak>http://www.netspeak.org/]] --[[ANNIS>http://corpus-tools.org/annis/]] --Wmatrix -3. Case study -N-grams --kfNgram --Wmatrix's "c-grams" -4. Conclusion -Computer Assisted Qualitative Data Analysis (CAQDAS) --ATLAS.ti --NVivo --QDA Minor --Wordstat -LIWC (Linguistic Inquiry and Word Count) -Geographical Information System (GIS) or Social Network Analysis (SNA) --Voyant --MONK -[[GATE>https://gate.ac.uk/]]