[[FrontPage]]

Download
------------

1. Download TreeTagger from [[here>ftp://ftp.ims.uni-stuttgart.de/pub/corpora/tree-tagger-windows-3.2.zip]]

2. Download the English parameter file from [[here>ftp://ftp.ims.uni-stuttgart.de/pub/corpora/english-par-linux-3.2.bin.gz]]

3. Uncompress 1. and 2. and read INSTALL.txt (below)

Installation
------------

1. Install a Perl interpreter (if you have not already installed one).
   You can download a Perl interpreter for Windows for free at
   http://www.activestate.com/activeperl/

2. Move the TreeTagger directory to the root directory of drive C:.

3. Download the PC parameter files for the languages you need, decompress
   them (e.g. using Winzip or 7zip) and move them to the subdirectory lib.
   Rename the parameter files to <language>.par
   Example: Rename french-par-linux-3.1.bin to french.par

4. Add the path C:\TreeTagger\bin to the PATH environment variable.

5. Open a shell and type the command
   set PATH=C:\TreeTagger\bin;%PATH%

6. Change to the directory C:\TreeTagger

7. Now you can test the tagger, e.g. by analyzing this file with the command
   tag-english INSTALL.txt


Examples

------------
 c:\TreeTagger>set PATH=C:\TreeTagger\bin;%PATH%
 
 c:\TreeTagger>tag-english INSTALL.txt > INSTALL.tag.txt
        reading parameters ...
        tagging ...
        finished.

Notes
------------
For Linux and Mac, Penn TreeBank tagset is default. If you want VV for lexical verbs, VB for be-verbs, and VH for have, you should edit the file 'tree-tagger-english' as follows:

 #last line
 perl -pe 's/\tV[BDHV]/\VB/;s/IN\/that/\tIN/;'
 
 --> perl -pe 's/IN\/that/\tIN/;'


トップ   新規 一覧 単語検索 最終更新   ヘルプ   最終更新のRSS