ºÇ¶á¤ÎR¤Î¥á¥â
Shell script †
- ¡Ø±Ñ¸ì³Ø½¬¼Ô¥³¡¼¥Ñ¥¹³èÍѥϥó¥É¥Ö¥Ã¥¯¡Ù ¼ø¶ÈÍѤÎÉÕ°¥á¥â¡Ê2020/11¡Ë
- Linux, Mac ¤Î terminal ɸ½à
- Windows 10 ¤Çư¤«¤¹¤Ë¤Ï Windows Subsystem for Linux¡ÊWSL¡Ë¤ò»È¤ª¤¦¡§
NICE¤Î¥Ç¡¼¥¿½èÍý¡ÊÂ裵¾Ï¡Ë †
³Ø½¬¼Ô¥Ç¡¼¥¿¤Î¤ß¤Î¥Õ¥¡¥¤¥ë¤ò¼«Æ°ºîÀ® †
#!/bin/sh # ¤³¤ì¤Ï bash ¤Î¤ß¡£zsh ¤Ê¤É¤Ç¤ÏÉÔÍ×
cd `dirname $0`¡¡¡¡¡¡# ¤É¤³¤Ë°Ü¤·¤Æ¤â¼Â¹Ô²Äǽ
for file_name in `ls *.txt` #¥Ç¥£¥ì¥¯¥È¥êÆâ¤Î¤¹¤Ù¤Æ¤Î text ¥Õ¥¡¥¤¥ë¤ò file_name¤Ë³ÊǼ
do
# *JPN¡ÊÆüËܿͳؽ¬¼Ô¡Ë¤Î¹Ô¤ò¼è¤ê½Ð¤·¤Æ¡¢*JPN¤òºï½ü¤·¤¿¹Ô¤À¤±¤ò .out ¥Õ¥¡¥¤¥ë¤Ë½ÐÎÏ
grep \*JPN $file_name | perl -pe 's/^\*JPN[0-9]+:\t//g;' > $file_name.out
done
killall Terminal¡¡¡¡¡¡¡¡¡¡# ½ªÎ»¤·¤¿¤é¥¿¡¼¥ß¥Ê¥ë¤òÊĤ¸¤ë
³Ø½¬¼Ô¤Î³Æ¥Æ¥¥¹¥È¤ÎȯÏÃʸ¿ô¤Èñ¸ì¿ô¤ò°ì³ç½¸·× †
for file_name in `ls *.out`
do
wc -lw $file_name >> count.list.text
done
- ³Ø½¬¼Ô¥Ç¡¼¥¿¤À¤±¤ò *.out¥Õ¥¡¥¤¥ë¤ÇÈ´¤½Ð¤·¤¿¥Ç¥£¥ì¥¯¥È¥ê¤Ç¼Â¹Ô¤¹¤ë
- wc ¥³¥Þ¥ó¥É¤Ç¥Õ¥¡¥¤¥ë¤Î¹Ô¿ô¤Èñ¸ì¿ô¤ò¥«¥¦¥ó¥È¤·¤¿¤é¡¢count.list.txt ¤Ë append ¤¹¤ë
30 319 JPN501.txt.out
29 365 JPN502.txt.out
13 201 JPN503.txt.out
27 260 JPN504.txt.out
25 418 JPN505.txt.out
20 260 JPN506.txt.out
26 355 JPN507.txt.out
20 195 JPN508.txt.out
19 260 JPN509.txt.out
14 183 JPN510.txt.out
- Â裱¥³¥é¥à¤¬¹Ô¿ô¡¢Â裲¥³¥é¥à¤¬Ã±¸ì¿ô¤Ê¤Î¤Ç¡¢¤³¤Á¤é¤ò Excel ¤Ë¥¤¥ó¥Ý¡¼¥È¤·¤Æ¡¢Ê¿¶ÑʸĹ¤Ê¤É¤ò·×»»¤Ç¤¤ë¡£
Lexical diversity measure ¤ò°ì³ç¤Ç·×»»¤¹¤ë R ¥Ñ¥Ã¥±¡¼¥¸ †
- ¤¤¤í¤¤¤í¤Ê¤ä¤êÊý¤¬¤¢¤ë¤¬¡¢R ¤Î package "koRpus" (Meik Michalke »áºî¡Ë¤Î»È¤¤Êý¤ò¾Ò²ð¤·¤Æ¤ª¤¯¡£
- ¶ñÂÎŪ¤Ê»ÈÍÑÊýË¡¤Ï¤³¤Á¤é¤ò»²¾È¡§
- »ä¤¬ NICE3.3 ¤Î¥Ç¡¼¥¿¤ÇÎý½¬¤·¤¿ R markdown ¥Õ¥¡¥¤¥ë¤ÎPDF ɽ¼¨
- TreeTagger ¤Î¥¤¥ó¥¹¥È¡¼¥ë¤¬Á°Äó
- multiple files ¤Î°·¤¤¤Ï tm ¤È¤¤¤¦Ê̥⥸¥å¡¼¥ë¤ò»È¤¤¤³¤Ê¤µ¤Ê¤¤¤È¤¤¤±¤Ê¤¤¤Î¤Ç¡¢´ðËÜŪ¤Ë¤Ï£±¥Õ¥¡¥¤¥ë¤º¤ÄʬÀϤ¹¤ë¥Ä¡¼¥ë¤À¤È»×¤Ã¤¿Êý¤¬¤è¤¤¡£
½ôÃí°Õ †