[ CEFR-J Members | CEFR-J Descriptors? | CEFR-J RLD | CEFR-J Publications? ]
CEFR-J Reference Level Descriptions (RLDs) †
What is RLD? †
Major RLD projects for English †
British Council/EAQUALS Core Inventory for General English †
English Profile: †
Global Scale of English by Pearson †
Before CEFR †
- Threshold Level Series ("T-series")
- You can access the original T-series books from here
CEFR-J Wordlist †
About: †
- CEFR-J Wordlist Version 1.6
- A list of 7,801 items classified by the CEFR (A1 to B2) levels.
- Each item has the following information:
- headword (lemma form)
- part of speech
- CEFR level
- thematic categories defined by the British Council/EAQUALS Core Inventory for General English and Threshold Levels 1990 (Council of Europe)
Download †
How to cite: †
- Tono, Y. (2017). The CEFR-J and its Impact on English Language Teaching in Japan. JACET International Convention Selected Papers, Volume 4, pp. 31-52. JACET.
CEFR-J Collocation Dataset †
About †
- First release (September, 2022)
- Collocation list based on the CEFR-J Wordlist Ver. 1.6
- Syntactic frame-based collocation pairs extracted from BNC (dependency-parsed by stanza)
Dataset information: †
- Each collocation pair has the following information:
- w1: collocate
- w2: node
- w1_CEFR: CEFR level of w1
- w2_CEFR: CEFR level of w2
- relation: dependency relation
- cooccurrence: collocation frequency
- freq_w1: independent frequency of w1in the entire BNC
- freq_w2: independent frequency of w2 in the entire BNC
- w1_in_rel: frequency of w1 in the given dependency relation
- w2_in_rel: frequency of w2 in the given dependency relation
- DP: dispersion measure DP (Gries)
- expected_freq: expected frequencies
- Association measures for this given collocation pair:
- MI/ MI2/ MI3/ t_score/ z_score/ logDice/ log_likelihood/ chi_squared
Download †
- ADJ+NOUN (amod): 135,939 pairs [ download ]
- VERB+NOUN (obj): 114,582 pairs [ download ]
- NOUN+NOUN (nounmod): 72,340 pairs [ download ]
- ADVERB+VERB (advmod verb): 43,992 pairs [ download ]
- ADVERB+ADJ (advmod adj): 16,180 pairs [ download ]
Acknowledgement †
- This dataset was created by Kohei Fukuda, a postgraduate student in my lab.
How to cite: †
- Fukuda, K. & Tono, Y. (2022). The CEFR-J Collocation Dataset Version 1.0. Tono Lab, TUFS. (this URL)
CEFR-J Grammar Profile †
About †
- An inventory of grammar items classified by CEFR levels
- Profiling was based on INPUT (ELT Course Book Corpus) as well as OUTPUT (Spoken and Written Learner Corpus)
A list of grammar items and their REGEX queries †
- The following Excel file describes 263 grammar items investigated and their REGEX query
Grammar Profile for Teachers and Learners †
- A user-friendly version of the Grammar Profile
- Visual image of how grammar points are learned at CEFR levels
Original dataset †
- CEFR-based ELT Course Books
- CEFR-based ELT Course Books (Position-based)
- CEFR-based ELT Course Books (Skill-based)
- Written Learner Corpus: JEFLL Corpus (Original vs. Corrected)
- Spoken Learner Corpus: NICT JLE Corpus
How to cite: †
- Ishii, Y. & Tono, Y. (2018). Investigating Japanese EFL learners' overuse/underuse of English grammar categories and their relevance to CEFR levels. Proceedings of the 4th Asia Pacific Corpus Linguistics Conference, (Edited by Y. Tono and H. Isahara), pp. 160-165.
CEFR-J Text Profile †
CEFR-J Error Profile †