[ CEFR-J Members | CEFR-J Descriptors? | CEFR-J RLD | CEFR-J Publications? ]
CEFR-J Reference Level Descriptions (RLDs) †
What is RLD? †
Major RLD projects for English †
- British Council/EAQUALS Core Inventory for General English
- Global Scale of English by Pearson
Before CEFR †
- Threshold Level Series ("T-series")
- You can access the original T-series books from here
CEFR-J Wordlist †
- CEFR-J Wordlist Version 1.6
- A list of 7,801 items classified by the CEFR (A1 to B2) levels.
- Each item has the following information:
- headword (lemma form)
- part of speech
- CEFR level
- thematic categories defined by the British Council/EAQUALS Core Inventory for General English and Threshold Levels 1990 (Council of Europe)
- Version 1.6
CEFR-J Collocation List †
- First release (September, 2022)
- Collocation list based on the CEFR-J Wordlist Ver. 1.6
- Syntactic frame-based collocation pairs extracted from BNC (dependency-parsed by stanza)
- Each collocation pair has the following information:
- w1: collocate
- w2: node
- w1_CEFR: CEFR level of w1
- w2_CEFR: CEFR level of w2
- relation: dependency relation
- cooccurrence: collocation frequency
- freq_w1: independent frequency of w1in the entire BNC
- freq_w2: independent frequency of w2 in the entire BNC
- w1_in_rel: frequency of w1 in the given dependency relation
- w2_in_rel: frequency of w2 in the given dependency relation
- DP: dispersion measure DP (Gries)
- expected_freq: expected frequencies
- Association measures for this given collocation pair:
- MI/ MI2/ MI3/ t_score/ z_score/ logDice/ log_likelihood/ chi_squared
- ADJ+NOUN (amod): 135,939 pairs [ download ]
- VERB+NOUN (obj): 114,582 pairs [ download ]
- NOUN+NOUN (nounmod): 72,340 pairs [ download ]
- ADVERB+VERB (advmod verb): 43,992 pairs [ download ]
- ADVERB+ADJ (advmod adj): 16,180 pairs [ download ]
- Acknowledgement: This dataset was created by Kohei Fukuda, a postgraduate student in my lab.
CEFR-J Grammar Profile †
- An inventory of grammar items classified by CEFR levels
- Profiling was based on INPUT (ELT Course Book Corpus) as well as OUTPUT (Spoken and Written Learner Corpus)
A list of grammar items and their REGEX queries †
- The following Excel file describes 263 grammar items investigated and their REGEX query
Grammar Profile for Teachers and Learners †
- A user-friendly version of the Grammar Profile
- Visual image of how grammar points are learned at CEFR levels
Original dataset †
- CEFR-based ELT Course Books
- CEFR-based ELT Course Books (Position-based)
- CEFR-based ELT Course Books (Skill-based)
- Written Learner Corpus: JEFLL Corpus (Original vs. Corrected)
- Spoken Learner Corpus: NICT JLE Corpus
- English Textbooks published in Japan
CEFR-J Text Profile †
CEFR-J Error Profile †