Text Readability Assessment for Second Language Learners

Ekaterina Kochmar; Menglin Xia; Ted Briscoe

arxiv: 1906.07580 · v1 · pith:5FDKZXGWnew · submitted 2019-06-18 · 💻 cs.CL

Text Readability Assessment for Second Language Learners

Menglin Xia , Ekaterina Kochmar , Ted Briscoe This is my paper

classification 💻 cs.CL

keywords learnersreadabilityassessmentdatanativetexttextslanguage

0 comments

read the original abstract

This paper addresses the task of readability assessment for the texts aimed at second language (L2) learners. One of the major challenges in this task is the lack of significantly sized level-annotated data. For the present work, we collected a dataset of CEFR-graded texts tailored for learners of English as an L2 and investigated text readability assessment for both native and L2 learners. We applied a generalization method to adapt models trained on larger native corpora to estimate text readability for learners, and explored domain adaptation and self-learning techniques to make use of the native data to improve system performance on the limited L2 data. In our experiments, the best performing model for readability on learner texts achieves an accuracy of 0.797 and PCC of $0.938$.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

The Crutch or the Ceiling? How Different Generations of LLMs Shape EFL Student Writings
cs.HC 2026-04 unverdicted novelty 4.0

Advanced LLMs improve EFL writing scores and diversity for lower-proficiency students but correlate with lower expert ratings on deep coherence, acting more as crutches than scaffolds.