Frontier LLMs reach ~97% aggregate reliability on Nepal's K-10 curriculum but show major shortfalls in pedagogical clarity and cultural contextualization, indicating they are not ready for autonomous tutoring.
The Relative Effectiveness of Human Tutoring, Intelligent Tutoring Systems, and Other Tutoring Systems,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CY 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Assessing the Pedagogical Readiness of Large Language Models as AI Tutors in Low-Resource Contexts: A Case Study of Nepal's K-10 Curriculum
Frontier LLMs reach ~97% aggregate reliability on Nepal's K-10 curriculum but show major shortfalls in pedagogical clarity and cultural contextualization, indicating they are not ready for autonomous tutoring.