MMTutorBench is the first multimodal benchmark for AI math tutoring with 685 problems, problem-specific rubrics across six dimensions, and evaluations of 12 MLLMs revealing performance gaps versus humans.
Our annotation team meticulously reviews the automatically generated sequence of representative frames
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
MMTutorBench: The First Multimodal Benchmark for AI Math Tutoring
MMTutorBench is the first multimodal benchmark for AI math tutoring with 685 problems, problem-specific rubrics across six dimensions, and evaluations of 12 MLLMs revealing performance gaps versus humans.