MMTutorBench is the first multimodal benchmark for AI math tutoring with 685 problems, problem-specific rubrics across six dimensions, and evaluations of 12 MLLMs revealing performance gaps versus humans.
We first employ Gemini-2.0-Flash to process the subtitles and isolate the core mathematical problem-solving steps relevant to each key- step frame
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2025 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
MMTutorBench: The First Multimodal Benchmark for AI Math Tutoring
MMTutorBench is the first multimodal benchmark for AI math tutoring with 685 problems, problem-specific rubrics across six dimensions, and evaluations of 12 MLLMs revealing performance gaps versus humans.