Text2DistBench is a new scalable benchmark showing LLMs outperform random baselines on distributional reading comprehension from YouTube comments but vary widely by question type and distribution characteristics.
In Proceedings of the 62nd Annual Meeting of the As- sociation for Computational Linguistics (V olume 1: Long Papers), pages 6349–6384, Bangkok, Thailand
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
CONDITIONAL 1representative citing papers
citing papers explorer
-
Beyond Facts: Benchmarking Distributional Reading Comprehension in Large Language Models
Text2DistBench is a new scalable benchmark showing LLMs outperform random baselines on distributional reading comprehension from YouTube comments but vary widely by question type and distribution characteristics.