Text2DistBench is a new scalable benchmark showing LLMs outperform random baselines on distributional reading comprehension from YouTube comments but vary widely by question type and distribution characteristics.
InProceedings of the 62nd Annual Meeting of the Association for Compu- tational Linguistics (V olume 1: Long Papers), pages 16366–16393, Bangkok, Thailand
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
CONDITIONAL 1representative citing papers
citing papers explorer
-
Beyond Facts: Benchmarking Distributional Reading Comprehension in Large Language Models
Text2DistBench is a new scalable benchmark showing LLMs outperform random baselines on distributional reading comprehension from YouTube comments but vary widely by question type and distribution characteristics.