pith. sign in

arxiv: 2605.26620 · v1 · pith:EA563ULFnew · submitted 2026-05-26 · 💻 cs.CL · cs.HC

Granuscore: A Reference-Free Measure of Granularity for Text Analysis and Question Answering

classification 💻 cs.CL cs.HC
keywords granularitygranuscoreacrosssentenceanalysisdifferenceshierarchicalmeasure
0
0 comments X
read the original abstract

Natural language conveys information at varying levels of granularity, from fine-grained references to broad descriptions. While granularity is fundamental to human communication, existing measures mostly capture surface detail or sentence specificity. We introduce Granuscore, a reference-free measure of granularity that leverages structural properties of a hierarchical embedding space. Granuscore reliably recovers hierarchical orderings on the Granola-EQ dataset and captures expected differences in granularity across discourse contexts. Across domains, we further show that Granuscore explains non-linear variation in sentence specificity beyond sentence length. Finally, we apply Granuscore to four question-answering benchmarks and analyze how granularity differs for questions, gold answers, and model outputs across response outcomes. The analysis reveals consistent differences in model behavior and provides a principled lens for characterizing the difficulty of QA datasets. Together, the results position Granuscore as a scalable, broadly applicable tool for analyzing granularity in text.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.