Clarity generates benchmarks showing that leading NL2SQL systems degrade significantly under multi-faceted ambiguity and struggle to localize or resolve schema-level issues despite detecting ambiguity.
For example: -'Customer Count', 'customer count', 'CUSTOMER COUNT', 'customer_count' and 'customer counts'represent the same entity
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
CLARITY: A Framework and Benchmark for Conversational Language Ambiguity and Unanswerability in Interactive NL2SQL Systems
Clarity generates benchmarks showing that leading NL2SQL systems degrade significantly under multi-faceted ambiguity and struggle to localize or resolve schema-level issues despite detecting ambiguity.