CaseFacts benchmark of 6,294 claims shows LLMs struggle to verify colloquial legal statements against Supreme Court precedents, with unrestricted web search degrading performance due to noisy precedents.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
CaseFacts: A Benchmark for Legal Fact-Checking and Precedent Retrieval
CaseFacts benchmark of 6,294 claims shows LLMs struggle to verify colloquial legal statements against Supreme Court precedents, with unrestricted web search degrading performance due to noisy precedents.