Matching primary charge explains 99.2% of the NDCG@10 gap between BM25 and best systems on LeCaRDv2 because benchmark relevance is defined by charge-encoding elements.
SAILER: Structure-Aware Pre-trained Language Model for Legal Case Retrieval
3 Pith papers cite this work. Polarity classification is still indexing.
years
2026 3representative citing papers
An LLM agent self-evolves a set of query-rewriting rules that raise BM25 performance on the LeCaRD-v2 legal retrieval benchmark above human-designed and greedy baselines.
The paper identifies three pathologies of probabilistic RAG in legal retrieval (mereological blindness, diachronic blindness, causal opacity) and derives four deterministic architectural commitments to address the hierarchical, temporal, and institutional structure of legal knowledge.
citing papers explorer
-
Charge as a Construct-Validity Factor in Chinese Legal Case Retrieval: A Cross-Benchmark Audit
Matching primary charge explains 99.2% of the NDCG@10 gap between BM25 and best systems on LeCaRDv2 because benchmark relevance is defined by charge-encoding elements.
-
When Rules Learn: A Self-Evolving Agent for Legal Case Retrieval
An LLM agent self-evolves a set of query-rewriting rules that raise BM25 performance on the LeCaRD-v2 legal retrieval benchmark above human-designed and greedy baselines.
-
Beyond Probabilistic Similarity: Structural, Temporal, and Causal Limitations of Retrieval-Augmented Generation in the Legal Domain
The paper identifies three pathologies of probabilistic RAG in legal retrieval (mereological blindness, diachronic blindness, causal opacity) and derives four deterministic architectural commitments to address the hierarchical, temporal, and institutional structure of legal knowledge.