LegalWorld is a life-cycle interactive environment modeling Chinese civil litigation as five causally connected stages grounded in 75,309 judgments, paired with LongJud-Bench for cross-stage agent evaluation.
arXiv preprint arXiv:2507.04037
3 Pith papers cite this work. Polarity classification is still indexing.
3
Pith papers citing it
years
2026 3verdicts
UNVERDICTED 3representative citing papers
Introduces KAPRO framework and KAware dataset to benchmark LLM agents' self-awareness in distinguishing internal knowledge from external tool needs.
PHF applies Bourdieu's Theory of Practice to create hierarchical user models for LLM personalization and reports consistent gains on the LaMP benchmark.
citing papers explorer
-
From Knowing to Acting: Benchmarking Self-Awareness Capability of LLM Agents
Introduces KAPRO framework and KAware dataset to benchmark LLM agents' self-awareness in distinguishing internal knowledge from external tool needs.