pith. sign in

Judging llm-as-a-judge with mt-bench and chatbot arena

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

fields

cs.AI 1 cs.IR 1

years

2026 2

verdicts

UNVERDICTED 2

roles

background 1

polarities

background 1

clear filters

representative citing papers

Verifiable Process Rewards for Agentic Reasoning

cs.AI · 2026-05-11 · unverdicted · novelty 6.0

VPR converts symbolic, constraint, or posterior oracles into dense turn-level rewards for RL, improving credit assignment in agentic reasoning and transferring to general benchmarks.

Agentic GraphRAG: Navigating Unstructured Financial Data with Collaborative AI

cs.IR · 2026-04-15 · unverdicted · novelty 6.0

Agentic GraphRAG constructs a Neo4j graph via deterministic structured ingestion plus LLM extraction from notices, then deploys modular agents with tool access and reflection to outperform vector-RAG baselines on Swiss commercial gazette data across entity resolution, answer quality, and multi-turn

citing papers explorer

Showing 2 of 2 citing papers after filters.

  • Verifiable Process Rewards for Agentic Reasoning cs.AI · 2026-05-11 · unverdicted · none · ref 38

    VPR converts symbolic, constraint, or posterior oracles into dense turn-level rewards for RL, improving credit assignment in agentic reasoning and transferring to general benchmarks.

  • Agentic GraphRAG: Navigating Unstructured Financial Data with Collaborative AI cs.IR · 2026-04-15 · unverdicted · none · ref 8

    Agentic GraphRAG constructs a Neo4j graph via deterministic structured ingestion plus LLM extraction from notices, then deploys modular agents with tool access and reflection to outperform vector-RAG baselines on Swiss commercial gazette data across entity resolution, answer quality, and multi-turn