Title resolution pending

Alexandre Lacoste, Nicolas Gontier, Oleh Shliazhko, Aman Jaiswal, Kusha Sareen, Shailesh Nanisetty, Joan Cabezas, Manuel Del Verme, Omar G · 2026 · arXiv 2603.15798

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Every Eval Ever: A Unifying Schema and Community Repository for AI Evaluation Results

cs.AI · 2026-06-12 · unverdicted · novelty 7.0

Introduces the first community-governed unified JSON schema and crowdsourced repository for AI evaluation results, with converters and a database spanning 22,235 models and 2,273 benchmarks.

AgentBeats: Agentifying Agent Assessment for Openness, Standardization, and Reproducibility

cs.AI · 2026-06-11 · unverdicted · novelty 7.0

AgentBeats implements agentified evaluation of diverse AI agents through standardized interfaces, validated at scale in a five-month competition with 298 judges and 467 subjects plus a coding case study.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Every Eval Ever: A Unifying Schema and Community Repository for AI Evaluation Results cs.AI · 2026-06-12 · unverdicted · none · ref 52
Introduces the first community-governed unified JSON schema and crowdsourced repository for AI evaluation results, with converters and a database spanning 22,235 models and 2,273 benchmarks.
AgentBeats: Agentifying Agent Assessment for Openness, Standardization, and Reproducibility cs.AI · 2026-06-11 · unverdicted · none · ref 21
AgentBeats implements agentified evaluation of diverse AI agents through standardized interfaces, validated at scale in a five-month competition with 298 judges and 467 subjects plus a coding case study.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer