pith. sign in

Integrity report for Automated Benchmark Auditing for AI Agents and Large Language Models

A machine-verified record of the checks Pith has run against this paper: detector runs, findings, signed bundle events, and canonical identifiers.

arXiv:2605.26079 · pith:2026:DR3KKIIQOT375DTXKWCILJUQHT

0Critical
0Advisory
7Detectors run
2026-05-28Last checked

Paper page arXiv integrity.json bundle.json

Detector runs

claim_evidence completed v1.0.0 · findings 0 · 2026-05-28 18:04:55.664479+00:00
external_links completed v1.0.0 · findings 0 · 2026-05-26 17:31:40.719055+00:00
shingle_duplication skipped v0.1.0 · findings 0 · 2026-05-26 05:49:56.591447+00:00
citation_quote_validity skipped v0.1.0 · findings 0 · 2026-05-26 03:50:14.047363+00:00
ai_meta_artifact skipped v1.0.0 · findings 0 · 2026-05-26 02:34:03.978332+00:00
claim_evidence completed v1.0.0 · findings 0 · 2026-05-26 02:23:58.441639+00:00
cited_work_retraction completed v1.0.0 · findings 0 · 2026-05-26 02:23:24.457538+00:00

Findings

No public integrity findings for this paper.

Signed record

The machine-readable record for this paper lives at /pith/DR3KKIIQ/integrity.json. Pith Number bundles also include signed pith.integrity.v1 events where a Pith Number exists.