pith. sign in

Integrity report for Latent-space Attacks for Refusal Evasion in Language Models

A machine-verified record of the checks Pith has run against this paper: detector runs, findings, signed bundle events, and canonical identifiers.

arXiv:2605.21706 · pith:2026:CQ445C746WIGW6BDTQXYHMWICT

0Critical
0Advisory
5Detectors run
2026-05-27Last checked

Paper page arXiv integrity.json bundle.json

Detector runs

doi_compliance completed v1.0.0 · findings 0 · 2026-05-27 09:24:28.586365+00:00
doi_title_agreement completed v1.0.0 · findings 0 · 2026-05-27 09:03:13.861218+00:00
claim_evidence completed v1.0.0 · findings 0 · 2026-05-26 08:44:05.433808+00:00
cited_work_retraction completed v1.0.0 · findings 0 · 2026-05-22 19:52:34.371225+00:00
ai_meta_artifact skipped v1.0.0 · findings 0 · 2026-05-22 03:33:37.544223+00:00

Findings

No public integrity findings for this paper.

Signed record

The machine-readable record for this paper lives at /pith/CQ445C74/integrity.json. Pith Number bundles also include signed pith.integrity.v1 events where a Pith Number exists.