Integrity report for Self-Supervised On-Policy Reinforcement Learning via Contrastive Proximal Policy Optimisation

A machine-verified record of the checks Pith has run against this paper: detector runs, findings, signed bundle events, and canonical identifiers.

arXiv:2605.13554 · pith:2026:U6MVV3FTONPQYZXFVZ4EWWSDO5

0Critical

0Advisory

4Detectors run

2026-05-21Last checked

Paper page arXiv integrity.json bundle.json

Detector runs

doi_title_agreement completed v1.0.0 · findings 0 · 2026-05-21 13:31:42.523101+00:00

doi_compliance completed v1.0.0 · findings 0 · 2026-05-21 10:43:06.985529+00:00

claim_evidence completed v1.0.0 · findings 0 · 2026-05-20 23:02:16.166221+00:00

ai_meta_artifact completed v1.0.0 · findings 0 · 2026-05-19 08:36:03.406843+00:00

Findings

No public integrity findings for this paper.

Signed record

The machine-readable record for this paper lives at /pith/U6MVV3FT/integrity.json. Pith Number bundles also include signed pith.integrity.v1 events where a Pith Number exists.