Paper Integrity Record · LEDGER

Directional Alignment Mitigates Reward Hacking in Reinforcement Learning for Language Models

As of 2 June 2026, Pith completed 2 of 3 listed checks against arXiv:2605.25189. The completed checks produced no public findings within their stated scope. 1 check was skipped and produced no result.

This is a record of named checks, not a clean-status claim or a paper verdict.

pith.integrity.v1
2605.25189 v1
pith:2026:TCWOBLITA25TFI24X2VKMAI4PF

Coverage vector

completed 2 of 3 listed checks

Counted as completed checks in the claim sentence numerator.

skipped 1 of 3 listed checks

Skipped checks produced no result and say nothing about absence.

Listed checks

ai_meta_artifact

skipped

v1.0.0, observed 2026-06-02 07:35:16.697173+00:00

Scope: Paper body text scanned for literal AI-assistant artifacts.
Reason: Paper body text was unavailable.

claim_evidence

completed

v1.0.0, observed 2026-05-31 18:06:50.968576+00:00

Scope: Recorded paper claims checked for the evidence artifacts they name.
Completed surface: Completed within the stated scope.

cited_work_retraction

completed

v1.0.0, observed 2026-05-26 17:23:31.494855+00:00

Scope: Cited references checked for source-reported retractions or expressions of concern.
Completed surface: Completed within the stated scope.

Observations

The completed checks produced no public findings within their stated scope.

Methods and limits

Each listed check names the surface it examined. A skipped, failed, partial, unavailable, not collected, not requested, or withheld check says nothing about what a completed check would have returned.

Only checks with status completed enter the claim sentence numerator.
Findings appear only from completed checks and only within the scope printed for that check.
Severity labels describe one observation row. They do not roll up into a paper judgment.