pith. sign in

Recoverable Identifier

arXiv:2604.24334 · detector doi_compliance · incontrovertible · 2026-05-19 22:14:57.080790+00:00

advisory doi_compliance recoverable_identifier

DOI in the printed bibliography is fragmented by whitespace or line breaks. A longer candidate (10.18653/v1/2022.acl-long.577.url:https://aclanthology.org/) was visible in the surrounding text but could not be confirmed against doi.org as printed.

Paper page Integrity report arXiv Try DOI

Evidence text

Katherine Lee et al. “Deduplicating Training Data Makes Language Models Better”. In:Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Ed. by Smaranda Muresan, Preslav Nakov, and Aline Villavicencio. Dublin, Ireland: Association for Computational Linguistics, May 2022, pp. 8424–8445.DOI: 10.18653/v1/2022.acl-long.577.URL: https://aclanthology.org/ 2022.acl-long.577/

Evidence payload

{
  "printed_excerpt": "Katherine Lee et al. \u201cDeduplicating Training Data Makes Language Models Better\u201d. In:Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). Ed. by Smaranda Muresan, Preslav Nakov, and",
  "reconstructed_doi": "10.18653/v1/2022.acl-long.577.url:https://aclanthology.org/",
  "ref_index": 21,
  "resolved_title": null,
  "verdict_class": "incontrovertible"
}