pith. sign in

Recoverable Identifier

arXiv:2604.27263 · detector doi_compliance · incontrovertible · 2026-05-19 19:25:52.347849+00:00

advisory doi_compliance recoverable_identifier

DOI in the printed bibliography is fragmented by whitespace or line breaks. A longer candidate (10.1162/tacl_a_00448.url:https://doi.org/10.1162/tacl_a_00448(visitedon) was visible in the surrounding text but could not be confirmed against doi.org as printed.

Paper page Integrity report arXiv Try DOI

Evidence text

Jonathan H. Clark, Dan Garrette, Iulia Turc, and John Wieting. “Canine: Pre-training an Effi- cient Tokenization-Free Encoder for Language Representation”. In:Transactions of the Asso- ciation for Computational Linguistics10 (Jan. 31, 2022), pp. 73–91.ISSN: 2307-387X.DOI: 10.1162/tacl_a_00448.URL:https://doi.org/10.1162/tacl_a_00448(visited on 01/22/2026)

Evidence payload

{
  "printed_excerpt": "Jonathan H. Clark, Dan Garrette, Iulia Turc, and John Wieting. \u201cCanine: Pre-training an Effi- cient Tokenization-Free Encoder for Language Representation\u201d. In:Transactions of the Asso- ciation for Computational Linguistics10 (Jan. 31, 2022)",
  "reconstructed_doi": "10.1162/tacl_a_00448.url:https://doi.org/10.1162/tacl_a_00448(visitedon",
  "ref_index": 6,
  "resolved_title": null,
  "verdict_class": "incontrovertible"
}