Recoverable Identifier
advisory
doi_compliance
recoverable_identifier
DOI in the printed bibliography is fragmented by whitespace or line breaks. A longer candidate (10.18653/v1/2025.acl-long.485.URLhttps://aclanthology.org/2025.acl-long.485/.Han) was visible in the surrounding text but could not be confirmed against doi.org as printed.
Paper page Integrity report arXiv Try DOI
Evidence text
Association for Computational Linguistics. doi: 10.18653/v1/2025. acl-long.485. URLhttps://aclanthology.org/2025.acl-long.485/. Han Zhong, Zikang Shan, Guhao Feng, Wei Xiong, Xinle Cheng, Li Zhao, Di He, Jiang Bian, and Liwei Wang. DPO meets PPO: Reinforced token optimization for RLHF. InProceedings of the 42nd International Conference on Machine Learning, volume 267 ofProceedings of Machine Learning Research, pages 78498–78521. PMLR,
Evidence payload
{
"printed_excerpt": "Association for Computational Linguistics. doi: 10.18653/v1/2025. acl-long.485. URLhttps://aclanthology.org/2025.acl-long.485/. Han Zhong, Zikang Shan, Guhao Feng, Wei Xiong, Xinle Cheng, Li Zhao, Di He, Jiang Bian, and Liwei Wang. DPO meet",
"reconstructed_doi": "10.18653/v1/2025.acl-long.485.URLhttps://aclanthology.org/2025.acl-long.485/.Han",
"ref_index": 9,
"resolved_title": null,
"verdict_class": "incontrovertible"
}