pith. sign in

Recoverable Identifier

arXiv:2604.21523 · detector doi_compliance · incontrovertible · 2026-05-20 00:55:14.985916+00:00

advisory doi_compliance recoverable_identifier

DOI in the printed bibliography is fragmented by whitespace or line breaks. A longer candidate (10.48550/arXiv.2308) was visible in the surrounding text but could not be confirmed against doi.org as printed.

Paper page Integrity report arXiv Try DOI

Evidence text

doi: 10.48550/ARXIV .2305.17926. URL https://doi.org/10.48550/arXiv. 2305.17926. Zengbin Wang, Xuecai Hu, Yong Wang, Feng Xiong, Man Zhang, and Xiangxiang Chu. Everything in its place: Benchmarking spatial intelligence of text-to-image models.arXiv preprint arXiv: 2601.20354, 2026. Ishaan Watts, Varun Gumma, Aditya Yadavalli, Vivek Seshadri, Swami Manohar, and Sunayana Sitaram. Pariksha: A scalable, democratic, transparent evaluation platform for assessing indic large language models. May 2024. Song Wen, Guian Fang, Renrui Zhang, Peng Gao, Hao Dong, and Dimitris N. Metaxas. Improving compositional text-to-image generation with large vision-language mod- els.ArXiv, abs/2310.06311, 2023. URL https://api.semanticscholar.org/ CorpusID:263830080. Hongji Yang, Yucheng Zhou, Wencheng Han, and Jianbing Shen. Self-rewarding large vision-language models for optimizing prompts in text-to-image generation.ArXiv, abs/2505.16763, 2025a. URL https://api.semanticscholar.org/CorpusID: 278788692. Yan Yang, Dongxu Li, Haoning Wu, Bei Chen, Liu Liu, Liyuan Pan, and Junnan Li. Probench: Judging multimodal foundation models on open-ended multi-domain expert tasks.arXiv preprint arXiv: 2503.06885, 2025b. Michihiro Yasunaga, Luke Zettlemoyer, and Marjan Ghazvininejad. Multimodal reward- bench: Holistic evaluation of reward models for vision language models.arXiv preprint arXiv: 2502.14191, 2025. Weihao Yu, Zhengyuan Yang, Linjie Li, Jianfeng Wang, Kevin Lin, Zicheng Liu, Xinchao Wang, and Lijuan Wan

Evidence payload

{
  "printed_excerpt": "doi: 10.48550/ARXIV .2305.17926. URL https://doi.org/10.48550/arXiv. 2305.17926. Zengbin Wang, Xuecai Hu, Yong Wang, Feng Xiong, Man Zhang, and Xiangxiang Chu. Everything in its place: Benchmarking spatial intelligence of text-to-image mode",
  "reconstructed_doi": "10.48550/arXiv.2308",
  "ref_index": 2,
  "resolved_title": null,
  "verdict_class": "incontrovertible"
}