pith. sign in

Recoverable Identifier

arXiv:2604.21814 · detector doi_compliance · incontrovertible · 2026-05-20 00:37:58.287148+00:00

advisory doi_compliance recoverable_identifier

DOI in the printed bibliography is fragmented by whitespace or line breaks. A longer candidate (10.1038/s41597-022-01726-3.https://doi.org/10.1038/s41597-022-01726-3.Junying) was visible in the surrounding text but could not be confirmed against doi.org as printed.

Paper page Integrity report arXiv Try DOI

Evidence text

doi: 10.1038/s41597-022-01726-3.https://doi.org/10.1038/s41597-022-01726-3. Junying Chen et al. Huatuogpt-vision, towards injecting medical visual knowledge into multimodal llms at scale, 2024a.https://arxiv.org/abs/2406.19280. Zhe Chen et al. Internvl: Scaling up vision foundation models and aligning for generic visual-linguistic tasks. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 24185–24198, 2024b. Chuanqi Cheng, Jian Guan, Wei Wu, and Rui Yan. Scaling video-language models to 10k frames via hierarchical differential distillation.arXiv preprint arXiv:2504.02438,

Evidence payload

{
  "printed_excerpt": "doi: 10.1038/s41597-022-01726-3.https://doi.org/10.1038/s41597-022-01726-3. Junying Chen et al. Huatuogpt-vision, towards injecting medical visual knowledge into multimodal llms at scale, 2024a.https://arxiv.org/abs/2406.19280. Zhe Chen et ",
  "reconstructed_doi": "10.1038/s41597-022-01726-3.https://doi.org/10.1038/s41597-022-01726-3.Junying",
  "ref_index": 4,
  "resolved_title": null,
  "verdict_class": "incontrovertible"
}