CIRThan is a new sketch+text composed image retrieval dataset for Thangka imagery with 2,287 images, sketches, and multi-level hierarchical texts.
arXiv preprint arXiv:2312.08924 (2023) 4
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2representative citing papers
AlbumFill retrieves identity-consistent references from personal albums via VLM-inferred semantic cues to support personalized image completion.
citing papers explorer
-
A Sketch+Text Composed Image Retrieval Dataset for Thangka
CIRThan is a new sketch+text composed image retrieval dataset for Thangka imagery with 2,287 images, sketches, and multi-level hierarchical texts.
-
AlbumFill: Album-Guided Reasoning and Retrieval for Personalized Image Completion
AlbumFill retrieves identity-consistent references from personal albums via VLM-inferred semantic cues to support personalized image completion.