Custom ZeroCLIP uses retrieval from seen provinces to caption traditional Indonesian clothing images from 8 unseen provinces, achieving CLIPScore 0.8536, BLEU-4 0.3342, and METEOR 0.4859 while outperforming baselines.
Zero-shot referring image segmentation with global-local context features,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Zero-Shot Captioning for Cultural Heritage: Automated Image Analysis of Traditional Indonesian Clothing
Custom ZeroCLIP uses retrieval from seen provinces to caption traditional Indonesian clothing images from 8 unseen provinces, achieving CLIPScore 0.8536, BLEU-4 0.3342, and METEOR 0.4859 while outperforming baselines.