Ro- bust visual representation learning with multi-modal prior knowledge for image classification under distribution shift

Hongkuan Zhou, Lavdim Halilaj, Sebastian Monka, Stefan Schmid, Yuqicheng Zhu, Bo Xiong, Steffen Staab · 2024 · arXiv 2410.15981

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

read on arXiv browse 1 citing papers

representative citing papers

GenAU: Language-Grounded Industrial Anomaly Understanding with Vision-Language Models

cs.CV · 2026-07-01 · unverdicted · novelty 6.0

GenAU augments a vision-language model with segmentation tokens to unify image-level anomaly detection, pixel-level segmentation, multi-type classification, and language-based defect analysis in a single instruction-following architecture.

citing papers explorer

Showing 1 of 1 citing paper after filters.

GenAU: Language-Grounded Industrial Anomaly Understanding with Vision-Language Models cs.CV · 2026-07-01 · unverdicted · none · ref 31
GenAU augments a vision-language model with segmentation tokens to unify image-level anomaly detection, pixel-level segmentation, multi-type classification, and language-based defect analysis in a single instruction-following architecture.

Ro- bust visual representation learning with multi-modal prior knowledge for image classification under distribution shift

fields

years

verdicts

representative citing papers

citing papers explorer