Composing text and image for image retrieval - an empirical odyssey

Nam V o, Lu Jiang, Chen Sun, Kevin Murphy, Li-Jia Li, Li Fei-Fei, James Hays · 2019

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

cs.CV · 2026-05-14 · unverdicted · novelty 5.0

CIR benchmarks contain many unimodal shortcuts and noisy queries, leading to overestimation of models' multimodal composition capabilities.

Showing 1 of 1 citing paper.

Do Composed Image Retrieval Benchmarks Require Multimodal Composition? cs.CV · 2026-05-14 · unverdicted · none · ref 17
CIR benchmarks contain many unimodal shortcuts and noisy queries, leading to overestimation of models' multimodal composition capabilities.