BLIP: bootstrapping language-image pre-training for unified vision-language understanding and generation,

· 2022

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

FashionLens: Toward Versatile Fashion Image Retrieval via Task-Adaptive Learning

cs.CV · 2026-05-21 · unverdicted · novelty 7.0

FashionLens is a task-adaptive MLLM framework that achieves SOTA performance on diverse fashion image retrieval scenarios via spherical query calibration and gradient-guided sampling.

citing papers explorer

Showing 1 of 1 citing paper.

FashionLens: Toward Versatile Fashion Image Retrieval via Task-Adaptive Learning cs.CV · 2026-05-21 · unverdicted · none · ref 12
FashionLens is a task-adaptive MLLM framework that achieves SOTA performance on diverse fashion image retrieval scenarios via spherical query calibration and gradient-guided sampling.

BLIP: bootstrapping language-image pre-training for unified vision-language understanding and generation,

fields

years

verdicts

representative citing papers

citing papers explorer