Clip in mirror: Disentangling text from visual images through reflection.Advances in Neural In- formation Processing Systems, 37:24523–24546, 2024b

Wang, T · 2012

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Probing CLIP's Comprehension of 360-Degree Textual and Visual Semantics

cs.CV · 2026-04-27 · conditional · novelty 6.0

CLIP models understand 360-degree textual semantics via explicit identifiers but show limited comprehension of visual semantics under horizontal circular shifts, which a LoRA fine-tuning approach improves with a noted trade-off in original task performance.

citing papers explorer

Showing 1 of 1 citing paper.

Probing CLIP's Comprehension of 360-Degree Textual and Visual Semantics cs.CV · 2026-04-27 · conditional · none · ref 17
CLIP models understand 360-degree textual semantics via explicit identifiers but show limited comprehension of visual semantics under horizontal circular shifts, which a LoRA fine-tuning approach improves with a noted trade-off in original task performance.

Clip in mirror: Disentangling text from visual images through reflection.Advances in Neural In- formation Processing Systems, 37:24523–24546, 2024b

fields

years

verdicts

representative citing papers

citing papers explorer