pith. sign in

arXiv preprint arXiv:2507.22052 (2025) 4, 11

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

fields

cs.CV 2

years

2026 2

verdicts

UNVERDICTED 2

representative citing papers

Cross-Attentive Multiview Fusion of Vision-Language Embeddings

cs.CV · 2026-04-14 · unverdicted · novelty 6.0

CAMFusion fuses multiview 2D vision-language embeddings via cross-attention and multiview consistency self-supervision to produce better 3D semantic and instance representations, outperforming averaging and reaching SOTA on benchmarks including zero-shot out-of-domain cases.

citing papers explorer

Showing 2 of 2 citing papers.

  • Bidirectional Cross-Modal Prompting for Event-Frame Asymmetric Stereo cs.CV · 2026-04-16 · unverdicted · none · ref 24

    Bi-CMPStereo uses bidirectional prompting to project event and frame data into each other's domains, creating aligned representations for improved cross-modal stereo matching.

  • Cross-Attentive Multiview Fusion of Vision-Language Embeddings cs.CV · 2026-04-14 · unverdicted · none · ref 10

    CAMFusion fuses multiview 2D vision-language embeddings via cross-attention and multiview consistency self-supervision to produce better 3D semantic and instance representations, outperforming averaging and reaching SOTA on benchmarks including zero-shot out-of-domain cases.