pith. sign in

Blip-2: Bootstrapping language-image pre-training with frozen image encoders and large language models

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

fields

cs.CV 4 cs.RO 2

roles

background 2

polarities

background 2

representative citing papers

VGR: Visual Grounded Reasoning

cs.CV · 2025-06-13 · unverdicted · novelty 7.0

VGR introduces a visual-grounded reasoning MLLM that detects and replays image regions during inference, achieving gains on visual benchmarks with 30% fewer image tokens than the LLaVA-NeXT-7B baseline.

Test-Time Distillation for Continual Model Adaptation

cs.CV · 2025-06-03 · conditional · novelty 7.0

CoDiRe blends VLM and target model predictions via MSP-based weighting and Optimal Transport rectification to enable stable continual test-time adaptation, outperforming CoTTA by 10.55% on ImageNet-C at 48% of the compute cost.

citing papers explorer

Showing 6 of 6 citing papers.