BLIP-2: Bootstrapping language-image pre-training with frozen image encoders and large language models

Junnan Li, Dongxu Li, Silvio Savarese, Steven Hoi

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

PRADA: Probability-Ratio-Based Attribution and Detection of Autoregressive-Generated Images

cs.CV · 2025-11-25 · unverdicted · novelty 7.0

PRADA uses probability ratios of autoregressive token sequences to detect and attribute images to specific generative models.

A More Word-like Image Tokenization for MLLMs

cs.CV · 2026-05-18 · unverdicted · novelty 6.0

DiVT clusters patch embeddings into coherent semantic units and adapts token count to image complexity, matching or exceeding baselines with fewer visual tokens on multimodal benchmarks.

citing papers explorer

Showing 2 of 2 citing papers.

PRADA: Probability-Ratio-Based Attribution and Detection of Autoregressive-Generated Images cs.CV · 2025-11-25 · unverdicted · none · ref 44
PRADA uses probability ratios of autoregressive token sequences to detect and attribute images to specific generative models.
A More Word-like Image Tokenization for MLLMs cs.CV · 2026-05-18 · unverdicted · none · ref 26
DiVT clusters patch embeddings into coherent semantic units and adapts token count to image complexity, matching or exceeding baselines with fewer visual tokens on multimodal benchmarks.

BLIP-2: Bootstrapping language-image pre-training with frozen image encoders and large language models

fields

years

verdicts

representative citing papers

citing papers explorer