pith. sign in

Eyes wide shut? exploring the visual shortcomings of multimodal llms

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

citation-role summary

background 1 dataset 1 method 1

citation-polarity summary

years

2026 3 2025 3

representative citing papers

Seed1.5-VL Technical Report

cs.CV · 2025-05-11 · unverdicted · novelty 4.0

Seed1.5-VL is a compact multimodal model that sets new records on dozens of vision-language benchmarks and outperforms prior systems on agent-style tasks.

citing papers explorer

Showing 6 of 6 citing papers.