pith. sign in

hub

Mirage the illusion of visual understanding

13 Pith papers cite this work. Polarity classification is still indexing.

13 Pith papers citing it

hub tools

citation-role summary

background 4

citation-polarity summary

years

2026 13

roles

background 4

polarities

background 4

representative citing papers

Do multimodal models imagine electric sheep?

cs.CV · 2026-05-10 · conditional · novelty 6.0

Fine-tuning VLMs to output action sequences for puzzles causes emergent internal visual representations that improve performance when integrated into reasoning.

Towards Conversational Medical AI with Eyes, Ears and a Voice

cs.AI · 2026-05-10 · conditional · novelty 6.0

AI co-clinician is a multimodal conversational AI that uses live audio-visual data for real-time medical reasoning in simulated telemedicine, approaching primary care physicians in management plans and differentials but lagging in physical exam and disease-specific tasks.

citing papers explorer

Showing 13 of 13 citing papers.