A model-agnostic Geometric Risk Controller reduces extreme errors in VLM-based OCR by requiring cross-view consensus before accepting outputs.
Semantic image synthesis with spatially-adaptive normalization,
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CV 2years
2026 2representative citing papers
SPADE-LDM conditional synthesis from composite semantic masks produces realistic 3D LGE MRI that raises LA cavity Dice from 0.908 to 0.936.
citing papers explorer
-
From Plausibility to Verifiability: Risk-Controlled Generative OCR with Vision-Language Models
A model-agnostic Geometric Risk Controller reduces extreme errors in VLM-based OCR by requiring cross-view consensus before accepting outputs.
-
3D Conditional Image Synthesis of Left Atrial LGE MRI from Composite Semantic Masks
SPADE-LDM conditional synthesis from composite semantic masks produces realistic 3D LGE MRI that raises LA cavity Dice from 0.908 to 0.936.