Visual description grounding reduces hallucinations and boosts reasoning in lvlms

Sreyan Ghosh, Chandra Kiran Reddy Evuru, Sonal Kumar, Utkarsh Tyagi, Oriol Nieto, Zeyu Jin, Dinesh Manocha · 2024 · arXiv 2405.15683

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

representative citing papers

Visual-Advantage On-Policy Distillation for Vision-Language Models

cs.CV · 2026-05-21 · unverdicted · novelty 7.0

VA-OPD improves VLM performance over standard on-policy distillation by reweighting rollouts and separating KL terms according to token-level visual advantage on math and visual benchmarks.

Revisiting Greedy Decoding for Visual Question Answering: A Calibration Perspective

cs.CL · 2026-04-25 · unverdicted · novelty 6.0

Greedy decoding is optimal for VQA under derived calibration conditions and outperforms stochastic sampling on benchmarks.

HypEHR: Hyperbolic Modeling of Electronic Health Records for Efficient Question Answering

cs.AI · 2026-04-22 · unverdicted · novelty 6.0

HypEHR is a hyperbolic embedding model for EHR data that uses Lorentzian geometry and hierarchy-aware pretraining to answer clinical questions nearly as well as large language models but with much smaller size.

citing papers explorer

Showing 3 of 3 citing papers.

Visual-Advantage On-Policy Distillation for Vision-Language Models cs.CV · 2026-05-21 · unverdicted · none · ref 21
VA-OPD improves VLM performance over standard on-policy distillation by reweighting rollouts and separating KL terms according to token-level visual advantage on math and visual benchmarks.
Revisiting Greedy Decoding for Visual Question Answering: A Calibration Perspective cs.CL · 2026-04-25 · unverdicted · none · ref 1
Greedy decoding is optimal for VQA under derived calibration conditions and outperforms stochastic sampling on benchmarks.
HypEHR: Hyperbolic Modeling of Electronic Health Records for Efficient Question Answering cs.AI · 2026-04-22 · unverdicted · none · ref 281
HypEHR is a hyperbolic embedding model for EHR data that uses Lorentzian geometry and hierarchy-aware pretraining to answer clinical questions nearly as well as large language models but with much smaller size.

Visual description grounding reduces hallucinations and boosts reasoning in lvlms

fields

years

verdicts

representative citing papers

citing papers explorer