YARD is a training-free method using Y-shaped decoder architecture and register tokens to improve contrastive decoding for hallucination reduction in LVLMs with lower latency.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
fields
cs.CV 2years
2026 2verdicts
UNVERDICTED 2representative citing papers
Presents GranFact benchmark with expert annotations and a reliability-prioritized DPO method to improve fine-grained yet reliable generation in MLLMs.
citing papers explorer
-
YARD: Y-Architecture Register Decoding for Efficient Hallucination Mitigation in Large Vision-Language Models
YARD is a training-free method using Y-shaped decoder architecture and register tokens to improve contrastive decoding for hallucination reduction in LVLMs with lower latency.
-
Reliability-Prioritized Fine-Grained Generation in Multimodal Large
Presents GranFact benchmark with expert annotations and a reliability-prioritized DPO method to improve fine-grained yet reliable generation in MLLMs.