Challenges in understanding modality conflict in vision-language models.arXiv preprint arXiv:2509.02805, 2025c

[Nguyenet al · 2025 · arXiv 2509.02805

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

MLLMs Get It Right, Then Get It Wrong: Tracing and Correcting Late-Layer Textual Bias

cs.CV · 2026-06-16 · unverdicted · novelty 6.0

MLLMs show late-layer textual override of correct visual predictions, with a directional signature enabling a simple inference-time recovery method that improves conflict benchmarks by up to 9.4%.

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

cs.CL · 2026-01-20 · unverdicted · novelty 5.0

The survey organizes mechanistic interpretability techniques into a Locate-Steer-Improve framework to enable actionable improvements in LLM alignment, capability, and efficiency.

citing papers explorer

Showing 2 of 2 citing papers.

MLLMs Get It Right, Then Get It Wrong: Tracing and Correcting Late-Layer Textual Bias cs.CV · 2026-06-16 · unverdicted · none · ref 24
MLLMs show late-layer textual override of correct visual predictions, with a directional signature enabling a simple inference-time recovery method that improves conflict benchmarks by up to 9.4%.
Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models cs.CL · 2026-01-20 · unverdicted · none · ref 222
The survey organizes mechanistic interpretability techniques into a Locate-Steer-Improve framework to enable actionable improvements in LLM alignment, capability, and efficiency.

Challenges in understanding modality conflict in vision-language models.arXiv preprint arXiv:2509.02805, 2025c

fields

years

verdicts

representative citing papers

citing papers explorer