Mechanistic Understanding and Mitigation of Language Model Non-Factual Hallucinations

Lei Yu, Meng Cao, Jackie CK Cheung, Yue Dong · 2024 · DOI 10.18653/v1/2024.findings-emnlp.466

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

LMs as Task-Specific Knowledge Bases: An Interpretability Analysis

cs.CL · 2026-06-25 · unverdicted · novelty 6.0

LMs store facts in task-specific parameter subsets, shown by inconsistent emergence across tasks during training and distinct localized parameters for the same fact.

Hallucinations as Orthogonal Noise: Inference-Time Manifold Alignment via Dynamic Contextual Orthogonalization

cs.CL · 2026-06-02 · unverdicted · novelty 5.0

DCO is an inference-time intervention that decomposes attention head outputs orthogonally to a dynamic context anchor and suppresses outlier components via Z-score to improve contextual faithfulness in Llama models.

citing papers explorer

Showing 2 of 2 citing papers.

LMs as Task-Specific Knowledge Bases: An Interpretability Analysis cs.CL · 2026-06-25 · unverdicted · none · ref 64
LMs store facts in task-specific parameter subsets, shown by inconsistent emergence across tasks during training and distinct localized parameters for the same fact.
Hallucinations as Orthogonal Noise: Inference-Time Manifold Alignment via Dynamic Contextual Orthogonalization cs.CL · 2026-06-02 · unverdicted · none · ref 79
DCO is an inference-time intervention that decomposes attention head outputs orthogonally to a dynamic context anchor and suppresses outlier components via Z-score to improve contextual faithfulness in Llama models.

Mechanistic Understanding and Mitigation of Language Model Non-Factual Hallucinations

fields

years

verdicts

representative citing papers

citing papers explorer