Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLM s

Jingcheng Niu, Xingdi Yuan, Tong Wang, Hamidreza Saghir, Amir H · 2025 · DOI 10.18653/v1/2025.acl-long.791

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

Sentence-Level Contextual Entrainment in Large Language Models

cs.CL · 2026-06-23 · unverdicted · novelty 6.0

Sentence-level contextual entrainment exists across LLMs, weakens with scale, and is localized to 2-4% of attention heads whose deactivation removes the effect without performance loss.

When Context Misleads: Surprisal, Energy and Attention Entropy as Metrics of Coherence Illusions in LLMs

cs.CL · 2026-06-19 · unverdicted · novelty 5.0

Dutch LLMs display coherence illusions tracked by surprisal, with attention entropy identifying affected heads and a new energy metric quantifying discourse coherence.

Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models

cs.CL · 2026-01-20 · unverdicted · novelty 5.0

The survey organizes mechanistic interpretability techniques into a Locate-Steer-Improve framework to enable actionable improvements in LLM alignment, capability, and efficiency.

citing papers explorer

Showing 3 of 3 citing papers.

Sentence-Level Contextual Entrainment in Large Language Models cs.CL · 2026-06-23 · unverdicted · none · ref 6
Sentence-level contextual entrainment exists across LLMs, weakens with scale, and is localized to 2-4% of attention heads whose deactivation removes the effect without performance loss.
When Context Misleads: Surprisal, Energy and Attention Entropy as Metrics of Coherence Illusions in LLMs cs.CL · 2026-06-19 · unverdicted · none · ref 29
Dutch LLMs display coherence illusions tracked by surprisal, with attention entropy identifying affected heads and a new energy metric quantifying discourse coherence.
Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models cs.CL · 2026-01-20 · unverdicted · none · ref 226
The survey organizes mechanistic interpretability techniques into a Locate-Steer-Improve framework to enable actionable improvements in LLM alignment, capability, and efficiency.

Llama See, Llama Do: A Mechanistic Perspective on Contextual Entrainment and Distraction in LLM s

fields

years

verdicts

representative citing papers

citing papers explorer