In-Context Learning Operates as Concept Subspace Learning

· 2026 · cs.LG · arXiv 2605.18830

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

open full Pith review browse 1 citing papers arXiv PDF

abstract

Regression and Bayesian accounts of in-context learning (ICL) explain how demonstrations can induce predictors, while mechanistic analyses often identify compact activation directions that steer prompted behavior. However, it remains unclear whether structured demonstrations induce low-dimensional concept inference. We study this question through a concept-subspace view of ICL, in which tasks vary only along intrinsic concept coordinates, although inputs are observed in a high-dimensional ambient space. For ridge and least-squares ICL proxies, prediction decomposes exactly into concept-coordinate regression and off-subspace leakage. Under block-diagonal or near-block-diagonal covariance assumptions, the leading estimation and nuisance-sensitivity terms scale with the dimension of the concept subspace, while residual effects are controlled by cross-subspace coupling. This separation gives a mechanistic prediction: recoverable task information should concentrate in a low-dimensional, task-aligned activation subspace. On CounterFact-derived multi-relation prompts with Llama-3-8B, a 68--73-dimensional subspace of the 4096-dimensional residual stream restores 78.8% of the clean--corrupted accuracy gap, whereas patching the complementary subspace restores 0%. Concept swaps redirect predictions toward injected relations, while random and cross-task matched-rank controls are largely ineffective. Additional experiments on Qwen2.5-7B and a controlled cross-lingual rule task show the same qualitative pattern. These results support concept subspaces as compact, task-aligned mediators of recoverable ICL behavior in structured task families, without implying full-circuit recovery.

representative citing papers

Causal Interventions on Continuous Variables: A Case Study on Verb Bias in Steering Vectors for In-Context Learning

cs.CL · 2026-05-28 · unverdicted · novelty 7.0

A method for causal intervention on continuous variables shows verb bias is causally encoded in LLM steering vectors and affects syntactic preferences, though links to in-context learning error signals are not causal.

citing papers explorer

Showing 1 of 1 citing paper.

Causal Interventions on Continuous Variables: A Case Study on Verb Bias in Steering Vectors for In-Context Learning cs.CL · 2026-05-28 · unverdicted · none · ref 27 · internal anchor
A method for causal intervention on continuous variables shows verb bias is causally encoded in LLM steering vectors and affects syntactic preferences, though links to in-context learning error signals are not causal.

In-Context Learning Operates as Concept Subspace Learning

fields

years

verdicts

representative citing papers

citing papers explorer