Language models implement simple W ord2 V ec-style vector arithmetic

Merullo, Jack, Eickhoff, Carsten, Pavlick, Ellie · 2024 · DOI 10.18653/v1/2024.naacl-long.281

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

Where Pretraining writes and Alignment reads: the asymmetry of Transformer weight space

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

Pretraining and alignment induce asymmetric geometric traces in transformer weights because alignment updates concentrate in read pathways due to activation covariance while write pathways inherit less structure from alignment losses.

Task Vector Geometry Underlies Dual Modes of Task Inference in Transformers

cs.LG · 2026-05-05 · unverdicted · novelty 7.0

In a controlled synthetic setting, transformers implement in-distribution task inference via convex combinations of task vectors and out-of-distribution inference via nearly orthogonal extrapolative representations.

Linear Representations of Hierarchical Concepts in Language Models

cs.CL · 2026-04-09 · unverdicted · novelty 6.0

Language models encode concept hierarchies as linear transformations that are domain-specific yet structurally similar across domains.

citing papers explorer

Showing 3 of 3 citing papers.

Where Pretraining writes and Alignment reads: the asymmetry of Transformer weight space cs.LG · 2026-05-15 · unverdicted · none · ref 53
Pretraining and alignment induce asymmetric geometric traces in transformer weights because alignment updates concentrate in read pathways due to activation covariance while write pathways inherit less structure from alignment losses.
Task Vector Geometry Underlies Dual Modes of Task Inference in Transformers cs.LG · 2026-05-05 · unverdicted · none · ref 23
In a controlled synthetic setting, transformers implement in-distribution task inference via convex combinations of task vectors and out-of-distribution inference via nearly orthogonal extrapolative representations.
Linear Representations of Hierarchical Concepts in Language Models cs.CL · 2026-04-09 · unverdicted · none · ref 20
Language models encode concept hierarchies as linear transformations that are domain-specific yet structurally similar across domains.

Language models implement simple W ord2 V ec-style vector arithmetic

fields

years

verdicts

representative citing papers

citing papers explorer