Introduces the Manifold Probe to discover representation manifolds in superposition and demonstrates causal steering on time concepts in Llama 2-7b.
Journal of the American Statistical Association , volume=
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2representative citing papers
REX-SUB combines a randomized exchange algorithm with Vecchia approximation to choose subsamples that minimize mean squared prediction error and interval scores in large-scale spatial GPs.
citing papers explorer
-
Probing for Representation Manifolds in Superposition
Introduces the Manifold Probe to discover representation manifolds in superposition and demonstrates causal steering on time concepts in Llama 2-7b.
-
REX-SUB: A Scalable Subsampling Strategy for Modeling Large Spatial Datasets
REX-SUB combines a randomized exchange algorithm with Vecchia approximation to choose subsamples that minimize mean squared prediction error and interval scores in large-scale spatial GPs.