Representation Collapse in Sequential Post-Training of Large Language Models

Chenxi Lin; Hao Wang; Jiarui Wu; Mingyu Chen; Rui Zhang; Wei Sun; Xiaoran Xu; Yichen Liu; Yutong Zhou; Yuxin Yang

arxiv: 2605.30524 · v1 · pith:IAMCMCRSnew · submitted 2026-05-28 · 💻 cs.LG

Representation Collapse in Sequential Post-Training of Large Language Models

Yichen Liu , Mingyu Chen , Hao Wang , Xiaoran Xu , Chenxi Lin , Rui Zhang , Yutong Zhou , Yuxin Yang

show 2 more authors

Jiarui Wu Wei Sun

This is my paper

classification 💻 cs.LG

keywords post-trainingrepresentationfeaturelanguagelargeloramodelssequential

0 comments

read the original abstract

Large language models are now adapted through chains of post-training stages rather than through a single instruction-tuning pass. This paper studies whether such sequential post-training gradually compresses internal representations into low-rank, anisotropic, and homogeneous feature spaces. We define a measurement suite for hidden states, logits, token trajectories, and LoRA updates, and we use it to analyze supervised fine-tuning, preference optimization, safety/refusal tuning, math and code specialization, and long chain-of-thought tuning under controlled stage orderings. The central hypothesis is that excessive representation concentration is not merely a geometric curiosity: it predicts reduced plasticity during later adaptation, weaker out-of-domain generalization, and poorer calibration. We further evaluate lightweight interventions, including mixed-domain replay, feature refresh, representation diversity regularization, and LoRA update decorrelation, as ways to preserve future learnability without giving up the behavioral gains of post-training.

This paper has not been read by Pith yet.

Representation Collapse in Sequential Post-Training of Large Language Models

discussion (0)