Towards specialized generalists: A multi-task moe-lora framework for domain-specific llm adaptation.arXiv preprint arXiv:2601.07935, 2026a

Yuxin Yang, Aoxiong Zeng, Xiangquan Yang · 2026 · arXiv 2601.07935

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

The Long-Term Effects of Data Selection in LLM Fine-Tuning

cs.LG · 2026-05-28 · unverdicted · novelty 6.0

Short-term data selectors in multi-stage LLM fine-tuning can slow future learning and increase forgetting, formalized as myopic selection with a proposed LHAS objective to address it.

Representation Collapse in Sequential Post-Training of Large Language Models

cs.LG · 2026-05-28 · unverdicted · novelty 5.0

Sequential post-training of LLMs induces representation collapse that correlates with reduced plasticity, weaker generalization, and poorer calibration, with lightweight interventions tested to mitigate it.

citing papers explorer

Showing 2 of 2 citing papers.

The Long-Term Effects of Data Selection in LLM Fine-Tuning cs.LG · 2026-05-28 · unverdicted · none · ref 11
Short-term data selectors in multi-stage LLM fine-tuning can slow future learning and increase forgetting, formalized as myopic selection with a proposed LHAS objective to address it.
Representation Collapse in Sequential Post-Training of Large Language Models cs.LG · 2026-05-28 · unverdicted · none · ref 46
Sequential post-training of LLMs induces representation collapse that correlates with reduced plasticity, weaker generalization, and poorer calibration, with lightweight interventions tested to mitigate it.

Towards specialized generalists: A multi-task moe-lora framework for domain-specific llm adaptation.arXiv preprint arXiv:2601.07935, 2026a

fields

years

verdicts

representative citing papers

citing papers explorer