Yuchen Xie
Identifiers
- name variant Yuchen Xie 0.60 · backfill
Papers (10)
- MONA: Muon Optimizer with Nesterov Acceleration for Scalable Language Model Training cs.LG · 2026 · author #7
- OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond cs.LG · 2026 · author #11
- FG$^2$-GDN: Enhancing Long-Context Gated Delta Networks with Doubly Fine-Grained Control cs.LG · 2026 · author #8
- SparseBalance: Load-Balanced Long Context Training with Dynamic Sparse Attention cs.LG · 2026 · author #8
- Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation cs.LG · 2026 · author #19
- AsyncTLS: Efficient Generative LLM Inference with Asynchronous Two-level Sparse Attention cs.CL · 2026 · author #8
- SnapMLA: Efficient Long-Context MLA Decoding via Hardware-Aware FP8 Quantized Pipelining cs.LG · 2026 · author #7
- WISCA: A Lightweight Model Transition Method to Improve LLM Training via Weight Scaling cs.LG · 2025 · author #10
- Analysis of the BFGS Method with Errors math.OC · 2019 · author #1
- On the convergence of BFGS on a class of piecewise linear non-smooth functions math.OC · 2017 · author #1
Mentions
- 2605.26842 #7 · arxiv_oai · confidence 0.70 Yuchen Xie
- 2605.19660 #11 · arxiv_oai · confidence 0.70 Yuchen Xie
Frequent Coauthors
- Xunliang Cai 6 shared papers
- Jianchao Tan 5 shared papers
- Yerui Sun 5 shared papers
- Pingwei Sun 4 shared papers
- Jiaqi Zhang 3 shared papers
- Rui Yang 3 shared papers
- Wei Wu 3 shared papers
- Yifan Lu 3 shared papers
- Yifan Zhang 3 shared papers
- Yulei Qian 3 shared papers
- Yuxuan Hu 3 shared papers
- Zunhai Su 3 shared papers
- Chao Zhang 2 shared papers
- Hongtao Xu 2 shared papers
- Jiacheng Li 2 shared papers
- Jing Xiong 2 shared papers
- Ngai Wong 2 shared papers
- Weile Jia 2 shared papers
- Yaxiu Liu 2 shared papers
- Andreas Waechter 1 shared papers