Shangtong Zhang
Identifiers
- name variant Shangtong Zhang 0.60 · backfill
Papers (20)
- Convergence of Two-Timescale Markovian Stochastic Approximations with Applications in Reinforcement Learning cs.LG · 2026 · author #4
- Latent Q-Barrier Shielding for Safe In-Context Reinforcement Learning cs.LG · 2026 · author #3
- Predicting Plasticity in Deep Continual Learning: A Theoretical Perspective cs.LG · 2026 · author #6
- Beyond Linear Attention: Softmax Transformers Implement In-Context Reinforcement Learning cs.LG · 2026 · author #6
- MathlibPR: Pull Request Merge-Readiness Benchmark for Formal Mathematical Libraries cs.LO · 2026 · author #3
- Convergence and Emergence of In-Context Reinforcement Learning with Chain of Thought cs.LG · 2026 · author #4
- Almost Sure Convergence Rates of Stochastic Approximation and Reinforcement Learning via a Poisson-Moreau Drift cs.LG · 2026 · author #3
- On the Divergence of Differential Temporal Difference Learning without Local Clocks cs.LG · 2026 · author #2
- Adaptive Policy Selection and Fine-Tuning under Interaction Budgets for Offline-to-Online Reinforcement Learning cs.LG · 2026 · author #3
- MathlibLemma: Folklore Lemma Generation and Benchmark for Formal Mathematics cs.LO · 2026 · author #8
- Extensions of Robbins-Siegmund Theorem with Applications in Reinforcement Learning cs.LG · 2025 · author #3
- Safe In-Context Reinforcement Learning cs.LG · 2025 · author #7
- Reward Is Enough: LLMs Are In-Context Reinforcement Learners cs.LG · 2025 · author #6
- GameChat: Multi-LLM Dialogue for Safe, Agile, and Socially Optimal Multi-Agent Navigation in Constrained Environments cs.RO · 2025 · author #2
- Distributional Reinforcement Learning for Efficient Exploration cs.LG · 2019 · author #2
- ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search cs.LG · 2018 · author #1
- QUOTA: The Quantile Option Architecture for Reinforcement Learning cs.LG · 2018 · author #1
- A Deeper Look at Experience Replay cs.LG · 2017 · author #1
- Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control cs.LG · 2017 · author #1
- Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks cs.LG · 2016 · author #2
Mentions
- 2605.31172 #4 · arxiv_oai · confidence 0.70 Shangtong Zhang
- 2509.26442 #3 · arxiv_oai · confidence 0.70 Shangtong Zhang
- 2509.25582 #7 · arxiv_oai · confidence 0.70 Shangtong Zhang
- 2602.02561 #8 · arxiv_oai · confidence 0.70 Shangtong Zhang
- 2605.25267 #3 · arxiv_oai · confidence 0.70 Shangtong Zhang
- 2605.07333 #6 · arxiv_oai · confidence 0.70 Shangtong Zhang
Frequent Coauthors
- Xinyu Liu 6 shared papers
- Zixuan Xie 6 shared papers
- Rohan Chandra 5 shared papers
- Amir Moeini 4 shared papers
- Claire Chen 4 shared papers
- Shuze Daniel Liu 4 shared papers
- Hengshuai Yao 3 shared papers
- Alper Kamil Bozkurt 2 shared papers
- Borislav Mavrin 2 shared papers
- Linglong Kong 2 shared papers
- Lu Feng 2 shared papers
- Minjae Kwon 2 shared papers
- Richard S. Sutton 2 shared papers
- Vagul Mahadevan 2 shared papers
- Yuichi Motai 2 shared papers
- Aidong Zhang 1 shared papers
- Ali Payani 1 shared papers
- Bo Liu 1 shared papers
- David Antrobius 1 shared papers
- Hao Chen 1 shared papers