Shangtong Zhang — Pith Author Registry

Identifiers

name variant Shangtong Zhang 0.60 · backfill

Papers (20)

Convergence of Two-Timescale Markovian Stochastic Approximations with Applications in Reinforcement Learning cs.LG · 2026 · author #4
Latent Q-Barrier Shielding for Safe In-Context Reinforcement Learning cs.LG · 2026 · author #3
Predicting Plasticity in Deep Continual Learning: A Theoretical Perspective cs.LG · 2026 · author #6
Beyond Linear Attention: Softmax Transformers Implement In-Context Reinforcement Learning cs.LG · 2026 · author #6
MathlibPR: Pull Request Merge-Readiness Benchmark for Formal Mathematical Libraries cs.LO · 2026 · author #3
Convergence and Emergence of In-Context Reinforcement Learning with Chain of Thought cs.LG · 2026 · author #4
Almost Sure Convergence Rates of Stochastic Approximation and Reinforcement Learning via a Poisson-Moreau Drift cs.LG · 2026 · author #3
On the Divergence of Differential Temporal Difference Learning without Local Clocks cs.LG · 2026 · author #2
Adaptive Policy Selection and Fine-Tuning under Interaction Budgets for Offline-to-Online Reinforcement Learning cs.LG · 2026 · author #3
MathlibLemma: Folklore Lemma Generation and Benchmark for Formal Mathematics cs.LO · 2026 · author #8
Extensions of Robbins-Siegmund Theorem with Applications in Reinforcement Learning cs.LG · 2025 · author #3
Safe In-Context Reinforcement Learning cs.LG · 2025 · author #7
Reward Is Enough: LLMs Are In-Context Reinforcement Learners cs.LG · 2025 · author #6
GameChat: Multi-LLM Dialogue for Safe, Agile, and Socially Optimal Multi-Agent Navigation in Constrained Environments cs.RO · 2025 · author #2
Distributional Reinforcement Learning for Efficient Exploration cs.LG · 2019 · author #2
ACE: An Actor Ensemble Algorithm for Continuous Control with Tree Search cs.LG · 2018 · author #1
QUOTA: The Quantile Option Architecture for Reinforcement Learning cs.LG · 2018 · author #1
A Deeper Look at Experience Replay cs.LG · 2017 · author #1
Comparing Deep Reinforcement Learning and Evolutionary Methods in Continuous Control cs.LG · 2017 · author #1
Learning Representations by Stochastic Meta-Gradient Descent in Neural Networks cs.LG · 2016 · author #2

Mentions

2605.31172 #4 · arxiv_oai · confidence 0.70 Shangtong Zhang
2509.26442 #3 · arxiv_oai · confidence 0.70 Shangtong Zhang
2509.25582 #7 · arxiv_oai · confidence 0.70 Shangtong Zhang
2602.02561 #8 · arxiv_oai · confidence 0.70 Shangtong Zhang
2605.25267 #3 · arxiv_oai · confidence 0.70 Shangtong Zhang
2605.07333 #6 · arxiv_oai · confidence 0.70 Shangtong Zhang

Frequent Coauthors

Xinyu Liu 6 shared papers
Zixuan Xie 6 shared papers
Rohan Chandra 5 shared papers
Amir Moeini 4 shared papers
Claire Chen 4 shared papers
Shuze Daniel Liu 4 shared papers
Hengshuai Yao 3 shared papers
Alper Kamil Bozkurt 2 shared papers
Borislav Mavrin 2 shared papers
Linglong Kong 2 shared papers
Lu Feng 2 shared papers
Minjae Kwon 2 shared papers
Richard S. Sutton 2 shared papers
Vagul Mahadevan 2 shared papers
Yuichi Motai 2 shared papers
Aidong Zhang 1 shared papers
Ali Payani 1 shared papers
Bo Liu 1 shared papers
David Antrobius 1 shared papers
Hao Chen 1 shared papers