Jian Qian
Identifiers
No identifiers captured yet.
Papers (5)
- What should post-training optimize? A test-time scaling law perspective cs.LG · 2026 · author #2
- Self-Normalized Martingales and Uniform Regret Bounds for Linear Regression stat.ML · 2026 · author #2
- Model-Based Reinforcement Learning with Double Oracle Efficiency in Policy Optimization and Offline Estimation cs.LG · 2026 · author #2
- $(\alpha,\beta)$-Stability for Boosting Vector-Valued Prediction cs.LG · 2026 · author #1
- Exploration Bonus for Regret Minimization in Undiscounted Discrete and Continuous Markov Decision Processes cs.LG · 2018 · author #1
Mentions
No mention provenance yet.
Frequent Coauthors
- Alessandro Lazaric 1 shared papers
- Alexander Rakhlin 1 shared papers
- David Simchi-Levi 1 shared papers
- Fan Chen 1 shared papers
- Haichen Hu 1 shared papers
- Matteo Pirotta 1 shared papers
- Muheng Li 1 shared papers
- Nikita Zhivotovskiy 1 shared papers
- Ronan Fruit 1 shared papers
- Shu Ge 1 shared papers
- Wenlong Mou 1 shared papers