pith. machine review for the scientific record. sign in

Jian Qian

Identifiers

No identifiers captured yet.

Papers (5)

  1. What should post-training optimize? A test-time scaling law perspective cs.LG · 2026 · author #2
  2. Self-Normalized Martingales and Uniform Regret Bounds for Linear Regression stat.ML · 2026 · author #2
  3. Model-Based Reinforcement Learning with Double Oracle Efficiency in Policy Optimization and Offline Estimation cs.LG · 2026 · author #2
  4. $(\alpha,\beta)$-Stability for Boosting Vector-Valued Prediction cs.LG · 2026 · author #1
  5. Exploration Bonus for Regret Minimization in Undiscounted Discrete and Continuous Markov Decision Processes cs.LG · 2018 · author #1

Mentions

No mention provenance yet.

Frequent Coauthors