Phalguni Nanda
Identifiers
No identifiers captured yet.
Papers (3)
- Natural Policy Gradient as Doubly Smoothed Policy Iteration: A Bellman-Operator Framework cs.LG · 2026 · author #1
- A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies cs.LG · 2025 · author #1
- From Set Convergence to Pointwise Convergence: Finite-Time Guarantees for Average-Reward Q-Learning with Adaptive Stepsizes cs.LG · 2025 · author #2
Mentions
No mention provenance yet.
Frequent Coauthors
- Zaiwei Chen 3 shared papers