pith. sign in

Phalguni Nanda

Identifiers

No identifiers captured yet.

Papers (3)

  1. Natural Policy Gradient as Doubly Smoothed Policy Iteration: A Bellman-Operator Framework cs.LG · 2026 · author #1
  2. A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies cs.LG · 2025 · author #1
  3. From Set Convergence to Pointwise Convergence: Finite-Time Guarantees for Average-Reward Q-Learning with Adaptive Stepsizes cs.LG · 2025 · author #2

Mentions

No mention provenance yet.

Frequent Coauthors