Phalguni Nanda

Identifiers

No identifiers captured yet.

Papers (3)

Natural Policy Gradient as Doubly Smoothed Policy Iteration: A Bellman-Operator Framework cs.LG · 2026 · author #1
A Minimal-Assumption Analysis of Q-Learning with Time-Varying Policies cs.LG · 2025 · author #1
From Set Convergence to Pointwise Convergence: Finite-Time Guarantees for Average-Reward Q-Learning with Adaptive Stepsizes cs.LG · 2025 · author #2

Mentions

No mention provenance yet.

Frequent Coauthors

Zaiwei Chen 3 shared papers