pith. the verified trust layer for science. sign in

Tadashi Kozuno

Identifiers

No identifiers captured yet.

Papers (5)

  1. The Harder Path: Last Iterate Convergence for Uncoupled Learning in Zero-Sum Games with Bandit Feedback cs.LG · 2026 · author #3
  2. Optimal last-iterate convergence in matrix games with bandit feedback using the log-barrier cs.LG · 2026 · author #3
  3. Near-Optimal Policy Identification in Robust Constrained Markov Decision Processes via Epigraph Form cs.LG · 2024 · author #2
  4. Gap-Increasing Policy Evaluation for Efficient and Noise-Tolerant Reinforcement Learning cs.LG · 2019 · author #1
  5. Unifying Value Iteration, Advantage Learning, and Dynamic Policy Programming stat.ML · 2017 · author #1

Mentions

No mention provenance yet.

Frequent Coauthors