University of London, University College London (United Kingdom), 2003

Sham Machandranath Kakade · 2003

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

On the Sample Complexity of Discounted Reinforcement Learning with Optimized Certainty Equivalents

cs.LG · 2026-05-20 · unverdicted · novelty 7.0

Characterizes utility functions making recursive OCE objectives PAC-learnable and derives matching upper and lower PAC sample complexity bounds for value and policy learning, with improved tau dependence for CVaR.

citing papers explorer

Showing 1 of 1 citing paper.

On the Sample Complexity of Discounted Reinforcement Learning with Optimized Certainty Equivalents cs.LG · 2026-05-20 · unverdicted · none · ref 34
Characterizes utility functions making recursive OCE objectives PAC-learnable and derives matching upper and lower PAC sample complexity bounds for value and policy learning, with improved tau dependence for CVaR.

University of London, University College London (United Kingdom), 2003

fields

years

verdicts

representative citing papers

citing papers explorer