Truncated variance reduced value iteration

Yujia Jin et al · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

On the Sample Complexity of Discounted Reinforcement Learning with Optimized Certainty Equivalents

cs.LG · 2026-05-20 · unverdicted · novelty 7.0

Characterizes utility functions making recursive OCE objectives PAC-learnable and derives matching upper and lower PAC sample complexity bounds for value and policy learning, with improved tau dependence for CVaR.

citing papers explorer

Showing 1 of 1 citing paper.

On the Sample Complexity of Discounted Reinforcement Learning with Optimized Certainty Equivalents cs.LG · 2026-05-20 · unverdicted · none · ref 33
Characterizes utility functions making recursive OCE objectives PAC-learnable and derives matching upper and lower PAC sample complexity bounds for value and policy learning, with improved tau dependence for CVaR.

Truncated variance reduced value iteration

fields

years

verdicts

representative citing papers

citing papers explorer