Optimizing the CVaR via Sampling

Aviv Tamar, Yonatan Glassner, Shie Mannor · 2014 · stat.ML · arXiv 1404.3862

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open full Pith review browse 3 citing papers arXiv PDF

abstract

Conditional Value at Risk (CVaR) is a prominent risk measure that is being used extensively in various domains. We develop a new formula for the gradient of the CVaR in the form of a conditional expectation. Based on this formula, we propose a novel sampling-based estimator for the CVaR gradient, in the spirit of the likelihood-ratio method. We analyze the bias of the estimator, and prove the convergence of a corresponding stochastic gradient descent algorithm to a local CVaR optimum. Our method allows to consider CVaR optimization in new domains. As an example, we consider a reinforcement learning application, and learn a risk-sensitive controller for the game of Tetris.

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

Robust H2/H-infinity control under stochastic requirements: minimizing conditional value-at-risk instead of worst-case performance

eess.SY · 2025-12-20 · unverdicted · novelty 7.0

Minimizing conditional value-at-risk instead of worst-case performance reduces conservatism in robust H2/H-infinity control by tolerating rare degradations for better average behavior.

Towards Affordable Energy: A Gymnasium Environment for Electric Utility Demand-Response Programs

cs.AI · 2026-05-12 · unverdicted · novelty 7.0

DR-Gym is a new Gymnasium-compatible simulator for training utility demand-response policies with regime-switching wholesale prices and physics-based building demand.

Concrete Problems in AI Safety

cs.AI · 2016-06-21 · accept · novelty 7.0

The paper categorizes five concrete AI safety problems arising from flawed objectives, costly evaluation, and learning dynamics.

citing papers explorer

Showing 3 of 3 citing papers.

Robust H2/H-infinity control under stochastic requirements: minimizing conditional value-at-risk instead of worst-case performance eess.SY · 2025-12-20 · unverdicted · none · ref 24 · internal anchor
Minimizing conditional value-at-risk instead of worst-case performance reduces conservatism in robust H2/H-infinity control by tolerating rare degradations for better average behavior.
Towards Affordable Energy: A Gymnasium Environment for Electric Utility Demand-Response Programs cs.AI · 2026-05-12 · unverdicted · none · ref 27
DR-Gym is a new Gymnasium-compatible simulator for training utility demand-response policies with regime-switching wholesale prices and physics-based building demand.
Concrete Problems in AI Safety cs.AI · 2016-06-21 · accept · none · ref 154
The paper categorizes five concrete AI safety problems arising from flawed objectives, costly evaluation, and learning dynamics.

Optimizing the CVaR via Sampling

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer