Risk-sensitive reinforcement learning: Near-optimal risk-sample tradeoff in regret

Fei Yingjie, Yang Zhuoran, Chen Yudong, Wang Zhaoran, Xie Qiaomin, “Risk-sensitive reinforcement learning: Near-optimal risk-sample tradeoff in regret,” inAdvances in Neural Information Processing Systems (NeurIPS), vol · 2020

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

BAPR: Bayesian amnesic piecewise-robust reinforcement learning for non-stationary continuous control

cs.LG · 2026-05-15 · unverdicted · novelty 7.0

BAPR combines Bayesian change detection with robust RL, proves the core operator is a contraction via Lean 4, and adapts conservatism after detected regime shifts in continuous control.

RE-SAC: Disentangling aleatoric and epistemic risks in bus fleet control: A stable and robust ensemble DRL approach

cs.LG · 2026-03-19 · unverdicted · novelty 5.0

RE-SAC disentangles aleatoric and epistemic risks via IPM regularization on the critic and a diversified Q-ensemble, yielding higher rewards and lower estimation error than vanilla SAC in simulated bus corridor control.

citing papers explorer

Showing 2 of 2 citing papers.

BAPR: Bayesian amnesic piecewise-robust reinforcement learning for non-stationary continuous control cs.LG · 2026-05-15 · unverdicted · full · ref 28
BAPR combines Bayesian change detection with robust RL, proves the core operator is a contraction via Lean 4, and adapts conservatism after detected regime shifts in continuous control.
RE-SAC: Disentangling aleatoric and epistemic risks in bus fleet control: A stable and robust ensemble DRL approach cs.LG · 2026-03-19 · unverdicted · none · ref 30
RE-SAC disentangles aleatoric and epistemic risks via IPM regularization on the critic and a diversified Q-ensemble, yielding higher rewards and lower estimation error than vanilla SAC in simulated bus corridor control.

Risk-sensitive reinforcement learning: Near-optimal risk-sample tradeoff in regret

fields

years

verdicts

representative citing papers

citing papers explorer