Why should i trust you, bellman? the bellman error is a poor replacement for value error

Scott Fujimoto, David Meger, Doina Precup, Ofir Nachum, Shixiang Shane Gu · 2022 · arXiv 2201.12417

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Contraction-Aligned Analysis of Soft Bellman Residual Minimization with Weighted Lp-Norm for Markov Decision Problem

cs.LG · 2026-04-08 · unverdicted · novelty 6.0

Soft Bellman residual minimization with weighted Lp-norm aligns the objective with Bellman contraction as p increases and yields performance error bounds.

Koopman-Assisted Reinforcement Learning

cs.AI · 2024-03-04 · unverdicted · novelty 6.0

Koopman-assisted RL reformulates max-entropy algorithms using controlled Koopman tensors and reports SOTA performance versus neural SAC on Lorenz, fluid flow, and other systems.

citing papers explorer

Showing 2 of 2 citing papers.

Contraction-Aligned Analysis of Soft Bellman Residual Minimization with Weighted Lp-Norm for Markov Decision Problem cs.LG · 2026-04-08 · unverdicted · none · ref 13
Soft Bellman residual minimization with weighted Lp-norm aligns the objective with Bellman contraction as p increases and yields performance error bounds.
Koopman-Assisted Reinforcement Learning cs.AI · 2024-03-04 · unverdicted · none · ref 94
Koopman-assisted RL reformulates max-entropy algorithms using controlled Koopman tensors and reports SOTA performance versus neural SAC on Lorenz, fluid flow, and other systems.

Why should i trust you, bellman? the bellman error is a poor replacement for value error

fields

years

verdicts

representative citing papers

citing papers explorer