Learning near-optimal policies with bellman-residual minimization based fitted policy iteration and a single sample path.Machine Learning, 71(1):89–129, 2008

András Antos, Csaba Szepesvári, Rémi Munos · 2008

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

TabQL: In-Context Q-Learning with Tabular Foundation Models

cs.LG · 2026-05-18 · unverdicted · novelty 7.0

TabQL is a reinforcement learning framework that substitutes a tabular foundation model with in-context capabilities for the parametric Q-network in DQN, with a warm-up phase and theoretical analysis claiming improved sample efficiency.

citing papers explorer

Showing 1 of 1 citing paper.

TabQL: In-Context Q-Learning with Tabular Foundation Models cs.LG · 2026-05-18 · unverdicted · none · ref 47
TabQL is a reinforcement learning framework that substitutes a tabular foundation model with in-context capabilities for the parametric Q-network in DQN, with a warm-up phase and theoretical analysis claiming improved sample efficiency.

Learning near-optimal policies with bellman-residual minimization based fitted policy iteration and a single sample path.Machine Learning, 71(1):89–129, 2008

fields

years

verdicts

representative citing papers

citing papers explorer