IEEE Transactions on Neural Networks and Learning Systems , volume=

A survey on offline reinforcement learning: Taxonomy, review, open problems , author= · 2023

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Offline Reinforcement Learning with Universal Horizon Models

cs.LG · 2026-05-15 · unverdicted · novelty 6.0

Universal horizon models extend geometric horizon models to arbitrary horizons and apply winsorized distributions for stable offline RL value learning, outperforming baselines on 100 OGBench tasks.

When Life Gives You BC, Make Q-functions: Extracting Q-values from Behavior Cloning for On-Robot Reinforcement Learning

cs.RO · 2026-05-06 · unverdicted · novelty 6.0

Q2RL extracts Q-functions from BC policies via minimal interactions and applies Q-gating to enable stable offline-to-online RL, outperforming baselines on manipulation benchmarks and achieving up to 100% success on-robot.

citing papers explorer

Showing 2 of 2 citing papers.

Offline Reinforcement Learning with Universal Horizon Models cs.LG · 2026-05-15 · unverdicted · none · ref 65
Universal horizon models extend geometric horizon models to arbitrary horizons and apply winsorized distributions for stable offline RL value learning, outperforming baselines on 100 OGBench tasks.
When Life Gives You BC, Make Q-functions: Extracting Q-values from Behavior Cloning for On-Robot Reinforcement Learning cs.RO · 2026-05-06 · unverdicted · none · ref 67
Q2RL extracts Q-functions from BC policies via minimal interactions and applies Q-gating to enable stable offline-to-online RL, outperforming baselines on manipulation benchmarks and achieving up to 100% success on-robot.

IEEE Transactions on Neural Networks and Learning Systems , volume=

fields

years

verdicts

representative citing papers

citing papers explorer