Tucker, and Sergey Levine

Aviral Kumar, Aurick Zhou, G · 2020

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Preserve Support, Not Correspondence: Dynamic Routing for Offline Reinforcement Learning

cs.LG · 2026-04-24 · unverdicted · novelty 7.0

DROL trains one-step offline RL actors via top-1 dynamic routing of dataset actions to latent candidates, enabling local improvements while preserving data support and retaining cheap inference.

citing papers explorer

Showing 1 of 1 citing paper.

Preserve Support, Not Correspondence: Dynamic Routing for Offline Reinforcement Learning cs.LG · 2026-04-24 · unverdicted · none · ref 9
DROL trains one-step offline RL actors via top-1 dynamic routing of dataset actions to latent candidates, enabling local improvements while preserving data support and retaining cheap inference.

Tucker, and Sergey Levine

fields

years

verdicts

representative citing papers

citing papers explorer