Stabilizing deep q-learning with convnets and vision transformers under data augmentation

Nicklas Hansen, Hao Su, Xiaolong Wang · 2021

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

TD-MPC2: Scalable, Robust World Models for Continuous Control

cs.LG · 2023-10-25 · conditional · novelty 6.0

TD-MPC2 scales an implicit world-model RL method to a 317M-parameter agent that masters 80 tasks across four domains with a single hyperparameter configuration.

citing papers explorer

Showing 1 of 1 citing paper.

TD-MPC2: Scalable, Robust World Models for Continuous Control cs.LG · 2023-10-25 · conditional · none · ref 23
TD-MPC2 scales an implicit world-model RL method to a 317M-parameter agent that masters 80 tasks across four domains with a single hyperparameter configuration.

Stabilizing deep q-learning with convnets and vision transformers under data augmentation

fields

years

verdicts

representative citing papers

citing papers explorer