Q-transformer: Scalable offline reinforcement learning via autoregressive Q-functions

Yevgen Chebotar, Quan Ho Vuong, Karol Hausman, Fei Xia, Yao Lu, Alex Irpan, Aviral Kumar, Tianhe Yu, Alexander Herzog, Karl Pertsch, Keerthana Gopalakrishnan, Julian Ibarz, Sergey Levine, Adrian Salazar, Chelsea Finn · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

ACSAC: Adaptive Chunk Size Actor-Critic with Causal Transformer Q-Network

cs.LG · 2026-05-10 · unverdicted · novelty 6.0

ACSAC adaptively selects action chunk sizes via a causal Transformer Q-network in actor-critic RL, proves the Bellman operator is a contraction, and reports state-of-the-art results on long-horizon manipulation tasks.

citing papers explorer

Showing 1 of 1 citing paper.

ACSAC: Adaptive Chunk Size Actor-Critic with Causal Transformer Q-Network cs.LG · 2026-05-10 · unverdicted · none · ref 3
ACSAC adaptively selects action chunk sizes via a causal Transformer Q-network in actor-critic RL, proves the Bellman operator is a contraction, and reports state-of-the-art results on long-horizon manipulation tasks.

Q-transformer: Scalable offline reinforcement learning via autoregressive Q-functions

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer