pith. sign in

Soft actor-critic: Off- policy maximum entropy deep reinforcement learning with a stochastic actor

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

roles

background 2

polarities

background 2

representative citing papers

Reinforcement Learning with Action Chunking

cs.LG · 2025-07-10 · unverdicted · novelty 6.0

Q-chunking improves offline-to-online RL sample efficiency on long-horizon sparse-reward manipulation tasks by applying action chunking to TD learning.

Koopman-Assisted Reinforcement Learning

cs.AI · 2024-03-04 · unverdicted · novelty 6.0

Koopman-assisted RL reformulates max-entropy algorithms using controlled Koopman tensors and reports SOTA performance versus neural SAC on Lorenz, fluid flow, and other systems.

citing papers explorer

Showing 4 of 4 citing papers.