Mag- netic control of tokamak plasmas through deep reinforcement learning

Jonas Degrave, Federico Felici, Jonas Buchli, Michael Neunert, Brendan Tracey, Francesco Carpanese, Timo Ewalds, Roland Hafner, Abbas Abdolmaleki, Diego de Las Casas, et al · 2022

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

Koopman-Assisted Reinforcement Learning

cs.AI · 2024-03-04 · unverdicted · novelty 6.0

Koopman-assisted RL reformulates max-entropy algorithms using controlled Koopman tensors and reports SOTA performance versus neural SAC on Lorenz, fluid flow, and other systems.

TreeDQN: Sample-Efficient Off-Policy Reinforcement Learning for Combinatorial Optimization

cs.LG · 2023-06-09 · unverdicted · novelty 6.0

TreeDQN is a sample-efficient off-policy RL method for combinatorial optimization that uses tree MDPs, requires up to 10 times less training data than on-policy methods, and outperforms state-of-the-art on ML4CO tasks.

citing papers explorer

Showing 2 of 2 citing papers.

Koopman-Assisted Reinforcement Learning cs.AI · 2024-03-04 · unverdicted · none · ref 26
Koopman-assisted RL reformulates max-entropy algorithms using controlled Koopman tensors and reports SOTA performance versus neural SAC on Lorenz, fluid flow, and other systems.
TreeDQN: Sample-Efficient Off-Policy Reinforcement Learning for Combinatorial Optimization cs.LG · 2023-06-09 · unverdicted · none · ref 11
TreeDQN is a sample-efficient off-policy RL method for combinatorial optimization that uses tree MDPs, requires up to 10 times less training data than on-policy methods, and outperforms state-of-the-art on ML4CO tasks.

Mag- netic control of tokamak plasmas through deep reinforcement learning

fields

years

verdicts

representative citing papers

citing papers explorer