Staggered Environment Resets Im- prove Massively Parallel On-Policy Reinforcement Learning

· 2025 · arXiv 2511.21011

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Graph Transformers and Stabilized Reinforcement Learning for Large-Scale Dynamic Routing Modulation and Spectrum Allocation in Elastic Optical Networks

cs.NI · 2026-05-03 · unverdicted · novelty 7.0 · 2 refs

A graph transformer with RL stabilizations is the first to exceed benchmarks for dynamic RMSA, supporting up to 13% more traffic load on networks up to 143 nodes.

Do We Really Need Immediate Resets? Rethinking Collision Handling for Efficient Robot Navigation

cs.RO · 2026-05-04

citing papers explorer

Showing 2 of 2 citing papers.

Graph Transformers and Stabilized Reinforcement Learning for Large-Scale Dynamic Routing Modulation and Spectrum Allocation in Elastic Optical Networks cs.NI · 2026-05-03 · unverdicted · none · ref 43 · 2 links
A graph transformer with RL stabilizations is the first to exceed benchmarks for dynamic RMSA, supporting up to 13% more traffic load on networks up to 143 nodes.
Do We Really Need Immediate Resets? Rethinking Collision Handling for Efficient Robot Navigation cs.RO · 2026-05-04 · unreviewed · ref 23

Staggered Environment Resets Im- prove Massively Parallel On-Policy Reinforcement Learning

fields

years

verdicts

representative citing papers

citing papers explorer