The option-critic architecture

· 2017

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

ORION: Option-Regularized Deep Reinforcement Learning for Cooperative Multi-Agent Online Navigation

cs.RO · 2026-01-03 · unverdicted · novelty 6.0

ORION introduces an option-critic RL method with shared graph encoding and dual-stage cooperation for decentralized multi-agent navigation and exploration in partially known environments, scaling to 10 robots with real-world validation.

UAV Trajectory and Bandwidth Allocation for Efficient Data Collection in Low-Altitude Intelligent IoT: A Hierarchical DRL Approach

cs.CE · 2026-04-25 · unverdicted · novelty 4.0 · 2 refs

A hierarchical DRL method (TBH-DDPG) optimizes UAV trajectories at coarse granularity and bandwidth allocation at fine granularity, reporting 44.44% faster convergence and 58.05% lower computational cost than a non-hierarchical baseline in simulations.

citing papers explorer

Showing 2 of 2 citing papers.

ORION: Option-Regularized Deep Reinforcement Learning for Cooperative Multi-Agent Online Navigation cs.RO · 2026-01-03 · unverdicted · none · ref 26
ORION introduces an option-critic RL method with shared graph encoding and dual-stage cooperation for decentralized multi-agent navigation and exploration in partially known environments, scaling to 10 robots with real-world validation.
UAV Trajectory and Bandwidth Allocation for Efficient Data Collection in Low-Altitude Intelligent IoT: A Hierarchical DRL Approach cs.CE · 2026-04-25 · unverdicted · none · ref 33 · 2 links
A hierarchical DRL method (TBH-DDPG) optimizes UAV trajectories at coarse granularity and bandwidth allocation at fine granularity, reporting 44.44% faster convergence and 58.05% lower computational cost than a non-hierarchical baseline in simulations.

The option-critic architecture

fields

years

verdicts

representative citing papers

citing papers explorer