Title resolution pending

doi: 10 · 2022 · arXiv 2022.320734

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Learning to Plan, Planning to Learn: Adaptive Hierarchical RL-MPC for Sample-Efficient Decision Making

cs.LG · 2025-12-18 · unverdicted · novelty 6.0

An adaptive RL-MPC framework uses RL to inform MPPI sampling and aggregates MPPI samples for value estimation, delivering up to 72% higher success rates and 2.1x faster convergence on tasks like race driving and Lunar Lander with obstacles.

An Overlay Multicast Routing Method Based on Network Situational Awareness and Hierarchical Multi-Agent Reinforcement Learning

cs.NI · 2026-01-17 · unverdicted · novelty 5.0

MA-DHRL-OM decomposes overlay multicast routing into hierarchical stages with multi-agent RL to improve delay, bandwidth use, and stability over prior methods.

citing papers explorer

Showing 2 of 2 citing papers.

Learning to Plan, Planning to Learn: Adaptive Hierarchical RL-MPC for Sample-Efficient Decision Making cs.LG · 2025-12-18 · unverdicted · none · ref 13
An adaptive RL-MPC framework uses RL to inform MPPI sampling and aggregates MPPI samples for value estimation, delivering up to 72% higher success rates and 2.1x faster convergence on tasks like race driving and Lunar Lander with obstacles.
An Overlay Multicast Routing Method Based on Network Situational Awareness and Hierarchical Multi-Agent Reinforcement Learning cs.NI · 2026-01-17 · unverdicted · none · ref 24
MA-DHRL-OM decomposes overlay multicast routing into hierarchical stages with multi-agent RL to improve delay, bandwidth use, and stability over prior methods.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer