Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey,

· 2024 · arXiv 2408.09675

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Overcoming Environmental Meta-Stationarity in MARL via Adaptive Curriculum and Counterfactual Group Advantage

cs.AI · 2025-06-09 · unverdicted · novelty 6.0

CL-MARL uses an adaptive curriculum scheduler called FlexDiff and Counterfactual Group Relative Policy Advantage to break static-difficulty training in MARL and achieve higher win rates on hard StarCraft maps.

Multi-Agent Reinforcement Learning for Safe Autonomous Driving Under Pedestrian Behavioral Uncertainty

cs.LG · 2026-05-18 · unverdicted · novelty 5.0

Co-training an SDC and 12 pedestrians with MAPPO in a MARL setup yields 78% goal success and 14% collisions versus 35% goals and 33% for the best rule-based baseline, with jaywalking linked to 62% of collisions despite being only 13% of events.

citing papers explorer

Showing 2 of 2 citing papers.

Overcoming Environmental Meta-Stationarity in MARL via Adaptive Curriculum and Counterfactual Group Advantage cs.AI · 2025-06-09 · unverdicted · none · ref 5
CL-MARL uses an adaptive curriculum scheduler called FlexDiff and Counterfactual Group Relative Policy Advantage to break static-difficulty training in MARL and achieve higher win rates on hard StarCraft maps.
Multi-Agent Reinforcement Learning for Safe Autonomous Driving Under Pedestrian Behavioral Uncertainty cs.LG · 2026-05-18 · unverdicted · none · ref 8
Co-training an SDC and 12 pedestrians with MAPPO in a MARL setup yields 78% goal success and 14% collisions versus 35% goals and 33% for the best rule-based baseline, with jaywalking linked to 62% of collisions despite being only 13% of events.

Multi-Agent Reinforcement Learning for Autonomous Driving: A Survey,

fields

years

verdicts

representative citing papers

citing papers explorer