Safe multi-agent reinforcement learning via shielding,

· 2021 · arXiv 2101.11196

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Generating Local Shields for Decentralised Partially Observable Markov Decision Processes

cs.MA · 2026-04-08 · unverdicted · novelty 7.0

A process algebra with guarded choice and recursion is compiled to global and then projected local Mealy machines that filter safe joint actions for each agent in Dec-POMDPs using belief-style state subsets.

A Review On Safe Reinforcement Learning Using Lyapunov and Barrier Functions

eess.SY · 2025-08-12 · unverdicted · novelty 2.0

A literature review of safe RL using Lyapunov and barrier functions that identifies a shift to model-free methods since 2017, well-defined open problems per approach class, and high-dimensional scalability as the main barrier.

citing papers explorer

Showing 2 of 2 citing papers.

Generating Local Shields for Decentralised Partially Observable Markov Decision Processes cs.MA · 2026-04-08 · unverdicted · none · ref 3
A process algebra with guarded choice and recursion is compiled to global and then projected local Mealy machines that filter safe joint actions for each agent in Dec-POMDPs using belief-style state subsets.
A Review On Safe Reinforcement Learning Using Lyapunov and Barrier Functions eess.SY · 2025-08-12 · unverdicted · none · ref 32
A literature review of safe RL using Lyapunov and barrier functions that identifies a shift to model-free methods since 2017, well-defined open problems per approach class, and high-dimensional scalability as the main barrier.

Safe multi-agent reinforcement learning via shielding,

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer