Safe reinforcement learning via probabilistic shields

· 2018 · arXiv 1807.06096

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

Certified Speculative Execution for Untrusted AI Agents

cs.CR · 2026-06-30 · unverdicted · novelty 7.0

CGPA enables certified speculative execution of untrusted AI proposals in constrained sequential decisions via verifier rejection, conformal boundary gating, and solver deferral, yielding zero violations and regret within noise of the oracle.

Neuro-Symbolic Injection of LTLf Constraints in Autoregressive Reinforcement Learning Policies

cs.AI · 2026-06-06 · unverdicted · novelty 6.0

A neuro-symbolic framework compiles LTLf formulas to DFAs, derives differentiable satisfaction signals from DFA progression, and uses them as a logic-based regularization loss to enforce temporal constraints in autoregressive transformer RL policies while preserving competitive returns.

What if Pinocchio Were a Reinforcement Learning Agent: A Normative End-to-End Pipeline

cs.AI · 2026-03-17 · unverdicted · novelty 5.0

The thesis presents Pino, an end-to-end pipeline that supervises reinforcement learning agents with argumentation-based normative advisors, introduces an algorithm for automatic argument extraction, and defines a mitigation strategy for norm avoidance.

citing papers explorer

Showing 3 of 3 citing papers after filters.

Certified Speculative Execution for Untrusted AI Agents cs.CR · 2026-06-30 · unverdicted · none · ref 15
CGPA enables certified speculative execution of untrusted AI proposals in constrained sequential decisions via verifier rejection, conformal boundary gating, and solver deferral, yielding zero violations and regret within noise of the oracle.
Neuro-Symbolic Injection of LTLf Constraints in Autoregressive Reinforcement Learning Policies cs.AI · 2026-06-06 · unverdicted · none · ref 35
A neuro-symbolic framework compiles LTLf formulas to DFAs, derives differentiable satisfaction signals from DFA progression, and uses them as a logic-based regularization loss to enforce temporal constraints in autoregressive transformer RL policies while preserving competitive returns.
What if Pinocchio Were a Reinforcement Learning Agent: A Normative End-to-End Pipeline cs.AI · 2026-03-17 · unverdicted · none · ref 98
The thesis presents Pino, an end-to-end pipeline that supervises reinforcement learning agents with argumentation-based normative advisors, introduces an algorithm for automatic argument extraction, and defines a mitigation strategy for norm avoidance.

Safe reinforcement learning via probabilistic shields

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer