pith. sign in

Safe reinforcement learning via probabilistic shields

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

fields

cs.AI 2 cs.CR 1

years

2026 3

verdicts

UNVERDICTED 3

roles

background 1

polarities

background 1

clear filters

representative citing papers

Certified Speculative Execution for Untrusted AI Agents

cs.CR · 2026-06-30 · unverdicted · novelty 7.0

CGPA enables certified speculative execution of untrusted AI proposals in constrained sequential decisions via verifier rejection, conformal boundary gating, and solver deferral, yielding zero violations and regret within noise of the oracle.

citing papers explorer

Showing 3 of 3 citing papers after filters.

  • Certified Speculative Execution for Untrusted AI Agents cs.CR · 2026-06-30 · unverdicted · none · ref 15

    CGPA enables certified speculative execution of untrusted AI proposals in constrained sequential decisions via verifier rejection, conformal boundary gating, and solver deferral, yielding zero violations and regret within noise of the oracle.

  • Neuro-Symbolic Injection of LTLf Constraints in Autoregressive Reinforcement Learning Policies cs.AI · 2026-06-06 · unverdicted · none · ref 35

    A neuro-symbolic framework compiles LTLf formulas to DFAs, derives differentiable satisfaction signals from DFA progression, and uses them as a logic-based regularization loss to enforce temporal constraints in autoregressive transformer RL policies while preserving competitive returns.

  • What if Pinocchio Were a Reinforcement Learning Agent: A Normative End-to-End Pipeline cs.AI · 2026-03-17 · unverdicted · none · ref 98

    The thesis presents Pino, an end-to-end pipeline that supervises reinforcement learning agents with argumentation-based normative advisors, introduces an algorithm for automatic argument extraction, and defines a mitigation strategy for norm avoidance.