pith. sign in

Pérez, and Marnix Suilen

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

citation-role summary

background 1

citation-polarity summary

fields

cs.LG 2

years

2026 2

verdicts

UNVERDICTED 2

roles

background 1

polarities

background 1

representative citing papers

Robust Parameter Learning for Uncertain MDPs

cs.LG · 2026-05-02 · unverdicted · novelty 7.0

Parametric MDPs enable PAC uncertainty models for MDPs by projecting empirical frequencies onto parameter space with polytopic outer approximations, yielding tighter estimates than independent interval methods.

citing papers explorer

Showing 2 of 2 citing papers.

  • Robust Parameter Learning for Uncertain MDPs cs.LG · 2026-05-02 · unverdicted · none · ref 21

    Parametric MDPs enable PAC uncertainty models for MDPs by projecting empirical frequencies onto parameter space with polytopic outer approximations, yielding tighter estimates than independent interval methods.

  • Robust Probabilistic Shielding for Safe Offline Reinforcement Learning cs.LG · 2026-05-11 · unverdicted · none · ref 16

    Shielding the policy improvement process in offline RL yields policies that are safe with high probability while outperforming unshielded baselines in both average and worst-case performance, especially under limited data.