A pac learning algorithm for ltl and omega-regular objectives in mdps

Perez, M · arXiv 2310.12248

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Reinforcement Learning for Reachability: Guaranteeing Asymptotic Optimality

cs.LG · 2026-05-23 · unverdicted · novelty 4.0

Iterative refinement of unknown MDP parameters allows repeated satisfaction of PAC conditions, yielding asymptotic optimality for reachability specifications in RL.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Reinforcement Learning for Reachability: Guaranteeing Asymptotic Optimality cs.LG · 2026-05-23 · unverdicted · none · ref 4
Iterative refinement of unknown MDP parameters allows repeated satisfaction of PAC conditions, yielding asymptotic optimality for reachability specifications in RL.

A pac learning algorithm for ltl and omega-regular objectives in mdps

fields

years

verdicts

representative citing papers

citing papers explorer