Title resolution pending

URLhttps://arxiv · arXiv 2305.14154

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Stochastic Minimum-Cost Reach-Avoid Reinforcement Learning

cs.LG · 2026-05-12 · unverdicted · novelty 6.0 · 2 refs

Introduces RAPCs and a contraction Bellman operator for cost-optimal policies that satisfy probabilistic reach-avoid specifications in stochastic MDPs, with almost-sure convergence to local optima.

How Does the Lagrangian Guide Safe Reinforcement Learning through Diffusion Models?

cs.LG · 2026-02-02 · unverdicted · novelty 6.0

ALGD augments the Lagrangian to locally convexify the energy landscape in diffusion models, stabilizing safe RL training and generation without changing optimal policies.

citing papers explorer

Showing 2 of 2 citing papers.

Stochastic Minimum-Cost Reach-Avoid Reinforcement Learning cs.LG · 2026-05-12 · unverdicted · none · ref 10 · 2 links
Introduces RAPCs and a contraction Bellman operator for cost-optimal policies that satisfy probabilistic reach-avoid specifications in stochastic MDPs, with almost-sure convergence to local optima.
How Does the Lagrangian Guide Safe Reinforcement Learning through Diffusion Models? cs.LG · 2026-02-02 · unverdicted · none · ref 15
ALGD augments the Lagrangian to locally convexify the energy landscape in diffusion models, stabilizing safe RL training and generation without changing optimal policies.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer