Journal of statistical mechanics: theory and experiment , volume=

Path integrals, symmetry breaking for optimal control theory , author=

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

representative citing papers

cs.AI · 2026-05-15 · unverdicted · novelty 8.0

A formal theory proves model exploitation is essentially unavoidable on large policy sets in RL, generalizes reward hacking results, and derives a safe horizon for a relaxed version of exploitation.

Onsager-Machlup Posterior Transport for Deep Gaussian Processes

cs.LG · 2026-05-22 · unverdicted · novelty 7.0

OM-Path uses Onsager-Machlup-regularized posterior transport on Doob-bridged paths for DGP inference and reports statistical wins over DBVI on the two largest UCI regression benchmarks.

QuantFPFlow: Quantum Amplitude Estimation for Fokker--Planck Policy Optimisation in Continuous Reinforcement Learning

cs.LG · 2026-05-14 · unverdicted · novelty 5.0

QuantFPFlow uses quantum amplitude estimation in a Fokker-Planck RL framework to achieve O(1/ε) partition function estimation and reports improved global optimum discovery plus better scaling in continuous control tasks.

citing papers explorer

Showing 3 of 3 citing papers.

Imperfect World Models are Exploitable cs.AI · 2026-05-15 · unverdicted · none · ref 4
A formal theory proves model exploitation is essentially unavoidable on large policy sets in RL, generalizes reward hacking results, and derives a safe horizon for a relaxed version of exploitation.
Onsager-Machlup Posterior Transport for Deep Gaussian Processes cs.LG · 2026-05-22 · unverdicted · none · ref 29
OM-Path uses Onsager-Machlup-regularized posterior transport on Doob-bridged paths for DGP inference and reports statistical wins over DBVI on the two largest UCI regression benchmarks.
QuantFPFlow: Quantum Amplitude Estimation for Fokker--Planck Policy Optimisation in Continuous Reinforcement Learning cs.LG · 2026-05-14 · unverdicted · none · ref 6
QuantFPFlow uses quantum amplitude estimation in a Fokker-Planck RL framework to achieve O(1/ε) partition function estimation and reports improved global optimum discovery plus better scaling in continuous control tasks.

Journal of statistical mechanics: theory and experiment , volume=

fields

years

verdicts

representative citing papers

citing papers explorer