FP-IRL recovers MDP reward, transition, and policy from trajectories alone by using variational system identification on a Fokker-Planck potential that corresponds to reward maximization.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
verdicts
UNVERDICTED 2roles
background 1polarities
background 1representative citing papers
FragileFlow formalizes margin-aware error flow and applies spectral control through a calibrated margin buffer and class-wise risk matrix, supported by a PAC-Bayes bound, to enhance worst-class robustness in foundation model adaptation while preserving clean accuracy.
citing papers explorer
-
FP-IRL: Fokker--Planck Inverse Reinforcement Learning -- A Physics-Constrained Approach to Markov Decision Processes
FP-IRL recovers MDP reward, transition, and policy from trajectories alone by using variational system identification on a Fokker-Planck potential that corresponds to reward maximization.
-
FragileFlow: Spectral Control of Correct-but-Fragile Predictions for Foundation Model Robustness
FragileFlow formalizes margin-aware error flow and applies spectral control through a calibrated margin buffer and class-wise risk matrix, supported by a PAC-Bayes bound, to enhance worst-class robustness in foundation model adaptation while preserving clean accuracy.