KBSE learns policies and barrier functions iteratively via conditional mean embeddings to bound unsafe state reachability probabilities during exploration in deep RL.
Proceedings of the 36th International Conference on Neural Information Processing Systems , articleno =
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
eess.SY 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Kernel-Based Safe Exploration in Deep Reinforcement Learning
KBSE learns policies and barrier functions iteratively via conditional mean embeddings to bound unsafe state reachability probabilities during exploration in deep RL.