Tail-risk-safe monte carlo tree search under pac-level guarantees

Zuyuan Zhang, Arnob Ghosh, Tian Lan · arXiv 2508.05441

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Operator-Guided Invariance Learning for Continuous Reinforcement Learning

cs.LG · 2026-05-07 · unverdicted · novelty 7.0

VPSD-RL discovers exact and approximate value-preserving Lie-group operators in continuous RL to stabilize learning via transition augmentation and consistency regularization.

citing papers explorer

Showing 1 of 1 citing paper.

Operator-Guided Invariance Learning for Continuous Reinforcement Learning cs.LG · 2026-05-07 · unverdicted · none · ref 13
VPSD-RL discovers exact and approximate value-preserving Lie-group operators in continuous RL to stabilize learning via transition augmentation and consistency regularization.

Tail-risk-safe monte carlo tree search under pac-level guarantees

fields

years

verdicts

representative citing papers

citing papers explorer