Policy evaluation and temporal-difference learning in continuous time and space: A martingale approach.Journal of Machine Learning Research, 23(154):1–55, 2022
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it