0 50000 100000 150000 200000 250000 Iterations 1.4 1.6 1.8 2.0 2.2 2.4 2.6 2.8 3.0Mean Number of Vases Broken Without h-Potential With h-Potential (b) Number of vases broken

15 0 50000 100000 150000 200000 250000 Iterations 0 · 2012

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

cs.LG · 2019-07-02 · unverdicted · novelty 7.0

Introduces a learned arrow of time in MDPs that aligns with the Jordan-Kinderlehrer-Otto notion for stochastic processes and enables practical RL utilities like reachability and side-effect detection.

citing papers explorer

Showing 1 of 1 citing paper.

Learning the Arrow of Time cs.LG · 2019-07-02 · unverdicted · none · ref 22
Introduces a learned arrow of time in MDPs that aligns with the Jordan-Kinderlehrer-Otto notion for stochastic processes and enables practical RL utilities like reachability and side-effect detection.

0 50000 100000 150000 200000 250000 Iterations 1.4 1.6 1.8 2.0 2.2 2.4 2.6 2.8 3.0Mean Number of Vases Broken Without h-Potential With h-Potential (b) Number of vases broken

fields

years

verdicts

representative citing papers

citing papers explorer