Title resolution pending

Christopher JCH Watkins, Peter Dayan · 1992

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Leveraging Experience in Lazy Search

cs.RO · 2019-07-16 · unverdicted · novelty 6.0

Uses imitation learning from oracles to train an edge-evaluation policy for lazy graph search, outperforming heuristics on 2D and 7D motion planning problems when test instances are similar to training.

Generalizing from a few environments in safety-critical reinforcement learning

cs.LG · 2019-07-02 · unverdicted · novelty 6.0

RL agents fail dangerously on unseen environments; ensembles reduce catastrophes in gridworld but not CoinRun, with uncertainty enabling intervention prediction.

citing papers explorer

Showing 2 of 2 citing papers.

Leveraging Experience in Lazy Search cs.RO · 2019-07-16 · unverdicted · none · ref 11
Uses imitation learning from oracles to train an edge-evaluation policy for lazy graph search, outperforming heuristics on 2D and 7D motion planning problems when test instances are similar to training.
Generalizing from a few environments in safety-critical reinforcement learning cs.LG · 2019-07-02 · unverdicted · none · ref 38
RL agents fail dangerously on unseen environments; ensembles reduce catastrophes in gridworld but not CoinRun, with uncertainty enabling intervention prediction.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer