Title resolution pending

· 2018

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Identifiable Latent Bandits: Leveraging observational data for personalized decision-making

cs.LG · 2024-07-23 · unverdicted · novelty 6.0

Identifiable latent bandits apply nonlinear ICA to observational data to recover representations sufficient for inferring optimal actions in new instances, shortening exploration time.

Learning World Graphs to Accelerate Hierarchical Reinforcement Learning

cs.LG · 2019-07-01 · unverdicted · novelty 6.0

A two-stage framework learns a world graph of pivotal states task-agnostically via joint training of a latent model and curiosity-driven policy, then uses the graph to accelerate hierarchical RL on maze tasks.

citing papers explorer

Showing 2 of 2 citing papers.

Identifiable Latent Bandits: Leveraging observational data for personalized decision-making cs.LG · 2024-07-23 · unverdicted · none · ref 44
Identifiable latent bandits apply nonlinear ICA to observational data to recover representations sufficient for inferring optimal actions in new instances, shortening exploration time.
Learning World Graphs to Accelerate Hierarchical Reinforcement Learning cs.LG · 2019-07-01 · unverdicted · none · ref 82
A two-stage framework learns a world graph of pivotal states task-agnostically via joint training of a latent model and curiosity-driven policy, then uses the graph to accelerate hierarchical RL on maze tasks.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer