Mher: Model-based hindsight experience replay

Rui Yang, Meng Fang, Lei Han, Yali Du, Feng Luo, Xiu Li · 2021 · arXiv 2107.00306

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Preemptive Solving of Future Problems: Multitask Preplay in Humans and Machines

cs.LG · 2025-07-08 · unverdicted · novelty 7.0

Multitask Preplay replays experience from pursued tasks as starting points for counterfactual simulation of unpursued tasks to learn predictive representations that support fast generalization in humans and machines.

QHyer: Q-conditioned Hybrid Attention-mamba Transformer for Offline Goal-conditioned RL

cs.LG · 2026-05-03 · unverdicted · novelty 6.0

QHyer replaces return-to-go with a state-conditioned Q-estimator and adds a gated hybrid attention-mamba backbone to achieve state-of-the-art performance in offline goal-conditioned RL on both Markovian and non-Markovian datasets.

citing papers explorer

Showing 2 of 2 citing papers.

Preemptive Solving of Future Problems: Multitask Preplay in Humans and Machines cs.LG · 2025-07-08 · unverdicted · none · ref 24
Multitask Preplay replays experience from pursued tasks as starting points for counterfactual simulation of unpursued tasks to learn predictive representations that support fast generalization in humans and machines.
QHyer: Q-conditioned Hybrid Attention-mamba Transformer for Offline Goal-conditioned RL cs.LG · 2026-05-03 · unverdicted · none · ref 219
QHyer replaces return-to-go with a state-conditioned Q-estimator and adds a gated hybrid attention-mamba backbone to achieve state-of-the-art performance in offline goal-conditioned RL on both Markovian and non-Markovian datasets.

Mher: Model-based hindsight experience replay

fields

years

verdicts

representative citing papers

citing papers explorer