Addressing function approxi- mation error in actor-critic methods,

· 2018

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

browse 2 citing papers

representative citing papers

ATRS: Adaptive Trajectory Re-splitting via a Shared Neural Policy for Parallel Optimization

cs.RO · 2026-04-24 · unverdicted · novelty 7.0

ATRS uses a shared neural policy in a multi-agent MDP to adaptively re-split trajectory segments during parallel ADMM optimization, cutting iterations by up to 26% and time by 19.1% with zero-shot generalization.

Can Tabular Foundation Models Guide Exploration in Robot Policy Learning?

cs.RO · 2026-04-30 · unverdicted · novelty 5.0

TFM-S3 uses a tabular foundation model to predict returns and guide intermittent global exploration within an SVD-derived policy subspace, yielding faster early convergence and better final performance than TD3 and population-based methods under fixed rollout budgets.

citing papers explorer

Showing 2 of 2 citing papers.

ATRS: Adaptive Trajectory Re-splitting via a Shared Neural Policy for Parallel Optimization cs.RO · 2026-04-24 · unverdicted · none · ref 18
ATRS uses a shared neural policy in a multi-agent MDP to adaptively re-split trajectory segments during parallel ADMM optimization, cutting iterations by up to 26% and time by 19.1% with zero-shot generalization.
Can Tabular Foundation Models Guide Exploration in Robot Policy Learning? cs.RO · 2026-04-30 · unverdicted · none · ref 3
TFM-S3 uses a tabular foundation model to predict returns and guide intermittent global exploration within an SVD-derived policy subspace, yielding faster early convergence and better final performance than TD3 and population-based methods under fixed rollout budgets.

Addressing function approxi- mation error in actor-critic methods,

fields

years

verdicts

representative citing papers

citing papers explorer