arXiv preprint arXiv:2409.08687 , year=

· 2024 · arXiv 2409.08687

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

SARM2: Multi-Task Stage Aware Reward Modeling for Self Improving Robotic Manipulation

cs.RO · 2026-06-09 · unverdicted · novelty 6.0

SARM2 presents RM, a multi-task stage-aware reward model achieving 80% lower value-estimation MSE, which when used in SPIRAL boosts manipulation task success from ~50% to near-perfect on several benchmarks.

Domain Adaptation with Adaptive Imagination for Visual Reinforcement Learning under Limited Target Data

cs.AI · 2026-06-29 · unverdicted · novelty 5.0

AIDA augments scarce target data for sim-to-real visual RL by adaptively truncating unreliable imagined rollouts via a distribution-shift-aware discriminator and applying self-consistency loss on reliable state reconstructions.

citing papers explorer

Showing 2 of 2 citing papers after filters.

SARM2: Multi-Task Stage Aware Reward Modeling for Self Improving Robotic Manipulation cs.RO · 2026-06-09 · unverdicted · none · ref 38
SARM2 presents RM, a multi-task stage-aware reward model achieving 80% lower value-estimation MSE, which when used in SPIRAL boosts manipulation task success from ~50% to near-perfect on several benchmarks.
Domain Adaptation with Adaptive Imagination for Visual Reinforcement Learning under Limited Target Data cs.AI · 2026-06-29 · unverdicted · none · ref 64
AIDA augments scarce target data for sim-to-real visual RL by adaptively truncating unreliable imagined rollouts via a distribution-shift-aware discriminator and applying self-consistency loss on reliable state reconstructions.

arXiv preprint arXiv:2409.08687 , year=

fields

years

verdicts

representative citing papers

citing papers explorer