Title resolution pending

· 2024 · arXiv 2407.16677

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

EgoEngine: From Egocentric Human Videos to High-Fidelity Dexterous Robot Demonstrations

cs.RO · 2026-06-10 · unverdicted · novelty 7.0

EgoEngine transforms egocentric human videos into high-fidelity robot data enabling zero-shot visuomotor dexterous policy learning without real-robot demonstrations.

Dynamic Execution Horizon Prediction for Chunk-based Robot Policies

cs.RO · 2026-06-09 · unverdicted · novelty 7.0

DEHP adds an online-RL horizon predictor to frozen chunk policies, yielding higher success on precise and long-horizon robot manipulation by adapting chunk length to task stage.

EXPO: Stable Reinforcement Learning with Expressive Policies

cs.LG · 2025-07-10 · conditional · novelty 7.0

EXPO stabilizes online RL for expressive policies by training a base policy with imitation and using a lightweight Gaussian edit policy to select higher-value actions on the fly for sampling and TD backups.

Steering Your Diffusion Policy with Latent Space Reinforcement Learning

cs.RO · 2025-06-18 · unverdicted · novelty 7.0

DSRL steers pretrained diffusion policies for robotics by applying RL to their latent noise inputs, achieving sample-efficient real-world adaptation with only black-box access.

RoHIL: Robust Human-in-the-Loop Robotic Reinforcement Learning Against Illumination Variations

cs.RO · 2026-05-19 · unverdicted · novelty 6.0

RoHIL adapts human-in-the-loop RL policies to new illumination conditions offline by combining world-model image relighting, illumination-retention replay, and anchored Bellman regularisation, improving shifted-light performance while preserving source performance on four real-robot tasks.

Diffusion Policy Policy Optimization

cs.RO · 2024-09-01 · unverdicted · novelty 6.0

DPPO fine-tunes diffusion policies via policy gradients and outperforms prior RL approaches for diffusion policies and PG-tuned alternatives on robot benchmarks while enabling stable training and hardware deployment.

HDFlow: Hierarchical Diffusion-Flow Planning for Long-horizon Tasks

cs.RO · 2026-05-06 · unverdicted · novelty 5.0 · 2 refs

HDFlow pairs a high-level diffusion planner for strategic subgoals with a low-level rectified flow planner for efficient trajectories, claiming superior performance on furniture assembly and other long-horizon robotic benchmarks.

Robot Self-Improvement via Human-Video Dynamics Models

cs.RO · 2026-06-19 · unverdicted · novelty 4.0

Human-video dynamics models enable cross-embodiment robot self-improvement via training-free Dynamics-Guided Action Correction, raising success rates from 40% to 81% on seven real-world tasks.

citing papers explorer

Showing 7 of 7 citing papers after filters.

EgoEngine: From Egocentric Human Videos to High-Fidelity Dexterous Robot Demonstrations cs.RO · 2026-06-10 · unverdicted · none · ref 53
EgoEngine transforms egocentric human videos into high-fidelity robot data enabling zero-shot visuomotor dexterous policy learning without real-robot demonstrations.
Dynamic Execution Horizon Prediction for Chunk-based Robot Policies cs.RO · 2026-06-09 · unverdicted · none · ref 1
DEHP adds an online-RL horizon predictor to frozen chunk policies, yielding higher success on precise and long-horizon robot manipulation by adapting chunk length to task stage.
Steering Your Diffusion Policy with Latent Space Reinforcement Learning cs.RO · 2025-06-18 · unverdicted · none · ref 64
DSRL steers pretrained diffusion policies for robotics by applying RL to their latent noise inputs, achieving sample-efficient real-world adaptation with only black-box access.
RoHIL: Robust Human-in-the-Loop Robotic Reinforcement Learning Against Illumination Variations cs.RO · 2026-05-19 · unverdicted · none · ref 2
RoHIL adapts human-in-the-loop RL policies to new illumination conditions offline by combining world-model image relighting, illumination-retention replay, and anchored Bellman regularisation, improving shifted-light performance while preserving source performance on four real-robot tasks.
Diffusion Policy Policy Optimization cs.RO · 2024-09-01 · unverdicted · none · ref 6
DPPO fine-tunes diffusion policies via policy gradients and outperforms prior RL approaches for diffusion policies and PG-tuned alternatives on robot benchmarks while enabling stable training and hardware deployment.
HDFlow: Hierarchical Diffusion-Flow Planning for Long-horizon Tasks cs.RO · 2026-05-06 · unverdicted · none · ref 1 · 2 links
HDFlow pairs a high-level diffusion planner for strategic subgoals with a low-level rectified flow planner for efficient trajectories, claiming superior performance on furniture assembly and other long-horizon robotic benchmarks.
Robot Self-Improvement via Human-Video Dynamics Models cs.RO · 2026-06-19 · unverdicted · none · ref 52
Human-video dynamics models enable cross-embodiment robot self-improvement via training-free Dynamics-Guided Action Correction, raising success rates from 40% to 81% on seven real-world tasks.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer