Streaming flow policy: Simplifying diffusion/flow- matching policies by treating action trajectories as flow trajectories

Siddharth Ancha · 2025 · arXiv 2505.21851

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

representative citing papers

TouchGuide: Inference-Time Steering of Visuomotor Policies via Touch Guidance

cs.RO · 2026-01-28 · unverdicted · novelty 7.0

TouchGuide improves contact-rich robot manipulation by steering diffusion or flow-matching visuomotor policies with tactile feasibility scores from a contrastively trained Contact Physical Model.

LLM-Guided Future Hypotheses for Horizon-Aware Exploration in Multi-Step Robot Manipulation

cs.RO · 2026-05-28 · unverdicted · novelty 6.0

FEC conditions policies on LLM-guided short-horizon future videos via a three-stage pipeline, yielding performance gains for BC+RL over no-future baselines on RoboCasa and CALVIN while mismatched futures degrade results.

When Life Gives You BC, Make Q-functions: Extracting Q-values from Behavior Cloning for On-Robot Reinforcement Learning

cs.RO · 2026-05-06 · unverdicted · novelty 6.0 · 2 refs

Q2RL extracts Q-values from a BC policy and applies Q-gating to enable efficient offline-to-online RL, outperforming baselines on D4RL/robomimic tasks and achieving up to 100% success on real-robot manipulation in 1-2 hours.

Behavior Uncloning: Distilling Mode Redirection into Policy Weights without Inference-Time Steering

cs.RO · 2026-06-28 · unverdicted · novelty 5.0

MoRE improves robot policy success rates by 44 percentage points by distilling mode redirection into weights, matching filtered retraining performance without inference overhead.

Diffusion-Based Optimization for Accelerated Convergence of Redundant Dual-Arm Minimum Time Problems

cs.RO · 2026-04-17 · unverdicted · novelty 5.0

A novel diffusion variant accelerates minimum-time planning for redundant dual-arm robots by replacing gradient-based solving of the nonconvex high-level problem with probabilistic sampling, yielding 35x faster runtime and 34% less path error.

citing papers explorer

Showing 5 of 5 citing papers after filters.

TouchGuide: Inference-Time Steering of Visuomotor Policies via Touch Guidance cs.RO · 2026-01-28 · unverdicted · none · ref 34
TouchGuide improves contact-rich robot manipulation by steering diffusion or flow-matching visuomotor policies with tactile feasibility scores from a contrastively trained Contact Physical Model.
LLM-Guided Future Hypotheses for Horizon-Aware Exploration in Multi-Step Robot Manipulation cs.RO · 2026-05-28 · unverdicted · none · ref 1
FEC conditions policies on LLM-guided short-horizon future videos via a three-stage pipeline, yielding performance gains for BC+RL over no-future baselines on RoboCasa and CALVIN while mismatched futures degrade results.
When Life Gives You BC, Make Q-functions: Extracting Q-values from Behavior Cloning for On-Robot Reinforcement Learning cs.RO · 2026-05-06 · unverdicted · none · ref 13 · 2 links
Q2RL extracts Q-values from a BC policy and applies Q-gating to enable efficient offline-to-online RL, outperforming baselines on D4RL/robomimic tasks and achieving up to 100% success on real-robot manipulation in 1-2 hours.
Behavior Uncloning: Distilling Mode Redirection into Policy Weights without Inference-Time Steering cs.RO · 2026-06-28 · unverdicted · none · ref 18
MoRE improves robot policy success rates by 44 percentage points by distilling mode redirection into weights, matching filtered retraining performance without inference overhead.
Diffusion-Based Optimization for Accelerated Convergence of Redundant Dual-Arm Minimum Time Problems cs.RO · 2026-04-17 · unverdicted · none · ref 45
A novel diffusion variant accelerates minimum-time planning for redundant dual-arm robots by replacing gradient-based solving of the nonconvex high-level problem with probabilistic sampling, yielding 35x faster runtime and 34% less path error.

Streaming flow policy: Simplifying diffusion/flow- matching policies by treating action trajectories as flow trajectories

fields

years

verdicts

representative citing papers

citing papers explorer