Seqvla: Sequential task execution for long-horizon manipulation with completion-aware vision- language-action model

· 2025 · arXiv 2509.14138

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

representative citing papers

FurnitureVLA: Learning Long-Horizon Bimanual Furniture Assembly with Vision-Language-Action Model

cs.RO · 2026-07-01 · unverdicted · novelty 6.0

Progress-enhanced VLA model raises simulated bimanual furniture assembly success from 48% to 80% across three furniture types and shows 16% drop on real Kinova robot.

SADP: Subgoal-Aware Diffusion Policy for Explainable Robots Learned from Foundation Model Generated Demonstrations

cs.RO · 2026-05-16 · unverdicted · novelty 5.0

SADP trains diffusion policies on foundation-model-generated subgoal-annotated demonstrations and adds a completion predictor to give robots built-in, subgoal-level explainability alongside improved task performance.

ROG-Grasp: Root-Oriented Geometry for Robotic Grasping and Placement

cs.RO · 2026-05-30 · unverdicted · novelty 3.0

ROG-Grasp estimates produce orientation from root surface geometry via YOLO detection and point cloud plane fitting to generate stable grasp poses and constrained motion plans, achieving higher reliability and speed than VLA policies in tomato and onion experiments.

VILAS: A VLA-Integrated Low-cost Architecture with Soft Grasping for Robotic Manipulation

cs.RO · 2026-05-03 · unverdicted · novelty 3.0 · 2 refs

VILAS integrates low-cost modular hardware with a kirigami soft gripper and evaluates fine-tuned pi_0, pi_0.5, and GR00T N1.6 models on grape grasping using a ZMQ-based teleoperation and deployment framework.

Threading Optimization for Vision-Language-Action Model Inference in Low-Cost Smart Agricultural Manipulation

cs.RO · 2026-05-31 · unverdicted · novelty 2.0

Threading optimization of RTAC for VLA models reduces end-to-end latency and improves stability on low-cost agricultural robotic arms without changing the policy.

citing papers explorer

Showing 5 of 5 citing papers after filters.

FurnitureVLA: Learning Long-Horizon Bimanual Furniture Assembly with Vision-Language-Action Model cs.RO · 2026-07-01 · unverdicted · none · ref 25
Progress-enhanced VLA model raises simulated bimanual furniture assembly success from 48% to 80% across three furniture types and shows 16% drop on real Kinova robot.
SADP: Subgoal-Aware Diffusion Policy for Explainable Robots Learned from Foundation Model Generated Demonstrations cs.RO · 2026-05-16 · unverdicted · none · ref 21
SADP trains diffusion policies on foundation-model-generated subgoal-annotated demonstrations and adds a completion predictor to give robots built-in, subgoal-level explainability alongside improved task performance.
ROG-Grasp: Root-Oriented Geometry for Robotic Grasping and Placement cs.RO · 2026-05-30 · unverdicted · none · ref 24
ROG-Grasp estimates produce orientation from root surface geometry via YOLO detection and point cloud plane fitting to generate stable grasp poses and constrained motion plans, achieving higher reliability and speed than VLA policies in tomato and onion experiments.
VILAS: A VLA-Integrated Low-cost Architecture with Soft Grasping for Robotic Manipulation cs.RO · 2026-05-03 · unverdicted · none · ref 29 · 2 links
VILAS integrates low-cost modular hardware with a kirigami soft gripper and evaluates fine-tuned pi_0, pi_0.5, and GR00T N1.6 models on grape grasping using a ZMQ-based teleoperation and deployment framework.
Threading Optimization for Vision-Language-Action Model Inference in Low-Cost Smart Agricultural Manipulation cs.RO · 2026-05-31 · unverdicted · none · ref 4
Threading optimization of RTAC for VLA models reduces end-to-end latency and improves stability on low-cost agricultural robotic arms without changing the policy.

Seqvla: Sequential task execution for long-horizon manipulation with completion-aware vision- language-action model

fields

years

verdicts

representative citing papers

citing papers explorer