hub

Omnivtla: Vision- tactile-language-action model with semantic-aligned tactile sensing

Zhengxue Cheng, Yiqian Zhang, Wenkang Zhang, Haoyu Li, Keyu Wang, Li Song, Hengdi Zhang · 2025 · arXiv 2508.08706

14 Pith papers cite this work. Polarity classification is still indexing.

14 Pith papers citing it

read on arXiv browse 14 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 4

citation-polarity summary

background 3 unclear 1

representative citing papers

AT-VLA: Adaptive Tactile Injection for Enhanced Feedback Reaction in Vision-Language-Action Models

cs.RO · 2026-05-08 · unverdicted · novelty 7.0 · 2 refs

AT-VLA proposes adaptive tactile injection and a dual-stream tactile reaction mechanism to enhance VLA models for contact-rich robotic manipulation with real-time responses.

TouchGuide: Inference-Time Steering of Visuomotor Policies via Touch Guidance

cs.RO · 2026-01-28 · unverdicted · novelty 7.0

TouchGuide improves contact-rich robot manipulation by steering diffusion or flow-matching visuomotor policies with tactile feasibility scores from a contrastively trained Contact Physical Model.

TAP-VLA: Tactile Annotation Prompting for Vision Language Action Models

cs.RO · 2026-06-27 · unverdicted · novelty 6.0

TAP-VLA improves VLA performance in contact-rich manipulation by visually annotating tactile shear fields onto input images, reaching 78% success versus under 50% for vision-only and other tactile methods.

Tabero: Learning Gentle Manipulation with Closed-Loop Force Feedback from Vision, Touch, and Language

cs.RO · 2026-05-27 · unverdicted · novelty 6.0

Tabero supplies a data pipeline that turns existing robot trajectories into vision-tactile-language tasks and a VTLA model that keeps task success high while cutting average grip force by over 70 percent under gentle instructions.

Force-Aware Residual DAgger via Trajectory Editing for Precision Insertion with Impedance Control

cs.RO · 2026-03-04 · conditional · novelty 6.0

TER-DAgger improves robotic precision insertion success rates by over 37% via residual policies from edited trajectories and force-aware intervention triggers.

Seeing Touch from Motion: A Unified Modality-Aware Visuo-Tactile Policy with Tactile Motion Correlation

cs.RO · 2026-06-29 · unverdicted · novelty 5.0

A visuo-tactile policy learning method that exploits tactile motion correlation for contact state distinction and Mixture-of-Transformers for cross-modal fusion.

Event-VLA: Action-Conditioned Event Fusion for Robust Vision-Language-Action Model

cs.CV · 2026-06-28 · unverdicted · novelty 5.0

Event-VLA integrates event streams into VLA models through action-conditioned gated cross-attention to maintain performance in normal light while improving success rates under low-light and near-dark conditions.

InvariantCloud: A Globally Invariant, Uniquely Indexed Point Cloud Framework for Robust 6-DoF Tactile Pose Tracking

cs.RO · 2026-05-24 · unverdicted · novelty 5.0

InvariantCloud registers marker-based point clouds in one shot via global invariance to deliver drift-free 6-DoF tactile pose tracking with improved yaw accuracy over prior methods.

ForceFlow: Learning to Feel and Act via Contact-Driven Flow Matching

cs.RO · 2026-05-11 · unverdicted · novelty 5.0

ForceFlow improves success rates by 37% on six real-world contact-rich tasks over ForceVLA by treating force as a global regulatory signal in a flow-matching policy with hierarchical vision-to-force decomposition.

Tactile-based Multimodal Fusion in Embodied Intelligence: A Survey of Vision, Language, and Contact-Driven Paradigms

cs.RO · 2026-05-17 · unverdicted · novelty 4.0

A survey proposing a hierarchical taxonomy for multimodal tactile fusion datasets and methods across perception, generation, and interaction in embodied intelligence.

Towards Robotic Dexterous Hand Intelligence: A Survey

cs.RO · 2026-05-13 · unverdicted · novelty 4.0

A structured survey of dexterous robotic hand research that reviews hardware, control methods, data resources, and benchmarks while identifying major limitations and future directions.

RLDX-1 Technical Report

cs.RO · 2026-05-05 · unverdicted · novelty 4.0 · 2 refs

RLDX-1 outperforms frontier VLAs such as π0.5 and GR00T N1.6 on dexterous manipulation benchmarks, reaching 86.8% success on ALLEX humanoid tasks versus around 40% for the baselines.

E-VLA: Event-Augmented Vision-Language-Action Model for Dark and Blurred Scenes

cs.CV · 2026-04-06

Learning to Feel the Future: DreamTacVLA for Contact-Rich Manipulation

cs.RO · 2025-12-29

citing papers explorer

Showing 11 of 11 citing papers after filters.

AT-VLA: Adaptive Tactile Injection for Enhanced Feedback Reaction in Vision-Language-Action Models cs.RO · 2026-05-08 · unverdicted · none · ref 14 · 2 links
AT-VLA proposes adaptive tactile injection and a dual-stream tactile reaction mechanism to enhance VLA models for contact-rich robotic manipulation with real-time responses.
TouchGuide: Inference-Time Steering of Visuomotor Policies via Touch Guidance cs.RO · 2026-01-28 · unverdicted · none · ref 13
TouchGuide improves contact-rich robot manipulation by steering diffusion or flow-matching visuomotor policies with tactile feasibility scores from a contrastively trained Contact Physical Model.
TAP-VLA: Tactile Annotation Prompting for Vision Language Action Models cs.RO · 2026-06-27 · unverdicted · none · ref 26
TAP-VLA improves VLA performance in contact-rich manipulation by visually annotating tactile shear fields onto input images, reaching 78% success versus under 50% for vision-only and other tactile methods.
Tabero: Learning Gentle Manipulation with Closed-Loop Force Feedback from Vision, Touch, and Language cs.RO · 2026-05-27 · unverdicted · none · ref 4
Tabero supplies a data pipeline that turns existing robot trajectories into vision-tactile-language tasks and a VTLA model that keeps task success high while cutting average grip force by over 70 percent under gentle instructions.
Seeing Touch from Motion: A Unified Modality-Aware Visuo-Tactile Policy with Tactile Motion Correlation cs.RO · 2026-06-29 · unverdicted · none · ref 13
A visuo-tactile policy learning method that exploits tactile motion correlation for contact state distinction and Mixture-of-Transformers for cross-modal fusion.
Event-VLA: Action-Conditioned Event Fusion for Robust Vision-Language-Action Model cs.CV · 2026-06-28 · unverdicted · none · ref 38
Event-VLA integrates event streams into VLA models through action-conditioned gated cross-attention to maintain performance in normal light while improving success rates under low-light and near-dark conditions.
InvariantCloud: A Globally Invariant, Uniquely Indexed Point Cloud Framework for Robust 6-DoF Tactile Pose Tracking cs.RO · 2026-05-24 · unverdicted · none · ref 7
InvariantCloud registers marker-based point clouds in one shot via global invariance to deliver drift-free 6-DoF tactile pose tracking with improved yaw accuracy over prior methods.
ForceFlow: Learning to Feel and Act via Contact-Driven Flow Matching cs.RO · 2026-05-11 · unverdicted · none · ref 38
ForceFlow improves success rates by 37% on six real-world contact-rich tasks over ForceVLA by treating force as a global regulatory signal in a flow-matching policy with hierarchical vision-to-force decomposition.
Tactile-based Multimodal Fusion in Embodied Intelligence: A Survey of Vision, Language, and Contact-Driven Paradigms cs.RO · 2026-05-17 · unverdicted · none · ref 55
A survey proposing a hierarchical taxonomy for multimodal tactile fusion datasets and methods across perception, generation, and interaction in embodied intelligence.
Towards Robotic Dexterous Hand Intelligence: A Survey cs.RO · 2026-05-13 · unverdicted · none · ref 116
A structured survey of dexterous robotic hand research that reviews hardware, control methods, data resources, and benchmarks while identifying major limitations and future directions.
RLDX-1 Technical Report cs.RO · 2026-05-05 · unverdicted · none · ref 26 · 2 links
RLDX-1 outperforms frontier VLAs such as π0.5 and GR00T N1.6 on dexterous manipulation benchmarks, reaching 86.8% success on ALLEX humanoid tasks versus around 40% for the baselines.

Omnivtla: Vision- tactile-language-action model with semantic-aligned tactile sensing

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer