MVTrans: Multi-View Perception of Transparent Objects

Alan Aspuru-Guzik; Animesh Garg; Florian Shkurti; Haoping Xu; Saggi Eppel; Yi Ru Wang; Yuchi Zhao

arxiv: 2302.11683 · v1 · pith:S3JC3BRGnew · submitted 2023-02-22 · 💻 cs.RO · cs.AI· cs.CV

MVTrans: Multi-View Perception of Transparent Objects

Yi Ru Wang , Yuchi Zhao , Haoping Xu , Saggi Eppel , Alan Aspuru-Guzik , Florian Shkurti , Animesh Garg This is my paper

classification 💻 cs.RO cs.AIcs.CV

keywords perceptiontransparentdepthestimationmulti-viewmvtransobjectrgb-d

0 comments

read the original abstract

Transparent object perception is a crucial skill for applications such as robot manipulation in household and laboratory settings. Existing methods utilize RGB-D or stereo inputs to handle a subset of perception tasks including depth and pose estimation. However, transparent object perception remains to be an open problem. In this paper, we forgo the unreliable depth map from RGB-D sensors and extend the stereo based method. Our proposed method, MVTrans, is an end-to-end multi-view architecture with multiple perception capabilities, including depth estimation, segmentation, and pose estimation. Additionally, we establish a novel procedural photo-realistic dataset generation pipeline and create a large-scale transparent object detection dataset, Syn-TODD, which is suitable for training networks with all three modalities, RGB-D, stereo and multi-view RGB. Project Site: https://ac-rad.github.io/MVTrans/

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Trans2Occ: Voxel Occupancy Estimation and Grasp for Transparent Objects from Simulation to Reality
cs.RO 2026-06 unverdicted novelty 4.0

A simulation-trained model predicts voxel occupancy from single RGB views for transparent object grasping and transfers to real robotic setups without fine-tuning.