Box Pose and Shape Estimation and Domain Adaptation for Large-Scale Warehouse Automation

Igor Gilitschenski; Jingnan Shi; Luca Carlone; Rajat Talak; Ulrich Viereck; Xihang Yu

arxiv: 2507.00984 · v1 · pith:HHSDABRQnew · submitted 2025-07-01 · 💻 cs.RO · cs.CV· cs.LG

Box Pose and Shape Estimation and Domain Adaptation for Large-Scale Warehouse Automation

Xihang Yu , Rajat Talak , Jingnan Shi , Ulrich Viereck , Igor Gilitschenski , Luca Carlone This is my paper

classification 💻 cs.RO cs.CVcs.LG

keywords adaptationestimationposeself-supervisedshapeautomationdatadomain

0 comments

read the original abstract

Modern warehouse automation systems rely on fleets of intelligent robots that generate vast amounts of data -- most of which remains unannotated. This paper develops a self-supervised domain adaptation pipeline that leverages real-world, unlabeled data to improve perception models without requiring manual annotations. Our work focuses specifically on estimating the pose and shape of boxes and presents a correct-and-certify pipeline for self-supervised box pose and shape estimation. We extensively evaluate our approach across a range of simulated and real industrial settings, including adaptation to a large-scale real-world dataset of 50,000 images. The self-supervised model significantly outperforms models trained solely in simulation and shows substantial improvements over a zero-shot 3D bounding box estimation baseline.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Picasso: Holistic Scene Reconstruction with Physics-Constrained Sampling
cs.CV 2026-02 unverdicted novelty 6.0

Picasso produces multi-object scene reconstructions that are both geometrically accurate and physically plausible by using physics-constrained rejection sampling over an inferred contact graph, outperforming prior met...