pith. machine review for the scientific record.
sign in

arxiv: 1903.03757 · v2 · pith:QQL5QKAHnew · submitted 2019-03-09 · 💻 cs.CV

Hierarchy Denoising Recursive Autoencoders for 3D Scene Layout Prediction

classification 💻 cs.CV
keywords objectscenehierarchicalvdraecloudcontextdatasetsdenoising
0
0 comments X
read the original abstract

Indoor scenes exhibit rich hierarchical structure in 3D object layouts. Many tasks in 3D scene understanding can benefit from reasoning jointly about the hierarchical context of a scene, and the identities of objects. We present a variational denoising recursive autoencoder (VDRAE) that generates and iteratively refines a hierarchical representation of 3D object layouts, interleaving bottom-up encoding for context aggregation and top-down decoding for propagation. We train our VDRAE on large-scale 3D scene datasets to predict both instance-level segmentations and a 3D object detections from an over-segmentation of an input point cloud. We show that our VDRAE improves object detection performance on real-world 3D point cloud datasets compared to baselines from prior work.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.