pith. sign in

arxiv: 1301.3572 · v2 · pith:PB4GFYSFnew · submitted 2013-01-16 · 💻 cs.CV

Indoor Semantic Segmentation using depth information

classification 💻 cs.CV
keywords depthindoorfeaturesinformationscenessegmentationaccuracyaddresses
0
0 comments X
read the original abstract

This work addresses multi-class segmentation of indoor scenes with RGB-D inputs. While this area of research has gained much attention recently, most works still rely on hand-crafted features. In contrast, we apply a multiscale convolutional network to learn features directly from the images and the depth information. We obtain state-of-the-art on the NYU-v2 depth dataset with an accuracy of 64.5%. We illustrate the labeling of indoor scenes in videos sequences that could be processed in real-time using appropriate hardware such as an FPGA.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. DEGround: An Effective Baseline for Ego-centric 3D Visual Grounding with a Homogeneous Framework

    cs.CV 2025-06 unverdicted novelty 6.0

    DEGround presents a unified homogeneous framework for 3D visual grounding with shared queries and two plug-in modules for better instruction alignment, reporting a 7.52% improvement on the EmbodiedScan benchmark.