DPSNet: End-to-end Deep Plane Sweep Stereo

arxiv: 1905.00538 · v1 · pith:LX3ZY4ZVnew · submitted 2019-05-02 · 💻 cs.CV · cs.RO

DPSNet: End-to-end Deep Plane Sweep Stereo

Sunghoon Im , Hae-Gon Jeon , Stephen Lin , In So Kweon This is my paper

classification 💻 cs.CV cs.RO

keywords deepcostdepthdpsnetplanesweepvolumelearning

0 comments p. Extension

pith:LX3ZY4ZV Add to your LaTeX paper

What is a Pith Number?

\usepackage{pith}
\pithnumber{LX3ZY4ZV}

Prints a linked pith:LX3ZY4ZV badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

Multiview stereo aims to reconstruct scene depth from images acquired by a camera under arbitrary motion. Recent methods address this problem through deep learning, which can utilize semantic cues to deal with challenges such as textureless and reflective regions. In this paper, we present a convolutional neural network called DPSNet (Deep Plane Sweep Network) whose design is inspired by best practices of traditional geometry-based approaches for dense depth reconstruction. Rather than directly estimating depth and/or optical flow correspondence from image pairs as done in many previous deep learning methods, DPSNet takes a plane sweep approach that involves building a cost volume from deep features using the plane sweep algorithm, regularizing the cost volume via a context-aware cost aggregation, and regressing the dense depth map from the cost volume. The cost volume is constructed using a differentiable warping process that allows for end-to-end training of the network. Through the effective incorporation of conventional multiview stereo concepts within a deep learning framework, DPSNet achieves state-of-the-art reconstruction results on a variety of challenging datasets.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

CylinderDepth: Cylindrical Spatial Attention for Multi-View Consistent Self-Supervised Surround Depth Estimation
cs.CV 2025-11 unverdicted novelty 6.0

CylinderDepth uses cylindrical spatial attention with non-learned weights to enforce cross-view consistency in self-supervised surround depth estimation.