MVSNet: Depth Inference for Unstructured Multi-view Stereo

· 2018 · cs.CV · arXiv 1804.02505

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

We present an end-to-end deep learning architecture for depth map inference from multi-view images. In the network, we first extract deep visual image features, and then build the 3D cost volume upon the reference camera frustum via the differentiable homography warping. Next, we apply 3D convolutions to regularize and regress the initial depth map, which is then refined with the reference image to generate the final output. Our framework flexibly adapts arbitrary N-view inputs using a variance-based cost metric that maps multiple features into one cost feature. The proposed MVSNet is demonstrated on the large-scale indoor DTU dataset. With simple post-processing, our method not only significantly outperforms previous state-of-the-arts, but also is several times faster in runtime. We also evaluate MVSNet on the complex outdoor Tanks and Temples dataset, where our method ranks first before April 18, 2018 without any fine-tuning, showing the strong generalization ability of MVSNet.

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

SonarSweep: Fusing Sonar and Vision for Robust 3D Reconstruction via Plane Sweeping

cs.RO · 2025-11-01 · unverdicted · novelty 7.0

SonarSweep adapts plane sweeping into an end-to-end neural network for sonar-vision fusion to produce dense accurate depth maps that outperform prior methods in high-turbidity underwater conditions.

Efficient 3D Content Reconstruction and Generation

cs.CV · 2026-05-18 · unverdicted · novelty 5.0

Presents Instant3D for rapid text/image-to-3D generation via multi-view diffusion plus feed-forward reconstruction, and FastMap for 10x faster structure-from-motion with comparable accuracy.

citing papers explorer

Showing 2 of 2 citing papers.

SonarSweep: Fusing Sonar and Vision for Robust 3D Reconstruction via Plane Sweeping cs.RO · 2025-11-01 · unverdicted · none · ref 13 · internal anchor
SonarSweep adapts plane sweeping into an end-to-end neural network for sonar-vision fusion to produce dense accurate depth maps that outperform prior methods in high-turbidity underwater conditions.
Efficient 3D Content Reconstruction and Generation cs.CV · 2026-05-18 · unverdicted · none · ref 296 · internal anchor
Presents Instant3D for rapid text/image-to-3D generation via multi-view diffusion plus feed-forward reconstruction, and FastMap for 10x faster structure-from-motion with comparable accuracy.

MVSNet: Depth Inference for Unstructured Multi-view Stereo

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer