EdgeStereo: A Context Integrated Residual Pyramid Network for Stereo Matching
read the original abstract
Recent convolutional neural networks, especially end-to-end disparity estimation models, achieve remarkable performance on stereo matching task. However, existed methods, even with the complicated cascade structure, may fail in the regions of non-textures, boundaries and tiny details. Focus on these problems, we propose a multi-task network EdgeStereo that is composed of a backbone disparity network and an edge sub-network. Given a binocular image pair, our model enables end-to-end prediction of both disparity map and edge map. Basically, we design a context pyramid to encode multi-scale context information in disparity branch, followed by a compact residual pyramid for cascaded refinement. To further preserve subtle details, our EdgeStereo model integrates edge cues by feature embedding and edge-aware smoothness loss regularization. Comparative results demonstrates that stereo matching and edge detection can help each other in the unified model. Furthermore, our method achieves state-of-art performance on both KITTI Stereo and Scene Flow benchmarks, which proves the effectiveness of our design.
This paper has not been read by Pith yet.
Forward citations
Cited by 1 Pith paper
-
BMN: Boundary-Matching Network for Temporal Action Proposal Generation
BMN uses a boundary-matching mechanism to generate precise temporal action proposals with reliable confidence scores simultaneously in an end-to-end framework, improving results on THUMOS-14 and ActivityNet-1.3.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.