Joint Unsupervised Learning of Optical Flow and Depth by Watching Stereo Videos

Chenxu Luo; Peng Wang; Wei Xu; Yang Wang; Yi Yang; Zhenheng Yang

arxiv: 1810.03654 · v1 · pith:Q3GV7KY5new · submitted 2018-10-08 · 💻 cs.CV

Joint Unsupervised Learning of Optical Flow and Depth by Watching Stereo Videos

Yang Wang , Zhenheng Yang , Peng Wang , Yi Yang , Chenxu Luo , Wei Xu This is my paper

classification 💻 cs.CV

keywords flowopticaldepthego-motionrigidstereovideosestimated

0 comments

read the original abstract

Learning depth and optical flow via deep neural networks by watching videos has made significant progress recently. In this paper, we jointly solve the two tasks by exploiting the underlying geometric rules within stereo videos. Specifically, given two consecutive stereo image pairs from a video, we first estimate depth, camera ego-motion and optical flow from three neural networks. Then the whole scene is decomposed into moving foreground and static background by compar- ing the estimated optical flow and rigid flow derived from the depth and ego-motion. We propose a novel consistency loss to let the optical flow learn from the more accurate rigid flow in static regions. We also design a rigid alignment module which helps refine ego-motion estimation by using the estimated depth and optical flow. Experiments on the KITTI dataset show that our results significantly outperform other state-of- the-art algorithms. Source codes can be found at https: //github.com/baidu-research/UnDepthflow

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

SGANVO: Unsupervised Deep Visual Odometry and Depth Estimation with Stacked Generative Adversarial Networks
cs.RO 2019-06 unverdicted novelty 6.0

SGANVO uses stacked GAN layers with recurrent connections to estimate depth and ego-motion unsupervisedly from images, reporting better or comparable results on the KITTI dataset.