Aperture Supervision for Monocular Depth Estimation

arxiv: 1711.07933 · v2 · pith:XSOGD7TWnew · submitted 2017-11-21 · 💻 cs.CV

Aperture Supervision for Monocular Depth Estimation

Pratul P. Srinivasan , Rahul Garg , Neal Wadhwa , Ren Ng , Jonathan T. Barron This is my paper

classification 💻 cs.CV

keywords aperturesupervisioncameradepthdepthsimageimagesscene

0 comments p. Extension

pith:XSOGD7TW Add to your LaTeX paper

What is a Pith Number?

\usepackage{pith}
\pithnumber{XSOGD7TW}

Prints a linked pith:XSOGD7TW badge after your title and writes the identifier into PDF metadata. Compiles on arXiv with no extra files. Learn more

read the original abstract

We present a novel method to train machine learning algorithms to estimate scene depths from a single image, by using the information provided by a camera's aperture as supervision. Prior works use a depth sensor's outputs or images of the same scene from alternate viewpoints as supervision, while our method instead uses images from the same viewpoint taken with a varying camera aperture. To enable learning algorithms to use aperture effects as supervision, we introduce two differentiable aperture rendering functions that use the input image and predicted depths to simulate the depth-of-field effects caused by real camera apertures. We train a monocular depth estimation network end-to-end to predict the scene depths that best explain these finite aperture images as defocus-blurred renderings of the input all-in-focus image.

This paper has not been read by Pith yet.

Aperture Supervision for Monocular Depth Estimation

discussion (0)