pith. sign in

arxiv: 1703.07971 · v3 · pith:QBLWLOOOnew · submitted 2017-03-23 · 💻 cs.CV

Image-based Localization using Hourglass Networks

classification 💻 cs.CV
keywords architecturedatahourglasslayerssingleup-convolutionapproachblur
0
0 comments X
read the original abstract

In this paper, we propose an encoder-decoder convolutional neural network (CNN) architecture for estimating camera pose (orientation and location) from a single RGB-image. The architecture has a hourglass shape consisting of a chain of convolution and up-convolution layers followed by a regression part. The up-convolution layers are introduced to preserve the fine-grained information of the input image. Following the common practice, we train our model in end-to-end manner utilizing transfer learning from large scale classification data. The experiments demonstrate the performance of the approach on data exhibiting different lighting conditions, reflections, and motion blur. The results indicate a clear improvement over the previous state-of-the-art even when compared to methods that utilize sequence of test frames instead of a single frame.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. On the Role of Geometry in Geo-Localization

    cs.CV 2019-06 unverdicted novelty 5.0

    CNNs recover camera pose from lean geometric images of a city by learning its geometry rather than memorizing textures.