pith. machine review for the scientific record. sign in

arxiv: 1606.03798 · v1 · submitted 2016-06-13 · 💻 cs.CV

Recognition: unknown

Deep Image Homography Estimation

Authors on Pith no claims yet
classification 💻 cs.CV
keywords homographydeepnetworkimageestimationimagesapproachconvolutional
0
0 comments X
read the original abstract

We present a deep convolutional neural network for estimating the relative homography between a pair of images. Our feed-forward network has 10 layers, takes two stacked grayscale images as input, and produces an 8 degree of freedom homography which can be used to map the pixels from the first image to the second. We present two convolutional neural network architectures for HomographyNet: a regression network which directly estimates the real-valued homography parameters, and a classification network which produces a distribution over quantized homographies. We use a 4-point homography parameterization which maps the four corners from one image into the second image. Our networks are trained in an end-to-end fashion using warped MS-COCO images. Our approach works without the need for separate local feature detection and transformation estimation stages. Our deep models are compared to a traditional homography estimator based on ORB features and we highlight the scenarios where HomographyNet outperforms the traditional technique. We also describe a variety of applications powered by deep homography estimation, thus showcasing the flexibility of a deep learning approach.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Graph-based Semantic Calibration Network for Unaligned UAV RGBT Image Semantic Segmentation and A Large-scale Benchmark

    cs.CV 2026-04 unverdicted novelty 7.0

    GSCNet with FDAM and SGCM modules plus the URTF benchmark improves fine-grained semantic segmentation on unaligned UAV RGBT images.

  2. Towards Seamless Lunar Mosaics: Deep Radiometric Normalization for Cross-Sensor Orbital Imagery Using Chandrayaan-2 TMC Data

    cs.CV 2026-04 unverdicted novelty 4.0

    A cGAN with U-Net generator and PatchGAN discriminator learns radiometric normalization from Chandrayaan-2 TMC imagery to LROC WAC reference, yielding improved tonal uniformity and fewer seams than histogram matching.