D2-Net: A Trainable CNN for Joint Detection and Description of Local Features

Akihiko Torii; Ignacio Rocco; Josef Sivic; Marc Pollefeys; Mihai Dusmanu; Tomas Pajdla; Torsten Sattler

arxiv: 1905.03561 · v1 · pith:GGOUEZLDnew · submitted 2019-05-09 · 💻 cs.CV

D2-Net: A Trainable CNN for Joint Detection and Description of Local Features

Mihai Dusmanu , Ignacio Rocco , Tomas Pajdla , Marc Pollefeys , Josef Sivic , Akihiko Torii , Torsten Sattler This is my paper

classification 💻 cs.CV

keywords detectioncorrespondencesdifficultfeaturelocalizationperformanceaachenaddress

0 comments

read the original abstract

In this work we address the problem of finding reliable pixel-level correspondences under difficult imaging conditions. We propose an approach where a single convolutional neural network plays a dual role: It is simultaneously a dense feature descriptor and a feature detector. By postponing the detection to a later stage, the obtained keypoints are more stable than their traditional counterparts based on early detection of low-level structures. We show that this model can be trained using pixel correspondences extracted from readily available large-scale SfM reconstructions, without any further annotations. The proposed method obtains state-of-the-art performance on both the difficult Aachen Day-Night localization dataset and the InLoc indoor localization benchmark, as well as competitive performance on other benchmarks for image matching and 3D reconstruction.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Efficient 3D Content Reconstruction and Generation
cs.CV 2026-05 unverdicted novelty 5.0

Presents Instant3D for rapid text/image-to-3D generation via multi-view diffusion plus feed-forward reconstruction, and FastMap for 10x faster structure-from-motion with comparable accuracy.
Landscape-Awareness for Geometric View Diffusion Model
cs.CV 2026-05 unverdicted novelty 4.0

A score-based method is introduced to guide optimization in geometric view diffusion models toward correct viewpoints, improving convergence and sample efficiency over naive multistart strategies.