Name	Name	Last commit message	Last commit date
parent directory ..
assets	assets
D2-Net.drawio	D2-Net.drawio
D2-Net.pdf	D2-Net.pdf
README.md	README.md

Name

Last commit message

Last commit date

Report for the Module 5

Paper Review: D2-Net

Title: D2-Net: A Trainable CNN for Joint Description and Detection of Local Features
Authors: Mihai Dusmanu, Ignacio Rocco, Tomas Pajdla, Marc Pollefeys, Josef Sivic, Akihiko Torii, Torsten Sattler
Link
Tags: Joint Feature Description and Detection, Correspondence, Convolutional Neural Network
Year: 2019

Summary

What:
- The authors proposed a CNN architecture for simultaneous dense feature description and detection in order to find reliable pixel-level correspondences under difficult imaging conditions.
- D2-Net obtains state-of-the-art performance on Aachen Day-Night (outdoor) and InLoc (indoor) localization datasets.
- The method can be integrated into image matching and 3D reconstruction pipelines.
How:
- It's a "single-shot" detect-and-describe (D2) approach. A VGG-16 (up to the conv4_3 layer) backbone is fine-tuned for extracting feature maps:
- Local descriptors (d_ij) are obtained by traversing n feature maps (l2-normalized across channels) at a spatial position (i,j)
- Detections (scores -- s_ij) are obtained by performing a soft versions of non-local-maximum suppression on a feature map (soft local-maximum score α) + non-maximum suppression across each descriptor (ratio-to-maximum score per descriptor β).
- Also, during the inference authors propose to create image pyramids for 3 scales: 0.5, 1, 2; then pass through the network and sum the feature maps (using bilinear interpolation for larger iamges and masking already detected regions to prevent re-detection)
- The objective corresponds to the repeatability of the detector and the distinctiveness of the descriptor. It is an extended triplet margin ranking loss.
Results:
- HPatches