Homography from two orientation- and scale-covariant features
Pith reviewed 2026-05-25 14:42 UTC · model grok-4.3
The pith
Two orientation- and scale-covariant features suffice to estimate a homography by deriving new constraints from their scales and rotations.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By providing a geometric interpretation of the angles and scales returned by orientation- and scale-covariant feature detectors, two new constraints on these quantities are obtained. When restricted to a homography, the constraints yield a minimal solver that recovers the homography from exactly two correspondences. Normalization of the correspondences is shown to keep the recovered rotation and scale parameters numerically stable.
What carries the argument
Two new constraints on scales and rotations derived from the geometric effect of a homography on covariant features.
If this is right
- Homography estimation requires only two feature pairs instead of four.
- RANSAC needs substantially fewer iterations when applied to the two-point solver.
- The scale and rotation information already supplied by detectors such as SIFT is used at no extra cost.
- The same scale-rotation constraints can be inserted into any other geometric estimation task that involves a homography.
Where Pith is reading between the lines
- The method could be extended to other planar transformations whose action on local scale and orientation can be written in closed form.
- In practice the two-point solver would be combined with the four-point solver inside a hybrid RANSAC loop to handle both minimal and over-determined cases.
- Numerical stability gains from normalization suggest that similar preprocessing steps may improve other minimal solvers that recover rotation or scale parameters.
Load-bearing premise
The angles and scales reported by the detectors correspond directly to the geometric quantities transformed by the homography.
What would settle it
Run the two-point solver on synthetic homographies with known ground-truth scales and rotations; if the recovered homography matrix deviates substantially from the ground truth while the four-point DLT succeeds, the derived constraints do not hold.
Figures
read the original abstract
This paper proposes a geometric interpretation of the angles and scales which the orientation- and scale-covariant feature detectors, e.g. SIFT, provide. Two new general constraints are derived on the scales and rotations which can be used in any geometric model estimation tasks. Using these formulas, two new constraints on homography estimation are introduced. Exploiting the derived equations, a solver for estimating the homography from the minimal number of two correspondences is proposed. Also, it is shown how the normalization of the point correspondences affects the rotation and scale parameters, thus achieving numerically stable results. Due to requiring merely two feature pairs, robust estimators, e.g. RANSAC, do significantly fewer iterations than by using the four-point algorithm. When using covariant features, e.g. SIFT, the information about the scale and orientation is given at no cost. The proposed homography estimation method is tested in a synthetic environment and on publicly available real-world datasets.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper derives two new general constraints on the scales and rotations induced by a homography acting on orientation- and scale-covariant features (e.g., SIFT). These constraints, together with the standard point correspondences, enable a minimal solver for the eight degrees of freedom of a homography from only two feature pairs. The manuscript also shows how point normalization affects the rotation and scale parameters to improve numerical stability and demonstrates the approach on synthetic data and public real-world datasets.
Significance. If the geometric constraints are independent and correctly derived, the method reduces the minimal sample size for homography estimation from four to two points when scale and orientation are already available from the detector. This directly lowers the number of RANSAC iterations required for robust estimation and exploits information that is obtained at no extra cost, which is a practical advantage for vision pipelines that rely on covariant features.
major comments (1)
- [Derivation of constraints] The derivation section must explicitly verify that the two new scale/orientation constraints per correspondence are linearly independent of the two point equations and of each other; otherwise the eight-equation system for two correspondences may be rank-deficient. The abstract states that exactly eight independent equations are obtained, but no rank argument or degeneracy analysis is referenced in the provided description.
minor comments (2)
- [Normalization] The normalization procedure for maintaining numerical stability should include a brief statement of the condition number improvement or a small synthetic example quantifying the effect on the estimated homography.
- [Experiments] A short discussion of degenerate configurations (e.g., when the two features are collinear or have identical scales) would help readers assess practical applicability.
Simulated Author's Rebuttal
We thank the referee for the constructive review and positive assessment of the work. We address the single major comment below.
read point-by-point responses
-
Referee: [Derivation of constraints] The derivation section must explicitly verify that the two new scale/orientation constraints per correspondence are linearly independent of the two point equations and of each other; otherwise the eight-equation system for two correspondences may be rank-deficient. The abstract states that exactly eight independent equations are obtained, but no rank argument or degeneracy analysis is referenced in the provided description.
Authors: We agree that an explicit verification strengthens the paper. The two point equations per correspondence arise solely from the positional mapping x' = Hx. The two new constraints are obtained by applying the homography to the local affine frame encoded by each covariant feature: one from the singular values (scale change) and one from the argument of the complex representation (orientation change). These act on distinct components of the 3x3 homography matrix and are therefore algebraically independent of the positional equations and of each other. Nevertheless, to satisfy the request we will add a short rank analysis (via the Jacobian of the eight-equation system evaluated at generic points) together with a brief degeneracy discussion in the revised derivation section. revision: yes
Circularity Check
No significant circularity; derivation is geometrically self-contained
full rationale
The paper presents a first-principles geometric derivation of two new constraints relating local scale and orientation changes under a homography, obtained directly from the transformation properties of covariant features. These constraints are combined with the standard two-point equations per correspondence to produce an eight-equation system for the eight degrees of freedom of H, enabling a two-correspondence solver. No fitted parameters are renamed as predictions, no self-citation chain supplies the central equations, and the normalization discussion addresses numerical stability rather than redefining the result. The derivation therefore stands independently of its own outputs.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The scale and orientation reported by covariant feature detectors admit a geometric interpretation that yields usable constraints under a homography transformation.
Reference graph
Works this paper leans on
-
[1]
D. Barath. P-HAF: Homography estimation using partial lo- cal affine frames. In International Conference on Computer Vision Theory and Applications, 2017. 2
work page 2017
- [2]
-
[3]
D. Barath. Five-point fundamental matrix estimation for un- calibrated cameras. Conference on Computer Vision and Pat- tern Recognition, 2018. 2, 3, 5, 6, 7
work page 2018
- [4]
-
[5]
D. Barath and L. Hajder. A theory of point-wise homography estimation. Pattern Recognition Letters, 94:7–14, 2017. 1
work page 2017
-
[6]
D. Barath and L. Hajder. Efficient recovery of essential ma- trix from two affine correspondences.IEEE Transactions on Image Processing, 27(11):5328–5337, 2018. 1, 4
work page 2018
-
[7]
D. Barath and J. Matas. Graph-Cut RANSAC. Conference on Computer Vision and Pattern Recognition, 2018. 2, 4, 6, 7, 8
work page 2018
-
[8]
D. Bar ´ath, J. Moln´ar, and L. Hajder. Optimal surface normal from affine transformation. 2015. 1, 3
work page 2015
- [9]
-
[10]
H. Bay, T. Tuytelaars, and L. Van Gool. SURF: Speeded up robust features. European Conference on Computer Vision,
-
[11]
J. Bentolila and J. M. Francos. Conic epipolar constraints from affine correspondences. Computer Vision and Image Understanding, 2014. 1
work page 2014
-
[12]
O. Chum and J. Matas. Matching with PROSAC-progressive sample consensus. In Computer Vision and Pattern Recogni- tion, 2005. 6
work page 2005
-
[13]
D. Cox, J. Little, and D. O’Shea. Using Algebraic Geometry. 2nd edition, 2005. 3, 4
work page 2005
-
[14]
D. Grayson and M. Stillman. Macaulay2, a software system for research in algebraic geometry. available at www.math.uiuc.edu/Macaulay2/. 3
-
[15]
R. Hartley and A. Zisserman. Multiple view geometry in computer vision. Cambridge University Press, 2003. 2, 5, 6, 7
work page 2003
-
[16]
R. I. Hartley. In defense of the eight-point algorithm. Pattern Analysis and Machine Intelligence, 1997. 3, 4
work page 1997
-
[17]
K. K ¨oser. Geometric Estimation with Local Affine Frames and Free-form Surfaces. Shaker, 2009. 1
work page 2009
- [18]
-
[19]
Z. Kukelova, M. Bujnak, and T. Pajdla. Automatic gener- ator of minimal problem solvers. In European Conference on Computer Vision, volume 5304 of Lecture Notes in Com- puter Science, 2008. 4
work page 2008
-
[20]
Z. Kukelova, J. Heller, and A. Fitzgibbon. Efficient intersec- tion of three quadrics and applications in computer vision. In Conference on Computer Vision and Pattern Recognition, pages 1799–1808, 2016. 5
work page 2016
-
[21]
A clever elimination strategy for efficient minimal solvers
Z. Kukelova, J. Kileel, B. Sturmfels, and T. Pajdla. A clever elimination strategy for efficient minimal solvers. In Con- ference on Computer Vision and Pattern Recognition, 2017. http://arxiv.org/abs/1703.05289. 3
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[22]
D. G. Lowe. Object recognition from local scale-invariant features. In International Conference on Computer vision ,
- [23]
-
[24]
K. Mikolajczyk, T. Tuytelaars, C. Schmid, A. Zisserman, J. Matas, F. Schaffalitzky, T. Kadir, and L. Van Gool. A comparison of affine region detectors.International Journal of Computer Vision, 65(1-2):43–72, 2005. 2
work page 2005
-
[25]
S. Mills. Four-and seven-point relative camera pose from oriented features. In International Conference on 3D Vision, pages 218–227. IEEE, 2018. 2
work page 2018
-
[26]
D. Mishkin, J. Matas, and M. Perdoch. MODS: Fast and robust method for two-view matching. Computer Vision and Image Understanding, 2015. 1, 2
work page 2015
-
[27]
J. Moln ´ar and D. Chetverikov. Quadratic transformation for planar mapping of implicit surfaces. Journal of Mathemati- cal Imaging and Vision, 2014. 2
work page 2014
-
[28]
J.-M. Morel and G. Yu. ASIFT: A new framework for fully affine invariant image comparison.SIAM journal on imaging sciences, 2(2):438–469, 2009. 1
work page 2009
-
[29]
M. Perdoch, J. Matas, and O. Chum. Epipolar geometry from two correspondences. In International Conference on Pat- tern Recognition, 2006. 1
work page 2006
- [30]
-
[31]
C. Raposo and J. P. Barreto. Theory and practice of structure- from-motion using affine correspondences. In Computer Vi- sion and Pattern Recognition, 2016. 1
work page 2016
- [32]
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.