pith. sign in

arxiv: 1907.00276 · v1 · pith:ZJMNIKJOnew · submitted 2019-06-29 · 💻 cs.CV · cs.RO

Stereo relative pose from line and point feature triplets

Pith reviewed 2026-05-25 12:27 UTC · model grok-4.3

classification 💻 cs.CV cs.RO
keywords stereo relative poseminimal solverspoint featuresline featuresvisual odometryvisual SLAMmotion estimation
0
0 comments X

The pith

Two minimal solvers compute stereo relative pose from every combination of three point or line features each with three projections.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops algebraic methods to solve for the relative motion between stereo camera pairs when the input is any triplet of points or lines, where each element supplies three known image projections. The authors first classify all geometrically distinct minimal configurations of such triplets and then supply two solvers that together address every case. This formulation matters for stereo visual odometry pipelines because it lets systems use whatever mix of points and lines is available without leaving some configurations unsolved. Experiments show that inserting the solvers into a full visual SLAM pipeline produces a measurable improvement in motion estimation accuracy.

Core claim

We present two minimal solvers for the stereo relative pose problem that together solve all minimal cases consisting of three point or line features with three projections each; a complete classification of these cases is given, and the solvers are validated inside a motion-estimation and visual-SLAM pipeline.

What carries the argument

Two algebraic minimal solvers that together cover the full classification of point/line triplet configurations for stereo relative pose.

If this is right

  • Any minimal set of three features can now be used for stereo pose without regard to whether they are points or lines.
  • Visual odometry systems gain the ability to process mixed-feature inputs without leaving some minimal sets unsolved.
  • Integration of the solvers into a visual SLAM pipeline yields a measurable gain in motion estimation performance.
  • The classification removes the need to maintain separate code paths for different point-line combinations.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The solvers could be dropped into existing RANSAC loops to increase the fraction of usable minimal samples when feature types vary across frames.
  • Because the method works from three projections per feature, it may extend naturally to other multi-camera rigs beyond standard stereo.
  • Real-time implementations could test whether the algebraic speed advantage reduces overall latency compared with iterative refinement methods.

Load-bearing premise

Every geometrically distinct combination of three point or line features with three projections each can be solved by exactly one of the two presented solvers without numerical instability or missed degeneracies.

What would settle it

A concrete triplet configuration of mixed points and lines for which neither solver returns a valid essential matrix or exhibits numerical failure.

Figures

Figures reproduced from arXiv: 1907.00276 by Alexander Vakhitov, Victor Lempitsky, Yinqiang Zheng.

Figure 1
Figure 1. Figure 1: Left:using three line or point features, each having exactly three projections, we seek to determine the relative pose of the two stereoviews. Right: the use of three￾view matches (bottom) by the proposed solvers results in higher number of inliers compared to the use of four-view matches (top). We show the projections of the inlier correspondences on one of the images of KITTI sequence 0 chosen by the met… view at source ↗
Figure 2
Figure 2. Figure 2: The effect of additive noise variation on median relative translation and absolute rotation for each feature/correspondence combination. The accuracy of the methods degrades when noise is added. When fewer points are available, the translation error also grows. The new solvers PPSEgo and EpiSEgo are more accurate than the base￾lines and show almost the same accuracy as bundle adjustment (BA) started from t… view at source ↗
Figure 3
Figure 3. Figure 3: The effects of rotation (left) and translation (right) magnitude variation on median relative translation and absolute rotation. The columns (left to right) corre￾spond to: easy cases, hard cases, point-only combinations (S3P, S2P-1P). The new solvers PPSEgo and EpiSEgo have the lowest errors approaching the reference method (bundle adjustment with ground truth initialization). The new solvers PPSEgo and E… view at source ↗
Figure 4
Figure 4. Figure 4: Results of the experiment on a KITTI sequence 6. Left: cumulative distribu￾tion for the rotation error in degrees. Right: cumulative distribution for the ratio of inliers. The proposed EpiSEgo using line and point feature triplets has higher accuracy compared to the baselines. It has higher inlier ratio than P3P and Pradeep. The use of all possible types of feature triplets rather than quadruplets (Pradeep… view at source ↗
Figure 1
Figure 1. Figure 1: Trajectories reconstructed by relative pose integration for the proposed solver EpiSEgo and the baselines. We use the KITTI sequences 6 (left) and 0 (right). The P3P method cannot produce a solution often so the trajectory for it is not shown. The Approx method gives low pose estimation errors compared to other baselines, but it results in large trajectory reconstruction errors due to the employed rotation… view at source ↗
Figure 2
Figure 2. Figure 2: Results of the experiment on a KITTI sequence 0. Left: cumulative distribution for the rotation error in degrees. Right: cumulative distribution for the ratio of inliers. The proposed method EpiSEgo has higher inlier ratio than all the baselines except the Approx and lower angular error than the baselines. The Approx method has better inlier ratios due to the fact that it uses only point features. experime… view at source ↗
Figure 3
Figure 3. Figure 3: Trajectories reconstructed by the original and modified visual SLAM pipeline [31] for the frame dropping experiment, see the paper for details. We present the trajectories for the KITTI sequences 6 (left) and 0 (right). The original pipeline loses track often. The modified pipeline tracks the full sequence. 1.3 Synthetic experiments We include here the results of two more synthetic experiments. In the firs… view at source ↗
Figure 4
Figure 4. Figure 4 [PITH_FULL_IMAGE:figures/full_fig_p021_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: The effect of noise magnitude variation on median relative translation and absolute rotation errors in planar case. Left graph: ’easy’ cases, central graph: ’harder’ cases, right graph: point-only cases [PITH_FULL_IMAGE:figures/full_fig_p022_5.png] view at source ↗
read the original abstract

Stereo relative pose problem lies at the core of stereo visual odometry systems that are used in many applications. In this work, we present two minimal solvers for the stereo relative pose. We specifically consider the case when a minimal set consists of three point or line features and each of them has three known projections on two stereo cameras. We validate the importance of this formulation for practical purposes in our experiments with motion estimation. We then present a complete classification of minimal cases with three point or line correspondences each having three projections, and present two new solvers that can handle all such cases. We demonstrate a considerable effect from the integration of the new solvers into a visual SLAM system.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper claims to present two minimal solvers for the stereo relative pose problem when using three point or line features each having three known projections across a stereo pair. It provides a complete classification of all geometrically distinct minimal cases with such triplets and asserts that the two solvers together handle every case without gaps. The work validates the formulation on motion estimation tasks and shows integration benefits in a visual SLAM pipeline.

Significance. If the classification is exhaustive and the solvers are algebraically correct and numerically stable, the contribution would be useful for stereo visual odometry, supplying efficient algebraic solutions for a practically relevant minimal configuration that mixes points and lines.

major comments (2)
  1. [Classification section] Classification section (referenced in the abstract): the central claim that the two solvers cover all cases rests on the asserted completeness of the enumeration of geometrically distinct 3-feature (point/line) combinations with three projections each. The manuscript must supply an explicit list or decision tree showing every combination, the assigned solver, and a verification that no combination falls outside the two solvers or triggers an unhandled degeneracy.
  2. [Solver derivation sections] Solver derivation sections: no derivation details, resultant degrees, or numerical stability analysis (e.g., condition numbers or failure rates on synthetic data near degeneracies) are visible for the polynomial systems solved by each of the two solvers, leaving open whether the claimed coverage is achieved without post-hoc fixes or missed singular cases.
minor comments (2)
  1. The abstract and introduction should state the number of solutions returned by each solver and the polynomial degree of the final univariate equation.
  2. Figure captions and table headings that report timing or accuracy should explicitly indicate whether the numbers include the classification step or only the solver execution.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. We address the two major comments point-by-point below and will incorporate revisions to improve clarity on the classification and solver details.

read point-by-point responses
  1. Referee: [Classification section] Classification section (referenced in the abstract): the central claim that the two solvers cover all cases rests on the asserted completeness of the enumeration of geometrically distinct 3-feature (point/line) combinations with three projections each. The manuscript must supply an explicit list or decision tree showing every combination, the assigned solver, and a verification that no combination falls outside the two solvers or triggers an unhandled degeneracy.

    Authors: We agree that an explicit list or decision tree would strengthen the presentation of completeness. The manuscript classifies the eight possible point/line combinations for three features (PPP, PPL, PLL, LLL and their permutations) into two geometrically distinct families handled by the respective solvers. In the revision we will add a table enumerating all combinations, the assigned solver for each, and explicit verification that none fall outside the two solvers or introduce unhandled degeneracies. revision: yes

  2. Referee: [Solver derivation sections] Solver derivation sections: no derivation details, resultant degrees, or numerical stability analysis (e.g., condition numbers or failure rates on synthetic data near degeneracies) are visible for the polynomial systems solved by each of the two solvers, leaving open whether the claimed coverage is achieved without post-hoc fixes or missed singular cases.

    Authors: The derivations follow standard algebraic elimination for the epipolar constraints on mixed point and line triplets, but the main text omits explicit resultant degrees and stability metrics for brevity. We will expand the solver sections in revision to include the polynomial degrees obtained for each solver and add synthetic experiments reporting condition numbers and failure rates near degeneracies. revision: yes

Circularity Check

0 steps flagged

No circularity: algebraic derivation of solvers is self-contained

full rationale

The paper derives two new minimal solvers for stereo relative pose from triplets of point/line features, each with three projections. It also presents a classification of minimal cases. No quoted equations or steps reduce a claimed prediction or result to a fitted parameter, self-definition, or load-bearing self-citation chain. The classification and solvers are presented as original contributions rather than renaming known results or smuggling ansatzes. The work is a standard geometric derivation in computer vision and remains independent of its own inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The work rests on standard projective geometry and camera models; no free parameters, ad-hoc constants, or new postulated entities are introduced in the abstract.

axioms (1)
  • standard math Standard assumptions of calibrated stereo cameras and projective geometry for points and lines
    Invoked to formulate the minimal pose problem from three-feature triplets.

pith-pipeline@v0.9.0 · 5638 in / 1239 out tokens · 22786 ms · 2026-05-25T12:27:00.118690+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

32 extracted references · 32 canonical work pages

  1. [1]

    In: TPAMI, IEEE (2004) 756–770

    Nister, D.: An efficient solution to the five-point relative pose problem. In: TPAMI, IEEE (2004) 756–770

  2. [2]

    In: Readings in computer vision

    Fischler, M.A., Bolles, R.C.: Random sample consensus: a paradigm for model fit- ting with applications to image analysis and automated cartography. In: Readings in computer vision. Elsevier (1987) 726–740

  3. [3]

    IEEE Transactions on Robotics 32(6) (2016) 1309–1332

    Cadena, C., Carlone, L., Carrillo, H., Latif, Y., Scaramuzza, D., Neira, J., Reid, I., Leonard, J.J.: Past, present, and future of simultaneous localization and mapping: Toward the robust-perception age. IEEE Transactions on Robotics 32(6) (2016) 1309–1332

  4. [4]

    Communications of the ACM 54(10) (2011) 105–112

    Agarwal, S., Furukawa, Y., Snavely, N., Simon, I., Curless, B., Seitz, S.M., Szeliski, R.: Building Rome in a day. Communications of the ACM 54(10) (2011) 105–112

  5. [5]

    International Journal of Computer Vision 124(1) (2017) 65–79

    Micusik, B., Wildenauer, H.: Structure from motion with line segments under relaxed endpoint constraints. International Journal of Computer Vision 124(1) (2017) 65–79

  6. [6]

    IEEE Transactions on Pattern Analysis and Machine Intelligence 39(6) (2017) 1209–1222

    Xu, C., Zhang, L., Cheng, L., Koch, R.: Pose estimation from line correspondences: A complete analysis and a series of solutions. IEEE Transactions on Pattern Analysis and Machine Intelligence 39(6) (2017) 1209–1222

  7. [7]

    In: Workshop on omnidirectional vision

    Stew´ enius, H., Nist´ er, D., Oskarsson, M.,˚Astr¨ om, K.: Solutions to minimal gener- alized relative pose problems. In: Workshop on omnidirectional vision. Volume 1. (2005) 3

  8. [8]

    International Journal of Computer Vision 98(2) (2012) 202–216

    Pradeep, V., Lim, J.: Egomotion estimation using assorted features. International Journal of Computer Vision 98(2) (2012) 202–216

  9. [9]

    In: ICCV, IEEE (2015) 747–755

    Ventura, J., Arth, C., Lepetit, V.: An efficient minimal solution for multi-camera motion. In: ICCV, IEEE (2015) 747–755

  10. [10]

    In: CVPR, IEEE (2014) 1–8

    Kneip, L., Li, H.: Efcient computation of relative pose for multi-camera systems. In: CVPR, IEEE (2014) 1–8

  11. [11]

    In: Conference on Computer Vision and Pattern Recognition (CVPR)

    Menze, M., Geiger, A.: Object scene flow for autonomous vehicles. In: Conference on Computer Vision and Pattern Recognition (CVPR). (2015)

  12. [12]

    In: 2011 International conference on computer vision, IEEE (2011) 2564–2571

    Rublee, E., Rabaud, V., Konolige, K., Bradski, G.: ORB: An efficient alternative to sift or surf. In: 2011 International conference on computer vision, IEEE (2011) 2564–2571

  13. [13]

    Journal of Visual Communication and Image Representation 24(7) (2013) 794–805

    Zhang, L., Koch, R.: An efficient and robust line segment matching approach based on lbd descriptor and pairwise geometric consistency. Journal of Visual Communication and Image Representation 24(7) (2013) 794–805

  14. [14]

    In: CVPR, IEEE (2004) 652–659

    Nister, D., Naroditsky, O., Bergen, J.: Visual odometry. In: CVPR, IEEE (2004) 652–659

  15. [15]

    In: International Conference on Computer Vision, IEEE (2009) 1741–1748

    Manmohan, C., Jongwoo, L., David, K.: Moving in stereo: Efficient structure and motion using lines. In: International Conference on Computer Vision, IEEE (2009) 1741–1748

  16. [16]

    In: International Conference on Computer Vision, IEEE (2009) 1725–1732

    Brian, C., Christopher, Z., Jan-Michael, F., Marc, P.: A new minimal solution to the relative pose of a calibrated stereo camera with small field of view overlap. In: International Conference on Computer Vision, IEEE (2009) 1725–1732

  17. [17]

    In: Computer Vision (ICCV), 2011 IEEE International Conference on, IEEE (2011) 1187–1194

    Dunn, E., Clipp, B., Frahm, J.M.: A geometric solver for calibrated stereo egomo- tion. In: Computer Vision (ICCV), 2011 IEEE International Conference on, IEEE (2011) 1187–1194

  18. [18]

    In: IEEE Computer Society Conference on Computer Vision and Pattern Recogni- tion

    Nister, D.: A minimal solution to the generalised 3-point pose problem. In: IEEE Computer Society Conference on Computer Vision and Pattern Recogni- tion. (2004) 16 A. Vakhitov, V. Lempitsky and Y. Zheng

  19. [19]

    Computer Vision and Image Understanding 103(3) (2006) 218–228

    Ramalingam, S., Lodha, S.K., Sturm, P.: A generic structure-from-motion frame- work. Computer Vision and Image Understanding 103(3) (2006) 218–228

  20. [20]

    Journal of Mathematical Imaging and Vision 27(1) (2007) 67–79

    Nist´ er, D., Stew´ enius, H.: A minimal solution to the generalised 3-point pose problem. Journal of Mathematical Imaging and Vision 27(1) (2007) 67–79

  21. [21]

    In: 3D Imaging (IC3D), 2014 International Conference on, IEEE (2014) 1–6

    Merzban, M.H., Abdellatif, M., Abouelsoud, A.: A simple solution for the non perspective three point pose problem. In: 3D Imaging (IC3D), 2014 International Conference on, IEEE (2014) 1–6

  22. [22]

    IEEE Transactions on Cybernetics 45(3) (2015) 404–415

    Miraldo, P., Araujo, H.: Direct solution to the minimal generalized pose. IEEE Transactions on Cybernetics 45(3) (2015) 404–415

  23. [23]

    In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition

    Kukelova, Z., Heller, J., Fitzgibbon, A.: Efficient intersection of three quadrics and applications in computer vision. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition. (2016) 1799–1808

  24. [24]

    Cambridge university press (2003)

    Hartley, R., Zisserman, A.: Multiple view geometry in computer vision. Cambridge university press (2003)

  25. [25]

    Springer Science & Business Media (2013)

    Char, B.W., Geddes, K.O., Gonnet, G.H., Leong, B.L., Monagan, M.B., Watt, S.: Maple V library reference manual. Springer Science & Business Media (2013)

  26. [26]

    Volume 185

    Cox, D.A., Little, J., O’shea, D.: Using algebraic geometry. Volume 185. Springer Science & Business Media (2006)

  27. [27]

    In: European Conference on Computer Vision, Springer (2016) 583–599

    Vakhitov, A., Funke, J., Moreno-Noguer, F.: Accurate and linear time pose estima- tion from points and lines. In: European Conference on Computer Vision, Springer (2016) 583–599

  28. [28]

    Pattern Recognition Letters 32(13) (2011) 1633–1642

    Akinlar, C., Topal, C.: Edlines: A real-time line segment detector with a false detection control. Pattern Recognition Letters 32(13) (2011) 1633–1642

  29. [29]

    IEEE transactions on pattern analysis and machine intelligence 25(8) (2003) 930–943

    Gao, X.S., Hou, X.R., Tang, J., Cheng, H.F.: Complete solution classification for the perspective-three-point problem. IEEE transactions on pattern analysis and machine intelligence 25(8) (2003) 930–943

  30. [30]

    In: Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, IEEE (2012) 3354–3361

    Geiger, A., Lenz, P., Urtasun, R.: Are we ready for autonomous driving? the KITTI vision benchmark suite. In: Computer Vision and Pattern Recognition (CVPR), 2012 IEEE Conference on, IEEE (2012) 3354–3361

  31. [31]

    Mur-Artal, R., Tard´ os, J.D.: ORB-SLAM2: An open-source slam system for monocular, stereo, and RGB-D cameras. IEEE Transactions on Robotics 33(5) (2017) 1255–1262 Stereo relative pose from line and point feature triplets 1 Supplemental Materials: Stereo relative pose from line and point feature triplets In these Supplemental Materials we report additiona...

  32. [32]

    We present the trajectories for the KITTI sequences 6 (left) and 0 (right)

    for the frame dropping experiment, see the paper for details. We present the trajectories for the KITTI sequences 6 (left) and 0 (right). The original pipeline loses track often. The modified pipeline tracks the full sequence. 1.3 Synthetic experiments We include here the results of two more synthetic experiments. In the first one we analyze a planar case, ...