Extending Deep Event Visual Odometry with Sparse Point-Cloud Export

Alireza Safdari; Sajad Ashraf

arxiv: 2605.22890 · v1 · pith:C6A2G6NHnew · submitted 2026-05-21 · 💻 cs.RO · cs.CV

Extending Deep Event Visual Odometry with Sparse Point-Cloud Export

Alireza Safdari , Sajad Ashraf This is my paper

Pith reviewed 2026-05-25 05:51 UTC · model grok-4.3

classification 💻 cs.RO cs.CV

keywords event camerasvisual odometrypoint cloud exportDEVOsparse reconstructionEMVSrobotics

0 comments

The pith

DEVO exports its internal 3D estimates as a sparse point cloud that matches EMVS at 5 cm precision without changing the odometry method.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper adds a point-cloud export step to Deep Event Visual Odometry that turns the system's existing 3D structure into an explicit sparse cloud. This is achieved by exposing internal estimates rather than altering the patch tracking, correspondence refinement, or bundle adjustment pipeline. On the BOARD SLOW sequence the exported points align closely with EMVS reconstructions at a 5 cm threshold. The work also documents the expected shortfalls in density and completeness that arise from relying on the original odometry output.

Core claim

Rather than modifying the core odometry formulation, the approach exposes the internal 3D structure already estimated by DEVO and converts it into an explicit point-cloud representation for visualization and further processing, preserving the original visual odometry pipeline while enabling sparse geometric scene output.

What carries the argument

Sparse point-cloud export pipeline that converts DEVO's internal 3D estimates into explicit points without changes to the odometry formulation.

If this is right

The exported sparse cloud is locally consistent with EMVS reconstructions.
High precision holds at a 5 cm threshold.
The output shows expected limits in density, completeness, and sensitivity to accumulated odometry noise.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The export step could let DEVO outputs feed directly into mapping or planning modules that expect explicit geometry.
Comparable export layers might be added to other deep event odometry systems to produce usable 3D data.
Sequences with larger accumulated drift would likely widen the gap between exported points and ground-truth reconstructions.

Load-bearing premise

The internal 3D structure already estimated inside DEVO can be directly converted into an explicit, usable point-cloud representation without requiring changes to the odometry formulation or additional reconstruction steps.

What would settle it

Direct comparison on the BOARD SLOW sequence in which the exported DEVO points deviate from EMVS reconstructions below the reported 5 cm precision threshold.

Figures

Figures reproduced from arXiv: 2605.22890 by Alireza Safdari, Sajad Ashraf.

**Figure 3.** Figure 3: Visual comparison between EMVS reconstructions using ground-truth [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗

**Figure 4.** Figure 4: Projection of reconstructed 3D points onto a regular gray-camera frame. The points align with visible board shape boundaries such as the diamond, circle, square, and printed symbols, confirming that the exported point clouds correspond to real scene geometry. Both EMVS and DEVO SLAM recover the dominant planar board structure. The EMVS reconstruction is more compact and has lower plane residuals, while the… view at source ↗

read the original abstract

Event cameras are well suited for visual odometry under high-speed motion and challenging lighting conditions due to their low latency, high temporal resolution, and high dynamic range. Deep Event Visual Odometry (DEVO) demonstrated that monocular event-only odometry can achieve strong performance by combining sparse patch tracking, learned patch selection, recurrent correspondence refinement, and differentiable bundle adjustment. In this project, we extend DEVO with a sparse point-cloud export pipeline. Rather than modifying the core odometry formulation, our approach exposes the internal 3D structure already estimated by DEVO and converts it into an explicit point-cloud representation for visualization and further processing. In addition, we implement a practical workflow for data export, format conversion, and point-cloud cleanup. The resulting system preserves the original visual odometry pipeline while enabling sparse geometric scene output. Experiments on the BOARD SLOW sequence show that the exported sparse cloud is locally consistent with EMVS reconstructions, achieving high precision at a 5 cm threshold, while also highlighting the expected limitations in density, completeness, and sensitivity to accumulated odometry noise.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This is a straightforward engineering extension to DEVO that adds point cloud export without touching the core odometry.

read the letter

This paper is basically a practical add-on that lets you export sparse point clouds from the existing DEVO system without changing how the odometry works. The authors convert the internal 3D estimates into a usable format with some cleanup. It does a clean job of preserving the original pipeline and shows local consistency with EMVS on the BOARD SLOW sequence at 5 cm. That's useful if you're already running DEVO and want geometry output. They also provide a workflow for data export and format conversion, which could save time for others. The evaluation honestly notes the limits in density and sensitivity to drift. The main limitation is the narrow scope: only one sequence is mentioned, and there's little detail on the metrics or comparison method. It's not introducing new sensing or better accuracy, just exposing what's already there. No new math or derivations, so no circularity issues. It's for DEVO users who need point clouds for further processing or viz in robotics applications. Not for someone hunting for core advances in event vision. I wouldn't bring this to reading group or cite it myself. The thinking is clear enough. I'd recommend against sending it for peer review; it reads more like a solid implementation note than a paper that needs referee scrutiny.

Referee Report

1 major / 1 minor

Summary. The paper extends Deep Event Visual Odometry (DEVO) with a sparse point-cloud export pipeline. Rather than altering the core odometry formulation, the approach exposes DEVO's internal 3D structure and converts it into an explicit point cloud via data export, format conversion, and cleanup steps. Experiments on the BOARD SLOW sequence report that the exported cloud is locally consistent with EMVS reconstructions at a 5 cm threshold, while noting expected limitations in density, completeness, and sensitivity to accumulated odometry noise.

Significance. If the consistency result can be substantiated with quantitative details, the work provides a lightweight way to obtain usable sparse geometric output from an existing event-based odometry system. This could support visualization and downstream processing in robotics without requiring separate reconstruction pipelines, though the modest scope limits broader impact.

major comments (1)

[Abstract] Abstract: The claim that the exported sparse cloud achieves 'high precision at a 5 cm threshold' with EMVS is presented without any supporting quantitative details (e.g., precision value, number of points evaluated, error distribution, or exact comparison protocol). This omission is load-bearing for the central consistency claim.

minor comments (1)

The manuscript would benefit from a diagram or pseudocode illustrating the export and cleanup workflow to make the conversion steps reproducible.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for identifying the need for greater precision in the abstract. We address the single major comment below and will revise accordingly.

read point-by-point responses

Referee: [Abstract] Abstract: The claim that the exported sparse cloud achieves 'high precision at a 5 cm threshold' with EMVS is presented without any supporting quantitative details (e.g., precision value, number of points evaluated, error distribution, or exact comparison protocol). This omission is load-bearing for the central consistency claim.

Authors: We agree the abstract overstates the result by using 'high precision' without supporting metrics. The manuscript reports local consistency at the 5 cm threshold on the BOARD SLOW sequence but does not include the requested quantitative breakdown in the provided text. In revision we will replace the phrasing with 'local consistency with EMVS reconstructions at a 5 cm threshold' and add a brief reference to the experiments section. If space permits we will also include a short statement on the number of points compared and the comparison protocol. revision: yes

Circularity Check

0 steps flagged

No significant circularity; implementation extension only

full rationale

The paper presents an engineering extension that exposes DEVO's existing internal 3D structure for sparse point-cloud export without altering the odometry formulation or adding reconstruction steps. No equations, fitted parameters, predictions, or derivations appear in the provided text. The consistency claim with EMVS is an empirical observation on the BOARD SLOW sequence at a fixed 5 cm threshold, not a self-referential fit or renamed result. No self-citations are load-bearing for any central premise, and the work is self-contained as a format-conversion pipeline. This matches the default expectation of no circularity.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

No mathematical derivations, fitted parameters, or new entities are introduced; the work consists of software implementation steps to expose existing internal estimates.

pith-pipeline@v0.9.0 · 5717 in / 975 out tokens · 17239 ms · 2026-05-25T05:51:26.602656+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Rather than modifying the core odometry formulation, our approach exposes the internal 3D structure already estimated by DEVO and converts it into an explicit point-cloud representation... back-projection principle... pc = z K^{-1} [u v 1]^T
IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Experiments on the BOARD SLOW sequence show that the exported sparse cloud is locally consistent with EMVS reconstructions, achieving high precision at a 5 cm threshold

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

15 extracted references · 15 canonical work pages

[1]

Event-based vision: A survey,

G. Gallego, T. Delbr ¨uck, G. Orchard, C. Bartolozzi, B. Taba, A. Censi, S. Leutenegger, A. J. Davison, J. Conradt, K. Daniilidis, and D. Scara- muzza, “Event-based vision: A survey,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 44, no. 1, pp. 154–180, 2022

work page 2022
[2]

A 128×128 120 db 15 µs latency asynchronous temporal contrast vision sensor,

P. Lichtsteiner, C. Posch, and T. Delbr ¨uck, “A 128×128 120 db 15 µs latency asynchronous temporal contrast vision sensor,”IEEE Journal of Solid-State Circuits, vol. 43, no. 2, pp. 566–576, 2008

work page 2008
[3]

Evo: A geometric approach to event-based 6-dof parallel tracking and mapping in real time,

H. Rebecq, T. Horstschaefer, G. Gallego, and D. Scaramuzza, “Evo: A geometric approach to event-based 6-dof parallel tracking and mapping in real time,”IEEE Robotics and Automation Letters, vol. 2, no. 2, pp. 593–600, 2017

work page 2017
[4]

Real-time visual- inertial odometry for event cameras using keyframe-based nonlinear optimization,

H. Rebecq, T. Horstschaefer, and D. Scaramuzza, “Real-time visual- inertial odometry for event cameras using keyframe-based nonlinear optimization,” inBritish Machine Vision Conference (BMVC), 2017

work page 2017
[5]

Ultimate slam? combining events, images, and imu for robust visual slam in hdr and high speed scenarios,

A. R. Vidal, H. Rebecq, T. Horstschaefer, and D. Scaramuzza, “Ultimate slam? combining events, images, and imu for robust visual slam in hdr and high speed scenarios,”IEEE Robotics and Automation Letters, vol. 3, no. 2, pp. 994–1001, 2018

work page 2018
[6]

Event-based stereo visual odometry,

Y . Zhou, G. Gallego, and S. Shen, “Event-based stereo visual odometry,” IEEE Transactions on Robotics, vol. 37, no. 5, pp. 1433–1450, 2021

work page 2021
[7]

Event-aided direct sparse odometry,

J. Hidalgo-Carri ´o, G. Gallego, and D. Scaramuzza, “Event-aided direct sparse odometry,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 5781– 5790

work page 2022
[8]

Pl-evio: Robust monocular event-based visual inertial odometry with point and line features,

W. Guan, P. Chen, Y . Xie, and P. Lu, “Pl-evio: Robust monocular event-based visual inertial odometry with point and line features,”arXiv preprint arXiv:2209.12160, 2022

work page arXiv 2022
[9]

Esvio: Event-based stereo visual inertial odometry,

P. Chen, W. Guan, and P. Lu, “Esvio: Event-based stereo visual inertial odometry,”IEEE Robotics and Automation Letters, vol. 8, no. 6, pp. 3661–3668, 2023

work page 2023
[10]

Deep patch visual odometry,

Z. Teed, L. Lipson, and J. Deng, “Deep patch visual odometry,” in Advances in Neural Information Processing Systems, vol. 36, 2023

work page 2023
[11]

Deep event visual odometry,

S. Klenk, M. Motzet, L. Koestler, and D. Cremers, “Deep event visual odometry,” inProceedings of the International Conference on 3D Vision (3DV), 2024, pp. 739–749

work page 2024
[12]

Emvs: Event- based multi-view stereo—3d reconstruction with an event camera in real-time,

H. Rebecq, G. Gallego, E. Mueggler, and D. Scaramuzza, “Emvs: Event- based multi-view stereo—3d reconstruction with an event camera in real-time,”International Journal of Computer Vision, vol. 126, no. 12, pp. 1394–1414, 2018

work page 2018
[13]

The event-camera dataset and simulator: Event-based data for pose estimation, visual odometry, and slam,

E. Mueggler, H. Rebecq, G. Gallego, T. Delbr ¨uck, and D. Scaramuzza, “The event-camera dataset and simulator: Event-based data for pose estimation, visual odometry, and slam,”The International Journal of Robotics Research, vol. 36, no. 2, pp. 142–149, 2017

work page 2017
[14]

Devo-point-cloud-docker,

A. S. Khosroshahi and S. Ashraf, “Devo-point-cloud-docker,” https://github.com/Alireza-Safdari-Khosroshahi/DEVO-point-cloud- docker, 2026, gitHub repository, accessed March 28, 2026

work page 2026
[15]

BOARD SLOW Shape Validation Notebook,

A. Safdari and S. Ashraf, “BOARD SLOW Shape Validation Notebook,” Jupyter notebook, supplementary material, 2026, analysis notebook for scale isolation, EMVS comparison, point-cloud projection, ICP alignment, and board planarity validation

work page 2026

[1] [1]

Event-based vision: A survey,

G. Gallego, T. Delbr ¨uck, G. Orchard, C. Bartolozzi, B. Taba, A. Censi, S. Leutenegger, A. J. Davison, J. Conradt, K. Daniilidis, and D. Scara- muzza, “Event-based vision: A survey,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 44, no. 1, pp. 154–180, 2022

work page 2022

[2] [2]

A 128×128 120 db 15 µs latency asynchronous temporal contrast vision sensor,

P. Lichtsteiner, C. Posch, and T. Delbr ¨uck, “A 128×128 120 db 15 µs latency asynchronous temporal contrast vision sensor,”IEEE Journal of Solid-State Circuits, vol. 43, no. 2, pp. 566–576, 2008

work page 2008

[3] [3]

Evo: A geometric approach to event-based 6-dof parallel tracking and mapping in real time,

H. Rebecq, T. Horstschaefer, G. Gallego, and D. Scaramuzza, “Evo: A geometric approach to event-based 6-dof parallel tracking and mapping in real time,”IEEE Robotics and Automation Letters, vol. 2, no. 2, pp. 593–600, 2017

work page 2017

[4] [4]

Real-time visual- inertial odometry for event cameras using keyframe-based nonlinear optimization,

H. Rebecq, T. Horstschaefer, and D. Scaramuzza, “Real-time visual- inertial odometry for event cameras using keyframe-based nonlinear optimization,” inBritish Machine Vision Conference (BMVC), 2017

work page 2017

[5] [5]

Ultimate slam? combining events, images, and imu for robust visual slam in hdr and high speed scenarios,

A. R. Vidal, H. Rebecq, T. Horstschaefer, and D. Scaramuzza, “Ultimate slam? combining events, images, and imu for robust visual slam in hdr and high speed scenarios,”IEEE Robotics and Automation Letters, vol. 3, no. 2, pp. 994–1001, 2018

work page 2018

[6] [6]

Event-based stereo visual odometry,

Y . Zhou, G. Gallego, and S. Shen, “Event-based stereo visual odometry,” IEEE Transactions on Robotics, vol. 37, no. 5, pp. 1433–1450, 2021

work page 2021

[7] [7]

Event-aided direct sparse odometry,

J. Hidalgo-Carri ´o, G. Gallego, and D. Scaramuzza, “Event-aided direct sparse odometry,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 5781– 5790

work page 2022

[8] [8]

Pl-evio: Robust monocular event-based visual inertial odometry with point and line features,

W. Guan, P. Chen, Y . Xie, and P. Lu, “Pl-evio: Robust monocular event-based visual inertial odometry with point and line features,”arXiv preprint arXiv:2209.12160, 2022

work page arXiv 2022

[9] [9]

Esvio: Event-based stereo visual inertial odometry,

P. Chen, W. Guan, and P. Lu, “Esvio: Event-based stereo visual inertial odometry,”IEEE Robotics and Automation Letters, vol. 8, no. 6, pp. 3661–3668, 2023

work page 2023

[10] [10]

Deep patch visual odometry,

Z. Teed, L. Lipson, and J. Deng, “Deep patch visual odometry,” in Advances in Neural Information Processing Systems, vol. 36, 2023

work page 2023

[11] [11]

Deep event visual odometry,

S. Klenk, M. Motzet, L. Koestler, and D. Cremers, “Deep event visual odometry,” inProceedings of the International Conference on 3D Vision (3DV), 2024, pp. 739–749

work page 2024

[12] [12]

Emvs: Event- based multi-view stereo—3d reconstruction with an event camera in real-time,

H. Rebecq, G. Gallego, E. Mueggler, and D. Scaramuzza, “Emvs: Event- based multi-view stereo—3d reconstruction with an event camera in real-time,”International Journal of Computer Vision, vol. 126, no. 12, pp. 1394–1414, 2018

work page 2018

[13] [13]

The event-camera dataset and simulator: Event-based data for pose estimation, visual odometry, and slam,

E. Mueggler, H. Rebecq, G. Gallego, T. Delbr ¨uck, and D. Scaramuzza, “The event-camera dataset and simulator: Event-based data for pose estimation, visual odometry, and slam,”The International Journal of Robotics Research, vol. 36, no. 2, pp. 142–149, 2017

work page 2017

[14] [14]

Devo-point-cloud-docker,

A. S. Khosroshahi and S. Ashraf, “Devo-point-cloud-docker,” https://github.com/Alireza-Safdari-Khosroshahi/DEVO-point-cloud- docker, 2026, gitHub repository, accessed March 28, 2026

work page 2026

[15] [15]

BOARD SLOW Shape Validation Notebook,

A. Safdari and S. Ashraf, “BOARD SLOW Shape Validation Notebook,” Jupyter notebook, supplementary material, 2026, analysis notebook for scale isolation, EMVS comparison, point-cloud projection, ICP alignment, and board planarity validation

work page 2026