See Silhouettes in Motion with Neuromorphic Vision

Jinpeng Chen; Pei Zhang; Shijie Lin; Wei Pu; Zhou Ge

arxiv: 2605.17984 · v1 · pith:N3WSL6URnew · submitted 2026-05-18 · 📡 eess.IV · cs.CV· cs.RO

See Silhouettes in Motion with Neuromorphic Vision

Pei Zhang , Shijie Lin , Zhou Ge , Jinpeng Chen , Wei Pu This is my paper

Pith reviewed 2026-05-20 00:52 UTC · model grok-4.3

classification 📡 eess.IV cs.CVcs.RO

keywords neuromorphic visionevent camerasbinarizationsilhouette extractionmotion blurhigh frame ratedual-modal fusionedge computing

0 comments

The pith

Fusing frames with neuromorphic event data produces clear binarized silhouettes of fast-moving objects on standard CPUs.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper aims to show that a dual-modal fusion of conventional image frames and event streams from neuromorphic cameras can extract reliable binary silhouettes from quasi-bimodal objects such as text and signs. This matters because traditional frame-based imaging fails in dynamic scenes on mobile platforms due to motion blur and lighting extremes, while the proposed method runs in real time on CPU-only hardware. The asynchronous workflow sidesteps the event scarcity that undermines time-binning approaches, preserving sharp target shapes at kilohertz frame rates. The resulting binary representations then support efficient downstream vision tasks in resource-constrained settings.

Core claim

We introduce a simple yet effective dual-modal approach that harnesses the synergy between frames and events to achieve real-time, high-frame-rate binarization on CPU-only devices. Extensive evaluations show competitive performance against leading techniques in reducing motion blur, while delivering improvements under challenging illumination. The asynchronous workflow bypasses event scarcity that breaks traditional time-binning reconstruction, maintaining clear target shapes even at extreme kilohertz frame rates. Its binary results further serve as reliable representations that facilitate a range of downstream tasks.

What carries the argument

Dual-modal fusion of frames and events in an asynchronous workflow that produces binarized silhouettes.

If this is right

The method reduces motion blur in dynamic scenes while operating in real time on CPU hardware.
It improves binarization results under harsh or varying illumination compared to frame-only approaches.
Clear shapes are preserved even when traditional time-binning fails due to sparse events.
Binary outputs serve as compact inputs for downstream perception and interaction tasks.
The workflow supports lightweight perception on edge platforms such as drones and vehicles.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same fusion idea could be tested on other quasi-bimodal patterns such as moving QR codes or lane markings.
Performance under increased event noise levels beyond the paper's evaluations would clarify practical limits.
Integration into tracking or recognition pipelines on low-power robotics hardware offers a direct next step.

Load-bearing premise

A simple dual-modal fusion of frames and events is sufficient to produce reliable binarization without detailed specification of the fusion algorithm, training data, or failure modes under real-world event noise.

What would settle it

A side-by-side comparison at kilohertz rates in a high-noise, low-event-density scene where the output silhouettes lose clear target shapes relative to ground-truth labels.

Figures

Figures reproduced from arXiv: 2605.17984 by Jinpeng Chen, Pei Zhang, Shijie Lin, Wei Pu, Zhou Ge.

**Figure 1.** Figure 1: Minimal input, maximum insight. they are bottlenecked by the loss of raw visual information. Bridging this perception gap is our motivation. Quasi-bimodal objects can be found everywhere in our daily lives, like text, road signs, and barcodes. Checkerboards, fiducials are often used for camera calibration, localization, and tracking for robotics, AR/VR, and autonomous systems [1]– [3]. Binarization from i… view at source ↗

**Figure 2.** Figure 2: The dual-modal binarization framework. (a) A blurry “No overtaking” sign observed during high-speed driving is spatially decomposed into [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 3.** Figure 3: Stronger motion brings heavier frame blur, yet the distribution in a [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

**Figure 5.** Figure 5: A clear bimodal shape is observed in the sharp scene, while motion [PITH_FULL_IMAGE:figures/full_fig_p005_5.png] view at source ↗

**Figure 6.** Figure 6: Binarization results on the RND, REBlur, and EBT datasets. Each binary reference is taken from a sharp frame captured near the time of the degraded [PITH_FULL_IMAGE:figures/full_fig_p007_6.png] view at source ↗

**Figure 7.** Figure 7: Binarization results on the RND and DIRD datasets. Each binary reference is taken from a sharp frame captured near the time of the degraded [PITH_FULL_IMAGE:figures/full_fig_p007_7.png] view at source ↗

**Figure 8.** Figure 8: Accurate binarization benefits a range of downstream tasks. Left to [PITH_FULL_IMAGE:figures/full_fig_p008_8.png] view at source ↗

**Figure 10.** Figure 10: Runtime performance across the datasets with varying event rates. [PITH_FULL_IMAGE:figures/full_fig_p009_10.png] view at source ↗

**Figure 11.** Figure 11: Adaptive parameters optimize binarization results. [PITH_FULL_IMAGE:figures/full_fig_p009_11.png] view at source ↗

**Figure 12.** Figure 12: Evaluation of the impact of event quality on binarization. The samples [PITH_FULL_IMAGE:figures/full_fig_p010_12.png] view at source ↗

read the original abstract

Quasi-bimodal objects, such as text, road signs, and barcodes, play a basic yet vital role in daily visual communication. By boiling these down to clear silhouettes, binarization uses a minimal language to convey essential vision cues for maximum downstream efficiency. The catch is that frame-based imaging often struggles on mobile platforms like drones, self-driving cars, and underwater vehicles. In these dynamic scenes, rapid motion and harsh lighting can make it blind, causing severe motion blur and erasing crucial details. To overcome the limits, neuromorphic vision via event cameras, featuring microsecond-level temporal resolution and high dynamic range, steps in as a natural solution. Building upon this event-driven sensing paradigm, we introduce a simple yet effective dual-modal approach that harnesses the synergy between frames and events to achieve real-time, high-frame-rate binarization on CPU-only devices. Extensive evaluations present that it earns competitive performance against leading techniques in reducing motion blur, while delivering impressive improvements under challenging illumination. Besides, our asynchronous workflow bypasses event scarcity that breaks traditional time-binning reconstruction, maintaining clear target shapes even at extreme kilohertz frame rates. Its binary results further serve as reliable representations that facilitate a range of downstream tasks. This work paves the way towards lightweight perception and interaction in embodied intelligence on resource-constrained edge platforms.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper applies event cameras to silhouette binarization via an asynchronous dual-modal workflow for high-speed mobile scenes, but supplies no fusion rules or implementation details to back the kHz claims.

read the letter

This paper's main contribution is a dual-modal approach using event cameras and standard frames for binarizing silhouettes in high-motion scenarios, with an asynchronous workflow that supposedly handles sparse events at kilohertz rates without traditional binning. It targets practical issues in mobile platforms like drones and cars where motion blur and lighting changes hurt regular cameras. The new part is applying this to the specific binarization problem for quasi-bimodal objects, aiming for real-time CPU performance and better results in challenging conditions. It builds on known event camera properties like microsecond resolution and high dynamic range, which is a solid extension for edge vision tasks. The evaluations mentioned sound like they compare against leading techniques for blur reduction and illumination improvements, which could be valuable if the numbers hold up. It does well in framing the problem clearly and suggesting that binary silhouettes can help downstream tasks in embodied intelligence. The soft spots are around the method itself. There's no information on the fusion algorithm, how asynchrony is implemented, or any rules for combining the data. This makes it tough to see exactly how it bypasses event scarcity or why it works at extreme frame rates. The weakest assumption seems to be that a simple dual-modal setup is enough without more specifics on training or noise handling. If the full paper has those details and reproducible experiments, that would strengthen it a lot. This is the kind of work for applied researchers in computer vision and robotics interested in neuromorphic sensors for resource-limited settings. Someone looking for new ideas in high-speed binarization might get value from the overall approach. I'd say send it to peer review. The idea is relevant and the claims are testable in principle, so referees could help clarify the implementation and verify the results.

Referee Report

2 major / 1 minor

Summary. The paper proposes a dual-modal approach fusing conventional image frames with neuromorphic event-camera data for real-time binarization of quasi-bimodal objects (text, signs, barcodes) under rapid motion and harsh illumination. It claims an asynchronous workflow that avoids traditional time-binning, thereby bypassing event scarcity to produce clear binary silhouettes at extreme kilohertz frame rates on CPU-only hardware, while delivering competitive motion-blur reduction and illumination robustness relative to prior methods, with downstream utility for embodied perception tasks.

Significance. If the empirical claims hold after proper controls, the work could contribute to lightweight, high-speed silhouette extraction on resource-constrained platforms such as drones and underwater vehicles by exploiting the complementary strengths of frame and event sensing. The emphasis on CPU-only real-time operation and avoidance of event-density limitations addresses practical bottlenecks in neuromorphic vision pipelines.

major comments (2)

[Method] Method section: the asynchronous dual-modal fusion is presented only at a conceptual level with no equations, update rules, interpolation scheme, threshold logic, or pseudocode for how frames and events are combined to compensate for sparse events. This directly undermines the central claim that the workflow 'bypasses event scarcity' and maintains clear shapes at kHz rates, as it is impossible to determine whether the method genuinely compensates for low event density or simply falls back on frame data.
[Experiments] Experiments / Results: no quantitative tables, ablation studies, error bars, or controls for dataset choice and parameter tuning are provided to substantiate the reported improvements in motion blur and illumination robustness. Without these, the performance claims cannot be verified as load-bearing for the paper's conclusions.

minor comments (1)

[Abstract] Abstract: the sentence 'Extensive evaluations present that it earns competitive performance' is grammatically awkward and should be rephrased for clarity (e.g., 'Extensive evaluations demonstrate competitive performance').

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their insightful comments, which have helped us identify areas for improvement in our manuscript. We address each major comment below and outline the revisions we plan to make.

read point-by-point responses

Referee: [Method] Method section: the asynchronous dual-modal fusion is presented only at a conceptual level with no equations, update rules, interpolation scheme, threshold logic, or pseudocode for how frames and events are combined to compensate for sparse events. This directly undermines the central claim that the workflow 'bypasses event scarcity' and maintains clear shapes at kHz rates, as it is impossible to determine whether the method genuinely compensates for low event density or simply falls back on frame data.

Authors: We agree that the method description in the current version is primarily conceptual and lacks the formal details requested. To strengthen the manuscript, we will revise the Method section to include the specific equations for the dual-modal fusion, update rules for asynchronous event integration with frames, the interpolation scheme used to achieve high frame rates, the threshold logic for generating binary silhouettes, and pseudocode for the overall workflow. These additions will clarify how the approach compensates for event sparsity rather than relying solely on frame data. revision: yes
Referee: [Experiments] Experiments / Results: no quantitative tables, ablation studies, error bars, or controls for dataset choice and parameter tuning are provided to substantiate the reported improvements in motion blur and illumination robustness. Without these, the performance claims cannot be verified as load-bearing for the paper's conclusions.

Authors: We acknowledge the absence of quantitative tables, ablation studies, error bars, and detailed controls in the presented results. In the revised manuscript, we will incorporate quantitative evaluation tables comparing performance metrics against state-of-the-art methods, ablation studies isolating the contributions of frame and event modalities, error bars from repeated experiments, and explicit discussions of dataset selection criteria and parameter tuning procedures. This will provide the necessary evidence to support the claims on motion blur reduction and illumination robustness. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical method description with no self-referential derivations or fitted predictions

full rationale

The paper introduces a dual-modal fusion approach for real-time binarization of quasi-bimodal objects using frames and events, emphasizing an asynchronous workflow to handle event scarcity at high frame rates. No equations, parameter-fitting procedures, or derivation steps are presented in the abstract or described claims. The central assertions rest on empirical performance comparisons and the practical benefits of the workflow rather than any quantity that reduces to its own inputs by construction. No self-citations, uniqueness theorems, or ansatzes are invoked to justify load-bearing premises. The work is therefore self-contained, with its validity depending on external experimental validation rather than internal definitional closure.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

The abstract supplies no explicit free parameters, axioms, or invented entities; the approach appears to rely on standard assumptions about event-camera data and frame-event complementarity without stating new postulates.

pith-pipeline@v0.9.0 · 5774 in / 1140 out tokens · 38465 ms · 2026-05-20T00:52:19.898840+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We propose a simple yet effective, lightweight, and dual-modal method that leverages the synergy between frames and events for unified, single-step binarization... asynchronous state propagation strategy that treats the initial binary frame as a seed and propagates it asynchronously
IndisputableMonolith/Foundation/ArithmeticFromLogic.lean LogicNat recovery and 8-tick orbit structure unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

our asynchronous workflow bypasses event scarcity that breaks traditional time-binning reconstruction, maintaining clear target shapes even at extreme kilohertz frame rates

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

55 extracted references · 55 canonical work pages

[1]

CylinderTag: an accurate and flexible marker for cylinder-shape objects pose estimation based on projective invariants,

S. Wang, M. Zhu, Y . Hu, D. Li, F. Yuan, and J. Yu, “CylinderTag: an accurate and flexible marker for cylinder-shape objects pose estimation based on projective invariants,”IEEE Transactions on Visualization and Computer Graphics, vol. 30, no. 12, pp. 7486–7499, 2024. 11

work page 2024
[2]

A shared autonomy system for precise and efficient remote underwater manipulation,

A. Phung, G. Billings, A. F. Daniele, M. R. Walter, and R. Camilli, “A shared autonomy system for precise and efficient remote underwater manipulation,”IEEE Transactions on Robotics, vol. 40, pp. 4147–4159, 2024

work page 2024
[3]

Event-based stereo depth estimation: a sur- vey,

S. Ghosh and G. Gallego, “Event-based stereo depth estimation: a sur- vey,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 47, no. 10, pp. 9130–9149, 2025

work page 2025
[4]

Event-based vision: a survey,

G. Gallego, T. Delbruck, G. M. Orchard, C. Bartolozzi, B. Taba, A. Censi, S. Leutenegger, A. Davison, J. Conradt, K. Daniilidis, and D. Scaramuzza, “Event-based vision: a survey,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 44, no. 1, pp. 154–180, 2022

work page 2022
[5]

Low-latency automotive vision with event cameras,

D. Gehrig and D. Scaramuzza, “Low-latency automotive vision with event cameras,”Nature, vol. 629, pp. 1034–1040, 2024

work page 2024
[6]

Learning from images: a distillation learning framework for event cameras,

Y . Deng, H. Chen, H. Chen, and Y . Li, “Learning from images: a distillation learning framework for event cameras,”IEEE Transactions on Image Processing, vol. 30, pp. 4919–4931, 2021

work page 2021
[7]

SpectroGen: a physically informed gener- ative artificial intelligence for accelerated cross-modality spectroscopic materials characterization,

Y . Zhu and L. F. Tadesse, “SpectroGen: a physically informed gener- ative artificial intelligence for accelerated cross-modality spectroscopic materials characterization,”Matter, vol. 9, no. 1, p. 102434, 2026

work page 2026
[8]

Deblurring neural radiance fields with event-driven bundle adjustment,

Y . Qi, L. Zhu, Y . Zhao, N. Bao, and J. Li, “Deblurring neural radiance fields with event-driven bundle adjustment,” inProceedings of the 32nd ACM International Conference on Multimedia (ACM MM), 2024, pp. 9262–9270

work page 2024
[9]

Event-based motion deblurring with modality-aware decomposition and recomposition,

W. Yang, J. Wu, L. Li, W. Dong, and G. Shi, “Event-based motion deblurring with modality-aware decomposition and recomposition,” in Proceedings of the 31st ACM International Conference on Multimedia (ACM MM), 2023, pp. 8327–8335

work page 2023
[10]

Bringing a blurry frame alive at high frame-rate with an event camera,

L. Pan, C. Scheerlinck, X. Yu, R. Hartley, M. Liu, and Y . Dai, “Bringing a blurry frame alive at high frame-rate with an event camera,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 6820–6829

work page 2019
[11]

Binarization of document image using optimum threshold modification,

W. A. Mustafa and M. M. M. Abdul Kader, “Binarization of document image using optimum threshold modification,” inJournal of Physics: Conference Series, vol. 1019, 2018, p. 012022

work page 2018
[12]

ISauvola: improved Sauvola’s algorithm for document image binarization,

Z. Hadjadj, A. Meziane, Y . Cherfa, M. Cheriet, and I. Setitra, “ISauvola: improved Sauvola’s algorithm for document image binarization,” in International Conference on Image Analysis and Recognition, 2016, pp. 737–745

work page 2016
[13]

A threshold selection method from gray-level histograms,

N. Otsu, “A threshold selection method from gray-level histograms,” IEEE Transactions on Systems, Man, and Cybernetics, vol. 9, no. 1, pp. 62–66, 1979

work page 1979
[14]

Robust document image binarization technique for degraded document images,

B. Su, S. Lu, and C. L. Tan, “Robust document image binarization technique for degraded document images,”IEEE Transactions on Image Processing, vol. 22, no. 4, pp. 1408–1417, 2013

work page 2013
[15]

Comparison of Niblack inspired binarization methods for ancient documents,

K. Khurshid, I. Siddiqi, C. Faure, and N. Vincent, “Comparison of Niblack inspired binarization methods for ancient documents,” inDoc- ument Recognition and Retrieval XVI, vol. 7247, 2009, pp. 267–275

work page 2009
[16]

DeepOtsu: document enhancement and binarization using iterative deep learning,

S. He and L. Schomaker, “DeepOtsu: document enhancement and binarization using iterative deep learning,”Pattern Recognition, vol. 91, pp. 379–390, 2019

work page 2019
[17]

A selectional auto-encoder ap- proach for document image binarization,

J. Calvo-Zaragoza and A.-J. Gallego, “A selectional auto-encoder ap- proach for document image binarization,”Pattern Recognition, vol. 86, pp. 37–47, 2019

work page 2019
[18]

Self-BSR: self- supervised image denoising and destriping based on blind-spot reg- ularization,

C. Qu, Z. Chen, J. Zhang, X. Chen, and J. Han, “Self-BSR: self- supervised image denoising and destriping based on blind-spot reg- ularization,”IEEE Transactions on Circuits and Systems for Video Technology, vol. 35, no. 9, pp. 8666–8678, 2025

work page 2025
[19]

Degraded historical document binarization: a review on issues, challenges, techniques, and future directions,

A. Sulaiman, K. Omar, and M. F. Nasrudin, “Degraded historical document binarization: a review on issues, challenges, techniques, and future directions,”Journal of Imaging, vol. 5, no. 4, p. 48, 2019

work page 2019
[20]

EventAid: benchmarking event-aided image/video enhancement algorithms with real-captured hybrid dataset,

P. Duan, B. Li, Y . Yang, H. Lou, M. Teng, X. Zhou, Y . Ma, and B. Shi, “EventAid: benchmarking event-aided image/video enhancement algorithms with real-captured hybrid dataset,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 47, no. 8, pp. 6959– 6973, 2025

work page 2025
[21]

Dynamic obstacle avoid- ance for quadrotors with event cameras,

D. Falanga, K. Kleber, and D. Scaramuzza, “Dynamic obstacle avoid- ance for quadrotors with event cameras,”Science Robotics, vol. 5, no. 40, p. eaaz9712, 2020

work page 2020
[22]

Drone detection with event cameras,

G. Magrini, L. Berlincioni, F. Becattini, L. Cultrera, and P. Pala, “Drone detection with event cameras,” inProceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2025, pp. 4703–4714

work page 2025
[23]

Ultrafast dynamic defect inspection with computational neuromorphic imaging,

S. Zhu, Q. Yin, C. Wang, J. Huang, and E. Y . Lam, “Ultrafast dynamic defect inspection with computational neuromorphic imaging,”Advanced Science, vol. 12, no. 44, p. e10338, 2025

work page 2025
[24]

Millisecond autofocusing microscopy using neuromorphic event sensing,

Z. Ge, H. Wei, F. Xu, Y . Gao, Z. Chu, H. K.-H. So, and E. Y . Lam, “Millisecond autofocusing microscopy using neuromorphic event sensing,”Optics and Lasers in Engineering, vol. 160, p. 107247, 2023

work page 2023
[25]

Angle- based neuromorphic wave normal sensing,

C. Wang, S. Zhu, P. Zhang, K. Wang, J. Huang, and E. Y . Lam, “Angle- based neuromorphic wave normal sensing,”Laser & Photonics Reviews, vol. 19, no. 4, p. 2400647, 2025

work page 2025
[26]

Widefield diamond quantum sensing with neuromorphic vision sensors,

Z. Du, M. Gupta, F. Xu, K. Zhang, J. Zhang, Y . Zhou, Y . Liu, Z. Wang, J. Wrachtrup, N. Wong, C. Li, and Z. Chu, “Widefield diamond quantum sensing with neuromorphic vision sensors,”Advanced Science, vol. 11, no. 2, p. 2304355, 2024

work page 2024
[27]

EventLFM: event camera integrated Fourier light field microscopy for ultrafast 3D imaging,

R. Guo, Q. Yang, A. S. Chang, G. Hu, J. Greene, C. V . Gabel, S. You, and L. Tian, “EventLFM: event camera integrated Fourier light field microscopy for ultrafast 3D imaging,”Light: Science & Applications, vol. 13, p. 144, 2024

work page 2024
[28]

Neuromorphic imaging with super-resolution,

P. Zhang, S. Zhu, C. Wang, Y . Zhao, and E. Y . Lam, “Neuromorphic imaging with super-resolution,”IEEE Transactions on Circuits and Systems for Video Technology, vol. 35, no. 2, pp. 1715–1727, 2025

work page 2025
[29]

Self-calibrated neuromorphic hyperspectral derivative imaging,

R. Chen, C. Wang, Y . Li, Y . Cao, S. Zhu, and E. Y . Lam, “Self-calibrated neuromorphic hyperspectral derivative imaging,”Optica, vol. 13, no. 4, pp. 587–590, 2026

work page 2026
[30]

Embodied neuromorphic synergy for lighting-robust machine vision to see in extreme bright,

S. Lin, G. Zheng, Z. Wang, R. Han, W. Xing, Z. Zhang, Y . Peng, and J. Pan, “Embodied neuromorphic synergy for lighting-robust machine vision to see in extreme bright,”Nature Communications, vol. 15, p. 10781, 2024

work page 2024
[31]

Event camera meets mobile embodied percep- tion: abstraction, algorithm, acceleration, application,

H. Wang, R. Guo, P. Ma, C. Ruan, X. Luo, W. Ding, T. Zhong, J. Xu, Y . Liu, and X. Chen, “Event camera meets mobile embodied percep- tion: abstraction, algorithm, acceleration, application,”ACM Computing Surveys, vol. 58, no. 8, pp. 1–41, 2026

work page 2026
[32]

Continuous-time intensity estimation using event cameras,

C. Scheerlinck, N. Barnes, and R. Mahony, “Continuous-time intensity estimation using event cameras,” inAsian Conference on Computer Vision (ACCV), 2018, pp. 308–324

work page 2018
[33]

Neuromorphic imag- ing with joint image deblurring and event denoising,

P. Zhang, H. Liu, Z. Ge, C. Wang, and E. Y . Lam, “Neuromorphic imag- ing with joint image deblurring and event denoising,”IEEE Transactions on Image Processing, vol. 33, pp. 2318–2333, 2024

work page 2024
[34]

High speed and high dynamic range video with an event camera,

H. Rebecq, R. Ranftl, V . Koltun, and D. Scaramuzza, “High speed and high dynamic range video with an event camera,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 43, no. 6, pp. 1964– 1980, 2021

work page 1964
[35]

Event-based fusion for motion deblurring with cross- modal attention,

L. Sun, C. Sakaridis, J. Liang, Q. Jiang, K. Yang, P. Sun, Y . Ye, K. Wang, and L. V . Gool, “Event-based fusion for motion deblurring with cross- modal attention,” inEuropean Conference on Computer Vision (ECCV), 2022, pp. 412–428

work page 2022
[36]

Dark-EvGS: event camera as an eye for radiance field in the dark,

J. Wu, P. Duan, Z. Wang, C. Wang, B. Shi, and E. Y . Lam, “Dark-EvGS: event camera as an eye for radiance field in the dark,”IEEE Transactions on Image Processing, vol. 35, pp. 3172–3185, 2026

work page 2026
[37]

Motion- adaptive Transformer for event-based image deblurring,

S. Xu, Z. Sun, M. Zhong, C. Cao, Y . Liu, X. Fu, and Y . Chen, “Motion- adaptive Transformer for event-based image deblurring,” inProceedings of the AAAI Conference on Artificial Intelligence (AAAI), vol. 39, no. 9, 2025, pp. 8942–8950

work page 2025
[38]

Event-based video super-resolution via state space models,

Z. Xiao and X. Wang, “Event-based video super-resolution via state space models,” inProceedings of the IEEE/CVF Conference on Com- puter Vision and Pattern Recognition (CVPR), 2025, pp. 12 564–12 574

work page 2025
[39]

Diffusion-based event generation for high-quality image deblurring,

X. Xie, Q. Zhang, and W.-S. Zheng, “Diffusion-based event generation for high-quality image deblurring,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025, pp. 2194–2203

work page 2025
[40]

Event-guided HDR reconstruction with diffusion priors,

Y . Yang, J. Zhang, Y . Zhang, Y . Wei, D. Zou, J. S. Ren, and B. Shi, “Event-guided HDR reconstruction with diffusion priors,” inProceed- ings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2025, pp. 11 787–11 796

work page 2025
[41]

Neuromorphic synergy for video binarization,

S. Lin, X. Zhang, L. Yang, L. Yu, B. Zhou, X. Luo, W. Wang, and J. Pan, “Neuromorphic synergy for video binarization,”IEEE Transactions on Image Processing, vol. 33, pp. 1403–1418, 2024

work page 2024
[42]

Reading in the dark with foveated event vision,

C. Brander, G. Cioffi, N. Messikommer, and D. Scaramuzza, “Reading in the dark with foveated event vision,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2025, pp. 5044–5052

work page 2025
[43]

Front and back illuminated dynamic and active pixel vision sensors comparison,

G. Taverni, D. P. Moeys, C. Li, C. Cavaco, V . Motsnyi, D. S. S. Bello, and T. Delbruck, “Front and back illuminated dynamic and active pixel vision sensors comparison,”IEEE Transactions on Circuits and Systems II: Express Briefs, vol. 65, no. 5, pp. 677–681, 2018

work page 2018
[44]

A unifying contrast maximization framework for event cameras, with applications to motion, depth, and optical flow estimation,

G. Gallego, H. Rebecq, and D. Scaramuzza, “A unifying contrast maximization framework for event cameras, with applications to motion, depth, and optical flow estimation,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018, pp. 3867–3876. 12

work page 2018
[45]

Event encryption: rethinking privacy exposure for neuromorphic imaging,

P. Zhang, S. Zhu, and E. Y . Lam, “Event encryption: rethinking privacy exposure for neuromorphic imaging,”Neuromorphic Computing and Engineering, vol. 4, no. 1, p. 014002, 2024

work page 2024
[46]

Event camera calibration of per-pixel biased contrast threshold,

Z. Wang, Y . Ng, P. van Goor, and R. Mahony, “Event camera calibration of per-pixel biased contrast threshold,” inAustralasian Conference of Robotics and Automation (ACRA), 2019

work page 2019
[47]

Graph-based blind image deblurring from a single photograph,

Y . Bai, G. Cheung, X. Liu, and W. Gao, “Graph-based blind image deblurring from a single photograph,”IEEE Transactions on Image Processing, vol. 28, no. 3, pp. 1404–1418, 2019

work page 2019
[48]

Blind image deblurring with local maximum gradient prior,

L. Chen, F. Fang, T. Wang, and G. Zhang, “Blind image deblurring with local maximum gradient prior,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 1742–1750

work page 2019
[49]

Reducing the sim-to-real gap for event cameras,

T. Stoffregen, C. Scheerlinck, D. Scaramuzza, T. Drummond, N. Barnes, L. Kleeman, and R. Mahony, “Reducing the sim-to-real gap for event cameras,” inEuropean Conference on Computer Vision (ECCV), 2020, pp. 534–549

work page 2020
[50]

The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation,

D. Chicco and G. Jurman, “The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation,”BMC Genomics, vol. 21, p. 6, 2020

work page 2020
[51]

Character region aware- ness for text detection,

Y . Baek, B. Lee, D. Han, S. Yun, and H. Lee, “Character region aware- ness for text detection,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 9365– 9374

work page 2019
[52]

An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition,

B. Shi, X. Bai, and C. Yao, “An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 11, pp. 2298–2304, 2017

work page 2017
[53]

DeepTag: a general framework for fiducial marker design and detection,

Z. Zhang, Y . Hu, G. Yu, and J. Dai, “DeepTag: a general framework for fiducial marker design and detection,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 45, no. 3, pp. 2931–2944, 2023

work page 2023
[54]

RAFT: recurrent all-pairs field transforms for optical flow,

Z. Teed and J. Deng, “RAFT: recurrent all-pairs field transforms for optical flow,” inEuropean Conference on Computer Vision (ECCV), 2020, pp. 402–419

work page 2020
[55]

Neuromorphic imaging with density-based spatiotemporal denoising,

P. Zhang, Z. Ge, L. Song, and E. Y . Lam, “Neuromorphic imaging with density-based spatiotemporal denoising,”IEEE Transactions on Computational Imaging, vol. 9, pp. 530–541, 2023

work page 2023

[1] [1]

CylinderTag: an accurate and flexible marker for cylinder-shape objects pose estimation based on projective invariants,

S. Wang, M. Zhu, Y . Hu, D. Li, F. Yuan, and J. Yu, “CylinderTag: an accurate and flexible marker for cylinder-shape objects pose estimation based on projective invariants,”IEEE Transactions on Visualization and Computer Graphics, vol. 30, no. 12, pp. 7486–7499, 2024. 11

work page 2024

[2] [2]

A shared autonomy system for precise and efficient remote underwater manipulation,

A. Phung, G. Billings, A. F. Daniele, M. R. Walter, and R. Camilli, “A shared autonomy system for precise and efficient remote underwater manipulation,”IEEE Transactions on Robotics, vol. 40, pp. 4147–4159, 2024

work page 2024

[3] [3]

Event-based stereo depth estimation: a sur- vey,

S. Ghosh and G. Gallego, “Event-based stereo depth estimation: a sur- vey,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 47, no. 10, pp. 9130–9149, 2025

work page 2025

[4] [4]

Event-based vision: a survey,

G. Gallego, T. Delbruck, G. M. Orchard, C. Bartolozzi, B. Taba, A. Censi, S. Leutenegger, A. Davison, J. Conradt, K. Daniilidis, and D. Scaramuzza, “Event-based vision: a survey,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 44, no. 1, pp. 154–180, 2022

work page 2022

[5] [5]

Low-latency automotive vision with event cameras,

D. Gehrig and D. Scaramuzza, “Low-latency automotive vision with event cameras,”Nature, vol. 629, pp. 1034–1040, 2024

work page 2024

[6] [6]

Learning from images: a distillation learning framework for event cameras,

Y . Deng, H. Chen, H. Chen, and Y . Li, “Learning from images: a distillation learning framework for event cameras,”IEEE Transactions on Image Processing, vol. 30, pp. 4919–4931, 2021

work page 2021

[7] [7]

SpectroGen: a physically informed gener- ative artificial intelligence for accelerated cross-modality spectroscopic materials characterization,

Y . Zhu and L. F. Tadesse, “SpectroGen: a physically informed gener- ative artificial intelligence for accelerated cross-modality spectroscopic materials characterization,”Matter, vol. 9, no. 1, p. 102434, 2026

work page 2026

[8] [8]

Deblurring neural radiance fields with event-driven bundle adjustment,

Y . Qi, L. Zhu, Y . Zhao, N. Bao, and J. Li, “Deblurring neural radiance fields with event-driven bundle adjustment,” inProceedings of the 32nd ACM International Conference on Multimedia (ACM MM), 2024, pp. 9262–9270

work page 2024

[9] [9]

Event-based motion deblurring with modality-aware decomposition and recomposition,

W. Yang, J. Wu, L. Li, W. Dong, and G. Shi, “Event-based motion deblurring with modality-aware decomposition and recomposition,” in Proceedings of the 31st ACM International Conference on Multimedia (ACM MM), 2023, pp. 8327–8335

work page 2023

[10] [10]

Bringing a blurry frame alive at high frame-rate with an event camera,

L. Pan, C. Scheerlinck, X. Yu, R. Hartley, M. Liu, and Y . Dai, “Bringing a blurry frame alive at high frame-rate with an event camera,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 6820–6829

work page 2019

[11] [11]

Binarization of document image using optimum threshold modification,

W. A. Mustafa and M. M. M. Abdul Kader, “Binarization of document image using optimum threshold modification,” inJournal of Physics: Conference Series, vol. 1019, 2018, p. 012022

work page 2018

[12] [12]

ISauvola: improved Sauvola’s algorithm for document image binarization,

Z. Hadjadj, A. Meziane, Y . Cherfa, M. Cheriet, and I. Setitra, “ISauvola: improved Sauvola’s algorithm for document image binarization,” in International Conference on Image Analysis and Recognition, 2016, pp. 737–745

work page 2016

[13] [13]

A threshold selection method from gray-level histograms,

N. Otsu, “A threshold selection method from gray-level histograms,” IEEE Transactions on Systems, Man, and Cybernetics, vol. 9, no. 1, pp. 62–66, 1979

work page 1979

[14] [14]

Robust document image binarization technique for degraded document images,

B. Su, S. Lu, and C. L. Tan, “Robust document image binarization technique for degraded document images,”IEEE Transactions on Image Processing, vol. 22, no. 4, pp. 1408–1417, 2013

work page 2013

[15] [15]

Comparison of Niblack inspired binarization methods for ancient documents,

K. Khurshid, I. Siddiqi, C. Faure, and N. Vincent, “Comparison of Niblack inspired binarization methods for ancient documents,” inDoc- ument Recognition and Retrieval XVI, vol. 7247, 2009, pp. 267–275

work page 2009

[16] [16]

DeepOtsu: document enhancement and binarization using iterative deep learning,

S. He and L. Schomaker, “DeepOtsu: document enhancement and binarization using iterative deep learning,”Pattern Recognition, vol. 91, pp. 379–390, 2019

work page 2019

[17] [17]

A selectional auto-encoder ap- proach for document image binarization,

J. Calvo-Zaragoza and A.-J. Gallego, “A selectional auto-encoder ap- proach for document image binarization,”Pattern Recognition, vol. 86, pp. 37–47, 2019

work page 2019

[18] [18]

Self-BSR: self- supervised image denoising and destriping based on blind-spot reg- ularization,

C. Qu, Z. Chen, J. Zhang, X. Chen, and J. Han, “Self-BSR: self- supervised image denoising and destriping based on blind-spot reg- ularization,”IEEE Transactions on Circuits and Systems for Video Technology, vol. 35, no. 9, pp. 8666–8678, 2025

work page 2025

[19] [19]

Degraded historical document binarization: a review on issues, challenges, techniques, and future directions,

A. Sulaiman, K. Omar, and M. F. Nasrudin, “Degraded historical document binarization: a review on issues, challenges, techniques, and future directions,”Journal of Imaging, vol. 5, no. 4, p. 48, 2019

work page 2019

[20] [20]

EventAid: benchmarking event-aided image/video enhancement algorithms with real-captured hybrid dataset,

P. Duan, B. Li, Y . Yang, H. Lou, M. Teng, X. Zhou, Y . Ma, and B. Shi, “EventAid: benchmarking event-aided image/video enhancement algorithms with real-captured hybrid dataset,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 47, no. 8, pp. 6959– 6973, 2025

work page 2025

[21] [21]

Dynamic obstacle avoid- ance for quadrotors with event cameras,

D. Falanga, K. Kleber, and D. Scaramuzza, “Dynamic obstacle avoid- ance for quadrotors with event cameras,”Science Robotics, vol. 5, no. 40, p. eaaz9712, 2020

work page 2020

[22] [22]

Drone detection with event cameras,

G. Magrini, L. Berlincioni, F. Becattini, L. Cultrera, and P. Pala, “Drone detection with event cameras,” inProceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2025, pp. 4703–4714

work page 2025

[23] [23]

Ultrafast dynamic defect inspection with computational neuromorphic imaging,

S. Zhu, Q. Yin, C. Wang, J. Huang, and E. Y . Lam, “Ultrafast dynamic defect inspection with computational neuromorphic imaging,”Advanced Science, vol. 12, no. 44, p. e10338, 2025

work page 2025

[24] [24]

Millisecond autofocusing microscopy using neuromorphic event sensing,

Z. Ge, H. Wei, F. Xu, Y . Gao, Z. Chu, H. K.-H. So, and E. Y . Lam, “Millisecond autofocusing microscopy using neuromorphic event sensing,”Optics and Lasers in Engineering, vol. 160, p. 107247, 2023

work page 2023

[25] [25]

Angle- based neuromorphic wave normal sensing,

C. Wang, S. Zhu, P. Zhang, K. Wang, J. Huang, and E. Y . Lam, “Angle- based neuromorphic wave normal sensing,”Laser & Photonics Reviews, vol. 19, no. 4, p. 2400647, 2025

work page 2025

[26] [26]

Widefield diamond quantum sensing with neuromorphic vision sensors,

Z. Du, M. Gupta, F. Xu, K. Zhang, J. Zhang, Y . Zhou, Y . Liu, Z. Wang, J. Wrachtrup, N. Wong, C. Li, and Z. Chu, “Widefield diamond quantum sensing with neuromorphic vision sensors,”Advanced Science, vol. 11, no. 2, p. 2304355, 2024

work page 2024

[27] [27]

EventLFM: event camera integrated Fourier light field microscopy for ultrafast 3D imaging,

R. Guo, Q. Yang, A. S. Chang, G. Hu, J. Greene, C. V . Gabel, S. You, and L. Tian, “EventLFM: event camera integrated Fourier light field microscopy for ultrafast 3D imaging,”Light: Science & Applications, vol. 13, p. 144, 2024

work page 2024

[28] [28]

Neuromorphic imaging with super-resolution,

P. Zhang, S. Zhu, C. Wang, Y . Zhao, and E. Y . Lam, “Neuromorphic imaging with super-resolution,”IEEE Transactions on Circuits and Systems for Video Technology, vol. 35, no. 2, pp. 1715–1727, 2025

work page 2025

[29] [29]

Self-calibrated neuromorphic hyperspectral derivative imaging,

R. Chen, C. Wang, Y . Li, Y . Cao, S. Zhu, and E. Y . Lam, “Self-calibrated neuromorphic hyperspectral derivative imaging,”Optica, vol. 13, no. 4, pp. 587–590, 2026

work page 2026

[30] [30]

Embodied neuromorphic synergy for lighting-robust machine vision to see in extreme bright,

S. Lin, G. Zheng, Z. Wang, R. Han, W. Xing, Z. Zhang, Y . Peng, and J. Pan, “Embodied neuromorphic synergy for lighting-robust machine vision to see in extreme bright,”Nature Communications, vol. 15, p. 10781, 2024

work page 2024

[31] [31]

Event camera meets mobile embodied percep- tion: abstraction, algorithm, acceleration, application,

H. Wang, R. Guo, P. Ma, C. Ruan, X. Luo, W. Ding, T. Zhong, J. Xu, Y . Liu, and X. Chen, “Event camera meets mobile embodied percep- tion: abstraction, algorithm, acceleration, application,”ACM Computing Surveys, vol. 58, no. 8, pp. 1–41, 2026

work page 2026

[32] [32]

Continuous-time intensity estimation using event cameras,

C. Scheerlinck, N. Barnes, and R. Mahony, “Continuous-time intensity estimation using event cameras,” inAsian Conference on Computer Vision (ACCV), 2018, pp. 308–324

work page 2018

[33] [33]

Neuromorphic imag- ing with joint image deblurring and event denoising,

P. Zhang, H. Liu, Z. Ge, C. Wang, and E. Y . Lam, “Neuromorphic imag- ing with joint image deblurring and event denoising,”IEEE Transactions on Image Processing, vol. 33, pp. 2318–2333, 2024

work page 2024

[34] [34]

High speed and high dynamic range video with an event camera,

H. Rebecq, R. Ranftl, V . Koltun, and D. Scaramuzza, “High speed and high dynamic range video with an event camera,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 43, no. 6, pp. 1964– 1980, 2021

work page 1964

[35] [35]

Event-based fusion for motion deblurring with cross- modal attention,

L. Sun, C. Sakaridis, J. Liang, Q. Jiang, K. Yang, P. Sun, Y . Ye, K. Wang, and L. V . Gool, “Event-based fusion for motion deblurring with cross- modal attention,” inEuropean Conference on Computer Vision (ECCV), 2022, pp. 412–428

work page 2022

[36] [36]

Dark-EvGS: event camera as an eye for radiance field in the dark,

J. Wu, P. Duan, Z. Wang, C. Wang, B. Shi, and E. Y . Lam, “Dark-EvGS: event camera as an eye for radiance field in the dark,”IEEE Transactions on Image Processing, vol. 35, pp. 3172–3185, 2026

work page 2026

[37] [37]

Motion- adaptive Transformer for event-based image deblurring,

S. Xu, Z. Sun, M. Zhong, C. Cao, Y . Liu, X. Fu, and Y . Chen, “Motion- adaptive Transformer for event-based image deblurring,” inProceedings of the AAAI Conference on Artificial Intelligence (AAAI), vol. 39, no. 9, 2025, pp. 8942–8950

work page 2025

[38] [38]

Event-based video super-resolution via state space models,

Z. Xiao and X. Wang, “Event-based video super-resolution via state space models,” inProceedings of the IEEE/CVF Conference on Com- puter Vision and Pattern Recognition (CVPR), 2025, pp. 12 564–12 574

work page 2025

[39] [39]

Diffusion-based event generation for high-quality image deblurring,

X. Xie, Q. Zhang, and W.-S. Zheng, “Diffusion-based event generation for high-quality image deblurring,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025, pp. 2194–2203

work page 2025

[40] [40]

Event-guided HDR reconstruction with diffusion priors,

Y . Yang, J. Zhang, Y . Zhang, Y . Wei, D. Zou, J. S. Ren, and B. Shi, “Event-guided HDR reconstruction with diffusion priors,” inProceed- ings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2025, pp. 11 787–11 796

work page 2025

[41] [41]

Neuromorphic synergy for video binarization,

S. Lin, X. Zhang, L. Yang, L. Yu, B. Zhou, X. Luo, W. Wang, and J. Pan, “Neuromorphic synergy for video binarization,”IEEE Transactions on Image Processing, vol. 33, pp. 1403–1418, 2024

work page 2024

[42] [42]

Reading in the dark with foveated event vision,

C. Brander, G. Cioffi, N. Messikommer, and D. Scaramuzza, “Reading in the dark with foveated event vision,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2025, pp. 5044–5052

work page 2025

[43] [43]

Front and back illuminated dynamic and active pixel vision sensors comparison,

G. Taverni, D. P. Moeys, C. Li, C. Cavaco, V . Motsnyi, D. S. S. Bello, and T. Delbruck, “Front and back illuminated dynamic and active pixel vision sensors comparison,”IEEE Transactions on Circuits and Systems II: Express Briefs, vol. 65, no. 5, pp. 677–681, 2018

work page 2018

[44] [44]

A unifying contrast maximization framework for event cameras, with applications to motion, depth, and optical flow estimation,

G. Gallego, H. Rebecq, and D. Scaramuzza, “A unifying contrast maximization framework for event cameras, with applications to motion, depth, and optical flow estimation,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018, pp. 3867–3876. 12

work page 2018

[45] [45]

Event encryption: rethinking privacy exposure for neuromorphic imaging,

P. Zhang, S. Zhu, and E. Y . Lam, “Event encryption: rethinking privacy exposure for neuromorphic imaging,”Neuromorphic Computing and Engineering, vol. 4, no. 1, p. 014002, 2024

work page 2024

[46] [46]

Event camera calibration of per-pixel biased contrast threshold,

Z. Wang, Y . Ng, P. van Goor, and R. Mahony, “Event camera calibration of per-pixel biased contrast threshold,” inAustralasian Conference of Robotics and Automation (ACRA), 2019

work page 2019

[47] [47]

Graph-based blind image deblurring from a single photograph,

Y . Bai, G. Cheung, X. Liu, and W. Gao, “Graph-based blind image deblurring from a single photograph,”IEEE Transactions on Image Processing, vol. 28, no. 3, pp. 1404–1418, 2019

work page 2019

[48] [48]

Blind image deblurring with local maximum gradient prior,

L. Chen, F. Fang, T. Wang, and G. Zhang, “Blind image deblurring with local maximum gradient prior,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 1742–1750

work page 2019

[49] [49]

Reducing the sim-to-real gap for event cameras,

T. Stoffregen, C. Scheerlinck, D. Scaramuzza, T. Drummond, N. Barnes, L. Kleeman, and R. Mahony, “Reducing the sim-to-real gap for event cameras,” inEuropean Conference on Computer Vision (ECCV), 2020, pp. 534–549

work page 2020

[50] [50]

The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation,

D. Chicco and G. Jurman, “The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation,”BMC Genomics, vol. 21, p. 6, 2020

work page 2020

[51] [51]

Character region aware- ness for text detection,

Y . Baek, B. Lee, D. Han, S. Yun, and H. Lee, “Character region aware- ness for text detection,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 9365– 9374

work page 2019

[52] [52]

An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition,

B. Shi, X. Bai, and C. Yao, “An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 11, pp. 2298–2304, 2017

work page 2017

[53] [53]

DeepTag: a general framework for fiducial marker design and detection,

Z. Zhang, Y . Hu, G. Yu, and J. Dai, “DeepTag: a general framework for fiducial marker design and detection,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 45, no. 3, pp. 2931–2944, 2023

work page 2023

[54] [54]

RAFT: recurrent all-pairs field transforms for optical flow,

Z. Teed and J. Deng, “RAFT: recurrent all-pairs field transforms for optical flow,” inEuropean Conference on Computer Vision (ECCV), 2020, pp. 402–419

work page 2020

[55] [55]

Neuromorphic imaging with density-based spatiotemporal denoising,

P. Zhang, Z. Ge, L. Song, and E. Y . Lam, “Neuromorphic imaging with density-based spatiotemporal denoising,”IEEE Transactions on Computational Imaging, vol. 9, pp. 530–541, 2023

work page 2023