See Silhouettes in Motion with Neuromorphic Vision
Pith reviewed 2026-05-20 00:52 UTC · model grok-4.3
The pith
Fusing frames with neuromorphic event data produces clear binarized silhouettes of fast-moving objects on standard CPUs.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We introduce a simple yet effective dual-modal approach that harnesses the synergy between frames and events to achieve real-time, high-frame-rate binarization on CPU-only devices. Extensive evaluations show competitive performance against leading techniques in reducing motion blur, while delivering improvements under challenging illumination. The asynchronous workflow bypasses event scarcity that breaks traditional time-binning reconstruction, maintaining clear target shapes even at extreme kilohertz frame rates. Its binary results further serve as reliable representations that facilitate a range of downstream tasks.
What carries the argument
Dual-modal fusion of frames and events in an asynchronous workflow that produces binarized silhouettes.
If this is right
- The method reduces motion blur in dynamic scenes while operating in real time on CPU hardware.
- It improves binarization results under harsh or varying illumination compared to frame-only approaches.
- Clear shapes are preserved even when traditional time-binning fails due to sparse events.
- Binary outputs serve as compact inputs for downstream perception and interaction tasks.
- The workflow supports lightweight perception on edge platforms such as drones and vehicles.
Where Pith is reading between the lines
- The same fusion idea could be tested on other quasi-bimodal patterns such as moving QR codes or lane markings.
- Performance under increased event noise levels beyond the paper's evaluations would clarify practical limits.
- Integration into tracking or recognition pipelines on low-power robotics hardware offers a direct next step.
Load-bearing premise
A simple dual-modal fusion of frames and events is sufficient to produce reliable binarization without detailed specification of the fusion algorithm, training data, or failure modes under real-world event noise.
What would settle it
A side-by-side comparison at kilohertz rates in a high-noise, low-event-density scene where the output silhouettes lose clear target shapes relative to ground-truth labels.
Figures
read the original abstract
Quasi-bimodal objects, such as text, road signs, and barcodes, play a basic yet vital role in daily visual communication. By boiling these down to clear silhouettes, binarization uses a minimal language to convey essential vision cues for maximum downstream efficiency. The catch is that frame-based imaging often struggles on mobile platforms like drones, self-driving cars, and underwater vehicles. In these dynamic scenes, rapid motion and harsh lighting can make it blind, causing severe motion blur and erasing crucial details. To overcome the limits, neuromorphic vision via event cameras, featuring microsecond-level temporal resolution and high dynamic range, steps in as a natural solution. Building upon this event-driven sensing paradigm, we introduce a simple yet effective dual-modal approach that harnesses the synergy between frames and events to achieve real-time, high-frame-rate binarization on CPU-only devices. Extensive evaluations present that it earns competitive performance against leading techniques in reducing motion blur, while delivering impressive improvements under challenging illumination. Besides, our asynchronous workflow bypasses event scarcity that breaks traditional time-binning reconstruction, maintaining clear target shapes even at extreme kilohertz frame rates. Its binary results further serve as reliable representations that facilitate a range of downstream tasks. This work paves the way towards lightweight perception and interaction in embodied intelligence on resource-constrained edge platforms.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a dual-modal approach fusing conventional image frames with neuromorphic event-camera data for real-time binarization of quasi-bimodal objects (text, signs, barcodes) under rapid motion and harsh illumination. It claims an asynchronous workflow that avoids traditional time-binning, thereby bypassing event scarcity to produce clear binary silhouettes at extreme kilohertz frame rates on CPU-only hardware, while delivering competitive motion-blur reduction and illumination robustness relative to prior methods, with downstream utility for embodied perception tasks.
Significance. If the empirical claims hold after proper controls, the work could contribute to lightweight, high-speed silhouette extraction on resource-constrained platforms such as drones and underwater vehicles by exploiting the complementary strengths of frame and event sensing. The emphasis on CPU-only real-time operation and avoidance of event-density limitations addresses practical bottlenecks in neuromorphic vision pipelines.
major comments (2)
- [Method] Method section: the asynchronous dual-modal fusion is presented only at a conceptual level with no equations, update rules, interpolation scheme, threshold logic, or pseudocode for how frames and events are combined to compensate for sparse events. This directly undermines the central claim that the workflow 'bypasses event scarcity' and maintains clear shapes at kHz rates, as it is impossible to determine whether the method genuinely compensates for low event density or simply falls back on frame data.
- [Experiments] Experiments / Results: no quantitative tables, ablation studies, error bars, or controls for dataset choice and parameter tuning are provided to substantiate the reported improvements in motion blur and illumination robustness. Without these, the performance claims cannot be verified as load-bearing for the paper's conclusions.
minor comments (1)
- [Abstract] Abstract: the sentence 'Extensive evaluations present that it earns competitive performance' is grammatically awkward and should be rephrased for clarity (e.g., 'Extensive evaluations demonstrate competitive performance').
Simulated Author's Rebuttal
We thank the referee for their insightful comments, which have helped us identify areas for improvement in our manuscript. We address each major comment below and outline the revisions we plan to make.
read point-by-point responses
-
Referee: [Method] Method section: the asynchronous dual-modal fusion is presented only at a conceptual level with no equations, update rules, interpolation scheme, threshold logic, or pseudocode for how frames and events are combined to compensate for sparse events. This directly undermines the central claim that the workflow 'bypasses event scarcity' and maintains clear shapes at kHz rates, as it is impossible to determine whether the method genuinely compensates for low event density or simply falls back on frame data.
Authors: We agree that the method description in the current version is primarily conceptual and lacks the formal details requested. To strengthen the manuscript, we will revise the Method section to include the specific equations for the dual-modal fusion, update rules for asynchronous event integration with frames, the interpolation scheme used to achieve high frame rates, the threshold logic for generating binary silhouettes, and pseudocode for the overall workflow. These additions will clarify how the approach compensates for event sparsity rather than relying solely on frame data. revision: yes
-
Referee: [Experiments] Experiments / Results: no quantitative tables, ablation studies, error bars, or controls for dataset choice and parameter tuning are provided to substantiate the reported improvements in motion blur and illumination robustness. Without these, the performance claims cannot be verified as load-bearing for the paper's conclusions.
Authors: We acknowledge the absence of quantitative tables, ablation studies, error bars, and detailed controls in the presented results. In the revised manuscript, we will incorporate quantitative evaluation tables comparing performance metrics against state-of-the-art methods, ablation studies isolating the contributions of frame and event modalities, error bars from repeated experiments, and explicit discussions of dataset selection criteria and parameter tuning procedures. This will provide the necessary evidence to support the claims on motion blur reduction and illumination robustness. revision: yes
Circularity Check
No circularity: empirical method description with no self-referential derivations or fitted predictions
full rationale
The paper introduces a dual-modal fusion approach for real-time binarization of quasi-bimodal objects using frames and events, emphasizing an asynchronous workflow to handle event scarcity at high frame rates. No equations, parameter-fitting procedures, or derivation steps are presented in the abstract or described claims. The central assertions rest on empirical performance comparisons and the practical benefits of the workflow rather than any quantity that reduces to its own inputs by construction. No self-citations, uniqueness theorems, or ansatzes are invoked to justify load-bearing premises. The work is therefore self-contained, with its validity depending on external experimental validation rather than internal definitional closure.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We propose a simple yet effective, lightweight, and dual-modal method that leverages the synergy between frames and events for unified, single-step binarization... asynchronous state propagation strategy that treats the initial binary frame as a seed and propagates it asynchronously
-
IndisputableMonolith/Foundation/ArithmeticFromLogic.leanLogicNat recovery and 8-tick orbit structure unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
our asynchronous workflow bypasses event scarcity that breaks traditional time-binning reconstruction, maintaining clear target shapes even at extreme kilohertz frame rates
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
S. Wang, M. Zhu, Y . Hu, D. Li, F. Yuan, and J. Yu, “CylinderTag: an accurate and flexible marker for cylinder-shape objects pose estimation based on projective invariants,”IEEE Transactions on Visualization and Computer Graphics, vol. 30, no. 12, pp. 7486–7499, 2024. 11
work page 2024
-
[2]
A shared autonomy system for precise and efficient remote underwater manipulation,
A. Phung, G. Billings, A. F. Daniele, M. R. Walter, and R. Camilli, “A shared autonomy system for precise and efficient remote underwater manipulation,”IEEE Transactions on Robotics, vol. 40, pp. 4147–4159, 2024
work page 2024
-
[3]
Event-based stereo depth estimation: a sur- vey,
S. Ghosh and G. Gallego, “Event-based stereo depth estimation: a sur- vey,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 47, no. 10, pp. 9130–9149, 2025
work page 2025
-
[4]
G. Gallego, T. Delbruck, G. M. Orchard, C. Bartolozzi, B. Taba, A. Censi, S. Leutenegger, A. Davison, J. Conradt, K. Daniilidis, and D. Scaramuzza, “Event-based vision: a survey,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 44, no. 1, pp. 154–180, 2022
work page 2022
-
[5]
Low-latency automotive vision with event cameras,
D. Gehrig and D. Scaramuzza, “Low-latency automotive vision with event cameras,”Nature, vol. 629, pp. 1034–1040, 2024
work page 2024
-
[6]
Learning from images: a distillation learning framework for event cameras,
Y . Deng, H. Chen, H. Chen, and Y . Li, “Learning from images: a distillation learning framework for event cameras,”IEEE Transactions on Image Processing, vol. 30, pp. 4919–4931, 2021
work page 2021
-
[7]
Y . Zhu and L. F. Tadesse, “SpectroGen: a physically informed gener- ative artificial intelligence for accelerated cross-modality spectroscopic materials characterization,”Matter, vol. 9, no. 1, p. 102434, 2026
work page 2026
-
[8]
Deblurring neural radiance fields with event-driven bundle adjustment,
Y . Qi, L. Zhu, Y . Zhao, N. Bao, and J. Li, “Deblurring neural radiance fields with event-driven bundle adjustment,” inProceedings of the 32nd ACM International Conference on Multimedia (ACM MM), 2024, pp. 9262–9270
work page 2024
-
[9]
Event-based motion deblurring with modality-aware decomposition and recomposition,
W. Yang, J. Wu, L. Li, W. Dong, and G. Shi, “Event-based motion deblurring with modality-aware decomposition and recomposition,” in Proceedings of the 31st ACM International Conference on Multimedia (ACM MM), 2023, pp. 8327–8335
work page 2023
-
[10]
Bringing a blurry frame alive at high frame-rate with an event camera,
L. Pan, C. Scheerlinck, X. Yu, R. Hartley, M. Liu, and Y . Dai, “Bringing a blurry frame alive at high frame-rate with an event camera,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 6820–6829
work page 2019
-
[11]
Binarization of document image using optimum threshold modification,
W. A. Mustafa and M. M. M. Abdul Kader, “Binarization of document image using optimum threshold modification,” inJournal of Physics: Conference Series, vol. 1019, 2018, p. 012022
work page 2018
-
[12]
ISauvola: improved Sauvola’s algorithm for document image binarization,
Z. Hadjadj, A. Meziane, Y . Cherfa, M. Cheriet, and I. Setitra, “ISauvola: improved Sauvola’s algorithm for document image binarization,” in International Conference on Image Analysis and Recognition, 2016, pp. 737–745
work page 2016
-
[13]
A threshold selection method from gray-level histograms,
N. Otsu, “A threshold selection method from gray-level histograms,” IEEE Transactions on Systems, Man, and Cybernetics, vol. 9, no. 1, pp. 62–66, 1979
work page 1979
-
[14]
Robust document image binarization technique for degraded document images,
B. Su, S. Lu, and C. L. Tan, “Robust document image binarization technique for degraded document images,”IEEE Transactions on Image Processing, vol. 22, no. 4, pp. 1408–1417, 2013
work page 2013
-
[15]
Comparison of Niblack inspired binarization methods for ancient documents,
K. Khurshid, I. Siddiqi, C. Faure, and N. Vincent, “Comparison of Niblack inspired binarization methods for ancient documents,” inDoc- ument Recognition and Retrieval XVI, vol. 7247, 2009, pp. 267–275
work page 2009
-
[16]
DeepOtsu: document enhancement and binarization using iterative deep learning,
S. He and L. Schomaker, “DeepOtsu: document enhancement and binarization using iterative deep learning,”Pattern Recognition, vol. 91, pp. 379–390, 2019
work page 2019
-
[17]
A selectional auto-encoder ap- proach for document image binarization,
J. Calvo-Zaragoza and A.-J. Gallego, “A selectional auto-encoder ap- proach for document image binarization,”Pattern Recognition, vol. 86, pp. 37–47, 2019
work page 2019
-
[18]
Self-BSR: self- supervised image denoising and destriping based on blind-spot reg- ularization,
C. Qu, Z. Chen, J. Zhang, X. Chen, and J. Han, “Self-BSR: self- supervised image denoising and destriping based on blind-spot reg- ularization,”IEEE Transactions on Circuits and Systems for Video Technology, vol. 35, no. 9, pp. 8666–8678, 2025
work page 2025
-
[19]
A. Sulaiman, K. Omar, and M. F. Nasrudin, “Degraded historical document binarization: a review on issues, challenges, techniques, and future directions,”Journal of Imaging, vol. 5, no. 4, p. 48, 2019
work page 2019
-
[20]
P. Duan, B. Li, Y . Yang, H. Lou, M. Teng, X. Zhou, Y . Ma, and B. Shi, “EventAid: benchmarking event-aided image/video enhancement algorithms with real-captured hybrid dataset,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 47, no. 8, pp. 6959– 6973, 2025
work page 2025
-
[21]
Dynamic obstacle avoid- ance for quadrotors with event cameras,
D. Falanga, K. Kleber, and D. Scaramuzza, “Dynamic obstacle avoid- ance for quadrotors with event cameras,”Science Robotics, vol. 5, no. 40, p. eaaz9712, 2020
work page 2020
-
[22]
Drone detection with event cameras,
G. Magrini, L. Berlincioni, F. Becattini, L. Cultrera, and P. Pala, “Drone detection with event cameras,” inProceedings of the IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), 2025, pp. 4703–4714
work page 2025
-
[23]
Ultrafast dynamic defect inspection with computational neuromorphic imaging,
S. Zhu, Q. Yin, C. Wang, J. Huang, and E. Y . Lam, “Ultrafast dynamic defect inspection with computational neuromorphic imaging,”Advanced Science, vol. 12, no. 44, p. e10338, 2025
work page 2025
-
[24]
Millisecond autofocusing microscopy using neuromorphic event sensing,
Z. Ge, H. Wei, F. Xu, Y . Gao, Z. Chu, H. K.-H. So, and E. Y . Lam, “Millisecond autofocusing microscopy using neuromorphic event sensing,”Optics and Lasers in Engineering, vol. 160, p. 107247, 2023
work page 2023
-
[25]
Angle- based neuromorphic wave normal sensing,
C. Wang, S. Zhu, P. Zhang, K. Wang, J. Huang, and E. Y . Lam, “Angle- based neuromorphic wave normal sensing,”Laser & Photonics Reviews, vol. 19, no. 4, p. 2400647, 2025
work page 2025
-
[26]
Widefield diamond quantum sensing with neuromorphic vision sensors,
Z. Du, M. Gupta, F. Xu, K. Zhang, J. Zhang, Y . Zhou, Y . Liu, Z. Wang, J. Wrachtrup, N. Wong, C. Li, and Z. Chu, “Widefield diamond quantum sensing with neuromorphic vision sensors,”Advanced Science, vol. 11, no. 2, p. 2304355, 2024
work page 2024
-
[27]
EventLFM: event camera integrated Fourier light field microscopy for ultrafast 3D imaging,
R. Guo, Q. Yang, A. S. Chang, G. Hu, J. Greene, C. V . Gabel, S. You, and L. Tian, “EventLFM: event camera integrated Fourier light field microscopy for ultrafast 3D imaging,”Light: Science & Applications, vol. 13, p. 144, 2024
work page 2024
-
[28]
Neuromorphic imaging with super-resolution,
P. Zhang, S. Zhu, C. Wang, Y . Zhao, and E. Y . Lam, “Neuromorphic imaging with super-resolution,”IEEE Transactions on Circuits and Systems for Video Technology, vol. 35, no. 2, pp. 1715–1727, 2025
work page 2025
-
[29]
Self-calibrated neuromorphic hyperspectral derivative imaging,
R. Chen, C. Wang, Y . Li, Y . Cao, S. Zhu, and E. Y . Lam, “Self-calibrated neuromorphic hyperspectral derivative imaging,”Optica, vol. 13, no. 4, pp. 587–590, 2026
work page 2026
-
[30]
Embodied neuromorphic synergy for lighting-robust machine vision to see in extreme bright,
S. Lin, G. Zheng, Z. Wang, R. Han, W. Xing, Z. Zhang, Y . Peng, and J. Pan, “Embodied neuromorphic synergy for lighting-robust machine vision to see in extreme bright,”Nature Communications, vol. 15, p. 10781, 2024
work page 2024
-
[31]
Event camera meets mobile embodied percep- tion: abstraction, algorithm, acceleration, application,
H. Wang, R. Guo, P. Ma, C. Ruan, X. Luo, W. Ding, T. Zhong, J. Xu, Y . Liu, and X. Chen, “Event camera meets mobile embodied percep- tion: abstraction, algorithm, acceleration, application,”ACM Computing Surveys, vol. 58, no. 8, pp. 1–41, 2026
work page 2026
-
[32]
Continuous-time intensity estimation using event cameras,
C. Scheerlinck, N. Barnes, and R. Mahony, “Continuous-time intensity estimation using event cameras,” inAsian Conference on Computer Vision (ACCV), 2018, pp. 308–324
work page 2018
-
[33]
Neuromorphic imag- ing with joint image deblurring and event denoising,
P. Zhang, H. Liu, Z. Ge, C. Wang, and E. Y . Lam, “Neuromorphic imag- ing with joint image deblurring and event denoising,”IEEE Transactions on Image Processing, vol. 33, pp. 2318–2333, 2024
work page 2024
-
[34]
High speed and high dynamic range video with an event camera,
H. Rebecq, R. Ranftl, V . Koltun, and D. Scaramuzza, “High speed and high dynamic range video with an event camera,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 43, no. 6, pp. 1964– 1980, 2021
work page 1964
-
[35]
Event-based fusion for motion deblurring with cross- modal attention,
L. Sun, C. Sakaridis, J. Liang, Q. Jiang, K. Yang, P. Sun, Y . Ye, K. Wang, and L. V . Gool, “Event-based fusion for motion deblurring with cross- modal attention,” inEuropean Conference on Computer Vision (ECCV), 2022, pp. 412–428
work page 2022
-
[36]
Dark-EvGS: event camera as an eye for radiance field in the dark,
J. Wu, P. Duan, Z. Wang, C. Wang, B. Shi, and E. Y . Lam, “Dark-EvGS: event camera as an eye for radiance field in the dark,”IEEE Transactions on Image Processing, vol. 35, pp. 3172–3185, 2026
work page 2026
-
[37]
Motion- adaptive Transformer for event-based image deblurring,
S. Xu, Z. Sun, M. Zhong, C. Cao, Y . Liu, X. Fu, and Y . Chen, “Motion- adaptive Transformer for event-based image deblurring,” inProceedings of the AAAI Conference on Artificial Intelligence (AAAI), vol. 39, no. 9, 2025, pp. 8942–8950
work page 2025
-
[38]
Event-based video super-resolution via state space models,
Z. Xiao and X. Wang, “Event-based video super-resolution via state space models,” inProceedings of the IEEE/CVF Conference on Com- puter Vision and Pattern Recognition (CVPR), 2025, pp. 12 564–12 574
work page 2025
-
[39]
Diffusion-based event generation for high-quality image deblurring,
X. Xie, Q. Zhang, and W.-S. Zheng, “Diffusion-based event generation for high-quality image deblurring,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025, pp. 2194–2203
work page 2025
-
[40]
Event-guided HDR reconstruction with diffusion priors,
Y . Yang, J. Zhang, Y . Zhang, Y . Wei, D. Zou, J. S. Ren, and B. Shi, “Event-guided HDR reconstruction with diffusion priors,” inProceed- ings of the IEEE/CVF International Conference on Computer Vision (ICCV), 2025, pp. 11 787–11 796
work page 2025
-
[41]
Neuromorphic synergy for video binarization,
S. Lin, X. Zhang, L. Yang, L. Yu, B. Zhou, X. Luo, W. Wang, and J. Pan, “Neuromorphic synergy for video binarization,”IEEE Transactions on Image Processing, vol. 33, pp. 1403–1418, 2024
work page 2024
-
[42]
Reading in the dark with foveated event vision,
C. Brander, G. Cioffi, N. Messikommer, and D. Scaramuzza, “Reading in the dark with foveated event vision,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), 2025, pp. 5044–5052
work page 2025
-
[43]
Front and back illuminated dynamic and active pixel vision sensors comparison,
G. Taverni, D. P. Moeys, C. Li, C. Cavaco, V . Motsnyi, D. S. S. Bello, and T. Delbruck, “Front and back illuminated dynamic and active pixel vision sensors comparison,”IEEE Transactions on Circuits and Systems II: Express Briefs, vol. 65, no. 5, pp. 677–681, 2018
work page 2018
-
[44]
G. Gallego, H. Rebecq, and D. Scaramuzza, “A unifying contrast maximization framework for event cameras, with applications to motion, depth, and optical flow estimation,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2018, pp. 3867–3876. 12
work page 2018
-
[45]
Event encryption: rethinking privacy exposure for neuromorphic imaging,
P. Zhang, S. Zhu, and E. Y . Lam, “Event encryption: rethinking privacy exposure for neuromorphic imaging,”Neuromorphic Computing and Engineering, vol. 4, no. 1, p. 014002, 2024
work page 2024
-
[46]
Event camera calibration of per-pixel biased contrast threshold,
Z. Wang, Y . Ng, P. van Goor, and R. Mahony, “Event camera calibration of per-pixel biased contrast threshold,” inAustralasian Conference of Robotics and Automation (ACRA), 2019
work page 2019
-
[47]
Graph-based blind image deblurring from a single photograph,
Y . Bai, G. Cheung, X. Liu, and W. Gao, “Graph-based blind image deblurring from a single photograph,”IEEE Transactions on Image Processing, vol. 28, no. 3, pp. 1404–1418, 2019
work page 2019
-
[48]
Blind image deblurring with local maximum gradient prior,
L. Chen, F. Fang, T. Wang, and G. Zhang, “Blind image deblurring with local maximum gradient prior,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 1742–1750
work page 2019
-
[49]
Reducing the sim-to-real gap for event cameras,
T. Stoffregen, C. Scheerlinck, D. Scaramuzza, T. Drummond, N. Barnes, L. Kleeman, and R. Mahony, “Reducing the sim-to-real gap for event cameras,” inEuropean Conference on Computer Vision (ECCV), 2020, pp. 534–549
work page 2020
-
[50]
D. Chicco and G. Jurman, “The advantages of the Matthews correlation coefficient (MCC) over F1 score and accuracy in binary classification evaluation,”BMC Genomics, vol. 21, p. 6, 2020
work page 2020
-
[51]
Character region aware- ness for text detection,
Y . Baek, B. Lee, D. Han, S. Yun, and H. Lee, “Character region aware- ness for text detection,” inProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2019, pp. 9365– 9374
work page 2019
-
[52]
B. Shi, X. Bai, and C. Yao, “An end-to-end trainable neural network for image-based sequence recognition and its application to scene text recognition,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 11, pp. 2298–2304, 2017
work page 2017
-
[53]
DeepTag: a general framework for fiducial marker design and detection,
Z. Zhang, Y . Hu, G. Yu, and J. Dai, “DeepTag: a general framework for fiducial marker design and detection,”IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 45, no. 3, pp. 2931–2944, 2023
work page 2023
-
[54]
RAFT: recurrent all-pairs field transforms for optical flow,
Z. Teed and J. Deng, “RAFT: recurrent all-pairs field transforms for optical flow,” inEuropean Conference on Computer Vision (ECCV), 2020, pp. 402–419
work page 2020
-
[55]
Neuromorphic imaging with density-based spatiotemporal denoising,
P. Zhang, Z. Ge, L. Song, and E. Y . Lam, “Neuromorphic imaging with density-based spatiotemporal denoising,”IEEE Transactions on Computational Imaging, vol. 9, pp. 530–541, 2023
work page 2023
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.