Protect the Brain When Treating the Heart: A Convolutional Neural Network for Detecting Emboli
Pith reviewed 2026-05-08 12:31 UTC · model grok-4.3
The pith
A 2.5D U-Net segments gaseous microemboli in cardiac ultrasound videos with robust accuracy and real-time speed.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
A 2.5D U-Net architecture applied to space-time connected ultrasound data yields robust detection of gaseous microemboli against background structures, high segmentation accuracy, and real-time execution speed, enabling integration into patient-monitoring surgical protocols that quantify GME area over time.
What carries the argument
The 2.5D U-Net, a convolutional network that processes short sequences of ultrasound frames to outline gaseous microemboli across both spatial position and time.
If this is right
- The pipeline can be added to existing surgical monitoring systems without changing workflow.
- GME area can be quantified continuously over the course of a procedure.
- Detection stays reliable even when background structures resemble bubbles.
- The same approach applies to both open-heart surgery and transcatheter interventions.
- Real-time performance is preserved while segmentation accuracy remains high.
Where Pith is reading between the lines
- Automated GME tracking could support future studies that link bubble volume directly to post-operative neurological outcomes.
- The method might be retrained on other ultrasound views or for different transient objects such as solid emboli.
- Hospitals could add threshold alerts that notify the team when cumulative GME area exceeds a chosen safety limit.
- Standardized AI monitoring might reduce differences in emboli reporting across different surgical centers.
Load-bearing premise
The 2.5D U-Net maintains robust detection and high accuracy when applied to real clinical ultrasound data that include varying operator views and high velocities.
What would settle it
A set of live operating-room ultrasound recordings from multiple patients and operators in which the network's bubble outlines match expert manual review less than 80 percent of the time or cannot keep up with video frame rate on standard hospital hardware.
Figures
read the original abstract
Gaseous microemboli (GME) represent a common complication of cardiac structural interventions across both surgical and transcatheter approaches. Transthoracic cardiac ultrasound imaging represents a convenient methodology to visualize the presence of circulating GME. However, their detection and quantification are far from trivial due to operator-dependent view, high velocity, and objects with similar structure in the background. Here, we propose an approach based on a 2.5D U-Net architecture to segment GME in space-time connected data. Such an approach yields robust detection against the background and high segmentation accuracy while retaining real-time execution speed. These properties facilitated the integration of the proposed pipeline into patient-monitoring surgical protocols, providing the quantification of GME area over time.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes a 2.5D U-Net architecture to segment gaseous microemboli (GME) in space-time connected transthoracic ultrasound data. It claims this yields robust background rejection, high segmentation accuracy (via Dice/IoU), and real-time inference (sub-30 ms) sufficient for integration into surgical monitoring protocols to quantify GME area over time.
Significance. If the reported metrics generalize, the work offers a practical tool for real-time emboli monitoring during cardiac interventions, addressing a clinically relevant complication. The temporal stacking in the 2.5D design and timing benchmarks are strengths that support feasibility; however, the overall significance is moderated by the absence of external validation or baselines, limiting claims of robustness across varying clinical conditions.
major comments (2)
- [Results] Results section: Performance (Dice/IoU) is reported only on an internal train-test split of the dataset. This does not address the operator-dependent views, high velocities, and background objects emphasized in the Introduction as core challenges; without patient-wise cross-validation, multi-center data, or external test sets, the central claim of 'robust detection' lacks sufficient support.
- [Methods] Methods section: The 2.5D U-Net with temporal stacking is described as standard, but no comparisons to 2D U-Net, 3D U-Net, or non-DL baselines (e.g., intensity thresholding) are provided. This makes it impossible to quantify the specific benefit of the proposed architecture for the space-time segmentation task.
minor comments (2)
- [Abstract] Abstract: Claims of 'high segmentation accuracy' and 'robust detection' are made without any numerical results (e.g., Dice score or timing); adding these would immediately ground the assertions.
- [Methods] Figure captions and Methods: Clarify the exact temporal stacking (number of frames) and augmentation strategy used for the space-time volumes; this detail is needed to reproduce the real-time performance.
Simulated Author's Rebuttal
We are grateful to the referee for their constructive comments on our manuscript. We have revised the paper to address the concerns raised and provide point-by-point responses below.
read point-by-point responses
-
Referee: [Results] Results section: Performance (Dice/IoU) is reported only on an internal train-test split of the dataset. This does not address the operator-dependent views, high velocities, and background objects emphasized in the Introduction as core challenges; without patient-wise cross-validation, multi-center data, or external test sets, the central claim of 'robust detection' lacks sufficient support.
Authors: We acknowledge the importance of robust evaluation for the claimed detection performance. In the revised manuscript, we have incorporated patient-wise cross-validation results to better account for variability across patients, operators, and views. These additional experiments, reported in the updated Results section, demonstrate consistent performance metrics, thereby supporting the robustness against the challenges outlined in the Introduction. We note that multi-center or external test sets are not available for this study and have explicitly discussed this as a limitation in the revised Discussion section. revision: partial
-
Referee: [Methods] Methods section: The 2.5D U-Net with temporal stacking is described as standard, but no comparisons to 2D U-Net, 3D U-Net, or non-DL baselines (e.g., intensity thresholding) are provided. This makes it impossible to quantify the specific benefit of the proposed architecture for the space-time segmentation task.
Authors: We agree that comparative analyses are necessary to highlight the benefits of the 2.5D architecture. Accordingly, we have added comparisons against 2D U-Net, 3D U-Net, and an intensity-based thresholding baseline in the revised Methods and Results sections. The quantitative results show that the temporal component in the 2.5D U-Net provides improved segmentation accuracy for high-velocity GME while maintaining real-time performance, as now detailed in the manuscript. revision: yes
- The lack of multi-center or external validation datasets, which cannot be addressed without collecting new data from additional centers.
Circularity Check
No circularity; empirical application of standard 2.5D U-Net
full rationale
The manuscript applies a 2.5D U-Net to space-time ultrasound data for GME segmentation. No derivation chain, uniqueness theorem, or parameter-fitting step is present that reduces a claimed prediction or result to its own inputs by construction. Methods consist of standard architecture, loss, augmentation, and internal-split evaluation; reported Dice/IoU and timing figures are direct empirical outcomes rather than self-referential. No self-citations are load-bearing for any mathematical claim, and no ansatz or renaming of known results occurs.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Ultrasound images can be treated as space-time connected data for segmentation.
Reference graph
Works this paper leans on
-
[1]
Christoph Angermann, Markus Haltmeier, Ruth Steiger, Sergiy Pereverzyev Jr., and Elke Ruth Gizewski. Projection- based 2.5d u-net architecture for fast volumetric segmenta- tion.CoRR, abs/1902.00347, 2019
-
[2]
Angino and K
A. Angino and K. Trotti. Aircatch. https://github.com/ AnginoA/Aircatch, 2025. GitHub repository
2025
-
[3]
Paola Antonello, Diego Morone, Edisa Pirani, Mariagrazia Uguccioni, Marcus Thelen, Rolf Krause, and Diego Piz- zagalli. Tracking unlabeled cancer cells imaged with low resolution in wide migration chambers via u-net class-1 probability (pseudofluorescence).Journal of Biological En- gineering, 17, 01 2023
2023
-
[4]
Picie: Unsupervised semantic segmentation using invariance and equivariance in clustering
Jang Hyun Cho, Utkarsh Mall, Kavita Bala, and Bharath Hariharan. Picie: Unsupervised semantic segmentation using invariance and equivariance in clustering. InProceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pages 16794–16804, June 2021
2021
-
[5]
Emma M. L. Chung, Caroline Banahan, Nikil Patel, Justyna Janus, David Marshall, Mark A. Horsfield, Cl ´ement Rousseau, Jonathan Keelan, David H. Evans, and James P. Hague. Size distribution of air bubbles entering the brain during cardiac surgery.PLOS ONE, 10(4):1–11, 04 2015
2015
-
[6]
Dueholm Vestergaard, U
C. Dueholm Vestergaard, U. Elstrøm Vindelev, L. P. Muren, J. Ren, O. Nørrevang, K. Jensen, and V . T. Taasti. Data augmentation for medical imaging: A systematic literature re- view.Physics Imaging and Radiation Oncology, 32:100658, 2024
2024
-
[7]
Pylv ¨an¨ainen, St´ephane U
Dmitry Ershov, Minh-Son Phan, Joanna W. Pylv ¨an¨ainen, St´ephane U. Rigaud, Laure Le Blanc, Arthur Charles-Orszag, James R. W. Conway, Romain F. Laine, Nathan H. Roy, Daria Bonazzi, Guillaume Dum ´enil, Guillaume Jacquemet, and Jean-Yves Tinevez. Trackmate 7: integrating state-of-the- art segmentation algorithms into tracking pipelines.Nature Methods, 19...
2022
-
[8]
Mask r-cnn
Kaiming He, Georgia Gkioxari, Piotr Dollar, and Ross Gir- shick. Mask r-cnn. InProceedings of the IEEE International Conference on Computer Vision (ICCV), October 2017
2017
-
[9]
Henriques, and Andrea Vedaldi
Xu Ji, Joao F. Henriques, and Andrea Vedaldi. Invariant infor- mation clustering for unsupervised image classification and segmentation. InProceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), October 2019
2019
-
[10]
Transformers in medical image segmentation: a narrative review.Quantitative Imaging in Medicine and Surgery, 13(12), 2023
Rabeea Fatma Khan, Byoung-Dai Lee, and Mu Sook Lee. Transformers in medical image segmentation: a narrative review.Quantitative Imaging in Medicine and Surgery, 13(12), 2023
2023
-
[11]
Investigation of air bubble properties: Relevance to prevention of coronary air embolism during cardiac surgery.Artificial Organs, 45(9):E349–E358, September 2021
Kazuki Kihara and Kazumasa Orihashi. Investigation of air bubble properties: Relevance to prevention of coronary air embolism during cardiac surgery.Artificial Organs, 45(9):E349–E358, September 2021. © 2021 International Center for Artificial Organs and Transplantation and Wiley Periodicals, Inc
2021
-
[12]
A. Kurz, K. Hauser, H. A. Mehrtens, E. Krieghoff-Henning, A. Hekler, J. N. Kather, S. Fr ¨ohling, C. von Kalle, and T. J. Brinker. Uncertainty estimation in medical image classification: Systematic review.JMIR Medical Informatics, 10(8):e36427, 2022
2022
-
[13]
Self- supervised vision transformers are efficient segmentation learners for imperfect labels, 2024
Seungho Lee, Seoungyoon Kang, and Hyunjung Shim. Self- supervised vision transformers are efficient segmentation learners for imperfect labels, 2024
2024
-
[14]
Fully convolutional networks for semantic segmentation
Jonathan Long, Evan Shelhamer, and Trevor Darrell. Fully convolutional networks for semantic segmentation. InPro- ceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 2015
2015
-
[15]
An analytics-driven review of u-net for medical image seg- mentation.Healthcare Analytics, 8:100416, 2025
Fnu Neha, Deepshikha Bhati, Deepak Kumar Shukla, Son- avi Makarand Dalvi, Nikolaos Mantzou, and Safa Shubbar. An analytics-driven review of u-net for medical image seg- mentation.Healthcare Analytics, 8:100416, 2025
2025
-
[16]
Retained intrac- ardiac air in cardiovascular surgery: a re-visited problem
Kazumasa Orihashi and Tsuyoshi Miyata. Retained intrac- ardiac air in cardiovascular surgery: a re-visited problem. General Thoracic and Cardiovascular Surgery, 72(7):429– 438, July 2024. Published online 2024/07/01
2024
-
[17]
U-net: Convolutional networks for biomedical image segmentation
Olaf Ronneberger, Philipp Fischer, and Thomas Brox. U-net: Convolutional networks for biomedical image segmentation. In Nassir Navab, Joachim Hornegger, William M. Wells, and Alejandro F. Frangi, editors,Medical Image Computing and Computer-Assisted Intervention – MICCAI 2015, pages 234– 241, Cham, 2015. Springer International Publishing
2015
-
[18]
Elkin, and Vijay Devabhaktuni
Nahian Siddique, Sidike Paheding, Colin P. Elkin, and Vijay Devabhaktuni. U-net and its variants for medical image segmentation: A review of theory and applications.IEEE Access, 9:82031–82057, 2021
2021
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.