Classification of Epileptic iEEG using Topological Machine Learning
Pith reviewed 2026-05-10 16:16 UTC · model grok-4.3
The pith
Dimension-reduced topological features from iEEG classify preictal, ictal and interictal states at up to 80 percent balanced accuracy across 55 patients.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By deriving persistence diagrams from multichannel iEEG recordings and vectorizing them with Carlsson coordinates, persistence images and template functions, then applying dimensionality reduction, the authors obtain up to 80 percent balanced accuracy in classifying preictal, ictal and interictal states across 55 patients. Classical machine learning classifiers reach 79.17 percent balanced accuracy, matching deep learning performance and indicating that topological features can reduce model complexity. Pipelines that keep the full multichannel feature structure overfit badly because of the high-dimensional input space.
What carries the argument
Persistence diagrams computed on iEEG time series, vectorized by Carlsson coordinates, persistence images or template functions and then dimension-reduced before classification.
If this is right
- Classical machine learning models achieve accuracy comparable to deep learning when supplied with reduced topological features.
- Preserving the full multichannel feature structure produces severe overfitting in high-dimensional settings.
- Topological representations support three-class classification without requiring patient-specific model adjustments.
- Structure-preserving dimensionality reduction is required to make topology-based features practical for multichannel neural recordings.
Where Pith is reading between the lines
- The same reduced topological pipeline could be tested on other multichannel physiological signals to check whether the accuracy gains generalize beyond epilepsy.
- Lower model complexity might allow deployment of seizure-state classifiers on portable or implantable devices that cannot run deep networks.
- Further ablation on specific frequency bands could isolate which topological features carry the strongest state information.
Load-bearing premise
The chosen topological vectorizations and dimensionality reductions preserve enough discriminative information that works across different patients without patient-specific tuning.
What would settle it
Applying the identical pipeline to an independent collection of iEEG recordings from new epilepsy patients and finding that balanced accuracy falls well below 80 percent.
read the original abstract
Epileptic seizure detection from EEG signals remains challenging due to the high dimensionality and nonlinear, potentially stochastic, dynamics of neural activity. In this work, we investigate whether features derived from topological data analysis (TDA) can improve the classification of brain states in preictal, ictal and interictal iEEG recordings from epilepsy patients using multichannel data. We analyze data from 55 patients, significantly larger than many previous studies that rely on patient-specific models. Persistence diagrams derived from iEEG signals are vectorized using several TDA representations, including Carlsson coordinates, persistence images, and template functions. To understand how topological representations interact with modern machine learning pipelines, we conduct a large-scale ablation study across multiple iEEG frequency bands, dimensionality reduction techniques, feature representations, and classifier architectures. Our experiments show that dimension-reduced topological representations achieve up to 80\% balanced accuracy for three-class classification. Interestingly, classical machine learning models perform comparably to deep learning models, achieving up to 79.17\% balanced accuracy, suggesting that carefully designed topological features can substantially reduce model complexity requirements. In contrast, pipelines preserving the full multichannel feature structure exhibit severe overfitting due to the high-dimensional feature space. These findings highlight the importance of structure-preserving dimensionality reduction when applying topology-based representations to multichannel neural data.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents an empirical study on classifying epileptic iEEG recordings into preictal, ictal, and interictal states using topological data analysis (TDA) features derived from persistence diagrams. Vectorizations such as Carlsson coordinates, persistence images, and template functions are applied, followed by dimensionality reduction, and evaluated across various frequency bands and classifiers on data from 55 patients. The central claim is that dimension-reduced TDA representations achieve up to 80% balanced accuracy, with classical machine learning models performing comparably to deep learning (up to 79.17%), while full multichannel features lead to overfitting.
Significance. If the reported accuracies hold under proper validation, this work would be significant for demonstrating the effectiveness of TDA-based features in a large-scale, multi-patient iEEG classification task. It highlights the benefits of dimensionality reduction to mitigate overfitting in high-dimensional neural data and shows that well-engineered topological features can enable simpler models to match deep learning performance, potentially lowering computational requirements in clinical applications. The scale of the ablation study across multiple components is a strength.
major comments (3)
- [Abstract] Abstract: The peak balanced accuracy of 80% is reported from a large-scale ablation study without mention of a pre-specified primary pipeline or correction for multiple comparisons across the many tested combinations of frequency bands, TDA vectorizations, dimensionality reductions, and classifiers. This selection of the maximum after observing results risks inflating the reported performance.
- [Abstract and Results] Abstract and Results: No cross-validation scheme, patient-wise data splits (e.g., leave-one-patient-out to assess generalization across patients), statistical significance tests, or error bars are described for the accuracy numbers. These details are essential to support claims of performance across 55 patients.
- [Abstract] Abstract: The claim that 'pipelines preserving the full multichannel feature structure exhibit severe overfitting due to the high-dimensional feature space' lacks quantitative support, such as comparisons of training and validation accuracies or specific overfitting metrics for those pipelines.
minor comments (2)
- [Abstract] Abstract: The abstract mentions 'up to 80% balanced accuracy' and 'up to 79.17%'; it would be clearer to specify the exact configurations achieving these or provide a table reference.
- Ensure all TDA vectorization methods (Carlsson coordinates, persistence images, templates) are briefly defined or referenced in the main text for readers unfamiliar with TDA.
Simulated Author's Rebuttal
We thank the referee for the thoughtful and constructive comments, which highlight important aspects of reporting and validation in our empirical study. We address each major comment point by point below, indicating where revisions will be made to improve clarity and rigor without altering the core findings.
read point-by-point responses
-
Referee: [Abstract] Abstract: The peak balanced accuracy of 80% is reported from a large-scale ablation study without mention of a pre-specified primary pipeline or correction for multiple comparisons across the many tested combinations of frequency bands, TDA vectorizations, dimensionality reductions, and classifiers. This selection of the maximum after observing results risks inflating the reported performance.
Authors: We agree that selecting and highlighting the maximum performance from a large ablation study without pre-specification carries a risk of over-optimism, as is common in exploratory ML work. Our ablation was intended to systematically evaluate interactions between TDA representations and pipelines rather than to claim a single optimized result. In the revised manuscript, we will designate a primary pipeline upfront (e.g., the most physiologically motivated frequency band combined with a standard vectorization and dimensionality reduction), report its performance separately, and present the full ablation results as secondary analyses. We will also add a discussion of multiple comparisons and either apply a conservative correction or report the distribution of accuracies across configurations to contextualize the peak value. revision: yes
-
Referee: [Abstract and Results] Abstract and Results: No cross-validation scheme, patient-wise data splits (e.g., leave-one-patient-out to assess generalization across patients), statistical significance tests, or error bars are described for the accuracy numbers. These details are essential to support claims of performance across 55 patients.
Authors: The Methods section of the manuscript details a patient-stratified cross-validation procedure with patient-wise splits to ensure no leakage between training and test data from the same individual, along with statistical comparisons and variability measures. However, we acknowledge that these elements are not sufficiently highlighted in the abstract or results summary. In the revision, we will explicitly state the cross-validation scheme (including leave-one-patient-out where applied), add error bars or confidence intervals to all reported accuracies, and include statistical significance tests with p-values in the results. revision: yes
-
Referee: [Abstract] Abstract: The claim that 'pipelines preserving the full multichannel feature structure exhibit severe overfitting due to the high-dimensional feature space' lacks quantitative support, such as comparisons of training and validation accuracies or specific overfitting metrics for those pipelines.
Authors: The results section already contrasts performance of full multichannel pipelines against dimension-reduced ones, showing clear generalization gaps. To strengthen the claim, we will add explicit quantitative support in the revision, including side-by-side training versus validation accuracy tables or plots for representative full-feature pipelines, along with the train-validation gap as an overfitting metric. revision: yes
Circularity Check
No significant circularity in empirical ablation study
full rationale
The paper is a purely empirical study reporting results from a large-scale ablation over TDA vectorizations, frequency bands, dimensionality reductions, and classifiers on iEEG data from 55 patients. No derivation chain, equations, or first-principles predictions are claimed; accuracies (e.g., 80% balanced accuracy) are presented as observed experimental outcomes rather than quantities derived from fitted parameters or self-referential definitions. No load-bearing self-citations, uniqueness theorems, or ansatzes are invoked to justify a mathematical result. The work is self-contained against external benchmarks as an experimental comparison, with no reduction of outputs to inputs by construction.
Axiom & Free-Parameter Ledger
free parameters (1)
- hyperparameters of classifiers and dimensionality reduction
axioms (1)
- domain assumption iEEG time series can be meaningfully embedded as point clouds whose persistent homology captures clinically relevant state differences
Reference graph
Works this paper leans on
- [1]
-
[2]
Maria Tito, Mercedes Cabrerizo, Melvin Ayala, Armando Barreto, Ian Miller, Prasanna Jayakar, and Malek Adjouadi. Classification of electroencephalographic seizure recordings into ictal and interictal files using correlation sum.Computers in Biology and Medicine, 39(7):604–614, July 2009
work page 2009
-
[3]
Erdem Tuncer and Emine Do˘ gru Bolat. Channel based epilepsy seizure type detection from electroencephalography (eeg) signals with machine learning tech- niques.Biocybernetics and Biomedical Engineering, 42(2):575–595, April 2022. 15
work page 2022
-
[4]
Seungjun Ryu, Seunghyeok Back, Seongju Lee, Hyeon Seo, Chanki Park, Kyoobin Lee, and Dong-Seok Kim. Pilot study of a single-channel eeg seizure detection algorithm using machine learning.Child’s Nervous System, May 2021
work page 2021
-
[5]
Gabriella Panuccio, Narayan P Subramaniyam, Angel Canal-Alonso, Juan M Corchado, and Carlo Ierna. Using ai to steer brain regeneration: The enhanced regenerative medicine paradigm.Big Data Analysis and Artificial Intelligence for Medical Sciences, pages 273–307, 2024
work page 2024
-
[6]
G. Yogarajan, Najah Alsubaie, G. Rajasekaran, T. Revathi, Mohammed S. Alqah- tani, Mohamed Abbas, Madshush M. Alshahrani, and Ben Othman Soufiene. Eeg-based epileptic seizure detection using binary dragonfly algorithm and deep neural network.Scientific Reports, 13(1), October 2023
work page 2023
-
[7]
Epileptic eeg sig- nal classification using machine learning based model
Garima Chandel, Sandeep Kumar Saini, and Ashish Sharma. Epileptic eeg sig- nal classification using machine learning based model. In2023 International Conference on Disruptive Technologies (ICDT), pages 733–739. IEEE, 2023
work page 2023
-
[8]
Jeff Craley, Emily Johnson, Christophe Jouny, and Archana Venkataraman. Automated inter-patient seizure detection using multichannel convolutional and recurrent neural networks.Biomedical Signal Processing and Control, 64:102360, February 2021
work page 2021
-
[9]
Amir H. Ansari, Perumpillichira J. Cherian, Alexander Caicedo, Gunnar Naulaers, Maarten De Vos, and Sabine Van Huffel. Neonatal seizure detec- tion using deep convolutional neural networks.International Journal of Neural Systems, 29(04):1850011, May 2019
work page 2019
-
[10]
Yang Li, Yang Yang, Qinghe Zheng, Yunxia Liu, Hongjun Wang, Shangling Song, and Penghui Zhao. Dynamical graph neural network with attention mechanism for epilepsy detection using single channel eeg.Medical & Biological Engineering & Computing, 62(1):307–326, October 2023
work page 2023
-
[11]
Mingkan Shen, Fuwen Yang, Peng Wen, Bo Song, and Yan Li. A real-time epilepsy seizure detection approach based on eeg using short-time fourier transform and google-net convolutional neural network.Heliyon, 10(11):e31827, June 2024
work page 2024
-
[12]
Yang Li, Zuyi Yu, Yang Chen, Chunfeng Yang, Yue Li, X. Allen Li, and Baosheng Li. Automatic seizure detection using fully convolutional nested lstm. International Journal of Neural Systems, 30(04):2050019, March 2020
work page 2020
-
[13]
Guoyang Liu, Weidong Zhou, and Minxing Geng. Automatic seizure detec- tion based on s-transform and deep convolutional neural network.International Journal of Neural Systems, 30(04):1950024, September 2019
work page 2019
-
[14]
Wenna Chen, Yixing Wang, Yuhao Ren, Hongwei Jiang, Ganqin Du, Jincan Zhang, and Jinghua Li. An automated detection of epileptic seizures eeg using cnn classifier based on feature fusion with high accuracy.BMC Medical Informatics and Decision Making, 23(1), May 2023
work page 2023
-
[15]
Xiashuang Wang, Yinglei Wang, Dunwei Liu, Ying Wang, and Zhengjun Wang. Automated recognition of epilepsy from eeg signals using a combining space–time algorithm of cnn-lstm.Scientific Reports, 13(1), 9 2023
work page 2023
-
[16]
Ziyu Wang, Feifei Liu, Shuhua Shi, Shengxiang Xia, Fulai Peng, Lin Wang, Sen Ai, and Zheng Xu. Automatic epileptic seizure detection based on persistent homology.Frontiers in Physiology, 14, 12 2023. 16
work page 2023
-
[17]
Ziwei Wang and Paolo Mengoni. Seizure classification with selected frequency bands and eeg montages: a natural language processing approach.Brain Informatics, 9(1), May 2022
work page 2022
-
[18]
Rohan Akut. Wavelet based deep learning approach for epilepsy detection.Health Information Science and Systems, 7(1), April 2019
work page 2019
-
[19]
Mamun or Rashid and Mohiuddin Ahmad
Md. Mamun or Rashid and Mohiuddin Ahmad. Epileptic seizure classification using statistical features of eeg signal. In2017 International Conference on Electrical, Computer and Communication Engineering (ECCE), pages 308–312. IEEE, 2017
work page 2017
-
[20]
Alalayah, Ebrahim Mohammed Senan, Hany F
Khaled M. Alalayah, Ebrahim Mohammed Senan, Hany F. Atlam, Ibrahim Abdulrab Ahmed, and Hamzeh Salameh Ahmad Shatnawi. Effective early detection of epileptic seizures through eeg signals using classification algorithms based on t-distributed stochastic neighbor embedding and k-means. Diagnostics, 13(11):1957, June 2023
work page 1957
-
[21]
Milind Natu, Mrinal Bachute, Shilpa Gite, Ketan Kotecha, and Ankit Vid- yarthi. Review on epileptic seizure prediction: Machine learning and deep learning approaches.Computational and Mathematical Methods in Medicine, 2022:1–17, January 2022
work page 2022
-
[22]
Y. Yacoob and L.S. Davis. Recognizing human facial expressions from long image sequences using optical flow.IEEE Transactions on Pattern Analysis and Machine Intelligence, 18(6):636–642, June 1996
work page 1996
-
[23]
M. Winterhalder, T. Maiwald, H.U. Voss, R. Aschenbrenner-Scheibe, J. Timmer, and A. Schulze-Bonhage. The seizure prediction characteristic: a general frame- work to assess and compare seizure prediction methods.Epilepsy & Behavior, 4(3):318–325, 2003
work page 2003
-
[24]
Classification of epileptic seizures in eeg data based on iterative gated graph convolution network
Yue Hu, Jian Liu, Rencheng Sun, Yongqiang Yu, and Yi Sui. Classification of epileptic seizures in eeg data based on iterative gated graph convolution network. Frontiers in Computational Neuroscience, 18, August 2024
work page 2024
-
[25]
Topology and data.Bulletin of the American Mathematical Society, 46(2):255–308, 2009
Gunnar Carlsson. Topology and data.Bulletin of the American Mathematical Society, 46(2):255–308, 2009
work page 2009
-
[26]
Henry Adams, Tamal Emerson, Michael Kirby, Rachel Neville, Chris Peterson, Patrick Shipman, and Sophia Chepushtanova. Persistence images: A stable vector representation of persistent homology.Journal of Machine Learning Research, 18(8):1–35, 2017
work page 2017
- [27]
-
[28]
John M. Bernabei, Adam Li, Andrew Y. Revell, Rachel J. Smith, Kristin M. Gunnarsdottir, Ian Z. Ong, Kathryn A. Davis, Nishant Sinha, Sridevi Sarma, and Brian Litt. Hup ieeg epilepsy dataset. 2023
work page 2023
-
[29]
C. Wang et al. Artificial intelligence in electroencephalography analysis for epilepsy: A review.Frontiers in Neurology, 2025
work page 2025
-
[30]
Yuan Wang, Hernando Ombao, and Moo K. Chung. Topological data analysis of single-trial electroencephalographic signals.The Annals of Applied Statistics, 17 12(3), September 2018. 18
work page 2018
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.