pith. sign in

arxiv: 2604.11971 · v1 · submitted 2026-04-13 · 💻 cs.LG · stat.AP

Classification of Epileptic iEEG using Topological Machine Learning

Pith reviewed 2026-05-10 16:16 UTC · model grok-4.3

classification 💻 cs.LG stat.AP
keywords topological data analysisepilepsyiEEGseizure detectionpersistence diagramsmachine learningdimensionality reductionbrain state classification
0
0 comments X

The pith

Dimension-reduced topological features from iEEG classify preictal, ictal and interictal states at up to 80 percent balanced accuracy across 55 patients.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper investigates whether features drawn from topological data analysis of intracranial EEG can distinguish among pre-seizure, seizure, and non-seizure brain states. Persistence diagrams computed on multichannel recordings are turned into vectors through Carlsson coordinates, persistence images and template functions, then passed through dimensionality reduction before classification. Experiments on data from 55 patients show that these reduced representations reach 80 percent balanced accuracy for three-class problems, while classical machine learning models perform nearly as well as deep learning models at 79.17 percent. Retaining the full high-dimensional multichannel structure produces severe overfitting. The results indicate that structure-preserving reduction of topological summaries makes multichannel neural data more tractable for state classification.

Core claim

By deriving persistence diagrams from multichannel iEEG recordings and vectorizing them with Carlsson coordinates, persistence images and template functions, then applying dimensionality reduction, the authors obtain up to 80 percent balanced accuracy in classifying preictal, ictal and interictal states across 55 patients. Classical machine learning classifiers reach 79.17 percent balanced accuracy, matching deep learning performance and indicating that topological features can reduce model complexity. Pipelines that keep the full multichannel feature structure overfit badly because of the high-dimensional input space.

What carries the argument

Persistence diagrams computed on iEEG time series, vectorized by Carlsson coordinates, persistence images or template functions and then dimension-reduced before classification.

If this is right

  • Classical machine learning models achieve accuracy comparable to deep learning when supplied with reduced topological features.
  • Preserving the full multichannel feature structure produces severe overfitting in high-dimensional settings.
  • Topological representations support three-class classification without requiring patient-specific model adjustments.
  • Structure-preserving dimensionality reduction is required to make topology-based features practical for multichannel neural recordings.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same reduced topological pipeline could be tested on other multichannel physiological signals to check whether the accuracy gains generalize beyond epilepsy.
  • Lower model complexity might allow deployment of seizure-state classifiers on portable or implantable devices that cannot run deep networks.
  • Further ablation on specific frequency bands could isolate which topological features carry the strongest state information.

Load-bearing premise

The chosen topological vectorizations and dimensionality reductions preserve enough discriminative information that works across different patients without patient-specific tuning.

What would settle it

Applying the identical pipeline to an independent collection of iEEG recordings from new epilepsy patients and finding that balanced accuracy falls well below 80 percent.

read the original abstract

Epileptic seizure detection from EEG signals remains challenging due to the high dimensionality and nonlinear, potentially stochastic, dynamics of neural activity. In this work, we investigate whether features derived from topological data analysis (TDA) can improve the classification of brain states in preictal, ictal and interictal iEEG recordings from epilepsy patients using multichannel data. We analyze data from 55 patients, significantly larger than many previous studies that rely on patient-specific models. Persistence diagrams derived from iEEG signals are vectorized using several TDA representations, including Carlsson coordinates, persistence images, and template functions. To understand how topological representations interact with modern machine learning pipelines, we conduct a large-scale ablation study across multiple iEEG frequency bands, dimensionality reduction techniques, feature representations, and classifier architectures. Our experiments show that dimension-reduced topological representations achieve up to 80\% balanced accuracy for three-class classification. Interestingly, classical machine learning models perform comparably to deep learning models, achieving up to 79.17\% balanced accuracy, suggesting that carefully designed topological features can substantially reduce model complexity requirements. In contrast, pipelines preserving the full multichannel feature structure exhibit severe overfitting due to the high-dimensional feature space. These findings highlight the importance of structure-preserving dimensionality reduction when applying topology-based representations to multichannel neural data.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The manuscript presents an empirical study on classifying epileptic iEEG recordings into preictal, ictal, and interictal states using topological data analysis (TDA) features derived from persistence diagrams. Vectorizations such as Carlsson coordinates, persistence images, and template functions are applied, followed by dimensionality reduction, and evaluated across various frequency bands and classifiers on data from 55 patients. The central claim is that dimension-reduced TDA representations achieve up to 80% balanced accuracy, with classical machine learning models performing comparably to deep learning (up to 79.17%), while full multichannel features lead to overfitting.

Significance. If the reported accuracies hold under proper validation, this work would be significant for demonstrating the effectiveness of TDA-based features in a large-scale, multi-patient iEEG classification task. It highlights the benefits of dimensionality reduction to mitigate overfitting in high-dimensional neural data and shows that well-engineered topological features can enable simpler models to match deep learning performance, potentially lowering computational requirements in clinical applications. The scale of the ablation study across multiple components is a strength.

major comments (3)
  1. [Abstract] Abstract: The peak balanced accuracy of 80% is reported from a large-scale ablation study without mention of a pre-specified primary pipeline or correction for multiple comparisons across the many tested combinations of frequency bands, TDA vectorizations, dimensionality reductions, and classifiers. This selection of the maximum after observing results risks inflating the reported performance.
  2. [Abstract and Results] Abstract and Results: No cross-validation scheme, patient-wise data splits (e.g., leave-one-patient-out to assess generalization across patients), statistical significance tests, or error bars are described for the accuracy numbers. These details are essential to support claims of performance across 55 patients.
  3. [Abstract] Abstract: The claim that 'pipelines preserving the full multichannel feature structure exhibit severe overfitting due to the high-dimensional feature space' lacks quantitative support, such as comparisons of training and validation accuracies or specific overfitting metrics for those pipelines.
minor comments (2)
  1. [Abstract] Abstract: The abstract mentions 'up to 80% balanced accuracy' and 'up to 79.17%'; it would be clearer to specify the exact configurations achieving these or provide a table reference.
  2. Ensure all TDA vectorization methods (Carlsson coordinates, persistence images, templates) are briefly defined or referenced in the main text for readers unfamiliar with TDA.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the thoughtful and constructive comments, which highlight important aspects of reporting and validation in our empirical study. We address each major comment point by point below, indicating where revisions will be made to improve clarity and rigor without altering the core findings.

read point-by-point responses
  1. Referee: [Abstract] Abstract: The peak balanced accuracy of 80% is reported from a large-scale ablation study without mention of a pre-specified primary pipeline or correction for multiple comparisons across the many tested combinations of frequency bands, TDA vectorizations, dimensionality reductions, and classifiers. This selection of the maximum after observing results risks inflating the reported performance.

    Authors: We agree that selecting and highlighting the maximum performance from a large ablation study without pre-specification carries a risk of over-optimism, as is common in exploratory ML work. Our ablation was intended to systematically evaluate interactions between TDA representations and pipelines rather than to claim a single optimized result. In the revised manuscript, we will designate a primary pipeline upfront (e.g., the most physiologically motivated frequency band combined with a standard vectorization and dimensionality reduction), report its performance separately, and present the full ablation results as secondary analyses. We will also add a discussion of multiple comparisons and either apply a conservative correction or report the distribution of accuracies across configurations to contextualize the peak value. revision: yes

  2. Referee: [Abstract and Results] Abstract and Results: No cross-validation scheme, patient-wise data splits (e.g., leave-one-patient-out to assess generalization across patients), statistical significance tests, or error bars are described for the accuracy numbers. These details are essential to support claims of performance across 55 patients.

    Authors: The Methods section of the manuscript details a patient-stratified cross-validation procedure with patient-wise splits to ensure no leakage between training and test data from the same individual, along with statistical comparisons and variability measures. However, we acknowledge that these elements are not sufficiently highlighted in the abstract or results summary. In the revision, we will explicitly state the cross-validation scheme (including leave-one-patient-out where applied), add error bars or confidence intervals to all reported accuracies, and include statistical significance tests with p-values in the results. revision: yes

  3. Referee: [Abstract] Abstract: The claim that 'pipelines preserving the full multichannel feature structure exhibit severe overfitting due to the high-dimensional feature space' lacks quantitative support, such as comparisons of training and validation accuracies or specific overfitting metrics for those pipelines.

    Authors: The results section already contrasts performance of full multichannel pipelines against dimension-reduced ones, showing clear generalization gaps. To strengthen the claim, we will add explicit quantitative support in the revision, including side-by-side training versus validation accuracy tables or plots for representative full-feature pipelines, along with the train-validation gap as an overfitting metric. revision: yes

Circularity Check

0 steps flagged

No significant circularity in empirical ablation study

full rationale

The paper is a purely empirical study reporting results from a large-scale ablation over TDA vectorizations, frequency bands, dimensionality reductions, and classifiers on iEEG data from 55 patients. No derivation chain, equations, or first-principles predictions are claimed; accuracies (e.g., 80% balanced accuracy) are presented as observed experimental outcomes rather than quantities derived from fitted parameters or self-referential definitions. No load-bearing self-citations, uniqueness theorems, or ansatzes are invoked to justify a mathematical result. The work is self-contained against external benchmarks as an experimental comparison, with no reduction of outputs to inputs by construction.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

Abstract supplies no explicit free parameters or invented entities; relies on standard TDA and ML assumptions.

free parameters (1)
  • hyperparameters of classifiers and dimensionality reduction
    Standard ML pipeline choices (number of components, kernel parameters, etc.) that affect reported accuracy but are not enumerated.
axioms (1)
  • domain assumption iEEG time series can be meaningfully embedded as point clouds whose persistent homology captures clinically relevant state differences
    Implicit in the decision to compute persistence diagrams from multichannel signals.

pith-pipeline@v0.9.0 · 5542 in / 1221 out tokens · 63950 ms · 2026-05-10T16:16:23.114440+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

30 extracted references · 30 canonical work pages

  1. [1]

    Roy et al

    S. Roy et al. Evaluation of artificial intelligence systems for assisting diagnosis of epilepsy.EBioMedicine, 69:103443, 2021

  2. [2]

    Classification of electroencephalographic seizure recordings into ictal and interictal files using correlation sum.Computers in Biology and Medicine, 39(7):604–614, July 2009

    Maria Tito, Mercedes Cabrerizo, Melvin Ayala, Armando Barreto, Ian Miller, Prasanna Jayakar, and Malek Adjouadi. Classification of electroencephalographic seizure recordings into ictal and interictal files using correlation sum.Computers in Biology and Medicine, 39(7):604–614, July 2009

  3. [3]

    Erdem Tuncer and Emine Do˘ gru Bolat. Channel based epilepsy seizure type detection from electroencephalography (eeg) signals with machine learning tech- niques.Biocybernetics and Biomedical Engineering, 42(2):575–595, April 2022. 15

  4. [4]

    Pilot study of a single-channel eeg seizure detection algorithm using machine learning.Child’s Nervous System, May 2021

    Seungjun Ryu, Seunghyeok Back, Seongju Lee, Hyeon Seo, Chanki Park, Kyoobin Lee, and Dong-Seok Kim. Pilot study of a single-channel eeg seizure detection algorithm using machine learning.Child’s Nervous System, May 2021

  5. [5]

    Using ai to steer brain regeneration: The enhanced regenerative medicine paradigm.Big Data Analysis and Artificial Intelligence for Medical Sciences, pages 273–307, 2024

    Gabriella Panuccio, Narayan P Subramaniyam, Angel Canal-Alonso, Juan M Corchado, and Carlo Ierna. Using ai to steer brain regeneration: The enhanced regenerative medicine paradigm.Big Data Analysis and Artificial Intelligence for Medical Sciences, pages 273–307, 2024

  6. [6]

    Yogarajan, Najah Alsubaie, G

    G. Yogarajan, Najah Alsubaie, G. Rajasekaran, T. Revathi, Mohammed S. Alqah- tani, Mohamed Abbas, Madshush M. Alshahrani, and Ben Othman Soufiene. Eeg-based epileptic seizure detection using binary dragonfly algorithm and deep neural network.Scientific Reports, 13(1), October 2023

  7. [7]

    Epileptic eeg sig- nal classification using machine learning based model

    Garima Chandel, Sandeep Kumar Saini, and Ashish Sharma. Epileptic eeg sig- nal classification using machine learning based model. In2023 International Conference on Disruptive Technologies (ICDT), pages 733–739. IEEE, 2023

  8. [8]

    Automated inter-patient seizure detection using multichannel convolutional and recurrent neural networks.Biomedical Signal Processing and Control, 64:102360, February 2021

    Jeff Craley, Emily Johnson, Christophe Jouny, and Archana Venkataraman. Automated inter-patient seizure detection using multichannel convolutional and recurrent neural networks.Biomedical Signal Processing and Control, 64:102360, February 2021

  9. [9]

    Ansari, Perumpillichira J

    Amir H. Ansari, Perumpillichira J. Cherian, Alexander Caicedo, Gunnar Naulaers, Maarten De Vos, and Sabine Van Huffel. Neonatal seizure detec- tion using deep convolutional neural networks.International Journal of Neural Systems, 29(04):1850011, May 2019

  10. [10]

    Dynamical graph neural network with attention mechanism for epilepsy detection using single channel eeg.Medical & Biological Engineering & Computing, 62(1):307–326, October 2023

    Yang Li, Yang Yang, Qinghe Zheng, Yunxia Liu, Hongjun Wang, Shangling Song, and Penghui Zhao. Dynamical graph neural network with attention mechanism for epilepsy detection using single channel eeg.Medical & Biological Engineering & Computing, 62(1):307–326, October 2023

  11. [11]

    A real-time epilepsy seizure detection approach based on eeg using short-time fourier transform and google-net convolutional neural network.Heliyon, 10(11):e31827, June 2024

    Mingkan Shen, Fuwen Yang, Peng Wen, Bo Song, and Yan Li. A real-time epilepsy seizure detection approach based on eeg using short-time fourier transform and google-net convolutional neural network.Heliyon, 10(11):e31827, June 2024

  12. [12]

    Allen Li, and Baosheng Li

    Yang Li, Zuyi Yu, Yang Chen, Chunfeng Yang, Yue Li, X. Allen Li, and Baosheng Li. Automatic seizure detection using fully convolutional nested lstm. International Journal of Neural Systems, 30(04):2050019, March 2020

  13. [13]

    Automatic seizure detec- tion based on s-transform and deep convolutional neural network.International Journal of Neural Systems, 30(04):1950024, September 2019

    Guoyang Liu, Weidong Zhou, and Minxing Geng. Automatic seizure detec- tion based on s-transform and deep convolutional neural network.International Journal of Neural Systems, 30(04):1950024, September 2019

  14. [14]

    An automated detection of epileptic seizures eeg using cnn classifier based on feature fusion with high accuracy.BMC Medical Informatics and Decision Making, 23(1), May 2023

    Wenna Chen, Yixing Wang, Yuhao Ren, Hongwei Jiang, Ganqin Du, Jincan Zhang, and Jinghua Li. An automated detection of epileptic seizures eeg using cnn classifier based on feature fusion with high accuracy.BMC Medical Informatics and Decision Making, 23(1), May 2023

  15. [15]

    Automated recognition of epilepsy from eeg signals using a combining space–time algorithm of cnn-lstm.Scientific Reports, 13(1), 9 2023

    Xiashuang Wang, Yinglei Wang, Dunwei Liu, Ying Wang, and Zhengjun Wang. Automated recognition of epilepsy from eeg signals using a combining space–time algorithm of cnn-lstm.Scientific Reports, 13(1), 9 2023

  16. [16]

    Automatic epileptic seizure detection based on persistent homology.Frontiers in Physiology, 14, 12 2023

    Ziyu Wang, Feifei Liu, Shuhua Shi, Shengxiang Xia, Fulai Peng, Lin Wang, Sen Ai, and Zheng Xu. Automatic epileptic seizure detection based on persistent homology.Frontiers in Physiology, 14, 12 2023. 16

  17. [17]

    Seizure classification with selected frequency bands and eeg montages: a natural language processing approach.Brain Informatics, 9(1), May 2022

    Ziwei Wang and Paolo Mengoni. Seizure classification with selected frequency bands and eeg montages: a natural language processing approach.Brain Informatics, 9(1), May 2022

  18. [18]

    Wavelet based deep learning approach for epilepsy detection.Health Information Science and Systems, 7(1), April 2019

    Rohan Akut. Wavelet based deep learning approach for epilepsy detection.Health Information Science and Systems, 7(1), April 2019

  19. [19]

    Mamun or Rashid and Mohiuddin Ahmad

    Md. Mamun or Rashid and Mohiuddin Ahmad. Epileptic seizure classification using statistical features of eeg signal. In2017 International Conference on Electrical, Computer and Communication Engineering (ECCE), pages 308–312. IEEE, 2017

  20. [20]

    Alalayah, Ebrahim Mohammed Senan, Hany F

    Khaled M. Alalayah, Ebrahim Mohammed Senan, Hany F. Atlam, Ibrahim Abdulrab Ahmed, and Hamzeh Salameh Ahmad Shatnawi. Effective early detection of epileptic seizures through eeg signals using classification algorithms based on t-distributed stochastic neighbor embedding and k-means. Diagnostics, 13(11):1957, June 2023

  21. [21]

    Review on epileptic seizure prediction: Machine learning and deep learning approaches.Computational and Mathematical Methods in Medicine, 2022:1–17, January 2022

    Milind Natu, Mrinal Bachute, Shilpa Gite, Ketan Kotecha, and Ankit Vid- yarthi. Review on epileptic seizure prediction: Machine learning and deep learning approaches.Computational and Mathematical Methods in Medicine, 2022:1–17, January 2022

  22. [22]

    Yacoob and L.S

    Y. Yacoob and L.S. Davis. Recognizing human facial expressions from long image sequences using optical flow.IEEE Transactions on Pattern Analysis and Machine Intelligence, 18(6):636–642, June 1996

  23. [23]

    Winterhalder, T

    M. Winterhalder, T. Maiwald, H.U. Voss, R. Aschenbrenner-Scheibe, J. Timmer, and A. Schulze-Bonhage. The seizure prediction characteristic: a general frame- work to assess and compare seizure prediction methods.Epilepsy & Behavior, 4(3):318–325, 2003

  24. [24]

    Classification of epileptic seizures in eeg data based on iterative gated graph convolution network

    Yue Hu, Jian Liu, Rencheng Sun, Yongqiang Yu, and Yi Sui. Classification of epileptic seizures in eeg data based on iterative gated graph convolution network. Frontiers in Computational Neuroscience, 18, August 2024

  25. [25]

    Topology and data.Bulletin of the American Mathematical Society, 46(2):255–308, 2009

    Gunnar Carlsson. Topology and data.Bulletin of the American Mathematical Society, 46(2):255–308, 2009

  26. [26]

    Persistence images: A stable vector representation of persistent homology.Journal of Machine Learning Research, 18(8):1–35, 2017

    Henry Adams, Tamal Emerson, Michael Kirby, Rachel Neville, Chris Peterson, Patrick Shipman, and Sophia Chepushtanova. Persistence images: A stable vector representation of persistent homology.Journal of Machine Learning Research, 18(8):1–35, 2017

  27. [27]

    Khasawneh

    Sarah Tymochko, Elizabeth Munch, and Firas A. Khasawneh. Adaptive partition- ing for template functions on persistence diagrams.2019 18th IEEE International Conference On Machine Learning And Applications (ICMLA), page 1227–1234, December 2019

  28. [28]

    Bernabei, Adam Li, Andrew Y

    John M. Bernabei, Adam Li, Andrew Y. Revell, Rachel J. Smith, Kristin M. Gunnarsdottir, Ian Z. Ong, Kathryn A. Davis, Nishant Sinha, Sridevi Sarma, and Brian Litt. Hup ieeg epilepsy dataset. 2023

  29. [29]

    Wang et al

    C. Wang et al. Artificial intelligence in electroencephalography analysis for epilepsy: A review.Frontiers in Neurology, 2025

  30. [30]

    Yuan Wang, Hernando Ombao, and Moo K. Chung. Topological data analysis of single-trial electroencephalographic signals.The Annals of Applied Statistics, 17 12(3), September 2018. 18