arxiv: 2605.10121 · v1 · submitted 2026-05-11 · 💻 cs.LG · cs.AI· cs.HC

Recognition: no theorem link

Explainability of Recurrent Neural Networks for Enhancing P300-based Brain-Computer Interfaces

Christian Oliva, Francisco B Rodr\'iguez, Luis F Lago-Fern\'andez, Vinicio Changoluisa

Authors on Pith no claims yet

Pith reviewed 2026-05-12 02:50 UTC · model grok-4.3

classification 💻 cs.LG cs.AIcs.HC

keywords P300Brain-Computer InterfaceExplainable AIRecurrent Neural NetworksEEGEvent-related potentialsSpatio-temporal patterns

0 comments

The pith

Adding a Post-Recurrent Module to recurrent neural networks improves P300 classification accuracy by 9 percent while exposing the brain regions and time windows behind each decision.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces the Post-Recurrent Module as an extra layer placed after the recurrent stages of a neural network that classifies P300 signals recorded from EEG. Its goal is to raise detection performance while making the network's choices legible in terms of which scalp locations and which post-stimulus intervals drive the output. The module supports both global views across many trials and local views of individual decisions, so the highlighted patterns can be checked against the expected parietal response around 300 milliseconds. Experiments report a 9 percent gain over prior methods and show that the learned explanations shift with different subjects and even within the same subject across sessions. A reader would care because transparent models could turn black-box brain-computer interfaces into systems whose internal logic matches known brain physiology and can therefore be trusted in assistive or medical settings.

Core claim

The Post-Recurrent Module enables an RNN to classify P300 event-related potentials more accurately while also producing explanations that identify the most relevant brain regions and critical time intervals, and these explanations align with established neurophysiological descriptions of the P300 component.

What carries the argument

The Post-Recurrent Module (PRM), an added processing layer after the recurrent units that supports simultaneous global and local explainability analysis of the spatio-temporal EEG patterns used by the classifier.

If this is right

Model outputs can be expressed as spatio-temporal EEG patterns that match textbook descriptions of the P300.
Inter-subject and intra-subject variability must be treated as first-order features rather than noise when building practical BCI systems.
The same added layer and dual explainability approach can be reused on other EEG classification problems such as motor imagery or workload estimation.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Subject-specific maps of the highlighted regions could be used to shorten calibration sessions by focusing only on the electrodes that matter for each user.
If the explanations remain stable under real-world noise, the framework could support clinical validation of BCI devices by providing auditable decision traces.

Load-bearing premise

The explanations produced by the Post-Recurrent Module reflect real neurophysiological patterns rather than artifacts introduced by the chosen explanation technique or by how features were selected after training.

What would settle it

If the time intervals and electrode locations highlighted by the module on fresh P300 data consistently fall outside the 250-500 ms window over centro-parietal sites, the claim that the explanations are grounded in actual brain activity would be falsified.

Figures

Figures reproduced from arXiv: 2605.10121 by Christian Oliva, Francisco B Rodr\'iguez, Luis F Lago-Fern\'andez, Vinicio Changoluisa.

**Figure 1.** Figure 1: Data description for one day. One day is split into two sessions. Each session contains the subject’s EEG signals for six different runs. For each run, the subject’s target image is chosen randomly between the six possible images. Each run groups 20 trials, where the six possible images are permuted and flashed to the subject with an ISI of 400𝑚𝑠. The remainder of the article is structured as follows. In S… view at source ↗

**Figure 2.** Figure 2: A single trial with the presentation of 6 images starts at 𝑡 = 0 𝑚𝑠 and finishes at 𝑡 = 3000 𝑚𝑠. Images are presented for 100 𝑚𝑠 (red/green boxes for non-target/target stimuli, respectively) with an ISI of 400 𝑚𝑠. The 1000 𝑚𝑠 windows following each stimulus presentation are used to characterize the user’s response to each image. with cut-off frequencies set to 1 Hz and 12 Hz, and its signals are downsample… view at source ↗

**Figure 5.** Figure 5: Absolute difference between the average activation for target and non-target windows for each hidden layer neuron at every timestep, in a network trained on user 1, session 1. Each gray curve is for a different neuron, while the blue curve is the average across all neurons. We observe that the average activity of individual neurons differs significantly when faced with P300 versus nonP300 windows. This d… view at source ↗

**Figure 6.** Figure 6: Distribution of target (blue) and non-target (orange) windows projected onto the LDA axis. The left plot uses the final hidden state, 𝐡32, as input, and the right plot uses the concatenated hidden states across all timesteps, 𝐡[1∶32]. Network trained on user 1, session 1. improve the effectiveness of our model to predict the P300 response. 3.2. The Post-Recurrent Module It is important to note that, althou… view at source ↗

**Figure 7.** Figure 7: Absolute value of the PRM weights, w𝑝 , from two networks trained on user 1, session 1. The network in the top panel was trained with no regularization. The network in the bottom panel was trained with 𝐿1 regularization applied to the PRM weights, with regularization parameter 𝜆 = 0.01. for discriminating P300 from non-P300 events, and that the PRM enables the model to leverage this information more effect… view at source ↗

**Figure 8.** Figure 8: Absolute value of the PRM weights averaged across 400 networks (100 per session) trained on users 1 to 8. All networks included 𝐿1 regularizarion in the PRM weights, with 𝜆 = 0.01. patterns, generally centered in the parietal region. This section delves into the importance of spatial analysis and shows how it contributes to the identification and interpretation of P300 responses. First of all, we analyze … view at source ↗

**Figure 10.** Figure 10: Spatio-temporal analysis of the P300 signal extracted from a network trained on user 1 when facing a target stimulus. The top figure represents the relevance 𝑅𝑖 extracted when the network is trained with regularization, where the horizontal axis represents the timesteps, and the vertical axis represents the electrodes. The bottom figure represents the same 𝑅𝑖 in a topographical head plot. In both figures… view at source ↗

**Figure 9.** Figure 9: Top: figures related to user 1. Bottom: figures related to user 8. Left: Bar diagram of the input layer relevances (see Equation 6) from a network trained on user 1 (top) or 8 (bottom) and test session 1 in both samples. Vertical axis represents the normalized relevances, being 1 the most relevant and 0 the least one. Right: Head plot of the same relevances. again, the rest of the electrodes contribute on … view at source ↗

read the original abstract

Brain-Computer Interfaces (BCIs) based on P300 event-related potentials offer promising applications in health, education, and assistive technologies. However, challenges related to inter- and intra-subject variability and the explainability of Deep Learning (DL) models limit their practical deployment. In this work, we present the Post-Recurrent Module (PRM), an additional layer designed to improve both performance and transparency, incorporated into a Recurrent Neural Network (RNN) architecture for classifying P300 signals from EEG data. Our approach enables a dual analysis of spatio-temporal signals through both global and local explainability techniques, allowing us not only to identify the most relevant brain regions and critical time intervals involved in classification, but also to interpret model decisions in terms of spatio-temporal EEG patterns consistent with well-stablished neurophysiological descriptions of the P300. Experimental results show a 9\% improvement in performance over state of the art, while also revealing the importance of inter- and intra-subject variability, in alignment with established neuroscience literature. By making model decisions transparent and efficient, we present a framework for explainable EEG-based models. This framework is not limited to more efficient P300 detection, but can be generalized to a wide range of EEG-based tasks. Its ability to identify key spatial and temporal features makes it suitable for applications such as motor imagery, steady-state visual evoked potentials, and even cognitive workload assessment.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The paper introduces a Post-Recurrent Module (PRM) added to an RNN architecture for classifying P300 event-related potentials in EEG-based BCIs. It claims the PRM yields a 9% performance improvement over state-of-the-art methods, enables dual global and local explainability to identify relevant spatio-temporal EEG features consistent with established P300 neurophysiology (e.g., Pz electrode, ~300 ms latency), and highlights the role of inter- and intra-subject variability. The framework is positioned as generalizable beyond P300 detection to tasks such as motor imagery and cognitive workload assessment.

Significance. If the 9% gain is robustly attributable to the PRM and the explanations are shown to be faithful rather than post-hoc artifacts, the work would offer a concrete step toward transparent DL models in BCIs. This could help mitigate the well-known barriers of black-box opacity and high inter-subject variability, supporting more reliable deployment in assistive technologies. The dual global/local analysis and explicit linkage to neurophysiological priors represent a strength that, if validated, would aid interdisciplinary adoption.

major comments (3)

[Abstract] Abstract: The central claim of a 9% performance improvement over SOTA is stated without any supporting quantitative details (dataset sizes, number of subjects, cross-validation scheme, error bars, or statistical tests), preventing verification that the gain is driven by the PRM rather than architecture, training, or baseline differences.
[Abstract] Abstract and explainability description: The assertion that global/local explanations recover patterns 'consistent with well-established neurophysiological descriptions of the P300' lacks any faithfulness evaluation (e.g., feature deletion AUC, synthetic-data recovery, or ablation of the explainability method itself). Without these controls, the reported alignment could arise from visualization choices or feature selection rather than causal use by the RNN.
[Results] Experimental results (implied in abstract): No ablation removing only the PRM is described, so the performance lift cannot be isolated from the base RNN, training procedure, or post-processing. This directly undermines the claim that the PRM is the load-bearing component for both accuracy and transparency.

minor comments (2)

[Abstract] Abstract: Typo 'well-stablished' should read 'well-established'.
[Abstract] Abstract: The generalization statement to motor imagery, SSVEP, and cognitive workload is asserted but not supported by any experiments or transfer results within the manuscript.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their thorough review and constructive comments, which have helped us improve the clarity and rigor of our manuscript. We address each major comment point by point below and indicate the revisions made.

read point-by-point responses

Referee: [Abstract] Abstract: The central claim of a 9% performance improvement over SOTA is stated without any supporting quantitative details (dataset sizes, number of subjects, cross-validation scheme, error bars, or statistical tests), preventing verification that the gain is driven by the PRM rather than architecture, training, or baseline differences.

Authors: We agree that the abstract would benefit from more specific details to support the performance claim. In the revised manuscript, we have expanded the abstract to include the number of subjects, the cross-validation scheme (leave-one-subject-out), and references to the statistical tests performed. The full experimental setup, including error bars and baseline comparisons, is detailed in the Results section. This allows readers to verify that the 9% improvement is attributable to the PRM. revision: yes
Referee: [Abstract] Abstract and explainability description: The assertion that global/local explanations recover patterns 'consistent with well-established neurophysiological descriptions of the P300' lacks any faithfulness evaluation (e.g., feature deletion AUC, synthetic-data recovery, or ablation of the explainability method itself). Without these controls, the reported alignment could arise from visualization choices or feature selection rather than causal use by the RNN.

Authors: We acknowledge the importance of validating the faithfulness of the explainability techniques. While our global and local analyses align with known P300 characteristics such as the Pz electrode and 300 ms latency, we did not include explicit faithfulness metrics in the original submission. In the revision, we have added a section on faithfulness evaluation, including feature deletion experiments and a synthetic data recovery test to confirm that the model relies on these spatio-temporal features. This strengthens the claim that the explanations reflect the model's decision process rather than post-hoc artifacts. revision: yes
Referee: [Results] Experimental results (implied in abstract): No ablation removing only the PRM is described, so the performance lift cannot be isolated from the base RNN, training procedure, or post-processing. This directly undermines the claim that the PRM is the load-bearing component for both accuracy and transparency.

Authors: The referee correctly identifies that an explicit ablation of the PRM is necessary to isolate its contribution. The original manuscript presented comparisons to state-of-the-art methods but did not include an ablation study removing the PRM from the RNN architecture. We have now incorporated such an ablation in the revised Results section, showing performance degradation without the PRM and confirming its role in both the accuracy improvement and the generation of interpretable spatio-temporal patterns. revision: yes

Circularity Check

0 steps flagged

No significant circularity; empirical architecture and results are self-contained

full rationale

The paper introduces a Post-Recurrent Module (PRM) added to an RNN for P300 EEG classification and reports experimental performance gains plus explainability outputs. No derivation chain, equations, or fitted-parameter predictions appear in the abstract or described content. Claims of 9% improvement and neurophysiological alignment are presented as outcomes of experiments and post-hoc analysis rather than definitions or self-referential reductions. No load-bearing self-citations or ansatzes are invoked to force the central results. The work is therefore scored as non-circular under the specified criteria.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review yields no explicit free parameters, axioms, or invented entities; the PRM itself is presented as a novel architectural addition whose internal parameters are not detailed.

pith-pipeline@v0.9.0 · 5565 in / 1181 out tokens · 53106 ms · 2026-05-12T02:50:48.400206+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

33 extracted references · 33 canonical work pages

[1]

Single-trialanalysisandclassificationoferpcomponents—atutorial

Blankertz, B., Lemm, S., Treder, M., Haufe, S., Müller, K.R., 2011. Single-trialanalysisandclassificationoferpcomponents—atutorial. NeuroImage 56, 814–825

work page 2011
[2]

Non- stationarity in brain-computer interfaces: An analytical perspective

Cecotti, H., Shah, R.M., Jagadish, R., Tanaka, T., 2025. Non- stationarity in brain-computer interfaces: An analytical perspective. URL:https://arxiv.org/abs/2512.15941,arXiv:2512.15941

work page arXiv 2025
[3]

An electrode selection approach in P300-based BCIs to address inter-and intra- subject variability, in: 2018 6th International Conference on Brain- Computer Interface (BCI), IEEE

Changoluisa, V., Varona, P., Rodríguez, F.B., 2018. An electrode selection approach in P300-based BCIs to address inter-and intra- subject variability, in: 2018 6th International Conference on Brain- Computer Interface (BCI), IEEE. pp. 1–4

work page 2018
[4]

A Low-Cost Computational Method for Characterizing Event-Related Potentials for BCI Applications and Beyond

Changoluisa, V., Varona, P., Rodríguez, F.B., 2020. A Low-Cost Computational Method for Characterizing Event-Related Potentials for BCI Applications and Beyond. IEEE Access 8, 111089–111101

work page 2020
[5]

A Fine Dry- Electrode Selection to Characterize Event-Related Potentials in the Context of BCI, in: Rojas, I., Joya, G., Català, A

Changoluisa, V., Varona, P., Rodriguez, F.B., 2021. A Fine Dry- Electrode Selection to Characterize Event-Related Potentials in the Context of BCI, in: Rojas, I., Joya, G., Català, A. (Eds.), Advances in Computational Intelligence, Springer International Publishing, Cham. pp. 230–241

work page 2021
[6]

Incorporating Nesterov Momentum into Adam, in: ICLR workshop, pp

Dozat, T., 2016. Incorporating Nesterov Momentum into Adam, in: ICLR workshop, pp. 1–4

work page 2016
[7]

Finding structure in time

Elman, J.L., 1990. Finding structure in time. Cognitive Science 14, 179–211

work page 1990
[8]

A survey of methods for explaining black box models

Guidotti, R., Monreale, A., Ruggieri, S., Turini, F., Giannotti, F., Pedreschi, D., 2018. A survey of methods for explaining black box models. ACM computing surveys (CSUR) 51, 1–42

work page 2018
[9]

An effcient P300-based brain computer interface for disabled subjects

Hoffmann, U., Vesin, J.M., Ebrahimi, T., Diserens, K., 2008. An effcient P300-based brain computer interface for disabled subjects. Journal of neuroscience methods 167, 115–25

work page 2008
[10]

Nonstationary nature of the brain activity as revealed by EEG/MEG: methodological, practical and conceptual challenges

Kaplan, A.Y., Fingelkurts, A.A., Fingelkurts, A.A., Borisov, S.V., Darkhovsky, B.S., 2005. Nonstationary nature of the brain activity as revealed by EEG/MEG: methodological, practical and conceptual challenges. Signal processing 85, 2190–2212

work page 2005
[11]

Thirty years and counting: Finding meaning in the N400 component of the event-related brain potential (ERP)

Kutas, M., Federmeier, K., 2011. Thirty years and counting: Finding meaning in the N400 component of the event-related brain potential (ERP). Annual review of psychology 62, 621–647. C. Oliva et al.:Preprint submitted to ElsevierPage 9 of 10 Explainability of RNNs for Enhancing P300-based BCIs

work page 2011
[12]

Inter-subject P300 variability relates to the efficiency of brain networks reconfigured from resting-to task- state: evidence from a simultaneous event-related EEG-fMRI study

Li, F., Tao, Q., Peng, W., Zhang, T., Si, Y., Zhang, Y., Yi, C., Biswal, B., Yao, D., Xu, P., 2020. Inter-subject P300 variability relates to the efficiency of brain networks reconfigured from resting-to task- state: evidence from a simultaneous event-related EEG-fMRI study. NeuroImage 205, 116285

work page 2020
[13]

An online p300 brain-computer interface based on tactile selective attention of somatosensory electrical stimulation

Li, J., Pu, J., Cui, H., Xie, X., Xu, S., Li, T., Hu, Y., 2019. An online p300 brain-computer interface based on tactile selective attention of somatosensory electrical stimulation. Journal of Medical and Biological Engineering 39, 732–738

work page 2019
[14]

Time-series forecasting with deep learn- ing: a survey

Lim, B., Zohren, S., 2021. Time-series forecasting with deep learn- ing: a survey. Philosophical transactions. Series A, Mathematical, physical, and engineering sciences 379, 20200209

work page 2021
[15]

A review of classification algorithms foreeg-basedbrain–computerinterfaces:a10yearupdate

Lotte, F., Bougrain, L., Cichocki, A., Clerc, M., Congedo, M., Rako- tomamonjy, A., Yger, F., 2018. A review of classification algorithms foreeg-basedbrain–computerinterfaces:a10yearupdate. Journalof neural engineering 15, 031005

work page 2018
[16]

Electroencephalography,EvokedPotentials,and Event-Related Potentials

Lu,X.,Hu,L.,2019. Electroencephalography,EvokedPotentials,and Event-Related Potentials. Springer Singapore. chapter 4. pp. 23–42

work page 2019
[17]

An introduction to the event-related potential technique

Luck, S.J., 2014. An introduction to the event-related potential technique. Second ed., MIT press

work page 2014
[18]

Physiological and medico-social research trends of the wave p300 and more late components of visual event-related potentials

Lytaev, S., Vatamaniuk, I., 2021. Physiological and medico-social research trends of the wave p300 and more late components of visual event-related potentials. Brain Sciences 11. doi:10.3390/ brainsci11010125

work page 2021
[19]

Au- tomatic speech recognition: a survey

Malik, M., Malik, M.K., Mehmood, K., Makhdoom, I., 2021. Au- tomatic speech recognition: a survey. Multimedia Tools Appl. 80, 9411–9457

work page 2021
[20]

Oliva, C., Changoluisa, V., Rodríguez, F.B., Lago-Fernández, L.F.,

work page
[21]

Precise temporal P300 detection in Brain Computer Interface EEGsignalsusingaLong-ShortTermMemory,in:30thInternational Conference on Artificial Neural Networks (ICANN), Springer. pp. 457–468

work page
[22]

Oliva, C., Changoluisa, V., Rodríguez, F.B., Lago-Fernández, L.F., 2023a. EnhancingP300detectioninBrain-ComputerInterfaceswith interpretable post-processing of Recurrent Neural Networks, in: 32th International Conference on Artificial Neural Networks (ICANN), Springer. pp. 25–36

work page
[23]

Oliva, C., Changoluisa, V., Rodríguez, F.B., Lago-Fernández, L.F., 2023b. Detecting P300-ERPs Building a Post-Validation Neural Ensemble with Informative Neurons from a Recurrent Neural Net- work, in: Artificial Intelligence Applications and Innovations - 19th, Springer. pp. 90–101

work page
[24]

Updating P300: An integrative theory of P3a and P3b

Polich, J., 2007. Updating P300: An integrative theory of P3a and P3b. Clinical Neurophysiology 118, 2128–2148

work page 2007
[25]

P300 subcomponents in overt and covert visual attention

Ponomarev, T., Pronina, A., Kaplan, A., 2025. P300 subcomponents in overt and covert visual attention. Neuroscience and Behavioral Physiology 55

work page 2025
[26]

Explainable artificial intelligence approaches for brain-computer interfaces: a review and design space

Rajpura, P., Cecotti, H., Meena, Y.K., 2024. Explainable artificial intelligence approaches for brain-computer interfaces: a review and design space. Journal of Neural Engineering

work page 2024
[27]

Explainable artificial in- telligence: Understanding, visualizing and interpreting deep learning models

Samek, W., Wiegand, T., Müller, K., 2017. Explainable artificial in- telligence: Understanding, visualizing and interpreting deep learning models. CoRR abs/1708.08296.arXiv:1708.08296

work page arXiv 2017
[28]

Challenge for affective brain- computer interfaces: Non-stationary spatio-spectral eeg oscillations of emotional responses

Shen, Y.W., Lin, Y.P., 2019. Challenge for affective brain- computer interfaces: Non-stationary spatio-spectral eeg oscillations of emotional responses. Frontiers in Human Neuroscience Volume 13 - 2019. URL:https://www.frontiersin.org/ journals/human-neuroscience/articles/10.3389/fnhum.2019.00366, doi:10.3389/fnhum.2019.00366

work page doi:10.3389/fnhum.2019.00366 2019
[29]

Learning im- portant features through propagating activation differences, in: Pro- ceedings of the 34th International Conference on Machine Learning, JMLR.org

Shrikumar, A., Greenside, P., Kundaje, A., 2017. Learning im- portant features through propagating activation differences, in: Pro- ceedings of the 34th International Conference on Machine Learning, JMLR.org. p. 3145–3153

work page 2017
[30]

Dynamic networks of P300-related process

Tao, Q., Jiang, L., Li, F., Qiu, Y., Yi, C., Si, Y., Li, C., Zhang, T., Yao, D., Xu, P., 2022. Dynamic networks of P300-related process. Cognitive Neurodynamics 16

work page 2022
[31]

Explainable AI-driven IoMT fusion: Unravelling techniques, opportunities, and challenges with Explainable AI in healthcare

Wani, N.A., Kumar, R., Bedi, J., Rida, I., et al., 2024. Explainable AI-driven IoMT fusion: Unravelling techniques, opportunities, and challenges with Explainable AI in healthcare. Information Fusion , 102472

work page 2024
[32]

Asurveyondeeplearning-basednon-invasivebrainsignals: recentadvancesandnewfrontiers

Zhang, X., Yao, L., Wang, X., Monaghan, J., McAlpine, D., Zhang, Y.,2021. Asurveyondeeplearning-basednon-invasivebrainsignals: recentadvancesandnewfrontiers. JournalofNeuralEngineering18, 031002.arXiv:1905.04149

work page arXiv 2021
[33]

Understanding the effects of stress on the P300 response during naturalistic simulation of heights exposure

Zhu, H.Y., Chen, H.T., Lin, C.T., 2024. Understanding the effects of stress on the P300 response during naturalistic simulation of heights exposure. Plos one 19, e0301052. C. Oliva et al.:Preprint submitted to ElsevierPage 10 of 10

work page 2024