pith. sign in

arxiv: 2507.22867 · v2 · submitted 2025-07-30 · 📊 stat.ME

Hawkes Processes with Variable Length Memory: Existence, Inference and Application to Neuronal Activity

Pith reviewed 2026-05-19 02:41 UTC · model grok-4.3

classification 📊 stat.ME
keywords Hawkes processesvariable length memorynonlinear point processesexistence and uniquenesslikelihood inferenceneuronal spikingmultivariate point processes
0
0 comments X

The pith

Nonlinear Hawkes processes with variable length memory exist and can be inferred by likelihood maximization, generalizing models for reset-prone systems like neurons.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper defines a new class of nonlinear Hawkes processes in which each subprocess can have its memory length reset after its own most recent event. This setup lets the intensity ignore earlier history in some cases while still allowing both excitation and inhibition effects. The authors prove that well-defined versions of these processes exist and supply a practical likelihood maximization procedure that recovers parameters for both the new variable-memory case and ordinary fixed-memory Hawkes models. They test the method on synthetic data and on recordings of neuronal spiking activity. The approach is motivated by neuroscience, where a neuron's firing can effectively clear dependence on events before its last spike.

Core claim

The central claim is that a multivariate nonlinear Hawkes process with variable length memory exists under suitable conditions on the intensity functions, and that a likelihood-based estimation procedure can identify whether a given subprocess follows classical or variable-memory dynamics. In the variable-memory case the intensity of a subprocess depends only on the history after its own last event, while still satisfying the integrability requirements needed for the point process to be well-defined.

What carries the argument

The variable-length-memory intensity function, which renders a subprocess independent of all history before its most recent event while preserving the overall existence conditions of the multivariate point process.

If this is right

  • The model can represent neuronal spiking where each spike resets dependence on earlier activity.
  • The same likelihood procedure distinguishes classical Hawkes behavior from reset behavior on the same dataset.
  • Existence is guaranteed for both excitatory and inhibitory nonlinear intensities that satisfy the variable-memory independence condition.
  • The framework applies directly to any multivariate point process in which some components exhibit event-triggered memory resets.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • Similar reset mechanisms could be introduced into other point-process families such as Cox or renewal processes for applications outside neuroscience.
  • The inference method might be adapted to streaming data for online detection of memory-length changes.
  • Connections to marked point processes or self-correcting processes could be explored by treating the last-event time as an explicit mark.
  • Empirical tests on additional datasets would show in which regimes the variable-memory extension improves predictive accuracy over standard Hawkes fits.

Load-bearing premise

The intensity of each subprocess can be defined to ignore history before its last event without violating the conditions required for the existence proof to go through.

What would settle it

Generating data from the proposed variable-memory model and then checking whether the likelihood maximizer recovers the true parameters or whether the simulated process satisfies the existence criteria would confirm or refute the claims.

Figures

Figures reproduced from arXiv: 2507.22867 by Anna Bonnet, Maxime Sangnier, Sacha Quayle.

Figure 1
Figure 1. Figure 1: Test results from estimations under (GVM) for Scenarios (HP) and (VM). Colours [PITH_FULL_IMAGE:figures/full_fig_p011_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Boxplots of the relative squared error for each group of parameters ( [PITH_FULL_IMAGE:figures/full_fig_p011_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Heatmaps of true parameters µ, α, β and αr. For each scenario, we display the outcomes from Tests 1 to 3, and the relative squared errors for each group of parameters (µ, α, β and αr) after Step 5, by considering vector norms (unlike the bivariate case, errors after Steps 1 and 3 are omitted). As shown in [PITH_FULL_IMAGE:figures/full_fig_p012_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Test results from estimations under (GVM) for Scenarios (HP), (VM), (GVM) and [PITH_FULL_IMAGE:figures/full_fig_p013_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Boxplots of the relative squared error for each group of parameters ( [PITH_FULL_IMAGE:figures/full_fig_p014_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Test results from GVM estimations for 25 resampled trials. On the left: red (0) [PITH_FULL_IMAGE:figures/full_fig_p015_6.png] view at source ↗
Figure 7
Figure 7. Figure 7: Heatmaps of estimated parameters after Step 5. For [PITH_FULL_IMAGE:figures/full_fig_p015_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Transition graph of the Markov chain pZ plq qlPN for d “ 2. Denote P1 :“ Pz p1q 1 . Our goal is to show that P1pTz p1q 1 “ 8q “ 0, meaning that the chain returns to z p1q 1 with probability 1. Examining the graph of the Markov chain, for Tz p1q 1 to be infinite, one of two scenarios must occur: either only event times from N 1 are observed indefinitely, or at some point, an event time from N 2 occurs, afte… view at source ↗
Figure 9
Figure 9. Figure 9: Normalised cumulative spike counts Ni ptq{t for the 10 original trials, shown on the normalised time interval r0, 10s, obtained after Step 1 [PITH_FULL_IMAGE:figures/full_fig_p024_9.png] view at source ↗
Figure 10
Figure 10. Figure 10: Normalised cumulative spike counts Ni ptq{t for the 7 trials retained after Step 3 [PITH_FULL_IMAGE:figures/full_fig_p024_10.png] view at source ↗
Figure 11
Figure 11. Figure 11: Normalised cumulative spike counts Ni ptq{t for the 25 trials obtained after the resampling procedure in Step 6. 24 [PITH_FULL_IMAGE:figures/full_fig_p024_11.png] view at source ↗
read the original abstract

Multivariate Hawkes processes are past-dependant point processes originally introduced to model excitation effects, later extended to a nonlinear framework to account for the opposite effect, known as inhibition. Motivated by applications in neuroscience, where the memory of a neuron may reset upon firing, we introduce a new class of nonlinear Hawkes processes with variable length memory. Our model generalises classical Hawkes processes, with or without inhibition, describing the situation where the probability of an event occurring within a given subprocess may depend differently on the history before and after its last event. In particular, if the subprocess does not depend on the history before its last event, it is said to have a variable length memory. Our main contributions are to prove existence of such processes, and to derive a workable likelihood maximisation method, capable of identifying both classical and variable memory dynamics. We demonstrate the effectiveness of our approach both on synthetic data, and on a neuronal activity dataset.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The manuscript introduces a new class of nonlinear Hawkes processes with variable length memory, generalizing classical models to allow the intensity of a subprocess to depend differently on history before and after its last event. The main contributions are a proof of existence for such processes and the derivation of a likelihood maximization method for inference that can identify both classical and variable memory dynamics. The approach is illustrated on synthetic data and a neuronal activity dataset.

Significance. Should the existence result hold under the stated conditions and the inference method prove consistent, this work offers a meaningful extension to Hawkes process modeling, particularly suited for neuroscience applications where neuronal memory may reset upon firing. The capacity to distinguish memory types through likelihood maximization adds practical value for analyzing spiking data.

major comments (2)
  1. [Existence section (likely §3)] The proof of existence needs to address whether the variable-length memory truncation preserves the necessary Lipschitz continuity or boundedness conditions for the intensity function. Standard arguments for nonlinear Hawkes processes rely on these to ensure no finite-time explosion; the paper should explicitly verify that the reset mechanism does not violate them, perhaps by providing a uniform bound independent of the number of events.
  2. [Inference section (likely §4)] The derivation of the likelihood function for the variable memory model should include a discussion of identifiability and consistency of the maximizer, especially when the model can reduce to classical Hawkes under certain parameter choices.
minor comments (2)
  1. [Abstract] Change 'past-dependant' to 'past-dependent'.
  2. [Throughout] Ensure consistent notation for the intensity function and history measures.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments, which help strengthen the presentation of our results on variable-length-memory Hawkes processes. We address each major comment below.

read point-by-point responses
  1. Referee: [Existence section (likely §3)] The proof of existence needs to address whether the variable-length memory truncation preserves the necessary Lipschitz continuity or boundedness conditions for the intensity function. Standard arguments for nonlinear Hawkes processes rely on these to ensure no finite-time explosion; the paper should explicitly verify that the reset mechanism does not violate them, perhaps by providing a uniform bound independent of the number of events.

    Authors: We agree that the existence argument should explicitly confirm preservation of the required conditions. In the revised manuscript we will add a short lemma showing that, under the same Lipschitz and boundedness assumptions used for the classical nonlinear Hawkes process, the variable-length truncation yields an intensity that remains Lipschitz continuous with a constant independent of history length and admits a uniform bound independent of the number of events, thereby ruling out finite-time explosion by the standard arguments. revision: yes

  2. Referee: [Inference section (likely §4)] The derivation of the likelihood function for the variable memory model should include a discussion of identifiability and consistency of the maximizer, especially when the model can reduce to classical Hawkes under certain parameter choices.

    Authors: We thank the referee for this suggestion. The revised manuscript will expand the inference section with a paragraph on identifiability, explicitly noting that the variable-memory model nests the classical Hawkes process (recovered when the pre-reset kernel coefficients are set to zero) and that the parameter space remains identifiable under mild separation conditions. We will also sketch consistency of the maximum-likelihood estimator by appealing to standard regularity conditions for point-process likelihoods, with a reference to existing results in the literature. revision: yes

Circularity Check

0 steps flagged

Derivation chain is self-contained; no circular reductions identified

full rationale

The paper defines a new class of nonlinear Hawkes processes with variable-length memory, proves existence under stated conditions, and derives a likelihood maximization procedure for inference. These steps rely on standard point-process theory (compensators, intensity functions) and do not reduce any claimed existence result or estimator to a fitted quantity, self-citation chain, or input by construction. The model specification (truncation at last event) is logically prior to both the existence argument and the subsequent parameter estimation; no equation or theorem is shown to be equivalent to its own inputs. The paper is therefore self-contained against external benchmarks for the purpose of this circularity check.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Based on abstract only; the paper invokes standard point-process existence conditions and introduces the variable-memory cutoff as a modeling primitive without detailing free parameters or invented entities.

axioms (1)
  • domain assumption The intensity function satisfies conditions that guarantee existence of the point process even when memory is truncated after the last event.
    Existence proof is listed as a main contribution, implying this background regularity is assumed or verified.

pith-pipeline@v0.9.0 · 5692 in / 1150 out tokens · 32064 ms · 2026-05-19T02:41:42.839711+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

41 extracted references · 41 canonical work pages

  1. [1]

    write newline

    " write newline "" before.all 'output.state := FUNCTION fin.entry add.period write newline FUNCTION new.block output.state before.all = 'skip after.block 'output.state := if FUNCTION new.sentence output.state after.block = 'skip output.state before.all = 'skip after.sentence 'output.state := if if FUNCTION not #0 #1 if FUNCTION and 'skip pop #0 if FUNCTIO...

  2. [2]

    Asmussen, S. (2003). Applied Probability and Queues , volume 51 of Stochastic Modelling and Applied Probability . New York: Springer

  3. [3]

    Bacry, E., Bompaire, M., Gaïffas, S., & Muzy, J.-F. (2020). Sparse and low-rank multivariate Hawkes processes. Journal of Machine Learning Research , 21(50), 1--32

  4. [4]

    & Hochberg, Y

    Benjamini, Y. & Hochberg, Y. (1995). Controlling the False Discovery Rate : A Practical and Powerful Approach to Multiple Testing . Journal of the Royal Statistical Society: Series B (Methodological) , 57(1), 289--300

  5. [5]

    Bonnet, A., Dion, C., Gindraud, F., & Lemler, S. (2022). Neuronal Network Inference and Membrane Potential Model using Multivariate Hawkes Processes . Journal of Neuroscience Methods , 372, 109550

  6. [6]

    M., & Sangnier, M

    Bonnet, A., Herrera, M. M., & Sangnier, M. (2023). Inference of multivariate exponential Hawkes processes with inhibition and application to neuronal activity. Statistics and Computing , 33(4), 91

  7. [7]

    & Sangnier, M

    Bonnet, A. & Sangnier, M. (2025). Nonparametric estimation of Hawkes processes with RKHSs . In Proceedings of The 28th International Conference on Artificial Intelligence and Statistics (pp.\ 3574--3582).: PMLR

  8. [8]

    & Massoulié, L

    Brémaud, P. & Massoulié, L. (1996). Stability of nonlinear Hawkes processes. The Annals of Probability , 24(3), 1563--1588

  9. [9]

    Costa, M., Graham, C., Marsalle, L., & Tran, V. C. (2020). Renewal in Hawkes processes with self-excitation and inhibition. Advances in Applied Probability , 52(3), 879--915

  10. [10]

    Daley, D. J. & Vere-Jones (2003). An Introduction to the Theory of Point Processes , volume 1 of Probability and its Applications . New York: Springer-Verlag, 2 edition

  11. [11]

    Daley, D. J. & Vere-Jones, D. (2008). An introduction to the Theory of Point Processes , volume 2 of Probability and its Applications . Springer-Verlag, 2 edition

  12. [12]

    & Ross, G

    Deutsch, I. & Ross, G. J. (2025). Estimating Product Cannibalisation in Wholesale using Multivariate Hawkes Processes with Inhibition . Annals of Applied Statistics , 19(1), 235--260

  13. [13]

    Duval, C., Luçon, E., & Pouzat, C. (2022). Interacting Hawkes processes with multiplicative inhibition. Stochastic Processes and their Applications , 148, 180--226

  14. [14]

    & Löcherbach, E

    Galves, A. & Löcherbach, E. (2013). Infinite Systems of Interacting Chains with Memory of Variable Length — A Stochastic Model for Biological Neural Nets . Journal of Statistical Physics , 151(5), 896--921

  15. [15]

    & Löcherbach, E

    Galves, A. & Löcherbach, E. (2016). Modeling networks of spiking neurons as interacting processes with memory of variable length. Journal de la Société Française de Statistique , 157(1), 17--32

  16. [16]

    Galves, A., Löcherbach, E., & Pouzat, C. (2024). Probabilistic Spiking Neuronal Nets : Neuromathematics for the Computer Era . Lecture Notes on Mathematical Modelling in the Life Sciences . Cham: Springer International Publishing

  17. [17]

    Hawkes, A. G. (1971). Spectra of some self-exciting and mutually exciting point processes. Biometrika , 58(1), 83--90

  18. [18]

    & Löcherbach, E

    Hodara, P. & Löcherbach, E. (2017). Hawkes processes with variable length memory and an infinite number of components. Advances in Applied Probability , 49, 84--107

  19. [19]

    & Jain, S

    Joseph, S. & Jain, S. (2024a). A neural network based model for multi-dimensional non-linear Hawkes processes. Journal of Computational and Applied Mathematics , 447, 115889

  20. [20]

    & Jain, S

    Joseph, S. & Jain, S. (2024b). Non- Parametric Estimation of Multi -dimensional Marked Hawkes Processes

  21. [21]

    C., Tuleau-Malot, C., Bessaih, T., Rivoirard, V., Bouret, Y., Leresche, N., & Reynaud-Bouret, P

    Lambert, R. C., Tuleau-Malot, C., Bessaih, T., Rivoirard, V., Bouret, Y., Leresche, N., & Reynaud-Bouret, P. (2018). Reconstructing the functional connectivity of multiple spike trains using Hawkes models. Journal of Neuroscience Methods , 297, 9--21

  22. [22]

    & Vayatis, N

    Lemonnier, R. & Vayatis, N. (2014). Nonparametric Markovian Learning of Triggering Kernels for Mutually Exciting and Mutually Inhibiting Multivariate Hawkes Processes . In T. Calders, F. Esposito, E. Hüllermeier, & R. Meo (Eds.), Machine Learning and Knowledge Discovery in Databases (pp.\ 161--176). Berlin, Heidelberg: Springer

  23. [23]

    Lotz, A. (2024). A sparsity test for multivariate Hawkes processes

  24. [24]

    & Eisner, J

    Mei, H. & Eisner, J. (2017). The neural hawkes process: a neurally self-modulating multivariate point process. In Proceedings of the 31st International Conference on Neural Information Processing Systems , NIPS '17 (pp.\ 6757--6767). Red Hook, NY, USA: Curran Associates Inc

  25. [25]

    Meng, Z., Wan, K., Huang, Y., Li, Z., Wang, Y., & Zhou, F. (2024). Interpretable Transformer Hawkes Processes : Unveiling Complex Interactions in Social Networks . In Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining , KDD '24 (pp.\ 2200--2211). New York, NY, USA: Association for Computing Machinery

  26. [26]

    J., Swanepoel, L

    Nicvert, L., Donnet, S., Keith, M., Peel, M., Somers, M. J., Swanepoel, L. H., Venter, J., Fritz, H., & Dray, S. (2024). Using the multivariate Hawkes process to study interactions between multiple species from camera trap data. Ecology , 105(4), e4237

  27. [27]

    Ogata, Y. (1978). The asymptotic behaviour of maximum likelihood estimators for stationary point processes. Annals of the Institute of Statistical Mathematics , 30(2), 243--261

  28. [28]

    Ogata, Y. (1981). On Lewis ' simulation method for point processes. IEEE Transactions on Information Theory , 27(1), 23--31

  29. [29]

    Ogata, Y. (1988). Statistical Models for Earthquake Occurrences and Residual Analysis for Point Processes . Journal of the American Statistical Association , 83(401), 9--27

  30. [30]

    & Short, M

    Olinde, J. & Short, M. (2020). A Self -limiting Hawkes Process : Interpretation , Estimation , and Use in Crime Modeling . 2020 IEEE International Conference on Big Data , (pp.\ 3219)

  31. [31]

    M., & Zhe, S

    Pan, Z., Wang, Z., Phillips, J. M., & Zhe, S. (2021). Self-adaptable point processes with nonparametric time decays. In Proceedings of the 35th International Conference on Neural Information Processing Systems , NIPS '21 (pp.\ 4594--4606). Red Hook, NY, USA: Curran Associates Inc

  32. [32]

    Petersen, P. C. & Berg, R. W. (2016). Lognormal firing rate distribution reveals prominent fluctuation–driven regime in spinal motor networks. eLife , 5, e18805

  33. [33]

    C., Lindén, H., Vestergaard, M., & Berg, R

    Radosevic, M., Willumsen, A., Petersen, P. C., Lindén, H., Vestergaard, M., & Berg, R. W. (2019). Decoupling of timescales reveals sparse convergent CPG network in the adult spinal cord. Nature Communications , 10(1)

  34. [34]

    Reynaud-Bouret, P., Rivoirard, V., Grammont, F., & Tuleau-Malot, C. (2014). Goodness-of-fit tests and nonparametric adaptive estimation for spike train analysis. Journal of Mathematical Neuroscience , 4, 3

  35. [35]

    Rizoiu, M.-A., Lee, Y., Mishra, S., & Xie, L. (2017). A Tutorial on Hawkes Processes for Events in Social Media

  36. [36]

    & Harvey, R

    Strata, P. & Harvey, R. (1999). Dale's principle. Brain Res. Bull. , 50, 349--50

  37. [37]

    Sulem, D., Rivoirard, V., & Rousseau, J. (2023). Scalable and adaptive variational Bayes methods for Hawkes processes

  38. [38]

    Sulem, D., Rivoirard, V., & Rousseau, J. (2024). Bayesian estimation of nonlinear Hawkes processes. Bernoulli , 30(2), 1257--1286

  39. [39]

    E., Haberland, M., Reddy, T., Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., Bright, J., van der Walt, S

    Virtanen, P., Gommers, R., Oliphant, T. E., Haberland, M., Reddy, T., Cournapeau, D., Burovski, E., Peterson, P., Weckesser, W., Bright, J., van der Walt, S. J., Brett, M., Wilson, J., Millman, K. J., Mayorov, N., Nelson, A. R. J., Jones, E., Kern, R., Larson, E., Carey, C. J., Polat, I., Feng, Y., Moore, E. W., VanderPlas, J., Laxalde, D., Perktold, J., ...

  40. [40]

    Zhang, Q., Lipani, A., Lipani, A., & Yilmaz, E. (2020). Self-attentive Hawkes process. In Proceedings of the 37th International Conference on Machine Learning , volume 119 of ICML '20 (pp.\ 11183--11193).: JMLR.org

  41. [41]

    Zuo, S., Jiang, H., Li, Z., Zhao, T., & Zha, H. (2020). Transformer Hawkes process. In Proceedings of the 37th International Conference on Machine Learning , volume 119 of ICML '20 (pp.\ 11692--11702).: JMLR.org