pith. sign in

arxiv: 2503.15821 · v4 · submitted 2025-03-20 · 📊 stat.AP · stat.OT

Temporal Point Process Modeling of Aggressive Behavior Onset in Psychiatric Inpatient Youths with Autism

Pith reviewed 2026-05-22 23:54 UTC · model grok-4.3

classification 📊 stat.AP stat.OT
keywords temporal point processesHawkes processaggressive behaviorautismpsychiatric inpatientself-exciting processesbehavior predictionpoint process modeling
0
0 comments X

The pith

Self-exciting temporal point processes capture the clustered timing of aggressive behavior onsets in autistic youth more accurately than Poisson models.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper applies temporal point processes to model the timing of aggressive behavior onsets in psychiatric inpatient youths with autism. Short-window machine learning predictors exist but do not estimate longer-term probabilities or explain event clustering. Self-exciting Hawkes processes are benchmarked against other TPPs and Poisson models using goodness-of-fit and predictive metrics on patient data. The self-exciting models show superior capture of irregular, clustered patterns. This yields probabilistic forecasts over a time continuum that could support clinical decisions.

Core claim

Self-exciting TPPs more accurately capture the irregular and clustered nature of aggression onsets compared to traditional Poisson models, enabling interpretable probabilistic forecasts of aggression onset along a time continuum.

What carries the argument

Self-exciting Hawkes processes, which increase future event intensity based on past events through an excitation kernel.

If this is right

  • TPPs can estimate expected numbers of aggression onsets over longer periods than short-window predictors.
  • The models supply interpretable insights into the dynamics that drive onset clustering.
  • Improved accuracy over Poisson models supports use in clinical decision-making.
  • Probabilistic forecasts along a time line enable preemptive intervention planning.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same self-exciting framework could be tested on other episodic behaviors such as self-injury or elopement.
  • Real-time biosensor streams could be used to update the intensity function dynamically.
  • Fitting separate parameters per individual patient would test whether excitation strength varies across cases.

Load-bearing premise

The aggression onset times recorded in the psychiatric inpatient sample are sufficiently complete and accurately timestamped to allow reliable estimation of self-excitation parameters.

What would settle it

A direct model comparison on the same dataset in which a standard Poisson process achieves equal or superior goodness-of-fit and predictive scores to the self-exciting Hawkes process.

Figures

Figures reproduced from arXiv: 2503.15821 by Ahmet Demirkaya, Ashutosh Singh, Deniz Erdogmus, Georgios Stratis, Matthew S. Goodwin, Michael Everett, Michael Potter, Tales Imbiriba, Yuna Watanabe.

Figure 1
Figure 1. Figure 1: Three stages of data preprocessing. (a) Aggregating SIB,ATO,ED behavior labels into a condition label. (b) Preproccessing only the aggressive behavior onset timestamps. (c) Normalizing the aggressive behavior onset timestamps to be at the begining of the observation session. (a) (b) [PITH_FULL_IMAGE:figures/full_fig_p017_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: TPPs conditional intensity curves for two observation sessions. (a) The conditional intensity curve for each TPP model in participant 1206.01, observation session 1. (b) The conditional intensity curve for each TPP model in participant 1234.01 observation session 13. 17/34 [PITH_FULL_IMAGE:figures/full_fig_p017_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Goodness-of-fit evaluation metrics for the TPPs. Subfigure (a): The empirical data count distribution (orange) versus TPP generated count distribution (blue) for the number of aggressive behavior onsets in an observation session. Subfigure (b): the QQ plot of RTC theorem inter-arrivals for different TPP fits. The x-axis is the theoretical quantiles of an exponential distribution with mean 1, and the y-axis… view at source ↗
Figure 4
Figure 4. Figure 4: Visualization of the forecasted number of aggressive behavior onsets within a future time window, conditioned on the history of prior onsets. The blue line represents the observed history of onset counts, while the black line denotes the true future onset count. The black vertical dotted line marks the start of forecasting. The red line corresponds to the median prediction over 250 sampled forecasts, with … view at source ↗
Figure 5
Figure 5. Figure 5: Posterior PDF over the branching factor for the HawkesPPs. (a), (b), (c) are the branching factors for the HawkesExpPP, Hawkes2ExpPP and HawkesPLPP, respectively. (a) intensity (b) intensity [PITH_FULL_IMAGE:figures/full_fig_p020_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: TPPs conditional CDF curves for two observation sessions. (a) The conditional CDF curve for each TPP model in participant 1206.01, observation session 1. (b) The conditional CDF curve for each TPP model in participant 1234.01 observation session 13. 20/34 [PITH_FULL_IMAGE:figures/full_fig_p020_6.png] view at source ↗
read the original abstract

Aggressive behavior, including aggression towards others and self-injury, occurs in up to 80% of children and adolescents with autism, making it a leading cause of behavioral health referrals and a major driver of healthcare costs. Predicting when autistic youth will exhibit aggression can be challenging due to their communication difficulties. Many are minimally verbal or have poor emotional insight. Recent advances in Machine Learning and wearable biosensing demonstrate the ability to predict aggression within a limited future window (typically one to three minutes) in autistic individuals. However, existing works don't estimate aggression onset probability or the expected number of aggression onsets over longer periods, nor do they provide interpretable insights into onset dynamics. To address these limitations, we apply Temporal Point Processes (TPPs) - particularly self-exciting Hawkes processes - to model the timing of aggressive behavior onsets in psychiatric inpatient autistic youth. We benchmark several TPP models by evaluating their goodness-of-fit and predictive metrics. Our results demonstrate that self-exciting TPPs more accurately captures the irregular and clustered nature of aggression onsets, especially compared to traditional Poisson models. These incipient findings suggest that TPPs can provide interpretable, probabilistic forecasts of aggression onset along a time continuum, supporting future clinical decision-making and preemptive intervention.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 1 minor

Summary. The paper applies temporal point processes, specifically self-exciting Hawkes processes, to model the timing of aggressive behavior onsets in psychiatric inpatient youths with autism. It benchmarks these models against traditional Poisson models, claiming superior goodness-of-fit and predictive performance for capturing clustered onsets, and suggests they can provide interpretable probabilistic forecasts for clinical use.

Significance. If the empirical results hold after proper validation and reporting, this introduces a framework for continuous-time probabilistic modeling of irregular behavioral events, addressing gaps in short-window ML predictions and potentially supporting longer-term clinical forecasting. The application of TPPs to this domain is novel, but the absence of any quantitative details in the abstract prevents assessment of practical significance.

major comments (3)
  1. [Abstract] Abstract: The central claim that self-exciting TPPs 'more accurately captures the irregular and clustered nature of aggression onsets, especially compared to traditional Poisson models' is asserted without any reported numerical values for goodness-of-fit or predictive metrics, sample size, cross-validation procedure, or censoring handling. This directly undermines evaluation of the empirical benchmark that supports the paper's main contribution.
  2. [Abstract] Abstract / Data description: No information is given on timestamp granularity, logging delays by staff, inter-rater reliability, or validation against continuous observation (e.g., video). In inpatient settings, such measurement error or under-recording can induce apparent clustering that a Hawkes model attributes to self-excitation, threatening the validity of the claimed superiority over Poisson baselines.
  3. [Abstract] Abstract: The assertion that results demonstrate superior performance does not address whether the improvement survives multiple-testing correction or alternative baselines, leaving the strength of the cross-model comparison unverified.
minor comments (1)
  1. [Abstract] Abstract: Grammatical agreement error in 'self-exciting TPPs more accurately captures' (should be 'capture').

Simulated Author's Rebuttal

3 responses · 1 unresolved

We thank the referee for their constructive comments, which highlight important issues with the abstract and potential data limitations. We agree that the abstract requires quantitative support and will revise it accordingly. We address each major comment below and indicate planned changes to the manuscript.

read point-by-point responses
  1. Referee: [Abstract] Abstract: The central claim that self-exciting TPPs 'more accurately captures the irregular and clustered nature of aggression onsets, especially compared to traditional Poisson models' is asserted without any reported numerical values for goodness-of-fit or predictive metrics, sample size, cross-validation procedure, or censoring handling. This directly undermines evaluation of the empirical benchmark that supports the paper's main contribution.

    Authors: We agree the abstract lacks supporting numbers. The full manuscript (Results and Methods sections) reports log-likelihood values, AIC comparisons, predictive metrics on held-out intervals, sample size (patients and events), 5-fold cross-validation, and right-censoring at discharge. We will revise the abstract to include these specifics (e.g., sample size, key metric improvements) while keeping it concise. revision: yes

  2. Referee: [Abstract] Abstract / Data description: No information is given on timestamp granularity, logging delays by staff, inter-rater reliability, or validation against continuous observation (e.g., video). In inpatient settings, such measurement error or under-recording can induce apparent clustering that a Hawkes model attributes to self-excitation, threatening the validity of the claimed superiority over Poisson baselines.

    Authors: This is a valid concern about measurement error potentially mimicking self-excitation. The manuscript states events are staff-logged at minute-level timestamps in the EHR; we will expand the Data section to detail this granularity and add a Limitations paragraph acknowledging possible logging delays and lack of inter-rater or video validation data. We cannot quantify delays retrospectively, so this remains a caveat on interpretation rather than a full rebuttal. revision: partial

  3. Referee: [Abstract] Abstract: The assertion that results demonstrate superior performance does not address whether the improvement survives multiple-testing correction or alternative baselines, leaving the strength of the cross-model comparison unverified.

    Authors: The model comparisons involve a small, pre-specified set (Poisson vs. Hawkes variants). We will add a sentence noting that no multiple-testing correction was applied due to the limited number of primary contrasts and will report bootstrap confidence intervals for metric differences. Alternative baselines (e.g., renewal processes) are outside the paper's scope but could be noted as future work. revision: yes

standing simulated objections not resolved
  • Quantitative assessment of logging delays and inter-rater reliability is not available in the retrospective clinical dataset and cannot be added.

Circularity Check

0 steps flagged

No circularity detected; empirical benchmark is self-contained

full rationale

The paper applies standard temporal point process models (including Hawkes self-exciting processes) to timestamped aggression onsets and reports empirical goodness-of-fit and predictive metrics against Poisson baselines. No equations, parameter definitions, or derivations are supplied that reduce the claimed superior fit to a tautology by construction, self-citation chain, or fitted-input renaming. The central result rests on observable data patterns rather than any self-referential step, making the derivation chain independent and non-circular.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review supplies no explicit free parameters, axioms, or invented entities. The modeling choice itself (Hawkes self-excitation) is treated as a standard tool imported from the TPP literature.

pith-pipeline@v0.9.0 · 5782 in / 1068 out tokens · 21899 ms · 2026-05-22T23:54:49.743573+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

54 extracted references · 54 canonical work pages · 3 internal anchors

  1. [1]

    Maenner, M. J. Prevalence and characteristics of autism spectrum disorder among children aged 8 years—autism and developmental disabilities monitoring network, 11 sites, united states, 2020. MMWR. Surveillance Summ. 72 (2023)

  2. [2]

    A., Matson, J

    Hattier, M. A., Matson, J. L., Belva, B. C. & Horovitz, M. The occurrence of challenging behaviours in children with autism spectrum disorders and atypical development. Dev. Neurorehabilitation 14, 221–229 (2011)

  3. [3]

    Kanne, S. M. & Mazurek, M. O. Aggression in children and adolescents with asd: Prevalence and risk factors. J. autism developmental disorders 41, 926–937 (2011)

  4. [4]

    Matson, J. L. & Cervantes, P. E. Assessing aggression in persons with autism spectrum disorders: An overview. Res. developmental disabilities 35, 3269–3275 (2014)

  5. [5]

    Arnold, L. E. et al. Parent-defined target symptoms respond to risperidone in rupp autism study: customer approach to clinical trials. J. Am. Acad. Child & Adolesc. Psychiatry 42, 1443–1450 (2003)

  6. [6]

    A., Najjar, D

    Croen, L. A., Najjar, D. V ., Ray, G. T., Lotspeich, L. & Bernal, P. A comparison of health care utilization and costs of children with and without autism spectrum disorders in a large group-model health plan. Pediatrics 118, e1203–e1211 (2006)

  7. [7]

    Mazefsky, C. A. & White, S. W. Emotion regulation: Concepts & practice in autism spectrum disorder. Child adolescent psychiatric clinics North Am. 23, 10–1016 (2013)

  8. [8]

    Tager-Flusberg, H. et al. Conducting research with minimally verbal participants with autism spectrum disorder. Autism 21, 852–861 (2017)

  9. [9]

    Davis, N. O. & Carter, A. S. Parenting stress in mothers and fathers of toddlers with autism spectrum disorders: Associations with child characteristics. J. autism developmental disorders 38, 1278–1291 (2008)

  10. [10]

    & Zwaigenbaum, L

    Hodgetts, S., Nicholas, D. & Zwaigenbaum, L. Home sweet home? families’ experiences with aggression in children with autism spectrum disorders. Focus. on autism other developmental disabilities 28, 166–174 (2013)

  11. [11]

    & Pankhurst, H

    Kiely, J. & Pankhurst, H. Violence faced by staff in a learning disability service. Disabil. Rehabil. 20, 81–89 (1998)

  12. [12]

    Statistical models for earthquake occurrences and residual analysis for point processes.J

    Ogata, Y . Statistical models for earthquake occurrences and residual analysis for point processes.J. Am. Stat. association 83, 9–27 (1988)

  13. [13]

    & Sornette, D

    Filimonov, V . & Sornette, D. Apparent criticality and calibration issues in the hawkes self-excited point process model: application to high-frequency financial data. Quant. Finance 15, 1293–1314 (2015)

  14. [14]

    C., Bailey, T

    Gatrell, A. C., Bailey, T. C., Diggle, P. J. & Rowlingson, B. S. Spatial point pattern analysis and its application in geographical epidemiology. Transactions Inst. Br. geographers 256–274 (1996)

  15. [15]

    Recent advance in temporal point process: from machine learning perspective

    Yan, J. Recent advance in temporal point process: from machine learning perspective. SJTU Tech. Rep. (2019)

  16. [16]

    Shchur, A

    Shchur, O., Türkmen, A. C., Januschowski, T. & Günnemann, S. Neural temporal point processes: A review.arXiv preprint arXiv:2104.03528 (2021). 14/34

  17. [17]

    M., Hussein, Q

    Fadhel, Z. M., Hussein, Q. M. & Askari, A. O. A. R. A. Detecting aggressive behavior in children with autism spectrum disorder using machine learning. In AIP Conference Proceedings, vol. 2845 (AIP Publishing, 2023)

  18. [18]

    Khullar, V ., Singh, H. P. & Bala, M. Meltdown/tantrum detection system for individuals with autism spectrum disorder. Appl. Artif. Intell. 35, 1708–1732 (2021)

  19. [19]

    & Goodwin, M

    Imbiriba, T., Demirkaya, A., Singh, A., Erdogmus, D. & Goodwin, M. S. Wearable biosensing to predict imminent aggressive behavior in psychiatric inpatient youths with autism. JAMA network open 6, e2348898–e2348898 (2023)

  20. [20]

    Goodwin, M. S. et al. Predicting imminent aggression onset in minimally-verbal youth with autism spectrum disorder using preceding physiological signals. In Proceedings of the 12th EAI International Conference on Pervasive Computing Technologies for Healthcare, PervasiveHealth ’18, 201–207, DOI: 10.1145/3240925.3240980 (Association for Computing Machinery...

  21. [21]

    Imbiriba, T. et al. Biosensor prediction of aggression in youth with autism using kernel-based methods. In Proceedings of the 13th ACM international conference on PErvasive technologies related to assistive environments, 1–6 (2020)

  22. [22]

    Xiao, S. et al. Wasserstein learning of deep generative point process models. Adv. neural information processing systems 30 (2017)

  23. [23]

    Chaspari, T. et al. A non-homogeneous poisson process model of skin conductance responses integrated with observed regulatory behaviors for autism intervention. In 2014 IEEE international conference on acoustics, speech and signal processing (ICASSP), 1611–1615 (IEEE, 2014)

  24. [24]

    O., Tung, F

    Bae, W., Ahmed, M. O., Tung, F. & Oliveira, G. L. Meta temporal point processes. arXiv preprint arXiv:2301.12023 (2023)

  25. [25]

    Shachter, R. D. & Peot, M. A. Decision making using probabilistic inference methods. In Uncertainty in Artificial Intelligence, 276–283 (Elsevier, 1992)

  26. [26]

    Lindsay, J. J. & Anderson, C. A. From antecedent conditions to violent actions: A general affective aggression model. Pers. Soc. Psychol. Bull. 26, 533–547 (2000)

  27. [27]

    Physiological arousal and its dysregulation in child maladjustment

    Scarpa, A. Physiological arousal and its dysregulation in child maladjustment. Curr. Dir. Psychol. Sci. 24, 345–351 (2015)

  28. [28]

    L., Yoo, J

    Cohen, I. L., Yoo, J. H., Goodwin, M. S. & Moskowitz, L. Assessing challenging behaviors in autism spectrum disorders: Prevalence, rating scales, and autonomic indicators. Int. handbook autism pervasive developmental disorders 247–270 (2011)

  29. [29]

    Rasmussen, J. G. Lecture notes: Temporal point processes and the conditional intensity function. arXiv preprint arXiv:1806.00221 (2018)

  30. [30]

    & Nakashima, N

    Nohara, Y ., Matsumoto, K., Soejima, H. & Nakashima, N. Explanation of machine learning models using shapley additive explanation and application for real data in hospital. Comput. Methods Programs Biomed. 214, 106584 (2022)

  31. [31]

    Variable selection with stepwise and best subset approaches

    Zhang, Z. Variable selection with stepwise and best subset approaches. Annals translational medicine 4, 136 (2016)

  32. [32]

    Gallager, R. G. Discrete stochastic processes. J. Oper. Res. Soc. 48, 103–103 (1997)

  33. [33]

    J., Lee, Y ., Pollett, P

    Laub, P. J., Lee, Y ., Pollett, P. K. & Taimre, T. Hawkes models and their applications.Annu. Rev. Stat. Its Appl. 12 (2024)

  34. [34]

    Hawkes, A. G. Point spectra of some mutually exciting point processes. J. Royal Stat. Soc. Ser. B: Stat. Methodol. 33, 438–443 (1971)

  35. [35]

    Multi-kernel property in high-frequency price dynamics under hawkes model

    Lee, K. Multi-kernel property in high-frequency price dynamics under hawkes model. Stud. Nonlinear Dyn. & Econom. 28, 605–624 (2024)

  36. [36]

    D., Gelman, A

    Hoffman, M. D., Gelman, A. et al. The no-u-turn sampler: adaptively setting path lengths in hamiltonian monte carlo. J. Mach. Learn. Res. 15, 1593–1623 (2014)

  37. [37]

    A Conceptual Introduction to Hamiltonian Monte Carlo

    Betancourt, M. A conceptual introduction to hamiltonian monte carlo. arXiv preprint arXiv:1701.02434 (2017)

  38. [38]

    Bayesian reasoning and machine learning (Cambridge University Press, 2012)

    Barber, D. Bayesian reasoning and machine learning (Cambridge University Press, 2012)

  39. [39]

    Composable Effects for Flexible and Accelerated Probabilistic Programming in NumPyro

    Phan, D., Pradhan, N. & Jankowiak, M. Composable effects for flexible and accelerated probabilistic programming in numpyro. arXiv preprint arXiv:1912.11554 (2019)

  40. [40]

    Carpenter, B. et al. Stan: A probabilistic programming language. J. statistical software 76, 1–32 (2017)

  41. [41]

    Wu, J., Smith, A. L. & Zheng, T. Diagnostics and visualization of point process models for event times on a social network. Appl. modeling techniques data analysis 1: computational data analysis methods tools 7, 129–145 (2021). 15/34

  42. [42]

    & Günnemann, S

    Lüdke, D., Biloš, M., Shchur, O., Lienen, M. & Günnemann, S. Add and thin: Diffusion for temporal point processes. Adv. Neural Inf. Process. Syst. 36, 56784–56801 (2023)

  43. [43]

    & Gabry, J

    Vehtari, A., Gelman, A. & Gabry, J. Practical bayesian model evaluation using leave-one-out cross-validation and waic. Stat. computing 27, 1413–1432 (2017)

  44. [44]

    Brown, T. C. & Nair, M. G. A simple proof of the multivariate random time change theorem for point processes. J. Appl. Probab. 25, 210–214 (1988)

  45. [45]

    K., Borgan, O., Gill, R

    Andersen, P. K., Borgan, O., Gill, R. D. & Keiding, N. Statistical models based on counting processes (Springer Science & Business Media, 2012)

  46. [46]

    Kolouri, S. et al. Optimal transport and wasserstein distance

  47. [47]

    & Gabry, J

    Vehtari, A., Simpson, D., Gelman, A., Yao, Y . & Gabry, J. Pareto smoothed importance sampling.J. Mach. Learn. Res. 25, 1–58 (2024)

  48. [48]

    J., Bercot, N

    Hardiman, S. J., Bercot, N. & Bouchaud, J.-P. Critical reflexivity in financial markets: a hawkes process analysis. The Eur. Phys. J. B 86, 1–9 (2013)

  49. [49]

    Understanding auc-roc curve

    Narkhede, S. Understanding auc-roc curve. Towards data science26, 220–227 (2018)

  50. [50]

    Karlis, D. Proposer of the vote of thanks to narayanan, kosmidis and dellaportas and contribution to the discussion of ‘flexible marked spatio-temporal point processes with applications to event sequences from association football’.J. Royal Stat. Soc. Ser. C: Appl. Stat. 72, 1127–1128 (2023)

  51. [51]

    Multivariate hawkes processes

    Liniger, T. Multivariate hawkes processes. Ph.D. thesis, ETH Zurich (2009)

  52. [52]

    Drivendata: Data science & ai competitions to build a better world

    DrivenData. Drivendata: Data science & ai competitions to build a better world. https://www.drivendata.org/ (2025). Accessed: 2025-07-11

  53. [53]

    Bradbury, J. et al. JAX: composable transformations of Python+NumPy programs (2018)

  54. [54]

    2019, Journal of Open Source Software, 4, 1143, doi: 10.21105/joss.01143

    Kumar, R., Carroll, C., Hartikainen, A. & Martin, O. Arviz a unified library for exploratory analysis of bayesian models in python. J. Open Source Softw. 4, 1143, DOI: 10.21105/joss.01143 (2019). 16/34 0 1AGG 0 1ED 0 1SIB 2019-05-08 13:00:002019-05-08 13:10:002019-05-08 13:20:002019-05-08 13:30:002019-05-08 13:40:00 Time 0 1CONDITION Trench t3 - Patient 1...