WaveletInception Networks for on-board Vibration-Based Infrastructure Health Monitoring
Pith reviewed 2026-05-21 23:52 UTC · model grok-4.3
The pith
A learnable wavelet network processes raw on-board vibration signals at varying speeds to produce high-resolution infrastructure health maps without explicit preprocessing.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The WaveletInception-BiGRU network performs early spectral feature extraction with a Learnable Wavelet Packet Transform, followed by 1D Inception-ResNet modules for multi-scale feature learning and BiGRU modules that integrate temporal dependencies while incorporating operational conditions such as measurement speed. This sequence enables direct analysis of vibration signals recorded at varying speeds without explicit preprocessing and produces accurate localized health assessments together with high-resolution profiles spatially mapped to the physical infrastructure layout.
What carries the argument
WaveletInception-BiGRU network that uses a learnable wavelet packet transform for initial spectral features, Inception-ResNet blocks for multi-scale extraction, and speed-aware BiGRU layers for temporal integration before a sequential estimation head.
If this is right
- Vibration data collected at different vehicle speeds can be used directly for health assessment without separate normalization steps.
- High-resolution health profiles can be generated that align with the physical geometry of the monitored structure.
- The same pipeline supports both regression tasks such as track stiffness estimation and classification tasks such as transition-zone detection.
- On-board monitoring becomes feasible because the network operates on raw signals and accounts for operational variables internally.
Where Pith is reading between the lines
- The approach may extend to other linear infrastructures such as pipelines or overhead lines if similar vibration patterns are available.
- Real-time deployment on inspection vehicles could reduce the need for periodic manual surveys by providing continuous spatial health maps.
- Integration with existing sensor networks on trains or maintenance cars would allow the method to operate without dedicated data-collection campaigns.
Load-bearing premise
That embedding measurement speed inside the BiGRU layers along with learnable wavelet features is enough to deliver accurate localized health estimates on new infrastructures and unseen operating conditions without extra tuning or filtering.
What would settle it
Performance drop on vibration recordings from a different infrastructure type or from speeds far outside the training distribution while keeping the same network architecture and training procedure.
Figures
read the original abstract
This paper presents a deep learning framework for analyzing on board vibration response signals in infrastructure health monitoring. The proposed WaveletInception-BiGRU network uses a Learnable Wavelet Packet Transform (LWPT) for early spectral feature extraction, followed by one-dimensional Inception-Residual Network (1D Inception-ResNet) modules for multi-scale, high-level feature learning. Bidirectional Gated Recurrent Unit (BiGRU) modules then integrate temporal dependencies and incorporate operational conditions, such as the measurement speed. This approach enables effective analysis of vibration signals recorded at varying speeds, eliminating the need for explicit signal preprocessing. The sequential estimation head further leverages bidirectional temporal information to produce an accurate, localized assessment of infrastructure health. Ultimately, the framework generates high-resolution health profiles spatially mapped to the physical layout of the infrastructure. Case studies involving track stiffness regression and transition zone classification using real-world measurements demonstrate that the proposed framework significantly outperforms state-of-the-art methods, underscoring its potential for accurate, localized, and automated on-board infrastructure health monitoring.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript presents a deep learning framework called WaveletInception-BiGRU for on-board vibration-based infrastructure health monitoring. It combines a Learnable Wavelet Packet Transform (LWPT) for early spectral feature extraction, 1D Inception-ResNet modules for multi-scale feature learning, and BiGRU modules that integrate temporal dependencies and operational conditions such as measurement speed. The approach claims to handle vibration signals at varying speeds without explicit preprocessing and to produce high-resolution, spatially mapped health profiles. Case studies on real-world track stiffness regression and transition zone classification demonstrate significant outperformance over state-of-the-art methods.
Significance. If the empirical results hold under rigorous validation, the work could meaningfully advance automated on-board monitoring by reducing preprocessing requirements and directly conditioning on operational variables. The combination of learnable wavelets with speed-aware recurrent modules represents a targeted technical contribution for vibration analysis under variable conditions. No mention is made of open code, reproducible pipelines, or parameter-free derivations, which limits the assessed impact relative to papers that provide such assets.
major comments (2)
- [§4 (Case Studies) and §3.3 (BiGRU Integration)] The central claim that LWPT features plus BiGRU conditioning on measurement speed suffice to produce accurate localized health maps from raw varying-speed signals (without preprocessing) is load-bearing but under-supported. No evidence is presented of speed-stratified cross-validation, ablation removing the speed input, or testing on infrastructure/speed ranges outside the training distribution. Physical scaling of vibration amplitude and frequency with speed (via wheel-rail dynamics) creates a risk that the model exploits spurious speed-health correlations rather than invariant features.
- [Abstract and §4] The abstract and results sections assert significant outperformance on track stiffness regression and transition classification, yet the manuscript provides no quantitative metrics, error bars, baseline implementation details, or description of data splits and train/test partitioning. This prevents verification of the performance claims against the paper's own evidence and directly affects the soundness of the empirical contribution.
minor comments (2)
- [§3.1] Notation for the LWPT is introduced without a clear equation defining the learnable parameters or the wavelet packet decomposition tree; a single equation or diagram would improve clarity.
- [Figure 2 and Figure 4] Figure captions for the network architecture and health-profile outputs should explicitly state the input dimensions, output resolution, and any spatial mapping procedure used to align predictions with infrastructure layout.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed comments on our manuscript. These have prompted us to strengthen the empirical validation and presentation of results. We address each major comment below and indicate the revisions planned for the next version.
read point-by-point responses
-
Referee: [§4 (Case Studies) and §3.3 (BiGRU Integration)] The central claim that LWPT features plus BiGRU conditioning on measurement speed suffice to produce accurate localized health maps from raw varying-speed signals (without preprocessing) is load-bearing but under-supported. No evidence is presented of speed-stratified cross-validation, ablation removing the speed input, or testing on infrastructure/speed ranges outside the training distribution. Physical scaling of vibration amplitude and frequency with speed (via wheel-rail dynamics) creates a risk that the model exploits spurious speed-health correlations rather than invariant features.
Authors: We agree that additional targeted validation is needed to substantiate the claim and mitigate concerns about spurious correlations. In the revised manuscript we will add (i) an ablation study that removes the speed input from the BiGRU while keeping all other components fixed, (ii) speed-stratified cross-validation results on the existing datasets to demonstrate consistent performance across speed bins, and (iii) an expanded discussion of the physical scaling effects together with evidence that the learned features remain informative after speed conditioning. We acknowledge that testing on entirely new infrastructure types and speed ranges outside the current collection is not feasible with the data at hand; we will therefore add a limitations paragraph highlighting this generalization gap and the value of future multi-site validation. revision: partial
-
Referee: [Abstract and §4] The abstract and results sections assert significant outperformance on track stiffness regression and transition classification, yet the manuscript provides no quantitative metrics, error bars, baseline implementation details, or description of data splits and train/test partitioning. This prevents verification of the performance claims against the paper's own evidence and directly affects the soundness of the empirical contribution.
Authors: We apologize for the insufficient detail in the abstract and the compressed presentation of experimental settings. The full results in Section 4 contain the underlying numbers, but we will revise the abstract to report the key quantitative metrics (e.g., RMSE and classification accuracy with standard deviations across folds). In the revised Section 4 we will also provide explicit descriptions of baseline re-implementations (following the original authors' protocols), the precise train/test partitioning strategy, and the cross-validation procedure used to generate error bars. These additions will make the performance claims directly verifiable from the manuscript. revision: yes
- Testing on infrastructure and speed ranges completely outside the current training distribution cannot be performed because additional real-world datasets from new sites are not available to the authors at this time.
Circularity Check
No circularity in derivation chain
full rationale
The paper presents an empirical deep learning architecture (LWPT + 1D Inception-ResNet + BiGRU with speed conditioning) evaluated on real-world track stiffness and transition classification tasks. No equations, uniqueness theorems, or self-citations are invoked to derive performance claims; results are reported from direct training and testing on measured data without reduction to fitted inputs or prior author work by construction. The framework is self-contained as a data-driven model whose validity rests on external benchmarks rather than internal redefinition.
Axiom & Free-Parameter Ledger
invented entities (1)
-
Learnable Wavelet Packet Transform (LWPT)
no independent evidence
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
The WaveletInception feature extractor utilizes a Learnable Wavelet Packet Transform (LWPT) as the stem for extracting vibration signal features, followed by 1D Inception networks... BiLSTM... integrate operational conditions such as the measurement speed.
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
C. from the Commission to the European Parliament, the Council, Eighth monitoring report on the development of the rail market un- der Article 15(4) of Directive 2012/34/EU of the European Parliament and of the Council (2023)
work page 2012
-
[2]
H. A. Fern´ andez-Bobadilla, U. Martin, Modern tendencies in vehicle- based condition monitoring of the railway track, IEEE Trans- actions on Instrumentation and Measurement 72 (2023) 1–44. doi:10.1109/TIM.2023.3243673
-
[3]
A. Sansi˜ nena, B. Rodr´ ıguez-Arana, S. Arrizabalaga, A systematic review of acceleration-based estimation of railway track quality, Vehicle System Dynamics (2025) 1–28doi:10.1080/00423114.2025.2483972
-
[4]
W. Phusakulkajorn, S. Unsiwilai, L. Chang, Z. Li, A. N´ u˜ nez, A hybrid neural model approach for health assessment of railway transition zones with multiple data sources, IEEE Transactions on Instrumentation and Measurement 74 (2025) 1–17. doi:10.1109/TIM.2025.3575986
-
[5]
W. Phusakulkajorn, A. N´ u˜ nez, H. Wang, A. Jamshidi, A. Zoeteman, B. Ripke, R. Dollevoet, B. De Schutter, Z. Li, Artificial intelligence in railway infrastructure: Current research, challenges, and future op- portunities, Intelligent Transportation Infrastructure 2 (2023) liad016. doi:10.1093/iti/liad016
-
[6]
G. Lederman, S. Chen, J. Garrett, J. Kovaˇ cevi´ c, H. Y. Noh, J. Bielak, Track-monitoring from the dynamic response of an operational train 87 (2017-03) 1–16. doi:10.1016/j.ymssp.2016.06.041
-
[7]
C. Shen, P. Zhang, R. Dollevoet, A. Zoeteman, Z. Li, Evaluating rail- way track stiffness using axle box accelerations: A digital twin ap- proach, Mechanical Systems and Signal Processing 204 (2023) 110730. doi:10.1016/j.ymssp.2023.110730
-
[8]
A. C. Lamprea-Pineda, D. P. Connolly, A. Castanheira-Pinto, P. Alves- Costa, M. F. Hussein, P. K. Woodward, On railway track receptance 177 (2024-02) 108331. doi:10.1016/j.soildyn.2023.108331. 36
-
[9]
S. Unsiwilai, L. Wang, A. N´ u˜ nez, Z. Li, Multiple-axle box accelera- tion measurements at railway transition zones 213 (2023-05) 112688. doi:10.1016/j.measurement.2023.112688
-
[10]
Y. B. Yang, J. P. Yang, State-of-the-Art Review on Modal Identification and Damage Detection of Bridges by Moving Test Vehicles 18 (2018-02) 1850025. doi:10.1142/S0219455418500256
-
[11]
P. Quirke, D. Cantero, E. J. OBrien, C. Bowe, Drive-by detection of railway track stiffness variation using in-service vehicles, Proceedings of the Institution of Mechanical Engineers, Part F: Journal of Rail and Rapid Transit 231 (2017) 498–514. doi:10.1177/0954409716634752
-
[12]
X. Q. Zhu, S. S. Law, L. Huang, Identification of Railway Ballasted Track Systems from Dynamic Responses of In-Service Trains, Journal of Aerospace Engineering 31 (2018) 04018060. doi:10.1061/(ASCE)AS.1943-5525.0000898
-
[13]
A. Caprioli, A. Cigada, D. Raveglia, Rail inspection in track maintenance: A benchmark between the wavelet approach and the more conventional Fourier analysis 21 (2007-02) 631–652. doi:10.1016/j.ymssp.2005.12.001
-
[14]
C. Shen, R. Dollevoet, Z. Li, Fast and robust identification of railway track stiffness from simple field measurement, Mechanical Systems and Signal Processing 152 (2021) 107431
work page 2021
-
[15]
C. Hoelzl, V. Dertimanis, L. Ancu, A. Kollros, E. Chatzi, Vold–Kalman filter order tracking of axle box accelerations for track stiffness assess- ment, Mechanical Systems and Signal Processing 204 (2023) 110817. doi:10.1016/j.ymssp.2023.110817
-
[16]
A. Malekjafarian, E. OBrien, P. Quirke, C. Bowe, Railway Track Mon- itoring Using Train Measurements: An Experimental Case Study 9 (2019-01) 4859. doi:10.3390/app9224859
-
[17]
X. Mao, C. Xia, J. Liu, H. Zhang, Y. Ding, Y. Yao, Z. Liu, A novel similarity measure based on dispersion-transition matrix and Jensen–Fisher divergence and its application on the detection of rail short-wave defects, Chaos, Solitons & Fractals 192 (2025) 115988. doi:10.1016/j.chaos.2025.115988. 37
-
[18]
W. Locke, J. Sybrandt, L. Redmond, I. Safro, S. Atamturktur, Using drive-by health monitoring to detect bridge damage consid- ering environmental and operational effects 468 (2020-03) 115088. doi:10.1016/j.jsv.2019.115088
-
[19]
J. Huang, X. Yin, S. Kaewunruen, Quantification of Dynamic Track Stiffness Using Machine Learning 10 (2022) 78747–78753. doi:10.1109/ACCESS.2022.3191278
-
[20]
D. Hajializadeh, Deep learning-based indirect bridge damage identifica- tion system 22 (2023-03) 897–912. doi:10.1177/14759217221087147
-
[21]
R. Corbally, A. Malekjafarian, A deep-learning framework for classify- ing the type, location, and severity of bridge damage using drive-by measurements 39 (2024) 852–871. doi:10.1111/mice.13104
-
[22]
W. Phusakulkajorn, Y. Zeng, Z. Li, A. N´ u˜ nez, Unsupervised represen- tation learning for monitoring rail infrastructures with high-frequency moving vibration sensors, IEEE Transactions on Intelligent Transporta- tion Systems (2025) 1–15doi:10.1109/TITS.2025.3557712
-
[23]
R. R. Samani, A. Nunez, B. D. Schutter, A Bidirectional Long Short Term Memory Approach for Infrastructure Health Monitor- ing Using On-board Vibration Response (2024-12). arXiv:2412.02643, doi:10.48550/arXiv.2412.02643
- [24]
-
[25]
G. Frusque, O. Fink, Robust time series denoising with learnable wavelet packet transform 62 (2024-10) 102669. doi:10.1016/j.aei.2024.102669
-
[26]
P. Liu, H. Zhang, W. Lian, W. Zuo, Multi-level wavelet convolutional neural networks 7 (2019) 74973–74985
work page 2019
-
[27]
T. Li, Z. Zhao, C. Sun, L. Cheng, X. Chen, R. Yan, R. X. Gao, WaveletKernelNet: An Interpretable Deep Neural Net- work for Industrial Intelligent Diagnosis 52 (2022-04) 2302–2312. doi:10.1109/TSMC.2020.3048950. 38
-
[28]
L. E. bouny, M. Khalil, A. Adib, An End-to-End Multi-Level Wavelet Convolutional Neural Networks for heart diseases diagnosis 417 (2020-
work page 2020
-
[29]
doi:10.1016/j.neucom.2020.07.056
187–201. doi:10.1016/j.neucom.2020.07.056
-
[30]
T. Yao, Y. Pan, Y. Li, C.-W. Ngo, T. Mei, Wave-ViT: Unifying Wavelet and Transformers for Visual Representation Learning, in: S. Avidan, G. Brostow, M. Ciss´ e, G. M. Farinella, T. Hassner (Eds.), Computer Vision – ECCV 2022, 2022, pp. 328–345. doi:10.1007/978-3-031-19806- 9 19
-
[31]
J. Wang, Z. Wang, J. Li, J. Wu, Multilevel Wavelet Decomposi- tion Network for Interpretable Time Series Analysis, in: Proceed- ings of the 24th ACM SIGKDD International Conference on Knowl- edge Discovery & Data Mining, KDD ’18, 2018-07, pp. 2437–2446. doi:10.1145/3219819.3220060
-
[32]
Mallat, A Wavelet Tour of Signal Processing, Elsevier, 1999
S. Mallat, A Wavelet Tour of Signal Processing, Elsevier, 1999. doi:10.1016/B978-0-12-374370-1.X0001-8
-
[33]
C. C. Aggarwal, et al., Neural Networks and Deep Learning, Vol. 10, Springer, 2018
work page 2018
-
[34]
Palma, Long-memory Time Series: Theory and Methods, John Wi- ley & Sons, 2007
W. Palma, Long-memory Time Series: Theory and Methods, John Wi- ley & Sons, 2007. doi:10.1002/9780470131466
-
[35]
S. Hochreiter, J. Schmidhuber, Long Short-Term Memory 9 (1997-11) 1735–1780. doi:10.1162/neco.1997.9.8.1735
- [36]
-
[37]
R. Zhang, F. Nie, X. Li, X. Wei, Feature selection with multi-view data: A survey 50 (2019) 158–167. doi:10.1016/j.inffus.2018.11.019
-
[38]
H. V. Dang, H. Tran-Ngoc, T. V. Nguyen, T. Bui-Tien, G. De Roeck, H. X. Nguyen, Data-Driven Structural Health Monitoring Using Fea- ture Fusion and Hybrid Deep Learning 18 (2021-10) 2087–2103. doi:10.1109/TASE.2020.3034401. 39
-
[39]
L. Mou, C. Zhou, P. Zhao, B. Nakisa, M. N. Rastgoo, R. Jain, W. Gao, Driver stress detection via multimodal fusion using attention-based CNN-LSTM 173 (2021-07) 114693. doi:10.1016/j.eswa.2021.114693
-
[40]
S. Unsiwilai, W. Phusakulkajorn, C. Shen, A. Zoeteman, R. Dollevoet, A. N´ u˜ nez, Z. Li, Enhanced vertical railway track quality index with dynamic responses from moving trains 10 (2024-10) e38670. doi:10.1016/j.heliyon.2024.e38670
-
[41]
C. Shi, Y. Zhou, L. Xu, X. Zhang, Y. Guo, A critical review on the vertical stiffness irregularity of railway ballasted track 400 (2023-10) 132715. doi:10.1016/j.conbuildmat.2023.132715
-
[42]
R. R. Samani, Data underlying the publication: Waveletinception networks for drive-by vibration-based infrastructure health monitor- ing, 4tu.researchdata (2025). doi:10.4121/8d6245b5-2d81-436f-896c- 83a78e22d213
-
[43]
A. Gonz´ alez, K. Feng, M. Casero, Effective separation of vehicle, road and bridge information from drive-by acceleration data via the power spectral density resulting from crossings at various speeds 14 (2023-04) 100162. doi:10.1016/j.dibe.2023.100162
-
[44]
A. Malekjafarian, F. Golpayegani, C. Moloney, S. Clarke, A Machine Learning Approach to Bridge-Damage Detection Using Responses Mea- sured on a Passing Vehicle 19 (2019-01) 4035
work page 2019
-
[45]
D. P. Kingma, J. Ba, Adam: A Method for Stochastic Optimization (2017). arXiv:1412.6980, doi:10.48550/arXiv.1412.6980
work page internal anchor Pith review Pith/arXiv arXiv doi:10.48550/arxiv.1412.6980 2017
-
[46]
Biewald, Experiment tracking with weights and biases, software avail- able from wandb.com (2020)
L. Biewald, Experiment tracking with weights and biases, software avail- able from wandb.com (2020). URL https://www.wandb.com/
work page 2020
-
[47]
G. Fan, J. Li, H. Hao, Vibration signal denoising for structural health monitoring by residual convolutional neural networks 157 (2020-06) 107651. doi:10.1016/j.measurement.2020.107651
-
[48]
K. He, X. Zhang, S. Ren, J. Sun, Deep Residual Learning for Image Recognition (2015-12-10). arXiv:1512.03385. 40
work page internal anchor Pith review Pith/arXiv arXiv 2015
-
[49]
Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling
J. Chung, C. Gulcehre, K. Cho, Y. Bengio, Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling (2014-12-11). arXiv:1412.3555
work page internal anchor Pith review Pith/arXiv arXiv 2014
-
[50]
J. Shi, H. Shi, J. Li, Z. Yu, Train-induced vibration response recon- struction for bridge damage detection with a deep learning methodology, Structures 64 (2024) 106496
work page 2024
-
[51]
A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A. N. Gomez, L. Kaiser, I. Polosukhin, Attention is all you need, CoRR abs/1706.03762 (2017). arXiv:1706.03762. URL http://arxiv.org/abs/1706.03762 41
work page internal anchor Pith review Pith/arXiv arXiv 2017
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.