Survival Analysis with Machine Learning for Predicting Li-ion Battery Remaining Useful Life

Dongjing Jiang; Jianfei Zhang; Jingyuan Xue; Qingchong Jiao; Redouane EL Bouchtaoui; Xiaozhen Zhao

arxiv: 2503.13558 · v9 · submitted 2025-03-17 · 📡 eess.SP · cs.AI· cs.LG

Survival Analysis with Machine Learning for Predicting Li-ion Battery Remaining Useful Life

Jingyuan Xue , Xiaozhen Zhao , Dongjing Jiang , Qingchong Jiao , Redouane EL Bouchtaoui , Jianfei Zhang This is my paper

Pith reviewed 2026-05-23 00:31 UTC · model grok-4.3

classification 📡 eess.SP cs.AIcs.LG

keywords survival analysisremaining useful lifelithium-ion batterypath signaturesDeepHitMTLRbattery degradationtime-to-failure

0 comments

The pith

Path signatures turn battery voltage time series into time-to-failure data that survival models use to estimate remaining useful life.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a hybrid survival analysis approach for lithium-ion battery remaining useful life prediction. It first converts raw voltage measurements into survival-style time-to-failure records via path signatures. Multiple Cox models plus machine-learning survival techniques such as DeepHit and MTLR are then trained to output time-dependent failure-free probability curves. Experiments on the Toyota and NASA battery datasets report strong discrimination (high time-dependent AUC and C-Index) together with well-calibrated probabilities (low integrated Brier score). The method is offered as a practical way to handle nonlinear degradation while supplying uncertainty estimates useful for maintenance planning.

Core claim

The central claim is that reconstructing battery voltage trajectories as path signatures produces inputs from which Cox-based and deep survival models can learn accurate, time-varying failure-free probability functions, demonstrated by superior AUC, concordance, and Brier scores on the Toyota and NASA datasets.

What carries the argument

Path-signature transformation of voltage time series into time-to-failure survival data, followed by training of Cox proportional-hazards models together with DeepHit and MTLR.

If this is right

Maintenance decisions can be based on continuous failure-probability curves rather than fixed capacity thresholds.
The same reconstruction-plus-survival pipeline supplies both point estimates and uncertainty bands for remaining life.
Public code release allows direct comparison against other RUL methods on the same Toyota and NASA traces.
The framework extends in principle to any sensor time series that ends in a failure event.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

Path signatures may reduce sensitivity to irregular sampling intervals common in real-world battery monitoring.
The approach could be tested on other degradation processes such as fuel-cell or supercapacitor aging.
Embedding the models in an online filtering loop would allow updating predictions as new voltage samples arrive.

Load-bearing premise

The path-signature features extracted from voltage series contain the information needed for survival models to recover the true degradation trajectory and produce reliable probability estimates.

What would settle it

Retraining the same pipeline on a new set of batteries whose voltage traces differ markedly in sampling rate or noise level and finding that time-dependent AUC drops below 0.7 or integrated Brier score rises above 0.2 would falsify the claim.

read the original abstract

Battery degradation significantly impacts the reliability and efficiency of energy storage systems, particularly in electric vehicles and industrial applications. Predicting the remaining useful life (RUL) of lithium-ion batteries is crucial for optimizing maintenance schedules, reducing costs, and improving safety. Traditional RUL prediction methods often struggle with nonlinear degradation patterns and uncertainty quantification. To address these challenges, we propose a hybrid survival analysis framework integrating survival data reconstruction, survival model learning, and survival probability estimation. Our approach transforms battery voltage time series into time-to-failure data using path signatures. The multiple Cox-based survival models and machine-learning-based methods, such as DeepHit and MTLR, are learned to predict battery failure-free probabilities over time. Experiments conducted on the Toyota battery and NASA battery datasets demonstrate the effectiveness of our approach, achieving high time-dependent AUC and concordance index (C-Index) while maintaining a low integrated Brier score. The data and source codes are available to the public at https://github.com/okic-ca/rul

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes a hybrid survival analysis framework for Li-ion battery RUL prediction. Voltage time series are transformed into time-to-failure data via path signatures; Cox-based models together with DeepHit and MTLR are then trained to output time-dependent failure-free probabilities. Experiments on the Toyota and NASA datasets are reported to yield high time-dependent AUC and C-Index together with low integrated Brier score; code and data are released publicly.

Significance. If the path-signature step is shown to be responsible for the reported gains, the framework could improve uncertainty-aware RUL estimation for nonlinear battery degradation, with direct relevance to EV and grid-storage reliability. Public release of code and data is a clear strength for reproducibility.

major comments (2)

[Abstract / §3] Abstract / §3 (path-signature construction): the central claim that path signatures produce inputs allowing survival models to capture degradation patterns rests on the reported AUC/C-Index/IBS numbers, yet no ablation replaces the signature features with standard summary statistics or raw down-sampled voltage and re-runs the identical Cox/DeepHit/MTLR pipelines; without this comparison the contribution of the featurization step cannot be isolated from the survival heads.
[Experiments] Experiments section: no description is given of train/test splits, cross-validation procedure, hyperparameter search, or censoring handling for the survival models; these omissions are load-bearing because the headline metrics are obtained by fitting directly to the Toyota and NASA datasets and could reflect overfitting rather than generalizable performance.

minor comments (1)

[Abstract] The abstract states 'multiple Cox-based survival models' without enumerating the variants or their differences from the baseline Cox model.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback. The two major comments identify important gaps in isolating the path-signature contribution and in documenting experimental procedures. We address both below and will revise the manuscript to incorporate the requested elements.

read point-by-point responses

Referee: [Abstract / §3] Abstract / §3 (path-signature construction): the central claim that path signatures produce inputs allowing survival models to capture degradation patterns rests on the reported AUC/C-Index/IBS numbers, yet no ablation replaces the signature features with standard summary statistics or raw down-sampled voltage and re-runs the identical Cox/DeepHit/MTLR pipelines; without this comparison the contribution of the featurization step cannot be isolated from the survival heads.

Authors: We agree that the current experiments do not isolate the contribution of the path-signature step. In the revised manuscript we will add an ablation study that replaces the signature features with (i) standard summary statistics (mean, variance, min, max, slope) and (ii) raw down-sampled voltage series, then re-trains the identical Cox, DeepHit and MTLR pipelines on the same Toyota and NASA splits. The resulting time-dependent AUC, C-Index and IBS values will be reported side-by-side with the original signature-based results. revision: yes
Referee: [Experiments] Experiments section: no description is given of train/test splits, cross-validation procedure, hyperparameter search, or censoring handling for the survival models; these omissions are load-bearing because the headline metrics are obtained by fitting directly to the Toyota and NASA datasets and could reflect overfitting rather than generalizable performance.

Authors: We acknowledge the omission. The revised Experiments section will explicitly describe: (1) the train/test partitioning (70/30 random split per battery, with time-based hold-out for the second dataset), (2) 5-fold cross-validation for hyper-parameter tuning, (3) the grid-search ranges and selection criterion for each model, and (4) the standard right-censoring mechanism used by all three survival heads. We will also state that all headline metrics are computed on the held-out test folds. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper describes a standard supervised pipeline: path-signature featurization of voltage trajectories to produce time-to-failure inputs, followed by training of Cox/DeepHit/MTLR models and reporting of AUC/C-Index/Brier metrics on the Toyota and NASA datasets. No equations, definitions, or steps are shown that reduce by construction to the inputs (no self-definitional mappings, no fitted parameters renamed as predictions, no load-bearing self-citations). The derivation remains independent of the reported performance numbers and is self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The framework rests on the domain assumption that path signatures preserve the information needed for survival modeling of battery degradation and on the standard assumption that the chosen ML survival models can be trained to produce calibrated probabilities on the given datasets. No new entities are introduced.

free parameters (1)

hyperparameters and fitted weights of DeepHit, MTLR, and Cox models
Machine learning survival models contain numerous parameters whose values are determined by training on the battery datasets.

axioms (1)

domain assumption Path signatures can be used to transform voltage time series into suitable time-to-failure representations for survival analysis
Invoked in the first stage of the proposed framework to prepare input for the survival models.

pith-pipeline@v0.9.0 · 5723 in / 1279 out tokens · 70172 ms · 2026-05-23T00:31:44.275251+00:00 · methodology

Survival Analysis with Machine Learning for Predicting Li-ion Battery Remaining Useful Life

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)