E4GEN: Event-level Explainable Extreme-Enhanced Time-series Generation

Dahai Yu; Guang Wang; Lin Jiang; Ximiao Li

arxiv: 2606.01634 · v1 · pith:4BSCFLQKnew · submitted 2026-06-01 · 💻 cs.LG · cs.AI

E4GEN: Event-level Explainable Extreme-Enhanced Time-series Generation

Lin Jiang , Dahai Yu , Ximiao Li , Guang Wang This is my paper

Pith reviewed 2026-06-28 15:40 UTC · model grok-4.3

classification 💻 cs.LG cs.AI

keywords time series generationdiffusion modelsextreme eventsexplainable generationdenoising processself-driven predictionevent-level control

0 comments

The pith

E4GEN is an explainable diffusion framework that generates time series with improved fidelity for both regular patterns and extreme events.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents E4GEN as a diffusion-based method for creating realistic time series that specifically addresses the common failure of prior models to capture extreme events accurately. It introduces three components that provide control over when extremes activate during denoising, what semantic signals to apply, and how to inject those signals layer by layer. A sympathetic reader would care because many real applications rely on generated data that includes rare but high-impact events without distorting normal trends or seasonality. The approach uses self-driven prediction to derive control signals from the data itself, avoiding the need for explicit extreme labels during training. Experiments across six datasets and seventeen metrics indicate gains in overall fidelity, extreme-event fidelity, and performance on downstream tasks.

Core claim

E4GEN provides systematic insights into when, what, and how to control extreme-event generation through three key components: E-Activator learns the dataset-adaptive extreme-control signal activation step during the denoising process; E-Predictor determines what control signal to enforce through Self-Driven Semantic Prediction and a novel Data-Conditioned Training, Noise-Initiated Sampling mechanism; E-Control specifies how to control extreme-event generation through a trainable Extreme Control Network that transforms the semantic control signal into layer-wise signals and injects it into the denoising process.

What carries the argument

The E-Activator, E-Predictor, and E-Control components that learn and apply dataset-adaptive extreme-control signals during the diffusion denoising process without interfering with regular temporal components.

If this is right

Generated time series achieve higher overall distributional fidelity while also improving fidelity on extreme events.
The generated data yields better results on downstream utility tasks compared with prior methods.
Control over extreme events is achieved through separate, interpretable decisions about activation timing, signal selection, and injection mechanism.
Training proceeds without requiring labeled extreme events thanks to the self-driven semantic prediction approach.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The self-driven control mechanism could be adapted to other generative tasks where rare events must be modeled without explicit labels.
Layer-wise injection of control signals might offer a template for adding targeted constraints in other diffusion or autoregressive models.
Datasets with different extreme-event characteristics could be used to test whether the activation step remains dataset-adaptive across domains.
The separation of activation, prediction, and control steps could help diagnose failure modes when generated extremes still deviate from real data.

Load-bearing premise

The Self-Driven Semantic Prediction and Data-Conditioned Training mechanism can reliably infer and apply latent extreme-event control signals without access to training labels for extremes.

What would settle it

Running E4GEN on a dataset with independently verified extreme events and finding that the generated series show no measurable improvement over baselines on extreme-specific fidelity metrics or that the inferred control signals fail to align with the timing of actual extremes.

Figures

Figures reproduced from arXiv: 2606.01634 by Dahai Yu, Guang Wang, Lin Jiang, Ximiao Li.

**Figure 2.** Figure 2: Value-level Extreme Enhancement [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 3.** Figure 3: A Sample Denoising Process [PITH_FULL_IMAGE:figures/full_fig_p003_3.png] view at source ↗

**Figure 5.** Figure 5: shows the overall framework of our proposed E4GEN, which integrates three key components that systematically address when, what, and how to control extreme-event generation during the denoising process. Section 4.1 introduces E-Activator, which learns to decide the dataset-adaptive control activation step tCA. Section 4.2 presents E-Predictor, which estimates the extreme-control signal at tCA with two co… view at source ↗

**Figure 6.** Figure 6: Control Activation Window via BD and OES [PITH_FULL_IMAGE:figures/full_fig_p006_6.png] view at source ↗

**Figure 7.** Figure 7: Overall distribution comparison [PITH_FULL_IMAGE:figures/full_fig_p009_7.png] view at source ↗

**Figure 9.** Figure 9: GEV Distribution of Block Maxima Value-level extreme-aware generation is a common line of research in existing studies on extreme enhancement [3, 57, 21, 22, 23], and it is often framed as heavy-tailed generation. Its core idea is to directly strengthen tail behavior at the value-distribution level, e.g., by replacing light-tailed Gaussian noise with heavy-tailed alternatives such as Student-t noise [23]. … view at source ↗

**Figure 10.** Figure 10: Heavy-tail methods exhibit aggregation effects across all five dimensions, namely a [PITH_FULL_IMAGE:figures/full_fig_p018_10.png] view at source ↗

**Figure 11.** Figure 11: Coarse-to-fine generation trajectories of 10 randomly selected samples from a 500-step [PITH_FULL_IMAGE:figures/full_fig_p019_11.png] view at source ↗

**Figure 12.** Figure 12: Intrinsic patterns in extreme events [PITH_FULL_IMAGE:figures/full_fig_p019_12.png] view at source ↗

**Figure 13.** Figure 13: Illustration of Threshold Fluctuations in Extreme-event Definition. [PITH_FULL_IMAGE:figures/full_fig_p021_13.png] view at source ↗

**Figure 14.** Figure 14: Control Activation Window via BD and OES in More Datasets [PITH_FULL_IMAGE:figures/full_fig_p022_14.png] view at source ↗

**Figure 15.** Figure 15: Visualizations for Syn-Data Dataset • Wea-Temp: We use the Hourly Weather Data provided by Dewey [62], which contains hourly climate observations across the United States since 2018. Here, an extreme event refers to a low-temperature event, represented by a consecutive period with temperature below a predefined threshold. We select five nearby stations around Jacksonville in northeastern Florida, a regio… view at source ↗

**Figure 16.** Figure 16: Temperature series for five stations in April 2021. [PITH_FULL_IMAGE:figures/full_fig_p027_16.png] view at source ↗

**Figure 17.** Figure 17: Visualizations for Wea-Temp Dataset across Florida and record the daily total precipitation at each station. Daily precipitation is used instead of hourly precipitation because the hourly records contain substantial missing values. The data are partitioned into 90-day samples. Using records from January 1, 2023 to December 31, 2025, we construct a dataset of shape (1056, 90, 1). The extreme threshold is s… view at source ↗

**Figure 18.** Figure 18: Visualizations for Wea-Prec Dataset • LTST-ECG: We use the Long-Term ST Database (LTST) from PhysioNet [64, 65], which contains long-duration ambulatory ECG recordings with expert-provided ST-related annotations. In this dataset, we first identify abnormal ST-related intervals based on the provided annotations, and then define extreme events within these intervals as consecutive periods during which the … view at source ↗

**Figure 19.** Figure 19: Visualizations for LTST-ECG Dataset by a consecutive period during which the power values remain above a predefined threshold. We focus on the Global_active_power variable and select a continuous three-year period from January 1, 2007 to December 31, 2009. After temporal alignment and missing-value imputation, the data are resampled at 10-minute intervals, so that each day contains 144 observations. We th… view at source ↗

**Figure 20.** Figure 20: Visualizations for HH-Power Dataset 29 [PITH_FULL_IMAGE:figures/full_fig_p029_20.png] view at source ↗

**Figure 21.** Figure 21: Visualizations for PEMS-SF Dataset L.2 Descriptions of Metrics To comprehensively evaluate the performance of our generation model, we utilize seventeen distinct metrics assessed from two primary perspectives: overall generation quality and extreme-event generation quality. Below are the detailed definitions and implementations of each metric. For overall generation quality, we employ eight metrics to ass… view at source ↗

**Figure 22.** Figure 22: Example for interpretable generation dynamics. The six figures visualize the evolution of [PITH_FULL_IMAGE:figures/full_fig_p037_22.png] view at source ↗

**Figure 23.** Figure 23: Comparison of the evolution of extreme-value point proportions during the denoising process, [PITH_FULL_IMAGE:figures/full_fig_p038_23.png] view at source ↗

**Figure 24.** Figure 24: Visualization of E-Predictor predictions for extreme-event semantics on Syn-Data, LTST [PITH_FULL_IMAGE:figures/full_fig_p039_24.png] view at source ↗

**Figure 25.** Figure 25: Sensitivity analysis of E4GEN with respect to the alignment start step [PITH_FULL_IMAGE:figures/full_fig_p041_25.png] view at source ↗

**Figure 26.** Figure 26: Controllable extreme-event generation results under three user-specified semantic configura [PITH_FULL_IMAGE:figures/full_fig_p043_26.png] view at source ↗

read the original abstract

Generating realistic time series is essential for scientific research and real-world applications. However, existing methods often emphasize overall distributional fidelity while failing to faithfully capture extreme events. To advance existing research, we propose E4GEN, an explainable diffusion framework for extreme event-aware time-series generation. E4GEN provides systematic insights into when, what, and how to control extreme-event generation through three key components. First, E-Activator learns the dataset-adaptive extreme-control signal activation step during the denoising process without interfering with regular temporal components, including trend and seasonality. Second, E-Predictor determines what control signal to enforce through Self-Driven Semantic Prediction, where each sample derives its own control signal by inferring latent extreme-event information during generation. It also includes a novel Data-Conditioned Training, Noise-Initiated Sampling mechanism to address the issue of unavailable training labels. Third, E-Control specifies how to control extreme-event generation through a trainable Extreme Control Network, which transforms the semantic control signal into layer-wise signals and injects it into the denoising process. We evaluate E4GEN on six datasets with 17 metrics, and extensive experiments show that E4GEN outperforms state-of-the-art models across multiple dimensions, including overall fidelity, extreme-event fidelity, and downstream utility.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

E4GEN adds three diffusion modules aimed at extreme-event control in time series, but the self-driven inference step lacks independent checks that would confirm it actually captures extremes rather than dataset artifacts.

read the letter

The main takeaway is that this paper builds a diffusion generator with targeted modules for when, what, and how to handle extremes, yet the central claim of improved extreme fidelity rests on an unverified inference step.

What is new is the split into E-Activator for timing the control signal without messing up trends or seasonality, E-Predictor that runs self-driven semantic prediction plus data-conditioned training to create its own control signal from unlabeled data, and E-Control that turns the signal into layer-wise injections. The setup tries to keep the generation explainable at the event level while still using a standard diffusion backbone.

The work does address a practical gap. Most time-series generators optimize for overall distribution and can underperform on tails, which matters for risk or anomaly applications. Running on six datasets with 17 metrics and reporting gains in both fidelity and downstream utility shows they took the evaluation seriously rather than stopping at toy visuals.

The soft spots sit mainly in the validation of the self-driven part. The abstract describes inferring latent extreme information without training labels, but gives no separate test—such as correlation against known extreme timestamps or an ablation on how accurate the inferred signals are. If those signals are just picking up other statistics already in the diffusion process, the extreme-fidelity wins could be overstated. The lack of equations, error bars, or ablation tables in the provided text makes it difficult to judge how much the new components actually move the needle versus the base model. The stress-test concern about circular evaluation holds up here.

This is for people working on conditional or controllable time-series generation in applied domains. A reader who needs a starting point for extreme-aware diffusion would get some concrete module ideas to try. It is coherent enough on its own terms to deserve a serious referee, though the authors would need to add the missing checks on signal accuracy before acceptance.

Referee Report

3 major / 2 minor

Summary. The paper proposes E4GEN, an explainable diffusion framework for extreme-event-aware time-series generation. It introduces three components: E-Activator (learns dataset-adaptive extreme-control signal activation during denoising without interfering with trend/seasonality), E-Predictor (uses Self-Driven Semantic Prediction to derive per-sample control signals by inferring latent extreme information, plus Data-Conditioned Training and Noise-Initiated Sampling to handle missing extreme labels), and E-Control (trainable Extreme Control Network that transforms semantic signals into layer-wise injections). Experiments on six datasets using 17 metrics claim outperformance over SOTA in overall fidelity, extreme-event fidelity, and downstream utility.

Significance. If the empirical claims and the validity of the inferred control signals hold, the work would advance time-series generation by addressing a key weakness in capturing extremes while adding explainability; this has potential value in domains like finance, climate, and healthcare where extremes drive decisions. The parameter-free aspects of the activation and the label-free training mechanism are notable strengths if independently verified.

major comments (3)

[Abstract, §3] Abstract and §3 (E-Predictor): The central claim that Self-Driven Semantic Prediction plus Data-Conditioned Training reliably infers latent extreme-event control signals without any extreme labels lacks an independent validation step (e.g., correlation of inferred signals with known extreme timestamps or an ablation measuring signal accuracy against ground-truth extremes). Without this, gains on the 17 metrics could be attributable to the Extreme Control Network alone rather than true event-level control.
[Abstract] Abstract: The outperformance claim on extreme-event fidelity and overall utility is stated without any equations, ablation tables, error bars, dataset names, or statistical significance tests. This prevents assessment of whether the reported gains are load-bearing or could be explained by implementation details of the diffusion backbone.
[§3.2] §3.2 (Data-Conditioned Training): The mechanism for addressing unavailable training labels via Noise-Initiated Sampling is described at a high level; it is unclear whether the inferred signals are grounded externally or risk circularity by deriving control from the same generation process they are meant to condition.

minor comments (2)

[§3] Notation for the three invented entities (E-Activator, E-Predictor, E-Control) should be introduced with explicit mathematical definitions in §3 rather than descriptive prose only.
[Abstract, Experiments] The abstract mentions 'explainable' but does not specify what form the explanations take (e.g., visualizations of activation steps or signal attributions); this should be clarified with an example in the experiments section.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive feedback on E4GEN. The comments highlight important areas for strengthening the validation of the inferred control signals and the clarity of our claims and mechanisms. We address each major comment below and commit to revisions that enhance the manuscript without altering its core contributions.

read point-by-point responses

Referee: [Abstract, §3] Abstract and §3 (E-Predictor): The central claim that Self-Driven Semantic Prediction plus Data-Conditioned Training reliably infers latent extreme-event control signals without any extreme labels lacks an independent validation step (e.g., correlation of inferred signals with known extreme timestamps or an ablation measuring signal accuracy against ground-truth extremes). Without this, gains on the 17 metrics could be attributable to the Extreme Control Network alone rather than true event-level control.

Authors: We agree that an explicit independent validation step would more convincingly isolate the contribution of Self-Driven Semantic Prediction. The current manuscript demonstrates the value of the full E4GEN pipeline through ablations and downstream metrics, but does not include direct correlation of inferred signals against ground-truth extremes (as the framework is designed for label-free settings). In revision we will add experiments on synthetic data with known extreme timestamps to report correlation metrics and an ablation that disables the predictor while retaining E-Control, thereby addressing whether gains are attributable to true event-level control. revision: yes
Referee: [Abstract] Abstract: The outperformance claim on extreme-event fidelity and overall utility is stated without any equations, ablation tables, error bars, dataset names, or statistical significance tests. This prevents assessment of whether the reported gains are load-bearing or could be explained by implementation details of the diffusion backbone.

Authors: Abstracts are necessarily concise, and the manuscript already contains the requested details (dataset names, ablation tables, 17 metrics, error bars, and significance tests) in Sections 4 and 5. To improve accessibility we will revise the abstract to name the six datasets and cite the key quantitative gains with references to the corresponding tables and statistical tests, while leaving the full equations and ablations in the body. revision: partial
Referee: [§3.2] §3.2 (Data-Conditioned Training): The mechanism for addressing unavailable training labels via Noise-Initiated Sampling is described at a high level; it is unclear whether the inferred signals are grounded externally or risk circularity by deriving control from the same generation process they are meant to condition.

Authors: We will expand §3.2 with additional pseudocode and a step-by-step diagram clarifying the training and sampling flow. Data-Conditioned Training learns the predictor from the empirical data distribution during the forward process; Noise-Initiated Sampling begins from pure noise but conditions the predictor on progressively denoised samples drawn from the same distribution. This is not circular because the predictor is trained to recover latent extreme semantics from the data itself, independent of the final generated output. The revised text will make this grounding explicit. revision: yes

Circularity Check

0 steps flagged

No circularity detected; derivation self-contained

full rationale

The paper presents E4GEN as a diffusion-based framework with three components (E-Activator, E-Predictor via Self-Driven Semantic Prediction and Data-Conditioned Training, E-Control) for extreme-event time-series generation. No equations, derivations, or parameter-fitting steps are visible in the provided text that would allow reduction of any claimed prediction or control signal to its inputs by construction. The central claims rest on experimental outperformance across six datasets and 17 metrics rather than on any self-referential mathematical identity or unverified self-citation chain. The absence of load-bearing self-citations or ansatz smuggling in the abstract supports treating the described mechanisms as independently motivated.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 3 invented entities

Abstract-only; no access to methods, equations, or experiments to enumerate free parameters, axioms, or invented entities beyond the three named components.

invented entities (3)

E-Activator no independent evidence
purpose: Learns dataset-adaptive extreme-control signal activation step
New named component introduced to control when extremes activate
E-Predictor no independent evidence
purpose: Determines control signal via Self-Driven Semantic Prediction
New named component for inferring extreme signals without labels
E-Control no independent evidence
purpose: Transforms semantic control signal into layer-wise signals
New named component for injecting control into denoising

pith-pipeline@v0.9.1-grok · 5759 in / 1074 out tokens · 22341 ms · 2026-06-28T15:40:11.613785+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

70 extracted references · 6 canonical work pages · 2 internal anchors

[1]

Time-series generative adversarial networks.Advances in neural information processing systems, 32, 2019

Jinsung Yoon, Daniel Jarrett, and Mihaela Van der Schaar. Time-series generative adversarial networks.Advances in neural information processing systems, 32, 2019

2019
[2]

Time-series generation by contrastive imitation.Advances in neural information processing systems, 34:28968–28982, 2021

Daniel Jarrett, Ioana Bica, and Mihaela van der Schaar. Time-series generation by contrastive imitation.Advances in neural information processing systems, 34:28968–28982, 2021

2021
[3]

Fide: Frequency-inflated conditional diffusion model for extreme-aware time series generation.Advances in Neural Information Processing Systems, 37:114434–114457, 2024

Asadullah Hill Galib, Pang-Ning Tan, and Lifeng Luo. Fide: Frequency-inflated conditional diffusion model for extreme-aware time series generation.Advances in Neural Information Processing Systems, 37:114434–114457, 2024

2024
[4]

Diffusion-ts: Interpretable diffusion for general time series genera- tion

Xinyu Yuan and Yan Qiao. Diffusion-ts: Interpretable diffusion for general time series genera- tion. InThe Twelfth International Conference on Learning Representations (ICLR), 2024

2024
[5]

Forging time series with language: A large language model approach to synthetic data generation.Advances in Neural Information Processing Systems, 2025

Cécile Rousseau, Tobia Boschi, Giandomenico Cornacchia, Dhaval Salwala, Alessandra Pascale, and Juan Bernabe Moreno. Forging time series with language: A large language model approach to synthetic data generation.Advances in Neural Information Processing Systems, 2025

2025
[6]

Pcf-gan: generating sequential data via the characteristic function of measures on the path space.Advances in Neural Information Processing Systems, 36:39755–39781, 2023

Hang Lou, Siran Li, and Hao Ni. Pcf-gan: generating sequential data via the characteristic function of measures on the path space.Advances in Neural Information Processing Systems, 36:39755–39781, 2023

2023
[7]

Tsgm: a flexible framework for generative modeling of synthetic time series.Advances in Neural Information Processing Systems, 37:129042–129061, 2024

Alexander Nikitin, Letizia Iannucci, and Samuel Kaski. Tsgm: a flexible framework for generative modeling of synthetic time series.Advances in Neural Information Processing Systems, 37:129042–129061, 2024

2024
[8]

Self-interpretable time series prediction with counterfactual explanations

Jingquan Yan and Hao Wang. Self-interpretable time series prediction with counterfactual explanations. InInternational Conference on Machine Learning, pages 39110–39125. PMLR, 2023

2023
[9]

Surrogate time series.Physica D: Nonlinear Phenom- ena, 142(3-4):346–382, 2000

Thomas Schreiber and Andreas Schmitz. Surrogate time series.Physica D: Nonlinear Phenom- ena, 142(3-4):346–382, 2000

2000
[10]

Gt-gan: General purpose time series synthesis with generative adversarial networks.Advances in Neural Information Processing Systems, 35:36999–37010, 2022

Jinsung Jeon, Jeonghak Kim, Haryong Song, Seunghyeon Cho, and Noseong Park. Gt-gan: General purpose time series synthesis with generative adversarial networks.Advances in Neural Information Processing Systems, 35:36999–37010, 2022. 10

2022
[11]

TimeVAE: A variational auto-encoder for multivariate time series generation.arXiv preprint arXiv:2111.08095, 2021

Abhyuday Desai, Cynthia Freeman, Zuhui Wang, and Ian Beaver. Timevae: A variational auto-encoder for multivariate time series generation.arXiv preprint arXiv:2111.08095, 2021

work page arXiv 2021
[12]

Gen- erative modeling of regular and irregular time series data via koopman vaes

Ilan Naiman, N Benjamin Erichson, Pu Ren, Michael W Mahoney, and Omri Azencot. Gen- erative modeling of regular and irregular time series data via koopman vaes. InThe Twelfth International Conference on Learning Representations, 2024

2024
[13]

Generative time-series modeling with fourier flows

Ahmed Alaa, Alex James Chan, and Mihaela van der Schaar. Generative time-series modeling with fourier flows. InInternational Conference on Learning Representations, 2021

2021
[14]

Are language models actually useful for time series forecasting?Advances in Neural Information Processing Systems, 37:60162–60191, 2024

Mingtian Tan, Mike Merrill, Vinayak Gupta, Tim Althoff, and Tom Hartvigsen. Are language models actually useful for time series forecasting?Advances in Neural Information Processing Systems, 37:60162–60191, 2024

2024
[15]

Diffwave: A versatile diffusion model for audio synthesis

Zhifeng Kong, Wei Ping, Jiaji Huang, Kexin Zhao, and Bryan Catanzaro. Diffwave: A versatile diffusion model for audio synthesis. InInternational Conference on Learning Representations, 2021

2021
[16]

On the constrained time-series generation problem.Advances in Neural Information Processing Systems, 36:61048–61059, 2023

Andrea Coletta, Sriram Gopalakrishnan, Daniel Borrajo, and Svitlana Vyetrenko. On the constrained time-series generation problem.Advances in Neural Information Processing Systems, 36:61048–61059, 2023

2023
[17]

Generative adversarial networks in time series: A systematic literature review.ACM Computing Surveys, 55(10):1–31, 2023

Eoin Brophy, Zhengwei Wang, Qi She, and Tomás Ward. Generative adversarial networks in time series: A systematic literature review.ACM Computing Surveys, 55(10):1–31, 2023

2023
[18]

Beyond the norm: A survey of synthetic data generation for rare events.arXiv preprint arXiv:2506.06380, 2025

Jingyi Gu, Xuan Zhang, and Guiling Wang. Beyond the norm: A survey of synthetic data generation for rare events.arXiv preprint arXiv:2506.06380, 2025

work page arXiv 2025
[19]

Long-tailed diffusion models with oriented calibration

Tianjiao Zhang, Huangjie Zheng, Jiangchao Yao, Xiangfeng Wang, Mingyuan Zhou, Ya Zhang, and Yanfeng Wang. Long-tailed diffusion models with oriented calibration. InThe twelfth international conference on learning representations, 2024

2024
[20]

Improving generation quality of long-tailed diffusion via disentangled latent representations

Esther Rodriguez, Monica Welfert, Samuel McDowell, Nathan Stromberg, Julian Antolin Camarena, and Lalitha Sankar. Improving generation quality of long-tailed diffusion via disentangled latent representations. InNeurIPS Workshop on Structured Probabilistic Inference & Generative Modeling, 2025

2025
[21]

Tails of lipschitz triangular flows

Priyank Jaini, Ivan Kobyzev, Yaoliang Yu, and Marcus Brubaker. Tails of lipschitz triangular flows. InInternational Conference on Machine Learning, pages 4673–4681. PMLR, 2020

2020
[22]

t3-variational autoencoder: Learning heavy-tailed data with student’s t and power divergence

Juno Kim, Jaehyuk Kwon, Mincheol Cho, Hyunjong Lee, and Joong-Ho Won. t3-variational autoencoder: Learning heavy-tailed data with student’s t and power divergence. InInternational Conference on Learning Representations (ICLR), 2024

2024
[23]

Heavy-tailed diffusion models

Kushagra Pandey, Jaideep Pathak, Yilun Xu, Stephan Mandt, Michael Pritchard, Arash Vahdat, and Morteza Mardani. Heavy-tailed diffusion models. InInternational Conference on Learning Representations (ICLR), 2025

2025
[24]

Back to Basics: Let Denoising Generative Models Denoise

Tianhong Li and Kaiming He. Back to basics: Let denoising generative models denoise.arXiv preprint arXiv:2511.13720, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025
[25]

Deterministic nonperiodic flow 1

Edward N Lorenz. Deterministic nonperiodic flow 1. InUniversality in Chaos, 2nd edition, pages 367–378. Routledge, 2017

2017
[26]

Recurrence time analysis, long-term correlations, and extreme events.Physical Review E—Statistical, Nonlinear, and Soft Matter Physics, 71(5):056106, 2005

Eduardo G Altmann and Holger Kantz. Recurrence time analysis, long-term correlations, and extreme events.Physical Review E—Statistical, Nonlinear, and Soft Matter Physics, 71(5):056106, 2005

2005
[27]

The effect of long-term correlations on the return periods of rare events.Physica A: Statistical Mechanics and its Applications, 330(1-2):1–7, 2003

Armin Bunde, Jan F Eichner, Shlomo Havlin, and Jan W Kantelhardt. The effect of long-term correlations on the return periods of rare events.Physica A: Statistical Mechanics and its Applications, 330(1-2):1–7, 2003

2003
[28]

Improved denoising diffusion probabilistic models

Alexander Quinn Nichol and Prafulla Dhariwal. Improved denoising diffusion probabilistic models. InInternational conference on machine learning, pages 8162–8171. PMLR, 2021. 11

2021
[29]

Timebridge: Better diffusion prior design with bridge models for time series generation

Jinseong Park, Seungyun Lee, Woojin Jeong, Yujin Choi, and Jaewook Lee. Timebridge: Better diffusion prior design with bridge models for time series generation. InProceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V . 1, pages 1135–1146, 2026

2026
[30]

Stl: A seasonal-trend decomposition.J

Robert B Cleveland, William S Cleveland, Jean E McRae, Irma Terpenning, et al. Stl: A seasonal-trend decomposition.J. off. Stat, 6(1):3–73, 1990

1990
[31]

Defining extreme events: A cross-disciplinary review.Earth’s Future, 6(3):441–455, 2018

Lauren E McPhillips, Heejun Chang, Mikhail V Chester, Yaella Depietri, Erin Friedman, Nancy B Grimm, John S Kominoski, Timon McPhearson, Pablo Méndez-Lázaro, Emma J Rosi, et al. Defining extreme events: A cross-disciplinary review.Earth’s Future, 6(3):441–455, 2018

2018
[32]

Models for exceedances over high thresholds.Journal of the Royal Statistical Society Series B: Statistical Methodology, 52(3):393–425, 1990

Anthony C Davison and Richard L Smith. Models for exceedances over high thresholds.Journal of the Royal Statistical Society Series B: Statistical Methodology, 52(3):393–425, 1990

1990
[33]

Time series shapelets: a new primitive for data mining

Lexiang Ye and Eamonn Keogh. Time series shapelets: a new primitive for data mining. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 947–956, 2009

2009
[34]

Denoising diffusion probabilistic models.Advances in neural information processing systems, 33:6840–6851, 2020

Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffusion probabilistic models.Advances in neural information processing systems, 33:6840–6851, 2020

2020
[35]

Springer, 2009

Cédric Villani et al.Optimal transport: old and new, volume 338. Springer, 2009

2009
[36]

On the estimation of the discrepancy between empirical curves of distribution for two independent samples.Bull

Nikolai V Smirnov et al. On the estimation of the discrepancy between empirical curves of distribution for two independent samples.Bull. Math. Univ. Moscou, 2(2):3–14, 1939

1939
[37]

Divergence measures based on the shannon entropy.IEEE Transactions on Information theory, 37(1):145–151, 2002

Jianhua Lin. Divergence measures based on the shannon entropy.IEEE Transactions on Information theory, 37(1):145–151, 2002

2002
[38]

A kernel two-sample test.The journal of machine learning research, 13(1):723–773, 2012

Arthur Gretton, Karsten M Borgwardt, Malte J Rasch, Bernhard Schölkopf, and Alexander Smola. A kernel two-sample test.The journal of machine learning research, 13(1):723–773, 2012

2012
[39]

Data series similarity using correlation-aware measures

Katsiaryna Mirylenka, Michele Dallachiesa, and Themis Palpanas. Data series similarity using correlation-aware measures. InProceedings of the 29th International Conference on Scientific and Statistical Database Management, pages 1–12, 2017

2017
[40]

Tsgbench: Time series generation benchmark.Proceedings of the VLDB Endowment, 17(3):305–318, 2023

Yihao Ang, Qiang Huang, Yifan Bao, Anthony KH Tung, and Zhiyong Huang. Tsgbench: Time series generation benchmark.Proceedings of the VLDB Endowment, 17(3):305–318, 2023

2023
[41]

Psa-gan: Progressive self attention gans for synthetic time series

Paul Jeha, Michael Bohlke-Schneider, Pedro Mercado, Shubham Kapoor, Rajbir Singh Nirwan, Valentin Flunkert, Jan Gasthaus, and Tim Januschowski. Psa-gan: Progressive self attention gans for synthetic time series. InThe tenth international conference on learning representations, 2022

2022
[42]

LSTM-based Encoder-Decoder for Multi-sensor Anomaly Detection

Pankaj Malhotra, Anusha Ramakrishnan, Gaurangi Anand, Lovekesh Vig, Puneet Agarwal, and Gautam Shroff. Lstm-based encoder-decoder for multi-sensor anomaly detection.arXiv preprint arXiv:1607.00148, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016
[43]

C-rnn-gan: Continuous recurrent neural networks with adversarial training, 2016

Olof Mogren. C-rnn-gan: Continuous recurrent neural networks with adversarial training, 2016

2016
[44]

Hyland, and Gunnar Rätsch

Cristóbal Esteban, Stephanie L. Hyland, and Gunnar Rätsch. Real-valued (medical) time series generation with recurrent conditional gans, 2017

2017
[45]

A recurrent latent variable model for sequential data.arXiv, 2015

Junyoung Chung, Kyle Kastner, Laurent Dinh, Kratarth Goel, Aaron Courville, and Yoshua Bengio. A recurrent latent variable model for sequential data.arXiv, 2015

2015
[46]

Diffwave: A versatile diffusion model for audio synthesis.arXiv, 2020

Zhifeng Kong, Wei Ping, Jiaji Huang, Kexin Zhao, and Bryan Catanzaro. Diffwave: A versatile diffusion model for audio synthesis.arXiv, 2020

2020
[47]

Kashif Rasul, Calvin Seward, Ingmar Schuster, and Roland V ollgraf. Autoregressive denoising diffusion models for multivariate probabilistic time series forecasting.Proceedings of the 38th International Conference on Machine Learning, PMLR 139:8857-8868, 2021, 2021. 12

2021
[48]

Csdi: Conditional score-based diffusion models for probabilistic time series imputation.arXiv, 2021

Yusuke Tashiro, Jiaming Song, Yang Song, and Stefano Ermon. Csdi: Conditional score-based diffusion models for probabilistic time series imputation.arXiv, 2021

2021
[49]

Diffusion-ts: Interpretable diffusion for general time series genera- tion.arXiv, 2024

Xinyu Yuan and Yan Qiao. Diffusion-ts: Interpretable diffusion for general time series genera- tion.arXiv, 2024

2024
[50]

Hao Xue and Flora D. Salim. Promptcast: A new prompt-based learning paradigm for time series forecasting.IEEE Transactions on Knowledge and Data Engineering, 2023

2023
[51]

Hua, R., Liu, Z., Zhang, K., and Yang, Y

Nate Gruver, Marc Finzi, Shikai Qiu, and Andrew Gordon Wilson. Large language models are zero-shot time series forecasters.arXiv preprint arXiv:2310.07820, 2023

work page arXiv 2023
[52]

One fits all: Power general time series analysis by pretrained lm

Tian Zhou, Peisong Niu, Xue Wang, Liang Sun, and Rong Jin. One fits all: Power general time series analysis by pretrained lm. InAdvances in Neural Information Processing Systems, volume 36, 2023

2023
[53]

Time-LLM: Time series forecasting by reprogramming large language models

Ming Jin, Shiyu Wang, Lintao Ma, Zhixuan Chu, James Y Zhang, Xiaoming Shi, Pin-Yu Chen, Yuxuan Liang, Yuan-Fang Li, Shirui Pan, and Qingsong Wen. Time-LLM: Time series forecasting by reprogramming large language models. InInternational Conference on Learning Representations (ICLR), 2024

2024
[54]

Maddix, and Yuyang Wang

Abdul Fatir Ansari, Lorenzo Stella, Caner Turkmen, Xiyuan Zhang, Pedro Mercado, Huibin Shen, Oleksandr Shchur, Syama Sundar Rangapuram, Sebastian Pineda Arango, Shubham Kapoor, Jasper Zschiegner, Danielle C. Maddix, and Yuyang Wang. Chronos: Learning the language of time series.arXiv, 2024

2024
[55]

Heavy-tailed diffusion with denoising levy probabilistic models

Dario Shariatian, Umut Simsekli, and Alain Durmus. Heavy-tailed diffusion with denoising levy probabilistic models. InInternational Conference on Learning Representations, 2025

2025
[56]

Exgan: Adversarial generation of extreme samples

Siddharth Bhatia, Arjit Jain, and Bryan Hooi. Exgan: Adversarial generation of extreme samples. InProceedings of the AAAI Conference on Artificial Intelligence, volume 35, pages 6841–6849, 2021

2021
[57]

Pareto gan: Extending the representational power of gans to heavy-tailed distributions

Todd Huster, Jeremy Cohen, Zinan Lin, Kevin Chan, Charles Kamhoua, Nandi O Leslie, Cho- Yu Jason Chiang, and Vyas Sekar. Pareto gan: Extending the representational power of gans to heavy-tailed distributions. InInternational Conference on Machine Learning, pages 4523–4532. PMLR, 2021

2021
[58]

Springer, 2006

Laurens De Haan and Ana Ferreira.Extreme value theory: an introduction. Springer, 2006

2006
[59]

Oreshkin, Dmitri Carpov, Nicolas Chapados, and Yoshua Bengio

Boris N. Oreshkin, Dmitri Carpov, Nicolas Chapados, and Yoshua Bengio. N-beats: Neural basis expansion analysis for interpretable time series forecasting. InThe Eighth International Conference on Learning Representations (ICLR), 2020

2020
[60]

An algorithm for the machine calculation of complex fourier series.Mathematics of computation, 19(90):297–301, 1965

James W Cooley and John W Tukey. An algorithm for the machine calculation of complex fourier series.Mathematics of computation, 19(90):297–301, 1965

1965
[61]

Adding conditional control to text-to-image diffusion models

Lvmin Zhang, Anyi Rao, and Maneesh Agrawala. Adding conditional control to text-to-image diffusion models. InProceedings of the IEEE/CVF international conference on computer vision, pages 3836–3847, 2023

2023
[62]

Hourly weather data [dataset], 2022

Custom Weather. Hourly weather data [dataset], 2022. Dewey Data. https://doi.org/10. 82551/YX6H-K352

2022
[63]

Daily weather data [dataset]], 2022

Custom Weather. Daily weather data [dataset]], 2022. Dewey Data. https://doi.org/10. 82551/VBWQ-AQ20

2022
[64]

Moody, Michele Emdin, Gorazd Antolic, Roman Dorn, Ales Smrdel, Carlo Marchesi, and Roger G

Franc Jager, Alessandro Taddei, George B. Moody, Michele Emdin, Gorazd Antolic, Roman Dorn, Ales Smrdel, Carlo Marchesi, and Roger G. Mark. Long-term ST database: a reference for the development and evaluation of automated ischaemia detectors and for the study of the dynamics of myocardial ischaemia.Medical & Biological Engineering & Computing, 41(2):172–...

2003
[65]

Physiobank, physiotoolkit, and physionet: components of a new research resource for complex physiologic signals.circulation, 101(23):e215–e220, 2000

Ary L Goldberger, Luis AN Amaral, Leon Glass, Jeffrey M Hausdorff, Plamen Ch Ivanov, Roger G Mark, Joseph E Mietus, George B Moody, Chung-Kang Peng, and H Eugene Stanley. Physiobank, physiotoolkit, and physionet: components of a new research resource for complex physiologic signals.circulation, 101(23):e215–e220, 2000

2000
[66]

Individual household electric power consumption [dataset],

Georges Hebrail and Alice Berard. Individual household electric power consumption [dataset],
[67]

https://archive.ics.uci.edu/ml/datasets/ individual+household+electric+power+consumption

UCI Machine Learning Repository. https://archive.ics.uci.edu/ml/datasets/ individual+household+electric+power+consumption
[68]

Pems-sf [dataset], 2011

Marco Cuturi. Pems-sf [dataset], 2011. UCI Machine Learning Repository. https://archive. ics.uci.edu/ml/datasets/pems-sf

2011
[69]

Performance measurement system (pems), 2026

California Department of Transportation. Performance measurement system (pems), 2026. https://dot.ca.gov/programs/traffic-operations/mpr/pems-source

2026
[70]

Ts2vec: Towards universal representation of time series

Zhihan Yue, Yujing Wang, Juanyong Duan, Tianmeng Yang, Congrui Huang, Yunhai Tong, and Bixiong Xu. Ts2vec: Towards universal representation of time series. InProceedings of the AAAI conference on artificial intelligence, volume 36, pages 8980–8987, 2022. 14 A Reproducibility Statement To improve reproducibility, we release an anonymous GitHub repository a...

work page arXiv 2022

[1] [1]

Time-series generative adversarial networks.Advances in neural information processing systems, 32, 2019

Jinsung Yoon, Daniel Jarrett, and Mihaela Van der Schaar. Time-series generative adversarial networks.Advances in neural information processing systems, 32, 2019

2019

[2] [2]

Time-series generation by contrastive imitation.Advances in neural information processing systems, 34:28968–28982, 2021

Daniel Jarrett, Ioana Bica, and Mihaela van der Schaar. Time-series generation by contrastive imitation.Advances in neural information processing systems, 34:28968–28982, 2021

2021

[3] [3]

Fide: Frequency-inflated conditional diffusion model for extreme-aware time series generation.Advances in Neural Information Processing Systems, 37:114434–114457, 2024

Asadullah Hill Galib, Pang-Ning Tan, and Lifeng Luo. Fide: Frequency-inflated conditional diffusion model for extreme-aware time series generation.Advances in Neural Information Processing Systems, 37:114434–114457, 2024

2024

[4] [4]

Diffusion-ts: Interpretable diffusion for general time series genera- tion

Xinyu Yuan and Yan Qiao. Diffusion-ts: Interpretable diffusion for general time series genera- tion. InThe Twelfth International Conference on Learning Representations (ICLR), 2024

2024

[5] [5]

Forging time series with language: A large language model approach to synthetic data generation.Advances in Neural Information Processing Systems, 2025

Cécile Rousseau, Tobia Boschi, Giandomenico Cornacchia, Dhaval Salwala, Alessandra Pascale, and Juan Bernabe Moreno. Forging time series with language: A large language model approach to synthetic data generation.Advances in Neural Information Processing Systems, 2025

2025

[6] [6]

Pcf-gan: generating sequential data via the characteristic function of measures on the path space.Advances in Neural Information Processing Systems, 36:39755–39781, 2023

Hang Lou, Siran Li, and Hao Ni. Pcf-gan: generating sequential data via the characteristic function of measures on the path space.Advances in Neural Information Processing Systems, 36:39755–39781, 2023

2023

[7] [7]

Tsgm: a flexible framework for generative modeling of synthetic time series.Advances in Neural Information Processing Systems, 37:129042–129061, 2024

Alexander Nikitin, Letizia Iannucci, and Samuel Kaski. Tsgm: a flexible framework for generative modeling of synthetic time series.Advances in Neural Information Processing Systems, 37:129042–129061, 2024

2024

[8] [8]

Self-interpretable time series prediction with counterfactual explanations

Jingquan Yan and Hao Wang. Self-interpretable time series prediction with counterfactual explanations. InInternational Conference on Machine Learning, pages 39110–39125. PMLR, 2023

2023

[9] [9]

Surrogate time series.Physica D: Nonlinear Phenom- ena, 142(3-4):346–382, 2000

Thomas Schreiber and Andreas Schmitz. Surrogate time series.Physica D: Nonlinear Phenom- ena, 142(3-4):346–382, 2000

2000

[10] [10]

Gt-gan: General purpose time series synthesis with generative adversarial networks.Advances in Neural Information Processing Systems, 35:36999–37010, 2022

Jinsung Jeon, Jeonghak Kim, Haryong Song, Seunghyeon Cho, and Noseong Park. Gt-gan: General purpose time series synthesis with generative adversarial networks.Advances in Neural Information Processing Systems, 35:36999–37010, 2022. 10

2022

[11] [11]

TimeVAE: A variational auto-encoder for multivariate time series generation.arXiv preprint arXiv:2111.08095, 2021

Abhyuday Desai, Cynthia Freeman, Zuhui Wang, and Ian Beaver. Timevae: A variational auto-encoder for multivariate time series generation.arXiv preprint arXiv:2111.08095, 2021

work page arXiv 2021

[12] [12]

Gen- erative modeling of regular and irregular time series data via koopman vaes

Ilan Naiman, N Benjamin Erichson, Pu Ren, Michael W Mahoney, and Omri Azencot. Gen- erative modeling of regular and irregular time series data via koopman vaes. InThe Twelfth International Conference on Learning Representations, 2024

2024

[13] [13]

Generative time-series modeling with fourier flows

Ahmed Alaa, Alex James Chan, and Mihaela van der Schaar. Generative time-series modeling with fourier flows. InInternational Conference on Learning Representations, 2021

2021

[14] [14]

Are language models actually useful for time series forecasting?Advances in Neural Information Processing Systems, 37:60162–60191, 2024

Mingtian Tan, Mike Merrill, Vinayak Gupta, Tim Althoff, and Tom Hartvigsen. Are language models actually useful for time series forecasting?Advances in Neural Information Processing Systems, 37:60162–60191, 2024

2024

[15] [15]

Diffwave: A versatile diffusion model for audio synthesis

Zhifeng Kong, Wei Ping, Jiaji Huang, Kexin Zhao, and Bryan Catanzaro. Diffwave: A versatile diffusion model for audio synthesis. InInternational Conference on Learning Representations, 2021

2021

[16] [16]

On the constrained time-series generation problem.Advances in Neural Information Processing Systems, 36:61048–61059, 2023

Andrea Coletta, Sriram Gopalakrishnan, Daniel Borrajo, and Svitlana Vyetrenko. On the constrained time-series generation problem.Advances in Neural Information Processing Systems, 36:61048–61059, 2023

2023

[17] [17]

Generative adversarial networks in time series: A systematic literature review.ACM Computing Surveys, 55(10):1–31, 2023

Eoin Brophy, Zhengwei Wang, Qi She, and Tomás Ward. Generative adversarial networks in time series: A systematic literature review.ACM Computing Surveys, 55(10):1–31, 2023

2023

[18] [18]

Beyond the norm: A survey of synthetic data generation for rare events.arXiv preprint arXiv:2506.06380, 2025

Jingyi Gu, Xuan Zhang, and Guiling Wang. Beyond the norm: A survey of synthetic data generation for rare events.arXiv preprint arXiv:2506.06380, 2025

work page arXiv 2025

[19] [19]

Long-tailed diffusion models with oriented calibration

Tianjiao Zhang, Huangjie Zheng, Jiangchao Yao, Xiangfeng Wang, Mingyuan Zhou, Ya Zhang, and Yanfeng Wang. Long-tailed diffusion models with oriented calibration. InThe twelfth international conference on learning representations, 2024

2024

[20] [20]

Improving generation quality of long-tailed diffusion via disentangled latent representations

Esther Rodriguez, Monica Welfert, Samuel McDowell, Nathan Stromberg, Julian Antolin Camarena, and Lalitha Sankar. Improving generation quality of long-tailed diffusion via disentangled latent representations. InNeurIPS Workshop on Structured Probabilistic Inference & Generative Modeling, 2025

2025

[21] [21]

Tails of lipschitz triangular flows

Priyank Jaini, Ivan Kobyzev, Yaoliang Yu, and Marcus Brubaker. Tails of lipschitz triangular flows. InInternational Conference on Machine Learning, pages 4673–4681. PMLR, 2020

2020

[22] [22]

t3-variational autoencoder: Learning heavy-tailed data with student’s t and power divergence

Juno Kim, Jaehyuk Kwon, Mincheol Cho, Hyunjong Lee, and Joong-Ho Won. t3-variational autoencoder: Learning heavy-tailed data with student’s t and power divergence. InInternational Conference on Learning Representations (ICLR), 2024

2024

[23] [23]

Heavy-tailed diffusion models

Kushagra Pandey, Jaideep Pathak, Yilun Xu, Stephan Mandt, Michael Pritchard, Arash Vahdat, and Morteza Mardani. Heavy-tailed diffusion models. InInternational Conference on Learning Representations (ICLR), 2025

2025

[24] [24]

Back to Basics: Let Denoising Generative Models Denoise

Tianhong Li and Kaiming He. Back to basics: Let denoising generative models denoise.arXiv preprint arXiv:2511.13720, 2025

work page internal anchor Pith review Pith/arXiv arXiv 2025

[25] [25]

Deterministic nonperiodic flow 1

Edward N Lorenz. Deterministic nonperiodic flow 1. InUniversality in Chaos, 2nd edition, pages 367–378. Routledge, 2017

2017

[26] [26]

Recurrence time analysis, long-term correlations, and extreme events.Physical Review E—Statistical, Nonlinear, and Soft Matter Physics, 71(5):056106, 2005

Eduardo G Altmann and Holger Kantz. Recurrence time analysis, long-term correlations, and extreme events.Physical Review E—Statistical, Nonlinear, and Soft Matter Physics, 71(5):056106, 2005

2005

[27] [27]

The effect of long-term correlations on the return periods of rare events.Physica A: Statistical Mechanics and its Applications, 330(1-2):1–7, 2003

Armin Bunde, Jan F Eichner, Shlomo Havlin, and Jan W Kantelhardt. The effect of long-term correlations on the return periods of rare events.Physica A: Statistical Mechanics and its Applications, 330(1-2):1–7, 2003

2003

[28] [28]

Improved denoising diffusion probabilistic models

Alexander Quinn Nichol and Prafulla Dhariwal. Improved denoising diffusion probabilistic models. InInternational conference on machine learning, pages 8162–8171. PMLR, 2021. 11

2021

[29] [29]

Timebridge: Better diffusion prior design with bridge models for time series generation

Jinseong Park, Seungyun Lee, Woojin Jeong, Yujin Choi, and Jaewook Lee. Timebridge: Better diffusion prior design with bridge models for time series generation. InProceedings of the 32nd ACM SIGKDD Conference on Knowledge Discovery and Data Mining V . 1, pages 1135–1146, 2026

2026

[30] [30]

Stl: A seasonal-trend decomposition.J

Robert B Cleveland, William S Cleveland, Jean E McRae, Irma Terpenning, et al. Stl: A seasonal-trend decomposition.J. off. Stat, 6(1):3–73, 1990

1990

[31] [31]

Defining extreme events: A cross-disciplinary review.Earth’s Future, 6(3):441–455, 2018

Lauren E McPhillips, Heejun Chang, Mikhail V Chester, Yaella Depietri, Erin Friedman, Nancy B Grimm, John S Kominoski, Timon McPhearson, Pablo Méndez-Lázaro, Emma J Rosi, et al. Defining extreme events: A cross-disciplinary review.Earth’s Future, 6(3):441–455, 2018

2018

[32] [32]

Models for exceedances over high thresholds.Journal of the Royal Statistical Society Series B: Statistical Methodology, 52(3):393–425, 1990

Anthony C Davison and Richard L Smith. Models for exceedances over high thresholds.Journal of the Royal Statistical Society Series B: Statistical Methodology, 52(3):393–425, 1990

1990

[33] [33]

Time series shapelets: a new primitive for data mining

Lexiang Ye and Eamonn Keogh. Time series shapelets: a new primitive for data mining. In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 947–956, 2009

2009

[34] [34]

Denoising diffusion probabilistic models.Advances in neural information processing systems, 33:6840–6851, 2020

Jonathan Ho, Ajay Jain, and Pieter Abbeel. Denoising diffusion probabilistic models.Advances in neural information processing systems, 33:6840–6851, 2020

2020

[35] [35]

Springer, 2009

Cédric Villani et al.Optimal transport: old and new, volume 338. Springer, 2009

2009

[36] [36]

On the estimation of the discrepancy between empirical curves of distribution for two independent samples.Bull

Nikolai V Smirnov et al. On the estimation of the discrepancy between empirical curves of distribution for two independent samples.Bull. Math. Univ. Moscou, 2(2):3–14, 1939

1939

[37] [37]

Divergence measures based on the shannon entropy.IEEE Transactions on Information theory, 37(1):145–151, 2002

Jianhua Lin. Divergence measures based on the shannon entropy.IEEE Transactions on Information theory, 37(1):145–151, 2002

2002

[38] [38]

A kernel two-sample test.The journal of machine learning research, 13(1):723–773, 2012

Arthur Gretton, Karsten M Borgwardt, Malte J Rasch, Bernhard Schölkopf, and Alexander Smola. A kernel two-sample test.The journal of machine learning research, 13(1):723–773, 2012

2012

[39] [39]

Data series similarity using correlation-aware measures

Katsiaryna Mirylenka, Michele Dallachiesa, and Themis Palpanas. Data series similarity using correlation-aware measures. InProceedings of the 29th International Conference on Scientific and Statistical Database Management, pages 1–12, 2017

2017

[40] [40]

Tsgbench: Time series generation benchmark.Proceedings of the VLDB Endowment, 17(3):305–318, 2023

Yihao Ang, Qiang Huang, Yifan Bao, Anthony KH Tung, and Zhiyong Huang. Tsgbench: Time series generation benchmark.Proceedings of the VLDB Endowment, 17(3):305–318, 2023

2023

[41] [41]

Psa-gan: Progressive self attention gans for synthetic time series

Paul Jeha, Michael Bohlke-Schneider, Pedro Mercado, Shubham Kapoor, Rajbir Singh Nirwan, Valentin Flunkert, Jan Gasthaus, and Tim Januschowski. Psa-gan: Progressive self attention gans for synthetic time series. InThe tenth international conference on learning representations, 2022

2022

[42] [42]

LSTM-based Encoder-Decoder for Multi-sensor Anomaly Detection

Pankaj Malhotra, Anusha Ramakrishnan, Gaurangi Anand, Lovekesh Vig, Puneet Agarwal, and Gautam Shroff. Lstm-based encoder-decoder for multi-sensor anomaly detection.arXiv preprint arXiv:1607.00148, 2016

work page internal anchor Pith review Pith/arXiv arXiv 2016

[43] [43]

C-rnn-gan: Continuous recurrent neural networks with adversarial training, 2016

Olof Mogren. C-rnn-gan: Continuous recurrent neural networks with adversarial training, 2016

2016

[44] [44]

Hyland, and Gunnar Rätsch

Cristóbal Esteban, Stephanie L. Hyland, and Gunnar Rätsch. Real-valued (medical) time series generation with recurrent conditional gans, 2017

2017

[45] [45]

A recurrent latent variable model for sequential data.arXiv, 2015

Junyoung Chung, Kyle Kastner, Laurent Dinh, Kratarth Goel, Aaron Courville, and Yoshua Bengio. A recurrent latent variable model for sequential data.arXiv, 2015

2015

[46] [46]

Diffwave: A versatile diffusion model for audio synthesis.arXiv, 2020

Zhifeng Kong, Wei Ping, Jiaji Huang, Kexin Zhao, and Bryan Catanzaro. Diffwave: A versatile diffusion model for audio synthesis.arXiv, 2020

2020

[47] [47]

Kashif Rasul, Calvin Seward, Ingmar Schuster, and Roland V ollgraf. Autoregressive denoising diffusion models for multivariate probabilistic time series forecasting.Proceedings of the 38th International Conference on Machine Learning, PMLR 139:8857-8868, 2021, 2021. 12

2021

[48] [48]

Csdi: Conditional score-based diffusion models for probabilistic time series imputation.arXiv, 2021

Yusuke Tashiro, Jiaming Song, Yang Song, and Stefano Ermon. Csdi: Conditional score-based diffusion models for probabilistic time series imputation.arXiv, 2021

2021

[49] [49]

Diffusion-ts: Interpretable diffusion for general time series genera- tion.arXiv, 2024

Xinyu Yuan and Yan Qiao. Diffusion-ts: Interpretable diffusion for general time series genera- tion.arXiv, 2024

2024

[50] [50]

Hao Xue and Flora D. Salim. Promptcast: A new prompt-based learning paradigm for time series forecasting.IEEE Transactions on Knowledge and Data Engineering, 2023

2023

[51] [51]

Hua, R., Liu, Z., Zhang, K., and Yang, Y

Nate Gruver, Marc Finzi, Shikai Qiu, and Andrew Gordon Wilson. Large language models are zero-shot time series forecasters.arXiv preprint arXiv:2310.07820, 2023

work page arXiv 2023

[52] [52]

One fits all: Power general time series analysis by pretrained lm

Tian Zhou, Peisong Niu, Xue Wang, Liang Sun, and Rong Jin. One fits all: Power general time series analysis by pretrained lm. InAdvances in Neural Information Processing Systems, volume 36, 2023

2023

[53] [53]

Time-LLM: Time series forecasting by reprogramming large language models

Ming Jin, Shiyu Wang, Lintao Ma, Zhixuan Chu, James Y Zhang, Xiaoming Shi, Pin-Yu Chen, Yuxuan Liang, Yuan-Fang Li, Shirui Pan, and Qingsong Wen. Time-LLM: Time series forecasting by reprogramming large language models. InInternational Conference on Learning Representations (ICLR), 2024

2024

[54] [54]

Maddix, and Yuyang Wang

Abdul Fatir Ansari, Lorenzo Stella, Caner Turkmen, Xiyuan Zhang, Pedro Mercado, Huibin Shen, Oleksandr Shchur, Syama Sundar Rangapuram, Sebastian Pineda Arango, Shubham Kapoor, Jasper Zschiegner, Danielle C. Maddix, and Yuyang Wang. Chronos: Learning the language of time series.arXiv, 2024

2024

[55] [55]

Heavy-tailed diffusion with denoising levy probabilistic models

Dario Shariatian, Umut Simsekli, and Alain Durmus. Heavy-tailed diffusion with denoising levy probabilistic models. InInternational Conference on Learning Representations, 2025

2025

[56] [56]

Exgan: Adversarial generation of extreme samples

Siddharth Bhatia, Arjit Jain, and Bryan Hooi. Exgan: Adversarial generation of extreme samples. InProceedings of the AAAI Conference on Artificial Intelligence, volume 35, pages 6841–6849, 2021

2021

[57] [57]

Pareto gan: Extending the representational power of gans to heavy-tailed distributions

Todd Huster, Jeremy Cohen, Zinan Lin, Kevin Chan, Charles Kamhoua, Nandi O Leslie, Cho- Yu Jason Chiang, and Vyas Sekar. Pareto gan: Extending the representational power of gans to heavy-tailed distributions. InInternational Conference on Machine Learning, pages 4523–4532. PMLR, 2021

2021

[58] [58]

Springer, 2006

Laurens De Haan and Ana Ferreira.Extreme value theory: an introduction. Springer, 2006

2006

[59] [59]

Oreshkin, Dmitri Carpov, Nicolas Chapados, and Yoshua Bengio

Boris N. Oreshkin, Dmitri Carpov, Nicolas Chapados, and Yoshua Bengio. N-beats: Neural basis expansion analysis for interpretable time series forecasting. InThe Eighth International Conference on Learning Representations (ICLR), 2020

2020

[60] [60]

An algorithm for the machine calculation of complex fourier series.Mathematics of computation, 19(90):297–301, 1965

James W Cooley and John W Tukey. An algorithm for the machine calculation of complex fourier series.Mathematics of computation, 19(90):297–301, 1965

1965

[61] [61]

Adding conditional control to text-to-image diffusion models

Lvmin Zhang, Anyi Rao, and Maneesh Agrawala. Adding conditional control to text-to-image diffusion models. InProceedings of the IEEE/CVF international conference on computer vision, pages 3836–3847, 2023

2023

[62] [62]

Hourly weather data [dataset], 2022

Custom Weather. Hourly weather data [dataset], 2022. Dewey Data. https://doi.org/10. 82551/YX6H-K352

2022

[63] [63]

Daily weather data [dataset]], 2022

Custom Weather. Daily weather data [dataset]], 2022. Dewey Data. https://doi.org/10. 82551/VBWQ-AQ20

2022

[64] [64]

Moody, Michele Emdin, Gorazd Antolic, Roman Dorn, Ales Smrdel, Carlo Marchesi, and Roger G

Franc Jager, Alessandro Taddei, George B. Moody, Michele Emdin, Gorazd Antolic, Roman Dorn, Ales Smrdel, Carlo Marchesi, and Roger G. Mark. Long-term ST database: a reference for the development and evaluation of automated ischaemia detectors and for the study of the dynamics of myocardial ischaemia.Medical & Biological Engineering & Computing, 41(2):172–...

2003

[65] [65]

Physiobank, physiotoolkit, and physionet: components of a new research resource for complex physiologic signals.circulation, 101(23):e215–e220, 2000

Ary L Goldberger, Luis AN Amaral, Leon Glass, Jeffrey M Hausdorff, Plamen Ch Ivanov, Roger G Mark, Joseph E Mietus, George B Moody, Chung-Kang Peng, and H Eugene Stanley. Physiobank, physiotoolkit, and physionet: components of a new research resource for complex physiologic signals.circulation, 101(23):e215–e220, 2000

2000

[66] [66]

Individual household electric power consumption [dataset],

Georges Hebrail and Alice Berard. Individual household electric power consumption [dataset],

[67] [67]

https://archive.ics.uci.edu/ml/datasets/ individual+household+electric+power+consumption

UCI Machine Learning Repository. https://archive.ics.uci.edu/ml/datasets/ individual+household+electric+power+consumption

[68] [68]

Pems-sf [dataset], 2011

Marco Cuturi. Pems-sf [dataset], 2011. UCI Machine Learning Repository. https://archive. ics.uci.edu/ml/datasets/pems-sf

2011

[69] [69]

Performance measurement system (pems), 2026

California Department of Transportation. Performance measurement system (pems), 2026. https://dot.ca.gov/programs/traffic-operations/mpr/pems-source

2026

[70] [70]

Ts2vec: Towards universal representation of time series

Zhihan Yue, Yujing Wang, Juanyong Duan, Tianmeng Yang, Congrui Huang, Yunhai Tong, and Bixiong Xu. Ts2vec: Towards universal representation of time series. InProceedings of the AAAI conference on artificial intelligence, volume 36, pages 8980–8987, 2022. 14 A Reproducibility Statement To improve reproducibility, we release an anonymous GitHub repository a...

work page arXiv 2022