OpenFinGym is a multi-task verifiable gym environment for quant-finance agents with automated task construction from publications, containerised runtime, paper trading engine, and support for SFT/RL training.
hub
Real-valued (Medical) Time Series Generation with Recurrent Conditional GANs
22 Pith papers cite this work. Polarity classification is still indexing.
abstract
Generative Adversarial Networks (GANs) have shown remarkable success as a framework for training models to produce realistic-looking data. In this work, we propose a Recurrent GAN (RGAN) and Recurrent Conditional GAN (RCGAN) to produce realistic real-valued multi-dimensional time series, with an emphasis on their application to medical data. RGANs make use of recurrent neural networks in the generator and the discriminator. In the case of RCGANs, both of these RNNs are conditioned on auxiliary information. We demonstrate our models in a set of toy datasets, where we show visually and quantitatively (using sample likelihood and maximum mean discrepancy) that they can successfully generate realistic time-series. We also describe novel evaluation methods for GANs, where we generate a synthetic labelled training dataset, and evaluate on a real test set the performance of a model trained on the synthetic data, and vice-versa. We illustrate with these metrics that RCGANs can generate time-series data useful for supervised training, with only minor degradation in performance on real test data. This is demonstrated on digit classification from 'serialised' MNIST and by training an early warning system on a medical dataset of 17,000 patients from an intensive care unit. We further discuss and analyse the privacy concerns that may arise when using RCGANs to generate realistic synthetic medical time series data.
hub tools
citation-role summary
citation-polarity summary
verdicts
UNVERDICTED 22roles
background 1polarities
background 1representative citing papers
Defines outcome-conformant synthesis as exact closed-form generation of relational data matching declared aggregates via Gamma conditional-sum sampling, introduces SpecBench for measuring conformance, and shows it is orthogonal to fidelity.
CaTSG is a unified diffusion model for causal time series generation that handles observational, interventional, and counterfactual tasks via backdoor adjustment and abduction-action-prediction.
EMGFlow is the first application of flow matching to synthesize sEMG data, outperforming GAN and diffusion baselines in fidelity, distributional metrics, and downstream gesture recognition utility under TSTR evaluation.
Introduces SOCK (SOft Competing Kernels), a differentiable random convolutional feature map, to train generative models of financial time series via feature matching and shows outperformance over signature and diffusion baselines on small-sample datasets.
ReGeN decomposes references into periodic, stochastic, and causal components to generate synthetic multivariate time series that preserve domain structure and support improved forecasting in low-data settings.
FedEHR-Gen is a federated two-stage autoencoder plus TCVAE system that aligns latent spaces via layer-wise matching and uses distribution-aware aggregation to produce synthetic EHR time-series data matching centralized performance on eICU and MIMIC-III.
PrismFlow augments flow matching with residual dynamical experts and a winner-take-all objective to reduce spectral distortion and improve mode coverage in time-series generation.
GenTS is a modular benchmark library providing unified data pipelines, generative models, and evaluation metrics for time series synthesis, forecasting, and imputation, with open-source code and initial benchmarking experiments.
TabSCM produces causally consistent tabular data by orienting a CPDAG into a DAG, fitting root marginals with KDE, and using conditional diffusion plus trees for child nodes, outperforming GANs and diffusion baselines on fidelity, utility, and privacy across seven datasets.
A new framework evaluates utility of synthetic mobility trajectories while a membership inference attack reveals privacy vulnerabilities in generative models thought to be safe.
Fed-CausalDiff proposes decoupled synchronization in a federated causal diffusion model to improve do-simulation and policy-value estimation across heterogeneous decentralized datasets.
UPLOTS proposes a unified prompt-guided pretrained transformer for generating constrained time-series data across diverse domains using dynamic multi-dataset loss re-weighting.
Introduces a sequential forward-backward diffusion framework that generates adapted time series by conditioning on prior history, with a parallelizable score-matching objective and statistical guarantees for ReLU networks.
AugMask is a plug-and-play training framework that lets diffusion models on incomplete tabular data use stochastic augmentation for conditioning and observed-only supervision, outperforming missing-aware baselines via a Rao-Blackwellized objective.
MOSAIC structures LLM-based model selection via memory-grounded blueprints and failure-aware RL, reporting gains in performance and traceability on financial time-series tasks over AutoML and agent baselines.
Hybrid CoMeTS-GAN plus diffusion model generates multivariate financial time series claimed to better reproduce stylized facts and inter-asset correlations than prior generative methods.
MSDformer introduces a multi-scale discrete transformer that tokenizes time series at multiple scales and models them autoregressively in discrete space, claiming superior performance over prior DTM methods with rate-distortion theoretical support.
Off-the-shelf image diffusion models can be repurposed to create synthetic structured data capable of inducing ground truth drift in machine pipelines.
An MCMC framework enforces empirical transition laws on GAN outputs to reduce temporal drift in synthetic multivariate time series.
TriHead-GAN is a GAN framework whose triple-head discriminator supervises distributional authenticity, cross-variable dependency via regression, and temporal smoothness via adjacent-difference prediction for carbon emission time series.
CGANs with LSTM generator can produce synthetic crypto price series that reproduce temporal patterns and preserve market trends and dynamics.
citing papers explorer
-
A Dual Perspective on Synthetic Trajectory Generators: Utility Framework and Privacy Vulnerabilities
A new framework evaluates utility of synthetic mobility trajectories while a membership inference attack reveals privacy vulnerabilities in generative models thought to be safe.