hub Mixed citations

Time-LLM: Time Series Forecasting by Reprogramming Large Language Models

Ming Jin, Shiyu Wang, Lintao Ma, Zhixuan Chu, James Y. Zhang, Xiaoming Shi · 2023 · cs.LG · arXiv 2310.01728

Mixed citation behavior. Most common role is background (67%).

38 Pith papers citing it

Background 67% of classified citations

open full Pith review browse 38 citing papers arXiv PDF

abstract

Time series forecasting holds significant importance in many real-world dynamic systems and has been extensively studied. Unlike natural language process (NLP) and computer vision (CV), where a single large model can tackle multiple tasks, models for time series forecasting are often specialized, necessitating distinct designs for different tasks and applications. While pre-trained foundation models have made impressive strides in NLP and CV, their development in time series domains has been constrained by data sparsity. Recent studies have revealed that large language models (LLMs) possess robust pattern recognition and reasoning abilities over complex sequences of tokens. However, the challenge remains in effectively aligning the modalities of time series data and natural language to leverage these capabilities. In this work, we present Time-LLM, a reprogramming framework to repurpose LLMs for general time series forecasting with the backbone language models kept intact. We begin by reprogramming the input time series with text prototypes before feeding it into the frozen LLM to align the two modalities. To augment the LLM's ability to reason with time series data, we propose Prompt-as-Prefix (PaP), which enriches the input context and directs the transformation of reprogrammed input patches. The transformed time series patches from the LLM are finally projected to obtain the forecasts. Our comprehensive evaluations demonstrate that Time-LLM is a powerful time series learner that outperforms state-of-the-art, specialized forecasting models. Moreover, Time-LLM excels in both few-shot and zero-shot learning scenarios.

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 4 method 1 other 1

citation-polarity summary

background 4 unclear 1 use method 1

representative citing papers

CausalPOI: Spatio-Temporal Graph-Based Causal Modeling for Cold-Start POI Check-in Forecasting

cs.LG · 2026-06-03 · unverdicted · novelty 7.0

CausalPOI proposes a spatio-temporal graph causal learning method for cold-start POI check-in forecasting that builds functional interaction graphs and treatment-control pairs to outperform baselines on SafeGraph data.

Bridging the Last Mile of Time Series Forecasting with LLM Agents

cs.AI · 2026-06-01 · unverdicted · novelty 7.0

An LLM-agent system is proposed to revise statistical time series forecasts with weakly structured business context via tool use and constrained reasoning actions.

Olivia: Harmonizing Time Series Foundation Models with Power Spectral Density

cs.LG · 2026-05-17 · unverdicted · novelty 7.0

Olivia harmonizes time series datasets via normalized power spectral density using a Harmonizer module and resonator-based HarmonicAttention, achieving state-of-the-art zero-shot, few-shot, and full-shot forecasting on TSLib, GIFT-Eval, and GluonTS benchmarks.

What if Tomorrow is the World Cup Final? Counterfactual Time Series Forecasting with Textual Conditions

cs.LG · 2026-05-14 · unverdicted · novelty 7.0

Introduces the task of counterfactual time series forecasting with textual conditions plus a text-attribution mechanism that improves accuracy by distinguishing mutable from immutable factors.

TailedTS: Benchmark Dataset for Heavy-Tailed Time Series Prediction and Periodicity Quantification

cs.LG · 2026-05-09 · unverdicted · novelty 7.0

TailedTS supplies 24.69 billion Wikipedia page-view records as a public benchmark for heavy-tailed time series forecasting and periodicity analysis, revealing weaker periodic structure in high-traffic pages.

LLaTiSA: Towards Difficulty-Stratified Time Series Reasoning from Visual Perception to Semantics

cs.AI · 2026-04-19 · unverdicted · novelty 7.0

LLaTiSA is a vision-language model trained on a new 83k-sample hierarchical time series reasoning dataset that shows superior performance and out-of-distribution generalization on stratified TSR tasks.

TimeSeriesExamAgent: Creating Time Series Reasoning Benchmarks at Scale

cs.AI · 2026-04-11 · conditional · novelty 7.0

TimeSeriesExamAgent combines templates and LLM agents to generate scalable time series reasoning benchmarks, demonstrating that current LLMs have limited performance on both abstract and domain-specific tasks.

Discrete Prototypical Memories for Federated Time Series Foundation Models

cs.LG · 2026-04-06 · unverdicted · novelty 7.0

FeDPM learns and aligns local discrete prototypical memories across domains to create a unified discrete latent space for LLM-based time series foundation models in a federated setting.

Overcoming the Modality Gap in Context-Aided Forecasting

cs.LG · 2026-03-12 · unverdicted · novelty 7.0

A semi-synthetic augmentation creates the CAF-7M dataset and demonstrates that improved context data enables multimodal models to outperform unimodal baselines in context-aided forecasting.

Is Flow Matching Just Trajectory Replay for Sequential Data?

stat.ML · 2026-02-09 · unverdicted · novelty 7.0

Flow matching on time series targets a closed-form nonparametric velocity field that is a similarity-weighted mixture of observed transition velocities, making neural models approximations to an ideal memory-augmented dynamical system sampler.

TSVer: A Benchmark for Fact Verification Against Time-Series Evidence

cs.CL · 2025-11-02 · unverdicted · novelty 7.0

TSVer is a new benchmark dataset for fact verification against time-series evidence, with 304 annotated real-world claims, 400 time series, verdicts, and justifications, plus baseline results showing current models struggle.

TS-Reasoner: Domain-Oriented Time Series Inference Agents for Reasoning and Automated Analysis

cs.LG · 2024-10-05 · unverdicted · novelty 7.0

TS-Reasoner is a domain-oriented agent using LLMs, computational tools, and error feedback for multi-step time series inference, showing better performance than general LLMs on understanding and reasoning benchmarks.

Deep Time Series Models: A Comprehensive Survey and Benchmark

cs.LG · 2024-07-18 · unverdicted · novelty 7.0

This survey and benchmark of deep time series models using the released TSLib library finds that models with specific structures perform well only on distinct analysis tasks.

EpiEvolve: Self-Evolving Agents for Streaming Pandemic Forecasting under Regime Shifts

cs.AI · 2026-06-03 · unverdicted · novelty 6.0

EpiEvolve achieves 0.629 accuracy in streaming COVID-19 forecasting by using episodic memory, reflection on delayed labels, and regime-aware retrieval, outperforming static LLMs (0.561) and CDC ensembles (0.325) while halving recovery lag after regime shifts.

MULTISEISMO: A Multimodal Seismic Dataset and Model for Cross-Modal Seismic Understanding

cs.LG · 2026-05-25 · unverdicted · novelty 6.0

MultiSeismo is a new multimodal seismic dataset with 16K events and SeisModal is a domain-adapted model that outperforms general multimodal models on seismic reasoning tasks.

TimeSRL: Generalizable Time-Series Behavioral Modeling via Semantic RL-Tuned LLMs -- A Case Study in Mental Health

cs.LG · 2026-05-20 · unverdicted · novelty 6.0

TimeSRL uses semantic abstractions from time-series data optimized via reinforcement learning to achieve better cross-dataset generalization than standard ML or LLM baselines in mental health prediction.

Agent-Based Post-Hoc Correction of Agricultural Yield Forecasts

cs.LG · 2026-05-12 · unverdicted · novelty 6.0

Structured LLM agents correct agricultural yield forecasts from models like XGBoost, cutting MAE by 20-28% and MASE by up to 66% on strawberry and corn datasets.

GazeMind: A Gaze-Guided LLM Agent for Personalized Cognitive Load Assessment

cs.HC · 2026-05-07 · unverdicted · novelty 6.0

GazeMind encodes gaze data for LLM reasoning to deliver interpretable, personalized cognitive load predictions that generalize across tasks without fine-tuning and outperform baselines by over 20% on a new 152-person dataset.

Exploring the Potential of Probabilistic Transformer for Time Series Modeling: A Report on the ST-PT Framework

cs.LG · 2026-04-29 · unverdicted · novelty 6.0

ST-PT turns transformers into explicit factor graphs for time series, enabling structural injection of symbolic priors, per-sample conditional generation, and principled latent autoregressive forecasting via MFVI iterations.

CAARL: In-Context Learning for Interpretable Co-Evolving Time Series Forecasting

cs.LG · 2026-04-20 · unverdicted · novelty 6.0

CAARL decomposes co-evolving time series into autoregressive segments, builds a temporal dependency graph, serializes it into a narrative, and uses LLMs for interpretable forecasting via chain-of-thought reasoning.

Semantic Communication with an LLM-enabled Knowledge Base

eess.SP · 2026-04-07 · unverdicted · novelty 6.0

SC-LMKB uses LLM-generated data with cross-domain fusion to cut hallucinations and delivers up to 72.6% gains on cross-modality retrieval tasks over standard semantic communication.

Uncertainty-Aware Foundation Models for Clinical Data

cs.LG · 2026-04-05 · unverdicted · novelty 6.0

The work introduces uncertainty-aware foundation models for clinical data by learning set-valued patient representations that enforce consistency across partial observations and integrate multimodal self-supervised objectives.

AlphaCast: A Human Wisdom-LLM Intelligence Co-Reasoning Framework for Interactive Time Series Forecasting

cs.AI · 2025-11-12 · conditional · novelty 6.0

AlphaCast is a training-free LLM framework that performs interactive multi-stage reasoning for time series forecasting by integrating feature extraction, knowledge bases, case libraries, and contextual pools.

MAP4TS: A Multi-Aspect Prompting Framework for Time-Series Forecasting with Large Language Models

cs.CL · 2025-10-27 · unverdicted · novelty 6.0

MAP4TS combines global, local, statistical, and temporal prompts derived from classical time-series analysis with raw embeddings via cross-modality alignment to improve LLM forecasting performance across eight datasets.

citing papers explorer

Showing 27 of 27 citing papers after filters.

CausalPOI: Spatio-Temporal Graph-Based Causal Modeling for Cold-Start POI Check-in Forecasting cs.LG · 2026-06-03 · unverdicted · none · ref 8 · internal anchor
CausalPOI proposes a spatio-temporal graph causal learning method for cold-start POI check-in forecasting that builds functional interaction graphs and treatment-control pairs to outperform baselines on SafeGraph data.
Bridging the Last Mile of Time Series Forecasting with LLM Agents cs.AI · 2026-06-01 · unverdicted · none · ref 1 · internal anchor
An LLM-agent system is proposed to revise statistical time series forecasts with weakly structured business context via tool use and constrained reasoning actions.
Olivia: Harmonizing Time Series Foundation Models with Power Spectral Density cs.LG · 2026-05-17 · unverdicted · none · ref 8 · internal anchor
Olivia harmonizes time series datasets via normalized power spectral density using a Harmonizer module and resonator-based HarmonicAttention, achieving state-of-the-art zero-shot, few-shot, and full-shot forecasting on TSLib, GIFT-Eval, and GluonTS benchmarks.
What if Tomorrow is the World Cup Final? Counterfactual Time Series Forecasting with Textual Conditions cs.LG · 2026-05-14 · unverdicted · none · ref 21 · internal anchor
Introduces the task of counterfactual time series forecasting with textual conditions plus a text-attribution mechanism that improves accuracy by distinguishing mutable from immutable factors.
TailedTS: Benchmark Dataset for Heavy-Tailed Time Series Prediction and Periodicity Quantification cs.LG · 2026-05-09 · unverdicted · none · ref 22 · internal anchor
TailedTS supplies 24.69 billion Wikipedia page-view records as a public benchmark for heavy-tailed time series forecasting and periodicity analysis, revealing weaker periodic structure in high-traffic pages.
LLaTiSA: Towards Difficulty-Stratified Time Series Reasoning from Visual Perception to Semantics cs.AI · 2026-04-19 · unverdicted · none · ref 4 · internal anchor
LLaTiSA is a vision-language model trained on a new 83k-sample hierarchical time series reasoning dataset that shows superior performance and out-of-distribution generalization on stratified TSR tasks.
TimeSeriesExamAgent: Creating Time Series Reasoning Benchmarks at Scale cs.AI · 2026-04-11 · conditional · none · ref 19 · internal anchor
TimeSeriesExamAgent combines templates and LLM agents to generate scalable time series reasoning benchmarks, demonstrating that current LLMs have limited performance on both abstract and domain-specific tasks.
Discrete Prototypical Memories for Federated Time Series Foundation Models cs.LG · 2026-04-06 · unverdicted · none · ref 11 · internal anchor
FeDPM learns and aligns local discrete prototypical memories across domains to create a unified discrete latent space for LLM-based time series foundation models in a federated setting.
Overcoming the Modality Gap in Context-Aided Forecasting cs.LG · 2026-03-12 · unverdicted · none · ref 4 · internal anchor
A semi-synthetic augmentation creates the CAF-7M dataset and demonstrates that improved context data enables multimodal models to outperform unimodal baselines in context-aided forecasting.
Is Flow Matching Just Trajectory Replay for Sequential Data? stat.ML · 2026-02-09 · unverdicted · none · ref 52 · internal anchor
Flow matching on time series targets a closed-form nonparametric velocity field that is a similarity-weighted mixture of observed transition velocities, making neural models approximations to an ideal memory-augmented dynamical system sampler.
EpiEvolve: Self-Evolving Agents for Streaming Pandemic Forecasting under Regime Shifts cs.AI · 2026-06-03 · unverdicted · none · ref 6 · internal anchor
EpiEvolve achieves 0.629 accuracy in streaming COVID-19 forecasting by using episodic memory, reflection on delayed labels, and regime-aware retrieval, outperforming static LLMs (0.561) and CDC ensembles (0.325) while halving recovery lag after regime shifts.
MULTISEISMO: A Multimodal Seismic Dataset and Model for Cross-Modal Seismic Understanding cs.LG · 2026-05-25 · unverdicted · none · ref 20 · internal anchor
MultiSeismo is a new multimodal seismic dataset with 16K events and SeisModal is a domain-adapted model that outperforms general multimodal models on seismic reasoning tasks.
TimeSRL: Generalizable Time-Series Behavioral Modeling via Semantic RL-Tuned LLMs -- A Case Study in Mental Health cs.LG · 2026-05-20 · unverdicted · none · ref 23 · internal anchor
TimeSRL uses semantic abstractions from time-series data optimized via reinforcement learning to achieve better cross-dataset generalization than standard ML or LLM baselines in mental health prediction.
Agent-Based Post-Hoc Correction of Agricultural Yield Forecasts cs.LG · 2026-05-12 · unverdicted · none · ref 14 · internal anchor
Structured LLM agents correct agricultural yield forecasts from models like XGBoost, cutting MAE by 20-28% and MASE by up to 66% on strawberry and corn datasets.
GazeMind: A Gaze-Guided LLM Agent for Personalized Cognitive Load Assessment cs.HC · 2026-05-07 · unverdicted · none · ref 21 · internal anchor
GazeMind encodes gaze data for LLM reasoning to deliver interpretable, personalized cognitive load predictions that generalize across tasks without fine-tuning and outperform baselines by over 20% on a new 152-person dataset.
Exploring the Potential of Probabilistic Transformer for Time Series Modeling: A Report on the ST-PT Framework cs.LG · 2026-04-29 · unverdicted · none · ref 34 · internal anchor
ST-PT turns transformers into explicit factor graphs for time series, enabling structural injection of symbolic priors, per-sample conditional generation, and principled latent autoregressive forecasting via MFVI iterations.
CAARL: In-Context Learning for Interpretable Co-Evolving Time Series Forecasting cs.LG · 2026-04-20 · unverdicted · none · ref 12 · internal anchor
CAARL decomposes co-evolving time series into autoregressive segments, builds a temporal dependency graph, serializes it into a narrative, and uses LLMs for interpretable forecasting via chain-of-thought reasoning.
Semantic Communication with an LLM-enabled Knowledge Base eess.SP · 2026-04-07 · unverdicted · none · ref 34 · internal anchor
SC-LMKB uses LLM-generated data with cross-domain fusion to cut hallucinations and delivers up to 72.6% gains on cross-modality retrieval tasks over standard semantic communication.
Uncertainty-Aware Foundation Models for Clinical Data cs.LG · 2026-04-05 · unverdicted · none · ref 51 · internal anchor
The work introduces uncertainty-aware foundation models for clinical data by learning set-valued patient representations that enforce consistency across partial observations and integrate multimodal self-supervised objectives.
Continuity and Ordinality Matter: Constraining Time Series Tokens for Effective Time Series Analysis with Large Language Models cs.LG · 2026-05-22 · unverdicted · none · ref 4 · internal anchor
COM integrates geometric constraints into token initialization and training to preserve continuity and ordinality in time series tokens, improving token-based TS-LLM performance on benchmarks.
Reasoning through Verifiable Forecast Actions: Consistency-Grounded RL for Financial LLMs cs.LG · 2026-05-21 · unverdicted · none · ref 30 · internal anchor
StockR1 unifies LLM-based financial reasoning and time-series forecasting by emitting verifiable forecast actions that condition a decoder, optimized via consistency-grounded RL to improve accuracy on QA and prediction tasks.
Teaching Large Language Models When Not to Know: Learning Temporal Critique for Ex-Ante Reasoning cs.AI · 2026-05-14 · unverdicted · none · ref 23 · internal anchor
TCFT trains LLMs on temporal critique tasks to reduce post-cutoff knowledge leakage by 37-42 percentage points over prompting and standard SFT on Qwen models.
Heterogeneous Scientific Foundation Model Collaboration cs.AI · 2026-04-30 · unverdicted · none · ref 73 · internal anchor
Eywa enables language-based agentic AI systems to collaborate with specialized scientific foundation models for improved performance on structured data tasks.
Frozen LLMs as Map-Aware Spatio-Temporal Reasoners for Vehicle Trajectory Prediction cs.CV · 2026-04-23 · unverdicted · none · ref 14 · internal anchor
A framework encodes observed trajectories and HD maps into tokens for frozen LLMs to perform spatio-temporal reasoning and predict future vehicle paths with a linear decoder.
A Review of Large Language Models for Stock Price Forecasting from a Hedge-Fund Perspective q-fin.PR · 2026-04-10 · unverdicted · none · ref 41 · internal anchor
This review synthesizes LLM uses in stock forecasting and catalogs key practical pitfalls from a hedge-fund viewpoint.
TS-Haystack: A Multi-Task Retrieval Benchmark for Long-Context Time-Series Reasoning cs.LG · 2026-02-15 · unreviewed · ref 4 · internal anchor
Probabilistic NDVI Forecasting from Sparse Satellite Time Series and Weather Covariates cs.LG · 2026-02-04 · unreviewed · ref 33 · internal anchor

Time-LLM: Time Series Forecasting by Reprogramming Large Language Models

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer