The authors create VisAnomBench with VLM-generated anomaly explanations and fine-tune VisAnomReasoner, reporting precision and F1 gains of at least 21 and 23 points on the new benchmark plus cross-benchmark improvements.
Time-RA: Towards Time Series Reasoning for Anomaly Diagnosis with LLM Feedback
1 Pith paper cite this work. Polarity classification is still indexing.
abstract
Time series anomaly detection (TSAD) has traditionally focused on binary classification and often lacks the fine-grained categorization and explanatory reasoning required for transparent decision-making. To address these limitations, we propose Time-series Reasoning for Anomaly (Time-RA), a novel task that reformulates TSAD from a discriminative into a generative, reasoning-intensive paradigm. To facilitate this, we introduce RATs40K, the first real-world large-scale multimodal benchmark with ~40,000 samples across 10 domains, integrating raw time series, textual context, and visual plots with structured reasoning annotations. Extensive benchmarking shows that while supervised fine-tuning and visual representations boost diagnostic accuracy and reasoning consistency, performance varies across complex scenarios. Notably, fine-tuned models demonstrate strong "plug-and-play" transferability, outperforming traditional baselines on unseen real-world datasets. Our work establishes a foundation for interpretable, multimodal time series analysis. All code (https://github.com/yyysjz1997/Time-RA) and the RATs40K dataset (https://huggingface.co/datasets/Time-RA/RATs40K) are fully open-sourced to facilitate future research.
fields
cs.AI 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Tiny but Trusted: Efficient Vision-Language Reasoning for Time-Series Anomaly Detection
The authors create VisAnomBench with VLM-generated anomaly explanations and fine-tune VisAnomReasoner, reporting precision and F1 gains of at least 21 and 23 points on the new benchmark plus cross-benchmark improvements.