TFRBench is a new benchmark and multi-agent synthesis method that generates reasoning traces for time-series forecasting and shows these traces raise average accuracy from ~40% to ~57% when used to prompt LLMs.
score":<your 1-5 score>, “feedback
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.AI 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
TFRBench: A Reasoning Benchmark for Evaluating Forecasting Systems
TFRBench is a new benchmark and multi-agent synthesis method that generates reasoning traces for time-series forecasting and shows these traces raise average accuracy from ~40% to ~57% when used to prompt LLMs.