Latent chain-of-thought via recurrent feedback tokens from compressed hidden states improves transformer performance on time-series forecasting and tabular prediction across 36 datasets.
Is one layer enough? understanding inference dynamics in tabular foundation models, 2026
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.LG 1years
2026 1verdicts
CONDITIONAL 1representative citing papers
citing papers explorer
-
Latent Chain-of-Thought Improves Structured-Data Transformers
Latent chain-of-thought via recurrent feedback tokens from compressed hidden states improves transformer performance on time-series forecasting and tabular prediction across 36 datasets.