Temporal reasoning is not the core bottleneck for LLMs on time-based QA; the real issue is unstructured text-to-event mapping, addressed by a neuro-symbolic system with PIS that reaches 100% accuracy on benchmarks when representations are correct.
Towards benchmarking and improving the temporal reasoning capability of large language models
2 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
years
2026 2verdicts
UNVERDICTED 2roles
background 1polarities
background 1representative citing papers
Late fusion of absolute and relative temporal metadata into Transformer NER models produces more robust performance than early fusion on French and German historical datasets, especially in early noisy periods.
citing papers explorer
-
Temporal Reasoning Is Not the Bottleneck: A Probabilistic Inconsistency Framework for Neuro-Symbolic QA
Temporal reasoning is not the core bottleneck for LLMs on time-based QA; the real issue is unstructured text-to-event mapping, addressed by a neuro-symbolic system with PIS that reaches 100% accuracy on benchmarks when representations are correct.
-
A Study of Temporal Fusion Strategies for Named Entity Recognition in Historical Texts
Late fusion of absolute and relative temporal metadata into Transformer NER models produces more robust performance than early fusion on French and German historical datasets, especially in early noisy periods.