TypewriterLM is a 7.24B language model pretrained exclusively on pre-1913 English text using a 54B-token corpus, lexically grounded instruction tuning, and the History-Event benchmark for temporal consistency.
A Careful Examination of Large Language Model Performance on Grade School Arithmetic , url =
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
ACCEPT 1representative citing papers
citing papers explorer
-
Pretraining Language Models on Historical Text
TypewriterLM is a 7.24B language model pretrained exclusively on pre-1913 English text using a 54B-token corpus, lexically grounded instruction tuning, and the History-Event benchmark for temporal consistency.