pith. sign in

arxiv: 2111.07408 · v2 · pith:HYBNHOUTnew · submitted 2021-11-14 · 💻 cs.CL

Time Waits for No One! Analysis and Challenges of Temporal Misalignment

classification 💻 cs.CL
keywords temporalmisalignmenttimecontinueddataacrossdomainseffects
0
0 comments X
read the original abstract

When an NLP model is trained on text data from one time period and tested or deployed on data from another, the resulting temporal misalignment can degrade end-task performance. In this work, we establish a suite of eight diverse tasks across different domains (social media, science papers, news, and reviews) and periods of time (spanning five years or more) to quantify the effects of temporal misalignment. Our study is focused on the ubiquitous setting where a pretrained model is optionally adapted through continued domain-specific pretraining, followed by task-specific finetuning. We establish a suite of tasks across multiple domains to study temporal misalignment in modern NLP systems. We find stronger effects of temporal misalignment on task performance than have been previously reported. We also find that, while temporal adaptation through continued pretraining can help, these gains are small compared to task-specific finetuning on data from the target time period. Our findings motivate continued research to improve temporal robustness of NLP models.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. ChatGPT as a Time Capsule: The Limits of Price Discovery

    q-fin.GN 2026-04 unverdicted novelty 6.0

    Frozen LLM checkpoints serve as time capsules of public text and generate outlook scores that forecast equity returns and analyst actions beyond contemporaneous valuations.