pith. sign in

arxiv: 2605.27564 · v1 · pith:WFRH6CU4new · submitted 2026-05-26 · 💻 cs.CL · cs.AI· cs.LG

The Future of Facts: Tracing the Factual Generation-Verification Gap

classification 💻 cs.CL cs.AIcs.LG
keywords factualmodelsverificationgenerationacrosscontinualdynamicsfacts
0
0 comments X
read the original abstract

Language models are becoming the default interface to factual knowledge, yet they often verify outputs more reliably than they generate them. This generation-verification gap (GV-gap) underlies many recent advances in self-improvement and reasoning, but its dynamics on factual knowledge specifically remain poorly understood. We focus on the training mechanisms underlying factual GV-gaps, distinguishing them from their computational and aesthetic counterparts. We trace generation and verification capabilities through three training phases (acquisition, continual learning, and updating) across four open-source model families at two scales each. Three findings recur across models: (i) verification is consistently learned before generation; (ii) verification is more robust to continual learning than generation; and (iii) factual updates can leave models in a "multi-verse" state, simultaneously verifying both old and new answers as correct. Natural experiments on frontier models reproduce these dynamics at scale and reveal residual verification biases on well-covered facts.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.