How do language models learn facts? dynamics, curricula and hallucinations

URLhttps://arxiv · 2025 · arXiv 2503.21676

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

representative citing papers

Why Fine-Tuning Encourages Hallucinations and How to Fix It

cs.CL · 2026-04-16 · unverdicted · novelty 6.0

Supervised fine-tuning increases LLM hallucinations via interference among overlapping semantic representations; self-distillation mitigates this by regularizing output-distribution drift while freezing parameters preserves performance when new facts are unnecessary.

Cram Less to Fit More: Training Data Pruning Improves Memorization of Facts

cs.CL · 2026-04-09 · conditional · novelty 6.0

Loss-based pruning of training data to limit facts and flatten their frequency distribution enables a 110M-parameter GPT-2 model to memorize 1.3 times more entity facts than standard training, matching a 1.3B-parameter model on the full dataset.

Deep sequence models tend to memorize geometrically; it is unclear why

cs.LG · 2025-10-30 · unverdicted · novelty 6.0

Deep sequence models develop geometric memory in embeddings that encodes novel global relationships, transforming l-fold composition tasks into 1-step navigation via a natural spectral bias connected to Node2Vec.

How Training Data Shapes the Use of Parametric and In-Context Knowledge in Language Models

cs.CL · 2025-09-29 · unverdicted · novelty 6.0

Balanced parametric and in-context knowledge use in LLMs is an emergent property requiring intra-document repetition, moderate inconsistency, and skewed distributions in training data.

Fine-Tuning Without Forgetting via Loss-Adaptive Learning Rates

cs.LG · 2026-05-19 · unverdicted · novelty 5.0

FINCH is a loss-adaptive learning-rate schedule that reduces forgetting by 93% on average during LLM fine-tuning while matching standard task performance across several benchmarks.

Do Activation Verbalization Methods Convey Privileged Information?

cs.CL · 2025-09-16 · unverdicted · novelty 5.0

Activation verbalization methods for LLMs largely reflect the verbalizer model's parametric knowledge rather than privileged information from the target model's activations.

citing papers explorer

Showing 6 of 6 citing papers.

Why Fine-Tuning Encourages Hallucinations and How to Fix It cs.CL · 2026-04-16 · unverdicted · none · ref 4
Supervised fine-tuning increases LLM hallucinations via interference among overlapping semantic representations; self-distillation mitigates this by regularizing output-distribution drift while freezing parameters preserves performance when new facts are unnecessary.
Cram Less to Fit More: Training Data Pruning Improves Memorization of Facts cs.CL · 2026-04-09 · conditional · none · ref 103
Loss-based pruning of training data to limit facts and flatten their frequency distribution enables a 110M-parameter GPT-2 model to memorize 1.3 times more entity facts than standard training, matching a 1.3B-parameter model on the full dataset.
Deep sequence models tend to memorize geometrically; it is unclear why cs.LG · 2025-10-30 · unverdicted · none · ref 211
Deep sequence models develop geometric memory in embeddings that encodes novel global relationships, transforming l-fold composition tasks into 1-step navigation via a natural spectral bias connected to Node2Vec.
How Training Data Shapes the Use of Parametric and In-Context Knowledge in Language Models cs.CL · 2025-09-29 · unverdicted · none · ref 20
Balanced parametric and in-context knowledge use in LLMs is an emergent property requiring intra-document repetition, moderate inconsistency, and skewed distributions in training data.
Fine-Tuning Without Forgetting via Loss-Adaptive Learning Rates cs.LG · 2026-05-19 · unverdicted · none · ref 76
FINCH is a loss-adaptive learning-rate schedule that reduces forgetting by 93% on average during LLM fine-tuning while matching standard task performance across several benchmarks.
Do Activation Verbalization Methods Convey Privileged Information? cs.CL · 2025-09-16 · unverdicted · none · ref 57
Activation verbalization methods for LLMs largely reflect the verbalizer model's parametric knowledge rather than privileged information from the target model's activations.

How do language models learn facts? dynamics, curricula and hallucinations

fields

years

verdicts

representative citing papers

citing papers explorer