hub

Is model collapse inevitable? Breaking the curse of recursion by accumulating real and synthetic data

· 2024 · arXiv 2404.01413

10 Pith papers cite this work. Polarity classification is still indexing.

10 Pith papers citing it

read on arXiv browse 10 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

The Economics of Model Collapse: Equilibrium, Welfare, and Optimal Provenance Subsidies in Synthetic Data Markets

econ.GN · 2026-05-19 · unverdicted · novelty 8.0

Introduces the Synthetic Data Contamination Equilibrium and derives closed-form optimal provenance subsidies s* = KL(q||p)/(2 kappa) plus watermark strengths to mitigate model collapse, validated by OLS matching structural predictions on C4 data.

The Impact of AI-Generated Text on the Internet

cs.CY · 2026-04-14 · unverdicted · novelty 7.0

By mid-2025 roughly 35% of new websites are AI-generated or AI-assisted, correlating with lower semantic diversity and higher positive sentiment but showing no significant drop in factual accuracy or stylistic diversity.

Drift and selection in LLM text ecosystems

cs.CL · 2026-03-15 · unverdicted · novelty 7.0

Recursive LLM text generation drives public corpora toward shallow equilibria via drift unless normative selection for quality sustains deeper structure with a bounded divergence.

FuXi-TC: A generative framework integrating deep learning and physics-based models for improved tropical cyclone forecasts

physics.ao-ph · 2025-08-22 · unverdicted · novelty 6.0

FuXi-TC combines the FuXi global DL model with a diffusion generative framework to downscale and improve TC intensity and precipitation forecasts, matching ECMWF skill while being faster and generalizing zero-shot to North Atlantic hurricanes.

Filter Babel: The Challenge of Synthetic Media to Authenticity and Common Ground in AI-Mediated Communication

cs.HC · 2026-04-17 · unverdicted · novelty 5.0

Filter Babel explores a future of AI-personalized private experiences that may erode common ground in communication while supporting individual identity and selfhood.

Cognitive Atrophy and Systemic Collapse in AI-Dependent Software Engineering

cs.SE · 2026-04-29 · unverdicted · novelty 4.0

LLM integration in software engineering builds epistemological debt that erodes mental models and homogenizes code via recursive training, risking systemic fragility as illustrated by 2026 Amazon outages.

Knowledge Distillation Must Account for What It Loses

cs.LG · 2026-04-28 · unverdicted · novelty 4.0 · 2 refs

Knowledge distillation evaluations must report lost teacher capabilities via a Distillation Loss Statement rather than relying solely on task scores.

Reinforcement Learning from Human Feedback

cs.LG · 2025-04-16 · unverdicted · novelty 2.0

The book introduces the origins, mathematical setup, and optimization stages of RLHF including reward modeling, reinforcement learning, rejection sampling, and direct alignment algorithms.

Curated Synthetic Data Doesn't Have to Collapse: A Theoretical Study of Generative Retraining with Pluralistic Preferences

cs.LG · 2026-05-08

Position: the Stochastic Parrot in the Coal Mine. Model Collapse is a Threat to Low-Resource Communities

cs.LG · 2026-05-05

citing papers explorer

Showing 4 of 4 citing papers after filters.

Knowledge Distillation Must Account for What It Loses cs.LG · 2026-04-28 · unverdicted · none · ref 43 · 2 links
Knowledge distillation evaluations must report lost teacher capabilities via a Distillation Loss Statement rather than relying solely on task scores.
Reinforcement Learning from Human Feedback cs.LG · 2025-04-16 · unverdicted · none · ref 274
The book introduces the origins, mathematical setup, and optimization stages of RLHF including reward modeling, reinforcement learning, rejection sampling, and direct alignment algorithms.
Curated Synthetic Data Doesn't Have to Collapse: A Theoretical Study of Generative Retraining with Pluralistic Preferences cs.LG · 2026-05-08 · unreviewed · ref 78
Position: the Stochastic Parrot in the Coal Mine. Model Collapse is a Threat to Low-Resource Communities cs.LG · 2026-05-05 · unreviewed · ref 11

Is model collapse inevitable? Breaking the curse of recursion by accumulating real and synthetic data

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer