Self-consuming generative models go mad

Sina Alemohammad, Jose Casco-Rodriguez, Lorenzo Luzi, Ahmed Imtiaz Humayun, Hossein Babaei, Daniel LeJeune, Ali Siahkoohi, Richard G Baraniuk · 2023 · arXiv 2307.01850

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

read on arXiv browse 9 citing papers

citation-role summary

background 1 baseline 1

citation-polarity summary

background 1 baseline 1

representative citing papers

Perturbation Dose Responses in Recursive LLM Loops: Raw Switching, Stochastic Floors, and Persistent Escape under Append, Replace, and Dialog Updates

cs.AI · 2026-05-04 · unverdicted · novelty 7.0

In 30-step recursive LLM loops, append-mode persistent escape from source basins reaches 50% near 400 tokens under full history but plateaus below 50% under tail-clip memory policy, while replace-mode switching largely reflects state reset.

The Impact of AI-Generated Text on the Internet

cs.CY · 2026-04-14 · unverdicted · novelty 7.0

By mid-2025 roughly 35% of new websites are AI-generated or AI-assisted, correlating with lower semantic diversity and higher positive sentiment but showing no significant drop in factual accuracy or stylistic diversity.

Self-Training Doesn't Flatten Language -- It Restructures It: Surface Markers Amplify While Deep Syntax Dies

cs.CL · 2026-05-20 · unverdicted · novelty 6.0

Self-training restructures language by amplifying surface markers and collapsing deep syntax according to structural depth rather than frequency, as evidenced by correlations across multiple models and a human fine-tuning control.

Alice v1: Distillation-Enhanced Video Generation Surpassing Closed-Source Models

cs.GR · 2026-04-27 · unverdicted · novelty 6.0

Alice v1 is an open video model that surpasses its teacher and closed-source systems like Veo3 and Sora2 in quality while running 7x faster through specialized distillation.

Multi-LLM Systems Exhibit Robust Semantic Collapse

cs.MA · 2026-05-16 · unverdicted · novelty 5.0

Closed-loop multi-LLM systems exhibit robust semantic collapse across model families and interventions, consistent with intrinsic properties of autoregressive generation.

Filter Babel: The Challenge of Synthetic Media to Authenticity and Common Ground in AI-Mediated Communication

cs.HC · 2026-04-17 · unverdicted · novelty 5.0

Filter Babel explores a future of AI-personalized private experiences that may erode common ground in communication while supporting individual identity and selfhood.

On Inverse Problems, Parameter Estimation, and Domain Generalization

cs.IT · 2025-06-06 · unverdicted · novelty 5.0

A theoretical framework for parameter estimation in inverse problems shows inversion does not necessarily improve accuracy per the data processing inequality and reveals a vulnerability in domain generalization via the Double Meaning Theorem.

Losing our Tail, Again: (Un)Natural Selection & Multilingual LLMs

cs.CL · 2025-07-05 · unverdicted · novelty 4.0

Position paper warns that model collapse in self-consuming multilingual LLM training loops risks flattening linguistic diversity and cultural nuance.

Will LLMs Scaling Hit the Wall? Breaking Barriers via Distributed Resources on Massive Edge Devices

cs.DC · 2025-03-11 · unverdicted · novelty 2.0

Position paper claiming that distributed training across massive edge devices can overcome data depletion and centralized compute monopolies in LLM scaling.

citing papers explorer

Showing 9 of 9 citing papers.

Perturbation Dose Responses in Recursive LLM Loops: Raw Switching, Stochastic Floors, and Persistent Escape under Append, Replace, and Dialog Updates cs.AI · 2026-05-04 · unverdicted · none · ref 4
In 30-step recursive LLM loops, append-mode persistent escape from source basins reaches 50% near 400 tokens under full history but plateaus below 50% under tail-clip memory policy, while replace-mode switching largely reflects state reset.
The Impact of AI-Generated Text on the Internet cs.CY · 2026-04-14 · unverdicted · none · ref 2
By mid-2025 roughly 35% of new websites are AI-generated or AI-assisted, correlating with lower semantic diversity and higher positive sentiment but showing no significant drop in factual accuracy or stylistic diversity.
Self-Training Doesn't Flatten Language -- It Restructures It: Surface Markers Amplify While Deep Syntax Dies cs.CL · 2026-05-20 · unverdicted · none · ref 1
Self-training restructures language by amplifying surface markers and collapsing deep syntax according to structural depth rather than frequency, as evidenced by correlations across multiple models and a human fine-tuning control.
Alice v1: Distillation-Enhanced Video Generation Surpassing Closed-Source Models cs.GR · 2026-04-27 · unverdicted · none · ref 19
Alice v1 is an open video model that surpasses its teacher and closed-source systems like Veo3 and Sora2 in quality while running 7x faster through specialized distillation.
Multi-LLM Systems Exhibit Robust Semantic Collapse cs.MA · 2026-05-16 · unverdicted · none · ref 16
Closed-loop multi-LLM systems exhibit robust semantic collapse across model families and interventions, consistent with intrinsic properties of autoregressive generation.
Filter Babel: The Challenge of Synthetic Media to Authenticity and Common Ground in AI-Mediated Communication cs.HC · 2026-04-17 · unverdicted · none · ref 3
Filter Babel explores a future of AI-personalized private experiences that may erode common ground in communication while supporting individual identity and selfhood.
On Inverse Problems, Parameter Estimation, and Domain Generalization cs.IT · 2025-06-06 · unverdicted · none · ref 2
A theoretical framework for parameter estimation in inverse problems shows inversion does not necessarily improve accuracy per the data processing inequality and reveals a vulnerability in domain generalization via the Double Meaning Theorem.
Losing our Tail, Again: (Un)Natural Selection & Multilingual LLMs cs.CL · 2025-07-05 · unverdicted · none · ref 1
Position paper warns that model collapse in self-consuming multilingual LLM training loops risks flattening linguistic diversity and cultural nuance.
Will LLMs Scaling Hit the Wall? Breaking Barriers via Distributed Resources on Massive Edge Devices cs.DC · 2025-03-11 · unverdicted · none · ref 36
Position paper claiming that distributed training across massive edge devices can overcome data depletion and centralized compute monopolies in LLM scaling.

Self-consuming generative models go mad

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer