When New Generators Arrive: Lifelong Machine-Generated Text Attribution via Ridge Feature Transfer

Cheng Hong; Jiaheng Wei; Xinlei He; Yifan Liao; Yutao Yue; Zhen Sun; Zhicong Huang

arxiv: 2606.05626 · v1 · pith:FJXZ2GXLnew · submitted 2026-06-04 · 💻 cs.CL · cs.AI· cs.LG

When New Generators Arrive: Lifelong Machine-Generated Text Attribution via Ridge Feature Transfer

Zhen Sun , Yifan Liao , Zhicong Huang , Jiaheng Wei , Cheng Hong , Yutao Yue , Xinlei He This is my paper

Pith reviewed 2026-06-28 01:36 UTC · model grok-4.3

classification 💻 cs.CL cs.AIcs.LG

keywords machine-generated textlifelong learningattributionridge regressionincremental learningtext classificationfeature transfer

0 comments

The pith

RidgeFT adds new generators to machine-generated text attribution via replay-free ridge updates on a frozen encoder.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper establishes that a task-aware encoder trained only on an initial set of generators can be frozen while still supporting effective lifelong attribution through stored class-wise sufficient statistics and closed-form ridge regression updates. RidgeFT applies covariance calibration to reduce irrelevant variation and augments with fixed random features before performing the analytic updates for each new generator class. This produces better macro-F1 scores than baselines while improving retention of old classes and adaptation to new ones across multiple domains, backbones, and incremental settings. A sympathetic reader would care because emerging language models require attribution systems that can incorporate new sources without replaying or retraining on all prior data.

Core claim

RidgeFT trains a task-aware encoder on the initial generator set, stores compact class-wise sufficient statistics when each generator class is first observed, freezes the encoder, suppresses generator-irrelevant variation through covariance calibration, improves representation capacity with fixed random features, and updates new classes through closed-form ridge regression based on class-level sufficient statistics.

What carries the argument

Closed-form ridge regression updates on class-wise sufficient statistics from a frozen task-aware encoder, preceded by covariance calibration and fixed random feature expansion.

If this is right

New generator classes can be incorporated without storing or replaying any previous text examples.
Both old-class retention and new-class adaptation improve simultaneously compared with prior lifelong methods.
The same analytic procedure works across different text domains, encoder backbones, and incremental learning protocols.
Only compact class-level statistics need to be stored after the initial training phase.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same sufficient-statistic plus ridge update pattern could be tested on other incremental text classification tasks such as topic or author attribution.
Memory cost scales with the number of classes rather than the number of examples, which may become advantageous as the number of generators grows.
If the initial encoder captures sufficiently general features, the method could be applied to generators that appear long after the initial training period.
Direct comparison of wall-clock update time versus full retraining would quantify the efficiency gain in a production setting.

Load-bearing premise

An encoder trained only on the initial generator set remains sufficiently discriminative when frozen so that class-wise sufficient statistics plus ridge updates can handle new generators without major loss of power or need for replay.

What would settle it

An evaluation in which new generators are added sequentially and RidgeFT shows either a large drop in old-class accuracy or lower overall macro-F1 than a full retraining baseline on the same data.

Figures

Figures reproduced from arXiv: 2606.05626 by Cheng Hong, Jiaheng Wei, Xinlei He, Yifan Liao, Yutao Yue, Zhen Sun, Zhicong Huang.

**Figure 2.** Figure 2: Overview of RidgeFT. RidgeFT only processes newly arriving class data and does not revisit old raw texts. Covariance Calibration. Base representations often capture generator-irrelevant variations (e.g., topic, length, domain) as high-variance directions, which impairs subsequent innerproduct classifiers. To mitigate this, RidgeFT applies a fractional whitening transformation to suppress within-class noi… view at source ↗

**Figure 3.** Figure 3: Experiments on academic topics. P3 starts with [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: Full-F1 under varying target-class data proportions. [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗

**Figure 5.** Figure 5: Parameter sensitivity of RidgeFT. We vary one hyperparameter at a time while keeping the others fixed, including the covariance calibration exponent δ, trace shrinkage coefficient α, random feature dimension dφ, class-reweighting strength β under the 20% setting, smoothing constant τ, and ridge regularization coefficient λ. six class-specific Ac matrices alone accounting for 768 MiB. To reduce this bottlen… view at source ↗

**Figure 6.** Figure 6: Lifelong MGT attribution experiments on the social [PITH_FULL_IMAGE:figures/full_fig_p010_6.png] view at source ↗

**Figure 7.** Figure 7: t-SNE visualization of different frozen feature spaces. We compare the raw frozen representation [PITH_FULL_IMAGE:figures/full_fig_p011_7.png] view at source ↗

read the original abstract

Machine-generated text (MGT) attribution aims to identify the specific generator responsible for a given text, thereby providing fine-grained evidence for model accountability and misuse investigation. As new large language models continue to emerge, attribution models must continuously incorporate new generators while preserving their ability to recognize previously seen ones. Prior works have shown that this lifelong MGT attribution setting is challenging, and existing methods often struggle to achieve a stable balance between adapting to new classes and retaining old ones. To address this issue, we propose RidgeFT, a lightweight analytic update framework that does not rely on exemplar replay. RidgeFT trains a task-aware encoder on the initial generator set, stores compact class-wise sufficient statistics when each generator class is first observed, and then freezes the encoder for replay-free closed-form updates. It then suppresses generator-irrelevant variation through covariance calibration, improves representation capacity with fixed random features, and updates new classes through closed-form ridge regression based on class-level sufficient statistics. Across multi-topic evaluations with varying initial generator setups, RidgeFT consistently outperforms baselines. It achieves the best macro-F1 across domains, backbones, and incremental protocols, while also improving both old-class retention and new-class adaptation. These results suggest that feature-stable analytic updates provide a simple yet effective approach to lifelong MGT attribution.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

RidgeFT gives a replay-free analytic update for lifelong MGT attribution but rests on an unproven assumption about the initial encoder's features.

read the letter

The main idea is RidgeFT: train an encoder on the first batch of generators, freeze it, store per-class means and covariances, calibrate the covariance, add random features, and then do closed-form ridge regression when new generators arrive. No replay of old examples.

This combination of sufficient statistics, covariance calibration, and analytic ridge updates is not in the prior work the abstract cites, so the framework itself is new. It targets a real operational need in attribution and forensics where storing old data may not be feasible.

The abstract claims consistent outperformance on macro-F1 across domains, backbones, and incremental setups while improving both retention and adaptation. That would be useful if the numbers hold.

The soft spots are bigger. The whole thing assumes the frozen encoder's feature space already makes new-generator classes linearly separable from each other and from the old ones. The stress-test note is right: if later models produce stylistic or topical shifts outside that span, the ridge update on the stored stats cannot recover discriminative power. The abstract gives no analysis or ablation showing the feature distributions stay aligned. It also supplies no quantitative results, protocols, or error breakdowns, so the performance claims cannot be checked.

This is for people working on MGT detection systems that need to keep adding generators over time. A reader in that niche might pick up the analytic-update trick, but only after seeing whether the experiments actually test the feature-stability assumption.

I would bring it to a reading group as maybe, to talk through the method. I would not cite it yet. It deserves peer review because the problem is practical and the analytic approach is distinct enough to be worth referee scrutiny on the experiments and the generalization question.

Referee Report

2 major / 2 minor

Summary. The manuscript proposes RidgeFT, a replay-free framework for lifelong machine-generated text attribution. An encoder is trained on the initial generator set and frozen. Class-wise sufficient statistics (means and covariances) are stored upon first observation of each generator. For new generators, closed-form ridge regression updates are performed after covariance calibration and augmentation with fixed random features. The paper reports that this approach consistently achieves the highest macro-F1 scores across domains, backbones, and incremental protocols while balancing retention of old classes and adaptation to new ones.

Significance. If the reported results are robust, the work offers a computationally efficient alternative to replay-based or fine-tuning methods for continual attribution, which is relevant as new LLMs proliferate. The analytic nature of the updates and avoidance of exemplar storage are notable strengths, providing a parameter-efficient way to handle incremental classes without catastrophic forgetting.

major comments (2)

[Method] The core assumption that the feature space learned from the initial generator set remains discriminative for subsequently introduced generators is not validated. The ridge update is derived under the premise of linear separability in this fixed space, but no experiment or analysis demonstrates that stylistic or topical shifts from new LLMs lie within the span of the initial encoder's representation. This assumption is load-bearing for the central claim of effective closed-form adaptation.
[Experiments] Table reporting macro-F1 results across incremental protocols: while outperformance is claimed, there is no ablation isolating the contribution of covariance calibration versus random features, nor any diagnostic measuring how much new-generator variance falls outside the initial encoder span. This weakens the ability to attribute gains specifically to the analytic update mechanism.

minor comments (2)

[Abstract] The abstract asserts quantitative superiority (best macro-F1, improved retention and adaptation) without supplying any numerical values, dataset sizes, or protocol details, reducing its standalone informativeness.
[Method] Notation for the class-wise sufficient statistics and the exact closed-form ridge solution could be presented with numbered equations to improve reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback highlighting the importance of validating the core feature-space assumption and providing targeted ablations. We address each major comment below and will incorporate revisions to strengthen the manuscript.

read point-by-point responses

Referee: [Method] The core assumption that the feature space learned from the initial generator set remains discriminative for subsequently introduced generators is not validated. The ridge update is derived under the premise of linear separability in this fixed space, but no experiment or analysis demonstrates that stylistic or topical shifts from new LLMs lie within the span of the initial encoder's representation. This assumption is load-bearing for the central claim of effective closed-form adaptation.

Authors: We agree that the manuscript does not contain a direct diagnostic validating that new-generator features remain within the discriminative span of the initial encoder. While the reported results across domains, backbones, and protocols show that the closed-form updates yield strong macro-F1 and retention-adaptation balance, this does not substitute for an explicit test (e.g., variance explained by the initial principal components or alignment of new class statistics). In revision we will add such an analysis to quantify how much new-generator variation projects onto the frozen feature space. revision: yes
Referee: [Experiments] Table reporting macro-F1 results across incremental protocols: while outperformance is claimed, there is no ablation isolating the contribution of covariance calibration versus random features, nor any diagnostic measuring how much new-generator variance falls outside the initial encoder span. This weakens the ability to attribute gains specifically to the analytic update mechanism.

Authors: We concur that the current experiments do not isolate the individual contributions of covariance calibration and random-feature augmentation, nor do they include the out-of-span variance diagnostic. To address this, the revised manuscript will add controlled ablations that remove or vary each component while keeping the ridge update fixed, together with the diagnostic requested in the method comment. These additions will allow clearer attribution of performance to the analytic mechanism. revision: yes

Circularity Check

0 steps flagged

No circularity; method is an empirical proposal using standard analytic updates

full rationale

The paper proposes RidgeFT as a practical method: train encoder on initial generators, freeze it, store class-wise sufficient statistics (means/covariances), then apply closed-form ridge regression for new classes after covariance calibration and random features. No derivation chain, first-principles claim, or prediction is shown to reduce by construction to fitted inputs or self-citations. Performance claims rest on multi-domain empirical evaluations rather than any self-referential definition or load-bearing prior result from the same authors. This is self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review yields no identifiable free parameters, axioms, or invented entities; method description implies standard ridge regression and encoder training but provides no explicit ledger entries.

pith-pipeline@v0.9.1-grok · 5782 in / 1056 out tokens · 33250 ms · 2026-06-28T01:36:11.627213+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

44 extracted references · 14 canonical work pages · 8 internal anchors

[1]

New insights on reducing abrupt representation change in online continual learning.arXiv preprint arXiv:2104.05025, 2021

Lucas Caccia, Rahaf Aljundi, Nader Asadi, Tinne Tuytelaars, Joelle Pineau, and Eugene Belilovsky. New insights on reducing abrupt representation change in online continual learning.arXiv preprint arXiv:2104.05025, 2021. 2

work page arXiv 2021
[2]

Openturingbench: An open-model-based benchmark and framework for machine-generated text detection and attribution

Lucio La Cava and Andrea Tagarelli. Openturingbench: An open-model-based benchmark and framework for machine-generated text detection and attribution. In Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 26655–26671. Association for Computational Linguistics, 2025. 1

2025
[3]

Divscore: Zero-shot detection of llm-generated text in specialized domains

Zhihui Chen, Kai He, Yucheng Huang, Yunxiao Zhu, and Mengling Feng. Divscore: Zero-shot detection of llm-generated text in specialized domains. In Chris- tos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, and Violet Peng, editors,Conference on Empiri- cal Methods in Natural Language Processing (EMNLP), pages 19231–19253. ACL, 2025. 2

2025
[4]

Could ai trace and explain the ori- gins of ai-generated images and text?arXiv preprint arXiv:2504.04279, 2025

Hongchao Fang, Yixin Liu, Jiangshu Du, Can Qin, Ran Xu, Feng Liu, Lichao Sun, Dongwon Lee, Lifu Huang, and Wenpeng Yin. Could ai trace and explain the ori- gins of ai-generated images and text?arXiv preprint arXiv:2504.04279, 2025. 2

work page arXiv 2025
[5]

Catastrophic forgetting in connection- ist networks.Trends in cognitive sciences, 3(4):128–135,

Robert M French. Catastrophic forgetting in connection- ist networks.Trends in cognitive sciences, 3(4):128–135,
[6]

The Llama 3 Herd of Models

Aaron Grattafiori, Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Alex Vaughan, et al. The llama 3 herd of models.arXiv preprint arXiv:2407.21783, 2024. 4

work page internal anchor Pith review Pith/arXiv arXiv 2024
[7]

Learning to rewrite: Generalized llm- generated text detection

Wei Hao, Ran Li, Weiliang Zhao, Junfeng Yang, and Chengzhi Mao. Learning to rewrite: Generalized llm- generated text detection. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, and Mohammad Taher Pilehvar, editors,Annual Meeting of the Association for Computational Linguistics (ACL), pages 6421–6434. ACL, 2025. 2

2025
[8]

DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing

Pengcheng He, Jianfeng Gao, and Weizhu Chen. Deber- tav3: Improving deberta using electra-style pre-training with gradient-disentangled embedding sharing.arXiv preprint arXiv:2111.09543, 2021. 4

work page internal anchor Pith review Pith/arXiv arXiv 2021
[9]

Mgtbench: Benchmarking machine- generated text detection

Xinlei He, Xinyue Shen, Zeyuan Chen, Michael Backes, and Yang Zhang. Mgtbench: Benchmarking machine- generated text detection. In Bo Luo, Xiaojing Liao, Jun Xu, Engin Kirda, and David Lie, editors,ACM SIGSAC Conference on Computer and Communications Security (CCS), pages 2251–2265. ACM, 2024. 1, 2

2024
[10]

Authorship attribution in the era of llms: Problems, methodologies, and challenges.ACM SIGKDD Explorations Newsletter, 26(2):21–43, 2025

Baixiang Huang, Canyu Chen, and Kai Shu. Authorship attribution in the era of llms: Problems, methodologies, and challenges.ACM SIGKDD Explorations Newsletter, 26(2):21–43, 2025. 2

2025
[11]

Mitigating catastrophic forgetting in large language models with self-synthesized rehearsal

Jianheng Huang, Leyang Cui, Ante Wang, Chengyi Yang, Xinting Liao, Linfeng Song, Junfeng Yao, and Jinsong Su. Mitigating catastrophic forgetting in large language models with self-synthesized rehearsal. In Annual Meeting of the Association for Computational Linguistics (ACL), pages 1416–1428, 2024. 2

2024
[12]

GPT-4o System Card

Aaron Hurst, Adam Lerer, Adam P Goucher, Adam Perelman, Aditya Ramesh, Aidan Clark, AJ Ostrow, Akila Welihinda, Alan Hayes, Alec Radford, et al. Gpt- 4o system card.arXiv preprint arXiv:2410.21276, 2024. 4

work page internal anchor Pith review Pith/arXiv arXiv 2024
[13]

Mixtral of Experts

Albert Q Jiang, Alexandre Sablayrolles, Antoine Roux, Arthur Mensch, Blanche Savary, Chris Bamford, De- vendra Singh Chaplot, Diego de las Casas, Emma Bou Hanna, Florian Bressand, et al. Mixtral of experts.arXiv preprint arXiv:2401.04088, 2024. 4

work page internal anchor Pith review Pith/arXiv arXiv 2024
[14]

M-rangedetector: Enhancing gen- eralization in machine-generated text detection through multi-range attention masks

Kaijie Jiao, Quan Wang, Licheng Zhang, Zikang Guo, and Zhendong Mao. M-rangedetector: Enhancing gen- eralization in machine-generated text detection through multi-range attention masks. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, and Mohammad Taher Pilehvar, editors,Findings of the Association for Com- putational Linguistics: ACL, pages 8971–8983. ACL,
[15]

A survey of ai-generated text forensic sys- tems: Detection, attribution, and characterization.arXiv preprint arXiv:2403.01152, 2024

Tharindu Kumarage, Garima Agrawal, Paras Sheth, Raha Moraffah, Aman Chadha, Joshua Garland, and Huan Liu. A survey of ai-generated text forensic sys- tems: Detection, attribution, and characterization.arXiv preprint arXiv:2403.01152, 2024. 1

work page arXiv 2024
[16]

Authorship Attribution in Multilingual Machine-Generated Texts

Lucio La Cava, Dominik Macko, Róbert Móro, Ivan Srba, and Andrea Tagarelli. Authorship attribution in multilingual machine-generated texts.arXiv preprint arXiv:2508.01656, 2025. 2

work page internal anchor Pith review Pith/arXiv arXiv 2025
[17]

Prde- tect: Perturbation-robust llm-generated text detection based on syntax tree

Xiang Li, Zhiyi Yin, Hexiang Tan, Shaoling Jing, Du Su, Yi Cheng, Huawei Shen, and Fei Sun. Prde- tect: Perturbation-robust llm-generated text detection based on syntax tree. In Luis Chiruzzo, Alan Ritter, and Lu Wang, editors,Findings of the Association for Computational Linguistics: NAACL, pages 8290–8301. ACL, 2025. 2

2025
[18]

Iron sharpens iron: Defending 8 against attacks in machine-generated text detection with adversarial training

Yuanfan Li, Zhaohan Zhang, Chengzhengxu Li, Chao Shen, and Xiaoming Liu. Iron sharpens iron: Defending 8 against attacks in machine-generated text detection with adversarial training. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, and Mohammad Taher Pilehvar, ed- itors,Annual Meeting of the Association for Computa- tional Linguistics (ACL), pages 3091...

2025
[19]

Learning without forget- ting.IEEE transactions on pattern analysis and machine intelligence, 40(12):2935–2947, 2017

Zhizhong Li and Derek Hoiem. Learning without forget- ting.IEEE transactions on pattern analysis and machine intelligence, 40(12):2935–2947, 2017. 4

2017
[20]

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Man- dar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. Roberta: A robustly optimized bert pretraining approach.arXiv preprint arXiv:1907.11692, 2019. 4

work page internal anchor Pith review Pith/arXiv arXiv 1907
[21]

On the gen- eralization and adaptation ability of machine-generated text detectors in academic writing

Yule Liu, Zhiyuan Zhong, Yifan Liao, Zhen Sun, Jingyi Zheng, Jiaheng Wei, Qingyuan Gong, Fenghua Tong, Yang Chen, Yang Zhang, and Xinlei He. On the gen- eralization and adaptation ability of machine-generated text detectors in academic writing. InACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), pages 5674–5685. ACM, 2025. 1, 2, 4, 11

2025
[22]

Multisocial: Multilingual benchmark of machine- generated text detection of social-media texts

Dominik Macko, Jakub Kopal, Róbert Móro, and Ivan Srba. Multisocial: Multilingual benchmark of machine- generated text detection of social-media texts. In Wanx- iang Che, Joyce Nabende, Ekaterina Shutova, and Mo- hammad Taher Pilehvar, editors,Annual Meeting of the Association for Computational Linguistics (ACL), pages 727–752. ACL, 2025. 2

2025
[23]

Catastrophic interference in connectionist networks: The sequential learning problem

Michael McCloskey and Neal J Cohen. Catastrophic interference in connectionist networks: The sequential learning problem. InPsychology of learning and moti- vation, volume 24, pages 109–165. Elsevier, 1989. 2

1989
[24]

Moonshot AI

Moonshot AI. Moonshot AI. https://www.moonshot. ai/, 2026. Accessed: 2026-05-15. 4

2026
[25]

Leveraging explainable ai for llm text attribution: Differentiating human-written and multiple llm-generated text.Information, 16(9):767, 2025

Ayat A Najjar, Huthaifa I Ashqar, Omar Darwish, and Eman Hammad. Leveraging explainable ai for llm text attribution: Differentiating human-written and multiple llm-generated text.Information, 16(9):767, 2025. 2

2025
[26]

Openclaw docs

OpenClaw. Openclaw docs. https://docs.openclaw. ai/, 2026. Accessed: 2026-05-14. 1

2026
[27]

Artificial intelligence (ai) tools for academic research.Library Hi Tech News, 41(8):18–20,

Adetoun A Oyelude. Artificial intelligence (ai) tools for academic research.Library Hi Tech News, 41(8):18–20,
[28]

Stress-testing machine generated text detection: Shifting language models writing style to fool detectors

Andrea Pedrotti, Michele Papucci, Cristiano Ciaccio, Alessio Miaschi, Giovanni Puccetti, Felice Dell’Orletta, and Andrea Esuli. Stress-testing machine generated text detection: Shifting language models writing style to fool detectors. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, and Mohammad Taher Pilehvar, editors,Find- ings of the Association for ...

2025
[29]

Random features for large-scale kernel machines.Advances in neural infor- mation processing systems, 20, 2007

Ali Rahimi and Benjamin Recht. Random features for large-scale kernel machines.Advances in neural infor- mation processing systems, 20, 2007. 3

2007
[30]

icarl: Incremental classifier and representation learning

Sylvestre-Alvise Rebuffi, Alexander Kolesnikov, Georg Sperl, and Christoph H Lampert. icarl: Incremental classifier and representation learning. InProceedings of the IEEE conference on Computer Vision and Pattern Recognition, pages 2001–2010, 2017. 4

2001
[31]

Almost ai, almost human: The challenge of detecting ai-polished writing

Shoumik Saha and Soheil Feizi. Almost ai, almost human: The challenge of detecting ai-polished writing. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, and Mohammad Taher Pilehvar, editors,Findings of the Association for Computational Linguistics: ACL, pages 25414–25431. ACL, 2025. 2

2025
[32]

Overview of autextification at iberlef 2023: Detection and attribution of machine-generated text in multiple domains.arXiv preprint arXiv:2309.11285,

Areg Mikael Sarvazyan, José Ángel González, Marc Franco-Salvador, Francisco Rangel, Berta Chulvi, and Paolo Rosso. Overview of autextification at iberlef 2023: Detection and attribution of machine-generated text in multiple domains.arXiv preprint arXiv:2309.11285,

work page arXiv 2023
[33]

Haco-det: A study to- wards fine-grained machine-generated text detection un- der human-ai coauthoring

Zhixiong Su, Yichen Wang, Herun Wan, Zhaohan Zhang, and Minnan Luo. Haco-det: A study to- wards fine-grained machine-generated text detection un- der human-ai coauthoring. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, and Mohammad Taher Pilehvar, editors,Annual Meeting of the Association for Computational Linguistics (ACL), pages 22015–22036. ACL, 2025. 2

2025
[34]

Are we in the ai-generated text world already? quantify- ing and monitoring AIGT on social media

Zhen Sun, Zongmin Zhang, Xinyue Shen, Ziyi Zhang, Yule Liu, Michael Backes, Yang Zhang, and Xinlei He. Are we in the ai-generated text world already? quantify- ing and monitoring AIGT on social media. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, and Moham- mad Taher Pilehvar, editors,Annual Meeting of the As- sociation for Computational Linguistics ...

2025
[35]

LLaMA: Open and Efficient Foundation Language Models

Hugo Touvron, Thibaut Lavril, Gautier Izacard, Xavier Martinet, Marie-Anne Lachaux, Timothée Lacroix, Bap- tiste Rozière, Naman Goyal, Eric Hambro, Faisal Azhar, et al. Llama: Open and efficient foundation language models.arXiv preprint arXiv:2302.13971, 2023. 4

work page internal anchor Pith review Pith/arXiv arXiv 2023
[36]

Llama 2: Open Foundation and Fine-Tuned Chat Models

Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, et al. Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288, 2023. 4

work page internal anchor Pith review Pith/arXiv arXiv 2023
[37]

Continual learning: Applications and the road forward.arXiv preprint arXiv:2311.11908,

Eli Verwimp, Rahaf Aljundi, Shai Ben-David, Matthias Bethge, Andrea Cossu, Alexander Gepperth, Tyler L Hayes, Eyke Hüllermeier, Christopher Kanan, Dhiree- sha Kudithipudi, et al. Continual learning: Applications and the road forward.arXiv preprint arXiv:2311.11908,

work page arXiv
[38]

Chao, and Derek Fai Wong

Junchao Wu, Shu Yang, Runzhe Zhan, Yulin Yuan, Lidia S. Chao, and Derek Fai Wong. A survey on llm- generated text detection: Necessity, methods, and future directions.Comput. Linguistics, 51(1):275–338, 2025. 1, 2 9

2025
[39]

Large scale incremental learning

Yue Wu, Yinpeng Chen, Lijuan Wang, Yuancheng Ye, Zicheng Liu, Yandong Guo, and Yun Fu. Large scale incremental learning. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 374–382, 2019. 4

2019
[40]

Semantic drift compensation for class-incremental learning

Lu Yu, Bartlomiej Twardowski, Xialei Liu, Luis Her- ranz, Kai Wang, Yongmei Cheng, Shangling Jui, and Joost van de Weijer. Semantic drift compensation for class-incremental learning. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 6982–6991, 2020. 2

2020
[41]

Evobench: To- wards real-world llm-generated text detection bench- marking for evolving large language models

Xiao Yu, Yi Yu, Dongrui Liu, Kejiang Chen, Weiming Zhang, Nenghai Yu, and Jing Shao. Evobench: To- wards real-world llm-generated text detection bench- marking for evolving large language models. In Wanxi- ang Che, Joyce Nabende, Ekaterina Shutova, and Mo- hammad Taher Pilehvar, editors,Findings of the As- sociation for Computational Linguistics: ACL, pag...

2025
[42]

Revisiting class-incremental learning with pre-trained models: Generalizability and adaptivity are all you need.International Journal of Computer Vision, 133(3):1012–1032, 2025

Da-Wei Zhou, Zi-Wen Cai, Han-Jia Ye, De-Chuan Zhan, and Ziwei Liu. Revisiting class-incremental learning with pre-trained models: Generalizability and adaptivity are all you need.International Journal of Computer Vision, 133(3):1012–1032, 2025. 4

2025
[43]

Expandable subspace ensemble for pre- trained model-based class-incremental learning

Da-Wei Zhou, Hai-Long Sun, Han-Jia Ye, and De- Chuan Zhan. Expandable subspace ensemble for pre- trained model-based class-incremental learning. InPro- ceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 23554–23564,
[44]

Ori.” column is marked “—

Fei Zhu, Xu-Yao Zhang, Chuang Wang, Fei Yin, and Cheng-Lin Liu. Prototype augmentation and self- supervision for incremental learning. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5871–5880, 2021. 4 A Sufficiency Analysis of Frozen Representa- tions SinceRidgeFTfreezes the task-tuned encoder during the increme...

work page arXiv 2021

[1] [1]

New insights on reducing abrupt representation change in online continual learning.arXiv preprint arXiv:2104.05025, 2021

Lucas Caccia, Rahaf Aljundi, Nader Asadi, Tinne Tuytelaars, Joelle Pineau, and Eugene Belilovsky. New insights on reducing abrupt representation change in online continual learning.arXiv preprint arXiv:2104.05025, 2021. 2

work page arXiv 2021

[2] [2]

Openturingbench: An open-model-based benchmark and framework for machine-generated text detection and attribution

Lucio La Cava and Andrea Tagarelli. Openturingbench: An open-model-based benchmark and framework for machine-generated text detection and attribution. In Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 26655–26671. Association for Computational Linguistics, 2025. 1

2025

[3] [3]

Divscore: Zero-shot detection of llm-generated text in specialized domains

Zhihui Chen, Kai He, Yucheng Huang, Yunxiao Zhu, and Mengling Feng. Divscore: Zero-shot detection of llm-generated text in specialized domains. In Chris- tos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, and Violet Peng, editors,Conference on Empiri- cal Methods in Natural Language Processing (EMNLP), pages 19231–19253. ACL, 2025. 2

2025

[4] [4]

Could ai trace and explain the ori- gins of ai-generated images and text?arXiv preprint arXiv:2504.04279, 2025

Hongchao Fang, Yixin Liu, Jiangshu Du, Can Qin, Ran Xu, Feng Liu, Lichao Sun, Dongwon Lee, Lifu Huang, and Wenpeng Yin. Could ai trace and explain the ori- gins of ai-generated images and text?arXiv preprint arXiv:2504.04279, 2025. 2

work page arXiv 2025

[5] [5]

Catastrophic forgetting in connection- ist networks.Trends in cognitive sciences, 3(4):128–135,

Robert M French. Catastrophic forgetting in connection- ist networks.Trends in cognitive sciences, 3(4):128–135,

[6] [6]

The Llama 3 Herd of Models

Aaron Grattafiori, Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Alex Vaughan, et al. The llama 3 herd of models.arXiv preprint arXiv:2407.21783, 2024. 4

work page internal anchor Pith review Pith/arXiv arXiv 2024

[7] [7]

Learning to rewrite: Generalized llm- generated text detection

Wei Hao, Ran Li, Weiliang Zhao, Junfeng Yang, and Chengzhi Mao. Learning to rewrite: Generalized llm- generated text detection. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, and Mohammad Taher Pilehvar, editors,Annual Meeting of the Association for Computational Linguistics (ACL), pages 6421–6434. ACL, 2025. 2

2025

[8] [8]

DeBERTaV3: Improving DeBERTa using ELECTRA-Style Pre-Training with Gradient-Disentangled Embedding Sharing

Pengcheng He, Jianfeng Gao, and Weizhu Chen. Deber- tav3: Improving deberta using electra-style pre-training with gradient-disentangled embedding sharing.arXiv preprint arXiv:2111.09543, 2021. 4

work page internal anchor Pith review Pith/arXiv arXiv 2021

[9] [9]

Mgtbench: Benchmarking machine- generated text detection

Xinlei He, Xinyue Shen, Zeyuan Chen, Michael Backes, and Yang Zhang. Mgtbench: Benchmarking machine- generated text detection. In Bo Luo, Xiaojing Liao, Jun Xu, Engin Kirda, and David Lie, editors,ACM SIGSAC Conference on Computer and Communications Security (CCS), pages 2251–2265. ACM, 2024. 1, 2

2024

[10] [10]

Authorship attribution in the era of llms: Problems, methodologies, and challenges.ACM SIGKDD Explorations Newsletter, 26(2):21–43, 2025

Baixiang Huang, Canyu Chen, and Kai Shu. Authorship attribution in the era of llms: Problems, methodologies, and challenges.ACM SIGKDD Explorations Newsletter, 26(2):21–43, 2025. 2

2025

[11] [11]

Mitigating catastrophic forgetting in large language models with self-synthesized rehearsal

Jianheng Huang, Leyang Cui, Ante Wang, Chengyi Yang, Xinting Liao, Linfeng Song, Junfeng Yao, and Jinsong Su. Mitigating catastrophic forgetting in large language models with self-synthesized rehearsal. In Annual Meeting of the Association for Computational Linguistics (ACL), pages 1416–1428, 2024. 2

2024

[12] [12]

GPT-4o System Card

Aaron Hurst, Adam Lerer, Adam P Goucher, Adam Perelman, Aditya Ramesh, Aidan Clark, AJ Ostrow, Akila Welihinda, Alan Hayes, Alec Radford, et al. Gpt- 4o system card.arXiv preprint arXiv:2410.21276, 2024. 4

work page internal anchor Pith review Pith/arXiv arXiv 2024

[13] [13]

Mixtral of Experts

Albert Q Jiang, Alexandre Sablayrolles, Antoine Roux, Arthur Mensch, Blanche Savary, Chris Bamford, De- vendra Singh Chaplot, Diego de las Casas, Emma Bou Hanna, Florian Bressand, et al. Mixtral of experts.arXiv preprint arXiv:2401.04088, 2024. 4

work page internal anchor Pith review Pith/arXiv arXiv 2024

[14] [14]

M-rangedetector: Enhancing gen- eralization in machine-generated text detection through multi-range attention masks

Kaijie Jiao, Quan Wang, Licheng Zhang, Zikang Guo, and Zhendong Mao. M-rangedetector: Enhancing gen- eralization in machine-generated text detection through multi-range attention masks. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, and Mohammad Taher Pilehvar, editors,Findings of the Association for Com- putational Linguistics: ACL, pages 8971–8983. ACL,

[15] [15]

A survey of ai-generated text forensic sys- tems: Detection, attribution, and characterization.arXiv preprint arXiv:2403.01152, 2024

Tharindu Kumarage, Garima Agrawal, Paras Sheth, Raha Moraffah, Aman Chadha, Joshua Garland, and Huan Liu. A survey of ai-generated text forensic sys- tems: Detection, attribution, and characterization.arXiv preprint arXiv:2403.01152, 2024. 1

work page arXiv 2024

[16] [16]

Authorship Attribution in Multilingual Machine-Generated Texts

Lucio La Cava, Dominik Macko, Róbert Móro, Ivan Srba, and Andrea Tagarelli. Authorship attribution in multilingual machine-generated texts.arXiv preprint arXiv:2508.01656, 2025. 2

work page internal anchor Pith review Pith/arXiv arXiv 2025

[17] [17]

Prde- tect: Perturbation-robust llm-generated text detection based on syntax tree

Xiang Li, Zhiyi Yin, Hexiang Tan, Shaoling Jing, Du Su, Yi Cheng, Huawei Shen, and Fei Sun. Prde- tect: Perturbation-robust llm-generated text detection based on syntax tree. In Luis Chiruzzo, Alan Ritter, and Lu Wang, editors,Findings of the Association for Computational Linguistics: NAACL, pages 8290–8301. ACL, 2025. 2

2025

[18] [18]

Iron sharpens iron: Defending 8 against attacks in machine-generated text detection with adversarial training

Yuanfan Li, Zhaohan Zhang, Chengzhengxu Li, Chao Shen, and Xiaoming Liu. Iron sharpens iron: Defending 8 against attacks in machine-generated text detection with adversarial training. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, and Mohammad Taher Pilehvar, ed- itors,Annual Meeting of the Association for Computa- tional Linguistics (ACL), pages 3091...

2025

[19] [19]

Learning without forget- ting.IEEE transactions on pattern analysis and machine intelligence, 40(12):2935–2947, 2017

Zhizhong Li and Derek Hoiem. Learning without forget- ting.IEEE transactions on pattern analysis and machine intelligence, 40(12):2935–2947, 2017. 4

2017

[20] [20]

RoBERTa: A Robustly Optimized BERT Pretraining Approach

Yinhan Liu, Myle Ott, Naman Goyal, Jingfei Du, Man- dar Joshi, Danqi Chen, Omer Levy, Mike Lewis, Luke Zettlemoyer, and Veselin Stoyanov. Roberta: A robustly optimized bert pretraining approach.arXiv preprint arXiv:1907.11692, 2019. 4

work page internal anchor Pith review Pith/arXiv arXiv 1907

[21] [21]

On the gen- eralization and adaptation ability of machine-generated text detectors in academic writing

Yule Liu, Zhiyuan Zhong, Yifan Liao, Zhen Sun, Jingyi Zheng, Jiaheng Wei, Qingyuan Gong, Fenghua Tong, Yang Chen, Yang Zhang, and Xinlei He. On the gen- eralization and adaptation ability of machine-generated text detectors in academic writing. InACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), pages 5674–5685. ACM, 2025. 1, 2, 4, 11

2025

[22] [22]

Multisocial: Multilingual benchmark of machine- generated text detection of social-media texts

Dominik Macko, Jakub Kopal, Róbert Móro, and Ivan Srba. Multisocial: Multilingual benchmark of machine- generated text detection of social-media texts. In Wanx- iang Che, Joyce Nabende, Ekaterina Shutova, and Mo- hammad Taher Pilehvar, editors,Annual Meeting of the Association for Computational Linguistics (ACL), pages 727–752. ACL, 2025. 2

2025

[23] [23]

Catastrophic interference in connectionist networks: The sequential learning problem

Michael McCloskey and Neal J Cohen. Catastrophic interference in connectionist networks: The sequential learning problem. InPsychology of learning and moti- vation, volume 24, pages 109–165. Elsevier, 1989. 2

1989

[24] [24]

Moonshot AI

Moonshot AI. Moonshot AI. https://www.moonshot. ai/, 2026. Accessed: 2026-05-15. 4

2026

[25] [25]

Leveraging explainable ai for llm text attribution: Differentiating human-written and multiple llm-generated text.Information, 16(9):767, 2025

Ayat A Najjar, Huthaifa I Ashqar, Omar Darwish, and Eman Hammad. Leveraging explainable ai for llm text attribution: Differentiating human-written and multiple llm-generated text.Information, 16(9):767, 2025. 2

2025

[26] [26]

Openclaw docs

OpenClaw. Openclaw docs. https://docs.openclaw. ai/, 2026. Accessed: 2026-05-14. 1

2026

[27] [27]

Artificial intelligence (ai) tools for academic research.Library Hi Tech News, 41(8):18–20,

Adetoun A Oyelude. Artificial intelligence (ai) tools for academic research.Library Hi Tech News, 41(8):18–20,

[28] [28]

Stress-testing machine generated text detection: Shifting language models writing style to fool detectors

Andrea Pedrotti, Michele Papucci, Cristiano Ciaccio, Alessio Miaschi, Giovanni Puccetti, Felice Dell’Orletta, and Andrea Esuli. Stress-testing machine generated text detection: Shifting language models writing style to fool detectors. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, and Mohammad Taher Pilehvar, editors,Find- ings of the Association for ...

2025

[29] [29]

Random features for large-scale kernel machines.Advances in neural infor- mation processing systems, 20, 2007

Ali Rahimi and Benjamin Recht. Random features for large-scale kernel machines.Advances in neural infor- mation processing systems, 20, 2007. 3

2007

[30] [30]

icarl: Incremental classifier and representation learning

Sylvestre-Alvise Rebuffi, Alexander Kolesnikov, Georg Sperl, and Christoph H Lampert. icarl: Incremental classifier and representation learning. InProceedings of the IEEE conference on Computer Vision and Pattern Recognition, pages 2001–2010, 2017. 4

2001

[31] [31]

Almost ai, almost human: The challenge of detecting ai-polished writing

Shoumik Saha and Soheil Feizi. Almost ai, almost human: The challenge of detecting ai-polished writing. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, and Mohammad Taher Pilehvar, editors,Findings of the Association for Computational Linguistics: ACL, pages 25414–25431. ACL, 2025. 2

2025

[32] [32]

Overview of autextification at iberlef 2023: Detection and attribution of machine-generated text in multiple domains.arXiv preprint arXiv:2309.11285,

Areg Mikael Sarvazyan, José Ángel González, Marc Franco-Salvador, Francisco Rangel, Berta Chulvi, and Paolo Rosso. Overview of autextification at iberlef 2023: Detection and attribution of machine-generated text in multiple domains.arXiv preprint arXiv:2309.11285,

work page arXiv 2023

[33] [33]

Haco-det: A study to- wards fine-grained machine-generated text detection un- der human-ai coauthoring

Zhixiong Su, Yichen Wang, Herun Wan, Zhaohan Zhang, and Minnan Luo. Haco-det: A study to- wards fine-grained machine-generated text detection un- der human-ai coauthoring. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, and Mohammad Taher Pilehvar, editors,Annual Meeting of the Association for Computational Linguistics (ACL), pages 22015–22036. ACL, 2025. 2

2025

[34] [34]

Are we in the ai-generated text world already? quantify- ing and monitoring AIGT on social media

Zhen Sun, Zongmin Zhang, Xinyue Shen, Ziyi Zhang, Yule Liu, Michael Backes, Yang Zhang, and Xinlei He. Are we in the ai-generated text world already? quantify- ing and monitoring AIGT on social media. In Wanxiang Che, Joyce Nabende, Ekaterina Shutova, and Moham- mad Taher Pilehvar, editors,Annual Meeting of the As- sociation for Computational Linguistics ...

2025

[35] [35]

LLaMA: Open and Efficient Foundation Language Models

Hugo Touvron, Thibaut Lavril, Gautier Izacard, Xavier Martinet, Marie-Anne Lachaux, Timothée Lacroix, Bap- tiste Rozière, Naman Goyal, Eric Hambro, Faisal Azhar, et al. Llama: Open and efficient foundation language models.arXiv preprint arXiv:2302.13971, 2023. 4

work page internal anchor Pith review Pith/arXiv arXiv 2023

[36] [36]

Llama 2: Open Foundation and Fine-Tuned Chat Models

Hugo Touvron, Louis Martin, Kevin Stone, Peter Albert, Amjad Almahairi, Yasmine Babaei, Nikolay Bashlykov, Soumya Batra, Prajjwal Bhargava, Shruti Bhosale, et al. Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288, 2023. 4

work page internal anchor Pith review Pith/arXiv arXiv 2023

[37] [37]

Continual learning: Applications and the road forward.arXiv preprint arXiv:2311.11908,

Eli Verwimp, Rahaf Aljundi, Shai Ben-David, Matthias Bethge, Andrea Cossu, Alexander Gepperth, Tyler L Hayes, Eyke Hüllermeier, Christopher Kanan, Dhiree- sha Kudithipudi, et al. Continual learning: Applications and the road forward.arXiv preprint arXiv:2311.11908,

work page arXiv

[38] [38]

Chao, and Derek Fai Wong

Junchao Wu, Shu Yang, Runzhe Zhan, Yulin Yuan, Lidia S. Chao, and Derek Fai Wong. A survey on llm- generated text detection: Necessity, methods, and future directions.Comput. Linguistics, 51(1):275–338, 2025. 1, 2 9

2025

[39] [39]

Large scale incremental learning

Yue Wu, Yinpeng Chen, Lijuan Wang, Yuancheng Ye, Zicheng Liu, Yandong Guo, and Yun Fu. Large scale incremental learning. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 374–382, 2019. 4

2019

[40] [40]

Semantic drift compensation for class-incremental learning

Lu Yu, Bartlomiej Twardowski, Xialei Liu, Luis Her- ranz, Kai Wang, Yongmei Cheng, Shangling Jui, and Joost van de Weijer. Semantic drift compensation for class-incremental learning. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 6982–6991, 2020. 2

2020

[41] [41]

Evobench: To- wards real-world llm-generated text detection bench- marking for evolving large language models

Xiao Yu, Yi Yu, Dongrui Liu, Kejiang Chen, Weiming Zhang, Nenghai Yu, and Jing Shao. Evobench: To- wards real-world llm-generated text detection bench- marking for evolving large language models. In Wanxi- ang Che, Joyce Nabende, Ekaterina Shutova, and Mo- hammad Taher Pilehvar, editors,Findings of the As- sociation for Computational Linguistics: ACL, pag...

2025

[42] [42]

Revisiting class-incremental learning with pre-trained models: Generalizability and adaptivity are all you need.International Journal of Computer Vision, 133(3):1012–1032, 2025

Da-Wei Zhou, Zi-Wen Cai, Han-Jia Ye, De-Chuan Zhan, and Ziwei Liu. Revisiting class-incremental learning with pre-trained models: Generalizability and adaptivity are all you need.International Journal of Computer Vision, 133(3):1012–1032, 2025. 4

2025

[43] [43]

Expandable subspace ensemble for pre- trained model-based class-incremental learning

Da-Wei Zhou, Hai-Long Sun, Han-Jia Ye, and De- Chuan Zhan. Expandable subspace ensemble for pre- trained model-based class-incremental learning. InPro- ceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 23554–23564,

[44] [44]

Ori.” column is marked “—

Fei Zhu, Xu-Yao Zhang, Chuang Wang, Fei Yin, and Cheng-Lin Liu. Prototype augmentation and self- supervision for incremental learning. InProceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 5871–5880, 2021. 4 A Sufficiency Analysis of Frozen Representa- tions SinceRidgeFTfreezes the task-tuned encoder during the increme...

work page arXiv 2021