LOKI: Memory-Free Null-Space Constrained Lifelong Knowledge Editing

Jennifer Dy; Masih Eskandar; Miquel Sirera Perell\'o; Stratis Ioannidis

arxiv: 2606.19679 · v1 · pith:7R6BR6XAnew · submitted 2026-06-18 · 💻 cs.LG · cs.AI

LOKI: Memory-Free Null-Space Constrained Lifelong Knowledge Editing

Masih Eskandar , Miquel Sirera Perell\'o , Stratis Ioannidis , Jennifer Dy This is my paper

Pith reviewed 2026-06-26 18:44 UTC · model grok-4.3

classification 💻 cs.LG cs.AI

keywords lifelong knowledge editingnull-space projectiondynamic layer selectionHilbert-Schmidt Independence Criterionlanguage modelscatastrophic forgettinggradient projection

0 comments

The pith

LOKI updates language models sequentially with new knowledge by selecting layers dynamically via HSIC and projecting gradients into the null-space of weights without any prior data access.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents LOKI as a method for lifelong knowledge editing that avoids two common limitations in current approaches. Fixed layer choices across edits reduce flexibility and raise forgetting, while many techniques need stored past examples or precomputed statistics. LOKI instead picks layers on the fly using the Hilbert-Schmidt Independence Criterion and forces each gradient update to lie in the null-space of the current model weights. Experiments across benchmarks report higher average accuracy than baselines, with gains reaching 14 percent. Readers would care because the approach makes repeated model corrections feasible in settings where historical data cannot be retained.

Core claim

LOKI achieves superior performance to existing approaches across a wide variety of experiments, achieving up to a 14% improvement in average accuracy, by using HSIC-based dynamic layer selection and projecting gradient updates onto the null-space of the model weights, bypassing the requirement for previous knowledge access.

What carries the argument

Dynamic layer selection via the Hilbert-Schmidt Independence Criterion together with projection of gradient updates onto the null-space of the model weights.

If this is right

Sequential edits can be performed without storing or replaying any previous knowledge samples.
Different knowledge updates can modify different layers, avoiding the rigidity of fixed-layer schemes.
Gradient steps remain orthogonal to existing weight directions, limiting unintended changes to prior edits.
The method supports editing under strict privacy or storage constraints that bar access to historical data.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If null-space projections remain effective over dozens of successive edits, the approach could support very long editing histories without explicit regularization terms.
HSIC-based selection might be replaced by cheaper proxies such as activation variance if the independence criterion proves expensive at scale.
The same null-space idea could be tested on non-transformer architectures where weight matrices have different spectral properties.
One could measure whether the chosen layers correlate with attention heads that encode the edited facts, providing a post-hoc interpretability check.

Load-bearing premise

That projecting gradient updates onto the null-space of model weights, combined with HSIC-based dynamic layer selection, sufficiently prevents interference with previously edited knowledge even without access to any prior data or statistics.

What would settle it

A sequence of edits on a held-out benchmark where accuracy on earlier facts falls below the levels achieved by methods that retain past data, or where average accuracy does not exceed the best baseline by a measurable margin.

Figures

Figures reproduced from arXiv: 2606.19679 by Jennifer Dy, Masih Eskandar, Miquel Sirera Perell\'o, Stratis Ioannidis.

**Figure 2.** Figure 2: Average accuracy of various methods every 100 edits for 1000 sequential edits on the Llama-3-8B-Instruct and ZsRE dataset. LOKI suffers the least performance degradation over time across competitors. (less than 15% degradation). It is worth noting that methods with higher locality scores suffer from a minimum 47% degradation in perplexity for multiple edits. Similarly, in [PITH_FULL_IMAGE:figures/full_fig… view at source ↗

**Figure 3.** Figure 3: TSNE visualization of Llama-3-8BInstruct final layer embeddings before and after 1000 sequential edits on the ZsRE dataset. LOKI manages to avoid a significant shift in model embeddings without access to previous knowledge. Layer 4 5 6 7 8 Llama-3 W-Null (Ours) 2.42 3.19 2.56 2.40 2.42 Random 4.28 5.01 4.18 3.69 4.82 Mistral-7B W-Null (Ours) 2.88 2.85 2.56 2.41 2.79 Random 4.68 4.43 4.10 3.59 4.33 [PITH… view at source ↗

**Figure 4.** Figure 4: Visualization of the layer selection fre [PITH_FULL_IMAGE:figures/full_fig_p009_4.png] view at source ↗

**Figure 5.** Figure 5: Average edit time per method measured on an NVIDIA H200 GPU. We report the average [PITH_FULL_IMAGE:figures/full_fig_p014_5.png] view at source ↗

read the original abstract

Lifelong knowledge editing aims to efficiently and sequentially update language models over time, as new knowledge becomes available or when the model makes mistakes, while preserving acceptable performance on past knowledge. One unresolved challenge is that existing methods modify a fixed set of layers for all new knowledge samples, reducing flexibility and increasing catastrophic forgetting. Another is requiring access to previous knowledge and extensive pre-processing to obtain data statistics. To address these challenges, we introduce LOKI, a novel approach that uses dynamic layer selection based on the Hilbert-Schmidt Independence Criterion and projects gradient updates onto the null-space of the model weights, bypassing the requirement for previous knowledge access. We show that LOKI achieves superior performance to existing approaches across a wide variety of experiments, achieving up to a 14\% improvement in average accuracy.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

LOKI's abstract sketches a memory-free editing method via HSIC layer choice and null-space projection but supplies zero experimental details, so the 14% claim can't be assessed yet.

read the letter

The paper targets two practical headaches in lifelong knowledge editing for language models: methods that always edit the same layers and methods that need stored old data or precomputed statistics. LOKI picks layers on the fly with the Hilbert-Schmidt Independence Criterion and projects the gradient update into the null space of the current weights so no prior examples are required.

That combination is the actual novelty. It directly attacks the flexibility and memory issues the abstract flags, and the null-space step is a clean way to try to keep old knowledge untouched without replay.

The paper does a reasonable job naming the bottlenecks and sketching a solution that could scale better in applied settings. If the full experiments back it up, the approach would be worth testing in other editing pipelines.

The soft spot is obvious from the abstract alone: it states superior performance and up to 14% average accuracy gains but gives no baselines, no dataset sizes, no statistical tests, no ablation on the HSIC choice or the projection, and no numbers on how much forgetting actually occurs. Without those, the central claim that the projection plus dynamic selection prevents interference remains an untested assumption. The soundness score in the pith report is low for exactly this reason.

This is for people working on continual model updates who care about memory footprint. A reader already deep in the editing literature would get value from the specific technique once the experiments are visible.

I'd send it to peer review if the full paper contains reproducible tables and code; the idea is practical enough to deserve a proper check even if revisions are needed.

Referee Report

2 major / 2 minor

Summary. The paper proposes LOKI, a memory-free method for lifelong knowledge editing of language models. It dynamically selects layers using the Hilbert-Schmidt Independence Criterion (HSIC) and projects gradient updates onto the null-space of the model weights to enable sequential edits while preserving prior knowledge, without requiring access to previous data or pre-computed statistics. The central claim is that this yields superior performance, with up to 14% higher average accuracy than existing approaches across experiments.

Significance. If the experimental results hold, the contribution would be meaningful for practical lifelong editing: it removes the memory and preprocessing requirements that limit prior methods and adds flexibility via dynamic layer choice. The approach builds on standard tools (HSIC, null-space projection) rather than introducing new primitives, which lowers the barrier to adoption but also means the novelty lies primarily in the combination and the no-memory guarantee.

major comments (2)

[Abstract and §4] Abstract and §4 (Experiments): the claim of up to 14% average-accuracy improvement is presented without any mention of the number of runs, standard deviations, statistical tests, or the precise baselines and editing benchmarks used. This information is load-bearing for the performance claim and must be supplied with concrete numbers and controls.
[§3.2] §3.2 (Null-space projection): the argument that projection onto the null-space of current weights is sufficient to avoid interference with prior edits, even without any stored statistics, requires an explicit argument or bound showing that the projected update cannot alter the previously edited directions. The current description relies on the geometric intuition alone.

minor comments (2)

Define HSIC on first use and state the precise kernel and regularization choices used in the layer-selection step.
Add a short pseudocode block or explicit algorithm listing the per-edit steps (HSIC computation, null-space projection, update application) to improve reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on our manuscript. We address the two major comments below and will incorporate revisions to strengthen the experimental reporting and theoretical justification.

read point-by-point responses

Referee: [Abstract and §4] Abstract and §4 (Experiments): the claim of up to 14% average-accuracy improvement is presented without any mention of the number of runs, standard deviations, statistical tests, or the precise baselines and editing benchmarks used. This information is load-bearing for the performance claim and must be supplied with concrete numbers and controls.

Authors: We agree that the performance claims require supporting statistical details for rigor. In the revised manuscript, we will update the abstract and Section 4 to specify the number of independent runs (e.g., 5), report mean accuracies with standard deviations, note any statistical significance tests performed, and explicitly detail the baselines (such as SERAC, MEND, and others) along with the editing benchmarks (ZsRE, CounterFact, etc.) used in each experiment. The reported 'up to 14%' figure represents the largest observed gain in average accuracy across the evaluated settings and will be contextualized with these controls. revision: yes
Referee: [§3.2] §3.2 (Null-space projection): the argument that projection onto the null-space of current weights is sufficient to avoid interference with prior edits, even without any stored statistics, requires an explicit argument or bound showing that the projected update cannot alter the previously edited directions. The current description relies on the geometric intuition alone.

Authors: We acknowledge the need for a more formal justification beyond geometric intuition. In the revision to §3.2, we will include an explicit argument and bound: because prior edits are already embedded in the current weight matrix W, any update ΔW projected onto the null space of W (i.e., satisfying W ΔW = 0) cannot modify the linear mappings established by previous edits. We will provide a short proof sketch showing that the projected update preserves the output behavior on previously edited inputs without requiring stored statistics, along with a bound on residual interference under standard assumptions on the weight matrix rank. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The derivation relies on standard external techniques (HSIC for layer selection and null-space projection of gradients) applied to the editing objective without re-deriving or fitting quantities from the same data in a self-referential loop. No self-citation chains, ansatz smuggling, or fitted-input-as-prediction patterns are present in the abstract or described method. The central claim of memory-free interference prevention is supported by the algorithmic construction rather than reducing to its own inputs by definition.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Review performed on abstract only; no explicit free parameters, axioms, or invented entities are described in the provided text.

pith-pipeline@v0.9.1-grok · 5674 in / 1069 out tokens · 23461 ms · 2026-06-26T18:44:40.727968+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

33 extracted references · 9 canonical work pages · 2 internal anchors

[1]

Correcting diverse factual errors in abstractive summarization via post-editing and language model infilling

Vidhisha Balachandran, Hannaneh Hajishirzi, William Cohen, and Yulia Tsvetkov. Correcting diverse factual errors in abstractive summarization via post-editing and language model infilling. InProceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 9818–9830, 2022

2022
[2]

Language models are few-shot learners

Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al. Language models are few-shot learners. InAdvances in Neural Information Processing Systems, volume 33, pages 1877–1901, 2020

1901
[3]

On lazy training in differentiable program- ming.Advances in neural information processing systems, 32, 2019

Lenaic Chizat, Edouard Oyallon, and Francis Bach. On lazy training in differentiable program- ming.Advances in neural information processing systems, 32, 2019

2019
[4]

Alphaedit: Null-space constrained knowledge editing for language models.arXiv preprint arXiv:2410.02355,

Junfeng Fang, Houcheng Jiang, Kun Wang, Yunshan Ma, Shi Jie, Xiang Wang, Xiangnan He, and Tat-Seng Chua. Alphaedit: Null-space constrained knowledge editing for language models. arXiv preprint arXiv:2410.02355, 2024

work page arXiv 2024
[5]

Orthogonal gradient descent for continual learning

Mehrdad Farajtabar, Navid Azizan, Alex Mott, and Ang Li. Orthogonal gradient descent for continual learning. InInternational Conference on Artificial Intelligence and Statistics, pages 3762–3773. PMLR, 2020

2020
[6]

Should chatgpt be biased? challenges and risks of bias in large language models

Emilio Ferrara. Should chatgpt be biased? challenges and risks of bias in large language models. arXiv preprint arXiv:2304.03738, 2023

work page arXiv 2023
[7]

The Pile: An 800GB Dataset of Diverse Text for Language Modeling

Leo Gao, Stella Biderman, Sid Black, Laurence Golding, Travis Hoppe, Charles Foster, Jason Phang, Horace He, Anish Thite, Noa Nabeshima, Shawn Presser, and Connor Leahy. The Pile: An 800GB dataset of diverse text for language modeling.arXiv preprint arXiv:2101.00027, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2020
[8]

The Llama 3 Herd of Models

Aaron Grattafiori, Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Alex Vaughan, et al. The llama 3 herd of models.arXiv preprint arXiv:2407.21783, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024
[9]

Measuring statistical dependence with Hilbert-Schmidt norms

Arthur Gretton, Olivier Bousquet, Alex Smola, and Bernhard Schölkopf. Measuring statistical dependence with Hilbert-Schmidt norms. InInternational Conference on Algorithmic Learning Theory, pages 63–77. Springer, 2005

2005
[10]

Towards lifelong model editing via simulating ideal editor

Yaming Guo, Siyang Guo, Hengshu Zhu, and Ying Sun. Towards lifelong model editing via simulating ideal editor. InForty-second International Conference on Machine Learning, 2025

2025
[11]

Aging with grace: Lifelong model editing with discrete key-value adaptors.Advances in Neural Information Processing Systems, 36:47934–47959, 2023

Tom Hartvigsen, Swami Sankaranarayanan, Hamid Palangi, Yoon Kim, and Marzyeh Ghassemi. Aging with grace: Lifelong model editing with discrete key-value adaptors.Advances in Neural Information Processing Systems, 36:47934–47959, 2023

2023
[12]

Model editing with canonical examples.arXiv preprint arXiv:2402.06155, 2024

John Hewitt, Sarah Chen, Lanruo Lora Xie, Edward Adams, Percy Liang, and Christopher D Manning. Model editing with canonical examples.arXiv preprint arXiv:2402.06155, 2024

work page arXiv 2024
[13]

Survey of hallucination in natural language generation

Ziwei Ji, Nayeon Lee, Rita Frieske, Tiezheng Yu, Dan Su, Yan Xu, Etsuko Ishii, Ye Jin Bang, Andrea Madotto, and Pascale Fung. Survey of hallucination in natural language generation. ACM computing surveys, 55(12):1–38, 2023

2023
[14]

Albert Q. Jiang, Alexandre Sablayrolles, Arthur Mensch, Chris Bamford, Devendra Singh Chaplot, Diego de las Casas, Florian Bressand, Gianna Lengyel, Guillaume Lample, Lucile Saulnier, Lélio Renard Lavaud, Marie-Anne Lachaux, Pierre Stock, Teven Le Scao, Thibaut Lavril, Thomas Wang, Timothée Lacroix, and William El Sayed. Mistral 7b, 2023. 10

2023
[15]

Zero-shot relation extraction via reading comprehension

Omer Levy, Minjoon Seo, Eunsol Choi, and Luke Zettlemoyer. Zero-shot relation extraction via reading comprehension. InProceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), pages 333–342, 2017

2017
[16]

Wan-Duo Kurt Ma, J. P. Lewis, and W. Bastiaan Kleijn. The HSIC bottleneck: Deep learning without back-propagation. InProceedings of the AAAI Conference on Artificial Intelligence, volume 34, pages 5085–5092, 2020

2020
[17]

SelfCheckGPT: Zero-resource black-box hallucination detection for generative large language models

Potsawee Manakul, Adian Liusie, and Mark Gales. SelfCheckGPT: Zero-resource black-box hallucination detection for generative large language models. InProceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 9004–9017, 2023

2023
[18]

Catastrophic interference in connectionist networks: The sequential learning problem

Michael McCloskey and Neal J Cohen. Catastrophic interference in connectionist networks: The sequential learning problem. InPsychology of learning and motivation, volume 24, pages 109–165. Elsevier, 1989

1989
[19]

Locating and editing factual associations in gpt.Advances in neural information processing systems, 35:17359–17372, 2022

Kevin Meng, David Bau, Alex Andonian, and Yonatan Belinkov. Locating and editing factual associations in gpt.Advances in neural information processing systems, 35:17359–17372, 2022

2022
[20]

Mass editing memory in a transformer.The Eleventh International Conference on Learning Represen- tations (ICLR), 2023

Kevin Meng, Arnab Sen Sharma, Alex Andonian, Yonatan Belinkov, and David Bau. Mass editing memory in a transformer.The Eleventh International Conference on Learning Represen- tations (ICLR), 2023

2023
[21]

Memory-based model editing at scale

Eric Mitchell, Charles Lin, Antoine Bosselut, Christopher D Manning, and Chelsea Finn. Memory-based model editing at scale. InInternational Conference on Machine Learning, pages 15817–15831. PMLR, 2022

2022
[22]

Gradient projection memory for continual learning

Gobinda Saha, Isha Garg, and Kaushik Roy. Gradient projection memory for continual learning. InInternational Conference on Learning Representations, 2021

2021
[23]

Editable neural networks.arXiv preprint arXiv:2004.00345, 2020

Anton Sinitsin, Vsevolod Plokhotnyuk, Dmitriy Pyrkin, Sergei Popov, and Artem Babenko. Editable neural networks.arXiv preprint arXiv:2004.00345, 2020

work page arXiv 2004
[24]

Deep learning and the information bottleneck principle

Naftali Tishby and Noga Zaslavsky. Deep learning and the information bottleneck principle. In IEEE Information Theory Workshop, pages 1–5. IEEE, 2015

2015
[25]

GPT-J-6B: A 6 Billion Parameter Autoregressive Language Model.https://github.com/kingoflolz/mesh-transformer-jax, May 2021

Ben Wang and Aran Komatsuzaki. GPT-J-6B: A 6 Billion Parameter Autoregressive Language Model.https://github.com/kingoflolz/mesh-transformer-jax, May 2021

2021
[26]

Wise: Rethinking the knowledge memory for lifelong model editing of large language models.Advances in Neural Information Processing Systems, 37:53764– 53797, 2024

Peng Wang, Zexi Li, Ningyu Zhang, Ziwen Xu, Yunzhi Yao, Yong Jiang, Pengjun Xie, Fei Huang, and Huajun Chen. Wise: Rethinking the knowledge memory for lifelong model editing of large language models.Advances in Neural Information Processing Systems, 37:53764– 53797, 2024

2024
[27]

Easyedit: An easy-to-use knowledge editing framework for large language models.arXiv preprint arXiv:2308.07269, 2023

Peng Wang, Ningyu Zhang, Xin Xie, Yunzhi Yao, Bozhong Tian, Mengru Wang, Zekun Xi, Siyuan Cheng, Kangwei Liu, Guozhou Zheng, et al. Easyedit: An easy-to-use knowledge editing framework for large language models.arXiv preprint arXiv:2308.07269, 2023

work page arXiv 2023
[28]

Editing large language models: Problems, methods, and opportunities

Yunzhi Yao, Peng Wang, Bozhong Tian, Siyuan Cheng, Zhoubo Li, Shumin Deng, Huajun Chen, and Ningyu Zhang. Editing large language models: Problems, methods, and opportunities. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 10222–10240, 2023

2023
[29]

Zhang, Y

Ningyu Zhang, Yunzhi Yao, Bozhong Tian, Peng Wang, Shumin Deng, Mengru Wang, Zekun Xi, Shengyu Mao, Jintian Zhang, Yuansheng Ni, et al. A comprehensive study of knowledge editing for large language models.arXiv preprint arXiv:2401.01286, 2024

work page arXiv 2024
[30]

Adaptive budget allocation for parameter-efficient fine-tuning

Qingru Zhang, Minshuo Chen, Alexander Bukharin, Pengcheng He, Yu Cheng, Weizhu Chen, and Tuo Zhao. Adaptive budget allocation for parameter-efficient fine-tuning. InThe Eleventh International Conference on Learning Representations, 2023

2023
[31]

Can we edit factual knowledge by in-context learning?arXiv preprint arXiv:2305.12740, 2023

Ce Zheng, Lei Li, Qingxiu Dong, Yuxuan Fan, Zhiyong Wu, Jingjing Xu, and Baobao Chang. Can we edit factual knowledge by in-context learning?arXiv preprint arXiv:2305.12740, 2023. 11

work page arXiv 2023
[32]

subject is a {insert subject}

Zexuan Zhong, Zhengxuan Wu, Christopher D Manning, Christopher Potts, and Danqi Chen. MQuAKE: Assessing knowledge editing in language models via multi-hop questions. InThe 2023 Conference on Empirical Methods in Natural Language Processing, 2023. 12 A HSIC Definition Let F and G be reproducing kernel Hilbert spaces (RKHSs) on domains X and Y with associat...

2023
[33]

What was the date of Air France Flight 447?

prompts generated by the model from scratch, to improve generalization. The resulting inputs are then treated as a batch and used in the loss terms instead of Xt by itself. That is ¯Xt = [[G1, Xt],[G 2, Xt], ...], whereG is are generated by the LLM from scratch. 15 Table 7: LOKI hyperparameters for each benchmark and backbone reported in the main results....

2009

[1] [1]

Correcting diverse factual errors in abstractive summarization via post-editing and language model infilling

Vidhisha Balachandran, Hannaneh Hajishirzi, William Cohen, and Yulia Tsvetkov. Correcting diverse factual errors in abstractive summarization via post-editing and language model infilling. InProceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 9818–9830, 2022

2022

[2] [2]

Language models are few-shot learners

Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al. Language models are few-shot learners. InAdvances in Neural Information Processing Systems, volume 33, pages 1877–1901, 2020

1901

[3] [3]

On lazy training in differentiable program- ming.Advances in neural information processing systems, 32, 2019

Lenaic Chizat, Edouard Oyallon, and Francis Bach. On lazy training in differentiable program- ming.Advances in neural information processing systems, 32, 2019

2019

[4] [4]

Alphaedit: Null-space constrained knowledge editing for language models.arXiv preprint arXiv:2410.02355,

Junfeng Fang, Houcheng Jiang, Kun Wang, Yunshan Ma, Shi Jie, Xiang Wang, Xiangnan He, and Tat-Seng Chua. Alphaedit: Null-space constrained knowledge editing for language models. arXiv preprint arXiv:2410.02355, 2024

work page arXiv 2024

[5] [5]

Orthogonal gradient descent for continual learning

Mehrdad Farajtabar, Navid Azizan, Alex Mott, and Ang Li. Orthogonal gradient descent for continual learning. InInternational Conference on Artificial Intelligence and Statistics, pages 3762–3773. PMLR, 2020

2020

[6] [6]

Should chatgpt be biased? challenges and risks of bias in large language models

Emilio Ferrara. Should chatgpt be biased? challenges and risks of bias in large language models. arXiv preprint arXiv:2304.03738, 2023

work page arXiv 2023

[7] [7]

The Pile: An 800GB Dataset of Diverse Text for Language Modeling

Leo Gao, Stella Biderman, Sid Black, Laurence Golding, Travis Hoppe, Charles Foster, Jason Phang, Horace He, Anish Thite, Noa Nabeshima, Shawn Presser, and Connor Leahy. The Pile: An 800GB dataset of diverse text for language modeling.arXiv preprint arXiv:2101.00027, 2020

work page internal anchor Pith review Pith/arXiv arXiv 2020

[8] [8]

The Llama 3 Herd of Models

Aaron Grattafiori, Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Alex Vaughan, et al. The llama 3 herd of models.arXiv preprint arXiv:2407.21783, 2024

work page internal anchor Pith review Pith/arXiv arXiv 2024

[9] [9]

Measuring statistical dependence with Hilbert-Schmidt norms

Arthur Gretton, Olivier Bousquet, Alex Smola, and Bernhard Schölkopf. Measuring statistical dependence with Hilbert-Schmidt norms. InInternational Conference on Algorithmic Learning Theory, pages 63–77. Springer, 2005

2005

[10] [10]

Towards lifelong model editing via simulating ideal editor

Yaming Guo, Siyang Guo, Hengshu Zhu, and Ying Sun. Towards lifelong model editing via simulating ideal editor. InForty-second International Conference on Machine Learning, 2025

2025

[11] [11]

Aging with grace: Lifelong model editing with discrete key-value adaptors.Advances in Neural Information Processing Systems, 36:47934–47959, 2023

Tom Hartvigsen, Swami Sankaranarayanan, Hamid Palangi, Yoon Kim, and Marzyeh Ghassemi. Aging with grace: Lifelong model editing with discrete key-value adaptors.Advances in Neural Information Processing Systems, 36:47934–47959, 2023

2023

[12] [12]

Model editing with canonical examples.arXiv preprint arXiv:2402.06155, 2024

John Hewitt, Sarah Chen, Lanruo Lora Xie, Edward Adams, Percy Liang, and Christopher D Manning. Model editing with canonical examples.arXiv preprint arXiv:2402.06155, 2024

work page arXiv 2024

[13] [13]

Survey of hallucination in natural language generation

Ziwei Ji, Nayeon Lee, Rita Frieske, Tiezheng Yu, Dan Su, Yan Xu, Etsuko Ishii, Ye Jin Bang, Andrea Madotto, and Pascale Fung. Survey of hallucination in natural language generation. ACM computing surveys, 55(12):1–38, 2023

2023

[14] [14]

Albert Q. Jiang, Alexandre Sablayrolles, Arthur Mensch, Chris Bamford, Devendra Singh Chaplot, Diego de las Casas, Florian Bressand, Gianna Lengyel, Guillaume Lample, Lucile Saulnier, Lélio Renard Lavaud, Marie-Anne Lachaux, Pierre Stock, Teven Le Scao, Thibaut Lavril, Thomas Wang, Timothée Lacroix, and William El Sayed. Mistral 7b, 2023. 10

2023

[15] [15]

Zero-shot relation extraction via reading comprehension

Omer Levy, Minjoon Seo, Eunsol Choi, and Luke Zettlemoyer. Zero-shot relation extraction via reading comprehension. InProceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), pages 333–342, 2017

2017

[16] [16]

Wan-Duo Kurt Ma, J. P. Lewis, and W. Bastiaan Kleijn. The HSIC bottleneck: Deep learning without back-propagation. InProceedings of the AAAI Conference on Artificial Intelligence, volume 34, pages 5085–5092, 2020

2020

[17] [17]

SelfCheckGPT: Zero-resource black-box hallucination detection for generative large language models

Potsawee Manakul, Adian Liusie, and Mark Gales. SelfCheckGPT: Zero-resource black-box hallucination detection for generative large language models. InProceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 9004–9017, 2023

2023

[18] [18]

Catastrophic interference in connectionist networks: The sequential learning problem

Michael McCloskey and Neal J Cohen. Catastrophic interference in connectionist networks: The sequential learning problem. InPsychology of learning and motivation, volume 24, pages 109–165. Elsevier, 1989

1989

[19] [19]

Locating and editing factual associations in gpt.Advances in neural information processing systems, 35:17359–17372, 2022

Kevin Meng, David Bau, Alex Andonian, and Yonatan Belinkov. Locating and editing factual associations in gpt.Advances in neural information processing systems, 35:17359–17372, 2022

2022

[20] [20]

Mass editing memory in a transformer.The Eleventh International Conference on Learning Represen- tations (ICLR), 2023

Kevin Meng, Arnab Sen Sharma, Alex Andonian, Yonatan Belinkov, and David Bau. Mass editing memory in a transformer.The Eleventh International Conference on Learning Represen- tations (ICLR), 2023

2023

[21] [21]

Memory-based model editing at scale

Eric Mitchell, Charles Lin, Antoine Bosselut, Christopher D Manning, and Chelsea Finn. Memory-based model editing at scale. InInternational Conference on Machine Learning, pages 15817–15831. PMLR, 2022

2022

[22] [22]

Gradient projection memory for continual learning

Gobinda Saha, Isha Garg, and Kaushik Roy. Gradient projection memory for continual learning. InInternational Conference on Learning Representations, 2021

2021

[23] [23]

Editable neural networks.arXiv preprint arXiv:2004.00345, 2020

Anton Sinitsin, Vsevolod Plokhotnyuk, Dmitriy Pyrkin, Sergei Popov, and Artem Babenko. Editable neural networks.arXiv preprint arXiv:2004.00345, 2020

work page arXiv 2004

[24] [24]

Deep learning and the information bottleneck principle

Naftali Tishby and Noga Zaslavsky. Deep learning and the information bottleneck principle. In IEEE Information Theory Workshop, pages 1–5. IEEE, 2015

2015

[25] [25]

GPT-J-6B: A 6 Billion Parameter Autoregressive Language Model.https://github.com/kingoflolz/mesh-transformer-jax, May 2021

Ben Wang and Aran Komatsuzaki. GPT-J-6B: A 6 Billion Parameter Autoregressive Language Model.https://github.com/kingoflolz/mesh-transformer-jax, May 2021

2021

[26] [26]

Wise: Rethinking the knowledge memory for lifelong model editing of large language models.Advances in Neural Information Processing Systems, 37:53764– 53797, 2024

Peng Wang, Zexi Li, Ningyu Zhang, Ziwen Xu, Yunzhi Yao, Yong Jiang, Pengjun Xie, Fei Huang, and Huajun Chen. Wise: Rethinking the knowledge memory for lifelong model editing of large language models.Advances in Neural Information Processing Systems, 37:53764– 53797, 2024

2024

[27] [27]

Easyedit: An easy-to-use knowledge editing framework for large language models.arXiv preprint arXiv:2308.07269, 2023

Peng Wang, Ningyu Zhang, Xin Xie, Yunzhi Yao, Bozhong Tian, Mengru Wang, Zekun Xi, Siyuan Cheng, Kangwei Liu, Guozhou Zheng, et al. Easyedit: An easy-to-use knowledge editing framework for large language models.arXiv preprint arXiv:2308.07269, 2023

work page arXiv 2023

[28] [28]

Editing large language models: Problems, methods, and opportunities

Yunzhi Yao, Peng Wang, Bozhong Tian, Siyuan Cheng, Zhoubo Li, Shumin Deng, Huajun Chen, and Ningyu Zhang. Editing large language models: Problems, methods, and opportunities. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 10222–10240, 2023

2023

[29] [29]

Zhang, Y

Ningyu Zhang, Yunzhi Yao, Bozhong Tian, Peng Wang, Shumin Deng, Mengru Wang, Zekun Xi, Shengyu Mao, Jintian Zhang, Yuansheng Ni, et al. A comprehensive study of knowledge editing for large language models.arXiv preprint arXiv:2401.01286, 2024

work page arXiv 2024

[30] [30]

Adaptive budget allocation for parameter-efficient fine-tuning

Qingru Zhang, Minshuo Chen, Alexander Bukharin, Pengcheng He, Yu Cheng, Weizhu Chen, and Tuo Zhao. Adaptive budget allocation for parameter-efficient fine-tuning. InThe Eleventh International Conference on Learning Representations, 2023

2023

[31] [31]

Can we edit factual knowledge by in-context learning?arXiv preprint arXiv:2305.12740, 2023

Ce Zheng, Lei Li, Qingxiu Dong, Yuxuan Fan, Zhiyong Wu, Jingjing Xu, and Baobao Chang. Can we edit factual knowledge by in-context learning?arXiv preprint arXiv:2305.12740, 2023. 11

work page arXiv 2023

[32] [32]

subject is a {insert subject}

Zexuan Zhong, Zhengxuan Wu, Christopher D Manning, Christopher Potts, and Danqi Chen. MQuAKE: Assessing knowledge editing in language models via multi-hop questions. InThe 2023 Conference on Empirical Methods in Natural Language Processing, 2023. 12 A HSIC Definition Let F and G be reproducing kernel Hilbert spaces (RKHSs) on domains X and Y with associat...

2023

[33] [33]

What was the date of Air France Flight 447?

prompts generated by the model from scratch, to improve generalization. The resulting inputs are then treated as a batch and used in the loss terms instead of Xt by itself. That is ¯Xt = [[G1, Xt],[G 2, Xt], ...], whereG is are generated by the LLM from scratch. 15 Table 7: LOKI hyperparameters for each benchmark and backbone reported in the main results....

2009