pith. sign in

arxiv: 2606.19679 · v1 · pith:7R6BR6XAnew · submitted 2026-06-18 · 💻 cs.LG · cs.AI

LOKI: Memory-Free Null-Space Constrained Lifelong Knowledge Editing

Pith reviewed 2026-06-26 18:44 UTC · model grok-4.3

classification 💻 cs.LG cs.AI
keywords lifelong knowledge editingnull-space projectiondynamic layer selectionHilbert-Schmidt Independence Criterionlanguage modelscatastrophic forgettinggradient projection
0
0 comments X

The pith

LOKI updates language models sequentially with new knowledge by selecting layers dynamically via HSIC and projecting gradients into the null-space of weights without any prior data access.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper presents LOKI as a method for lifelong knowledge editing that avoids two common limitations in current approaches. Fixed layer choices across edits reduce flexibility and raise forgetting, while many techniques need stored past examples or precomputed statistics. LOKI instead picks layers on the fly using the Hilbert-Schmidt Independence Criterion and forces each gradient update to lie in the null-space of the current model weights. Experiments across benchmarks report higher average accuracy than baselines, with gains reaching 14 percent. Readers would care because the approach makes repeated model corrections feasible in settings where historical data cannot be retained.

Core claim

LOKI achieves superior performance to existing approaches across a wide variety of experiments, achieving up to a 14% improvement in average accuracy, by using HSIC-based dynamic layer selection and projecting gradient updates onto the null-space of the model weights, bypassing the requirement for previous knowledge access.

What carries the argument

Dynamic layer selection via the Hilbert-Schmidt Independence Criterion together with projection of gradient updates onto the null-space of the model weights.

If this is right

  • Sequential edits can be performed without storing or replaying any previous knowledge samples.
  • Different knowledge updates can modify different layers, avoiding the rigidity of fixed-layer schemes.
  • Gradient steps remain orthogonal to existing weight directions, limiting unintended changes to prior edits.
  • The method supports editing under strict privacy or storage constraints that bar access to historical data.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • If null-space projections remain effective over dozens of successive edits, the approach could support very long editing histories without explicit regularization terms.
  • HSIC-based selection might be replaced by cheaper proxies such as activation variance if the independence criterion proves expensive at scale.
  • The same null-space idea could be tested on non-transformer architectures where weight matrices have different spectral properties.
  • One could measure whether the chosen layers correlate with attention heads that encode the edited facts, providing a post-hoc interpretability check.

Load-bearing premise

That projecting gradient updates onto the null-space of model weights, combined with HSIC-based dynamic layer selection, sufficiently prevents interference with previously edited knowledge even without access to any prior data or statistics.

What would settle it

A sequence of edits on a held-out benchmark where accuracy on earlier facts falls below the levels achieved by methods that retain past data, or where average accuracy does not exceed the best baseline by a measurable margin.

Figures

Figures reproduced from arXiv: 2606.19679 by Jennifer Dy, Masih Eskandar, Miquel Sirera Perell\'o, Stratis Ioannidis.

Figure 1
Figure 1. Figure 1: (a) Overview of an LLM, (b) the lifelong knowledge editing pipeline for existing works, [PITH_FULL_IMAGE:figures/full_fig_p002_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Average accuracy of various methods every 100 edits for 1000 sequential edits on the Llama-3-8B-Instruct and ZsRE dataset. LOKI suffers the least performance degradation over time across competitors. (less than 15% degradation). It is worth noting that methods with higher locality scores suffer from a minimum 47% degradation in perplexity for multiple edits. Similarly, in [PITH_FULL_IMAGE:figures/full_fig… view at source ↗
Figure 3
Figure 3. Figure 3: TSNE visualization of Llama-3-8B￾Instruct final layer embeddings before and after 1000 sequential edits on the ZsRE dataset. LOKI manages to avoid a significant shift in model em￾beddings without access to previous knowledge. Layer 4 5 6 7 8 Llama-3 W-Null (Ours) 2.42 3.19 2.56 2.40 2.42 Random 4.28 5.01 4.18 3.69 4.82 Mistral-7B W-Null (Ours) 2.88 2.85 2.56 2.41 2.79 Random 4.68 4.43 4.10 3.59 4.33 [PITH… view at source ↗
Figure 4
Figure 4. Figure 4: Visualization of the layer selection fre [PITH_FULL_IMAGE:figures/full_fig_p009_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Average edit time per method measured on an NVIDIA H200 GPU. We report the average [PITH_FULL_IMAGE:figures/full_fig_p014_5.png] view at source ↗
read the original abstract

Lifelong knowledge editing aims to efficiently and sequentially update language models over time, as new knowledge becomes available or when the model makes mistakes, while preserving acceptable performance on past knowledge. One unresolved challenge is that existing methods modify a fixed set of layers for all new knowledge samples, reducing flexibility and increasing catastrophic forgetting. Another is requiring access to previous knowledge and extensive pre-processing to obtain data statistics. To address these challenges, we introduce LOKI, a novel approach that uses dynamic layer selection based on the Hilbert-Schmidt Independence Criterion and projects gradient updates onto the null-space of the model weights, bypassing the requirement for previous knowledge access. We show that LOKI achieves superior performance to existing approaches across a wide variety of experiments, achieving up to a 14\% improvement in average accuracy.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 2 minor

Summary. The paper proposes LOKI, a memory-free method for lifelong knowledge editing of language models. It dynamically selects layers using the Hilbert-Schmidt Independence Criterion (HSIC) and projects gradient updates onto the null-space of the model weights to enable sequential edits while preserving prior knowledge, without requiring access to previous data or pre-computed statistics. The central claim is that this yields superior performance, with up to 14% higher average accuracy than existing approaches across experiments.

Significance. If the experimental results hold, the contribution would be meaningful for practical lifelong editing: it removes the memory and preprocessing requirements that limit prior methods and adds flexibility via dynamic layer choice. The approach builds on standard tools (HSIC, null-space projection) rather than introducing new primitives, which lowers the barrier to adoption but also means the novelty lies primarily in the combination and the no-memory guarantee.

major comments (2)
  1. [Abstract and §4] Abstract and §4 (Experiments): the claim of up to 14% average-accuracy improvement is presented without any mention of the number of runs, standard deviations, statistical tests, or the precise baselines and editing benchmarks used. This information is load-bearing for the performance claim and must be supplied with concrete numbers and controls.
  2. [§3.2] §3.2 (Null-space projection): the argument that projection onto the null-space of current weights is sufficient to avoid interference with prior edits, even without any stored statistics, requires an explicit argument or bound showing that the projected update cannot alter the previously edited directions. The current description relies on the geometric intuition alone.
minor comments (2)
  1. Define HSIC on first use and state the precise kernel and regularization choices used in the layer-selection step.
  2. Add a short pseudocode block or explicit algorithm listing the per-edit steps (HSIC computation, null-space projection, update application) to improve reproducibility.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on our manuscript. We address the two major comments below and will incorporate revisions to strengthen the experimental reporting and theoretical justification.

read point-by-point responses
  1. Referee: [Abstract and §4] Abstract and §4 (Experiments): the claim of up to 14% average-accuracy improvement is presented without any mention of the number of runs, standard deviations, statistical tests, or the precise baselines and editing benchmarks used. This information is load-bearing for the performance claim and must be supplied with concrete numbers and controls.

    Authors: We agree that the performance claims require supporting statistical details for rigor. In the revised manuscript, we will update the abstract and Section 4 to specify the number of independent runs (e.g., 5), report mean accuracies with standard deviations, note any statistical significance tests performed, and explicitly detail the baselines (such as SERAC, MEND, and others) along with the editing benchmarks (ZsRE, CounterFact, etc.) used in each experiment. The reported 'up to 14%' figure represents the largest observed gain in average accuracy across the evaluated settings and will be contextualized with these controls. revision: yes

  2. Referee: [§3.2] §3.2 (Null-space projection): the argument that projection onto the null-space of current weights is sufficient to avoid interference with prior edits, even without any stored statistics, requires an explicit argument or bound showing that the projected update cannot alter the previously edited directions. The current description relies on the geometric intuition alone.

    Authors: We acknowledge the need for a more formal justification beyond geometric intuition. In the revision to §3.2, we will include an explicit argument and bound: because prior edits are already embedded in the current weight matrix W, any update ΔW projected onto the null space of W (i.e., satisfying W ΔW = 0) cannot modify the linear mappings established by previous edits. We will provide a short proof sketch showing that the projected update preserves the output behavior on previously edited inputs without requiring stored statistics, along with a bound on residual interference under standard assumptions on the weight matrix rank. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The derivation relies on standard external techniques (HSIC for layer selection and null-space projection of gradients) applied to the editing objective without re-deriving or fitting quantities from the same data in a self-referential loop. No self-citation chains, ansatz smuggling, or fitted-input-as-prediction patterns are present in the abstract or described method. The central claim of memory-free interference prevention is supported by the algorithmic construction rather than reducing to its own inputs by definition.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Review performed on abstract only; no explicit free parameters, axioms, or invented entities are described in the provided text.

pith-pipeline@v0.9.1-grok · 5674 in / 1069 out tokens · 23461 ms · 2026-06-26T18:44:40.727968+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

33 extracted references · 9 canonical work pages · 2 internal anchors

  1. [1]

    Correcting diverse factual errors in abstractive summarization via post-editing and language model infilling

    Vidhisha Balachandran, Hannaneh Hajishirzi, William Cohen, and Yulia Tsvetkov. Correcting diverse factual errors in abstractive summarization via post-editing and language model infilling. InProceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 9818–9830, 2022

  2. [2]

    Language models are few-shot learners

    Tom Brown, Benjamin Mann, Nick Ryder, Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, et al. Language models are few-shot learners. InAdvances in Neural Information Processing Systems, volume 33, pages 1877–1901, 2020

  3. [3]

    On lazy training in differentiable program- ming.Advances in neural information processing systems, 32, 2019

    Lenaic Chizat, Edouard Oyallon, and Francis Bach. On lazy training in differentiable program- ming.Advances in neural information processing systems, 32, 2019

  4. [4]

    Alphaedit: Null-space constrained knowledge editing for language models.arXiv preprint arXiv:2410.02355,

    Junfeng Fang, Houcheng Jiang, Kun Wang, Yunshan Ma, Shi Jie, Xiang Wang, Xiangnan He, and Tat-Seng Chua. Alphaedit: Null-space constrained knowledge editing for language models. arXiv preprint arXiv:2410.02355, 2024

  5. [5]

    Orthogonal gradient descent for continual learning

    Mehrdad Farajtabar, Navid Azizan, Alex Mott, and Ang Li. Orthogonal gradient descent for continual learning. InInternational Conference on Artificial Intelligence and Statistics, pages 3762–3773. PMLR, 2020

  6. [6]

    Should chatgpt be biased? challenges and risks of bias in large language models

    Emilio Ferrara. Should chatgpt be biased? challenges and risks of bias in large language models. arXiv preprint arXiv:2304.03738, 2023

  7. [7]

    The Pile: An 800GB Dataset of Diverse Text for Language Modeling

    Leo Gao, Stella Biderman, Sid Black, Laurence Golding, Travis Hoppe, Charles Foster, Jason Phang, Horace He, Anish Thite, Noa Nabeshima, Shawn Presser, and Connor Leahy. The Pile: An 800GB dataset of diverse text for language modeling.arXiv preprint arXiv:2101.00027, 2020

  8. [8]

    The Llama 3 Herd of Models

    Aaron Grattafiori, Abhimanyu Dubey, Abhinav Jauhri, Abhinav Pandey, Abhishek Kadian, Ahmad Al-Dahle, Aiesha Letman, Akhil Mathur, Alan Schelten, Alex Vaughan, et al. The llama 3 herd of models.arXiv preprint arXiv:2407.21783, 2024

  9. [9]

    Measuring statistical dependence with Hilbert-Schmidt norms

    Arthur Gretton, Olivier Bousquet, Alex Smola, and Bernhard Schölkopf. Measuring statistical dependence with Hilbert-Schmidt norms. InInternational Conference on Algorithmic Learning Theory, pages 63–77. Springer, 2005

  10. [10]

    Towards lifelong model editing via simulating ideal editor

    Yaming Guo, Siyang Guo, Hengshu Zhu, and Ying Sun. Towards lifelong model editing via simulating ideal editor. InForty-second International Conference on Machine Learning, 2025

  11. [11]

    Aging with grace: Lifelong model editing with discrete key-value adaptors.Advances in Neural Information Processing Systems, 36:47934–47959, 2023

    Tom Hartvigsen, Swami Sankaranarayanan, Hamid Palangi, Yoon Kim, and Marzyeh Ghassemi. Aging with grace: Lifelong model editing with discrete key-value adaptors.Advances in Neural Information Processing Systems, 36:47934–47959, 2023

  12. [12]

    Model editing with canonical examples.arXiv preprint arXiv:2402.06155, 2024

    John Hewitt, Sarah Chen, Lanruo Lora Xie, Edward Adams, Percy Liang, and Christopher D Manning. Model editing with canonical examples.arXiv preprint arXiv:2402.06155, 2024

  13. [13]

    Survey of hallucination in natural language generation

    Ziwei Ji, Nayeon Lee, Rita Frieske, Tiezheng Yu, Dan Su, Yan Xu, Etsuko Ishii, Ye Jin Bang, Andrea Madotto, and Pascale Fung. Survey of hallucination in natural language generation. ACM computing surveys, 55(12):1–38, 2023

  14. [14]

    Albert Q. Jiang, Alexandre Sablayrolles, Arthur Mensch, Chris Bamford, Devendra Singh Chaplot, Diego de las Casas, Florian Bressand, Gianna Lengyel, Guillaume Lample, Lucile Saulnier, Lélio Renard Lavaud, Marie-Anne Lachaux, Pierre Stock, Teven Le Scao, Thibaut Lavril, Thomas Wang, Timothée Lacroix, and William El Sayed. Mistral 7b, 2023. 10

  15. [15]

    Zero-shot relation extraction via reading comprehension

    Omer Levy, Minjoon Seo, Eunsol Choi, and Luke Zettlemoyer. Zero-shot relation extraction via reading comprehension. InProceedings of the 21st Conference on Computational Natural Language Learning (CoNLL 2017), pages 333–342, 2017

  16. [16]

    Wan-Duo Kurt Ma, J. P. Lewis, and W. Bastiaan Kleijn. The HSIC bottleneck: Deep learning without back-propagation. InProceedings of the AAAI Conference on Artificial Intelligence, volume 34, pages 5085–5092, 2020

  17. [17]

    SelfCheckGPT: Zero-resource black-box hallucination detection for generative large language models

    Potsawee Manakul, Adian Liusie, and Mark Gales. SelfCheckGPT: Zero-resource black-box hallucination detection for generative large language models. InProceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 9004–9017, 2023

  18. [18]

    Catastrophic interference in connectionist networks: The sequential learning problem

    Michael McCloskey and Neal J Cohen. Catastrophic interference in connectionist networks: The sequential learning problem. InPsychology of learning and motivation, volume 24, pages 109–165. Elsevier, 1989

  19. [19]

    Locating and editing factual associations in gpt.Advances in neural information processing systems, 35:17359–17372, 2022

    Kevin Meng, David Bau, Alex Andonian, and Yonatan Belinkov. Locating and editing factual associations in gpt.Advances in neural information processing systems, 35:17359–17372, 2022

  20. [20]

    Mass editing memory in a transformer.The Eleventh International Conference on Learning Represen- tations (ICLR), 2023

    Kevin Meng, Arnab Sen Sharma, Alex Andonian, Yonatan Belinkov, and David Bau. Mass editing memory in a transformer.The Eleventh International Conference on Learning Represen- tations (ICLR), 2023

  21. [21]

    Memory-based model editing at scale

    Eric Mitchell, Charles Lin, Antoine Bosselut, Christopher D Manning, and Chelsea Finn. Memory-based model editing at scale. InInternational Conference on Machine Learning, pages 15817–15831. PMLR, 2022

  22. [22]

    Gradient projection memory for continual learning

    Gobinda Saha, Isha Garg, and Kaushik Roy. Gradient projection memory for continual learning. InInternational Conference on Learning Representations, 2021

  23. [23]

    Editable neural networks.arXiv preprint arXiv:2004.00345, 2020

    Anton Sinitsin, Vsevolod Plokhotnyuk, Dmitriy Pyrkin, Sergei Popov, and Artem Babenko. Editable neural networks.arXiv preprint arXiv:2004.00345, 2020

  24. [24]

    Deep learning and the information bottleneck principle

    Naftali Tishby and Noga Zaslavsky. Deep learning and the information bottleneck principle. In IEEE Information Theory Workshop, pages 1–5. IEEE, 2015

  25. [25]

    GPT-J-6B: A 6 Billion Parameter Autoregressive Language Model.https://github.com/kingoflolz/mesh-transformer-jax, May 2021

    Ben Wang and Aran Komatsuzaki. GPT-J-6B: A 6 Billion Parameter Autoregressive Language Model.https://github.com/kingoflolz/mesh-transformer-jax, May 2021

  26. [26]

    Wise: Rethinking the knowledge memory for lifelong model editing of large language models.Advances in Neural Information Processing Systems, 37:53764– 53797, 2024

    Peng Wang, Zexi Li, Ningyu Zhang, Ziwen Xu, Yunzhi Yao, Yong Jiang, Pengjun Xie, Fei Huang, and Huajun Chen. Wise: Rethinking the knowledge memory for lifelong model editing of large language models.Advances in Neural Information Processing Systems, 37:53764– 53797, 2024

  27. [27]

    Easyedit: An easy-to-use knowledge editing framework for large language models.arXiv preprint arXiv:2308.07269, 2023

    Peng Wang, Ningyu Zhang, Xin Xie, Yunzhi Yao, Bozhong Tian, Mengru Wang, Zekun Xi, Siyuan Cheng, Kangwei Liu, Guozhou Zheng, et al. Easyedit: An easy-to-use knowledge editing framework for large language models.arXiv preprint arXiv:2308.07269, 2023

  28. [28]

    Editing large language models: Problems, methods, and opportunities

    Yunzhi Yao, Peng Wang, Bozhong Tian, Siyuan Cheng, Zhoubo Li, Shumin Deng, Huajun Chen, and Ningyu Zhang. Editing large language models: Problems, methods, and opportunities. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 10222–10240, 2023

  29. [29]

    Zhang, Y

    Ningyu Zhang, Yunzhi Yao, Bozhong Tian, Peng Wang, Shumin Deng, Mengru Wang, Zekun Xi, Shengyu Mao, Jintian Zhang, Yuansheng Ni, et al. A comprehensive study of knowledge editing for large language models.arXiv preprint arXiv:2401.01286, 2024

  30. [30]

    Adaptive budget allocation for parameter-efficient fine-tuning

    Qingru Zhang, Minshuo Chen, Alexander Bukharin, Pengcheng He, Yu Cheng, Weizhu Chen, and Tuo Zhao. Adaptive budget allocation for parameter-efficient fine-tuning. InThe Eleventh International Conference on Learning Representations, 2023

  31. [31]

    Can we edit factual knowledge by in-context learning?arXiv preprint arXiv:2305.12740, 2023

    Ce Zheng, Lei Li, Qingxiu Dong, Yuxuan Fan, Zhiyong Wu, Jingjing Xu, and Baobao Chang. Can we edit factual knowledge by in-context learning?arXiv preprint arXiv:2305.12740, 2023. 11

  32. [32]

    subject is a {insert subject}

    Zexuan Zhong, Zhengxuan Wu, Christopher D Manning, Christopher Potts, and Danqi Chen. MQuAKE: Assessing knowledge editing in language models via multi-hop questions. InThe 2023 Conference on Empirical Methods in Natural Language Processing, 2023. 12 A HSIC Definition Let F and G be reproducing kernel Hilbert spaces (RKHSs) on domains X and Y with associat...

  33. [33]

    What was the date of Air France Flight 447?

    prompts generated by the model from scratch, to improve generalization. The resulting inputs are then treated as a batch and used in the loss terms instead of Xt by itself. That is ¯Xt = [[G1, Xt],[G 2, Xt], ...], whereG is are generated by the LLM from scratch. 15 Table 7: LOKI hyperparameters for each benchmark and backbone reported in the main results....