Orthogonal Representation Editing: Decoupling Semantic Entanglement in Batch Knowledge Editing of LLMs

Bo Lv; Fangyin Ma; KaiWen Wei; Nayu Liu; Shihao Yang; Wenhao Yu; Zhicong Lu

arxiv: 2606.22627 · v1 · pith:BRN7ELWGnew · submitted 2026-06-21 · 💻 cs.CL · cs.AI

Orthogonal Representation Editing: Decoupling Semantic Entanglement in Batch Knowledge Editing of LLMs

Wenhao Yu , Zhicong Lu , Bo Lv , Fangyin Ma , Kaiwen Wei , Shihao Yang , Nayu Liu This is my paper

Pith reviewed 2026-06-26 10:10 UTC · model grok-4.3

classification 💻 cs.CL cs.AI

keywords knowledge editinglarge language modelsbatch editingrepresentation editingsemantic entanglementorthogonal constraintsLLM updates

0 comments

The pith

Orthogonal constraints on edit vectors in hidden representations decouple semantic entanglement for batch LLM knowledge editing.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper claims that overlapping concepts and shared patterns create accumulating interference in LLM representation space, which lowers precision when multiple facts are edited together. It addresses this by moving edits into the hidden representation space, building a general semantic subspace, and forcing the edit vectors to remain orthogonal to one another. A gated non-linear representation head is added so the model can learn suitable editing locations and control the injection of new knowledge. Experiments indicate the method raises editing success rates over prior batch approaches and works better across languages. If the account is accurate, simultaneous updates to many facts could become more reliable without full retraining or broad capability loss.

Core claim

ORE performs edits in the hidden representation space of LLMs by constructing a general semantic subspace and enforcing orthogonal constraints on edit vectors, effectively decoupling semantic entanglement, and introduces a gated non-linear representation head for adaptive editing locations and precise control over knowledge injection.

What carries the argument

Orthogonal constraints on edit vectors inside a constructed general semantic subspace of the LLM hidden representations, which separate overlapping semantic signals.

If this is right

Batch knowledge editing maintains higher precision when multiple facts are updated at once.
Cross-lingual knowledge editing reaches stronger results than previous methods.
Knowledge injection occurs with finer location control through the gated head.
Overall editing success improves relative to existing batch techniques without requiring full retraining.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same subspace-plus-orthogonality pattern could be tested on sequential editing pipelines to see whether it reduces cumulative drift.
If the decoupling holds, larger batch sizes might become feasible before interference reappears.
The gated head mechanism might transfer to other representation-level interventions such as targeted fine-tuning steps.

Load-bearing premise

Semantic representation entanglement is the main source of interference that degrades batch editing performance, and orthogonal constraints plus a gated head can separate those signals without creating fresh interference or harming unrelated model behavior.

What would settle it

A controlled comparison in which edit vectors are made orthogonal yet batch editing accuracy and interference metrics show no improvement over a non-orthogonal baseline using the same subspace and head.

Figures

Figures reproduced from arXiv: 2606.22627 by Bo Lv, Fangyin Ma, KaiWen Wei, Nayu Liu, Shihao Yang, Wenhao Yu, Zhicong Lu.

**Figure 2.** Figure 2: Editing efficacy of MEMIT and AlphaEdit on entangled (Fr, Zh, Fr-Zh) and random samples. Performance drops on entangled samples, especially in cross-lingual (Fr-Zh) settings. This result supports our hypothesis: in batch editing, shared general semantic structures cause update vectors to conflict and accumulate noise within the representation subspace, thereby leading to a decline in the performance of ex… view at source ↗

**Figure 3.** Figure 3: Overview of the proposed ORE framework. It edits a frozen LLM by applying gated, non-linear [PITH_FULL_IMAGE:figures/full_fig_p004_3.png] view at source ↗

**Figure 4.** Figure 4: Cosine similarity between edit representations [PITH_FULL_IMAGE:figures/full_fig_p007_4.png] view at source ↗

**Figure 5.** Figure 5: Ablation study of Representation Subspace [PITH_FULL_IMAGE:figures/full_fig_p008_5.png] view at source ↗

**Figure 6.** Figure 6: Performance Comparison between ORE and Existing Methods. [PITH_FULL_IMAGE:figures/full_fig_p012_6.png] view at source ↗

read the original abstract

Knowledge editing aims to efficiently update factual information in Large Language Models (LLMs) without full retraining. However, existing methods still suffer from performance degradation in batch knowledge editing. We identify that semantic representation entanglement, such as overlapping concepts and shared syntactic patterns, accumulates interference in the representation space and reduces editing precision. To bridge this gap, in this paper, we propose Orthogonal Representation Editing (ORE), which performs edits in the hidden representation space of LLMs by constructing a general semantic subspace and enforcing orthogonal constraints on edit vectors, effectively decoupling semantic entanglement. Furthermore, we introduce a gated non-linear representation head to enable adaptive learning of editing locations and precise control over knowledge injection. Extensive experiments show that ORE outperforms existing methods and achieves superior performance in cross-lingual knowledge editing scenarios. We release our code at https://github.com/YVVH/ORE.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

ORE frames orthogonal constraints on edit vectors plus a gated head as a fix for entanglement-driven interference in batch LLM editing, but the abstract supplies no data or controls to evaluate whether that mechanism actually works.

read the letter

ORE claims that semantic entanglement from overlapping concepts and shared syntax builds up interference during batch knowledge edits, and it counters this by constructing a general semantic subspace, enforcing orthogonal constraints on the edit vectors, and adding a gated non-linear head for choosing edit locations adaptively. That specific pairing of orthogonality and gating is the main new element on offer.

The paper does a reasonable job naming the interference problem in multi-edit settings and sketching a representation-space intervention that builds on existing editing lines. The gated head is a practical addition for controlling where and how knowledge gets injected, and the code release helps with checking the implementation later.

The clear limitation is the missing evidence. The abstract states outperformance and better cross-lingual results but gives no numbers, ablations on the orthogonal term, measurements of reduced entanglement, or comparisons to prior orthogonal regularization work. Without those, it is impossible to tell whether the claimed gains come from decoupling entanglement or from extra parameters and regularization. The stress-test point is fair: the paper needs to show that entanglement is the dominant cause and that the constraints do not create new interference elsewhere.

This is for researchers working on knowledge editing and representation interventions in LLMs. Someone already following that literature would get value from the method description and could test the released code themselves. It shows straightforward engagement with the editing problem rather than incoherence, so it is worth a referee's time to examine the experiments and see whether the mechanism holds up.

Referee Report

3 major / 1 minor

Summary. The paper claims that semantic representation entanglement (overlapping concepts and shared syntactic patterns) accumulates interference and degrades performance in batch knowledge editing of LLMs. It proposes Orthogonal Representation Editing (ORE), which performs edits in the hidden representation space by constructing a general semantic subspace, enforcing orthogonal constraints on edit vectors to decouple entanglement, and adding a gated non-linear representation head for adaptive editing locations and precise knowledge injection. The method is asserted to outperform prior approaches and achieve superior results in cross-lingual editing scenarios.

Significance. If the central claims hold with rigorous empirical support, the work could advance batch knowledge editing by offering a representation-space mechanism to reduce cross-edit interference via orthogonality, with potential benefits for maintaining unrelated capabilities. The public code release supports reproducibility.

major comments (3)

[Abstract] Abstract: the claims of outperformance and 'superior performance in cross-lingual knowledge editing scenarios' are stated without any quantitative results, ablation studies, error bars, tables, or statistical details, so the central performance assertions cannot be evaluated from the manuscript text.
[Introduction/Method] Introduction/Method description: the premise that semantic entanglement is the dominant cause of degradation (rather than optimization conflicts or capacity limits) is asserted but not supported by any measurement of entanglement, isolation experiment, or control that isolates the effect of the orthogonal constraints and gated head from generic regularization or capacity benefits.
[Experiments] Experiments section (implied by abstract claims): no tables, figures, or specific results are supplied to demonstrate that the orthogonality mechanism plus gating reduces cross-edit interference while preserving unrelated capabilities, which is load-bearing for the headline claims.

minor comments (1)

[Abstract] The code repository link is provided, which aids reproducibility; the abstract could briefly note the models and datasets used to contextualize the cross-lingual results.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the constructive feedback. We address each major comment below and will revise the manuscript to improve clarity and support for the claims where appropriate.

read point-by-point responses

Referee: [Abstract] Abstract: the claims of outperformance and 'superior performance in cross-lingual knowledge editing scenarios' are stated without any quantitative results, ablation studies, error bars, tables, or statistical details, so the central performance assertions cannot be evaluated from the manuscript text.

Authors: We agree the abstract is high-level by design. The full manuscript's Experiments section provides the requested quantitative results, tables, figures, ablations, and error bars. We will revise the abstract to include one or two key numerical highlights (e.g., relative gains on batch editing metrics) while remaining within length limits. revision: yes
Referee: [Introduction/Method] Introduction/Method description: the premise that semantic entanglement is the dominant cause of degradation (rather than optimization conflicts or capacity limits) is asserted but not supported by any measurement of entanglement, isolation experiment, or control that isolates the effect of the orthogonal constraints and gated head from generic regularization or capacity benefits.

Authors: The premise is motivated by observed interference patterns in batch editing, but we acknowledge the absence of a direct entanglement metric or isolation control. The orthogonal constraints and gated head are validated via targeted ablations in the experiments. We will add a short analysis subsection with a simple entanglement proxy (e.g., cosine similarity of edit vectors) and an isolation experiment to better separate the contribution of orthogonality from generic regularization. revision: partial
Referee: [Experiments] Experiments section (implied by abstract claims): no tables, figures, or specific results are supplied to demonstrate that the orthogonality mechanism plus gating reduces cross-edit interference while preserving unrelated capabilities, which is load-bearing for the headline claims.

Authors: The Experiments section does contain tables and figures with batch and cross-lingual results plus ablations. If the reviewed version omitted these, we will ensure the next version clearly presents all tables/figures with error bars, statistical tests, and explicit discussion of interference reduction (via before/after edit vector orthogonality metrics) and capability preservation. revision: yes

Circularity Check

0 steps flagged

No circularity in derivation chain

full rationale

The provided abstract and description contain no equations, quantitative derivations, or self-citations that could reduce claims to inputs by construction. The paper identifies semantic entanglement as an issue and proposes ORE (general semantic subspace + orthogonal constraints + gated head) as a solution, with performance claims resting on experimental results rather than any self-referential math or fitted-parameter renaming. No load-bearing steps match the enumerated circularity patterns.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review yields no explicit free parameters, axioms, or invented entities; the method description remains at the level of high-level technique names.

pith-pipeline@v0.9.1-grok · 5693 in / 1045 out tokens · 25658 ms · 2026-06-26T10:10:54.809581+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

42 extracted references · 10 canonical work pages

[1]

Knowledge Decoupling via Orthogonal Projection for Lifelong Editing of Large Language Models

Xu, Haoyu and Lan, Pengxiang and Yang, Enneng and Guo, Guibing and Zhao, Jianzhe and Jiang, Linying and Wang, Xingwei. Knowledge Decoupling via Orthogonal Projection for Lifelong Editing of Large Language Models. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025. doi:10.18653/v1/2025.acl-long.646

work page doi:10.18653/v1/2025.acl-long.646 2025
[2]

The Thirteenth International Conference on Learning Representations , year=

AlphaEdit: Null-Space Constrained Model Editing for Language Models , author=. The Thirteenth International Conference on Learning Representations , year=
[3]

Mitigating Negative Interference in Multilingual Knowledge Editing through Null-Space Constraints

Sun, Wei and Qu, Tingyu and Li, Mingxiao and Davis, Jesse and Moens, Marie-Francine. Mitigating Negative Interference in Multilingual Knowledge Editing through Null-Space Constraints. Findings of the Association for Computational Linguistics: ACL 2025. 2025. doi:10.18653/v1/2025.findings-acl.460

work page doi:10.18653/v1/2025.findings-acl.460 2025
[4]

Context-Robust Knowledge Editing for Language Models

Park, Haewon and Choi, Gyubin and Kim, Minjun and Jo, Yohan. Context-Robust Knowledge Editing for Language Models. Findings of the Association for Computational Linguistics: ACL 2025. 2025. doi:10.18653/v1/2025.findings-acl.540

work page doi:10.18653/v1/2025.findings-acl.540 2025
[5]

URL https: //aclanthology.org/2025.acl-long.208/

Li, Qi and Chu, Xiaowen. A da E dit: Advancing Continuous Knowledge Editing For Large Language Models. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025. doi:10.18653/v1/2025.acl-long.208

work page doi:10.18653/v1/2025.acl-long.208 2025
[6]

One for All: Update Parameterized Knowledge Across Multiple Models with Once Edit

Ma, Weitao and Du, Xiyuan and Feng, Xiaocheng and Huang, Lei and Huang, Yichong and Zhang, Huiyi and Yang, Xiaoliang and Li, Baohang and Feng, Xiachong and Liu, Ting and Qin, Bing. One for All: Update Parameterized Knowledge Across Multiple Models with Once Edit. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volu...

work page doi:10.18653/v1/2025.acl-long.780 2025
[7]

Zhengxuan Wu and Aryaman Arora and Zheng Wang and Atticus Geiger and Dan Jurafsky and Christopher D Manning and Christopher Potts , booktitle=. Re. 2024 , url=

2024
[8]

Neuron-Level Sequential Editing for Large Language Models

Jiang, Houcheng and Fang, Junfeng and Zhang, Tianyu and Bi, Baolong and Zhang, An and Wang, Ruipeng and Liang, Tao and Wang, Xiang. Neuron-Level Sequential Editing for Large Language Models. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025. doi:10.18653/v1/2025.acl-long.815

work page doi:10.18653/v1/2025.acl-long.815 2025
[9]

The Eleventh International Conference on Learning Representations , year=

Mass-Editing Memory in a Transformer , author=. The Eleventh International Conference on Learning Representations , year=
[10]

Aging with

Thomas Hartvigsen and Swami Sankaranarayanan and Hamid Palangi and Yoon Kim and Marzyeh Ghassemi , booktitle=. Aging with. 2023 , url=

2023
[11]

Serial Lifelong Editing via Mixture of Knowledge Experts

Cheng, YuJu and Yu, Yu-Chu and Chang, Kai-Po and Wang, Yu-Chiang Frank. Serial Lifelong Editing via Mixture of Knowledge Experts. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025. doi:10.18653/v1/2025.acl-long.1492

work page doi:10.18653/v1/2025.acl-long.1492 2025
[12]

The Thirteenth International Conference on Learning Representations , year=

Unlocking Efficient, Scalable, and Continual Knowledge Editing with Basis-Level Representation Fine-Tuning , author=. The Thirteenth International Conference on Learning Representations , year=
[13]

2024 , url=

Peng Wang and Zexi Li and Ningyu Zhang and Ziwen Xu and Yunzhi Yao and Yong Jiang and Pengjun Xie and Fei Huang and Huajun Chen , booktitle=. 2024 , url=

2024
[14]

Locating and Editing Factual Associations in

Kevin Meng and David Bau and Alex J Andonian and Yonatan Belinkov , booktitle=. Locating and Editing Factual Associations in. 2022 , url=

2022
[15]

Advances in neural information processing systems , volume=

Experience replay for continual learning , author=. Advances in neural information processing systems , volume=
[16]

arXiv preprint arXiv:2005.14165 , volume=

Language models are few-shot learners , author=. arXiv preprint arXiv:2005.14165 , volume=

Pith/arXiv arXiv 2005
[17]

arXiv preprint arXiv:1701.06538 , year=

Outrageously large neural networks: The sparsely-gated mixture-of-experts layer , author=. arXiv preprint arXiv:1701.06538 , year=

Pith/arXiv arXiv
[18]

Advances in neural information processing systems , volume=

Language models are few-shot learners , author=. Advances in neural information processing systems , volume=
[19]

ACM Computing Surveys , volume=

Knowledge editing for large language models: A survey , author=. ACM Computing Surveys , volume=. 2024 , publisher=

2024
[20]

arXiv preprint arXiv:2305.13172 , year=

Editing large language models: Problems, methods, and opportunities , author=. arXiv preprint arXiv:2305.13172 , year=

arXiv
[21]

arXiv preprint arXiv:2401.07453 , year=

Model editing at scale leads to gradual and catastrophic forgetting , author=. arXiv preprint arXiv:2401.07453 , year=

arXiv
[22]

arXiv preprint arXiv:2304.00740 , year=

Inspecting and editing knowledge representations in language models , author=. arXiv preprint arXiv:2304.00740 , year=

arXiv
[23]

arXiv preprint arXiv:2301.09785 , year=

Transformer-patcher: One mistake worth one neuron , author=. arXiv preprint arXiv:2301.09785 , year=

arXiv
[24]

arXiv preprint arXiv:2502.05628 , year=

Anyedit: Edit any knowledge encoded in language models , author=. arXiv preprint arXiv:2502.05628 , year=

arXiv
[25]

Findings of the Association for Computational Linguistics: EACL 2024 , pages=

Cross-lingual editing in multilingual language models , author=. Findings of the Association for Computational Linguistics: EACL 2024 , pages=

2024
[26]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=

Cross-lingual knowledge editing in large language models , author=. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=
[27]

arXiv preprint arXiv:2505.18774 , year=

Disentangling Knowledge Representations for Large Language Model Editing , author=. arXiv preprint arXiv:2505.18774 , year=

arXiv
[28]

arXiv preprint arXiv:2410.11469 , year=

O-edit: Orthogonal subspace editing for language model sequential editing , author=. arXiv preprint arXiv:2410.11469 , year=

arXiv
[29]

arXiv preprint arXiv:2506.00536 , year=

Decoupling Reasoning and Knowledge Injection for In-Context Knowledge Editing , author=. arXiv preprint arXiv:2506.00536 , year=

arXiv
[30]

arXiv preprint arXiv:1706.04115 , year=

Zero-shot relation extraction via reading comprehension , author=. arXiv preprint arXiv:1706.04115 , year=

Pith/arXiv arXiv
[31]

The Thirteenth International Conference on Learning Representations , year=

Perturbation-Restrained Sequential Model Editing , author=. The Thirteenth International Conference on Learning Representations , year=
[32]

Model editing harms general abilities of large language models: Regularization to the rescue

Gu, Jia-Chen and Xu, Hao-Xiang and Ma, Jun-Yu and Lu, Pan and Ling, Zhen-Hua and Chang, Kai-Wei and Peng, Nanyun. Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. 2024. doi:10.18653/v1/2024.emnlp-main.934

work page doi:10.18653/v1/2024.emnlp-main.934 2024
[33]

Advances in neural information processing systems , volume=

Attention is all you need , author=. Advances in neural information processing systems , volume=
[34]

International Conference on Machine Learning , pages=

Memory-based model editing at scale , author=. International Conference on Machine Learning , pages=. 2022 , organization=

2022
[35]

arXiv preprint arXiv:2012.00363 , year=

Modifying memories in transformer models , author=. arXiv preprint arXiv:2012.00363 , year=

arXiv 2012
[36]

arXiv preprint arXiv:2407.21783 , year=

The llama 3 herd of models , author=. arXiv preprint arXiv:2407.21783 , year=

Pith/arXiv arXiv
[37]

arXiv preprint arXiv:2407.10671 , volume=

Qwen2 technical report , author=. arXiv preprint arXiv:2407.10671 , volume=

Pith/arXiv arXiv
[38]

Wiley interdisciplinary reviews: computational statistics , volume=

Principal component analysis , author=. Wiley interdisciplinary reviews: computational statistics , volume=. 2010 , publisher=

2010
[39]

The Bell system technical journal , volume=

A mathematical theory of communication , author=. The Bell system technical journal , volume=. 1948 , publisher=

1948
[40]

, author=

Lora: Low-rank adaptation of large language models. , author=. ICLR , volume=
[41]

SAKE : Steering activations for knowledge editing

Scialanga, Marco and Laugel, Thibault and Grari, Vincent and Detyniecki, Marcin. SAKE : Steering Activations for Knowledge Editing. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025. doi:10.18653/v1/2025.acl-long.777

work page doi:10.18653/v1/2025.acl-long.777 2025
[42]

Decoding by Contrasting Knowledge: Enhancing Large Language Model Confidence on Edited Facts

Bi, Baolong and Liu, Shenghua and Mei, Lingrui and Wang, Yiwei and Fang, Junfeng and Ji, Pengliang and Cheng, Xueqi. Decoding by Contrasting Knowledge: Enhancing Large Language Model Confidence on Edited Facts. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025. doi:10.18653/v1/2025.acl-long.841

work page doi:10.18653/v1/2025.acl-long.841 2025

[1] [1]

Knowledge Decoupling via Orthogonal Projection for Lifelong Editing of Large Language Models

Xu, Haoyu and Lan, Pengxiang and Yang, Enneng and Guo, Guibing and Zhao, Jianzhe and Jiang, Linying and Wang, Xingwei. Knowledge Decoupling via Orthogonal Projection for Lifelong Editing of Large Language Models. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025. doi:10.18653/v1/2025.acl-long.646

work page doi:10.18653/v1/2025.acl-long.646 2025

[2] [2]

The Thirteenth International Conference on Learning Representations , year=

AlphaEdit: Null-Space Constrained Model Editing for Language Models , author=. The Thirteenth International Conference on Learning Representations , year=

[3] [3]

Mitigating Negative Interference in Multilingual Knowledge Editing through Null-Space Constraints

Sun, Wei and Qu, Tingyu and Li, Mingxiao and Davis, Jesse and Moens, Marie-Francine. Mitigating Negative Interference in Multilingual Knowledge Editing through Null-Space Constraints. Findings of the Association for Computational Linguistics: ACL 2025. 2025. doi:10.18653/v1/2025.findings-acl.460

work page doi:10.18653/v1/2025.findings-acl.460 2025

[4] [4]

Context-Robust Knowledge Editing for Language Models

Park, Haewon and Choi, Gyubin and Kim, Minjun and Jo, Yohan. Context-Robust Knowledge Editing for Language Models. Findings of the Association for Computational Linguistics: ACL 2025. 2025. doi:10.18653/v1/2025.findings-acl.540

work page doi:10.18653/v1/2025.findings-acl.540 2025

[5] [5]

URL https: //aclanthology.org/2025.acl-long.208/

Li, Qi and Chu, Xiaowen. A da E dit: Advancing Continuous Knowledge Editing For Large Language Models. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025. doi:10.18653/v1/2025.acl-long.208

work page doi:10.18653/v1/2025.acl-long.208 2025

[6] [6]

One for All: Update Parameterized Knowledge Across Multiple Models with Once Edit

Ma, Weitao and Du, Xiyuan and Feng, Xiaocheng and Huang, Lei and Huang, Yichong and Zhang, Huiyi and Yang, Xiaoliang and Li, Baohang and Feng, Xiachong and Liu, Ting and Qin, Bing. One for All: Update Parameterized Knowledge Across Multiple Models with Once Edit. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volu...

work page doi:10.18653/v1/2025.acl-long.780 2025

[7] [7]

Zhengxuan Wu and Aryaman Arora and Zheng Wang and Atticus Geiger and Dan Jurafsky and Christopher D Manning and Christopher Potts , booktitle=. Re. 2024 , url=

2024

[8] [8]

Neuron-Level Sequential Editing for Large Language Models

Jiang, Houcheng and Fang, Junfeng and Zhang, Tianyu and Bi, Baolong and Zhang, An and Wang, Ruipeng and Liang, Tao and Wang, Xiang. Neuron-Level Sequential Editing for Large Language Models. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025. doi:10.18653/v1/2025.acl-long.815

work page doi:10.18653/v1/2025.acl-long.815 2025

[9] [9]

The Eleventh International Conference on Learning Representations , year=

Mass-Editing Memory in a Transformer , author=. The Eleventh International Conference on Learning Representations , year=

[10] [10]

Aging with

Thomas Hartvigsen and Swami Sankaranarayanan and Hamid Palangi and Yoon Kim and Marzyeh Ghassemi , booktitle=. Aging with. 2023 , url=

2023

[11] [11]

Serial Lifelong Editing via Mixture of Knowledge Experts

Cheng, YuJu and Yu, Yu-Chu and Chang, Kai-Po and Wang, Yu-Chiang Frank. Serial Lifelong Editing via Mixture of Knowledge Experts. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025. doi:10.18653/v1/2025.acl-long.1492

work page doi:10.18653/v1/2025.acl-long.1492 2025

[12] [12]

The Thirteenth International Conference on Learning Representations , year=

Unlocking Efficient, Scalable, and Continual Knowledge Editing with Basis-Level Representation Fine-Tuning , author=. The Thirteenth International Conference on Learning Representations , year=

[13] [13]

2024 , url=

Peng Wang and Zexi Li and Ningyu Zhang and Ziwen Xu and Yunzhi Yao and Yong Jiang and Pengjun Xie and Fei Huang and Huajun Chen , booktitle=. 2024 , url=

2024

[14] [14]

Locating and Editing Factual Associations in

Kevin Meng and David Bau and Alex J Andonian and Yonatan Belinkov , booktitle=. Locating and Editing Factual Associations in. 2022 , url=

2022

[15] [15]

Advances in neural information processing systems , volume=

Experience replay for continual learning , author=. Advances in neural information processing systems , volume=

[16] [16]

arXiv preprint arXiv:2005.14165 , volume=

Language models are few-shot learners , author=. arXiv preprint arXiv:2005.14165 , volume=

Pith/arXiv arXiv 2005

[17] [17]

arXiv preprint arXiv:1701.06538 , year=

Outrageously large neural networks: The sparsely-gated mixture-of-experts layer , author=. arXiv preprint arXiv:1701.06538 , year=

Pith/arXiv arXiv

[18] [18]

Advances in neural information processing systems , volume=

Language models are few-shot learners , author=. Advances in neural information processing systems , volume=

[19] [19]

ACM Computing Surveys , volume=

Knowledge editing for large language models: A survey , author=. ACM Computing Surveys , volume=. 2024 , publisher=

2024

[20] [20]

arXiv preprint arXiv:2305.13172 , year=

Editing large language models: Problems, methods, and opportunities , author=. arXiv preprint arXiv:2305.13172 , year=

arXiv

[21] [21]

arXiv preprint arXiv:2401.07453 , year=

Model editing at scale leads to gradual and catastrophic forgetting , author=. arXiv preprint arXiv:2401.07453 , year=

arXiv

[22] [22]

arXiv preprint arXiv:2304.00740 , year=

Inspecting and editing knowledge representations in language models , author=. arXiv preprint arXiv:2304.00740 , year=

arXiv

[23] [23]

arXiv preprint arXiv:2301.09785 , year=

Transformer-patcher: One mistake worth one neuron , author=. arXiv preprint arXiv:2301.09785 , year=

arXiv

[24] [24]

arXiv preprint arXiv:2502.05628 , year=

Anyedit: Edit any knowledge encoded in language models , author=. arXiv preprint arXiv:2502.05628 , year=

arXiv

[25] [25]

Findings of the Association for Computational Linguistics: EACL 2024 , pages=

Cross-lingual editing in multilingual language models , author=. Findings of the Association for Computational Linguistics: EACL 2024 , pages=

2024

[26] [26]

Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=

Cross-lingual knowledge editing in large language models , author=. Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) , pages=

[27] [27]

arXiv preprint arXiv:2505.18774 , year=

Disentangling Knowledge Representations for Large Language Model Editing , author=. arXiv preprint arXiv:2505.18774 , year=

arXiv

[28] [28]

arXiv preprint arXiv:2410.11469 , year=

O-edit: Orthogonal subspace editing for language model sequential editing , author=. arXiv preprint arXiv:2410.11469 , year=

arXiv

[29] [29]

arXiv preprint arXiv:2506.00536 , year=

Decoupling Reasoning and Knowledge Injection for In-Context Knowledge Editing , author=. arXiv preprint arXiv:2506.00536 , year=

arXiv

[30] [30]

arXiv preprint arXiv:1706.04115 , year=

Zero-shot relation extraction via reading comprehension , author=. arXiv preprint arXiv:1706.04115 , year=

Pith/arXiv arXiv

[31] [31]

The Thirteenth International Conference on Learning Representations , year=

Perturbation-Restrained Sequential Model Editing , author=. The Thirteenth International Conference on Learning Representations , year=

[32] [32]

Model editing harms general abilities of large language models: Regularization to the rescue

Gu, Jia-Chen and Xu, Hao-Xiang and Ma, Jun-Yu and Lu, Pan and Ling, Zhen-Hua and Chang, Kai-Wei and Peng, Nanyun. Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue. Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing. 2024. doi:10.18653/v1/2024.emnlp-main.934

work page doi:10.18653/v1/2024.emnlp-main.934 2024

[33] [33]

Advances in neural information processing systems , volume=

Attention is all you need , author=. Advances in neural information processing systems , volume=

[34] [34]

International Conference on Machine Learning , pages=

Memory-based model editing at scale , author=. International Conference on Machine Learning , pages=. 2022 , organization=

2022

[35] [35]

arXiv preprint arXiv:2012.00363 , year=

Modifying memories in transformer models , author=. arXiv preprint arXiv:2012.00363 , year=

arXiv 2012

[36] [36]

arXiv preprint arXiv:2407.21783 , year=

The llama 3 herd of models , author=. arXiv preprint arXiv:2407.21783 , year=

Pith/arXiv arXiv

[37] [37]

arXiv preprint arXiv:2407.10671 , volume=

Qwen2 technical report , author=. arXiv preprint arXiv:2407.10671 , volume=

Pith/arXiv arXiv

[38] [38]

Wiley interdisciplinary reviews: computational statistics , volume=

Principal component analysis , author=. Wiley interdisciplinary reviews: computational statistics , volume=. 2010 , publisher=

2010

[39] [39]

The Bell system technical journal , volume=

A mathematical theory of communication , author=. The Bell system technical journal , volume=. 1948 , publisher=

1948

[40] [40]

, author=

Lora: Low-rank adaptation of large language models. , author=. ICLR , volume=

[41] [41]

SAKE : Steering activations for knowledge editing

Scialanga, Marco and Laugel, Thibault and Grari, Vincent and Detyniecki, Marcin. SAKE : Steering Activations for Knowledge Editing. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025. doi:10.18653/v1/2025.acl-long.777

work page doi:10.18653/v1/2025.acl-long.777 2025

[42] [42]

Decoding by Contrasting Knowledge: Enhancing Large Language Model Confidence on Edited Facts

Bi, Baolong and Liu, Shenghua and Mei, Lingrui and Wang, Yiwei and Fang, Junfeng and Ji, Pengliang and Cheng, Xueqi. Decoding by Contrasting Knowledge: Enhancing Large Language Model Confidence on Edited Facts. Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers). 2025. doi:10.18653/v1/2025.acl-long.841

work page doi:10.18653/v1/2025.acl-long.841 2025