virtual samples

Zhong, Zexuan, Wu, Zhengxuan, Manning, Christopher, Potts, Christopher, Chen, Danqi · 2023 · DOI 10.18653/v1/2023.emnlp-main.971

5 Pith papers cite this work. Polarity classification is still indexing.

5 Pith papers citing it

open at publisher browse 5 citing papers

representative citing papers

More Edits, More Stable: Understanding the Lifelong Normalization in Sequential Model Editing

cs.LG · 2026-05-12 · unverdicted · novelty 7.0

Lifelong Normalization combined with ridge-regularized regression produces asymptotically orthogonal and bounded parameter updates that mitigate forgetting and collapse in lifelong model editing.

MixSD: Mixed Contextual Self-Distillation for Knowledge Injection

cs.CL · 2026-05-16 · unverdicted · novelty 6.0 · 2 refs

MixSD mixes tokens from the base model's expert and naive conditionals to create distribution-aligned supervision for knowledge injection, yielding better memorization-retention trade-offs than SFT across scales and benchmarks.

From Backward Spreading to Forward Replay: Revisiting Target Construction in LLM Parameter Editing

cs.CL · 2026-05-01 · unverdicted · novelty 6.0

Forward replay replaces backward spreading in LLM parameter editing by optimizing the target hidden state at the first editing layer and propagating it forward, yielding more accurate layer-wise targets at the same computational cost.

Grounding Multi-Hop Reasoning in Structural Causal Models via Group Relative Policy Optimization

cs.AI · 2026-05-02 · unverdicted · novelty 5.0

SCM-GRPO grounds multi-hop fact verification in structural causal models and applies GRPO reinforcement learning to optimize reasoning chain length, outperforming baselines on HoVer and EX-FEVER.

Towards Scalable Lifelong Knowledge Editing with Selective Knowledge Suppression

cs.AI · 2026-04-21 · unverdicted · novelty 5.0

LightEdit enables scalable lifelong knowledge editing in LLMs via selective knowledge retrieval and probability suppression during decoding, outperforming prior methods on ZSRE, Counterfact, and RIPE while reducing training costs.

citing papers explorer

Showing 5 of 5 citing papers.

More Edits, More Stable: Understanding the Lifelong Normalization in Sequential Model Editing cs.LG · 2026-05-12 · unverdicted · none · ref 9
Lifelong Normalization combined with ridge-regularized regression produces asymptotically orthogonal and bounded parameter updates that mitigate forgetting and collapse in lifelong model editing.
MixSD: Mixed Contextual Self-Distillation for Knowledge Injection cs.CL · 2026-05-16 · unverdicted · none · ref 49 · 2 links
MixSD mixes tokens from the base model's expert and naive conditionals to create distribution-aligned supervision for knowledge injection, yielding better memorization-retention trade-offs than SFT across scales and benchmarks.
From Backward Spreading to Forward Replay: Revisiting Target Construction in LLM Parameter Editing cs.CL · 2026-05-01 · unverdicted · none · ref 29
Forward replay replaces backward spreading in LLM parameter editing by optimizing the target hidden state at the first editing layer and propagating it forward, yielding more accurate layer-wise targets at the same computational cost.
Grounding Multi-Hop Reasoning in Structural Causal Models via Group Relative Policy Optimization cs.AI · 2026-05-02 · unverdicted · none · ref 42
SCM-GRPO grounds multi-hop fact verification in structural causal models and applies GRPO reinforcement learning to optimize reasoning chain length, outperforming baselines on HoVer and EX-FEVER.
Towards Scalable Lifelong Knowledge Editing with Selective Knowledge Suppression cs.AI · 2026-04-21 · unverdicted · none · ref 87
LightEdit enables scalable lifelong knowledge editing in LLMs via selective knowledge retrieval and probability suppression during decoding, outperforming prior methods on ZSRE, Counterfact, and RIPE while reducing training costs.

virtual samples

fields

years

verdicts

representative citing papers

citing papers explorer