Towards effective evaluations and comparison for llm unlearning methods

Qizhou Wang, Bo Han, Puning Yang, Jianing Zhu, Tongliang Liu, Masashi Sugiyama · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

cs.LG · 2025-11-30 · unverdicted · novelty 5.0

Gradient analysis and ablations show DPO and PPO have different target directions and component roles in preference optimization for LLMs.

Showing 1 of 1 citing paper.

What Is Preference Optimization Doing, and Why? cs.LG · 2025-11-30 · unverdicted · none · ref 6
Gradient analysis and ablations show DPO and PPO have different target directions and component roles in preference optimization for LLMs.