To simplify the pipeline, Direct Preference Optimization (DPO) (Rafailov et al.,

rely on reward modeling, policy optimization, which incur high computational costs, can be sensitive to noisy supervision (Gao et al · 2023

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Revisiting Robustness for LLM Safety Alignment via Selective Geometry Control

cs.LG · 2026-02-07 · unverdicted · novelty 5.0

ShaPO improves LLM safety robustness over standard preference optimization by enforcing worst-case objectives via selective geometry control at token and reward levels.

citing papers explorer

Showing 1 of 1 citing paper.

Revisiting Robustness for LLM Safety Alignment via Selective Geometry Control cs.LG · 2026-02-07 · unverdicted · none · ref 33
ShaPO improves LLM safety robustness over standard preference optimization by enforcing worst-case objectives via selective geometry control at token and reward levels.

To simplify the pipeline, Direct Preference Optimization (DPO) (Rafailov et al.,

fields

years

verdicts

representative citing papers

citing papers explorer