write at least 25 sentences,

are released under the Apache-2 · 2025 · arXiv 3007.1380

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Recovering Diversity Without Losing Alignment: A DPO Recipe for Post-Trained LLMs

cs.CL · 2026-05-28 · conditional · novelty 7.0

REDIPO constructs DPO preference data from base-model generations rewritten by the instruct model to increase output diversity on NoveltyBench while preserving alignment metrics across three LLMs.

citing papers explorer

Showing 1 of 1 citing paper.

Recovering Diversity Without Losing Alignment: A DPO Recipe for Post-Trained LLMs cs.CL · 2026-05-28 · conditional · none · ref 5
REDIPO constructs DPO preference data from base-model generations rewritten by the instruct model to increase output diversity on NoveltyBench while preserving alignment metrics across three LLMs.

write at least 25 sentences,

fields

years

verdicts

representative citing papers

citing papers explorer