pith. sign in

Hyung Gyu Rho

Identifiers

  • name variant Hyung Gyu Rho 0.60 · backfill

Papers (1)

  1. Margin Adaptive DPO: Leveraging Reward Model for Granular Control in Preference Optimization cs.LG · 2025 · author #1

Mentions

  • 2510.05342 #1 · arxiv_oai · confidence 0.70 Hyung Gyu Rho