pith. sign in

Sixian Li

Identifiers

No identifiers captured yet.

Papers (2)

  1. DFPO: Scaling Value Modeling via Distributional Flow towards Robust and Generalizable LLM Post-Training cs.LG · 2026 · author #7
  2. DVPO: Distributional Value Modeling-based Policy Optimization for LLM Post-Training cs.LG · 2025 · author #5

Mentions

No mention provenance yet.

Frequent Coauthors