pith. sign in

Zhuoran Zhuang

Identifiers

No identifiers captured yet.

Papers (1)

  1. Teaching LLM to be Persuasive: Reward-Enhanced Policy Optimization for Alignment from Heterogeneous Rewards cs.CL · 2025 · author #6

Mentions

No mention provenance yet.

Frequent Coauthors