Zhisheng Yang
Identifiers
No identifiers captured yet.
Papers (2)
- EP-GRPO: Entropy-Progress Aligned Group Relative Policy Optimization with Implicit Process Guidance cs.LG · 2026 · author #4
- ERPO: Token-Level Entropy-Regulated Policy Optimization for Large Reasoning Models cs.LG · 2026 · author #4
Mentions
No mention provenance yet.
Frequent Coauthors
- Li Li 2 shared papers
- Song Yu 2 shared papers
- Wenwen Zhao 2 shared papers