pith. sign in

Zihua Zhao

Identifiers

  • name variant Zihua Zhao 0.60 · backfill

Papers (1)

  1. Focal Reward: Balanced Reinforcement Learning under Rubric-Based Rewards cs.LG · 2026 · author #2

Mentions

  • 2605.26579 #2 · arxiv_oai · confidence 0.70 Zihua Zhao

Frequent Coauthors