pith. sign in

Tiehua Mei

Identifiers

  • name variant Tiehua Mei 0.60 · backfill

Papers (3)

  1. ProRL: Effective Reinforcement Learning for Proactive Recommendation via Rectified Policy Gradient Estimation cs.LG · 2026 · author #2
  2. GoLongRL: Capability-Oriented Long Context Reinforcement Learning with Multitask Alignment cs.CL · 2026 · author #2
  3. Entropy Ratio Clipping as a Soft Global Constraint for Stable Reinforcement Learning cs.LG · 2025 · author #4

Mentions

  • 2605.28293 #2 · arxiv_oai · confidence 0.70 Tiehua Mei
  • 2605.19577 #2 · arxiv_oai · confidence 0.70 Tiehua Mei

Frequent Coauthors