pith. sign in

Hejie Cui

Identifiers

No identifiers captured yet.

Papers (2)

  1. T$^2$PO: Uncertainty-Guided Exploration Control for Stable Multi-Turn Agentic Reinforcement Learning cs.AI · 2026 · author #2
  2. Alternating Reinforcement Learning with Contextual Rubric Rewards: Beyond the Scalarization Strategy cs.LG · 2026 · author #4

Mentions

No mention provenance yet.

Frequent Coauthors