pith. sign in

Ai Jian

Identifiers

No identifiers captured yet.

Papers (2)

  1. PaTaRM: Bridging Pairwise and Pointwise Signals via Preference-Aware Task-Adaptive Reward Modeling cs.LG · 2025 · author #1
  2. Revisiting Entropy Regularization: Adaptive Coefficient Unlocks Its Potential for LLM Reinforcement Learning cs.LG · 2025 · author #7

Mentions

No mention provenance yet.

Frequent Coauthors