pith. sign in

Aaron Tu

Identifiers

  • name variant Aaron Tu 0.60 · backfill

Papers (3)

  1. NoiseRater: Meta-Learned Noise Valuation for Diffusion Model Training cs.LG · 2026 · author #9
  2. DeepSearch: Overcome the Bottleneck of Reinforcement Learning with Verifiable Rewards via Monte Carlo Tree Search cs.AI · 2025 · author #5
  3. Position: The Hidden Costs and Measurement Gaps of Reinforcement Learning with Verifiable Rewards cs.LG · 2025 · author #2

Mentions

  • 2509.21882 #2 · arxiv_oai · confidence 0.70 Aaron Tu

Frequent Coauthors