pith. sign in

Rei Higuchi

Identifiers

  • name variant Rei Higuchi 0.60 · backfill

Papers (1)

  1. How Neural Reward Models Learn Features for Policy Optimization: A Single-Index Analysis stat.ML · 2026 · author #1

Mentions

  • 2605.24749 #1 · arxiv_oai · confidence 0.70 Rei Higuchi

Frequent Coauthors