pith. sign in

Stewart Slocum

Identifiers

  • name variant Stewart Slocum 0.60 · backfill

Papers (2)

  1. Beyond Binary Rewards: Training LMs to Reason About Their Uncertainty cs.LG · 2025 · author #3
  2. Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback cs.AI · 2023 · author #18

Mentions

  • 2507.16806 #3 · arxiv_oai · confidence 0.70 Stewart Slocum

Frequent Coauthors