pith. sign in

Mehul Damani

Identifiers

  • name variant Mehul Damani 0.60 · backfill

Papers (4)

  1. Vector Policy Optimization: Training for Diversity Improves Test-Time Search cs.LG · 2026 · author #5
  2. Self-Distillation Enables Continual Learning cs.LG · 2026 · author #2
  3. Beyond Binary Rewards: Training LMs to Reason About Their Uncertainty cs.LG · 2025 · author #1
  4. Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback cs.AI · 2023 · author #17

Mentions

  • 2605.22817 #5 · arxiv_oai · confidence 0.70 Mehul Damani
  • 2507.16806 #1 · arxiv_oai · confidence 0.70 Mehul Damani

Frequent Coauthors