pith. sign in

Stephane Hatgis-Kessell

Identifiers

  • name variant Stephane Hatgis-Kessell 0.60 · backfill

Papers (2)

  1. When are LLMs Sufficient Policy Optimizers for Sequential RL Tasks? cs.LG · 2026 · author #1
  2. Influencing Humans to Conform to Preference Models for RLHF cs.LG · 2025 · author #1

Mentions

  • 2605.30719 #1 · arxiv_oai · confidence 0.70 Stephane Hatgis-Kessell

Frequent Coauthors