Frank Xiao
Identifiers
- name variant Frank Xiao 0.60 · backfill
Papers (4)
- Generalization Hacking: Models Can Game Reinforcement Learning by Preventing Behavioral Generalization cs.LG · 2026 · author #1
- Bootstrapped Monitoring: Leveraging Transparent Reasoning to Oversee Stronger AI Agents cs.LG · 2026 · author #1
- Probe-Based Data Attribution: Discovering and Mitigating Undesirable Behaviors in LLM Post-Training cs.LG · 2026 · author #1
- Why Do Language Model Agents Whistleblow? cs.LG · 2025 · author #2
Mentions
- 2606.12016 #1 · arxiv_oai · confidence 0.70 Frank Xiao
- 2606.11998 #1 · arxiv_oai · confidence 0.70 Frank Xiao
Frequent Coauthors
- Mary Phuong 2 shared papers
- Asa Cooper Stickland 1 shared papers
- Guido Bergman 1 shared papers
- Kushal Agrawal 1 shared papers
- Santiago Aranguri 1 shared papers