pith. sign in

Lei M Zhang

Identifiers

  • name variant Lei M Zhang 0.60 · backfill

Papers (1)

  1. Training Language Models to Self-Correct via Reinforcement Learning cs.LG · 2024 · author #11

Mentions

  • 2409.12917 #11 · arxiv_oai · confidence 0.70 Lei M Zhang

Frequent Coauthors