pith. sign in

Kellie Lu

Identifiers

  • name variant Kellie Lu 0.60 · backfill

Papers (1)

  1. RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI Feedback cs.CL · 2023 · author #6

Mentions

  • 2309.00267 #6 · arxiv_oai · confidence 0.70 Kellie Lu

Frequent Coauthors