Lanjihong Ma
Identifiers
No identifiers captured yet.
Papers (1)
- Data-dependent Exploration for Online Reinforcement Learning from Human Feedback cs.LG · 2026 · author #4
Mentions
No mention provenance yet.
Frequent Coauthors
- Jiandong Zhang 1 shared papers
- Masashi Sugiyama 1 shared papers
- Yuting Tang 1 shared papers
- Zhen-Yu Zhang 1 shared papers