Rohan Surana
Identifiers
No identifiers captured yet.
Papers (5)
- F-GRPO: Factorized Group-Relative Policy Optimization for Unified Candidate Generation and Ranking cs.LG · 2026 · author #1
- MASS-DPO: Multi-negative Active Sample Selection for Direct Policy Optimization cs.LG · 2026 · author #1
- Skill-R1: Agent Skill Evolution via Reinforcement Learning cs.LG · 2026 · author #2
- Skill-CMIB: Multimodal Agent Skill for Consistent Action via Conditional Multimodal Information Bottleneck cs.LG · 2026 · author #5
- Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning cs.LG · 2026 · author #1
Mentions
No mention provenance yet.
Frequent Coauthors
- Julian McAuley 5 shared papers
- Junda Wu 5 shared papers
- Tong Yu 5 shared papers
- Jingbo Shang 4 shared papers
- Xintong Li 4 shared papers
- Zihan Huang 3 shared papers
- Bowen Jin 2 shared papers
- Chuhan Wang 2 shared papers
- Gagan Mundada 2 shared papers
- Jiawei Han 2 shared papers
- Sheldon Yu 2 shared papers
- Sizhe Zhou 2 shared papers
- Xunyi Jiang 2 shared papers
- Difan Jiao 1 shared papers
- Kuan-Hao Huang 1 shared papers
- Lina Yao 1 shared papers
- Nikki Kuang 1 shared papers
- Nikki Lijing Kuang 1 shared papers
- Prithviraj Ammanabrolu 1 shared papers
- Qianqi Yan 1 shared papers