Rohan Surana

Identifiers

No identifiers captured yet.

Papers (5)

F-GRPO: Factorized Group-Relative Policy Optimization for Unified Candidate Generation and Ranking cs.LG · 2026 · author #1
MASS-DPO: Multi-negative Active Sample Selection for Direct Policy Optimization cs.LG · 2026 · author #1
Skill-R1: Agent Skill Evolution via Reinforcement Learning cs.LG · 2026 · author #2
Skill-CMIB: Multimodal Agent Skill for Consistent Action via Conditional Multimodal Information Bottleneck cs.LG · 2026 · author #5
Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning cs.LG · 2026 · author #1

Mentions

No mention provenance yet.

Frequent Coauthors

Julian McAuley 5 shared papers
Junda Wu 5 shared papers
Tong Yu 5 shared papers
Jingbo Shang 4 shared papers
Xintong Li 4 shared papers
Zihan Huang 3 shared papers
Bowen Jin 2 shared papers
Chuhan Wang 2 shared papers
Gagan Mundada 2 shared papers
Jiawei Han 2 shared papers
Sheldon Yu 2 shared papers
Sizhe Zhou 2 shared papers
Xunyi Jiang 2 shared papers
Difan Jiao 1 shared papers
Kuan-Hao Huang 1 shared papers
Lina Yao 1 shared papers
Nikki Kuang 1 shared papers
Nikki Lijing Kuang 1 shared papers
Prithviraj Ammanabrolu 1 shared papers
Qianqi Yan 1 shared papers