pith. sign in

Gagan Mundada

Identifiers

No identifiers captured yet.

Papers (2)

  1. F-GRPO: Factorized Group-Relative Policy Optimization for Unified Candidate Generation and Ranking cs.LG · 2026 · author #2
  2. Generate, Filter, Control, Replay: A Comprehensive Survey of Rollout Strategies for LLM Reinforcement Learning cs.LG · 2026 · author #2

Mentions

No mention provenance yet.

Frequent Coauthors