Mohammad Mahdi Salmani-Zarchi
Identifiers
- name variant Mohammad Mahdi Salmani-Zarchi 0.60 · backfill
Papers (1)
- MDP-GRPO: Stabilized Group Relative Policy Optimization for Multi-Constraint Instruction Following cs.LG · 2026 · author #1
Mentions
- 2606.06058 #1 · arxiv_oai · confidence 0.70 Mohammad Mahdi Salmani-Zarchi
Frequent Coauthors
- Heshaam Faili 1 shared papers
- Mohammad Javad Dousti 1 shared papers
- Zahra Rahimi 1 shared papers