Review history
Transformation-Augmented GRPO for Enhancing Exploration in Reasoning of Large Language Models
-
2026-05-21 CONDITIONAL
-
2026-05-16 UNVERDICTED
Transformation-Augmented GRPO for Enhancing Exploration in Reasoning of Large Language Models