pith. sign in

Yunfei Xie

Identifiers

  • name variant Yunfei Xie 0.60 · backfill

Papers (4)

  1. How Off-Policy Can GRPO Be? Mu-GRPO for Efficient LLM Reinforcement Learning cs.LG · 2026 · author #2
  2. PhysNote: Self-Knowledge Notes for Evolvable Physical Reasoning in Vision-Language Model cs.AI · 2026 · author #2
  3. Correct Answers from Sound Reasoning: Verifiable Process Supervision for Language Models cs.CL · 2026 · author #3
  4. Geometric realization of stress-tensor deformed field theory hep-th · 2025 · author #2

Mentions

  • 2605.17570 #2 · arxiv_oai · confidence 0.70 Yunfei Xie

Frequent Coauthors