pith. sign in

Xiangyang Qu

Identifiers

  • name variant Xiangyang Qu 0.60 · backfill

Papers (2)

  1. Reinforcement Learning with Robust Rubric Rewards cs.CV · 2026 · author #4
  2. Visual Preference Optimization with Rubric Rewards cs.CV · 2026 · author #3

Mentions

  • 2605.30244 #4 · arxiv_oai · confidence 0.70 Xiangyang Qu

Frequent Coauthors