pith. sign in

Xin-Qiang Cai

Identifiers

  • name variant Xin-Qiang Cai 0.60 · backfill

Papers (2)

  1. VI-CuRL: Stabilizing Verifier-Independent RL Reasoning via Confidence-Guided Variance Reduction cs.LG · 2026 · author #1
  2. Reinforcement Learning with Verifiable yet Noisy Rewards under Imperfect Verifiers cs.LG · 2025 · author #1

Mentions

  • 2602.12579 #1 · arxiv_oai · confidence 0.70 Xin-Qiang Cai
  • 2510.00915 #1 · arxiv_oai · confidence 0.70 Xin-Qiang Cai

Frequent Coauthors