pith. sign in

Kai Xiong

Identifiers

  • name variant Kai Xiong 0.60 · backfill

Papers (6)

  1. DeepTool: Scaling Interleaved Deliberation in Tool-Integrated Reasoning via Process-Supervised Reinforcement Learning cs.AI · 2026 · author #5
  2. X-Imitator: Spatial-Aware Imitation Learning via Bidirectional Action-Pose Interaction cs.RO · 2026 · author #1
  3. GR-Ben: A General Reasoning Benchmark for Evaluating Process Reward Models cs.AI · 2026 · author #6
  4. Consolidation or Adaptation? PRISM: Disentangling SFT and RL Data via Gradient Concentration cs.AI · 2026 · author #6
  5. MAESTRO: Meta-learning Adaptive Estimation of Scalarization Trade-offs for Reward Optimization cs.LG · 2026 · author #6
  6. PuzzleClone: A DSL-Powered Framework for Synthesizing Verifiable Data cs.AI · 2025 · author #1

Mentions

  • 2508.15180 #1 · arxiv_oai · confidence 0.70 Kai Xiong
  • 2605.29568 #5 · arxiv_oai · confidence 0.70 Kai Xiong

Frequent Coauthors