pith. sign in

Biqing Qi

Identifiers

  • name variant Biqing Qi 0.60 · backfill

Papers (13)

  1. UnityMAS-O: A General RL Optimization Framework for LLM-Based Multi-Agent Systems cs.AI · 2026 · author #12
  2. FlexDraft: Flexible Speculative Decoding via Attention Tuning and Bonus-Guided Calibration cs.CL · 2026 · author #7
  3. CKT-WAM: Parameter-Efficient Context Knowledge Transfer Between World Action Models cs.RO · 2026 · author #10
  4. Auto-FlexSwitch: Efficient Dynamic Model Merging via Learnable Task Vector Compression cs.LG · 2026 · author #4
  5. MARS$^2$: Scaling Multi-Agent Tree Search via Reinforcement Learning for Code Generation cs.AI · 2026 · author #9
  6. DARE: Diffusion Large Language Models Alignment and Reinforcement Executor cs.CL · 2026 · author #5
  7. AsyncVLA: Asynchronous Flow Matching for Vision-Language-Action Models cs.RO · 2025 · author #5
  8. Nirvana: A Specialized Generalist Model With Task-Aware Memory Mechanism cs.LG · 2025 · author #9
  9. A Survey of Inductive Reasoning for Large Language Models cs.CL · 2025 · author #11
  10. A Survey of Reinforcement Learning for Large Reasoning Models cs.CL · 2025 · author #37
  11. InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency cs.CV · 2025 · author #48
  12. ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows cs.AI · 2025 · author #19
  13. TTRL: Test-Time Reinforcement Learning cs.CL · 2025 · author #11

Mentions

  • 2605.26646 #12 · arxiv_oai · confidence 0.70 Biqing Qi
  • 2605.20022 #7 · arxiv_oai · confidence 0.70 Biqing Qi
  • 2509.08827 #37 · arxiv_oai · confidence 0.70 Biqing Qi

Frequent Coauthors