pith. sign in

Xiang Yue

Identifiers

  • name variant Xiang Yue 0.60 · backfill

Papers (14)

  1. JobBench: Aligning Agent Work With Human Will cs.AI · 2026 · author #23
  2. On the limits and opportunities of AI reviewers: Reviewing the reviews of Nature-family papers with 45 expert scientists cs.CL · 2026 · author #54
  3. VisCoder2: Building Multi-Language Visualization Coding Agents cs.SE · 2025 · author #10
  4. Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning cs.AI · 2025 · author #9
  5. Unlearning Isn't Deletion: Investigating Reversibility of Machine Unlearning in LLMs cs.CL · 2025 · author #2
  6. Scaling Evaluation-time Compute with Reasoning Models as Evaluators cs.CL · 2025 · author #4
  7. SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines cs.CL · 2025 · author #91
  8. Demystifying Long Chain-of-Thought Reasoning in LLMs cs.CL · 2025 · author #5
  9. Video-MMMU: Evaluating Knowledge Acquisition from Multi-Discipline Professional Videos cs.CV · 2025 · author #6
  10. MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark cs.CL · 2024 · author #1
  11. MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark cs.CL · 2024 · author #16
  12. MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI cs.CL · 2023 · author #1
  13. MAmmoTH: Building Math Generalist Models through Hybrid Instruction Tuning cs.CL · 2023 · author #1
  14. SurfCon: Synonym Discovery on Privacy-Aware Clinical Data cs.CL · 2019 · author #2

Mentions

  • 2605.26329 #23 · arxiv_oai · confidence 0.70 Xiang Yue
  • 2605.20668 #54 · arxiv_oai · confidence 0.70 Xiang Yue
  • 2503.19877 #4 · arxiv_oai · confidence 0.70 Xiang Yue
  • 2505.16831 #2 · arxiv_oai · confidence 0.70 Xiang Yue
  • 2502.03373 #5 · arxiv_oai · confidence 0.70 Xiang Yue
  • 2507.00432 #9 · arxiv_oai · confidence 0.70 Xiang Yue
  • 2309.05653 #1 · arxiv_oai · confidence 0.70 Xiang Yue
  • 2502.14739 #91 · arxiv_oai · confidence 0.70 Xiang Yue

Frequent Coauthors