pith. sign in

Xuezhi Cao

Identifiers

  • name variant Xuezhi Cao 0.60 · backfill

Papers (13)

  1. OPERA: Aligning Open-Ended Reasoning via Objective Perplexity-based Reinforcement Learning cs.CL · 2026 · author #8
  2. DailyReport: An Open-ended Benchmark for Evaluating Search Agents on Daily Search Tasks cs.AI · 2026 · author #7
  3. Asuka-Bench: Benchmarking Code Agents on Underspecified User Intent and Multi-Round Refinement cs.SE · 2026 · author #8
  4. SAGE: A Quantitative Evaluation of Socialized Evolution in Agent Ecosystems cs.AI · 2026 · author #4
  5. ATLAS: All-round Testing of Long-context Abilities across Scales cs.CL · 2026 · author #15
  6. WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation cs.CV · 2026 · author #7
  7. SWE-Cycle: Benchmarking Code Agents across the Complete Issue Resolution Cycle cs.SE · 2026 · author #8
  8. AgentEscapeBench: Evaluating Out-of-Domain Tool-Grounded Reasoning in LLM Agents cs.AI · 2026 · author #9
  9. General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks cs.CL · 2026 · author #12
  10. LARY: A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment cs.CV · 2026 · author #6
  11. A Machine Learning Approach To Prevent Malicious Calls Over Telephony Networks cs.CR · 2018 · author #6
  12. Collaborative Filtering with Graph-based Implicit Feedback cs.IR · 2018 · author #4
  13. Revealing Multiple Layers of Hidden Community Structure in Networks cs.SI · 2015 · author #3

Mentions

  • 2606.25757 #8 · arxiv_oai · confidence 0.70 Xuezhi Cao
  • 2606.12871 #7 · arxiv_oai · confidence 0.70 Xuezhi Cao
  • 2606.05920 #8 · arxiv_oai · confidence 0.70 Xuezhi Cao
  • 2606.03544 #4 · arxiv_oai · confidence 0.70 Xuezhi Cao
  • 1501.05700 #3 · backfill · confidence 0.70 Xuezhi Cao
  • 2605.28079 #15 · arxiv_oai · confidence 0.70 Xuezhi Cao
  • 2605.25874 #7 · arxiv_oai · confidence 0.70 Xuezhi Cao
  • 2605.07926 #9 · arxiv_oai · confidence 0.70 Xuezhi Cao

Frequent Coauthors