pith. sign in

Yunze Xiao

Identifiers

  • name variant Yunze Xiao 0.60 · backfill

Papers (9)

  1. Knowledge Index of Noah's Ark cs.AI · 2026 · author #3
  2. Dr. DocBench: A Comprehensive Benchmark for Expert-Level and Difficult Document Parsing cs.CL · 2026 · author #13
  3. Validated Hypotheses as a Lens for Human-Likeness Evaluation in AI Agents cs.CY · 2026 · author #6
  4. The Chameleon's Limit: Investigating Persona Collapse and Homogenization in Large Language Models cs.CL · 2026 · author #1
  5. Superminds Test: Actively Evaluating Collective Intelligence of Agent Society via Probing Agents cs.AI · 2026 · author #3
  6. Say Something Else: Rethinking Contextual Privacy as Information Sufficiency cs.CR · 2026 · author #1
  7. Sentipolis: Emotion-Aware Agents for Social Simulations cs.AI · 2026 · author #3
  8. Can LLMs Estimate Student Struggles? Human-AI Difficulty Alignment with Proficiency Simulation for Item Difficulty Prediction cs.CL · 2025 · author #3
  9. Humanity's Last Exam cs.LG · 2025 · author #910

Mentions

  • 2606.05104 #3 · arxiv_oai · confidence 0.70 Yunze Xiao
  • 2606.01393 #13 · arxiv_oai · confidence 0.70 Yunze Xiao
  • 2605.15473 #6 · arxiv_oai · confidence 0.70 Yunze Xiao

Frequent Coauthors