pith. sign in

Wei-Lin Chiang

Identifiers

  • name variant Wei-Lin Chiang 0.60 · backfill

Papers (6)

  1. Arena-T2I Hard: Benchmarking and Improving Faithfulness with Dependency-Aware Checklist cs.AI · 2026 · author #7
  2. DualEval: Joint Model-Item Calibration for Unified LLM Evaluation cs.LG · 2026 · author #5
  3. RouteLLM: Learning to Route LLMs with Preference Data cs.LG · 2024 · author #4
  4. From Crowdsourced Data to High-Quality Benchmarks: Arena-Hard and BenchBuilder Pipeline cs.LG · 2024 · author #2
  5. Chatbot Arena: An Open Platform for Evaluating LLMs by Human Preference cs.AI · 2024 · author #1
  6. Judging LLM-as-a-Judge with MT-Bench and Chatbot Arena cs.CL · 2023 · author #2

Mentions

  • 2606.31711 #7 · arxiv_oai · confidence 0.70 Wei-Lin Chiang
  • 2606.26429 #5 · arxiv_oai · confidence 0.70 Wei-Lin Chiang
  • 2406.11939 #2 · arxiv_oai · confidence 0.70 Wei-Lin Chiang

Frequent Coauthors