pith. sign in

Yotam Perlitz

Identifiers

  • name variant Yotam Perlitz 0.60 · backfill

Papers (9)

  1. A Matter of TASTE: Improving Coverage and Difficulty of Agent Benchmarks cs.AI · 2026 · author #4
  2. Instructions Shape Production of Language, not Processing cs.CL · 2026 · author #4
  3. PolySQL: Scaling Text-to-SQL Evaluation Across SQL Dialects via Automated Backend Isomorphism cs.CL · 2026 · author #1
  4. Growing Pains: Extensible and Efficient LLM Benchmarking Via Fixed Parameter Calibration cs.CL · 2026 · author #4
  5. General Agent Evaluation cs.AI · 2026 · author #5
  6. DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM Evaluation cs.CL · 2025 · author #4
  7. Humanity's Last Exam cs.LG · 2025 · author #849
  8. Holmes: A Benchmark to Assess the Linguistic Competence of Language Models cs.CL · 2024 · author #2
  9. Helical liquid in carbon nanotubes wrapped with DNA molecules cond-mat.mes-hall · 2017 · author #1

Mentions

  • 2605.28556 #4 · arxiv_oai · confidence 0.70 Yotam Perlitz

Frequent Coauthors