pith. sign in

Leshem Choshen

Identifiers

  • name variant Leshem Choshen 0.60 · backfill

Papers (13)

  1. Instructions Shape Production of Language, not Processing cs.CL · 2026 · author #2
  2. Growing Pains: Extensible and Efficient LLM Benchmarking Via Fixed Parameter Calibration cs.CL · 2026 · author #7
  3. General Agent Evaluation cs.AI · 2026 · author #12
  4. Beyond Binary Rewards: Training LMs to Reason About Their Uncertainty cs.LG · 2025 · author #5
  5. LLM Hypnosis: Exploiting User Feedback for Unauthorized Knowledge Injection to All Users cs.CL · 2025 · author #5
  6. DOVE: A Large-Scale Multi-Dimensional Predictions Dataset Towards Meaningful LLM Evaluation cs.CL · 2025 · author #6
  7. Holmes: A Benchmark to Assess the Linguistic Competence of Language Models cs.CL · 2024 · author #3
  8. Are You Convinced? Choosing the More Convincing Evidence with a Siamese Network cs.LG · 2019 · author #3
  9. Learning to combine Grammatical Error Corrections cs.CL · 2019 · author #3
  10. The Language of Legal and Illegal Activity on the Darknet cs.CL · 2019 · author #1
  11. Automatic Metric Validation for Grammatical Error Correction cs.CL · 2018 · author #1
  12. DORA The Explorer: Directed Outreaching Reinforcement Action-Selection cs.LG · 2018 · author #1
  13. Reference-less Measure of Faithfulness for Grammatical Error Correction cs.CL · 2018 · author #1

Mentions

  • 2507.16806 #5 · arxiv_oai · confidence 0.70 Leshem Choshen

Frequent Coauthors