pith. sign in

Liantong Yu

Identifiers

  • name variant Liantong Yu 0.60 · backfill

Papers (1)

  1. How Much Do Large Language Model Cheat on Evaluation? Benchmarking Overestimation under the One-Time-Pad-Based Framework cs.CL · 2025 · author #2

Mentions

  • 2507.19219 #2 · arxiv_oai · confidence 0.70 Liantong Yu

Frequent Coauthors