TadA-Bench: A Million-Variant Benchmark for Future-Round Discovery Toward Agentic Protein Engineering

Dequan Wang; Dukun Zhao; Jiaqi Shen; Jin Gao; Junhao Shi; Juntu Zhao; Yuming Lu; Zirui Zeng

arxiv: 2606.02624 · v1 · pith:ONYVXMFXnew · submitted 2026-05-29 · 🧬 q-bio.QM · cs.AI· cs.LG

TadA-Bench: A Million-Variant Benchmark for Future-Round Discovery Toward Agentic Protein Engineering

Jin Gao , Juntu Zhao , Zirui Zeng , Jiaqi Shen , Junhao Shi , Dukun Zhao , Yuming Lu , Dequan Wang This is my paper

classification 🧬 q-bio.QM cs.AIcs.LG

keywords agenticdiscoveryfuture-roundproteintada-benchengineeringreplayrounds

0 comments

read the original abstract

AI for scientific discovery is entering an agentic era, where protein-engineering systems are expected to prioritize future wet-lab experiments rather than merely fit static measurements. We introduce TadA-Bench, a million-variant wet-lab replay benchmark from 31 TadA directed-evolution rounds for future-round discovery toward agentic protein engineering. TadA-Bench preserves the campaign chronology and defines a fixed-data replay task: given earlier experimental rounds, models rank variants that appear only in later rounds. It provides aligned DNA, RNA, and protein views, and uses Seq2Graph, a graph-based label-unification pipeline, to reconcile noisy enrichment measurements into consistent cross-round activity labels. Random-split controls show strong interpolation, but future-round ranking and finite-budget candidate selection are much weaker. Controlled analyses suggest that evolutionary coverage is more informative than local data density, positioning TadA-Bench as a reproducible wet-lab replay substrate for future-round discovery toward agentic protein engineering; the data and code are released on Hugging Face and GitHub.

This paper has not been read by Pith yet.

TadA-Bench: A Million-Variant Benchmark for Future-Round Discovery Toward Agentic Protein Engineering

discussion (0)