Inputs (provided at evaluation time) •1

You are given the prompt which the agent was given to complete

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

BioAgent Bench: An AI Agent Evaluation Suite for Bioinformatics

cs.AI · 2026-01-29 · accept · novelty 7.0

BioAgent Bench is a new evaluation suite that tests AI agents on end-to-end bioinformatics pipelines and finds that frontier models often complete tasks reliably but fail under controlled perturbations like corrupted inputs or prompt bloat.

citing papers explorer

Showing 1 of 1 citing paper.

BioAgent Bench: An AI Agent Evaluation Suite for Bioinformatics cs.AI · 2026-01-29 · accept · none · ref 8
BioAgent Bench is a new evaluation suite that tests AI agents on end-to-end bioinformatics pipelines and finds that frontier models often complete tasks reliably but fail under controlled perturbations like corrupted inputs or prompt bloat.

Inputs (provided at evaluation time) •1

fields

years

verdicts

representative citing papers

citing papers explorer