GenomeQA benchmark shows general LLMs outperform random guessing on raw DNA sequences by detecting local patterns like GC content but struggle with multi-step or indirect genomic inferences.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
q-bio.GN 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
GenomeQA: Benchmarking General Large Language Models for Genome Sequence Understanding
GenomeQA benchmark shows general LLMs outperform random guessing on raw DNA sequences by detecting local patterns like GC content but struggle with multi-step or indirect genomic inferences.