BioXArena benchmarks LLM agents on generating end-to-end ML pipelines for 76 multi-modal biomedical tasks, with MLEvolve plus Gemini-3.1-Pro scoring highest at 0.666.
The cancer genome atlas pan-cancer analysis project.Nature genetics, 45(10):1113–1120, 2013
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
citation-role summary
background 1
citation-polarity summary
years
2026 2verdicts
UNVERDICTED 2roles
background 1polarities
unclear 1representative citing papers
MIST augments MIL projection layers with cross-modal gene-expression prototypes derived from spatial transcriptomics, yielding consistent gains on survival, subtyping, and biomarker tasks across 23 endpoints and 8 aggregators.
citing papers explorer
-
BioXArena: Benchmarking LLM Agents on Multi-Modal Biomedical Machine Learning Tasks
BioXArena benchmarks LLM agents on generating end-to-end ML pipelines for 76 multi-modal biomedical tasks, with MLEvolve plus Gemini-3.1-Pro scoring highest at 0.666.
-
Bridging the Modality Bottleneck in Pathology MIL through Virtual Molecular Staining
MIST augments MIL projection layers with cross-modal gene-expression prototypes derived from spatial transcriptomics, yielding consistent gains on survival, subtyping, and biomarker tasks across 23 endpoints and 8 aggregators.