Large language models surpass human experts in predicting neuroscience results

Akilles Rechardt; Alessandro Salatiello; Alexandra O. Cohen; Anna Behler; Anton Pashkov; Bati Yilmaz; Bradley C. Love; Chloe M. Hall; Daniele Marinazzo; Elkhan Yusifov

arxiv: 2403.03230 · v4 · pith:JQAOP3TLnew · submitted 2024-03-04 · 🧬 q-bio.NC · cs.AI

Large language models surpass human experts in predicting neuroscience results

Xiaoliang Luo , Akilles Rechardt , Guangzhi Sun , Kevin K. Nejad , Felipe Y\'a\~nez , Bati Yilmaz , Kangjoo Lee , Alexandra O. Cohen

show 31 more authors

Valentina Borghesani Anton Pashkov Daniele Marinazzo Jonathan Nicholas Alessandro Salatiello Ilia Sucholutsky Pasquale Minervini Sepehr Razavi Roberta Rocca Elkhan Yusifov Tereza Okalova Nianlong Gu Martin Ferianc Mikail Khona Kaustubh R. Patil Pui-Shee Lee Rui Mata Nicholas E. Myers Jennifer K Bizley Sebastian Musslick Isil Poyraz Bilgin Guiomar Niso Justin M. Ales Michael Gaebler N Apurva Ratan Murty Leyla Loued-Khenissi Anna Behler Chloe M. Hall Jessica Dafflon Sherry Dongqi Bao Bradley C. Love

This is my paper

classification 🧬 q-bio.NC cs.AI

keywords llmsexpertshumanneurosciencepredictingresultsbetterdiscoveries

0 comments

read the original abstract

Scientific discoveries often hinge on synthesizing decades of research, a task that potentially outstrips human information processing capacities. Large language models (LLMs) offer a solution. LLMs trained on the vast scientific literature could potentially integrate noisy yet interrelated findings to forecast novel results better than human experts. To evaluate this possibility, we created BrainBench, a forward-looking benchmark for predicting neuroscience results. We find that LLMs surpass experts in predicting experimental outcomes. BrainGPT, an LLM we tuned on the neuroscience literature, performed better yet. Like human experts, when LLMs were confident in their predictions, they were more likely to be correct, which presages a future where humans and LLMs team together to make discoveries. Our approach is not neuroscience-specific and is transferable to other knowledge-intensive endeavors.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Lynx: Enabling Efficient MoE Inference through Dynamic Batch-Aware Expert Selection
cs.LG 2024-11 unverdicted novelty 6.0

Lynx exploits training-induced batch-level expert activation skews via AffinityBinning to reduce invoked experts per batch, delivering up to 1.30x throughput with under 1% accuracy loss across four model families.