LitXBench is a new benchmark for extracting complete experiments from scientific papers, with results showing frontier LLMs outperform multi-turn pipelines by up to 0.37 F1 due to better handling of processing steps.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.IR 1years
2026 1verdicts
CONDITIONAL 1representative citing papers
citing papers explorer
-
LitXBench: A Benchmark for Extracting Experiments from Scientific Literature
LitXBench is a new benchmark for extracting complete experiments from scientific papers, with results showing frontier LLMs outperform multi-turn pipelines by up to 0.37 F1 due to better handling of processing steps.