SciCustom constructs application-specific benchmarks for LLM scientific capabilities from large-scale data using ontology-grounded units, automated tagging, consensus retrieval, and proxy selection.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CL 1years
2026 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
SciCustom: A Framework for Custom Evaluation of Scientific Capabilities in Large Language Models
SciCustom constructs application-specific benchmarks for LLM scientific capabilities from large-scale data using ontology-grounded units, automated tagging, consensus retrieval, and proxy selection.