Structured knowledge extracted from corpora enables test-driven data engineering for LLMs by mapping training data to source code, model training to compilation, benchmarking to unit testing, and failures to targeted data repairs, demonstrated across 16 disciplines.
STar: Bootstrapping reasoning with reasoning
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it