Dependency count.Of the 200 repositories, 171 (85.5%) contain a recognized package manifest file; among these, the median repository declares 17 total dependencies (12 runtime)

contains over 850 directories · 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

ProgramBench: Can Language Models Rebuild Programs From Scratch?

cs.SE · 2026-05-05 · unverdicted · novelty 7.0

ProgramBench introduces 200 tasks where models must reconstruct full programs like FFmpeg or SQLite from docs alone; none of 9 evaluated LMs fully solve any task and the best passes 95% tests on only 3% of tasks while favoring monolithic code.

citing papers explorer

Showing 1 of 1 citing paper.

ProgramBench: Can Language Models Rebuild Programs From Scratch? cs.SE · 2026-05-05 · unverdicted · none · ref 21
ProgramBench introduces 200 tasks where models must reconstruct full programs like FFmpeg or SQLite from docs alone; none of 9 evaluated LMs fully solve any task and the best passes 95% tests on only 3% of tasks while favoring monolithic code.

Dependency count.Of the 200 repositories, 171 (85.5%) contain a recognized package manifest file; among these, the median repository declares 17 total dependencies (12 runtime)

fields

years

verdicts

representative citing papers

citing papers explorer