National Academies Press

Committee on Reproducibility, Replicability in Science et al · 2019 · DOI 10.17226/25303

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

open at publisher browse 6 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

When Representative Samples Produce Worse Outcomes: Scale-up Decisions and Testing in Small-Budget RCTs

stat.ME · 2026-06-11 · unverdicted · novelty 7.0

In small-budget RCTs where significance tests decide scale-up, optimal pilot sampling shifts from representative to single homogeneous subpopulation as budget shrinks.

ARA: Agentic Reproducibility Assessment For Scalable Support Of Scientific Peer-Review

cs.DL · 2026-05-04 · unverdicted · novelty 6.0 · 2 refs

ARA uses LLMs to build workflow graphs linking sources, methods, and outputs in papers, then scores reproducibility, reaching ~61% accuracy on 213 ReScience C articles and outperforming priors on ReproBench and GoldStandardDB.

Automating Computational Reproducibility in Social Science: Comparing Prompt-Based and Agent-Based Approaches

cs.SE · 2026-02-09 · unverdicted · novelty 6.0

Agent-based AI workflows repair injected reproducibility failures in R social-science code at 69-96% success, substantially outperforming prompt-based LLM approaches at 31-79%.

Reflections on Traceability for Visualization Research

cs.HC · 2026-04-15 · conditional · novelty 5.0

Visualization researchers propose traceability—recording abundant annotated artifacts, reporting curated research threads, and enabling reading via interfaces—as a way to ensure rigor and transparency in inherently unreproducible design processes.

Is K-fold cross validation the best model selection method for Machine Learning?

stat.ML · 2024-01-29 · unverdicted · novelty 5.0

K-fold CUBV combines cross-validation with PAC-Bayesian upper bounds on actual risk to provide a more robust criterion for validating ML accuracy and reducing false positives than standard CV.

Replicable Bandits with UCB based Exploration

cs.LG · 2026-04-21

citing papers explorer

Showing 4 of 4 citing papers after filters.

When Representative Samples Produce Worse Outcomes: Scale-up Decisions and Testing in Small-Budget RCTs stat.ME · 2026-06-11 · unverdicted · none · ref 38
In small-budget RCTs where significance tests decide scale-up, optimal pilot sampling shifts from representative to single homogeneous subpopulation as budget shrinks.
ARA: Agentic Reproducibility Assessment For Scalable Support Of Scientific Peer-Review cs.DL · 2026-05-04 · unverdicted · none · ref 21 · 2 links
ARA uses LLMs to build workflow graphs linking sources, methods, and outputs in papers, then scores reproducibility, reaching ~61% accuracy on 213 ReScience C articles and outperforming priors on ReproBench and GoldStandardDB.
Automating Computational Reproducibility in Social Science: Comparing Prompt-Based and Agent-Based Approaches cs.SE · 2026-02-09 · unverdicted · none · ref 20
Agent-based AI workflows repair injected reproducibility failures in R social-science code at 69-96% success, substantially outperforming prompt-based LLM approaches at 31-79%.
Is K-fold cross validation the best model selection method for Machine Learning? stat.ML · 2024-01-29 · unverdicted · none · ref 1
K-fold CUBV combines cross-validation with PAC-Bayesian upper bounds on actual risk to provide a more robust criterion for validating ML accuracy and reducing false positives than standard CV.

National Academies Press

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer