A composable DSL for describing sampling workflows on code repositories enables explicit specification and statistical reasoning about the generalizability of empirical software engineering findings.
Title resolution pending
2 Pith papers cite this work. Polarity classification is still indexing.
fields
cs.SE 2years
2026 2representative citing papers
CoCoMUT is a reusable pipeline that discovers project structure, constructs call graphs, extracts source, reconciles bytecode to source, and emits versioned JSON datasets of method contexts, demonstrated on 20 Java repositories with 97.8% reconciliation and 99% audit accuracy.
citing papers explorer
-
Modeling Sampling Workflows for Code Repositories
A composable DSL for describing sampling workflows on code repositories enables explicit specification and statistical reasoning about the generalizability of empirical software engineering findings.
-
CoCoMUT: A Tool for Code-Context Mining and Automated Dataset Generation
CoCoMUT is a reusable pipeline that discovers project structure, constructs call graphs, extracts source, reconciles bytecode to source, and emits versioned JSON datasets of method contexts, demonstrated on 20 Java repositories with 97.8% reconciliation and 99% audit accuracy.