BioDefect is a new dataset for defect detection in bioinformatics software that improves average F1-scores by 29.61% to 38.04% over existing datasets when evaluated on nine language models.
Proceedings of the 2019 ACM SIGPLAN International Symposium on New Ideas, New Paradigms, and Reflections on Programming and Software , pages =
6 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
roles
background 2polarities
background 2representative citing papers
Novel compositional theory and software support enable modular design, verification, and reuse of swarm protocols for distributed agent systems.
The paper delivers a taxonomy of seven LLM study types in software engineering along with eight guidelines that separate mandatory requirements from recommended practices to address reproducibility challenges.
A semantic conflict model enables explicit local-first resolution in collaborative data structures by using operation dependencies and three-way merges on a replicated journal, demonstrated on registers including Last-Writer-Wins.
ERA adds asynchronous epoch-based arbitration to CRDTs to resolve duelling admins by imposing bounded total order within epochs while keeping availability.
StarCoderBase matches or beats OpenAI's code-cushman-001 on multi-language code benchmarks; the Python-fine-tuned StarCoder reaches 40% pass@1 on HumanEval while retaining other-language performance.
citing papers explorer
-
BioDefect: The First Dataset for Defect Detection in Bioinformatics Software
BioDefect is a new dataset for defect detection in bioinformatics software that improves average F1-scores by 29.61% to 38.04% over existing datasets when evaluated on nine language models.
-
Compositional Design, Implementation, and Verification of Swarms (Technical Report)
Novel compositional theory and software support enable modular design, verification, and reuse of swarm protocols for distributed agent systems.
-
Guidelines for Empirical Studies in Software Engineering involving Large Language Models
The paper delivers a taxonomy of seven LLM study types in software engineering along with eight guidelines that separate mandatory requirements from recommended practices to address reproducibility challenges.
-
Semantic Conflict Model for Collaborative Data Structures
A semantic conflict model enables explicit local-first resolution in collaborative data structures by using operation dependencies and three-way merges on a replicated journal, demonstrated on registers including Last-Writer-Wins.
-
ERA: Epoch-Resolved Arbitration for Duelling Admins in Group Management CRDTs
ERA adds asynchronous epoch-based arbitration to CRDTs to resolve duelling admins by imposing bounded total order within epochs while keeping availability.
-
StarCoder: may the source be with you!
StarCoderBase matches or beats OpenAI's code-cushman-001 on multi-language code benchmarks; the Python-fine-tuned StarCoder reaches 40% pass@1 on HumanEval while retaining other-language performance.