AgenticFlict is a public dataset of 29K+ textual merge conflicts from AI agent PRs, collected via merge simulation on 107K processed PRs and showing a 27.67% conflict rate with variation across agents.
German, and Daniela Damian
7 Pith papers cite this work. Polarity classification is still indexing.
citation-role summary
citation-polarity summary
fields
cs.SE 7roles
background 1polarities
background 1representative citing papers
Empirical analysis of 4707 MoltBook posts shows AI-only technical discourse focuses on security, trust, and abstract topics while lacking concrete runtime and project details found in human GitHub discussions.
A paraphrase-robust duplicate-step detector for Gherkin BDD suites, built on a new 1.1M-step public corpus, reports F1 scores up to 0.906 and estimates 893k eliminable step occurrences corpus-wide.
A composable DSL for describing sampling workflows on code repositories enables explicit specification and statistical reasoning about the generalizability of empirical software engineering findings.
Empirical study of 1,454 Java OSS projects finds weak correlations among quality assurance practices and greater intensity in mature projects for ASAT and code review but not CI.
Study of 362 Java projects finds MySQL and PostgreSQL dominate relational use while Redis and MongoDB lead non-relational, with frequent multi-DBM co-use and ORM mediation.
Developers most frequently reference the full Log4j migration guide in pull request descriptions (82.81% of cases) and continue consulting it during post-update maintenance tasks.
citing papers explorer
-
AgenticFlict: A Large-Scale Dataset of Merge Conflicts in AI Coding Agent Pull Requests on GitHub
AgenticFlict is a public dataset of 29K+ textual merge conflicts from AI agent PRs, collected via merge simulation on 107K processed PRs and showing a 27.67% conflict rate with variation across agents.
-
What Software Engineering Looks Like to AI Agents? -- An Empirical Study of AI-Only Technical Discourse on MoltBook
Empirical analysis of 4707 MoltBook posts shows AI-only technical discourse focuses on security, trust, and abstract topics while lacking concrete runtime and project details found in human GitHub discussions.
-
Reducing Maintenance Burden in Behaviour-Driven Development: A Paraphrase-Robust Duplicate-Step Detector with a 1.1M-Step Open Benchmark
A paraphrase-robust duplicate-step detector for Gherkin BDD suites, built on a new 1.1M-step public corpus, reports F1 scores up to 0.906 and estimates 893k eliminable step occurrences corpus-wide.
-
Modeling Sampling Workflows for Code Repositories
A composable DSL for describing sampling workflows on code repositories enables explicit specification and statistical reasoning about the generalizability of empirical software engineering findings.
-
State-Of-The-Practice in Quality Assurance in Java-Based Open Source Software Development
Empirical study of 1,454 Java OSS projects finds weak correlations among quality assurance practices and greater intensity in mature projects for ASAT and code review but not CI.
-
Analyzing the Adoption of Database Management Systems Throughout the History of Open Source Projects
Study of 362 Java projects finds MySQL and PostgreSQL dominate relational use while Redis and MongoDB lead non-relational, with frequent multi-DBM co-use and ORM mediation.
-
How Do Developers Use Migration Guides? A Case Study of Log4j
Developers most frequently reference the full Log4j migration guide in pull request descriptions (82.81% of cases) and continue consulting it during post-update maintenance tasks.