AIDev: Studying AI Coding Agents on GitHub

· 2026 · arXiv 2602.09185

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

read on arXiv browse 8 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

A Dataset of Agentic AI Coding Tool Configurations

cs.SE · 2026-05-08 · accept · novelty 8.0

A publicly released dataset of 15,591 configuration artifacts for five agentic AI coding tools, drawn from 4,738 GitHub repositories along with associated files and AI-co-authored commits.

Govern the Repository, Not the Agent: Measuring Ecosystem-Level Risk in AI-Native Software

cs.SE · 2026-06-26 · unverdicted · novelty 6.0

Study of 930k+ agent PRs shows repository explains ~50% of integration friction variance, with agents concentrating it twice as much as humans (ICC 0.30 vs 0.16) after controls.

All Smoke, No Alarm: Oracle Signals in Agent-Authored Test Code

cs.SE · 2026-06-16 · unverdicted · novelty 6.0

An empirical study of 86,156 test patches from five AI agents finds 80.2% lack strong oracle signals, with strong oracles linked to higher merge rates (OR=1.28) after regression controls.

Software Delegation Contracts: Measuring Reviewability in AI Coding-Agent Work

cs.SE · 2026-06-14 · unverdicted · novelty 6.0

Explicit delegation contracts improve reviewability metrics for AI coding agents without changing objective correctness in a 64-run pilot study.

From Assistance to Agency: Rethinking Autonomy and Control in CI/CD Pipelines

cs.SE · 2026-05-08 · unverdicted · novelty 5.0

The central challenge in AI-augmented CI/CD is designing authority transfer from humans to agents under constraints, as current systems remain limited to bounded data-plane autonomy backed by external governance.

Position: Coding Benchmarks Are Misaligned with Agentic Software Engineering

cs.SE · 2026-06-16 · unverdicted · novelty 4.0

Coding benchmarks misalign with agentic software engineering because they conflate model and harness, grade against single references, and provide no component-level iteration signals.

Quality and Security Signals in AI-Generated Python Refactoring Pull Requests

cs.SE · 2026-05-20 · unverdicted · novelty 4.0

Empirical analysis of AI refactoring PRs shows quality attribute improvements in 22.5% of cases with new Pylint issues in 24.17% and Bandit findings in 4.7%, yet 73.5% developer acceptance.

Agentic Agile-V: From Vibe Coding to Verified Engineering in Software and Hardware Development

cs.SE · 2026-05-19 · unverdicted · novelty 4.0

Agentic Agile-V uses Agile-V as backbone and a Specify-Constrain-Orchestrate-Prove-Evolve-Verify loop to convert AI agent conversations into traceable engineering artifacts with acceptance evidence.

citing papers explorer

Showing 7 of 7 citing papers after filters.

Govern the Repository, Not the Agent: Measuring Ecosystem-Level Risk in AI-Native Software cs.SE · 2026-06-26 · unverdicted · none · ref 2
Study of 930k+ agent PRs shows repository explains ~50% of integration friction variance, with agents concentrating it twice as much as humans (ICC 0.30 vs 0.16) after controls.
All Smoke, No Alarm: Oracle Signals in Agent-Authored Test Code cs.SE · 2026-06-16 · unverdicted · none · ref 14
An empirical study of 86,156 test patches from five AI agents finds 80.2% lack strong oracle signals, with strong oracles linked to higher merge rates (OR=1.28) after regression controls.
Software Delegation Contracts: Measuring Reviewability in AI Coding-Agent Work cs.SE · 2026-06-14 · unverdicted · none · ref 11
Explicit delegation contracts improve reviewability metrics for AI coding agents without changing objective correctness in a 64-run pilot study.
From Assistance to Agency: Rethinking Autonomy and Control in CI/CD Pipelines cs.SE · 2026-05-08 · unverdicted · none · ref 27
The central challenge in AI-augmented CI/CD is designing authority transfer from humans to agents under constraints, as current systems remain limited to bounded data-plane autonomy backed by external governance.
Position: Coding Benchmarks Are Misaligned with Agentic Software Engineering cs.SE · 2026-06-16 · unverdicted · none · ref 24
Coding benchmarks misalign with agentic software engineering because they conflate model and harness, grade against single references, and provide no component-level iteration signals.
Quality and Security Signals in AI-Generated Python Refactoring Pull Requests cs.SE · 2026-05-20 · unverdicted · none · ref 19
Empirical analysis of AI refactoring PRs shows quality attribute improvements in 22.5% of cases with new Pylint issues in 24.17% and Bandit findings in 4.7%, yet 73.5% developer acceptance.
Agentic Agile-V: From Vibe Coding to Verified Engineering in Software and Hardware Development cs.SE · 2026-05-19 · unverdicted · none · ref 8
Agentic Agile-V uses Agile-V as backbone and a Specify-Constrain-Orchestrate-Prove-Evolve-Verify loop to convert AI agent conversations into traceable engineering artifacts with acceptance evidence.

AIDev: Studying AI Coding Agents on GitHub

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer