hub

TOSEM https: //arxiv.org/abs/2509.14745, forthcoming

Miku Watanabe, Hao Li, Yutaro Kashiwa, Brittany Reid, Hajimu Iida, Ahmed E · 2025 · arXiv 2509.14745

13 Pith papers cite this work. Polarity classification is still indexing.

13 Pith papers citing it

read on arXiv browse 13 citing papers

hub tools

JSON dossier citing papers JSON arXiv source

citation-role summary

background 2

citation-polarity summary

background 1 support 1

representative citing papers

A Dataset of Agentic AI Coding Tool Configurations

cs.SE · 2026-05-08 · accept · novelty 8.0

A publicly released dataset of 15,591 configuration artifacts for five agentic AI coding tools, drawn from 4,738 GitHub repositories along with associated files and AI-co-authored commits.

Do AI Coding Agents Log Like Humans? An Empirical Study

cs.SE · 2026-04-10 · unverdicted · novelty 7.0

AI agents modify logging less often than humans in 58.4% of repositories but produce higher log density when they change it; explicit logging instructions are rare (4.7%) and ignored 67% of the time, with humans performing 72.5% of post-generation log repairs.

AgenticFlict: A Large-Scale Dataset of Merge Conflicts in AI Coding Agent Pull Requests on GitHub

cs.SE · 2026-04-04 · accept · novelty 7.0

AgenticFlict is a public dataset of 29K+ textual merge conflicts from AI agent PRs, collected via merge simulation on 107K processed PRs and showing a 27.67% conflict rate with variation across agents.

How AI Coding Agents Modify Code: A Large-Scale Study of GitHub Pull Requests

cs.SE · 2026-01-24 · unverdicted · novelty 7.0

AI coding agents produce pull requests with substantially more commits and slightly higher description-to-diff similarity than human developers, based on analysis of 29,095 merged PRs.

Why Are Agentic Pull Requests Merged or Rejected? An Empirical Study

cs.SE · 2026-05-21 · unverdicted · novelty 6.0

Analysis of 9,799 human-reviewed agentic PRs shows only 35.7% of rejections reflect clear agent failures, with 31.2% due to workflow constraints and 33.1% lacking clear rationale, plus notable interaction differences across agents.

To What Extent Does Agent-generated Code Require Maintenance? An Empirical Study

cs.SE · 2026-05-07 · unverdicted · novelty 6.0 · 2 refs

AI-generated code requires less maintenance than human-written code, mostly involving feature additions by humans rather than bug fixes.

Hot Fixing in the Wild

cs.SE · 2026-04-29 · unverdicted · novelty 6.0

Hot fixes show urgency patterns with reduced collaboration and testing, differing from regular fixes, and human versus AI agents display over 10 distinct repair behaviors in large-scale GitHub data.

On the Footprints of Reviewer Bots Feedback on Agentic Pull Requests in OSS GitHub Repositories

cs.SE · 2026-04-27 · unverdicted · novelty 6.0

Reviewer bots' higher comment volume on AI agent PRs is associated with slower resolutions and poorer average feedback quality, while feedback quality itself has no association with PR outcomes.

Insights into Security-Related AI-Generated Pull Requests

cs.SE · 2026-04-21 · unverdicted · novelty 6.0

AI-generated security pull requests frequently contain a small set of recurring weaknesses, with many flawed ones merged and rejections driven by process factors rather than technical issues.

Debt Behind the AI Boom: A Large-Scale Empirical Study of AI-Generated Code in the Wild

cs.SE · 2026-03-30 · unverdicted · novelty 6.0

AI coding assistants introduce code issues that persist in 22.7% of cases across real projects, creating measurable long-term technical debt.

Rethinking Code Review in the Age of AI: A Vision for Agentic Code Review

cs.SE · 2026-05-17 · unverdicted · novelty 5.0

The paper presents a vision for an agentic code review framework spanning PR Creation, Augmentation, Reviewer Selection, AI-Assisted Review, and Retrospective, with humans retained at quality gates.

From Industry Claims to Empirical Reality: An Empirical Study of Code Review Agents in Pull Requests

cs.SE · 2026-04-03 · conditional · novelty 5.0

Code review agents achieve 45.20% merge rate on PRs versus 68.37% for humans, with 60.2% of agent-only closed PRs showing 0-30% signal quality.

Agentic Agile-V: From Vibe Coding to Verified Engineering in Software and Hardware Development

cs.SE · 2026-05-19 · unverdicted · novelty 4.0

Agentic Agile-V uses Agile-V as backbone and a Specify-Constrain-Orchestrate-Prove-Evolve-Verify loop to convert AI agent conversations into traceable engineering artifacts with acceptance evidence.

citing papers explorer

Showing 13 of 13 citing papers.

A Dataset of Agentic AI Coding Tool Configurations cs.SE · 2026-05-08 · accept · none · ref 19
A publicly released dataset of 15,591 configuration artifacts for five agentic AI coding tools, drawn from 4,738 GitHub repositories along with associated files and AI-co-authored commits.
Do AI Coding Agents Log Like Humans? An Empirical Study cs.SE · 2026-04-10 · unverdicted · none · ref 33
AI agents modify logging less often than humans in 58.4% of repositories but produce higher log density when they change it; explicit logging instructions are rare (4.7%) and ignored 67% of the time, with humans performing 72.5% of post-generation log repairs.
AgenticFlict: A Large-Scale Dataset of Merge Conflicts in AI Coding Agent Pull Requests on GitHub cs.SE · 2026-04-04 · accept · none · ref 58
AgenticFlict is a public dataset of 29K+ textual merge conflicts from AI agent PRs, collected via merge simulation on 107K processed PRs and showing a 27.67% conflict rate with variation across agents.
How AI Coding Agents Modify Code: A Large-Scale Study of GitHub Pull Requests cs.SE · 2026-01-24 · unverdicted · none · ref 42
AI coding agents produce pull requests with substantially more commits and slightly higher description-to-diff similarity than human developers, based on analysis of 29,095 merged PRs.
Why Are Agentic Pull Requests Merged or Rejected? An Empirical Study cs.SE · 2026-05-21 · unverdicted · none · ref 19
Analysis of 9,799 human-reviewed agentic PRs shows only 35.7% of rejections reflect clear agent failures, with 31.2% due to workflow constraints and 33.1% lacking clear rationale, plus notable interaction differences across agents.
To What Extent Does Agent-generated Code Require Maintenance? An Empirical Study cs.SE · 2026-05-07 · unverdicted · none · ref 26 · 2 links
AI-generated code requires less maintenance than human-written code, mostly involving feature additions by humans rather than bug fixes.
Hot Fixing in the Wild cs.SE · 2026-04-29 · unverdicted · none · ref 21
Hot fixes show urgency patterns with reduced collaboration and testing, differing from regular fixes, and human versus AI agents display over 10 distinct repair behaviors in large-scale GitHub data.
On the Footprints of Reviewer Bots Feedback on Agentic Pull Requests in OSS GitHub Repositories cs.SE · 2026-04-27 · unverdicted · none · ref 20
Reviewer bots' higher comment volume on AI agent PRs is associated with slower resolutions and poorer average feedback quality, while feedback quality itself has no association with PR outcomes.
Insights into Security-Related AI-Generated Pull Requests cs.SE · 2026-04-21 · unverdicted · none · ref 9
AI-generated security pull requests frequently contain a small set of recurring weaknesses, with many flawed ones merged and rejections driven by process factors rather than technical issues.
Debt Behind the AI Boom: A Large-Scale Empirical Study of AI-Generated Code in the Wild cs.SE · 2026-03-30 · unverdicted · none · ref 16
AI coding assistants introduce code issues that persist in 22.7% of cases across real projects, creating measurable long-term technical debt.
Rethinking Code Review in the Age of AI: A Vision for Agentic Code Review cs.SE · 2026-05-17 · unverdicted · none · ref 144
The paper presents a vision for an agentic code review framework spanning PR Creation, Augmentation, Reviewer Selection, AI-Assisted Review, and Retrospective, with humans retained at quality gates.
From Industry Claims to Empirical Reality: An Empirical Study of Code Review Agents in Pull Requests cs.SE · 2026-04-03 · conditional · none · ref 19
Code review agents achieve 45.20% merge rate on PRs versus 68.37% for humans, with 60.2% of agent-only closed PRs showing 0-30% signal quality.
Agentic Agile-V: From Vibe Coding to Verified Engineering in Software and Hardware Development cs.SE · 2026-05-19 · unverdicted · none · ref 10
Agentic Agile-V uses Agile-V as backbone and a Specify-Constrain-Orchestrate-Prove-Evolve-Verify loop to convert AI agent conversations into traceable engineering artifacts with acceptance evidence.

TOSEM https: //arxiv.org/abs/2509.14745, forthcoming

hub tools

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer