Em-assist: Safe automated extractmethod refactoring with llms

Yinghao Chen, Zehao Hu, Chen Zhi, Junxiao Han, Shuiguang Deng, Jianwei Yin · 2024 · arXiv 3529.366380

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

read on arXiv browse 8 citing papers

citation-role summary

background 3

citation-polarity summary

support 2 background 1

representative citing papers

ConCovUp: Effective Agent-Based Test Driver Generation for Concurrency Testing

cs.SE · 2026-05-10 · unverdicted · novelty 7.0

ConCovUp uses static analysis to ground LLM test generation and backward tracing to produce concurrent test drivers that raise average shared-memory access pair coverage from 36.6% to 68.1% on nine real-world libraries.

Do AI Models Dream of Faster Code? An Empirical Study on LLM-Proposed Performance Improvements in Real-World Software

cs.SE · 2025-10-17 · unverdicted · novelty 7.0

LLMs propose volatile performance improvements on real-world Java tasks that lag human developers on average, showing algorithmic benchmarks overestimate capabilities.

Program Structure-aware Language Models: Targeted Software Testing beyond Textual Semantics

cs.SE · 2026-04-20 · unverdicted · novelty 6.0

GLMTest integrates code property graphs and GNNs with LLMs to steer test case generation toward targeted branches, raising branch accuracy from 27.4% to 50.2% on the TestGenEval benchmark.

Learned or Memorized ? Quantifying Memorization Advantage in Code LLMs

cs.SE · 2026-04-15 · unverdicted · novelty 6.0

A perturbation method shows memorization advantage in code LLMs varies widely by model and task, remaining low on CVEFixes and Defects4J benchmarks.

CoCoMUT: A Tool for Code-Context Mining and Automated Dataset Generation

cs.SE · 2026-06-30 · unverdicted · novelty 5.0

CoCoMUT is a reusable pipeline that discovers project structure, constructs call graphs, extracts source, reconciles bytecode to source, and emits versioned JSON datasets of method contexts, demonstrated on 20 Java repositories with 97.8% reconciliation and 99% audit accuracy.

AI-Assisted Unit Test Writing and Test-Driven Code Refactoring: A Case Study

cs.SE · 2026-04-03 · conditional · novelty 5.0

AI models generated nearly 16,000 lines of unit tests in hours and enabled safe large-scale refactoring with up to 78% branch coverage in a case study.

A Blueprint for AI-Driven Software Quality: Integrating LLMs with Established Standards

cs.SE · 2025-05-19 · unverdicted · novelty 3.0

Survey mapping LLM applications in software quality assurance to established standards including ISO/IEC 12207, ISO 25010, CMMI, and TMM, with case studies, challenges, and future directions.

To Vibe Research or Not to Vibe Research? Generative AI in Qualitative Research

cs.SE · 2026-04-30 · unverdicted · novelty 2.0

Generative AI suitability in qualitative research depends primarily on the approach (small-q positivist/post-positivist or Big Q non-positivist) along with skills, ethics, and personal preferences.

citing papers explorer

Showing 7 of 7 citing papers after filters.

ConCovUp: Effective Agent-Based Test Driver Generation for Concurrency Testing cs.SE · 2026-05-10 · unverdicted · none · ref 4
ConCovUp uses static analysis to ground LLM test generation and backward tracing to produce concurrent test drivers that raise average shared-memory access pair coverage from 36.6% to 68.1% on nine real-world libraries.
Do AI Models Dream of Faster Code? An Empirical Study on LLM-Proposed Performance Improvements in Real-World Software cs.SE · 2025-10-17 · unverdicted · none · ref 11
LLMs propose volatile performance improvements on real-world Java tasks that lag human developers on average, showing algorithmic benchmarks overestimate capabilities.
Program Structure-aware Language Models: Targeted Software Testing beyond Textual Semantics cs.SE · 2026-04-20 · unverdicted · none · ref 23
GLMTest integrates code property graphs and GNNs with LLMs to steer test case generation toward targeted branches, raising branch accuracy from 27.4% to 50.2% on the TestGenEval benchmark.
Learned or Memorized ? Quantifying Memorization Advantage in Code LLMs cs.SE · 2026-04-15 · unverdicted · none · ref 2
A perturbation method shows memorization advantage in code LLMs varies widely by model and task, remaining low on CVEFixes and Defects4J benchmarks.
CoCoMUT: A Tool for Code-Context Mining and Automated Dataset Generation cs.SE · 2026-06-30 · unverdicted · none · ref 4
CoCoMUT is a reusable pipeline that discovers project structure, constructs call graphs, extracts source, reconciles bytecode to source, and emits versioned JSON datasets of method contexts, demonstrated on 20 Java repositories with 97.8% reconciliation and 99% audit accuracy.
A Blueprint for AI-Driven Software Quality: Integrating LLMs with Established Standards cs.SE · 2025-05-19 · unverdicted · none · ref 55
Survey mapping LLM applications in software quality assurance to established standards including ISO/IEC 12207, ISO 25010, CMMI, and TMM, with case studies, challenges, and future directions.
To Vibe Research or Not to Vibe Research? Generative AI in Qualitative Research cs.SE · 2026-04-30 · unverdicted · none · ref 249
Generative AI suitability in qualitative research depends primarily on the approach (small-q positivist/post-positivist or Big Q non-positivist) along with skills, ethics, and personal preferences.

Em-assist: Safe automated extractmethod refactoring with llms

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer