Automating code review activities by large-scale pre- training

Deng, Y · 2022 · arXiv 0250.354908

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

read on arXiv browse 8 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

The Correctness Illusion in LLM-Generated GPU Kernels

cs.SE · 2026-06-18 · accept · novelty 7.0

Controlled corpus testing shows that fixed allclose oracles in LLM kernel benchmarks certify transcription-buggy kernels as correct while seeded fuzzing with fp64 references does not.

Choose Your Own Adventure: Non-Linear AI-Assisted Programming with EvoGraph

cs.HC · 2026-04-20 · unverdicted · novelty 7.0

EvoGraph turns linear AI-assisted programming into a manipulable graph of branching histories, reducing cognitive load and enabling better iteration according to a user study with 20 developers.

Measuring and Exploiting Contextual Bias in LLM-Assisted Security Code Review

cs.SE · 2026-03-19 · accept · novelty 7.0

LLM-based security code review is vulnerable to framing bias, with a novel iterative refinement attack achieving 100% success in reintroducing vulnerabilities across real projects.

Habituation at the Gate: Rising Approval and Declining Scrutiny in Human Review of AI Agent Code

cs.SE · 2026-06-21 · unverdicted · novelty 6.0

Within-reviewer analysis of 11,429 reviews shows AI code approval rising from 30.1% to 36.8% with experience, with reduced inline comments and increased latency, consistent with habituation.

AgentModernize: Preserving Business Logic in Legacy Modernization with Multi-Agent LLMs and Behavioral Specification Graphs

cs.SE · 2026-05-17 · unverdicted · novelty 6.0

A multi-agent LLM framework with Behavioral Specification Graphs preserves business logic in legacy modernization, achieving non-zero mean BER on all tested scenarios where baseline LLM approaches scored zero.

Guiding Symbolic Execution with Static Analysis and LLMs for Vulnerability Discovery

cs.CR · 2026-04-07 · unverdicted · novelty 6.0

SAILOR combines static analysis and LLM-orchestrated synthesis to automatically generate symbolic execution harnesses, discovering 379 previously unknown memory-safety vulnerabilities across 10 large open-source C/C++ projects where the strongest baseline found only 12.

Test-Input Generation for Tensor Programs: What Actually Finds Kernel Bugs

cs.SE · 2026-06-23 · unverdicted · novelty 5.0

Boundary shape sampling for tensor kernel testing achieves 78% recall on seeded bugs with 0% false positives on correct kernels, while adversarial value sampling reaches 99% recall at the cost of 94% false positives.

Mapping NVD Records to Their Vulnerability-fixing Commits: How Hard is It?

cs.SE · 2025-06-11 · accept · novelty 4.0

Empirical study finds Git references enable over 86% success in mapping NVD records to vulnerability-fixing commits while non-Git references succeed under 14%, yielding an automated pipeline and external mining that together cover only 11.3% of records at 87% precision.

citing papers explorer

Showing 0 of 0 citing papers after filters.

No citing papers match the current filters.

Automating code review activities by large-scale pre- training

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer