CVEfixes: automated collection of vulner- abilities and their fixes from open-source software

Guru Bhandari, Amara Naseer, Leon Moonen · 2021 · arXiv 5960.347598

9 Pith papers cite this work. Polarity classification is still indexing.

9 Pith papers citing it

read on arXiv browse 9 citing papers

citation-role summary

background 2 dataset 1

citation-polarity summary

background 2 use dataset 1

representative citing papers

ASSEMBLAGE-DEEPHISTORY: A Cross-Build Binary Dataset with Temporal Coverage

cs.CR · 2026-05-20 · unverdicted · novelty 7.0

A new queryable binary dataset combining cross-build diversity, temporal history, and CVE labels with linked metadata for vulnerability research.

Longitudinal Analyses of SAST Tools: A CodeQL Case Study

cs.CR · 2026-05-08 · unverdicted · novelty 7.0

CodeQL detected 171 CVEs total, with 83 caught by a prior version before the fix; detections were often actionable within the vulnerable file but not stable across tool versions.

VulKey: Automated Vulnerability Repair Guided by Domain-Specific Repair Patterns

cs.CR · 2026-05-03 · unverdicted · novelty 7.0 · 2 refs

VulKey introduces hierarchical expert knowledge abstractions to guide LLMs in vulnerability repair, reporting 31.5% accuracy on PrimeVul (7.6% above best baseline) and strong results on Vul4J.

CrossCommitVuln-Bench: A Dataset of Multi-Commit Python Vulnerabilities Invisible to Per-Commit Static Analysis

cs.CR · 2026-04-23 · conditional · novelty 7.0

CrossCommitVuln-Bench shows that 87% of 15 multi-commit Python CVEs are invisible to per-commit static analysis, with only 13% detection rate.

PromptAudit: Auditing Prompt Sensitivity in LLM-Based Vulnerability Detection

cs.LG · 2026-05-22 · unverdicted · novelty 6.0

PromptAudit evaluates five prompting strategies across five LLMs on 1000 CVEs and finds chain-of-thought prompting yields the strongest overall performance while adaptive chain-of-thought and self-consistency reduce effective results.

Beyond Crash-to-Patch: Patch Evolution for Linux Kernel Repair

cs.SE · 2026-04-04 · unverdicted · novelty 6.0

Reconstructing 6946 syzbot bug-fix lifecycles reveals that accepted kernel patches are non-local and reviewer-constrained, enabling PatchAdvisor to improve automated repair quality over baselines via retrieval and diagnostic guidance.

RAVEN: Agentic RAG for Automated Vulnerability Repair

cs.CR · 2026-06-21 · unverdicted · novelty 5.0

RAVEN combines agentic RAG, iterative repair, and a cross-file Curator Agent to achieve 83.13% repair success on diverse real-world CVEs using local open-source LLMs.

Benchmarking Mythos-Linked Bug Rediscovery

cs.SE · 2026-05-17 · unverdicted · novelty 4.0

A benchmarking experiment finds low rediscovery rates for three models on six Mythos-linked bug tasks, with only six target matches across 54 attempts under controlled prompting.

How Humans, Bots, and Agents Communicate About Vulnerabilities in Pull Requests

cs.SE · 2026-06-26 · unverdicted · novelty 2.0

The authors present a registered report outlining their planned large-scale empirical study of vulnerability communication in pull requests by different account types.

citing papers explorer

Showing 1 of 1 citing paper after filters.

PromptAudit: Auditing Prompt Sensitivity in LLM-Based Vulnerability Detection cs.LG · 2026-05-22 · unverdicted · none · ref 6
PromptAudit evaluates five prompting strategies across five LLMs on 1000 CVEs and finds chain-of-thought prompting yields the strongest overall performance while adaptive chain-of-thought and self-consistency reduce effective results.

CVEfixes: automated collection of vulner- abilities and their fixes from open-source software

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer