In: Proceedings of the 17th International Conference on Mining Software Repositories

Jiahao Fan, Yi Li, Shaohua Wang, Tien N · 2020 · arXiv 9597.338750

10 Pith papers cite this work. Polarity classification is still indexing.

10 Pith papers citing it

read on arXiv browse 10 citing papers

citation-role summary

background 1 dataset 1

citation-polarity summary

background 1 use dataset 1

representative citing papers

BioDefect: The First Dataset for Defect Detection in Bioinformatics Software

cs.SE · 2026-05-20 · unverdicted · novelty 7.0

BioDefect is a new dataset for defect detection in bioinformatics software that improves average F1-scores by 29.61% to 38.04% over existing datasets when evaluated on nine language models.

VulKey: Automated Vulnerability Repair Guided by Domain-Specific Repair Patterns

cs.CR · 2026-05-03 · unverdicted · novelty 7.0 · 2 refs

VulKey introduces hierarchical expert knowledge abstractions to guide LLMs in vulnerability repair, reporting 31.5% accuracy on PrimeVul (7.6% above best baseline) and strong results on Vul4J.

CrossCommitVuln-Bench: A Dataset of Multi-Commit Python Vulnerabilities Invisible to Per-Commit Static Analysis

cs.CR · 2026-04-23 · conditional · novelty 7.0

CrossCommitVuln-Bench shows that 87% of 15 multi-commit Python CVEs are invisible to per-commit static analysis, with only 13% detection rate.

Dissecting the Black Box: Circuit-Level Analysis of LLM Vulnerability Detection

cs.CR · 2026-05-28 · unverdicted · novelty 6.0

LLM vulnerability detection in Gemma-2-2b relies on sparse safety-detector circuits in early layers rather than direct vulnerability signatures, identified via circuit tracing and ablation on 472 C/C++ samples.

PromptAudit: Auditing Prompt Sensitivity in LLM-Based Vulnerability Detection

cs.LG · 2026-05-22 · unverdicted · novelty 6.0

PromptAudit evaluates five prompting strategies across five LLMs on 1000 CVEs and finds chain-of-thought prompting yields the strongest overall performance while adaptive chain-of-thought and self-consistency reduce effective results.

SAGE: Signal-Amplified Guided Embeddings for LLM-based Vulnerability Detection

cs.CR · 2026-04-21 · unverdicted · novelty 6.0

SAGE uses sparse autoencoders to boost vulnerability signals in LLMs, raising internal SNR 12.7x and delivering up to 318% MCC gains on vulnerability detection benchmarks.

UntrustVul: An Automated Approach for Identifying Untrustworthy Alerts in Vulnerability Detection Models

cs.SE · 2025-03-19 · unverdicted · novelty 6.0

UntrustVul identifies untrustworthy vulnerability predictions by marking lines that neither match historical vulnerability patterns nor influence vulnerable lines through dependencies, reporting AUC 70-88% and F1 82-94% on 115K predictions.

Reinforcement Learning for Software Vulnerability Analysis: A Systematic Review with Emphasis on C/C++ Source Code and Static Analysis

cs.SE · 2026-06-24 · unverdicted · novelty 4.0

A PRISMA-guided review of 21 papers shows RL work on C/C++ vulnerabilities focuses on fuzzing rather than detection or localization, proposes a taxonomy, and flags the lack of CFG-based state representations for vulnerable node identification.

Benchmarking Mythos-Linked Bug Rediscovery

cs.SE · 2026-05-17 · unverdicted · novelty 4.0

A benchmarking experiment finds low rediscovery rates for three models on six Mythos-linked bug tasks, with only six target matches across 54 attempts under controlled prompting.

HYDRA: A Hybrid Heuristic-Guided Deep Representation Architecture for Predicting Latent Zero-Day Vulnerabilities in Patched Functions

cs.CR · 2025-11-09 · unverdicted · novelty 4.0

HYDRA is a hybrid model that uses heuristics plus deep embeddings and a VAE to predict latent zero-day vulnerabilities in patched functions from Chrome, Android, and ImageMagick.

citing papers explorer

Showing 1 of 1 citing paper after filters.

CrossCommitVuln-Bench: A Dataset of Multi-Commit Python Vulnerabilities Invisible to Per-Commit Static Analysis cs.CR · 2026-04-23 · conditional · none · ref 2
CrossCommitVuln-Bench shows that 87% of 15 multi-commit Python CVEs are invisible to per-commit static analysis, with only 13% detection rate.

In: Proceedings of the 17th International Conference on Mining Software Repositories

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer