Insights and Current Gaps in Open-Source LLM Vulnerability Scan- ners: A Comparative Analysis

Yifeng He, Ethan Wang, Yuyang Rong, Zifei Cheng, Hao Chen · 2025 · arXiv 6699.2025

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

background 3

citation-polarity summary

background 3

representative citing papers

AVISE: Framework for Evaluating the Security of AI Systems

cs.CR · 2026-04-22 · unverdicted · novelty 6.0

AVISE provides a new framework and automated SET that identifies jailbreak vulnerabilities in language models with 92% accuracy, finding all nine tested models vulnerable to an augmented Red Queen attack.

A Case Study on the Impact of Anonymization Along the RAG Pipeline

cs.CR · 2026-04-17 · unverdicted · novelty 6.0

Anonymization placement in RAG—at the dataset or at the generated answer—creates observable differences in privacy protection versus response utility.

Reliability of AI Bots Footprints in GitHub Actions CI/CD Workflows

cs.SE · 2026-04-20 · unverdicted · novelty 5.0

Large-scale analysis of AI bot PRs shows Copilot and Codex achieve the highest CI/CD success rates but more frequent AI contributions correlate with reduced workflow reliability.

Toward Secure LLM Agents: Threat Surfaces, Attacks, Defenses, and Evaluation

cs.CR · 2026-06-09 · unverdicted · novelty 3.0

A synthesis of 247 papers on LLM agent security identifies prompt injection and tool hijacking as dominant threats, notes weakly compositional defenses, and argues for trust boundaries and realistic evaluations.

citing papers explorer

Showing 4 of 4 citing papers.

AVISE: Framework for Evaluating the Security of AI Systems cs.CR · 2026-04-22 · unverdicted · none · ref 35
AVISE provides a new framework and automated SET that identifies jailbreak vulnerabilities in language models with 92% accuracy, finding all nine tested models vulnerable to an augmented Red Queen attack.
A Case Study on the Impact of Anonymization Along the RAG Pipeline cs.CR · 2026-04-17 · unverdicted · none · ref 15
Anonymization placement in RAG—at the dataset or at the generated answer—creates observable differences in privacy protection versus response utility.
Reliability of AI Bots Footprints in GitHub Actions CI/CD Workflows cs.SE · 2026-04-20 · unverdicted · none · ref 10
Large-scale analysis of AI bot PRs shows Copilot and Codex achieve the highest CI/CD success rates but more frequent AI contributions correlate with reduced workflow reliability.
Toward Secure LLM Agents: Threat Surfaces, Attacks, Defenses, and Evaluation cs.CR · 2026-06-09 · unverdicted · none · ref 59
A synthesis of 247 papers on LLM agent security identifies prompt injection and tool hijacking as dominant threats, notes weakly compositional defenses, and argues for trust boundaries and realistic evaluations.

Insights and Current Gaps in Open-Source LLM Vulnerability Scan- ners: A Comparative Analysis

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer