Insights and Current Gaps in Open-Source LLM Vulnerability Scan- ners: A Comparative Analysis

Waqar Hussain · 2025 · arXiv 6699.2025

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

read on arXiv browse 3 citing papers

citation-role summary

background 2

citation-polarity summary

background 2

representative citing papers

AVISE: Framework for Evaluating the Security of AI Systems

cs.CR · 2026-04-22 · unverdicted · novelty 6.0

AVISE provides a new framework and automated SET that identifies jailbreak vulnerabilities in language models with 92% accuracy, finding all nine tested models vulnerable to an augmented Red Queen attack.

A Case Study on the Impact of Anonymization Along the RAG Pipeline

cs.CR · 2026-04-17 · unverdicted · novelty 6.0

Anonymization placement in RAG—at the dataset or at the generated answer—creates observable differences in privacy protection versus response utility.

Reliability of AI Bots Footprints in GitHub Actions CI/CD Workflows

cs.SE · 2026-04-20 · unverdicted · novelty 5.0

Large-scale analysis of AI bot PRs shows Copilot and Codex achieve the highest CI/CD success rates but more frequent AI contributions correlate with reduced workflow reliability.

citing papers explorer

Showing 3 of 3 citing papers.

AVISE: Framework for Evaluating the Security of AI Systems cs.CR · 2026-04-22 · unverdicted · none · ref 35
AVISE provides a new framework and automated SET that identifies jailbreak vulnerabilities in language models with 92% accuracy, finding all nine tested models vulnerable to an augmented Red Queen attack.
A Case Study on the Impact of Anonymization Along the RAG Pipeline cs.CR · 2026-04-17 · unverdicted · none · ref 15
Anonymization placement in RAG—at the dataset or at the generated answer—creates observable differences in privacy protection versus response utility.
Reliability of AI Bots Footprints in GitHub Actions CI/CD Workflows cs.SE · 2026-04-20 · unverdicted · none · ref 10
Large-scale analysis of AI bot PRs shows Copilot and Codex achieve the highest CI/CD success rates but more frequent AI contributions correlate with reduced workflow reliability.

Insights and Current Gaps in Open-Source LLM Vulnerability Scan- ners: A Comparative Analysis

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer