Gentel-safe: A uni- fied benchmark and shielding framework for defend- ing against prompt injection attacks

· 2024 · arXiv 2409.19521

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Gate AI: LLM Security Benchmark Evaluation Methodology and Results

cs.LG · 2026-06-01 · unverdicted · novelty 7.0

Introduces a cross-validation-based evaluation methodology for LLM security detectors using a global threshold and group-fold leakage checks to avoid per-dataset tuning.

Progent: Securing AI Agents with Privilege Control

cs.CR · 2025-04-16 · unverdicted · novelty 6.0

Progent introduces a privilege-control framework for AI agents that uses LLM-generated symbolic rules over tools, SMT-solver-enforced monotonic updates, and deterministic checks to reduce attack success rates on AgentDojo and ASB benchmarks.

citing papers explorer

Showing 2 of 2 citing papers after filters.

Gate AI: LLM Security Benchmark Evaluation Methodology and Results cs.LG · 2026-06-01 · unverdicted · none · ref 18
Introduces a cross-validation-based evaluation methodology for LLM security detectors using a global threshold and group-fold leakage checks to avoid per-dataset tuning.
Progent: Securing AI Agents with Privilege Control cs.CR · 2025-04-16 · unverdicted · none · ref 38
Progent introduces a privilege-control framework for AI agents that uses LLM-generated symbolic rules over tools, SMT-solver-enforced monotonic updates, and deterministic checks to reduce attack success rates on AgentDojo and ASB benchmarks.

Gentel-safe: A uni- fied benchmark and shielding framework for defend- ing against prompt injection attacks

fields

years

verdicts

representative citing papers

citing papers explorer