Jailbreaking text-to-image models with llm-based agents

· 2024 · arXiv 2408.00523

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

The Salami Slicing Threat: Exploiting Cumulative Risks in LLM Systems

cs.CR · 2026-04-13 · unverdicted · novelty 6.0

Salami Attack chains low-risk inputs to cumulatively trigger high-risk LLM behaviors, achieving over 90% success on GPT-4o and Gemini while resisting some defenses.

citing papers explorer

Showing 1 of 1 citing paper.

The Salami Slicing Threat: Exploiting Cumulative Risks in LLM Systems cs.CR · 2026-04-13 · unverdicted · none · ref 44
Salami Attack chains low-risk inputs to cumulatively trigger high-risk LLM behaviors, achieving over 90% success on GPT-4o and Gemini while resisting some defenses.

Jailbreaking text-to-image models with llm-based agents

fields

years

verdicts

representative citing papers

citing papers explorer