Dialogue injection attack: Jailbreaking llms through context manipulation.arXiv preprint arXiv:2503.08195, 2025

Wenlong Meng, Fan Zhang, Wendao Yao, Zhenyuan Guo, Yuwei Li, Chengkun Wei, Wenzhi Chen · 2025 · arXiv 2503.08195

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

Persona Attack: Incremental Memory Injection Jailbreak Attack against Large Language Models

cs.CR · 2026-05-29 · unverdicted · novelty 7.0

Persona Attack uses step-by-step memory injections to achieve up to 95% success in making LLMs ignore safety alignments, with effectiveness depending on model memory and instruction combinations.

citing papers explorer

Showing 1 of 1 citing paper.

Persona Attack: Incremental Memory Injection Jailbreak Attack against Large Language Models cs.CR · 2026-05-29 · unverdicted · none · ref 16
Persona Attack uses step-by-step memory injections to achieve up to 95% success in making LLMs ignore safety alignments, with effectiveness depending on model memory and instruction combinations.

Dialogue injection attack: Jailbreaking llms through context manipulation.arXiv preprint arXiv:2503.08195, 2025

fields

years

verdicts

representative citing papers

citing papers explorer