{harm_instruction}

Preserve the core meaning: Ensure that the new prompt still conveys the intent of "{harm_instruction}"

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

EvoJail: Evolutionary Diverse Jailbreak Prompt Generation for Large Language Models

cs.NE · 2026-04-22 · unverdicted · novelty 6.0

EvoJail uses evolutionary algorithms with field-aware instruction fusion and multi-level mutations to generate adaptable, diverse jailbreak prompts for LLMs, claiming over 93% attack success rate and 5.6% diversity gains over prior methods.

citing papers explorer

Showing 1 of 1 citing paper.

EvoJail: Evolutionary Diverse Jailbreak Prompt Generation for Large Language Models cs.NE · 2026-04-22 · unverdicted · none · ref 25
EvoJail uses evolutionary algorithms with field-aware instruction fusion and multi-level mutations to generate adaptable, diverse jailbreak prompts for LLMs, claiming over 93% attack success rate and 5.6% diversity gains over prior methods.

{harm_instruction}

fields

years

verdicts

representative citing papers

citing papers explorer