Tree of attacks: Jailbreaking black-box LLMs automatically

Anay Mehrotra, Manolis Zampetakis, Paul Kassianik, Blaine Nelson, Hyrum Anderson, Y aron Singer, Amin Karbasi · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

The Great Pretender: A Stochasticity Problem in LLM Jailbreak

cs.CR · 2026-05-14 · conditional · novelty 6.0

ASR metrics for LLM jailbreaks are inflated by stochasticity; CAS-eval reveals up to 30pp drops under multi-attempt criteria while CAS-gen recovers the performance loss.

citing papers explorer

Showing 1 of 1 citing paper.

The Great Pretender: A Stochasticity Problem in LLM Jailbreak cs.CR · 2026-05-14 · conditional · none · ref 4
ASR metrics for LLM jailbreaks are inflated by stochasticity; CAS-eval reveals up to 30pp drops under multi-attempt criteria while CAS-gen recovers the performance loss.

Tree of attacks: Jailbreaking black-box LLMs automatically

fields

years

verdicts

representative citing papers

citing papers explorer