arXiv preprint arXiv:2412.05934

Heuristic-inducedmultimodalriskdistribution jailbreakattackformultimodallargelanguagemodels · 2021 · arXiv 2412.05934

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

PRISM: Programmatic Reasoning with Image Sequence Manipulation for LVLM Jailbreaking

cs.CR · 2025-07-29 · unverdicted · novelty 6.0

PRISM decomposes harmful instructions into benign visual gadgets and directs LVLMs via prompts to compose them through reasoning into harmful outputs, achieving ASR over 0.90 on SafeBench.

One Shot Dominance: Knowledge Poisoning Attack on Retrieval-Augmented Generation Systems

cs.CR · 2025-05-15 · unverdicted · novelty 6.0

AuthChain poisons a single document to achieve high-success attacks on RAG systems for multi-hop queries across six LLMs while evading defenses.

citing papers explorer

Showing 2 of 2 citing papers.

PRISM: Programmatic Reasoning with Image Sequence Manipulation for LVLM Jailbreaking cs.CR · 2025-07-29 · unverdicted · none · ref 34
PRISM decomposes harmful instructions into benign visual gadgets and directs LVLMs via prompts to compose them through reasoning into harmful outputs, achieving ASR over 0.90 on SafeBench.
One Shot Dominance: Knowledge Poisoning Attack on Retrieval-Augmented Generation Systems cs.CR · 2025-05-15 · unverdicted · none · ref 7
AuthChain poisons a single document to achieve high-success attacks on RAG systems for multi-hop queries across six LLMs while evading defenses.

arXiv preprint arXiv:2412.05934

fields

years

verdicts

representative citing papers

citing papers explorer