Title resolution pending

demonstrated that optimizing visual adversarial examples can bypass textual safety filters, effectively acting as a "visual key" to unlock harmful model behaviors · 2024

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

Making MLLMs Blind: Adversarial Smuggling Attacks in MLLM Content Moderation

cs.CV · 2026-04-08 · unverdicted · novelty 8.0

Adversarial smuggling attacks encode harmful content into human-readable visuals that evade MLLM detection, achieving over 90% attack success rates on models like GPT-5 and Qwen3-VL via the new SmuggleBench benchmark.

citing papers explorer

Showing 1 of 1 citing paper.

Making MLLMs Blind: Adversarial Smuggling Attacks in MLLM Content Moderation cs.CV · 2026-04-08 · unverdicted · none · ref 7
Adversarial smuggling attacks encode harmful content into human-readable visuals that evade MLLM detection, achieving over 90% attack success rates on models like GPT-5 and Qwen3-VL via the new SmuggleBench benchmark.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer