arXiv preprint arXiv:2509.25896 , year=

LLaVAShield: Safeguarding Multimodal Multi-Turn Dialogues in Vision-Language Models , author= · arXiv 2509.25896

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

representative citing papers

cs.CR · 2026-06-04 · unverdicted · novelty 6.0

RedEdit finds that fewer than two photo edits on average let 76.2% of unsafe images evade detectors while retaining 93.0% of malicious semantics.

Showing 1 of 1 citing paper.

RedEdit: Agentic Red-Teaming of Image Safety Classifiers via MCTS-Guided Photo-Editing cs.CR · 2026-06-04 · unverdicted · none · ref 20
RedEdit finds that fewer than two photo edits on average let 76.2% of unsafe images evade detectors while retaining 93.0% of malicious semantics.