Title resolution pending

Zhao, Wei, Li, Zhe, Li, Yige, Zhang, Ye, Sun, Jun , editor = · 2024 · DOI 10.18653/v1/2024.findings-emnlp.293

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

Title metadata for this work has not finished resolving. The hub is built from the citation graph; the title resolver retries DOI and OpenAlex on its next pass.

representative citing papers

PRISM: Recovering Instruction Sets from Language Model Activations

cs.AI · 2026-06-08 · unverdicted · novelty 7.0

PRISM is a new activation-conditioned model that recovers full sets of simultaneous instructions from LLM hidden states via judge-guided GRPO training and outperforms prior activation-to-language methods on security-relevant tasks.

Hard to Read, Easy to Jailbreak: How Visual Degradation Bypasses MLLM Safety Alignment

cs.CV · 2026-05-08 · conditional · novelty 6.0

Degraded image resolution in MLLMs bypasses safety alignments via cognitive overload, raising jailbreak rates across perturbations.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Hard to Read, Easy to Jailbreak: How Visual Degradation Bypasses MLLM Safety Alignment cs.CV · 2026-05-08 · conditional · none · ref 58
Degraded image resolution in MLLMs bypasses safety alignments via cognitive overload, raising jailbreak rates across perturbations.

Title resolution pending

fields

years

verdicts

representative citing papers

citing papers explorer