pith. sign in

Visualpuzzles: Decoupling multimodal reasoning evaluation from domain knowledge

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

citation-role summary

background 2 dataset 1

citation-polarity summary

years

2026 6

representative citing papers

SALLIE: Safeguarding Against Latent Language & Image Exploits

cs.CR · 2026-04-06 · unverdicted · novelty 5.0

SALLIE detects jailbreaks in text and vision-language models by extracting residual stream activations, scoring maliciousness per layer with k-NN, and ensembling predictions, outperforming baselines on multiple datasets.

citing papers explorer

Showing 6 of 6 citing papers.