pith. sign in

hub Mixed citations

arXiv preprint arXiv:2404.02151 (2024)

Mixed citation behavior. Most common role is background (62%).

28 Pith papers citing it
Background 62% of classified citations

hub tools

citation-role summary

background 5 baseline 1 dataset 1 method 1

citation-polarity summary

clear filters

representative citing papers

GuardPhish: Securing Open-Source LLMs from Phishing Abuse

cs.CR · 2026-04-19 · unverdicted · novelty 7.0

Open-source LLMs detect phishing intent at high rates but still generate actionable phishing content, and GuardPhish supplies a dataset plus modular classifiers to close the gap.

SALLIE: Safeguarding Against Latent Language & Image Exploits

cs.CR · 2026-04-06 · unverdicted · novelty 5.0

SALLIE detects jailbreaks in text and vision-language models by extracting residual stream activations, scoring maliciousness per layer with k-NN, and ensembling predictions, outperforming baselines on multiple datasets.

citing papers explorer

Showing 28 of 28 citing papers.