pith. sign in

Wildteaming at scale: From in-the-wild jailbreaks to (adversarially) safer language models.Advances in Neural Information Processing Systems, 37:47094–47165

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

fields

cs.AI 2 cs.LG 2

years

2026 4

verdicts

UNVERDICTED 4

representative citing papers

Bayesian Model Merging

cs.LG · 2026-05-13 · unverdicted · novelty 6.0

Bayesian Model Merging introduces a bi-level optimization framework that merges task-specific models via closed-form Bayesian regression with an anchor prior and global hyperparameter search, outperforming baselines and nearly matching expert averages on up to 20-task vision and 5-task language Merg

citing papers explorer

Showing 4 of 4 citing papers.