pith. sign in

hub Mixed citations

Salad-bench: A hierarchical and com- prehensive safety benchmark for large language models

Mixed citation behavior. Most common role is background (40%).

13 Pith papers citing it
Background 40% of classified citations

hub tools

citation-role summary

background 2 dataset 2 other 1

citation-polarity summary

representative citing papers

ShieldGemma: Generative AI Content Moderation Based on Gemma

cs.CL · 2024-07-31 · unverdicted · novelty 4.0

ShieldGemma delivers a family of Gemma2-based classifiers that outperform Llama Guard and WildCard on public safety benchmarks while introducing a synthetic-data curation pipeline for safety tasks.

citing papers explorer

Showing 13 of 13 citing papers.