SafetyPrompts: a systematic review of open datasets for evaluating and improving large language model safety , year =

· 2025 · DOI 10.1609/aaai.v39i26.34975

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

RedVox: Safety and Fairness Gaps in Speech Models Across Languages

cs.CL · 2026-06-25 · unverdicted · novelty 7.0

RedVox benchmark shows speech model safety and fairness vulnerabilities persist under non-adversarial conditions, worsen in non-English languages, and increase with spoken inputs.

Beyond Third-Person Audits: Situated Interaction Auditing for User-Centered LLM Bias Research

cs.CY · 2026-06-10 · unverdicted · novelty 7.0

Introduces Situated Interaction Auditing (SIA) to examine how user sociodemographic signals affect LLM response quality, content, and tone in personal interactions.

Discriminatory Compliance: How LLMs Answer Queries from Protected Groups

cs.CY · 2026-06-19 · unverdicted · novelty 4.0

State-of-the-art LLMs respond inconsistently to queries from protected-group personas, with some responses omitting key information that should be provided.

citing papers explorer

Showing 1 of 1 citing paper after filters.

RedVox: Safety and Fairness Gaps in Speech Models Across Languages cs.CL · 2026-06-25 · unverdicted · none · ref 146
RedVox benchmark shows speech model safety and fairness vulnerabilities persist under non-adversarial conditions, worsen in non-English languages, and increase with spoken inputs.

SafetyPrompts: a systematic review of open datasets for evaluating and improving large language model safety , year =

fields

years

verdicts

representative citing papers

citing papers explorer