RedVox benchmark shows speech model safety and fairness vulnerabilities persist under non-adversarial conditions, worsen in non-English languages, and increase with spoken inputs.
SafetyPrompts: a systematic review of open datasets for evaluating and improving large language model safety , year =
3 Pith papers cite this work. Polarity classification is still indexing.
3
Pith papers citing it
years
2026 3verdicts
UNVERDICTED 3representative citing papers
Introduces Situated Interaction Auditing (SIA) to examine how user sociodemographic signals affect LLM response quality, content, and tone in personal interactions.
State-of-the-art LLMs respond inconsistently to queries from protected-group personas, with some responses omitting key information that should be provided.
citing papers explorer
-
Beyond Third-Person Audits: Situated Interaction Auditing for User-Centered LLM Bias Research
Introduces Situated Interaction Auditing (SIA) to examine how user sociodemographic signals affect LLM response quality, content, and tone in personal interactions.
-
Discriminatory Compliance: How LLMs Answer Queries from Protected Groups
State-of-the-art LLMs respond inconsistently to queries from protected-group personas, with some responses omitting key information that should be provided.