Pipelines for Social Bias Testing of Large Language Models

Nozza, Debora, Bianchi, Federico, Hovy, Dirk · 2022 · DOI 10.18653/v1/2022.bigscience-1.6

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

RedVox: Safety and Fairness Gaps in Speech Models Across Languages

cs.CL · 2026-06-25 · unverdicted · novelty 7.0

RedVox benchmark shows speech model safety and fairness vulnerabilities persist under non-adversarial conditions, worsen in non-English languages, and increase with spoken inputs.

Can Reasoning Models Detect Changes to their Chains of Thought?

cs.AI · 2026-06-20 · unverdicted · novelty 5.0

Reasoning models detect modifications to their chains of thought with only modest accuracy and cannot reliably identify the nature of those modifications.

citing papers explorer

Showing 1 of 1 citing paper after filters.

RedVox: Safety and Fairness Gaps in Speech Models Across Languages cs.CL · 2026-06-25 · unverdicted · none · ref 137
RedVox benchmark shows speech model safety and fairness vulnerabilities persist under non-adversarial conditions, worsen in non-English languages, and increase with spoken inputs.

Pipelines for Social Bias Testing of Large Language Models

fields

years

verdicts

representative citing papers

citing papers explorer