Pipelines for Social Bias Testing of Large Language Models

Nozza, Debora, Bianchi, Federico, Hovy, Dirk · 2022 · DOI 10.18653/v1/2022.bigscience-1.6

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open at publisher browse 2 citing papers

representative citing papers

RedVox: Safety and Fairness Gaps in Speech Models Across Languages

cs.CL · 2026-06-25 · unverdicted · novelty 7.0

RedVox benchmark shows speech model safety and fairness vulnerabilities persist under non-adversarial conditions, worsen in non-English languages, and increase with spoken inputs.

Can Reasoning Models Detect Changes to their Chains of Thought?

cs.AI · 2026-06-20 · unverdicted · novelty 5.0

Reasoning models detect modifications to their chains of thought with only modest accuracy and cannot reliably identify the nature of those modifications.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Can Reasoning Models Detect Changes to their Chains of Thought? cs.AI · 2026-06-20 · unverdicted · none · ref 51
Reasoning models detect modifications to their chains of thought with only modest accuracy and cannot reliably identify the nature of those modifications.

Pipelines for Social Bias Testing of Large Language Models

fields

years

verdicts

representative citing papers

citing papers explorer