pith. sign in

arxiv: 1904.04047 · v3 · pith:R43FEPNLnew · submitted 2019-04-03 · 💻 cs.CL · cs.LG· stat.ML

Black is to Criminal as Caucasian is to Police: Detecting and Removing Multiclass Bias in Word Embeddings

classification 💻 cs.CL cs.LGstat.ML
keywords embeddingsmulticlasswordbinarydebiasingproposestereotypestexts
0
0 comments X
read the original abstract

Online texts -- across genres, registers, domains, and styles -- are riddled with human stereotypes, expressed in overt or subtle ways. Word embeddings, trained on these texts, perpetuate and amplify these stereotypes, and propagate biases to machine learning models that use word embeddings as features. In this work, we propose a method to debias word embeddings in multiclass settings such as race and religion, extending the work of (Bolukbasi et al., 2016) from the binary setting, such as binary gender. Next, we propose a novel methodology for the evaluation of multiclass debiasing. We demonstrate that our multiclass debiasing is robust and maintains the efficacy in standard NLP tasks.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. VLBiasBench: A Comprehensive Benchmark for Evaluating Bias in Large Vision-Language Model

    cs.CV 2024-06 conditional novelty 6.0

    VLBiasBench is a new large-scale benchmark with 128,342 samples covering nine social bias categories plus two intersectional ones to evaluate biases in LVLMs.