Gender Bias in Contextualized Word Embeddings

Jieyu Zhao; Kai-Wei Chang; Mark Yatskar; Ryan Cotterell; Tianlu Wang; Vicente Ordonez

arxiv: 1904.03310 · v1 · pith:H6QVQCUWnew · submitted 2019-04-05 · 💻 cs.CL

Gender Bias in Contextualized Word Embeddings

Jieyu Zhao , Tianlu Wang , Mark Yatskar , Ryan Cotterell , Vicente Ordonez , Kai-Wei Chang This is my paper

classification 💻 cs.CL

keywords biaselmogendercontextualizedembeddingsentitiesfemaleinformation

0 comments

read the original abstract

In this paper, we quantify, analyze and mitigate gender bias exhibited in ELMo's contextualized word vectors. First, we conduct several intrinsic analyses and find that (1) training data for ELMo contains significantly more male than female entities, (2) the trained ELMo embeddings systematically encode gender information and (3) ELMo unequally encodes gender information about male and female entities. Then, we show that a state-of-the-art coreference system that depends on ELMo inherits its bias and demonstrates significant bias on the WinoBias probing corpus. Finally, we explore two methods to mitigate such gender bias and show that the bias demonstrated on WinoBias can be eliminated.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Bias in Large Language Models: Origin, Evaluation, and Mitigation
cs.CL 2024-11 unverdicted novelty 2.0

A literature review that categorizes bias in LLMs, surveys evaluation and mitigation techniques, and discusses ethical implications.