pith. sign in

arxiv: 1904.03310 · v1 · pith:H6QVQCUWnew · submitted 2019-04-05 · 💻 cs.CL

Gender Bias in Contextualized Word Embeddings

classification 💻 cs.CL
keywords biaselmogendercontextualizedembeddingsentitiesfemaleinformation
0
0 comments X
read the original abstract

In this paper, we quantify, analyze and mitigate gender bias exhibited in ELMo's contextualized word vectors. First, we conduct several intrinsic analyses and find that (1) training data for ELMo contains significantly more male than female entities, (2) the trained ELMo embeddings systematically encode gender information and (3) ELMo unequally encodes gender information about male and female entities. Then, we show that a state-of-the-art coreference system that depends on ELMo inherits its bias and demonstrates significant bias on the WinoBias probing corpus. Finally, we explore two methods to mitigate such gender bias and show that the bias demonstrated on WinoBias can be eliminated.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Good Secretaries, Bad Truck Drivers? Occupational Gender Stereotypes in Sentiment Analysis

    cs.CL 2019-06 unverdicted novelty 6.0

    Authors release a new 800-sentence gender-balanced profession dataset and use it to test occupational gender stereotypes in three sentiment analysis models.

  2. Bias in Large Language Models: Origin, Evaluation, and Mitigation

    cs.CL 2024-11 unverdicted novelty 2.0

    A literature review that categorizes bias in LLMs, surveys evaluation and mitigation techniques, and discusses ethical implications.