pith. sign in

arxiv: 1906.04760 · v1 · pith:JIF4U53Hnew · submitted 2019-06-11 · 💻 cs.CL

Unsupervised Discovery of Gendered Language through Latent-Variable Modeling

classification 💻 cs.CL
keywords describelanguageusedfemalemaleadjectivesdifferencesdifferent
0
0 comments X
read the original abstract

Studying the ways in which language is gendered has long been an area of interest in sociolinguistics. Studies have explored, for example, the speech of male and female characters in film and the language used to describe male and female politicians. In this paper, we aim not to merely study this phenomenon qualitatively, but instead to quantify the degree to which the language used to describe men and women is different and, moreover, different in a positive or negative way. To that end, we introduce a generative latent-variable model that jointly represents adjective (or verb) choice, with its sentiment, given the natural gender of a head (or dependent) noun. We find that there are significant differences between descriptions of male and female nouns and that these differences align with common gender stereotypes: Positive adjectives used to describe women are more often related to their bodies than adjectives used to describe men.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Speak Your Mind: The Speech Continuation Task as a Probe of Voice-Based Model Bias

    eess.AS 2025-09 unverdicted novelty 7.0

    The authors perform the first systematic bias evaluation in speech continuation tasks across three models, revealing gender interactions in text metrics and stronger reversion to modal phonation for female prompts.