J ob F air: A Framework for Benchmarking Gender Hiring Bias in Large Language Models

Wang, Ze, Wu, Zekun, Guan, Xin, Thaler, Michael, Koshiyama, Adriano, Lu, Skylar · 2024 · DOI 10.18653/v1/2024.findings-emnlp.184

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

StylisticBias: A Few Human Visual Cues Drive Most Social Biases in MLLMs

cs.CL · 2026-06-18 · unverdicted · novelty 7.0

StylisticBias benchmark shows 15 visual attributes explain nearly 80% of bias variation in six MLLMs by isolating single cues like age and fashion in generated images.

Decisive: Guiding User Decisions with Optimal Preference Elicitation from Unstructured Documents

cs.CL · 2026-04-20 · unverdicted · novelty 6.0

Decisive combines document-grounded option scoring with adaptive Bayesian preference elicitation to achieve up to 20% higher decision accuracy than LLMs and existing frameworks across domains.

Topics as Proxies for Sociodemographics: How Conversational Context Affects LLM Answers

cs.CL · 2026-06-01 · unverdicted · novelty 4.0

LLMs show minimal sociodemographic disparities in advice because they infer user demographics poorly from history; conversation topics are the main predictor and act as proxies for groups.

citing papers explorer

Showing 3 of 3 citing papers.

StylisticBias: A Few Human Visual Cues Drive Most Social Biases in MLLMs cs.CL · 2026-06-18 · unverdicted · none · ref 12
StylisticBias benchmark shows 15 visual attributes explain nearly 80% of bias variation in six MLLMs by isolating single cues like age and fashion in generated images.
Decisive: Guiding User Decisions with Optimal Preference Elicitation from Unstructured Documents cs.CL · 2026-04-20 · unverdicted · none · ref 41
Decisive combines document-grounded option scoring with adaptive Bayesian preference elicitation to achieve up to 20% higher decision accuracy than LLMs and existing frameworks across domains.
Topics as Proxies for Sociodemographics: How Conversational Context Affects LLM Answers cs.CL · 2026-06-01 · unverdicted · none · ref 39
LLMs show minimal sociodemographic disparities in advice because they infer user demographics poorly from history; conversation topics are the main predictor and act as proxies for groups.

J ob F air: A Framework for Benchmarking Gender Hiring Bias in Large Language Models

fields

years

verdicts

representative citing papers

citing papers explorer