DeepSeek also shows high acceptance (4.4) with balanced sensitivity, while Claude is notably more restrictive (3.0)

Affiliative humoris widely accepted, with GPT-4o, Grok, Gemini scoring near the maximum (4

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Investigating Counterfactual Unfairness in LLMs towards Identities through Humor

cs.CL · 2026-04-20 · unverdicted · novelty 5.0

LLMs refuse jokes from privileged speakers up to 67.5% more often, judge them malicious 64.7% more, and rate them up to 1.5 points higher in social harm.

citing papers explorer

Showing 1 of 1 citing paper.

Investigating Counterfactual Unfairness in LLMs towards Identities through Humor cs.CL · 2026-04-20 · unverdicted · none · ref 18
LLMs refuse jokes from privileged speakers up to 67.5% more often, judge them malicious 64.7% more, and rate them up to 1.5 points higher in social harm.

DeepSeek also shows high acceptance (4.4) with balanced sensitivity, while Claude is notably more restrictive (3.0)

fields

years

verdicts

representative citing papers

citing papers explorer