Antonios Kalloniatis and Panagiotis Adamidis

Kamruzzaman, M · 2023 · arXiv 2309.08902

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Investigating Counterfactual Unfairness in LLMs towards Identities through Humor

cs.CL · 2026-04-20 · unverdicted · novelty 5.0

LLMs refuse jokes from privileged speakers up to 67.5% more often, judge them malicious 64.7% more, and rate them up to 1.5 points higher in social harm.

A closer look at how large language models trust humans: patterns and biases

cs.CL · 2025-04-22 · unverdicted · novelty 5.0

Across 43,200 simulations with five LLMs and five scenarios, model trust in humans aligns with human-like patterns driven by trustworthiness dimensions and is sometimes biased by age, gender, and religion.

citing papers explorer

Showing 2 of 2 citing papers.

Investigating Counterfactual Unfairness in LLMs towards Identities through Humor cs.CL · 2026-04-20 · unverdicted · none · ref 1
LLMs refuse jokes from privileged speakers up to 67.5% more often, judge them malicious 64.7% more, and rate them up to 1.5 points higher in social harm.
A closer look at how large language models trust humans: patterns and biases cs.CL · 2025-04-22 · unverdicted · none · ref 34
Across 43,200 simulations with five LLMs and five scenarios, model trust in humans aligns with human-like patterns driven by trustworthiness dimensions and is sometimes biased by age, gender, and religion.

Antonios Kalloniatis and Panagiotis Adamidis

fields

years

verdicts

representative citing papers

citing papers explorer