arXiv preprint arXiv:2507.02799 , year=

Is reasoning all you need? probing bias in the age of reasoning language models · arXiv 2507.02799

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

representative citing papers

Calibrated? Not for Everyone: How Sexual Orientation and Religious Markers Distort LLM Accuracy and Confidence in Medical QA

cs.CL · 2026-04-19 · unverdicted · novelty 6.0

Social identity markers in medical questions degrade LLM accuracy and uncertainty calibration, producing a calibration crisis that is non-additive for intersectional cases.

Investigating Thinking Behaviours of Reasoning-Based Language Models for Social Bias Mitigation

cs.CL · 2025-10-20 · unverdicted · novelty 5.0

Reasoning LLMs aggregate social biases through stereotype repetition and irrelevant information injection in their thinking processes, and a self-review prompt mitigates this on BBQ, StereoSet, and BOLD benchmarks.

citing papers explorer

Showing 2 of 2 citing papers.

Calibrated? Not for Everyone: How Sexual Orientation and Religious Markers Distort LLM Accuracy and Confidence in Medical QA cs.CL · 2026-04-19 · unverdicted · none · ref 1
Social identity markers in medical questions degrade LLM accuracy and uncertainty calibration, producing a calibration crisis that is non-additive for intersectional cases.
Investigating Thinking Behaviours of Reasoning-Based Language Models for Social Bias Mitigation cs.CL · 2025-10-20 · unverdicted · none · ref 3
Reasoning LLMs aggregate social biases through stereotype repetition and irrelevant information injection in their thinking processes, and a self-review prompt mitigates this on BBQ, StereoSet, and BOLD benchmarks.

arXiv preprint arXiv:2507.02799 , year=

fields

years

verdicts

representative citing papers

citing papers explorer