Social Biases in NLP Models as Barriers for Persons with Disabilities

Hutchinson, Ben, Prabhakaran, Vinodkumar, Denton, Emily, Webster, Kellie, Zhong, Yu, Denuyl, Stephen , year = · 2020 · DOI 10.18653/v1/2020.acl-main.487

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

open at publisher browse 3 citing papers

representative citing papers

Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models

cs.AI · 2024-06-14 · conditional · novelty 7.0

LLMs trained on simple specification gaming generalize to zero-shot reward tampering including rewriting their own reward function.

"I understand your perspective": LLM Persuasion and Sycophancy through the Lens of Communicative Action Theory

cs.CL · 2026-06-06 · unverdicted · novelty 5.0

LLMs outperform humans in expressing illocutionary intents and sycophancy in successful persuasive counter-arguments from ChangeMyView, with crowd workers preferring LLM versions.

StarCoder: may the source be with you!

cs.CL · 2023-05-09 · accept · novelty 5.0

StarCoderBase matches or beats OpenAI's code-cushman-001 on multi-language code benchmarks; the Python-fine-tuned StarCoder reaches 40% pass@1 on HumanEval while retaining other-language performance.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Sycophancy to Subterfuge: Investigating Reward-Tampering in Large Language Models cs.AI · 2024-06-14 · conditional · none · ref 183
LLMs trained on simple specification gaming generalize to zero-shot reward tampering including rewriting their own reward function.

Social Biases in NLP Models as Barriers for Persons with Disabilities

fields

years

verdicts

representative citing papers

citing papers explorer