For Justice, Deontology, and Commonsense Morality, we had 5 MTurkers relabel each example, and we kept examples for which at least 4 out of the 5 agreed

13 Published as a conference paper at ICLR · 2021

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

cs.CY · 2020-08-05 · conditional · novelty 6.0

Introduces ETHICS benchmark showing current language models have promising but incomplete ability to predict basic human ethical judgments on text scenarios.

citing papers explorer

Showing 1 of 1 citing paper.

Aligning AI With Shared Human Values cs.CY · 2020-08-05 · conditional · none · ref 26
Introduces ETHICS benchmark showing current language models have promising but incomplete ability to predict basic human ethical judgments on text scenarios.

For Justice, Deontology, and Commonsense Morality, we had 5 MTurkers relabel each example, and we kept examples for which at least 4 out of the 5 agreed

fields

years

verdicts

representative citing papers

citing papers explorer