AI alignment should target fair principles that receive reflective endorsement despite moral variation, rather than identifying true moral principles, with a principle-based approach combining different alignment elements.
Title resolution pending
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CY 1years
2020 1verdicts
ACCEPT 1representative citing papers
citing papers explorer
-
Artificial Intelligence, Values and Alignment
AI alignment should target fair principles that receive reflective endorsement despite moral variation, rather than identifying true moral principles, with a principle-based approach combining different alignment elements.