Personal Universes: A Solution to the Multi-Agent Value Alignment Problem

Roman V. Yampolskiy

Personal Universes: A Solution to the Multi-Agent Value Alignment Problem

Not yet reviewed by Pith; the record is open.

Re-run · record.json Download PDF Read on arXiv ↗

This paper has not been read by Pith yet. Machine review is queued; the pith claim, tier, and objections will appear here once it completes.

SPECIMEN: schema-true, not a live event

T0 review · schema-true

One-sentence machine reading of the paper's core claim.

pith:XXXXXXXX · record.json · timestamp

arxiv 1901.01851 v1 pith:B3I7GJYX submitted 2019-01-01 cs.AI

Personal Universes: A Solution to the Multi-Agent Value Alignment Problem

Roman V. Yampolskiy This is my paper

classification cs.AI

keywords valuealignmentextractionmergermulti-agentpersonalpreferencesproblem

verification ladder T0 review T1 audit T2 compute T3 formal T4 reserved

0 comments

read the original abstract

AI Safety researchers attempting to align values of highly capable intelligent systems with those of humanity face a number of challenges including personal value extraction, multi-agent value merger and finally in-silico encoding. State-of-the-art research in value alignment shows difficulties in every stage in this process, but merger of incompatible preferences is a particularly difficult challenge to overcome. In this paper we assume that the value extraction problem will be solved and propose a possible way to implement an AI solution which optimally aligns with individual preferences of each user. We conclude by analyzing benefits and limitations of the proposed approach.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Unexplainability and Incomprehensibility of Artificial Intelligence
cs.CY 2019-06 unverdicted novelty 3.0

Advanced AI systems are unexplainable in full and produce explanations that humans cannot comprehend.