pith:RFZR3QBS
AI Alignment: A Comprehensive Survey
AI alignment research can be structured around four principles and split into forward training versus backward assurance.
arxiv:2310.19852 v6 · 2023-10-30 · cs.AI
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{RFZR3QBSBPJSZTMJWL5BRKQE7Z}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
We identify four principles as the key objectives of AI alignment: Robustness, Interpretability, Controllability, and Ethicality (RICE). Guided by these four principles, we outline the landscape of current alignment research and decompose them into two key components: forward alignment and backward alignment.
That the four RICE principles comprehensively capture the essential objectives of AI alignment and that the forward/backward decomposition provides a useful, largely non-overlapping categorization of the existing literature.
The paper surveys AI alignment by proposing the RICE principles and categorizing research into forward training-based alignment and backward assurance and governance approaches.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:13.826924Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
89731dc0320bd32ccd89b2fa18aa04fe7d12472eb6034ce52229d7b01d876ede
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/RFZR3QBSBPJSZTMJWL5BRKQE7Z \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 89731dc0320bd32ccd89b2fa18aa04fe7d12472eb6034ce52229d7b01d876ede
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "71a187f0c4ce7bfd2c1937bf3c1f3edbe4d30a000688c961a7f23470d31deaa0",
"cross_cats_sorted": [],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.AI",
"submitted_at": "2023-10-30T15:52:15Z",
"title_canon_sha256": "cdaea1ef56d9f162173f93e128895eccd8b87d13027ab82760ec2de0f01c0492"
},
"schema_version": "1.0",
"source": {
"id": "2310.19852",
"kind": "arxiv",
"version": 6
}
}