pith:23IZB7K2
Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback
RLHF, the dominant method for aligning large language models with human goals, carries fundamental limitations that incremental fixes cannot fully resolve.
arxiv:2307.15217 v2 · 2023-07-27 · cs.AI · cs.CL · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{23IZB7K2H2STHS5OGU3SX45Y43}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
RLHF has emerged as the central method used to finetune state-of-the-art large language models but has fundamental limitations, emphasizing the importance of a multi-faceted approach to the development of safer AI systems.
That the identified open problems represent fundamental limitations of RLHF rather than challenges that can be resolved through incremental improvements or better implementation.
RLHF has significant open problems and fundamental limitations that require a multi-faceted approach for safer AI development.
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:39:21.418541Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
d6d190fd5a3ea533cbae35372bf3b8e6ddf630437eb272f01b14c5a437f010e0
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/23IZB7K2H2STHS5OGU3SX45Y43 \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: d6d190fd5a3ea533cbae35372bf3b8e6ddf630437eb272f01b14c5a437f010e0
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "b2d5a1ffd2f126b3687464bdd54878590cc00c30071a32d643bc8ba98db2121f",
"cross_cats_sorted": [
"cs.CL",
"cs.LG"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.AI",
"submitted_at": "2023-07-27T22:29:25Z",
"title_canon_sha256": "f64b8e348d873883aaed9189c75a52a859f0648b3f54a9627b55a100e94a8e16"
},
"schema_version": "1.0",
"source": {
"id": "2307.15217",
"kind": "arxiv",
"version": 2
}
}