pith:AD4OU3S5
RLAIF vs. RLHF: Scaling Reinforcement Learning from Human Feedback with AI Feedback
Reinforcement learning from AI feedback matches human feedback performance for aligning large language models.
arxiv:2309.00267 v3 · 2023-09-01 · cs.CL · cs.AI · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{AD4OU3S57S5Y2GK4FCGKHPM5GD}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Across the tasks of summarization, helpful dialogue generation, and harmless dialogue generation, we show that RLAIF achieves comparable performance to RLHF. ... we introduce direct-RLAIF (d-RLAIF) ... which achieves superior performance to canonical RLAIF.
That the preferences generated by an off-the-shelf LLM are high-quality enough to serve as a substitute for human preferences in training the reward model.
RLAIF matches RLHF on summarization and dialogue tasks, with a direct-RLAIF variant achieving superior results by using LLM rewards directly during training.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:50.098142Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
00f8ea6e5dfcbb8d195c288ca3bd9d30ccec365083e1091ebe19ac2b0a61252f
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/AD4OU3S57S5Y2GK4FCGKHPM5GD \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 00f8ea6e5dfcbb8d195c288ca3bd9d30ccec365083e1091ebe19ac2b0a61252f
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "e06fbc5dabdd57615fbf708dac24235e084d3e237bd3abd328f2bc19edfd90ee",
"cross_cats_sorted": [
"cs.AI",
"cs.LG"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.CL",
"submitted_at": "2023-09-01T05:53:33Z",
"title_canon_sha256": "30a71bef573f4df6e33a747f2c8824790bde2c0d53c2ef6779f1df973bc3eb36"
},
"schema_version": "1.0",
"source": {
"id": "2309.00267",
"kind": "arxiv",
"version": 3
}
}