pith:KATRKKFL
From Feedback Loops to Policy Updates: Reinforcement Fine-Tuning for LLM-Based Alpha Factor Discovery
Reinforcement fine-tuning converts quantitative evaluations into policy updates so an LLM internalizes alpha factor optimization experience instead of accumulating prompt feedback.
arxiv:2605.15412 v1 · 2026-05-14 · cs.CE · cs.AI · cs.CL
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{KATRKKFLRXSSKV4IGUQUS2OXEN}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
QuantEvolver consistently improves the primary evaluation metric of each task over existing LLM-based alpha factor discovery baselines, produces higher-quality and more complementary factor pools.
That converting executable quantitative evaluation results into reinforcement policy updates allows the Miner LLM to internalize historical optimization experience without introducing new biases or failing to generalize beyond the regime backtests used during training.
QuantEvolver applies reinforcement fine-tuning to evolve an LLM policy for generating executable alpha factor expressions, yielding higher-quality and more complementary factors than prompt-based baselines on market benchmarks.
References
Formal links
Receipt and verification
| First computed | 2026-05-20T00:00:57.298359Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
50271528ab8de525578835214969d7237388e4425fe23cc0362480fae7afa191
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/KATRKKFLRXSSKV4IGUQUS2OXEN \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 50271528ab8de525578835214969d7237388e4425fe23cc0362480fae7afa191
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "0050df30ec955b63eaddfad66649b248143a5f57d70eadcc71fe842a910bd4a3",
"cross_cats_sorted": [
"cs.AI",
"cs.CL"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CE",
"submitted_at": "2026-05-14T20:54:40Z",
"title_canon_sha256": "d1e0186de0a1193da710b18dcf521f68e6ae266dea061730c86d009283934046"
},
"schema_version": "1.0",
"source": {
"id": "2605.15412",
"kind": "arxiv",
"version": 1
}
}