pith. machine review for the scientific record.
sign in
Pith Number

pith:LWH4ADZ4

pith:2025:LWH4ADZ4MWN3KIIC5FXC6ZKBCM
not attested not anchored not stored refs pending

A Survey of Reinforcement Learning for Large Reasoning Models

Bingxiang He, Biqing Qi, Bowen Zhou, Che Jiang, Dong Li, Ermo Hua, Fangfu Liu, Ganqu Cui, Guoli Jia, Haozhan Li, Huayu Chen, Jiaze Ma, Junqi Gao, Kai Tian, Kaiyan Zhang, Ning Ding, Pengfei Li, Runze Liu, Shang Qu, Shijie Wang, Sihang Zeng, Weize Chen, Xiang Xu, Xiaoye Qu, Xingtai Lv, Xinwei Long, Xuekai Zhu, Yafu Li, Yihao Liu, Youbang Sun, Yuchen Fan, Yuchen Zhang, Yu Fu, Yuru Wang, Yuxin Zuo, Zhenzhao Yuan, Zhiyuan Liu, Zhiyuan Ma, Zonglin Li

arxiv:2509.08827 v3 · 2025-09-10 · cs.CL · cs.AI · cs.LG

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Cited by

21 papers in Pith

Receipt and verification
First computed 2026-05-17T23:57:53.181737Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

5d8fc00f3c659bb52102e96e2f6541132aad902fe978735a8188def52dde9f83

Aliases

arxiv: 2509.08827 · arxiv_version: 2509.08827v3 · doi: 10.48550/arxiv.2509.08827 · pith_short_12: LWH4ADZ4MWN3 · pith_short_16: LWH4ADZ4MWN3KIIC · pith_short_8: LWH4ADZ4
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/LWH4ADZ4MWN3KIIC5FXC6ZKBCM \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 5d8fc00f3c659bb52102e96e2f6541132aad902fe978735a8188def52dde9f83
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "747d44ea4d84f76f8029daeb4f0509180301360c427349ac35b30c4ce34b38b4",
    "cross_cats_sorted": [
      "cs.AI",
      "cs.LG"
    ],
    "license": "http://creativecommons.org/licenses/by-nc-sa/4.0/",
    "primary_cat": "cs.CL",
    "submitted_at": "2025-09-10T17:59:43Z",
    "title_canon_sha256": "c7c07720025a1b339b5e8562879eacd8dfe59a7c8fdbd461faa42c4b2f10849e"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2509.08827",
    "kind": "arxiv",
    "version": 3
  }
}