pith. sign in
Pith Number

pith:7ZABN6EF

pith:2026:7ZABN6EF4P5PJM3ZIYRJ7YQCCM
not attested not anchored not stored refs pending

Offline Reinforcement Learning for Rotation Profile Control in Tokamaks

Andrew Rothstein, Egemen Kolemen, Hiro Josep Farre Kaga, Ian Char, Jeff Schneider, Jiayu Chen, Ricardo Shousha, Rohit Sonker

Offline reinforcement learning policies trained on historical tokamak data can control plasma rotation profiles when deployed on a real device.

arxiv:2605.05857 v2 · 2026-05-07 · cs.LG

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{7ZABN6EF4P5PJM3ZIYRJ7YQCCM}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

Our final method uses probabilistic models of plasma dynamics to generate rollouts for RL training. We deploy this policy on the DIII-D Tokamak and observe promising real-world results.

C2weakest assumption

That historical data from past plasma conditions is representative enough for the learned policy to generalize safely to new operating points and that the probabilistic models capture the relevant dynamics without dangerous extrapolation errors.

C3one line summary

Offline RL policies trained solely on DIII-D historical data were deployed on the tokamak and produced promising real-world control of the plasma rotation profile.

Receipt and verification
First computed 2026-06-10T01:11:01.060030Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

fe4016f885e3faf4b37946229fe202133546f597ce4f0b36a3d0581625598ed2

Aliases

arxiv: 2605.05857 · arxiv_version: 2605.05857v2 · doi: 10.48550/arxiv.2605.05857 · pith_short_12: 7ZABN6EF4P5P · pith_short_16: 7ZABN6EF4P5PJM3Z · pith_short_8: 7ZABN6EF
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/7ZABN6EF4P5PJM3ZIYRJ7YQCCM \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: fe4016f885e3faf4b37946229fe202133546f597ce4f0b36a3d0581625598ed2
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "b68d4c719e878b5af22a74e8088ff26e4598a43d36c7c980fe98899a5f17e379",
    "cross_cats_sorted": [],
    "license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
    "primary_cat": "cs.LG",
    "submitted_at": "2026-05-07T08:26:59Z",
    "title_canon_sha256": "0c9d6f4ffe701b12f204d593eb74f386f999c9b21b662c8c02c7a162d591e027"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2605.05857",
    "kind": "arxiv",
    "version": 2
  }
}