pith:4MCCMGFO
Learning When to Act: Communication-Efficient Reinforcement Learning via Run-Time Assurance
A single RL policy learns both control actions and sparse timing decisions while a Lyapunov run-time assurance shield enforces stability via LQR overrides.
arxiv:2605.12561 v1 · 2026-05-11 · cs.LG · cs.RO
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{4MCCMGFO5LQX7QQZVB737RVPGY}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
On an inverted pendulum, cart-pole, and planar quadrotor, the learned policy achieves 1.91×, 1.45×, and 3.51× higher mean inter-sample interval (MSI) than a Lyapunov-triggered baseline; a fixed LQR controller at the same average rate is unstable on all three plants, showing that adaptive timing, not a lower average rate, makes sparsity safe.
The approach assumes a known equilibrium point where CARE-based LQR backups and Lyapunov certificates are well-defined, allowing the RTA to provide strictly stronger safety guarantees than expectation-based methods.
Learned policies with runtime Lyapunov shields achieve substantially higher communication intervals than baselines while maintaining stability on inverted pendulum, cart-pole, and quadrotor systems.
References
Receipt and verification
| First computed | 2026-05-18T03:10:01.975671Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
e3042618aeeae17fc219a87fbfc6af36284ed772a4b249bbd9f183c6fb2f23aa
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/4MCCMGFO5LQX7QQZVB737RVPGY \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: e3042618aeeae17fc219a87fbfc6af36284ed772a4b249bbd9f183c6fb2f23aa
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "d524fae40de3911617501cc4d78c9be2d8101223b16693832f708c804f284830",
"cross_cats_sorted": [
"cs.RO"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.LG",
"submitted_at": "2026-05-11T23:55:15Z",
"title_canon_sha256": "441fded9b6c9b07b04ea76f6df2be6887a1930b077af32f3af176b487fc2417b"
},
"schema_version": "1.0",
"source": {
"id": "2605.12561",
"kind": "arxiv",
"version": 1
}
}