pith:DTOJ6LA4
Beyond Safety Filtering: Control Barrier Function-Informed Reinforcement Learning for Connected and Automated Vehicles
Converting Control Barrier Function constraints into rewards guides multi-agent reinforcement learning to higher performance with reduced hyperparameter sensitivity in connected vehicle intersections.
arxiv:2605.16894 v1 · 2026-05-16 · cs.RO · cs.SY · eess.SY
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{DTOJ6LA4GNU76LWDO7JI5CNJKP}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Results show that our method achieves the highest task performance and is less sensitive to reward hyperparameters, yielding consistently strong performance across the tested hyperparameter range.
That converting CBF constraint values under joint MARL actions into a reward signal will reliably guide safe learning without introducing new instabilities or performance trade-offs in the multi-agent intersection setting.
CBF-informed rewards for multi-agent RL achieve higher task performance and lower sensitivity to hyperparameters than heuristic baselines in a simulated four-way intersection with connected automated vehicles.
References
Formal links
Receipt and verification
| First computed | 2026-05-20T00:03:28.767960Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
1cdc9f2c1c3369ff2ec377d28e89a953e4d25a72954390a2d65591bbdb4e19f9
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/DTOJ6LA4GNU76LWDO7JI5CNJKP \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 1cdc9f2c1c3369ff2ec377d28e89a953e4d25a72954390a2d65591bbdb4e19f9
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "5f85ffaa2ee6e319efaa1018677c86535ef4427802f65f93bc42a18383be1b65",
"cross_cats_sorted": [
"cs.SY",
"eess.SY"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.RO",
"submitted_at": "2026-05-16T09:12:59Z",
"title_canon_sha256": "2a9e2e8c6745166ad0f680ab9e26e381104876c61ab53032a5784727f3070290"
},
"schema_version": "1.0",
"source": {
"id": "2605.16894",
"kind": "arxiv",
"version": 1
}
}