pith:MZ2Q6ZWW
AlphaDrive: Unleashing the Power of VLMs in Autonomous Driving via Reinforcement Learning and Reasoning
Reinforcement learning with tailored rewards and a two-stage strategy improves vision-language models for autonomous driving planning.
arxiv:2503.07608 v1 · 2025-03-10 · cs.CV · cs.RO
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{MZ2Q6ZWWEBZ2M4XGYHF72E5SOH}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
AlphaDrive significantly improves both planning performance and training efficiency compared to using only SFT or without reasoning, and following RL training exhibits emergent multimodal planning capabilities.
That the four GRPO-based RL rewards and two-stage training strategy produce generalizable, safe improvements on real-world driving data rather than overfitting to the training distribution.
AlphaDrive uses GRPO-based RL rewards and two-stage SFT+RL training on VLMs to improve autonomous driving planning performance and efficiency while producing emergent multimodal capabilities.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:46.738477Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
66750f66d62073a672e6c1cbfd13b271d5568d74846ccc3c164489e69622a1e9
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/MZ2Q6ZWWEBZ2M4XGYHF72E5SOH \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 66750f66d62073a672e6c1cbfd13b271d5568d74846ccc3c164489e69622a1e9
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "251251be26a88dc4ba247abe36d2d3fb3589f84bb6d3ebad2a96ee22e776a2fb",
"cross_cats_sorted": [
"cs.RO"
],
"license": "http://creativecommons.org/licenses/by-nc-sa/4.0/",
"primary_cat": "cs.CV",
"submitted_at": "2025-03-10T17:59:42Z",
"title_canon_sha256": "ffbef9e71aead583bc8fd0ecc333658dd73e98f7124c6a20764b1750183a9904"
},
"schema_version": "1.0",
"source": {
"id": "2503.07608",
"kind": "arxiv",
"version": 1
}
}