pith:C32FX3ZN
GAIA-2: A Controllable Multi-View Generative World Model for Autonomous Driving
GAIA-2 generates high-resolution multi-camera driving videos from structured inputs like vehicle dynamics, agent positions, and road semantics.
arxiv:2503.20523 v1 · 2025-03-26 · cs.CV · cs.AI · cs.RO
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{C32FX3ZNYSA4UYVQ3E2E3C6UAA}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
GAIA-2 supports controllable video generation conditioned on a rich set of structured inputs: ego-vehicle dynamics, agent configurations, environmental factors, and road semantics. It generates high-resolution, spatiotemporally consistent multi-camera videos across geographically diverse driving environments.
That the generated videos are sufficiently realistic, consistent, and free of artifacts to serve as effective training data for autonomous driving systems without introducing biases or failures when transferred to real vehicles.
GAIA-2 is a controllable latent diffusion world model that produces spatiotemporally consistent multi-view videos for autonomous driving simulation across diverse geographies.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:52.407613Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
16f45bef2dc481ca62b0d9344d8bd40035eaae2c01e698f7c62fd2e37dd2db93
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/C32FX3ZNYSA4UYVQ3E2E3C6UAA \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 16f45bef2dc481ca62b0d9344d8bd40035eaae2c01e698f7c62fd2e37dd2db93
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "fb599b61127c4e4ed010c5302ce50dd0aec525513754619b7e36da42bc535029",
"cross_cats_sorted": [
"cs.AI",
"cs.RO"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CV",
"submitted_at": "2025-03-26T13:11:35Z",
"title_canon_sha256": "da0c2c4ad3f32678ee5c887c2e1236f46fc1af634d54bd819e57d92fa8043270"
},
"schema_version": "1.0",
"source": {
"id": "2503.20523",
"kind": "arxiv",
"version": 1
}
}