pith:CE6UHSSY
VideoCrafter1: Open Diffusion Models for High-Quality Video Generation
Open diffusion models generate realistic videos at 1024x576 resolution from text, with an image-to-video version that preserves input content.
arxiv:2310.19512 v1 · 2023-10-30 · cs.CV
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{CE6UHSSYIZOIZQSPP6JTFWKE4A}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Our proposed T2V model can generate realistic and cinematic-quality videos with a resolution of 1024 × 576, outperforming other open-source T2V models in terms of quality. The I2V model is the first open-source I2V foundation model capable of transforming a given image into a video clip while maintaining content preservation constraints.
The assumption that the models achieve the stated quality, outperformance, and strict content preservation, which rests on unspecified training details, evaluation metrics, and comparisons not provided in the abstract.
Open-source text-to-video and image-to-video diffusion models generate high-quality 1024x576 videos, with the I2V variant claimed as the first to strictly preserve reference image content.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:39:21.595918Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
113d43ca58465c8cc24f7f9332d944e00cd0458a20c85d8c0b7cfb37f6313405
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/CE6UHSSYIZOIZQSPP6JTFWKE4A \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 113d43ca58465c8cc24f7f9332d944e00cd0458a20c85d8c0b7cfb37f6313405
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "2bca427249d4b40857a3991cb40c3ce214a50909dd8a5bd269297185d7728356",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.CV",
"submitted_at": "2023-10-30T13:12:40Z",
"title_canon_sha256": "dbef2c651c70aba99234257294d313bd01fe92e9f497281450091b5eb06da62f"
},
"schema_version": "1.0",
"source": {
"id": "2310.19512",
"kind": "arxiv",
"version": 1
}
}