pith. the verified trust layer for science. sign in
Pith Number

pith:BQDFLROG

pith:2025:BQDFLROG4STK3HB6IUQSMANU74
not attested not anchored not stored refs resolved

Video Generators are Robot Policies

Carl Vondrick, Junbang Liang, Paarth Shah, Pavel Tokmakov, Rares Ambrus, Ruoshi Liu, Sruthi Sudhakar

Video generation models can serve as robot policies by predicting future behavior frames and extracting actions from them.

arxiv:2508.00795 v1 · 2025-08-01 · cs.RO

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{BQDFLROG4STK3HB6IUQSMANU74}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

learning to generate videos of robot behavior allows for the extraction of policies with minimal demonstration data, significantly improving robustness and sample efficiency

C2weakest assumption

that the video generator produces videos whose implied actions are both feasible and optimal for the robot, without introducing dynamics that do not match the physical system

C3one line summary

Training models to generate videos of robot actions produces policies that generalize better to new objects and tasks while using far less demonstration data than standard behavior cloning.

References

63 extracted · 63 resolved · 7 Pith anchors

[1] M. Bain and C. Sammut. A framework for behavioural cloning. In Machine intelligence 15 , pages 103–129, 1995 1995
[2] C. Chi, S. Feng, Y . Du, Z. Xu, E. Cousineau, B. Burchfiel, and S. Song. Diffusion policy: Visuomotor policy learning via action diffusion. In RSS, 2023 2023
[3] A. Brohan, N. Brown, J. Carbajal, Y . Chebotar, J. Dabis, C. Finn, K. Gopalakrishnan, K. Haus- man, A. Herzog, J. Hsu, et al. RT-1: Robotics transformer for real-world control at scale. In RSS, 2022 2022
[4] O. M. Team, D. Ghosh, H. Walke, K. Pertsch, K. Black, O. Mees, S. Dasari, J. Hejna, T. Kreiman, C. Xu, et al. Octo: An open-source generalist robot policy. In RSS, 2024 2024
[5] K. Black, N. Brown, D. Driess, A. Esmail, M. Equi, C. Finn, N. Fusai, L. Groom, K. Hausman, B. Ichter, et al. π0: A vision-language-action flow model for general robot control. RSS, 2025 2025

Formal links

2 machine-checked theorem links

Cited by

20 papers in Pith

Receipt and verification
First computed 2026-05-17T23:38:50.056926Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

0c0655c5c6e4a6ad9c3e45212601b4ff3af1553ebbe9880dfd774649f1429ae8

Aliases

arxiv: 2508.00795 · arxiv_version: 2508.00795v1 · doi: 10.48550/arxiv.2508.00795 · pith_short_12: BQDFLROG4STK · pith_short_16: BQDFLROG4STK3HB6 · pith_short_8: BQDFLROG
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/BQDFLROG4STK3HB6IUQSMANU74 \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 0c0655c5c6e4a6ad9c3e45212601b4ff3af1553ebbe9880dfd774649f1429ae8
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "5e72f95ab4a2684d67183c98a995b8cb49ff1d51ed5898e203b6071544df66aa",
    "cross_cats_sorted": [],
    "license": "http://creativecommons.org/licenses/by/4.0/",
    "primary_cat": "cs.RO",
    "submitted_at": "2025-08-01T17:23:49Z",
    "title_canon_sha256": "a444d10e3f74deb76778d265a9478199e5e5cd0b7f6e5b90f7db6678eca95ff5"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2508.00795",
    "kind": "arxiv",
    "version": 1
  }
}