pith:GTXFFVUW
R3M: A Universal Visual Representation for Robot Manipulation
Pre-trained visual features from human videos enable more data-efficient robot manipulation.
arxiv:2203.12601 v3 · 2022-03-23 · cs.RO · cs.AI · cs.CV · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{GTXFFVUW5IAGGYDDGCDKIZJSH5}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Across a suite of 12 simulated robot manipulation tasks, we find that R3M improves task success by over 20% compared to training from scratch and by over 10% compared to state-of-the-art visual representations like CLIP and MoCo. Furthermore, R3M enables a Franka Emika Panda arm to learn a range of manipulation tasks in a real, cluttered apartment given just 20 demonstrations.
That visual features learned from human video data will transfer effectively to robotic camera inputs and task distributions without any robot-specific fine-tuning or domain adaptation.
A visual encoder pre-trained on diverse human videos with contrastive and language objectives improves simulated robot manipulation success by over 20% versus training from scratch and enables real Franka arm tasks from 20 demonstrations.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:52.454622Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
34ee52d696ea006360633086a465323f42504e698f7a5184bb4ac4f8e17a2bd4
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/GTXFFVUW5IAGGYDDGCDKIZJSH5 \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 34ee52d696ea006360633086a465323f42504e698f7a5184bb4ac4f8e17a2bd4
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "2e12f3f0f536b483f3c663217324ecee07bd9809f73caf8399664ec0c02148d6",
"cross_cats_sorted": [
"cs.AI",
"cs.CV",
"cs.LG"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.RO",
"submitted_at": "2022-03-23T17:55:09Z",
"title_canon_sha256": "923da15624e8a8f8846da5becb700e1523a5a0058db29deff7fbab2dd3adcab3"
},
"schema_version": "1.0",
"source": {
"id": "2203.12601",
"kind": "arxiv",
"version": 3
}
}