pith:C757O2MR
MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action
A textual prompt design lets ChatGPT collaborate with vision experts to handle advanced multimodal reasoning and action in zero-shot settings.
arxiv:2303.11381 v1 · 2023-03-20 · cs.CV · cs.CL · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{C757O2MRLHQBAEEYAMLEYHRIXC}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Zero-shot experiments demonstrate MM-REACT's effectiveness in addressing the specified capabilities of interests and its wide application in different scenarios that require advanced visual understanding.
The textual prompt design can faithfully represent and allow language models to process dense visual signals such as images and videos without loss of critical information.
MM-REACT uses textual prompts to let ChatGPT collaborate with external vision experts for zero-shot multimodal reasoning and action on advanced visual tasks.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-18T03:00:13.087518Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
17fbf7699159e010109803164c1e28b8be7a9d986cdbd49dc4371790e0fd38f7
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/C757O2MRLHQBAEEYAMLEYHRIXC \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 17fbf7699159e010109803164c1e28b8be7a9d986cdbd49dc4371790e0fd38f7
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "ad55e24432253d2a7fd679fd3f5d8e67b783e447d9f8098fe700f8965e46239c",
"cross_cats_sorted": [
"cs.CL",
"cs.LG"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CV",
"submitted_at": "2023-03-20T18:31:47Z",
"title_canon_sha256": "d8fc08a05575b94e41ebafb15d423ba6457301a1ecb6e571ce2d3f5d4f47bbb0"
},
"schema_version": "1.0",
"source": {
"id": "2303.11381",
"kind": "arxiv",
"version": 1
}
}