Inspect these assets only when essential for task comprehension or result veriﬁcation using available viewer tools

Multimodal Data Handling • Selective Inspection : Y ou have access to multimodal inputs (text, images, audio, video)

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

TOBench: A Task-Oriented Omni-Modal Benchmark for Real-World Tool-Using Agents

cs.AI · 2026-05-16 · unverdicted · novelty 5.0

MM-ToolBench introduces 100 closed-loop multimodal tasks across two domains with 27 MCP servers and 324 tools, where agents must execute, inspect artifacts, and revise before final output.

citing papers explorer

Showing 1 of 1 citing paper.

TOBench: A Task-Oriented Omni-Modal Benchmark for Real-World Tool-Using Agents cs.AI · 2026-05-16 · unverdicted · none · ref 41
MM-ToolBench introduces 100 closed-loop multimodal tasks across two domains with 27 MCP servers and 324 tools, where agents must execute, inspect artifacts, and revise before final output.

Inspect these assets only when essential for task comprehension or result veriﬁcation using available viewer tools

fields

years

verdicts

representative citing papers

citing papers explorer