Employ any necessary tools—such as code-based parsing scripts or vision-based image conversion—to accurately extract content

Robustness & Tolerance Rules Multimodal File Inspection: You are required to read, parse diverse file formats (including plain text, JSON, CSV, Excel, PPT, PDF, etc

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Workspace-Bench 1.0: Benchmarking AI Agents on Workspace Tasks with Large-Scale File Dependencies

cs.AI · 2026-05-05 · unverdicted · novelty 8.0 · 3 refs

Workspace-Bench reveals that AI agents achieve only 43.3% average success on workspace tasks with large-scale file dependencies, compared to 80.7% for humans.

citing papers explorer

Showing 1 of 1 citing paper.

Workspace-Bench 1.0: Benchmarking AI Agents on Workspace Tasks with Large-Scale File Dependencies cs.AI · 2026-05-05 · unverdicted · none · ref 7 · 3 links
Workspace-Bench reveals that AI agents achieve only 43.3% average success on workspace tasks with large-scale file dependencies, compared to 80.7% for humans.

Employ any necessary tools—such as code-based parsing scripts or vision-based image conversion—to accurately extract content

fields

years

verdicts

representative citing papers

citing papers explorer