pith. sign in

← back to paper

Review history

arxiv: 2602.00933 · 2 revisions

MCP-Atlas: A Large-Scale Benchmark for Tool-Use Competency with Real MCP Servers

  1. 2026-05-21 ACCEPT LOW v0.9.0 novelty 8.0
    66028 ms 5952 in 1609 out 2026-05-21T15:03:13.690263+00:00
  2. 2026-05-16 UNVERDICTED LOW v0.9.0 novelty 7.0
    26495 ms 5573 in 1268 out 2026-05-16T08:25:29.700997+00:00