pith. sign in

← back to paper

Review history

arxiv: 2605.15104 · 2 revisions

From Text to Voice: A Reproducible and Verifiable Framework for Evaluating Tool Calling LLM Agents

  1. 2026-05-21 UNVERDICTED LOW v0.9.0 novelty 6.0
    29691 ms 5833 in 1412 out 2026-05-21T08:42:11.009892+00:00
  2. 2026-05-15 UNVERDICTED LOW v0.9.0 novelty 7.0
    82732 ms 5602 in 1392 out 2026-05-15T03:18:59.044985+00:00