pith. sign in

Can ai freelancers compete? benchmarking earnings, reliability, and task success at scale.arXiv preprint arXiv:2505.13511, 2025

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

stat.OT 1

years

2026 1

verdicts

UNVERDICTED 1

clear filters

representative citing papers

Flaws in the LLM Automation Narrative

stat.OT · 2026-06-09 · unverdicted · novelty 7.0

A new code-writing data analysis benchmark shows human experts outperforming a frontier LLM on average with lower performance variance.

citing papers explorer

Showing 1 of 1 citing paper after filters.

  • Flaws in the LLM Automation Narrative stat.OT · 2026-06-09 · unverdicted · none · ref 42

    A new code-writing data analysis benchmark shows human experts outperforming a frontier LLM on average with lower performance variance.