pith:2JJKI5UQ
WebThinker: Empowering Large Reasoning Models with Deep Research Capability
WebThinker lets large reasoning models search the web and draft reports autonomously during reasoning.
arxiv:2504.21776 v2 · 2025-04-30 · cs.CL · cs.AI · cs.IR
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{2JJKI5UQGMKXPVPXS5K573WHRC}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Extensive experiments on complex reasoning benchmarks (GPQA, GAIA, WebWalkerQA, HLE) and scientific report generation tasks (Glaive) demonstrate that WebThinker significantly outperforms existing methods and strong proprietary systems.
That the Deep Web Explorer module can reliably locate, navigate, and extract accurate information from arbitrary web pages without introducing navigation errors or factual hallucinations that propagate into the final report.
WebThinker equips large reasoning models with autonomous web exploration and interleaved reasoning-drafting via a Deep Web Explorer and RL-based DPO training, yielding gains on GPQA, GAIA, and report-generation benchmarks.
References
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:46.873797Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
d252a47690331577d5f79755dfeec78889b87740c75130c472c25ff2b5a61c87
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/2JJKI5UQGMKXPVPXS5K573WHRC \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: d252a47690331577d5f79755dfeec78889b87740c75130c472c25ff2b5a61c87
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "e99d68991c3e046a23e457994dc6f977a81a18aab14292ab79ad9980dc1b958f",
"cross_cats_sorted": [
"cs.AI",
"cs.IR"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CL",
"submitted_at": "2025-04-30T16:25:25Z",
"title_canon_sha256": "d518db51547776410cc9728128f4fd47506bcf25e3bad8c45c165fe41daa4a18"
},
"schema_version": "1.0",
"source": {
"id": "2504.21776",
"kind": "arxiv",
"version": 2
}
}