pith. sign in
Pith Number

pith:474OF7OJ

pith:2026:474OF7OJGDEYBUPH6Z7HJHZDUW
not attested not anchored not stored refs pending

SciResearcher: Scaling Deep Research Agents for Frontier Scientific Reasoning

Kelvin Kiu Wai Tam, Newt Nguyen Kim Hue Nam, Rui Wang, Tianqing Fang, Tianshi Zheng, Wei Fan, Xiyun Li, Yangqiu Song

Automated synthesis of conceptual and computational tasks trains an 8B model to set new records on frontier biology and chemistry reasoning benchmarks.

arxiv:2605.01489 v2 · 2026-05-02 · cs.AI · cs.CL

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{474OF7OJGDEYBUPH6Z7HJHZDUW}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

SciResearcher-8B achieves 19.46% on the HLE-Bio/Chem-Gold benchmark, establishing a new state of the art at its parameter scale and surpassing several larger proprietary agents. It further achieves 13-15% absolute gains on SuperGPQA-Hard-Biology and TRQA-Literature benchmarks.

C2weakest assumption

That tasks synthesized by the agentic framework accurately reflect the computational and reasoning demands of actual frontier scientific problems rather than simplified or proxy versions.

C3one line summary

SciResearcher automates creation of diverse scientific reasoning tasks from academic evidence to train an 8B model that sets new SOTA at 19.46% on HLE-Bio/Chem-Gold and gains 13-15% on SuperGPQA-Hard-Biology and TRQA-Literature.

Receipt and verification
First computed 2026-05-27T01:05:55.655135Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

e7f8e2fdc930c980d1e7f67e749f23a5a594411b314f889839663588cb8554cf

Aliases

arxiv: 2605.01489 · arxiv_version: 2605.01489v2 · doi: 10.48550/arxiv.2605.01489 · pith_short_12: 474OF7OJGDEY · pith_short_16: 474OF7OJGDEYBUPH · pith_short_8: 474OF7OJ
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/474OF7OJGDEYBUPH6Z7HJHZDUW \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: e7f8e2fdc930c980d1e7f67e749f23a5a594411b314f889839663588cb8554cf
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "3b47ba40dd0e0666a765efc80157afb6b97d519cfd9d13d598ae51c9eec8202c",
    "cross_cats_sorted": [
      "cs.CL"
    ],
    "license": "http://creativecommons.org/licenses/by-sa/4.0/",
    "primary_cat": "cs.AI",
    "submitted_at": "2026-05-02T15:26:45Z",
    "title_canon_sha256": "7ceb00cfd966388e770212fbb394a4642b787e096360daca5bfea7ab18ec741d"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2605.01489",
    "kind": "arxiv",
    "version": 2
  }
}