pith:Q4F7RUBK
GRACE: Gradient-aligned Reasoning Data Curation for Efficient Post-training
GRACE scores each reasoning step by its alignment with the answer gradient and trajectory consistency to select data subsets that match or exceed full performance with 5-20 percent of the samples.
arxiv:2605.13130 v1 · 2026-05-13 · cs.AI
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{Q4F7RUBKK2IZHFPIAJTVEUKXGM}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
Post-training Qwen3-VL-2B-Instruct on MMathCoT-1M, GRACE reaches 108.8% of the full-data performance with 20% of the data and retains 100.2% with only 5%, with subsets that transfer effectively across model backbones.
That the representation-level gradient proxy accurately captures step-level alignment with the answer-oriented gradient and that the two signals (alignment and consistency) reliably identify valuable reasoning steps without external reward models or step annotations.
GRACE scores reasoning steps via gradient alignment and trajectory consistency to select data subsets that match full performance with 5% of the data on Qwen3-VL-2B-Instruct.
References
Formal links
Receipt and verification
| First computed | 2026-05-18T03:08:57.800483Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
870bf8d02a56919395e802675251573321a41d43e4aff08d897bd86a8641ff65
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/Q4F7RUBKK2IZHFPIAJTVEUKXGM \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 870bf8d02a56919395e802675251573321a41d43e4aff08d897bd86a8641ff65
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "4227439fb8a5dcf0e7f10a0e15349f543aa9bcf7c65d3f3e5d9a89473a50b7bf",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.AI",
"submitted_at": "2026-05-13T07:55:39Z",
"title_canon_sha256": "8d985cead699e72f8e855647819a7df04204444661269631e7f85e2af3ba6cf1"
},
"schema_version": "1.0",
"source": {
"id": "2605.13130",
"kind": "arxiv",
"version": 1
}
}