pith. sign in
Pith Number

pith:GIDL7EDI

pith:2026:GIDL7EDIPK547X4WFQF343VGDI
not attested not anchored not stored refs pending

Memorize Theorems, Not Instances: Probing SFT Generalization through Mathematical Reasoning

Jing Lei, Mengyu Yang, Ruiying Peng, Xiaohui Li, Xinlei Chen, Xueyu Wu

Supervised fine-tuning for math reasoning succeeds when models learn to apply theorems explicitly instead of memorizing individual problem-answer pairs.

arxiv:2605.09270 v2 · 2026-05-10 · cs.LG · cs.AI

Add to your LaTeX paper
\usepackage{pith}
\pithnumber{GIDL7EDIPK547X4WFQF343VGDI}

Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge

Record completeness

1 Bitcoin timestamp
2 Internet Archive
3 Author claim open · sign in to claim
4 Citations open
5 Replications open
Portable graph bundle live · download bundle · merged state
The bundle contains the canonical record plus signed events. A mirror can host it anywhere and recompute the same current state with the deterministic merge algorithm.

Claims

C1strongest claim

Generalization failures stem not from memorization as a mechanism, but from memorizing the wrong inductive targets.

C2weakest assumption

That the reported performance gains are caused by the shift to theorem-level supervision rather than by other unspecified differences in data construction, prompting, or training hyperparameters between vanilla SFT and Theorem-SFT.

C3one line summary

Theorem-SFT improves mathematical reasoning generalization by teaching theorem application rather than instance memorization, delivering gains of +8.8% on MATH and +20.27% on GeoQA across model families.

Formal links

2 machine-checked theorem links

Receipt and verification
First computed 2026-05-26T02:04:12.658778Z
Builder pith-number-builder-2026-05-17-v1
Signature Pith Ed25519 (pith-v1-2026-05) · public key
Schema pith-number/v1.0

Canonical hash

3206bf90687abbcfdf962c0bbe6ea61a259ffc8a711e099e8cd6a8c01927ea9e

Aliases

arxiv: 2605.09270 · arxiv_version: 2605.09270v2 · doi: 10.48550/arxiv.2605.09270 · pith_short_12: GIDL7EDIPK54 · pith_short_16: GIDL7EDIPK547X4W · pith_short_8: GIDL7EDI
Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/GIDL7EDIPK547X4WFQF343VGDI \
  | jq -c '.canonical_record' \
  | python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 3206bf90687abbcfdf962c0bbe6ea61a259ffc8a711e099e8cd6a8c01927ea9e
Canonical record JSON
{
  "metadata": {
    "abstract_canon_sha256": "881f5c372f9beeea9741eb229f4420493808e42c9867d83c4256600ac50ab2cc",
    "cross_cats_sorted": [
      "cs.AI"
    ],
    "license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
    "primary_cat": "cs.LG",
    "submitted_at": "2026-05-10T02:39:05Z",
    "title_canon_sha256": "2157ed097ea7cbccee21b89c015f0df8544df3e69a5bc8306a22aa8959162c6d"
  },
  "schema_version": "1.0",
  "source": {
    "id": "2605.09270",
    "kind": "arxiv",
    "version": 2
  }
}