pith:KW7XYM5B
Learning to Foresee: Unveiling the Unlocking Efficiency of On-Policy Distillation
On-policy distillation locks onto a stable update trajectory toward the final model early in training.
arxiv:2605.11739 v3 · 2026-05-12 · cs.CL
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{KW7XYM5B6RYRB4ECLKAOQB534H}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
OPD's efficiency stems from a form of ``foresight'': it establishes a stable update trajectory toward the final model early in training. This manifests at the Module-Allocation Level by concentrating updates on critical modules and at the Update-Direction Level by stronger low-rank concentration aligning with the final subspace, enabling EffOPD to achieve an average 3x training acceleration.
That the observed module utility patterns and low-rank alignment are causal drivers of efficiency rather than correlated side effects, and that adaptive extrapolation along the current direction generalizes without degrading final performance across diverse tasks and model scales.
On-policy distillation gains efficiency from early foresight in module allocation and low-rank update directions, enabling EffOPD to accelerate training by 3x via adaptive extrapolation without extra modules or tuning.
Receipt and verification
| First computed | 2026-05-22T01:04:06.080374Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
55bf7c33a1f47110f0825a80e807bbe1df4d20a11d0fda0a615ccbf8dd5f3608
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/KW7XYM5B6RYRB4ECLKAOQB534H \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 55bf7c33a1f47110f0825a80e807bbe1df4d20a11d0fda0a615ccbf8dd5f3608
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "29be5cf5a4a7a59eae794005e1ab80404060a346f322416b0b85aff6eda66c38",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.CL",
"submitted_at": "2026-05-12T08:19:15Z",
"title_canon_sha256": "24a4e720c67fa9bfcfb3d2ff0a86b4c0190ab5ee5944ed8d591d6b7e1b54d7b9"
},
"schema_version": "1.0",
"source": {
"id": "2605.11739",
"kind": "arxiv",
"version": 3
}
}