pith:KWFZ7LKC
MixSD: Mixed Contextual Self-Distillation for Knowledge Injection
Aligning fine-tuning targets with a language model's own generation distribution prevents catastrophic forgetting of pretrained capabilities.
arxiv:2605.16865 v1 · 2026-05-16 · cs.CL
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{KWFZ7LKCVDH2UJMDUDL7SIR6CL}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
aligning supervision with the model's native generation distribution is a simple and effective principle for knowledge injection that mitigates catastrophic forgetting.
The mixed supervision sequences preserve the factual learning signal while remaining substantially closer to the base model's distribution, as constructed from the expert and naive conditionals.
MixSD achieves superior memorization-retention trade-off in knowledge injection by using mixed self-generated supervision from the base model's conditionals, retaining up to 100% held-out capability versus 1% for standard SFT.
References
Formal links
Receipt and verification
| First computed | 2026-05-20T00:03:27.143636Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
558b9fad42a8cfaa2583a0d7f9223e12fd89bae3676dcccdca66ce398399cc8a
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/KWFZ7LKCVDH2UJMDUDL7SIR6CL \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 558b9fad42a8cfaa2583a0d7f9223e12fd89bae3676dcccdca66ce398399cc8a
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "525996c2af35114b47f73523be53e16451c1d75fb47c025a9525adaa2e02f7bc",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.CL",
"submitted_at": "2026-05-16T07:57:09Z",
"title_canon_sha256": "d91fa992df7169978a7e2800ca98bd851dbc95e62d7f4f7e1bb7ffd198ddadab"
},
"schema_version": "1.0",
"source": {
"id": "2605.16865",
"kind": "arxiv",
"version": 1
}
}