pith:NFR7D3TK
Self-Distilled Trajectory-Aware Boltzmann Modeling: Bridging the Training-Inference Discrepancy in Diffusion Language Models
A new optimization framework for diffusion language models uses self-distilled inference trajectories and Boltzmann modeling of entropies to close the gap with standard supervised fine-tuning.
arxiv:2605.11854 v2 · 2026-05-12 · cs.CL
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{NFR7D3TKF5FJXHBHE35KARRZL3}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
TABOM achieves substantial gains in new domains, expands the effective knowledge boundary of DLMs, and significantly mitigates catastrophic forgetting compared with standard SFT.
That modeling the inference unmasking preference as a Boltzmann distribution over predictive entropies and deriving a pairwise ranking objective from it will produce genuine knowledge acquisition rather than marginal or illusory gains.
TABOM models inference unmasking preferences as a Boltzmann distribution over predictive entropies and derives a ranking loss to align DLM training with observed trajectories, yielding gains in new domains and reduced catastrophic forgetting versus standard SFT.
Cited by
Receipt and verification
| First computed | 2026-05-20T00:04:36.307179Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
6963f1ee6a2f4a9b9c2726faa046395efde5b325c11605e8d6fbf224148341e0
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/NFR7D3TKF5FJXHBHE35KARRZL3 \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 6963f1ee6a2f4a9b9c2726faa046395efde5b325c11605e8d6fbf224148341e0
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "dc78199a10256301f0dcd74b28737650a982dce09dcd11f273d65eb6aabd7b7c",
"cross_cats_sorted": [],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.CL",
"submitted_at": "2026-05-12T09:39:06Z",
"title_canon_sha256": "81b1fcd3984a8bed55856acf3b3d8e6dbe00b7b2cfc8a57fc276301cb0e49e40"
},
"schema_version": "1.0",
"source": {
"id": "2605.11854",
"kind": "arxiv",
"version": 2
}
}