pith:QXWMAQZU
Improving Dictionary Learning with Gated Sparse Autoencoders
Gated Sparse Autoencoders separate feature selection from magnitude estimation to eliminate L1-induced shrinkage in language model dictionary learning.
arxiv:2404.16014 v2 · 2024-04-24 · cs.LG · cs.AI
Record completeness
Claims
Through training SAEs on LMs of up to 7B parameters we find that, in typical hyper-parameter ranges, Gated SAEs solve shrinkage, are similarly interpretable, and require half as many firing features to achieve comparable reconstruction fidelity.
That restricting the L1 penalty to the gating branch does not introduce new biases or degrade feature quality in dimensions not measured by the reported reconstruction and interpretability metrics.
Gated SAEs decouple which features to use from how large their activations should be, applying the L1 penalty only to selection and thereby eliminating shrinkage while halving the number of firing features needed for good fidelity.
References
Formal links
Cited by
Receipt and verification
| First computed | 2026-05-17T23:38:13.270899Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
85ecc043348c1085758bb26ca90394baf62ecde2b4a65f0faa10053019f8335c
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/QXWMAQZURQIIK5MLWJWKSA4UXL \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 85ecc043348c1085758bb26ca90394baf62ecde2b4a65f0faa10053019f8335c
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "a5a5978bca297540afaf137cdc1c11e59dd3aa7ff92132d2dba627675ae9dca9",
"cross_cats_sorted": [
"cs.AI"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.LG",
"submitted_at": "2024-04-24T17:47:22Z",
"title_canon_sha256": "de78f0873097f3b9f45e65322afe73347a1488e485186984f4ef162891cec806"
},
"schema_version": "1.0",
"source": {
"id": "2404.16014",
"kind": "arxiv",
"version": 2
}
}