pith:E4XUT7DC
Energy-Regularized Spatial Masking: A Novel Approach to Enhancing Robustness and Interpretability in Vision Models
Embedding a differentiable energy minimization layer inside convolutional networks lets them autonomously select sparse, coherent spatial features for improved robustness and interpretability.
arxiv:2604.06893 v3 · 2026-04-08 · cs.CV · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{E4XUT7DCORXNTYCWPBSMFOIO6R}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
We validate ERSM on convolutional architectures and demonstrate that it produces emergent sparsity, improved robustness to structured occlusion, and highly interpretable spatial masks, while preserving classification accuracy. Furthermore, we show that the learned energy ranking significantly outperforms magnitude-based pruning in deletion-based robustness tests.
That the proposed unary importance cost and pairwise spatial coherence penalty can be combined into a differentiable energy function whose minimization inside standard backbones yields stable training and semantically meaningful masks without additional supervision or post-hoc tuning.
ERSM reformulates spatial feature selection in vision models as energy minimization with unary importance and pairwise coherence terms, producing emergent sparsity and better occlusion robustness.
Formal links
Receipt and verification
| First computed | 2026-06-09T02:08:41.779309Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
272f49fc62746ed9e0567864c2b90ef47d3aa4298c4cd439809b1e85a3890bd3
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/E4XUT7DCORXNTYCWPBSMFOIO6R \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 272f49fc62746ed9e0567864c2b90ef47d3aa4298c4cd439809b1e85a3890bd3
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "5586c1b3fa59b57633a4bdb5f4448cfa91bb0f12ba54c89eda943f7420264279",
"cross_cats_sorted": [
"cs.LG"
],
"license": "http://arxiv.org/licenses/nonexclusive-distrib/1.0/",
"primary_cat": "cs.CV",
"submitted_at": "2026-04-08T09:48:31Z",
"title_canon_sha256": "07683473bfea90c02f26fa281ab5069e6c583591810975e203374feafd9e0a4f"
},
"schema_version": "1.0",
"source": {
"id": "2604.06893",
"kind": "arxiv",
"version": 3
}
}