pith:TKNCVVC7
To See is Not to Learn: Protecting Multimodal Data from Unauthorized Fine-Tuning of Large Vision-Language Model
Data owners can add invisible perturbations to images and text to stop large vision-language models from learning real content during unauthorized fine-tuning.
arxiv:2605.14291 v1 · 2026-05-14 · cs.CR · cs.AI · cs.CL · cs.CV · cs.LG
Add to your LaTeX paper
\usepackage{pith}
\pithnumber{TKNCVVC76J3KRK2NEPNL3GJYPY}
Prints a linked badge after your title and injects PDF metadata. Compiles on arXiv. Learn more · Embed verified badge
Record completeness
Claims
MMGuard is evaluated against nine open-source LVLMs across six datasets. Our comprehensive results demonstrate effective, stealthy, and robust protection under white-box, gray-box, and black-box threat models, establishing a mechanistic advantage in proactively defending against aggressive fine-tuning exploitation.
The injected perturbations create an optimization shortcut that forces the model to overfit to noise rather than content, and the cross-modal binding disruption enforces spurious correlations with theoretical guarantees, assuming the LVLM's attention and loss landscape behave predictably under the chosen perturbation strategy.
MMGuard generates unlearnable multimodal examples via perturbations that exploit LVLM optimization shortcuts and disrupt cross-modal bindings, providing robust protection against unauthorized fine-tuning across threat models.
References
Receipt and verification
| First computed | 2026-05-17T23:39:10.196919Z |
|---|---|
| Builder | pith-number-builder-2026-05-17-v1 |
| Signature | Pith Ed25519
(pith-v1-2026-05) · public key |
| Schema | pith-number/v1.0 |
Canonical hash
9a9a2ad45ff276a8ab4d23dabd99387e212d2cff4a32ced6f211e825a2efd6c1
Aliases
· · · · ·Agent API
Verify this Pith Number yourself
curl -sH 'Accept: application/ld+json' https://pith.science/pith/TKNCVVC76J3KRK2NEPNL3GJYPY \
| jq -c '.canonical_record' \
| python3 -c "import sys,json,hashlib; b=json.dumps(json.loads(sys.stdin.read()), sort_keys=True, separators=(',',':'), ensure_ascii=False).encode(); print(hashlib.sha256(b).hexdigest())"
# expect: 9a9a2ad45ff276a8ab4d23dabd99387e212d2cff4a32ced6f211e825a2efd6c1
Canonical record JSON
{
"metadata": {
"abstract_canon_sha256": "c32aac0e42aa4742efa0365c16d87960036e04a988d3cf4df2c932d6f71f6a00",
"cross_cats_sorted": [
"cs.AI",
"cs.CL",
"cs.CV",
"cs.LG"
],
"license": "http://creativecommons.org/licenses/by/4.0/",
"primary_cat": "cs.CR",
"submitted_at": "2026-05-14T02:49:27Z",
"title_canon_sha256": "29eeeb6ea16a1c46e35bdb94b5ba65d1cda12753ab4858d3691072b88df22a92"
},
"schema_version": "1.0",
"source": {
"id": "2605.14291",
"kind": "arxiv",
"version": 1
}
}