UniMark: Unified Adaptive Multi-bit Watermarking for Autoregressive Image Generators

Amir Rahman; Elena Petrova; Lucia Rossi; Mehmet Kaya; Yigit Yilmaz

arxiv: 2604.11843 · v1 · submitted 2026-04-12 · 💻 cs.CV

UniMark: Unified Adaptive Multi-bit Watermarking for Autoregressive Image Generators

Yigit Yilmaz , Elena Petrova , Mehmet Kaya , Lucia Rossi , Amir Rahman This is my paper

Pith reviewed 2026-05-10 15:44 UTC · model grok-4.3

classification 💻 cs.CV

keywords watermarkingautoregressive generationmulti-bit messagessemantic groupingtoken replacementimage robustnesstraining-free embedding

0 comments

The pith

UniMark introduces a training-free framework to embed multi-bit watermarks in autoregressive image generators across different architectures while maintaining quality and robustness.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes UniMark to solve limitations in watermarking for autoregressive image generators, specifically the lack of multi-bit support, vulnerability of static partitions, and lack of generality across models. It does this through three components that allow dynamic partitioning of codebooks, block-based message encoding with error correction, and an abstract interface for token replacement. A sympathetic reader would care because it enables reliable tracing of AI-generated images with embedded messages that survive common edits, addressing ownership and accountability issues in generative AI.

Core claim

UniMark is a unified adaptive multi-bit watermarking method for autoregressive image generators that uses adaptive semantic grouping of codebook entries based on similarity and a secret key for security, block-wise multi-bit encoding with error-correcting codes for reliable extraction, and a unified token-replacement interface to work with both next-token and next-scale prediction paradigms, delivering state-of-the-art performance in image fidelity, detection rates, and robustness to distortions.

What carries the argument

Adaptive Semantic Grouping that dynamically partitions codebook entries using semantic similarity and a secret key, enabling secure multi-bit embedding without training.

If this is right

Multi-bit messages can be embedded and extracted reliably from generated images.
The watermark remains detectable after image transformations such as cropping and compression.
Image generation quality, measured by FID, stays at or above baseline levels.
The method generalizes to different autoregressive architectures without retraining or tuning.
Theoretical bounds on detection error and capacity are provided for the encoding scheme.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

This framework might inspire similar adaptive techniques in other token-based generative systems for content authentication.
Adjusting the block size or error-correcting code strength could trade off message length against robustness in specific use cases.
Integration into production image generators could provide a practical way to label outputs for regulatory compliance.

Load-bearing premise

Dynamic semantic grouping based on similarity and a secret key can preserve perceptual quality and security against partition-exposing attacks without needing architecture-specific adjustments.

What would settle it

If applying the adaptive grouping leads to a noticeable increase in FID scores or if an attacker who knows the similarity metric can remove the watermark without the key, the central performance claims would be falsified.

Figures

Figures reproduced from arXiv: 2604.11843 by Amir Rahman, Elena Petrova, Lucia Rossi, Mehmet Kaya, Yigit Yilmaz.

**Figure 1.** Figure 1: Embedding capacity analysis: bit accuracy vs. message length across three AR models. JPEG Blur Noise Crop Color Erasing 80 85 90 95 100 UniMark (Ours) IndexMark AR-Watermark [PITH_FULL_IMAGE:figures/full_fig_p010_1.png] view at source ↗

**Figure 3.** Figure 3: Parameter sensitivity analysis: effect of green ratio [PITH_FULL_IMAGE:figures/full_fig_p010_3.png] view at source ↗

**Figure 4.** Figure 4: Security analysis: (a) green ratio distributions under correct/wrong keys; (b) [PITH_FULL_IMAGE:figures/full_fig_p011_4.png] view at source ↗

**Figure 5.** Figure 5: Efficiency analysis: (a) absolute time comparison (log scale); (b) relative overhead [PITH_FULL_IMAGE:figures/full_fig_p011_5.png] view at source ↗

**Figure 6.** Figure 6: Attack strength analysis: (a) JPEG quality, (b) Gaussian noise [PITH_FULL_IMAGE:figures/full_fig_p012_6.png] view at source ↗

**Figure 8.** Figure 8: Component contribution analysis showing ∆FID, TPR, and bit accuracy for each ablation variant. TPR down to quality 20. Under Gaussian noise, the critical threshold is around σ = 0.10, beyond which the re-tokenization becomes unreliable due to significant pixel-level perturbation. Under cropping, maintaining at least 60% of the image area ensures TPR above 85%. These degradation curves provide deployment gu… view at source ↗

read the original abstract

Invisible watermarking for autoregressive (AR) image generation has recently gained attention as a means of protecting image ownership and tracing AI-generated content. However, existing approaches suffer from three key limitations: (1) they embed only zero-bit watermarks for binary verification, lacking the ability to convey multi-bit messages; (2) they rely on static codebook partitioning strategies that are vulnerable to security attacks once the partition is exposed; and (3) they are designed for specific AR architectures, failing to generalize across diverse AR paradigms. We propose \method{}, a training-free, unified watermarking framework for autoregressive image generators that addresses all three limitations. \method{} introduces three core components: \textbf{Adaptive Semantic Grouping (ASG)}, which dynamically partitions codebook entries based on semantic similarity and a secret key, ensuring both image quality preservation and security; \textbf{Block-wise Multi-bit Encoding (BME)}, which divides the token sequence into blocks and encodes different bits across blocks with error-correcting codes for reliable message transmission; and \textbf{a Unified Token-Replacement Interface (UTRI)} that abstracts the watermark embedding process to support both next-token prediction (e.g., LlamaGen) and next-scale prediction (e.g., VAR) paradigms. We provide theoretical analysis on detection error rates and embedding capacity. Extensive experiments on three AR models demonstrate that \method{} achieves state-of-the-art performance in image quality (FID), watermark detection accuracy, and multi-bit message extraction, while maintaining robustness against cropping, JPEG compression, Gaussian noise, blur, color jitter, and random erasing attacks.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

UniMark gives a training-free way to embed multi-bit watermarks across next-token and next-scale AR generators via adaptive grouping and block encoding, but the quality claims depend on how much the semantic replacements shift the original token distributions.

read the letter

UniMark's main advance is a unified, training-free framework that handles multi-bit messages in autoregressive image generators. It uses Adaptive Semantic Grouping to partition the codebook by similarity plus a secret key, Block-wise Multi-bit Encoding with error-correcting codes across token blocks, and a Unified Token-Replacement Interface that works for both next-token models like LlamaGen and next-scale ones like VAR. This directly fixes the three limits called out in prior work: zero-bit only, static vulnerable partitions, and architecture-specific designs. The paper adds theoretical analysis on detection error rates and embedding capacity, then reports experiments on three models with SOTA FID, detection accuracy, and robustness to cropping, JPEG, noise, blur, jitter, and erasing. Those results look like the strongest part if the numbers hold up in the full tables. The soft spot is the central assumption in ASG that similarity-based replacement keeps perceptual quality and security without per-model tuning. The stress-test point about possible distribution shift is fair to raise; if the experiments show only small FID increases and no obvious low-probability token forcing, it lands, but I'd want the exact before-and-after token probability stats and how the secret key entropy was measured. Security against an adversary seeing many outputs also needs the attack simulations to be thorough rather than just listed. This is the kind of paper that fits a reading group on generative model security or provenance tools. It deserves peer review because the problem is practical, the components are clearly described, and the cross-paradigm unification is new enough to get useful referee feedback even if some robustness numbers need tightening.

Referee Report

0 major / 3 minor

Summary. The manuscript proposes UniMark, a training-free, unified adaptive multi-bit watermarking framework for autoregressive image generators. It introduces Adaptive Semantic Grouping (ASG) to dynamically partition the codebook based on semantic similarity and a secret key, Block-wise Multi-bit Encoding (BME) to divide token sequences into blocks and encode bits with error-correcting codes, and a Unified Token-Replacement Interface (UTRI) to support both next-token prediction and next-scale prediction paradigms. The paper includes theoretical analysis on detection error rates and embedding capacity, and reports extensive experiments on three AR models claiming state-of-the-art performance in FID for image quality, watermark detection accuracy, multi-bit message extraction, and robustness to attacks including cropping, JPEG compression, Gaussian noise, blur, color jitter, and random erasing.

Significance. If the central claims hold, this work would be significant for enabling secure, multi-bit watermarking in a variety of autoregressive image generation models without requiring model-specific training or tuning. The combination of semantic grouping for quality preservation and secret-key based security, along with the unified interface, addresses key limitations in the field. The theoretical analysis and broad experimental validation on multiple models and attack types are positive aspects. The skeptic concern about potential token probability degradation from ASG does not land on the basis of the reported SOTA FID results, which indicate that quality is maintained.

minor comments (3)

[Abstract] The abstract claims 'theoretical analysis' and 'extensive experiments' but does not include any specific quantitative results or error rates; moving some key metrics to the abstract would improve accessibility.
[§5] The robustness results would benefit from reporting standard deviations or confidence intervals across multiple generations to demonstrate consistency.
[§3.2] The definition of semantic similarity measure used in ASG should be explicitly stated with reference to the embedding space employed.

Simulated Author's Rebuttal

0 responses · 0 unresolved

We thank the referee for their positive summary and significance assessment of UniMark. We appreciate the recognition that our training-free framework with ASG, BME, and UTRI addresses key limitations in multi-bit watermarking for autoregressive generators, supported by theoretical analysis and experiments across models and attacks. No major comments were provided in the report.

Circularity Check

0 steps flagged

No significant circularity; claims rest on external experiments and stated theoretical analysis

full rationale

The paper introduces UniMark as a training-free framework with three explicitly defined components (ASG for dynamic partitioning, BME for block-wise encoding, UTRI for paradigm abstraction). It states that theoretical analysis is provided on detection error rates and embedding capacity, and that SOTA performance is demonstrated via experiments on three distinct AR models under multiple attacks. No equations, derivations, or self-citations in the abstract or described structure reduce any performance claim, capacity bound, or robustness result to a fitted parameter or input defined by the method itself. The derivation chain remains self-contained because the core claims are positioned as outcomes of the proposed algorithms plus independent validation rather than tautological re-statements of the inputs.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review yields no explicit free parameters, axioms, or invented entities beyond the three named algorithmic components; no numerical constants or post-hoc fits are mentioned.

pith-pipeline@v0.9.0 · 5599 in / 1074 out tokens · 45325 ms · 2026-05-10T15:44:37.356674+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Cost/FunctionalEquation washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Adaptive Semantic Grouping (ASG), which dynamically partitions codebook entries based on semantic similarity and a secret key
IndisputableMonolith/Foundation/RealityFromDistinction reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Block-wise Multi-bit Encoding (BME) ... with error-correcting codes

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

4 extracted references · 4 canonical work pages

[1]

write newline

" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION format.date year duplicate empty "emp...

work page
[2]

@esa (Ref

\@ifxundefined[1] #1\@undefined \@firstoftwo \@secondoftwo \@ifnum[1] #1 \@firstoftwo \@secondoftwo \@ifx[1] #1 \@firstoftwo \@secondoftwo [2] @ #1 \@temptokena #2 #1 @ \@temptokena \@ifclassloaded agu2001 natbib The agu2001 class already includes natbib coding, so you should not add it explicitly Type <Return> for now, but then later remove the command n...

work page
[3]

\@lbibitem[] @bibitem@first@sw\@secondoftwo \@lbibitem[#1]#2 \@extra@b@citeb \@ifundefined br@#2\@extra@b@citeb \@namedef br@#2 \@nameuse br@#2\@extra@b@citeb \@ifundefined b@#2\@extra@b@citeb @num @parse #2 @tmp #1 NAT@b@open@#2 NAT@b@shut@#2 \@ifnum @merge>\@ne @bibitem@first@sw \@firstoftwo \@ifundefined NAT@b*@#2 \@firstoftwo @num @NAT@ctr \@secondoft...

work page
[4]

GATEAU : Selecting Influential Samples for Long Context Alignment

@open @close @open @close and [1] URL: #1 \@ifundefined chapter * \@mkboth \@ifxundefined @sectionbib * \@mkboth * \@mkboth\@gobbletwo \@ifclassloaded amsart * \@ifclassloaded amsbook * \@ifxundefined @heading @heading NAT@ctr thebibliography [1] @ \@biblabel @NAT@ctr \@bibsetup #1 @NAT@ctr @ @openbib .11em \@plus.33em \@minus.07em 4000 4000 `\.\@m @bibit...

work page doi:10.18653/v1/2025.emnlp-main.375 2025

[1] [1]

write newline

" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION format.date year duplicate empty "emp...

work page

[2] [2]

@esa (Ref

\@ifxundefined[1] #1\@undefined \@firstoftwo \@secondoftwo \@ifnum[1] #1 \@firstoftwo \@secondoftwo \@ifx[1] #1 \@firstoftwo \@secondoftwo [2] @ #1 \@temptokena #2 #1 @ \@temptokena \@ifclassloaded agu2001 natbib The agu2001 class already includes natbib coding, so you should not add it explicitly Type <Return> for now, but then later remove the command n...

work page

[3] [3]

\@lbibitem[] @bibitem@first@sw\@secondoftwo \@lbibitem[#1]#2 \@extra@b@citeb \@ifundefined br@#2\@extra@b@citeb \@namedef br@#2 \@nameuse br@#2\@extra@b@citeb \@ifundefined b@#2\@extra@b@citeb @num @parse #2 @tmp #1 NAT@b@open@#2 NAT@b@shut@#2 \@ifnum @merge>\@ne @bibitem@first@sw \@firstoftwo \@ifundefined NAT@b*@#2 \@firstoftwo @num @NAT@ctr \@secondoft...

work page

[4] [4]

GATEAU : Selecting Influential Samples for Long Context Alignment

@open @close @open @close and [1] URL: #1 \@ifundefined chapter * \@mkboth \@ifxundefined @sectionbib * \@mkboth * \@mkboth\@gobbletwo \@ifclassloaded amsart * \@ifclassloaded amsbook * \@ifxundefined @heading @heading NAT@ctr thebibliography [1] @ \@biblabel @NAT@ctr \@bibsetup #1 @NAT@ctr @ @openbib .11em \@plus.33em \@minus.07em 4000 4000 `\.\@m @bibit...

work page doi:10.18653/v1/2025.emnlp-main.375 2025