SEAL: Semantic Aware Image Watermarking
Pith reviewed 2026-05-23 00:08 UTC · model grok-4.3
The pith
Embedding semantic information into watermarks allows verification without a key database and improves robustness to forgery attacks on generated images.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
By embedding semantic information about the generated image directly into the watermark, the key pattern can be inferred from the semantic embedding using locality-sensitive hashing, allowing verification without a database of key patterns, and conditioning detection on image content improves robustness against forgery attacks such as noise extraction and object insertion.
What carries the argument
Semantic embedding of the image combined with locality-sensitive hashing to derive the watermark key pattern without database lookup.
If this is right
- Watermark detection becomes possible without searching through dictionaries of used keys.
- Verification is distortion-free and tied directly to the image content.
- Robustness increases against attackers who extract initial noise to create new images with the same pattern.
- Greater resistance to insertion of unrelated objects while attempting to preserve the watermark.
Where Pith is reading between the lines
- The method may scale to watermarking at very large volumes of generated images by removing database requirements.
- It could be tested on non-diffusion generative models to check if semantic conditioning transfers.
- Combining it with existing perceptual quality metrics might further reduce any residual distortion risks.
Load-bearing premise
Locality-sensitive hashing on the semantic embedding will reliably recover the correct key pattern without false positives or negatives.
What would settle it
A demonstration that locality-sensitive hashing on the semantic embedding produces a collision or mismatch leading to failed key recovery on a watermarked image, or that an attacker can successfully extract noise or insert an object to forge without detection.
Figures
read the original abstract
Generative models have rapidly evolved to generate realistic outputs. However, their synthetic outputs increasingly challenge the clear distinction between natural and AI-generated content, necessitating robust watermarking techniques. Watermarks are typically expected to preserve the integrity of the target image, withstand removal attempts, and prevent unauthorized replication onto unrelated images. To address this need, recent methods embed persistent watermarks into images produced by diffusion models using the initial noise. Yet, to do so, they either distort the distribution of generated images or rely on searching through a long dictionary of used keys for detection. In this paper, we propose a novel watermarking method that embeds semantic information about the generated image directly into the watermark, enabling a distortion-free watermark that can be verified without requiring a database of key patterns. Instead, the key pattern can be inferred from the semantic embedding of the image using locality-sensitive hashing. Furthermore, conditioning the watermark detection on the original image content improves robustness against forgery attacks. To demonstrate that, we consider two largely overlooked attack strategies: (i) an attacker extracting the initial noise and generating a novel image with the same pattern; (ii) an attacker inserting an unrelated (potentially harmful) object into a watermarked image, possibly while preserving the watermark. We empirically validate our method's increased robustness to these attacks. Taken together, our results suggest that content-aware watermarks can mitigate risks arising from image-generative models.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes SEAL, a semantic-aware watermarking method for diffusion-generated images. It embeds semantic information about the image content into the watermark via locality-sensitive hashing (LSH) on semantic embeddings, enabling database-free detection by inferring the key pattern from the image itself. The central claim is that conditioning watermark detection on the original image content improves robustness against two forgery attacks: (i) extracting the initial noise to generate a new image using the same pattern, and (ii) inserting an unrelated object into a watermarked image (possibly while preserving the watermark). The abstract states that the method is distortion-free and that empirical validation demonstrates increased robustness to these attacks.
Significance. If the robustness claims hold after proper quantification and stability analysis, the approach would offer a meaningful contribution to watermarking for generative models by removing the need for key-pattern databases while addressing specific forgery vectors that exploit noise reuse or semantic edits.
major comments (3)
- [Abstract] Abstract: the claim of 'empirical validation' of increased robustness to the two attack strategies provides no quantitative results, metrics, error bars, dataset details, or baseline comparisons. This absence is load-bearing for the central robustness claim.
- [Abstract] Abstract (attacks paragraph): the robustness claim requires that LSH on the semantic embedding recovers the exact key used at embedding time. Attack (ii) explicitly perturbs semantics via unrelated object insertion, which changes the embedding and LSH bucket; no bound on collision probability, false-negative rate for legitimate images, or embedding stability under these edits is supplied.
- [Abstract] Abstract: attack (i) re-uses the key pattern on a different image whose semantics differ from the original; the manuscript supplies no analysis showing that LSH still matches the original key or that false-positive rates across unrelated images remain negligible.
minor comments (1)
- [Abstract] The abstract refers to 'two largely overlooked attack strategies' without citing prior literature on noise-extraction or object-insertion attacks to substantiate the claim of oversight.
Simulated Author's Rebuttal
We thank the referee for the constructive feedback on our manuscript. We address each major comment point by point below, clarifying the presentation in the abstract and the nature of our empirical and analytical support for the robustness claims.
read point-by-point responses
-
Referee: [Abstract] Abstract: the claim of 'empirical validation' of increased robustness to the two attack strategies provides no quantitative results, metrics, error bars, dataset details, or baseline comparisons. This absence is load-bearing for the central robustness claim.
Authors: The abstract serves as a concise summary and therefore omits specific numerical results. The full manuscript presents the empirical validation in Section 4, including quantitative metrics, error bars, dataset details, and baseline comparisons. We will revise the abstract to incorporate a brief reference to these key quantitative outcomes. revision: yes
-
Referee: [Abstract] Abstract (attacks paragraph): the robustness claim requires that LSH on the semantic embedding recovers the exact key used at embedding time. Attack (ii) explicitly perturbs semantics via unrelated object insertion, which changes the embedding and LSH bucket; no bound on collision probability, false-negative rate for legitimate images, or embedding stability under these edits is supplied.
Authors: We acknowledge that the manuscript does not supply theoretical bounds on collision probability or false-negative rates. Our contribution centers on empirical demonstration of LSH stability under the specified semantic edits, as shown in the experiments. We will add a dedicated discussion paragraph on LSH collision properties and observed stability in the revised version. revision: yes
-
Referee: [Abstract] Abstract: attack (i) re-uses the key pattern on a different image whose semantics differ from the original; the manuscript supplies no analysis showing that LSH still matches the original key or that false-positive rates across unrelated images remain negligible.
Authors: The manuscript provides empirical results demonstrating successful key recovery via LSH for attack (i) and reports false-positive behavior on unrelated images. We agree that an expanded explicit analysis of these rates would strengthen the presentation and will include it in the revised manuscript. revision: yes
Circularity Check
No circularity: method uses standard LSH on external semantic embeddings
full rationale
The paper proposes embedding semantic information via LSH on image embeddings for database-free detection and claims improved robustness to two forgery attacks. This relies on established LSH properties and semantic embedding models from prior literature, with empirical validation rather than any derivation that reduces to the authors' own inputs by construction. No self-definitional equations, fitted parameters renamed as predictions, or load-bearing self-citations appear in the abstract or description. The central claim remains independent of the paper's own fitted values.
Axiom & Free-Parameter Ledger
Forward citations
Cited by 3 Pith papers
-
PGID: Progressive Guided Inversion and Denoising for Robust Watermark Detection
PGID restores watermark detection in diffusion models by using progressive inversion-denoising cycles to correct latents displaced by removal or forgery attacks.
-
Removing the Watermark Is Not Enough: Forensic Stealth in Generative-AI Watermark Removal
Current AI image watermark removal attacks replace the watermark with a different forensic signal, allowing independent detectors to distinguish processed outputs from clean images at over 98% true-positive rate under...
-
Dual-Guard: Dual-Channel Latent Watermarking for Provenance and Tamper Localization in Diffusion Images
Dual-Guard embeds complementary watermarks in diffusion image generation to verify provenance and localize tampering with low error rates on a 2400-sample benchmark under reprompting and editing attacks.
Reference graph
Works this paper leans on
-
[1]
Combined dwt-dct digital image watermarking
Ali Al-Haj. Combined dwt-dct digital image watermarking. Journal of computer science, 3(9):740–746, 2007. 1
work page 2007
-
[2]
Waves: Bench- marking the robustness of image watermarks
Bang An, Mucong Ding, Tahseen Rabbani, Aakriti Agrawal, Yuancheng Xu, Chenghao Deng, Sicheng Zhu, Abdirisak Mohamed, Yuxin Wen, Tom Goldstein, et al. Waves: Bench- marking the robustness of image watermarks. arXiv preprint arXiv:2401.08573, 2024. 1, 2
-
[3]
Near-optimal hashing algo- rithms for approximate nearest neighbor in high dimensions
Alexandr Andoni and Piotr Indyk. Near-optimal hashing algo- rithms for approximate nearest neighbor in high dimensions. Communications of the ACM, 51(1):117–122, 2008. 3
work page 2008
-
[4]
Hidden in the noise: Two-stage robust water- marking for images
Kasra Arabi, Benjamin Feuer, R Teal Witter, Chinmay Hegde, and Niv Cohen. Hidden in the noise: Two-stage robust water- marking for images. arXiv preprint arXiv:2412.04653, 2024. 1, 2, 7, 8, 3
-
[5]
Ty- pology of risks of generative text-to-image models
Charlotte Bird, Eddie Ungless, and Atoosa Kasirzadeh. Ty- pology of risks of generative text-to-image models. In Pro- ceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, pages 396–410, 2023. 1, 3
work page 2023
-
[6]
Trustmark: Universal watermarking for arbitrary resolution images, 2023
Tu Bui, Shruti Agarwal, and John Collomosse. Trustmark: Universal watermarking for arbitrary resolution images, 2023. 4
work page 2023
-
[7]
Similarity estimation techniques from rounding algorithms
Moses S Charikar. Similarity estimation techniques from rounding algorithms. In Proceedings of the thiry-fourth an- nual ACM symposium on Theory of computing, pages 380– 388, 2002. 5, 1
work page 2002
-
[8]
Ringid: Rethinking tree-ring watermarking for enhanced multi-key identification
Hai Ci, Pei Yang, Yiren Song, and Mike Zheng Shou. Ringid: Rethinking tree-ring watermarking for enhanced multi-key identification. In European Conference on Computer Vision, pages 338–354. Springer, 2024. 2, 7
work page 2024
-
[9]
Locality-sensitive hashing scheme based on p- stable distributions
Mayur Datar, Nicole Immorlica, Piotr Indyk, and Vahab S Mirrokni. Locality-sensitive hashing scheme based on p- stable distributions. In Proceedings of the twentieth an- nual symposium on Computational geometry, pages 253–262,
-
[10]
Swift: Semantic watermarking for image forgery thwarting
Gautier Evennou, Vivien Chappelier, Ewa Kijak, and Teddy Furon. Swift: Semantic watermarking for image forgery thwarting. In 2024 IEEE International Workshop on Informa- tion Forensics and Security (WIFS), pages 1–6. IEEE, 2024. 4
work page 2024
-
[11]
Watermarking images in self-supervised latent spaces
Pierre Fernandez, Alexandre Sablayrolles, Teddy Furon, Herv´e J ´egou, and Matthijs Douze. Watermarking images in self-supervised latent spaces. In ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 3054–3058. IEEE, 2022. 1
work page 2022
-
[12]
The stable signature: Rooting watermarks in latent diffusion models
Pierre Fernandez, Guillaume Couairon, Herv´e J´egou, Matthijs Douze, and Teddy Furon. The stable signature: Rooting watermarks in latent diffusion models. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 22466–22477, 2023. 1, 2, 4
work page 2023
-
[13]
Simi- larity search in high dimensions via hashing
Aristides Gionis, Piotr Indyk, Rajeev Motwani, et al. Simi- larity search in high dimensions via hashing. In Vldb, pages 518–529, 1999. 2
work page 1999
-
[14]
An unde- tectable watermark for generative image models
Sam Gunn, Xuandong Zhao, and Dawn Song. An unde- tectable watermark for generative image models. arXiv preprint arXiv:2410.07369, 2024. 2, 8, 3
-
[15]
Approximate nearest neigh- bors: towards removing the curse of dimensionality
Piotr Indyk and Rajeev Motwani. Approximate nearest neigh- bors: towards removing the curse of dimensionality. In Pro- ceedings of the thirtieth annual ACM symposium on Theory of computing, pages 604–613, 1998. 2
work page 1998
-
[16]
Misinformation, disinformation, and generative ai: Implications for perception and policy
Kokil Jaidka, Tsuhan Chen, Simon Chesterman, Wynne Hsu, Min-Yen Kan, Mohan Kankanhalli, Mong Li Lee, Gyula Seres, Terence Sim, Araz Taeihagh, et al. Misinformation, disinformation, and generative ai: Implications for perception and policy. Digital Government: Research and Practice, 6 (1):1–15, 2025. 1
work page 2025
-
[17]
Forging and removing latent-noise dif- fusion watermarks using a single image
Anubhav Jain, Yuya Kobayashi, Naoki Murata, Yuhta Takida, Takashi Shibuya, Yuki Mitsufuji, Niv Cohen, Nasir Memon, and Julian Togelius. Forging and removing latent-noise dif- fusion watermarks using a single image. arXiv preprint arXiv:2504.20111, 2025. 2, 7
-
[18]
Changhoon Kim, Kyle Min, Maitreya Patel, Sheng Cheng, and Yezhou Yang. Wouaf: Weight modulation for user attri- bution and fingerprinting in text-to-image diffusion models,
-
[19]
Junnan Li, Dongxu Li, Silvio Savarese, and Steven Hoi. Blip- 2: Bootstrapping language-image pre-training with frozen image encoders and large language models. In International conference on machine learning, pages 19730–19742. PMLR,
-
[20]
Microsoft coco: Common objects in context
Tsung-Yi Lin, Michael Maire, Serge Belongie, James Hays, Pietro Perona, Deva Ramanan, Piotr Doll´ar, and C Lawrence Zitnick. Microsoft coco: Common objects in context. In Computer vision–ECCV 2014: 13th European conference, zurich, Switzerland, September 6-12, 2014, proceedings, part v 13, pages 740–755. Springer, 2014. 5
work page 2014
-
[21]
Prompting hard or hardly prompting: Prompt inversion for text-to-image diffusion models
Shweta Mahajan, Tanzila Rahman, Kwang Moo Yi, and Leonid Sigal. Prompting hard or hardly prompting: Prompt inversion for text-to-image diffusion models. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 6808–6817, 2024. 3
work page 2024
-
[22]
Black-box forgery attacks on seman- tic watermarks for diffusion models
Andreas M¨uller, Denis Lukovnikov, Jonas Thietke, Asja Fis- cher, and Erwin Quiring. Black-box forgery attacks on seman- tic watermarks for diffusion models. In Proceedings of the Computer Vision and Pattern Recognition Conference, pages 20937–20946, 2025. 2, 7
work page 2025
-
[23]
Dwt-dct-svd based watermark- ing
KA Navas, Mathews Cheriyan Ajay, M Lekshmi, Tampy S Archana, and M Sasikumar. Dwt-dct-svd based watermark- ing. In 2008 3rd international conference on communica- tion systems software and middleware and workshops (COM- SWARE’08), pages 271–274. IEEE, 2008. 1
work page 2008
-
[24]
Sentence-bert: Sentence embeddings using siamese bert-networks
Nils Reimers and Iryna Gurevych. Sentence-bert: Sentence embeddings using siamese bert-networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Lan- guage Processing. Association for Computational Linguistics,
work page 2019
-
[25]
High-resolution image synthesis with latent diffusion models
Robin Rombach, Andreas Blattmann, Dominik Lorenz, Patrick Esser, and Bj ¨orn Ommer. High-resolution image synthesis with latent diffusion models. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 10684–10695, 2022. 7
work page 2022
-
[26]
Watermark anything with localized messages
Tom Sander, Pierre Fernandez, Alain Durmus, Teddy Furon, and Matthijs Douze. Watermark anything with localized messages. arXiv preprint arXiv:2411.07231, 2024. 2, 7 9
-
[27]
Gustavo Santana. Stable-diffusion-prompts. https : / / huggingface . co / datasets / Gustavosta / Stable-Diffusion-Prompts, 2024. Accessed: 2024- 11-20. 7, 5
work page 2024
-
[28]
arXiv preprint arXiv:2306.05949 , year=
Irene Solaiman, Zeerak Talat, William Agnew, Lama Ahmad, Dylan Baker, Su Lin Blodgett, Canyu Chen, Hal Daum´e III, Jesse Dodge, Isabella Duan, et al. Evaluating the social impact of generative ai systems in systems and society. arXiv preprint arXiv:2306.05949, 2023. 1
-
[29]
Denoising Diffusion Implicit Models
Jiaming Song, Chenlin Meng, and Stefano Ermon. Denoising diffusion implicit models. arXiv preprint arXiv:2010.02502,
work page internal anchor Pith review Pith/arXiv arXiv 2010
-
[30]
Stegastamp: Invisible hyperlinks in physical photographs
Matthew Tancik, Ben Mildenhall, and Ren Ng. Stegastamp: Invisible hyperlinks in physical photographs. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 2117–2126, 2020. 1
work page 2020
-
[31]
Tree-ring watermarks: Fingerprints for diffu- sion images that are invisible and robust
Yuxin Wen, John Kirchenbauer, Jonas Geiping, and Tom Goldstein. Tree-ring watermarks: Fingerprints for diffu- sion images that are invisible and robust. arXiv preprint arXiv:2305.20030, 2023. 1, 2, 7, 8
-
[32]
Pei Yang, Hai Ci, Yiren Song, and Mike Zheng Shou. Can simple averaging defeat modern watermarks? Advances in Neural Information Processing Systems , 37:56644–56673,
-
[33]
Pei Yang, Hai Ci, Yiren Song, and Mike Zheng Shou. Ste- ganalysis on digital watermarking: Is your defense truly im- pervious? arXiv preprint arXiv:2406.09026, 2024. 8
-
[34]
Gaussian shading: Provable performance-lossless image watermarking for diffusion mod- els
Zijin Yang, Kai Zeng, Kejiang Chen, Han Fang, Weim- ing Zhang, and Nenghai Yu. Gaussian shading: Provable performance-lossless image watermarking for diffusion mod- els. In Proceedings of the IEEE/CVF Conference on Com- puter Vision and Pattern Recognition, pages 12162–12171,
-
[35]
Editguard: Versatile image watermarking for tamper localization and copyright protection
Xuanyu Zhang, Runyi Li, Jiwen Yu, Youmin Xu, Weiqi Li, and Jian Zhang. Editguard: Versatile image watermarking for tamper localization and copyright protection. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 11964–11974, 2024. 2, 7
work page 2024
-
[36]
Invisible image watermarks are provably removable using generative ai
Xuandong Zhao, Kexun Zhang, Zihao Su, Saastha Vasan, Ilya Grishchenko, Christopher Kruegel, Giovanni Vigna, Yu- Xiang Wang, and Lei Li. Invisible image watermarks are provably removable using generative ai. Advances in Neural Information Processing Systems, 37:8643–8672, 2025. 1, 2, 8, 4
work page 2025
-
[37]
Hidden: Hiding data with deep networks
Jiren Zhu, Russell Kaplan, Justin Johnson, and Li Fei-Fei. Hidden: Hiding data with deep networks. In Proceedings of the European conference on computer vision (ECCV), pages 657–672, 2018. 1, 4 10 SEAL: Semantic Aware Image Watermarking Supplementary Material Clean Rotate JPEG C&S Blur Noise Bright Image Transformation 0.0 0.2 0.4 0.6 0.8 1.0 Detection Ac...
work page 2018
-
[38]
Additional Related Works Post-Processing Methods. Post-processing techniques embed watermarks after the image generation stage, provid- ing model-agnostic flexibility at the cost of potential quality degradation. Frequency-domain methods, such as methods using the Discrete Wavelet Transform (DWT) and Discrete Cosine Transform (DCT) [1, 23], embed watermar...
-
[39]
Proof of Lemma 3.2 Proof of Lemma 3.2. The angle between the original seman- tic vector v used to generate the watermark and extracted semantic vector ˜v of the suspect image is θ(v, ˜v) = cos−1 ⟨v, ˜v⟩ ∥v∥2∥˜v∥2 ∈ [−90◦, 90◦]. By the property of SimHash 1 and Assumption 3.1, the 1See Section 3 of [7] for details on why Pr r∼N (0,I) (sign(⟨v, r⟩) = sign(⟨...
-
[40]
Implementation Details 9.1. Key Parameters Unless otherwise stated, the results are reported with the following parameters: number of patch matching threshold nmatch = 12; patch-wise matching threshold τ = 2.3; num- ber of projection per noise patch: b = 7; number of noise patches k = 1024. All parameters were chosen to optimize the overall performance. 9...
work page 2000
-
[41]
The results are presented in Figure 7
Ablation of Number of Patches and Bits To investigate the impact of the number of patches (n) and the number of bits (b) used to generate the initial noise, we conducted an exhaustive ablation study across various parameter combinations. The results are presented in Figure 7
-
[42]
Resilience to Latent Forgery Attacks We evaluate SEAL under the Latent Forgery Attack [17]. This attack aims to adversarially perturb non-watermarked images such that they appear watermarked by mimicking the latent representation of an originally watermarked image. This type of attacks assumes access to at least one watermarked image and attempts to shift...
-
[43]
Additional Limitations and Discussion Distortion-Free Property for Sets of Images Our watermarking scheme securely generates the noise for each patch from a normal distribution, ensuring that each individual noise is distributed from a normal distribution. However, multiple watermarked images corresponding to related prompts may leak information about the...
-
[44]
Additional Experiments 13.1. CatAttack Performance vs. Object Scale We varied the size of the pasted object in the CatAttack from 10% to 40% of the image area and evaluated detection performance at each scale. Table 4 reports the ROC-AUC (%) for each object scale, showing a gradual improvement from 95.4% at a 10% scale to 98.0% at a 40% scale. Table 4. Ca...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.