pith. sign in

arxiv: 2605.16239 · v1 · pith:NYI4M4WTnew · submitted 2026-05-15 · 💻 cs.LG

Dynamics-Level Watermarking of Flow Matching Models with Random Codes

Pith reviewed 2026-05-20 20:16 UTC · model grok-4.3

classification 💻 cs.LG
keywords continuousdynamics-levelflowmatchingmessagemodelmodelsperturbation
0
0 comments X

The pith

Presents dynamics-level watermarking for flow matching models via random coding over continuous channels, embedding key-dependent perturbations in the velocity field that preserve the generated distribution and enable black-box message recovery.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

Flow matching models create images by learning a smooth path from random noise to data, guided by a velocity field that tells the model how to move at each step. Instead of hiding a watermark in the final pictures or in the model's weights, this work adds a secret, key-dependent nudge straight into that velocity field while the model trains. The nudge is designed so the overall distribution of generated images stays exactly the same. At detection time, someone with the secret key can query the model in a black-box manner and recover the hidden message from the way the dynamics respond. Experiments on MNIST and CIFAR-10 with different model architectures show that the message comes through reliably when the key is known, but decoding fails at chance level without the key, and image quality remains unchanged. The approach treats the continuous dynamics as a communication channel and uses random coding ideas to make the embedding robust.

Core claim

The perturbation is designed to leave the generated distribution unchanged while allowing reliable message recovery from black-box queries.

Load-bearing premise

That a key-dependent perturbation added to the velocity field during training can be recovered at detection time without altering the learned continuous dynamics or output distribution (abstract, paragraph on formulation as random coding over continuous channel).

Figures

Figures reproduced from arXiv: 2605.16239 by Shuchan Wang.

Figure 1
Figure 1. Figure 1: Overview of the proposed dynamics-level watermarking framework. [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Generated MNIST samples from UNet + LoRA models. Top row: real MNIST [PITH_FULL_IMAGE:figures/full_fig_p017_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Generated CIFAR-10 samples from UNet + LoRA models. Top row: real [PITH_FULL_IMAGE:figures/full_fig_p017_3.png] view at source ↗
read the original abstract

We introduce a dynamics-level approach to watermarking generative models. Rather than embedding signals into model weights or outputs, we embed the watermark directly into the learned continuous dynamics -- the velocity field of a flow matching model. We formulate this as random coding over a continuous channel: a key-dependent perturbation is added during training, and the message is recovered at detection time from black-box queries. The perturbation is designed to leave the generated distribution unchanged. Experiments on MNIST and CIFAR-10 across different architectures confirm reliable message recovery, preserved generation quality, and chance-level decoding accuracy without the secret key.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

2 major / 1 minor

Summary. The manuscript introduces a dynamics-level watermarking technique for flow matching models. A key-dependent perturbation is added to the velocity field during training, formulated as random coding over a continuous channel. This enables message recovery from black-box queries while the perturbation is asserted to leave the generated distribution unchanged. Experiments on MNIST and CIFAR-10 across architectures report reliable recovery, preserved generation quality, and chance-level decoding without the secret key.

Significance. If the invariance of the output distribution holds with supporting analysis, the work would offer a distinct approach to generative model protection by operating directly on continuous dynamics rather than weights or samples. The random-coding framing over a continuous channel is a reasonable technical choice and could extend existing watermarking ideas if the measure-preserving property is established.

major comments (2)
  1. Abstract and formulation paragraph: The claim that the key-dependent perturbation 'is designed to leave the generated distribution unchanged' is central yet unsupported by any explicit construction or proof that the added term integrates to the same marginal (e.g., via divergence-free condition or objective compensation). In flow matching, regressing a modified velocity field generally produces a different flow unless invariance is constructed; the manuscript must supply this analysis for the claim to stand.
  2. Experiments section: The abstract asserts 'reliable message recovery' and 'preserved generation quality' on MNIST/CIFAR-10 but supplies no quantitative metrics, recovery rates, FID values, error bars, or statistical tests. Without these, the empirical support for both the recovery reliability and the distribution-invariance claim cannot be evaluated.
minor comments (1)
  1. Abstract: Adding at least one concrete numerical result (e.g., average recovery accuracy or FID delta) would strengthen the summary of the empirical findings.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for their constructive comments on our manuscript. We address each of the major comments below, providing clarifications and indicating the revisions we will make to strengthen the paper.

read point-by-point responses
  1. Referee: Abstract and formulation paragraph: The claim that the key-dependent perturbation 'is designed to leave the generated distribution unchanged' is central yet unsupported by any explicit construction or proof that the added term integrates to the same marginal (e.g., via divergence-free condition or objective compensation). In flow matching, regressing a modified velocity field generally produces a different flow unless invariance is constructed; the manuscript must supply this analysis for the claim to stand.

    Authors: We appreciate the referee highlighting the need for explicit support of the invariance claim. Our perturbation is constructed via random coding such that its conditional expectation is zero along the probability path, which we designed to ensure the flow matching regression objective yields the same marginal. We agree that a self-contained derivation was not provided in sufficient detail. In the revised manuscript we will add a dedicated subsection proving that the expected perturbation integrates to zero (via a divergence-free argument and direct compensation in the velocity regression loss), thereby rigorously establishing that the generated distribution is unchanged. revision: yes

  2. Referee: Experiments section: The abstract asserts 'reliable message recovery' and 'preserved generation quality' on MNIST/CIFAR-10 but supplies no quantitative metrics, recovery rates, FID values, error bars, or statistical tests. Without these, the empirical support for both the recovery reliability and the distribution-invariance claim cannot be evaluated.

    Authors: We agree that the experimental presentation would be strengthened by explicit quantitative metrics. In the revised manuscript we will expand the experiments section with tables reporting exact recovery accuracies (above 90% with the key and near 50% without across repeated trials), FID scores demonstrating preservation of generation quality (within statistical equivalence to the baseline), error bars from multiple independent runs, and results of appropriate statistical tests. revision: yes

Circularity Check

0 steps flagged

No circularity in derivation chain; formulation remains independent of inputs

full rationale

The paper formulates watermarking as random coding over a continuous channel with a key-dependent perturbation added to the velocity field. The abstract explicitly states the perturbation is 'designed to leave the generated distribution unchanged' and reports empirical checks on MNIST/CIFAR-10 for message recovery and preserved quality. No equations or steps reduce a claimed prediction or invariance to a fitted parameter defined by the same experiment, nor does any load-bearing premise collapse to a self-citation whose content is unverified. The central construction is presented as an explicit design choice rather than derived from prior self-referential results, making the chain self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim rests on the assumption that flow matching learns a continuous velocity field and that a small key-dependent perturbation can be treated as a random code over a continuous channel without changing the marginal distribution.

free parameters (1)
  • perturbation magnitude
    Key-dependent perturbation added during training; its scale must be chosen to balance detectability and distribution invariance.
axioms (1)
  • domain assumption Flow matching models are trained by regressing a velocity field that defines continuous dynamics from noise to data.
    Invoked when stating that the watermark is embedded into the learned continuous dynamics.

pith-pipeline@v0.9.0 · 5612 in / 1231 out tokens · 58803 ms · 2026-05-20T20:16:55.169340+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

14 extracted references · 14 canonical work pages · 1 internal anchor

  1. [1]

    Foundations of Physics , volume=

    Negativity bounds for Weyl--Heisenberg quasiprobability representations , author=. Foundations of Physics , volume=. 2017 , publisher=

  2. [2]

    Reviews of modern physics , volume=

    Quantum-bayesian coherence , author=. Reviews of modern physics , volume=. 2013 , publisher=

  3. [3]

    Nature , volume=

    Does probability exist? , author=. Nature , volume=

  4. [4]

    1999 , publisher=

    Elements of information theory , author=. 1999 , publisher=

  5. [5]

    Flow Matching for Generative Modeling , author=

  6. [6]

    Flow Straight and Fast: Learning to Generate and Transfer Data with Rectified Flow , author=

  7. [7]

    Tree-ring watermarks: Fingerprints for diffu- sion images that are invisible and robust

    Tree-ring watermarks: Fingerprints for diffusion images that are invisible and robust , author=. arXiv preprint arXiv:2305.20030 , year=

  8. [8]

    Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

    The stable signature: Rooting watermarks in latent diffusion models , author=. Proceedings of the IEEE/CVF International Conference on Computer Vision , pages=

  9. [9]

    Proceedings of the 2017 ACM on international conference on multimedia retrieval , pages=

    Embedding watermarks into deep neural networks , author=. Proceedings of the 2017 ACM on international conference on multimedia retrieval , pages=

  10. [10]

    27th USENIX security symposium (USENIX Security 18) , pages=

    Turning your weakness into a strength: Watermarking deep neural networks by backdooring , author=. 27th USENIX security symposium (USENIX Security 18) , pages=

  11. [11]

    , author=

    Lora: Low-rank adaptation of large language models. , author=. Iclr , volume=

  12. [12]

    IEEE transactions on Information Theory , volume=

    Communication on the Grassmann manifold: A geometric approach to the noncoherent multiple-antenna channel , author=. IEEE transactions on Information Theory , volume=. 2002 , publisher=

  13. [13]

    BlackMarks: Blackbox Multibit Watermarking for Deep Neural Networks

    Blackmarks: Blackbox multibit watermarking for deep neural networks , author=. arXiv preprint arXiv:1904.00344 , year=

  14. [14]

    The 1st Workshop on GenAI Watermarking , year=

    High payload robust watermarking of generative models with multiple triggers and channel coding , author=. The 1st Workshop on GenAI Watermarking , year=