Watermarking pre- trained language models with backdooring

Chenxi Gu, Chengsong Huang, Xiaoqing Zheng, Kai- Wei Chang, Cho-Jui Hsieh · 2022 · arXiv 2210.07543

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

SWAN: Semantic Watermarking with Abstract Meaning Representation

cs.CL · 2026-05-05 · unverdicted · novelty 7.0

SWAN uses AMR to embed semantic watermarks that persist through paraphrases, matching SOTA detection on original text and improving AUC by 13.9 points on paraphrased RealNews data.

cs.CR · 2025-08-15 · accept · novelty 7.0

A survey of LLM copyright protection that unifies text watermarking, model watermarking, and model fingerprinting while presenting new coverage of fingerprint transfer and removal.

Perturb and Recover: Fine-tuning for Effective Backdoor Removal from CLIP

cs.LG · 2024-12-01 · conditional · novelty 7.0

PAR fine-tunes CLIP to remove backdoors from structured triggers while preserving standard performance, and works even with only synthetic image-text pairs.

Prompt2Fingerprint: Plug-and-Play LLM Fingerprinting via Text-to-Weight Generation

cs.CR · 2026-05-18 · unverdicted · novelty 6.0

P2F generates low-rank parameter increments for LLM fingerprinting directly from textual descriptions in a single forward pass.

citing papers explorer

Showing 4 of 4 citing papers.

SWAN: Semantic Watermarking with Abstract Meaning Representation cs.CL · 2026-05-05 · unverdicted · none · ref 45
SWAN uses AMR to embed semantic watermarks that persist through paraphrases, matching SOTA detection on original text and improving AUC by 13.9 points on paraphrased RealNews data.
Copyright Protection for Large Language Models: A Survey of Methods, Challenges, and Trends cs.CR · 2025-08-15 · accept · none · ref 49
A survey of LLM copyright protection that unifies text watermarking, model watermarking, and model fingerprinting while presenting new coverage of fingerprint transfer and removal.
Perturb and Recover: Fine-tuning for Effective Backdoor Removal from CLIP cs.LG · 2024-12-01 · conditional · none · ref 17
PAR fine-tunes CLIP to remove backdoors from structured triggers while preserving standard performance, and works even with only synthetic image-text pairs.
Prompt2Fingerprint: Plug-and-Play LLM Fingerprinting via Text-to-Weight Generation cs.CR · 2026-05-18 · unverdicted · none · ref 4
P2F generates low-rank parameter increments for LLM fingerprinting directly from textual descriptions in a single forward pass.

Watermarking pre- trained language models with backdooring

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer