Watermarking pre- trained language models with backdooring

Chenxi Gu, Chengsong Huang, Xiaoqing Zheng, Kai- Wei Chang, Cho-Jui Hsieh · 2022 · arXiv 2210.07543

4 Pith papers cite this work. Polarity classification is still indexing.

4 Pith papers citing it

read on arXiv browse 4 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

SWAN: Semantic Watermarking with Abstract Meaning Representation

cs.CL · 2026-05-05 · unverdicted · novelty 7.0

SWAN uses AMR to embed semantic watermarks that persist through paraphrases, matching SOTA detection on original text and improving AUC by 13.9 points on paraphrased RealNews data.

cs.CR · 2025-08-15 · accept · novelty 7.0

A survey of LLM copyright protection that unifies text watermarking, model watermarking, and model fingerprinting while presenting new coverage of fingerprint transfer and removal.

Perturb and Recover: Fine-tuning for Effective Backdoor Removal from CLIP

cs.LG · 2024-12-01 · conditional · novelty 7.0

PAR fine-tunes CLIP to remove backdoors from structured triggers while preserving standard performance, and works even with only synthetic image-text pairs.

Prompt2Fingerprint: Plug-and-Play LLM Fingerprinting via Text-to-Weight Generation

cs.CR · 2026-05-18 · unverdicted · novelty 6.0

P2F generates low-rank parameter increments for LLM fingerprinting directly from textual descriptions in a single forward pass.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Perturb and Recover: Fine-tuning for Effective Backdoor Removal from CLIP cs.LG · 2024-12-01 · conditional · none · ref 17
PAR fine-tunes CLIP to remove backdoors from structured triggers while preserving standard performance, and works even with only synthetic image-text pairs.

Watermarking pre- trained language models with backdooring

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer