SAFESEAL is a key-conditioned LLM watermarking framework using tournament sampling for synonym substitution and a contrastive detector that reports 98.2% detection, 0.983 BERTScore, and 0.963 entity similarity while claiming robustness to attacks.
O’Reilly Media, Inc
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2representative citing papers
citing papers explorer
-
Robust LLM Watermarking with Minimal Semantic Distortion for IP Protection
SAFESEAL is a key-conditioned LLM watermarking framework using tournament sampling for synonym substitution and a contrastive detector that reports 98.2% detection, 0.983 BERTScore, and 0.963 entity similarity while claiming robustness to attacks.
- Prefix Teach, Suffix Fade: Local Teachability Collapse in Strong-to-Weak On-Policy Distillation