pith. sign in

Guanhong Tao

Identifiers

No identifiers captured yet.

Papers (4)

  1. A Sentence Relation-Based Approach to Sanitizing Malicious Instructions cs.CR · 2026 · author #4
  2. How Vulnerable Is My Learned Policy? Universal Adversarial Perturbation Attacks On Modern Behavior Cloning Policies cs.LG · 2025 · author #3
  3. PromptGuard: Soft Prompt-Guided Unsafe Content Moderation for Text-to-Image Models cs.CV · 2025 · author #4
  4. Attacks Meet Interpretability: Attribute-steered Detection of Adversarial Samples cs.LG · 2018 · author #1

Mentions

No mention provenance yet.

Frequent Coauthors