pith. sign in

arxiv: 1808.09408 · v1 · pith:6NILX4V2new · submitted 2018-08-28 · 💻 cs.CL

Privacy-preserving Neural Representations of Text

classification 💻 cs.CL
keywords neuralprivacyrepresentationshiddentextattackerinformationrepresentation
0
0 comments X
read the original abstract

This article deals with adversarial attacks towards deep learning systems for Natural Language Processing (NLP), in the context of privacy protection. We study a specific type of attack: an attacker eavesdrops on the hidden representations of a neural text classifier and tries to recover information about the input text. Such scenario may arise in situations when the computation of a neural network is shared across multiple devices, e.g. some hidden representation is computed by a user's device and sent to a cloud-based model. We measure the privacy of a hidden representation by the ability of an attacker to predict accurately specific private information from it and characterize the tradeoff between the privacy and the utility of neural representations. Finally, we propose several defense methods based on modified training objectives and show that they improve the privacy of neural representations.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Look Twice before You Leap: A Rational Framework for Localized Adversarial Anonymization

    cs.CR 2025-12 unverdicted novelty 5.0

    RLAA is a localized adversarial anonymization framework that adds an arbitrator to filter ghost leaks and enforce rational early stopping, yielding superior privacy-utility trade-offs on benchmarks compared to greedy ...