pith. machine review for the scientific record. sign in

arxiv: 1508.01420 · v1 · submitted 2015-08-06 · 💻 cs.IR · cs.CL· cs.CR

Recognition: unknown

Privacy-Preserving Multi-Document Summarization

Authors on Pith no claims yet
classification 💻 cs.IR cs.CLcs.CR
keywords multi-documentdocumentssummarizationapproachpartiesprivacy-preservingwithoutallowing
0
0 comments X
read the original abstract

State-of-the-art extractive multi-document summarization systems are usually designed without any concern about privacy issues, meaning that all documents are open to third parties. In this paper we propose a privacy-preserving approach to multi-document summarization. Our approach enables other parties to obtain summaries without learning anything else about the original documents' content. We use a hashing scheme known as Secure Binary Embeddings to convert documents representation containing key phrases and bag-of-words into bit strings, allowing the computation of approximate distances, instead of exact ones. Our experiments indicate that our system yields similar results to its non-private counterpart on standard multi-document evaluation datasets.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.