Squeezing bottlenecks: exploring the limits of autoencoder semantic representation capabilities

Paolo Rosso; Parth Gupta; Rafael E. Banchs

arxiv: 1402.3070 · v1 · pith:5222CP53new · submitted 2014-02-13 · 💻 cs.IR · cs.LG· stat.ML

Squeezing bottlenecks: exploring the limits of autoencoder semantic representation capabilities

Parth Gupta , Rafael E. Banchs , Paolo Rosso This is my paper

classification 💻 cs.IR cs.LGstat.ML

keywords autoencoderstextcapabilitiesdataproposeassessingattentionautoencoder

0 comments

read the original abstract

We present a comprehensive study on the use of autoencoders for modelling text data, in which (differently from previous studies) we focus our attention on the following issues: i) we explore the suitability of two different models bDA and rsDA for constructing deep autoencoders for text data at the sentence level; ii) we propose and evaluate two novel metrics for better assessing the text-reconstruction capabilities of autoencoders; and iii) we propose an automatic method to find the critical bottleneck dimensionality for text language representations (below which structural information is lost).

This paper has not been read by Pith yet.

Squeezing bottlenecks: exploring the limits of autoencoder semantic representation capabilities

discussion (0)