pith. machine review for the scientific record. sign in

arxiv: 1805.03435 · v1 · submitted 2018-05-09 · 💻 cs.AI · cs.CL· cs.LG

Recognition: unknown

Decoding Decoders: Finding Optimal Representation Spaces for Unsupervised Similarity Tasks

Authors on Pith no claims yet
classification 💻 cs.AI cs.CLcs.LG
keywords modelssimilaritytasksclosedeepoptimalrepresentationrepresentations
0
0 comments X
read the original abstract

Experimental evidence indicates that simple models outperform complex deep networks on many unsupervised similarity tasks. We provide a simple yet rigorous explanation for this behaviour by introducing the concept of an optimal representation space, in which semantically close symbols are mapped to representations that are close under a similarity measure induced by the model's objective function. In addition, we present a straightforward procedure that, without any retraining or architectural modifications, allows deep recurrent models to perform equally well (and sometimes better) when compared to shallow models. To validate our analysis, we conduct a set of consistent empirical evaluations and introduce several new sentence embedding models in the process. Even though this work is presented within the context of natural language processing, the insights are readily applicable to other domains that rely on distributed representations for transfer tasks.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.