Efficient Parallel Learning of Word2Vec

Arjen P. de Vries; Carsten Eickhoff; Jeroen B.P. Vuurens

arxiv: 1606.07822 · v1 · pith:RVPBJJA2new · submitted 2016-06-24 · 💻 cs.CL · cs.DC

Efficient Parallel Learning of Word2Vec

Jeroen B.P. Vuurens , Carsten Eickhoff , Arjen P. de Vries This is my paper

classification 💻 cs.CL cs.DC

keywords parallelcollisionsefficiencylearnlearningmemoryusedword2vec

0 comments

read the original abstract

Since its introduction, Word2Vec and its variants are widely used to learn semantics-preserving representations of words or entities in an embedding space, which can be used to produce state-of-art results for various Natural Language Processing tasks. Existing implementations aim to learn efficiently by running multiple threads in parallel while operating on a single model in shared memory, ignoring incidental memory update collisions. We show that these collisions can degrade the efficiency of parallel learning, and propose a straightforward caching strategy that improves the efficiency by a factor of 4.

This paper has not been read by Pith yet.

Efficient Parallel Learning of Word2Vec

discussion (0)