pith. sign in

arxiv: 1809.07615 · v1 · pith:CRTMREVRnew · submitted 2018-09-20 · 💻 cs.CL

Lessons learned in multilingual grounded language learning

classification 💻 cs.CL
keywords languagemultilingualtraininggroundedlanguageslearningmodeladditional
0
0 comments X
read the original abstract

Recent work has shown how to learn better visual-semantic embeddings by leveraging image descriptions in more than one language. Here, we investigate in detail which conditions affect the performance of this type of grounded language learning model. We show that multilingual training improves over bilingual training, and that low-resource languages benefit from training with higher-resource languages. We demonstrate that a multilingual model can be trained equally well on either translations or comparable sentence pairs, and that annotating the same set of images in multiple language enables further improvements via an additional caption-caption ranking objective.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.