Probing Multilingual Sentence Representations With X-Probe

Erik Velldal; Lilja {\O}vrelid; Vinit Ravishankar

arxiv: 1906.05061 · v1 · pith:UXG3C2DCnew · submitted 2019-06-12 · 💻 cs.CL

Probing Multilingual Sentence Representations With X-Probe

Vinit Ravishankar , Lilja {\O}vrelid , Erik Velldal This is my paper

classification 💻 cs.CL

keywords representationssentenceenglishmultilingualprobingderivedencoderslanguage

0 comments

read the original abstract

This paper extends the task of probing sentence representations for linguistic insight in a multilingual domain. In doing so, we make two contributions: first, we provide datasets for multilingual probing, derived from Wikipedia, in five languages, viz. English, French, German, Spanish and Russian. Second, we evaluate six sentence encoders for each language, each trained by mapping sentence representations to English sentence representations, using sentences in a parallel corpus. We discover that cross-lingually mapped representations are often better at retaining certain linguistic information than representations derived from English encoders trained on natural language inference (NLI) as a downstream task.

This paper has not been read by Pith yet.

Probing Multilingual Sentence Representations With X-Probe

discussion (0)