FinEst BERT and CroSloEngual BERT: less is more in multilingual models

Marko Robnik-\v{S}ikonja; Matej Ul\v{c}ar

arxiv: 2006.07890 · v1 · pith:GA2FTDXOnew · submitted 2020-06-14 · 💻 cs.CL

FinEst BERT and CroSloEngual BERT: less is more in multilingual models

Matej Ul\v{c}ar , Marko Robnik-\v{S}ikonja This is my paper

classification 💻 cs.CL

keywords bertmodelsenglishmultilingualcrosloengualfinestlanguagemonolingual

0 comments

read the original abstract

Large pretrained masked language models have become state-of-the-art solutions for many NLP problems. The research has been mostly focused on English language, though. While massively multilingual models exist, studies have shown that monolingual models produce much better results. We train two trilingual BERT-like models, one for Finnish, Estonian, and English, the other for Croatian, Slovenian, and English. We evaluate their performance on several downstream tasks, NER, POS-tagging, and dependency parsing, using the multilingual BERT and XLM-R as baselines. The newly created FinEst BERT and CroSloEngual BERT improve the results on all tasks in most monolingual and cross-lingual situations

This paper has not been read by Pith yet.

FinEst BERT and CroSloEngual BERT: less is more in multilingual models

discussion (0)