pith. machine review for the scientific record. sign in

arxiv: 1906.01037 · v2 · submitted 2019-06-03 · 💻 cs.CL

Recognition: unknown

Better Character Language Modeling Through Morphology

Authors on Pith no claims yet
classification 💻 cs.CL
keywords languagemodelingdatamorphologicalmorphologyperformancesupervisionacross
0
0 comments X
read the original abstract

We incorporate morphological supervision into character language models (CLMs) via multitasking and show that this addition improves bits-per-character (BPC) performance across 24 languages, even when the morphology data and language modeling data are disjoint. Analyzing the CLMs shows that inflected words benefit more from explicitly modeling morphology than uninflected words, and that morphological supervision improves performance even as the amount of language modeling data grows. We then transfer morphological supervision across languages to improve language modeling performance in the low-resource setting.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.