String Transduction with Target Language Models and Insertion Handling
classification
💻 cs.CL
keywords
targetlanguagemodelstransductionalignmentcharacter-levelcognatecombined
read the original abstract
Many character-level tasks can be framed as sequence-to-sequence transduction, where the target is a word from a natural language. We show that leveraging target language models derived from unannotated target corpora, combined with a precise alignment of the training data, yields state-of-the art results on cognate projection, inflection generation, and phoneme-to-grapheme conversion.
This paper has not been read by Pith yet.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.