pith. sign in

arxiv: 1906.05447 · v1 · pith:FOIQEF5Lnew · submitted 2019-06-11 · 💻 cs.CL

Cued@wmt19:ewc&lms

classification 💻 cs.CL
keywords cuedgainsgramtransformerwmt19architectureaveragingbaselines
0
0 comments X
read the original abstract

Two techniques provide the fabric of the Cambridge University Engineering Department's (CUED) entry to the WMT19 evaluation campaign: elastic weight consolidation (EWC) and different forms of language modelling (LMs). We report substantial gains by fine-tuning very strong baselines on former WMT test sets using a combination of checkpoint averaging and EWC. A sentence-level Transformer LM and a document-level LM based on a modified Transformer architecture yield further gains. As in previous years, we also extract $n$-gram probabilities from SMT lattices which can be seen as a source-conditioned $n$-gram LM.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.