Grammar as a Foreign Language

Geoffrey Hinton; Ilya Sutskever; Lukasz Kaiser; Oriol Vinyals; Slav Petrov; Terry Koo

arxiv: 1412.7449 · v3 · pith:ZIYS5PMHnew · submitted 2014-12-23 · 💻 cs.CL · cs.LG· stat.ML

Grammar as a Foreign Language

Oriol Vinyals , Lukasz Kaiser , Terry Koo , Slav Petrov , Ilya Sutskever , Geoffrey Hinton This is my paper

classification 💻 cs.CL cs.LGstat.ML

keywords parsersconstituencydatasetdomainlanguagemodelparsingprocessing

0 comments

read the original abstract

Syntactic constituency parsing is a fundamental problem in natural language processing and has been the subject of intensive research and engineering for decades. As a result, the most accurate parsers are domain specific, complex, and inefficient. In this paper we show that the domain agnostic attention-enhanced sequence-to-sequence model achieves state-of-the-art results on the most widely used syntactic constituency parsing dataset, when trained on a large synthetic corpus that was annotated using existing parsers. It also matches the performance of standard parsers when trained only on a small human-annotated dataset, which shows that this model is highly data-efficient, in contrast to sequence-to-sequence models without the attention mechanism. Our parser is also fast, processing over a hundred sentences per second with an unoptimized CPU implementation.

This paper has not been read by Pith yet.

Grammar as a Foreign Language

discussion (0)