A Fast Unified Model for Parsing and Sentence Understanding

Abhinav Rastogi; Christopher D. Manning; Christopher Potts; Jon Gauthier; Raghav Gupta; Samuel R. Bowman

arxiv: 1603.06021 · v3 · pith:X2OP5AUKnew · submitted 2016-03-19 · 💻 cs.CL

A Fast Unified Model for Parsing and Sentence Understanding

Samuel R. Bowman , Jon Gauthier , Abhinav Rastogi , Raghav Gupta , Christopher D. Manning , Christopher Potts This is my paper

classification 💻 cs.CL

keywords theymodeltree-structuredbatchedcomputationinterpretationmodelsneural

0 comments

read the original abstract

Tree-structured neural networks exploit valuable syntactic parse information as they interpret the meanings of sentences. However, they suffer from two key technical problems that make them slow and unwieldy for large-scale NLP tasks: they usually operate on parsed sentences and they do not directly support batched computation. We address these issues by introducing the Stack-augmented Parser-Interpreter Neural Network (SPINN), which combines parsing and interpretation within a single tree-sequence hybrid model by integrating tree-structured sentence interpretation into the linear sequential structure of a shift-reduce parser. Our model supports batched computation for a speedup of up to 25 times over other tree-structured models, and its integrated parser can operate on unparsed data with little loss in accuracy. We evaluate it on the Stanford NLI entailment task and show that it significantly outperforms other sentence-encoding models.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

DropAttention: A Regularization Method for Fully-Connected Self-Attention Networks
cs.CL 2019-07 unverdicted novelty 6.0

DropAttention regularizes attention weights in fully-connected self-attention networks to reduce overfitting and improve performance.