arxiv: 1804.05922 · v1 · pith:H6GU2KPCnew · submitted 2018-04-16 · 💻 cs.CL · cs.LG

Neural Models for Reasoning over Multiple Mentions using Coreference

Bhuwan Dhingra , Qiao Jin , Zhilin Yang , William W. Cohen , Ruslan Salakhutdinov This is my paper

classification 💻 cs.CL cs.LG

keywords layermentionsbiasedcoreferencedependenciesentitymultipleneural

0 comments

read the original abstract

Many problems in NLP require aggregating information from multiple mentions of the same entity which may be far apart in the text. Existing Recurrent Neural Network (RNN) layers are biased towards short-term dependencies and hence not suited to such tasks. We present a recurrent layer which is instead biased towards coreferent dependencies. The layer uses coreference annotations extracted from an external system to connect entity mentions belonging to the same cluster. Incorporating this layer into a state-of-the-art reading comprehension model improves performance on three datasets -- Wikihop, LAMBADA and the bAbi AI tasks -- with large gains when training data is scarce.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Universal Transformers
cs.CL 2018-07 unverdicted novelty 6.0

Universal Transformers combine Transformer parallelism with recurrent updates and dynamic halting to achieve Turing-completeness under assumptions and outperform standard Transformers on algorithmic and language tasks.
Retrieval-Augmented Generation with Graphs (GraphRAG)
cs.IR 2024-12 unverdicted novelty 5.0

A survey proposing a holistic GraphRAG framework with components including query processor, retriever, organizer, generator, and data source, plus domain-tailored reviews, challenges, and future directions.