pith. sign in

arxiv: 1707.07045 · v2 · pith:MMGLA2FYnew · submitted 2017-07-21 · 💻 cs.CL

End-to-end Neural Coreference Resolution

classification 💻 cs.CL
keywords coreferencemodelend-to-endfirstmentionspotentialresolutionspans
0
0 comments X
read the original abstract

We introduce the first end-to-end coreference resolution model and show that it significantly outperforms all previous work without using a syntactic parser or hand-engineered mention detector. The key idea is to directly consider all spans in a document as potential mentions and learn distributions over possible antecedents for each. The model computes span embeddings that combine context-dependent boundary representations with a head-finding attention mechanism. It is trained to maximize the marginal likelihood of gold antecedent spans from coreference clusters and is factored to enable aggressive pruning of potential mentions. Experiments demonstrate state-of-the-art performance, with a gain of 1.5 F1 on the OntoNotes benchmark and by 3.1 F1 using a 5-model ensemble, despite the fact that this is the first approach to be successfully trained with no external resources.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Bias in Large Language Models: Origin, Evaluation, and Mitigation

    cs.CL 2024-11 unverdicted novelty 2.0

    A literature review that categorizes bias in LLMs, surveys evaluation and mitigation techniques, and discusses ethical implications.