Simultaneously Self-Attending to All Mentions for Full-Abstract Biological Relation Extraction

Andrew McCallum; Emma Strubell; Patrick Verga

arxiv: 1802.10569 · v1 · pith:GBPFYPLCnew · submitted 2018-02-28 · 💻 cs.CL

Simultaneously Self-Attending to All Mentions for Full-Abstract Biological Relation Extraction

Patrick Verga , Emma Strubell , Andrew McCallum This is my paper

classification 💻 cs.CL

keywords mentionbiologicalextractionmentionspairrelationacrossannotation

0 comments

read the original abstract

Most work in relation extraction forms a prediction by looking at a short span of text within a single sentence containing a single entity pair mention. This approach often does not consider interactions across mentions, requires redundant computation for each mention pair, and ignores relationships expressed across sentence boundaries. These problems are exacerbated by the document- (rather than sentence-) level annotation common in biological text. In response, we propose a model which simultaneously predicts relationships between all mention pairs in a document. We form pairwise predictions over entire paper abstracts using an efficient self-attention encoder. All-pairs mention scores allow us to perform multi-instance learning by aggregating over mentions to form entity pair representations. We further adapt to settings without mention-level annotation by jointly training to predict named entities and adding a corpus of weakly labeled data. In experiments on two Biocreative benchmark datasets, we achieve state of the art performance on the Biocreative V Chemical Disease Relation dataset for models without external KB resources. We also introduce a new dataset an order of magnitude larger than existing human-annotated biological information extraction datasets and more accurate than distantly supervised alternatives.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 3 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Relational Probing: LM-to-Graph Adaptation for Financial Prediction
cs.CL 2026-04 unverdicted novelty 6.0

Relational Probing replaces the LM output head with a trainable relation head that induces graphs from hidden states and optimizes them end-to-end for stock trend prediction, showing gains over co-occurrence baselines.
Construct Dynamic Graphs for Hand Gesture Recognition via Spatial-Temporal Attention
cs.CV 2019-07 unverdicted novelty 6.0

DG-STA builds dynamic graphs from hand skeletons, applies spatial-temporal self-attention to learn features, and uses a mask to cut cost by 99%, outperforming prior methods on DHG-14/28 and SHREC'17.
AMI-Net+: A Novel Multi-Instance Neural Network for Medical Diagnosis from Incomplete and Imbalanced Data
cs.LG 2019-07 unverdicted novelty 4.0

AMI-Net+ extends AMI-Net by swapping cross-entropy for focal loss and adding self-adaptive instance-level pooling, then reports better performance than baselines on two real medical datasets.