pith. sign in

arxiv: 1808.08622 · v1 · pith:RTGESAQMnew · submitted 2018-08-26 · 💻 cs.CL

Semi-Supervised Event Extraction with Paraphrase Clusters

classification 💻 cs.CL
keywords eventextractionmentionsmultipletrainingdatasystemsaccuracy
0
0 comments X
read the original abstract

Supervised event extraction systems are limited in their accuracy due to the lack of available training data. We present a method for self-training event extraction systems by bootstrapping additional training data. This is done by taking advantage of the occurrence of multiple mentions of the same event instances across newswire articles from multiple sources. If our system can make a highconfidence extraction of some mentions in such a cluster, it can then acquire diverse training examples by adding the other mentions as well. Our experiments show significant performance improvements on multiple event extractors over ACE 2005 and TAC-KBP 2015 datasets.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.