Improving Span-based Question Answering Systems with Coarsely Labeled Data

Ankur Parikh; Hao Cheng; Kenton Lee; Kristina Toutanova; Michael Collins; Ming-Wei Chang

arxiv: 1811.02076 · v1 · pith:56KMIKLTnew · submitted 2018-11-05 · 💻 cs.CL

Improving Span-based Question Answering Systems with Coarsely Labeled Data

Hao Cheng , Ming-Wei Chang , Kenton Lee , Ankur Parikh , Michael Collins , Kristina Toutanova This is my paper

classification 💻 cs.CL

keywords dataannotatedansweransweringcoarsecoarse-grainedcoarselyfine-grained

0 comments

read the original abstract

We study approaches to improve fine-grained short answer Question Answering models by integrating coarse-grained data annotated for paragraph-level relevance and show that coarsely annotated data can bring significant performance gains. Experiments demonstrate that the standard multi-task learning approach of sharing representations is not the most effective way to leverage coarse-grained annotations. Instead, we can explicitly model the latent fine-grained short answer variables and optimize the marginal log-likelihood directly or use a newly proposed \emph{posterior distillation} learning objective. Since these latent-variable methods have explicit access to the relationship between the fine and coarse tasks, they result in significantly larger improvements from coarse supervision.

This paper has not been read by Pith yet.

Improving Span-based Question Answering Systems with Coarsely Labeled Data

discussion (0)