pith. sign in

arxiv: 1811.02076 · v1 · pith:56KMIKLTnew · submitted 2018-11-05 · 💻 cs.CL

Improving Span-based Question Answering Systems with Coarsely Labeled Data

classification 💻 cs.CL
keywords dataannotatedansweransweringcoarsecoarse-grainedcoarselyfine-grained
0
0 comments X
read the original abstract

We study approaches to improve fine-grained short answer Question Answering models by integrating coarse-grained data annotated for paragraph-level relevance and show that coarsely annotated data can bring significant performance gains. Experiments demonstrate that the standard multi-task learning approach of sharing representations is not the most effective way to leverage coarse-grained annotations. Instead, we can explicitly model the latent fine-grained short answer variables and optimize the marginal log-likelihood directly or use a newly proposed \emph{posterior distillation} learning objective. Since these latent-variable methods have explicit access to the relationship between the fine and coarse tasks, they result in significantly larger improvements from coarse supervision.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.