Semi-Supervised QA with Generative Domain-Adaptive Nets

Junjie Hu; Ruslan Salakhutdinov; William W. Cohen; Zhilin Yang

arxiv: 1702.02206 · v2 · pith:K2PQMM27new · submitted 2017-02-07 · 💻 cs.CL · cs.LG

Semi-Supervised QA with Generative Domain-Adaptive Nets

Zhilin Yang , Junjie Hu , Ruslan Salakhutdinov , William W. Cohen This is my paper

classification 💻 cs.CL cs.LG

keywords frameworkgenerativequestionquestionstextunlabeledansweringdata

0 comments

read the original abstract

We study the problem of semi-supervised question answering----utilizing unlabeled text to boost the performance of question answering models. We propose a novel training framework, the Generative Domain-Adaptive Nets. In this framework, we train a generative model to generate questions based on the unlabeled text, and combine model-generated questions with human-generated questions for training question answering models. We develop novel domain adaptation algorithms, based on reinforcement learning, to alleviate the discrepancy between the model-generated data distribution and the human-generated data distribution. Experiments show that our proposed framework obtains substantial improvement from unlabeled text.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

The False Promise of Imitating Proprietary LLMs
cs.CL 2023-05 conditional novelty 6.0

Finetuning open LMs on ChatGPT outputs creates models that mimic style and fool human raters but fail to close the performance gap to proprietary systems on tasks not well-represented in the imitation data.