pith. sign in

arxiv: 1606.02858 · v2 · pith:MYBZIBZDnew · submitted 2016-06-09 · 💻 cs.CL · cs.AI

A Thorough Examination of the CNN/Daily Mail Reading Comprehension Task

classification 💻 cs.CL cs.AI
keywords taskcomprehensiondailyexaminationmailperformancereadingsystems
0
0 comments X
read the original abstract

Enabling a computer to understand a document so that it can answer comprehension questions is a central, yet unsolved goal of NLP. A key factor impeding its solution by machine learned systems is the limited availability of human-annotated data. Hermann et al. (2015) seek to solve this problem by creating over a million training examples by pairing CNN and Daily Mail news articles with their summarized bullet points, and show that a neural network can then be trained to give good performance on this task. In this paper, we conduct a thorough examination of this new reading comprehension task. Our primary aim is to understand what depth of language understanding is required to do well on this task. We approach this from one side by doing a careful hand-analysis of a small subset of the problems and from the other by showing that simple, carefully designed systems can obtain accuracies of 73.6% and 76.6% on these two datasets, exceeding current state-of-the-art results by 7-10% and approaching what we believe is the ceiling for performance on this task.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Be Consistent! Improving Procedural Text Comprehension using Label Consistency

    cs.CL 2019-06 unverdicted novelty 5.0

    A label consistency training framework improves F1 on the ProPara benchmark for procedural text comprehension by using multiple independent descriptions of the same process.

  2. Machine Reading Comprehension: a Literature Review

    cs.CL 2019-06 unverdicted novelty 1.0

    A 2019 survey of machine reading comprehension corpora and methods.