Teaching Machines to Read and Comprehend

Edward Grefenstette; Karl Moritz Hermann; Lasse Espeholt; Mustafa Suleyman; Phil Blunsom; Tom\'a\v{s} Ko\v{c}isk\'y; Will Kay

arxiv: 1506.03340 · v3 · pith:3A6GKUTXnew · submitted 2015-06-10 · 💻 cs.CL · cs.AI· cs.NE

Teaching Machines to Read and Comprehend

Karl Moritz Hermann , Tom\'a\v{s} Ko\v{c}isk\'y , Edward Grefenstette , Lasse Espeholt , Will Kay , Mustafa Suleyman , Phil Blunsom This is my paper

classification 💻 cs.CL cs.AIcs.NE

keywords documentsreadanswerlanguagelargemachinesquestionsreading

0 comments

read the original abstract

Teaching machines to read natural language documents remains an elusive challenge. Machine reading systems can be tested on their ability to answer questions posed on the contents of documents that they have seen, but until now large scale training and test datasets have been missing for this type of evaluation. In this work we define a new methodology that resolves this bottleneck and provides large scale supervised reading comprehension data. This allows us to develop a class of attention based deep neural networks that learn to read real documents and answer complex questions with minimal prior knowledge of language structure.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 5 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
cs.CL 2017-05 accept novelty 8.0

TriviaQA is a new large-scale dataset for reading comprehension that features complex compositional questions, high lexical variability, and cross-sentence reasoning requirements, where current baselines reach only 40...
The Partial Testimony of Logs: Evaluation of Language Model Generation under Confounded Model Choice
cs.LG 2026-05 unverdicted novelty 7.0

An identification theorem shows that a randomized experiment and simulator together recover causal model values from confounded logs, with logs used only afterward to reduce estimation error.
MS MARCO: A Human Generated MAchine Reading COmprehension Dataset
cs.CL 2016-11 accept novelty 7.0

MS MARCO is a new large-scale machine reading comprehension dataset built from real Bing search queries, human-generated answers, and web passages, supporting three tasks including answer synthesis and passage ranking.
PromptSuite: A Task-Agnostic Framework for Multi-Prompt Generation
cs.CL 2025-07 unverdicted novelty 6.0

PromptSuite is a modular, extensible, task-agnostic framework for automatically generating diverse prompt variations to support robust multi-prompt LLM evaluation.
Inspection and Control of Self-Generated-Text Recognition Ability in Llama3-8b-Instruct
cs.LG 2024-10 unverdicted novelty 6.0

Llama3-8b-Instruct recognizes its own outputs via a residual-stream vector associated with self-authorship that can be steered to control authorship claims and perceptions.