Large-scale Simple Question Answering with Memory Networks

Antoine Bordes; Jason Weston; Nicolas Usunier; Sumit Chopra

arxiv: 1506.02075 · v1 · pith:F3LMMSKKnew · submitted 2015-06-05 · 💻 cs.LG · cs.CL

Large-scale Simple Question Answering with Memory Networks

Antoine Bordes , Nicolas Usunier , Sumit Chopra , Jason Weston This is my paper

classification 💻 cs.LG cs.CL

keywords questionansweringlarge-scalememorynetworksbecausequestionsreasoning

0 comments

read the original abstract

Training large-scale question answering systems is complicated because training sources usually cover a small portion of the range of possible questions. This paper studies the impact of multitask and transfer learning for simple question answering; a setting for which the reasoning required to answer is quite easy, as long as one can retrieve the correct evidence given a question, which can be difficult in large-scale conditions. To this end, we introduce a new dataset of 100k questions that we use in conjunction with existing benchmarks. We conduct our study within the framework of Memory Networks (Weston et al., 2015) because this perspective allows us to eventually scale up to more complex reasoning, and show that Memory Networks can be successfully trained to achieve excellent performance.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 7 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Reformer: The Efficient Transformer
cs.LG 2020-01 accept novelty 8.0

Reformer matches standard Transformer accuracy on long sequences while using far less memory and running faster via LSH attention and reversible residual layers.
TriviaQA: A Large Scale Distantly Supervised Challenge Dataset for Reading Comprehension
cs.CL 2017-05 accept novelty 8.0

TriviaQA is a new large-scale dataset for reading comprehension that features complex compositional questions, high lexical variability, and cross-sentence reasoning requirements, where current baselines reach only 40...
The Wikidata Query Logs Dataset
cs.CL 2026-02 accept novelty 7.0

The authors release the Wikidata Query Logs dataset containing 335k real question-query pairs constructed via an agent-based de-anonymization process from query service logs.
Constructing A Multi-hop QA Dataset for Comprehensive Evaluation of Reasoning Steps
cs.CL 2020-11 conditional novelty 7.0

Introduces 2WikiMultiHopQA, a multi-hop QA dataset with explicit evidence chains generated via templates and Wikidata logical rules to force and evaluate multi-hop reasoning.
Graph Alignment Topology as an Inductive Bias for Grounding Detection
cs.CL 2026-05 unverdicted novelty 6.0

A GNN trained on bipartite alignment graphs between references and LLM generations reports state-of-the-art hallucination detection across four datasets, beating prior methods and GPT-4o.
KoRe: Compact Knowledge Representations for Large Language Models
cs.CL 2026-05 unverdicted novelty 6.0

KoRe encodes 1-hop knowledge graph subgraphs as compact discrete tokens for injection into LLMs, achieving competitive benchmark performance with up to 10x token reduction.
Efficient and Transferable Agentic Knowledge Graph RAG via Reinforcement Learning
cs.CL 2025-09 unverdicted novelty 6.0

KG-R1 trains a single RL agent to retrieve from and reason over knowledge graphs in one loop, achieving higher accuracy with fewer tokens than multi-module baselines and transferring to unseen graphs.