Qanet: Combining local convolution with global self-attention for reading compre- hension

Adams Wei Yu, David Dohan, Minh-Thang Luong, Rui Zhao, Kai Chen, Mohammad Norouzi, Quoc V · 2018 · cs.CL · arXiv 1804.09541

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

open full Pith review browse 6 citing papers arXiv PDF

abstract

Current end-to-end machine reading and question answering (Q\&A) models are primarily based on recurrent neural networks (RNNs) with attention. Despite their success, these models are often slow for both training and inference due to the sequential nature of RNNs. We propose a new Q\&A architecture called QANet, which does not require recurrent networks: Its encoder consists exclusively of convolution and self-attention, where convolution models local interactions and self-attention models global interactions. On the SQuAD dataset, our model is 3x to 13x faster in training and 4x to 9x faster in inference, while achieving equivalent accuracy to recurrent models. The speed-up gain allows us to train the model with much more data. We hence combine our model with data generated by backtranslation from a neural machine translation model. On the SQuAD dataset, our single model, trained with augmented data, achieves 84.6 F1 score on the test set, which is significantly better than the best published F1 score of 81.8.

representative citing papers

Passage Re-ranking with BERT

cs.IR · 2019-01-13 · unverdicted · novelty 8.0

Fine-tuning BERT for query-passage relevance classification achieves state-of-the-art results on TREC-CAR and MS MARCO, with a 27% relative gain in MRR@10 over prior methods.

Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer

cs.LG · 2019-10-23 · unverdicted · novelty 7.0

T5 casts all NLP tasks as text-to-text generation, systematically explores pre-training choices, and reaches strong performance on summarization, QA, classification and other tasks via large-scale training on the Colossal Clean Crawled Corpus.

RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval

cs.CL · 2024-01-31 · unverdicted · novelty 6.0

RAPTOR introduces a tree-organized retrieval method using recursive abstractive summaries, achieving a 20% absolute accuracy improvement on the QuALITY benchmark when paired with GPT-4.

Attending to Emotional Narratives

cs.LG · 2019-07-08 · unverdicted · novelty 4.0

Transformer and Memory Fusion Network attention mechanisms generalize to multimodal time-series emotion recognition on emotional autobiographical narratives, achieving performance comparable to human raters in some cases.

EQuANt (Enhanced Question Answer Network)

cs.CL · 2019-06-24 · unverdicted · novelty 4.0

EQuANt extends QANet to SQuAD 2, achieving nearly twice the performance of a lightweight QANet baseline while also improving SQuAD 1.1 results via multi-task learning.

Machine Reading Comprehension: a Literature Review

cs.CL · 2019-06-30 · unverdicted · novelty 1.0

A 2019 survey of machine reading comprehension corpora and methods.

citing papers explorer

Showing 6 of 6 citing papers.

Passage Re-ranking with BERT cs.IR · 2019-01-13 · unverdicted · none · ref 12
Fine-tuning BERT for query-passage relevance classification achieves state-of-the-art results on TREC-CAR and MS MARCO, with a 27% relative gain in MRR@10 over prior methods.
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer cs.LG · 2019-10-23 · unverdicted · none · ref 80
T5 casts all NLP tasks as text-to-text generation, systematically explores pre-training choices, and reaches strong performance on summarization, QA, classification and other tasks via large-scale training on the Colossal Clean Crawled Corpus.
RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval cs.CL · 2024-01-31 · unverdicted · none · ref 125 · internal anchor
RAPTOR introduces a tree-organized retrieval method using recursive abstractive summaries, achieving a 20% absolute accuracy improvement on the QuALITY benchmark when paired with GPT-4.
Attending to Emotional Narratives cs.LG · 2019-07-08 · unverdicted · none · ref 21 · internal anchor
Transformer and Memory Fusion Network attention mechanisms generalize to multimodal time-series emotion recognition on emotional autobiographical narratives, achieving performance comparable to human raters in some cases.
EQuANt (Enhanced Question Answer Network) cs.CL · 2019-06-24 · unverdicted · none · ref 12 · internal anchor
EQuANt extends QANet to SQuAD 2, achieving nearly twice the performance of a lightweight QANet baseline while also improving SQuAD 1.1 results via multi-task learning.
Machine Reading Comprehension: a Literature Review cs.CL · 2019-06-30 · unverdicted · none · ref 73 · internal anchor
A 2019 survey of machine reading comprehension corpora and methods.

Qanet: Combining local convolution with global self-attention for reading compre- hension

fields

years

verdicts

representative citing papers

citing papers explorer