pith. sign in

Qanet: Combining local convolution with global self-attention for reading compre- hension

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it
abstract

Current end-to-end machine reading and question answering (Q\&A) models are primarily based on recurrent neural networks (RNNs) with attention. Despite their success, these models are often slow for both training and inference due to the sequential nature of RNNs. We propose a new Q\&A architecture called QANet, which does not require recurrent networks: Its encoder consists exclusively of convolution and self-attention, where convolution models local interactions and self-attention models global interactions. On the SQuAD dataset, our model is 3x to 13x faster in training and 4x to 9x faster in inference, while achieving equivalent accuracy to recurrent models. The speed-up gain allows us to train the model with much more data. We hence combine our model with data generated by backtranslation from a neural machine translation model. On the SQuAD dataset, our single model, trained with augmented data, achieves 84.6 F1 score on the test set, which is significantly better than the best published F1 score of 81.8.

years

2024 1 2019 5

verdicts

UNVERDICTED 6

representative citing papers

Passage Re-ranking with BERT

cs.IR · 2019-01-13 · unverdicted · novelty 8.0

Fine-tuning BERT for query-passage relevance classification achieves state-of-the-art results on TREC-CAR and MS MARCO, with a 27% relative gain in MRR@10 over prior methods.

Attending to Emotional Narratives

cs.LG · 2019-07-08 · unverdicted · novelty 4.0

Transformer and Memory Fusion Network attention mechanisms generalize to multimodal time-series emotion recognition on emotional autobiographical narratives, achieving performance comparable to human raters in some cases.

EQuANt (Enhanced Question Answer Network)

cs.CL · 2019-06-24 · unverdicted · novelty 4.0

EQuANt extends QANet to SQuAD 2, achieving nearly twice the performance of a lightweight QANet baseline while also improving SQuAD 1.1 results via multi-task learning.

citing papers explorer

Showing 6 of 6 citing papers.

  • Passage Re-ranking with BERT cs.IR · 2019-01-13 · unverdicted · none · ref 12

    Fine-tuning BERT for query-passage relevance classification achieves state-of-the-art results on TREC-CAR and MS MARCO, with a 27% relative gain in MRR@10 over prior methods.

  • Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer cs.LG · 2019-10-23 · unverdicted · none · ref 80

    T5 casts all NLP tasks as text-to-text generation, systematically explores pre-training choices, and reaches strong performance on summarization, QA, classification and other tasks via large-scale training on the Colossal Clean Crawled Corpus.

  • RAPTOR: Recursive Abstractive Processing for Tree-Organized Retrieval cs.CL · 2024-01-31 · unverdicted · none · ref 125 · internal anchor

    RAPTOR introduces a tree-organized retrieval method using recursive abstractive summaries, achieving a 20% absolute accuracy improvement on the QuALITY benchmark when paired with GPT-4.

  • Attending to Emotional Narratives cs.LG · 2019-07-08 · unverdicted · none · ref 21 · internal anchor

    Transformer and Memory Fusion Network attention mechanisms generalize to multimodal time-series emotion recognition on emotional autobiographical narratives, achieving performance comparable to human raters in some cases.

  • EQuANt (Enhanced Question Answer Network) cs.CL · 2019-06-24 · unverdicted · none · ref 12 · internal anchor

    EQuANt extends QANet to SQuAD 2, achieving nearly twice the performance of a lightweight QANet baseline while also improving SQuAD 1.1 results via multi-task learning.

  • Machine Reading Comprehension: a Literature Review cs.CL · 2019-06-30 · unverdicted · none · ref 73 · internal anchor

    A 2019 survey of machine reading comprehension corpora and methods.