pith. sign in

N., Kaiser, Ł., and Polosukhin, I

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

fields

cs.LG 1

years

2019 1

verdicts

UNVERDICTED 1

representative citing papers

Generating Long Sequences with Sparse Transformers

cs.LG · 2019-04-23 · unverdicted · novelty 7.0

Sparse Transformers factorize attention to handle sequences tens of thousands long, achieving new SOTA density modeling on Enwik8, CIFAR-10, and ImageNet-64.

citing papers explorer

Showing 1 of 1 citing paper.

  • Generating Long Sequences with Sparse Transformers cs.LG · 2019-04-23 · unverdicted · none · ref 22

    Sparse Transformers factorize attention to handle sequences tens of thousands long, achieving new SOTA density modeling on Enwik8, CIFAR-10, and ImageNet-64.