pith. sign in

Sequence to sequence learning with neural networks

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

fields

cs.CL 2

years

2019 2

verdicts

UNVERDICTED 2

representative citing papers

Sharing Attention Weights for Fast Transformer

cs.CL · 2019-06-26 · unverdicted · novelty 4.0

Sharing attention weights in adjacent Transformer layers yields 1.3X inference speedup with negligible BLEU loss on ten WMT and NIST tasks.

citing papers explorer

Showing 2 of 2 citing papers.