Noam Shazeer
- 3works
- 2Pith-reviewed
- 66.7%Recognition coverage
- 0queued
works
- GLU Variants Improve Transformer Pith 2020 · cs.LG · verdict UNVERDICTED · 194 Pith citing
- Attention is all you need.Advances in neural information processing systems, 30, 2017 metadata 2017 · 138 Pith citing
- Outrageously Large Neural Networks: The Sparsely-Gated Mixture-of-Experts Layer Pith 2017 · cs.LG · verdict ACCEPT · 220 Pith citing