Outrageously large neural networks: The sparsely-gated mixture-of- experts layer

Noam Shazeer, *Azalia Mirhoseini, *Krzysztof Maziarz, Andy Davis, Quoc Le, Geoffrey Hinton, Jeff Dean · 2017

1 Pith paper cite this work. Polarity classification is still indexing.

1 Pith paper citing it

browse 1 citing papers

representative citing papers

Unified Deployment-Aware Evaluation of Open Reasoning Language Models

cs.CL · 2026-04-08 · unverdicted · novelty 4.0 · 2 refs

A controlled multi-model evaluation on shared data subsets shows that deployment metrics and prompting choices create important tradeoffs and alter model rankings beyond accuracy alone.

citing papers explorer

Showing 1 of 1 citing paper.

Unified Deployment-Aware Evaluation of Open Reasoning Language Models cs.CL · 2026-04-08 · unverdicted · none · ref 14 · 2 links
A controlled multi-model evaluation on shared data subsets shows that deployment metrics and prompting choices create important tradeoffs and alter model rankings beyond accuracy alone.

Outrageously large neural networks: The sparsely-gated mixture-of- experts layer

fields

years

verdicts

representative citing papers

citing papers explorer