pith. sign in

From sparse to soft mixtures of experts

11 Pith papers cite this work. Polarity classification is still indexing.

11 Pith papers citing it

citation-role summary

background 2

citation-polarity summary

years

2026 10 2025 1

roles

background 2

polarities

background 2

clear filters

representative citing papers

Path-Constrained Mixture-of-Experts

cs.LG · 2026-03-18 · unverdicted · novelty 7.0

PathMoE constrains expert paths in MoE models by sharing router parameters across layer blocks, yielding more concentrated paths, better performance on perplexity and tasks, and no need for auxiliary losses.

FaceMoE: Mixture of Experts for Low-Resolution Face Recognition

cs.CV · 2026-06-30 · unverdicted · novelty 6.0

FaceMoE introduces a MoE transformer with top-k routed specialized FFN experts for resolution-aware feature extraction in low-resolution face recognition, outperforming prior methods on eleven datasets.

Tight Clusters Make Specialized Experts

cs.LG · 2025-02-21 · unverdicted · novelty 6.0

Introduces Adaptive Clustering router for MoE models that scales features to identify tight expert clusters, yielding faster convergence, robustness to corruption, and performance gains.

citing papers explorer

Showing 1 of 1 citing paper after filters.