Infinite Mixture Prototypes for Few-Shot Learning

Evan Shelhamer; Hanul Shin; Joshua B. Tenenbaum; Kelsey R. Allen

arxiv: 1902.04552 · v1 · pith:O43EQZCEnew · submitted 2019-02-12 · 💻 cs.LG · stat.ML

Infinite Mixture Prototypes for Few-Shot Learning

Kelsey R. Allen , Evan Shelhamer , Hanul Shin , Joshua B. Tenenbaum This is my paper

classification 💻 cs.LG stat.ML

keywords infinitemixtureprototypesaccuracyprototypicalclusteringdatafew-shot

0 comments

read the original abstract

We propose infinite mixture prototypes to adaptively represent both simple and complex data distributions for few-shot learning. Our infinite mixture prototypes represent each class by a set of clusters, unlike existing prototypical methods that represent each class by a single cluster. By inferring the number of clusters, infinite mixture prototypes interpolate between nearest neighbor and prototypical representations, which improves accuracy and robustness in the few-shot regime. We show the importance of adaptive capacity for capturing complex data distributions such as alphabets, with 25% absolute accuracy improvements over prototypical networks, while still maintaining or improving accuracy on the standard Omniglot and mini-ImageNet benchmarks. In clustering labeled and unlabeled data by the same clustering rule, infinite mixture prototypes achieves state-of-the-art semi-supervised accuracy. As a further capability, we show that infinite mixture prototypes can perform purely unsupervised clustering, unlike existing prototypical methods.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Learning to Theorize the World from Observation
cs.LG 2026-05 unverdicted novelty 6.0

NEO induces compositional latent programs as world theories from observations and executes them to enable explanation-driven generalization.