pith. sign in

arxiv: 1606.05579 · v3 · pith:BELOSC5Znew · submitted 2016-06-17 · 📊 stat.ML · cs.LG· q-bio.NC

Early Visual Concept Learning with Unsupervised Deep Learning

classification 📊 stat.ML cs.LGq-bio.NC
keywords learningunsupervisedvisualapproachdatadisentangledearlyfactors
0
0 comments X
read the original abstract

Automated discovery of early visual concepts from raw image data is a major open challenge in AI research. Addressing this problem, we propose an unsupervised approach for learning disentangled representations of the underlying factors of variation. We draw inspiration from neuroscience, and show how this can be achieved in an unsupervised generative model by applying the same learning pressures as have been suggested to act in the ventral visual stream in the brain. By enforcing redundancy reduction, encouraging statistical independence, and exposure to data with transform continuities analogous to those to which human infants are exposed, we obtain a variational autoencoder (VAE) framework capable of learning disentangled factors. Our approach makes few assumptions and works well across a wide variety of datasets. Furthermore, our solution has useful emergent properties, such as zero-shot inference and an intuitive understanding of "objectness".

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Autoencoding sensory substitution

    q-bio.NC 2019-07 unverdicted novelty 4.0

    Deep recurrent autoencoders convert images to shortened audio signals that incorporate hearing models, enabling above-chance hand posture discrimination and object reaching after a few hours of training instead of months.