pith. sign in

arxiv: 1705.02193 · v2 · pith:MTDBQ2VTnew · submitted 2017-05-05 · 💻 cs.CV · stat.ML

Unsupervised learning of object landmarks by factorized spatial embeddings

classification 💻 cs.CV stat.ML
keywords landmarksobjectlearningunsupervisedapproachcategoriesstructureaccuracy
0
0 comments X
read the original abstract

Learning automatically the structure of object categories remains an important open problem in computer vision. In this paper, we propose a novel unsupervised approach that can discover and learn landmarks in object categories, thus characterizing their structure. Our approach is based on factorizing image deformations, as induced by a viewpoint change or an object deformation, by learning a deep neural network that detects landmarks consistently with such visual effects. Furthermore, we show that the learned landmarks establish meaningful correspondences between different object instances in a category without having to impose this requirement explicitly. We assess the method qualitatively on a variety of object types, natural and man-made. We also show that our unsupervised landmarks are highly predictive of manually-annotated landmarks in face benchmark datasets, and can be used to regress these with a high degree of accuracy.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.