pith. sign in

arxiv: 2101.07988 · v1 · pith:H4L67IXVnew · submitted 2021-01-20 · 💻 cs.CV

Semi-supervised Keypoint Localization

classification 💻 cs.CV
keywords keypointimagerepresentationssemi-supervisedanimaldatasetdetectionimages
0
0 comments X
read the original abstract

Knowledge about the locations of keypoints of an object in an image can assist in fine-grained classification and identification tasks, particularly for the case of objects that exhibit large variations in poses that greatly influence their visual appearance, such as wild animals. However, supervised training of a keypoint detection network requires annotating a large image dataset for each animal species, which is a labor-intensive task. To reduce the need for labeled data, we propose to learn simultaneously keypoint heatmaps and pose invariant keypoint representations in a semi-supervised manner using a small set of labeled images along with a larger set of unlabeled images. Keypoint representations are learnt with a semantic keypoint consistency constraint that forces the keypoint detection network to learn similar features for the same keypoint across the dataset. Pose invariance is achieved by making keypoint representations for the image and its augmented copies closer together in feature space. Our semi-supervised approach significantly outperforms previous methods on several benchmarks for human and animal body landmark localization.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. GKDT: General Keypoint Detection Transformer

    cs.CV 2026-07 unverdicted novelty 6.0

    Creates MegaKPT dataset and GKDT promptable transformer model for general keypoint detection across diverse objects with reported high accuracy on 22 test sets.