Tagger: Deep Unsupervised Perceptual Grouping

Antti Rasmus; Harri Valpola; J\"urgen Schmidhuber; Klaus Greff; Mathias Berglund; Tele Hotloo Hao

arxiv: 1606.06724 · v2 · pith:RL6QGHRKnew · submitted 2016-06-21 · 💻 cs.CV · cs.NE

Tagger: Deep Unsupervised Perceptual Grouping

Klaus Greff , Antti Rasmus , Mathias Berglund , Tele Hotloo Hao , J\"urgen Schmidhuber , Harri Valpola This is my paper

classification 💻 cs.CV cs.NE

keywords segmentationsystemclassificationframeworkgroupingimagesinferenceinputs

0 comments

read the original abstract

We present a framework for efficient perceptual inference that explicitly reasons about the segmentation of its inputs and features. Rather than being trained for any specific segmentation, our framework learns the grouping process in an unsupervised manner or alongside any supervised task. By enriching the representations of a neural network, we enable it to group the representations of different objects in an iterative manner. By allowing the system to amortize the iterative inference of the groupings, we achieve very fast convergence. In contrast to many other recently proposed methods for addressing multi-object scenes, our system does not assume the inputs to be images and can therefore directly handle other modalities. For multi-digit classification of very cluttered images that require texture segmentation, our method offers improved classification performance over convolutional networks despite being fully connected. Furthermore, we observe that our system greatly improves on the semi-supervised result of a baseline Ladder network on our dataset, indicating that segmentation can also improve sample efficiency.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

MetaGPT: Meta Programming for A Multi-Agent Collaborative Framework
cs.AI 2023-08 unverdicted novelty 6.0

MetaGPT embeds human SOPs into LLM prompts to create role-specialized agent teams that produce more coherent solutions on collaborative software engineering tasks than prior chat-based multi-agent systems.