arxiv: 1703.00767 · v3 · pith:IUI3YE6Tnew · submitted 2017-03-02 · 💻 cs.CV · cs.LG

Attentive Recurrent Comparators

Pranav Shyam , Shubham Gupta , Ambedkar Dukkipati This is my paper

classification 💻 cs.CV cs.LG

keywords representationsarcsattentivecomparatorsdeveloplearningone-shotrecurrent

0 comments

read the original abstract

Rapid learning requires flexible representations to quickly adopt to new evidence. We develop a novel class of models called Attentive Recurrent Comparators (ARCs) that form representations of objects by cycling through them and making observations. Using the representations extracted by ARCs, we develop a way of approximating a \textit{dynamic representation space} and use it for one-shot learning. In the task of one-shot classification on the Omniglot dataset, we achieve the state of the art performance with an error rate of 1.5\%. This represents the first super-human result achieved for this task with a generic model that uses only pixel information.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Compressive Transformers for Long-Range Sequence Modelling
cs.LG 2019-11 unverdicted novelty 6.0

Compressive Transformer sets new records on WikiText-103 (17.1 ppl) and Enwik8 (0.97 bpc) via memory compression and introduces the PG-19 long-range language benchmark.