pith. sign in

arxiv: 1510.08973 · v1 · pith:V5Y4LMTMnew · submitted 2015-10-30 · 💻 cs.CV

VISALOGY: Answering Visual Analogy Questions

classification 💻 cs.CV
keywords imagequestionsanalogyansweringimagesvisualmappingnatural
0
0 comments X
read the original abstract

In this paper, we study the problem of answering visual analogy questions. These questions take the form of image A is to image B as image C is to what. Answering these questions entails discovering the mapping from image A to image B and then extending the mapping to image C and searching for the image D such that the relation from A to B holds for C to D. We pose this problem as learning an embedding that encourages pairs of analogous images with similar transformations to be close together using convolutional neural networks with a quadruple Siamese architecture. We introduce a dataset of visual analogy questions in natural images, and show first results of its kind on solving analogy questions on natural images.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.