pith. sign in

arxiv: 1607.08764 · v1 · pith:HVRCX3JInew · submitted 2016-07-29 · 💻 cs.CV

SwiDeN : Convolutional Neural Networks For Depiction Invariant Object Recognition

classification 💻 cs.CV
keywords swidenobjectrecognitionarchitecturesconvolutionaldepicteddepiction-invariantdepictive
0
0 comments X
read the original abstract

Current state of the art object recognition architectures achieve impressive performance but are typically specialized for a single depictive style (e.g. photos only, sketches only). In this paper, we present SwiDeN : our Convolutional Neural Network (CNN) architecture which recognizes objects regardless of how they are visually depicted (line drawing, realistic shaded drawing, photograph etc.). In SwiDeN, we utilize a novel `deep' depictive style-based switching mechanism which appropriately addresses the depiction-specific and depiction-invariant aspects of the problem. We compare SwiDeN with alternative architectures and prior work on a 50-category Photo-Art dataset containing objects depicted in multiple styles. Experimental results show that SwiDeN outperforms other approaches for the depiction-invariant object recognition problem.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.