Common Representation Learning Using Step-based Correlation Multi-Modal CNN

Balasubramanian Raman; Gaurav Bhatt; Piyush Jha

arxiv: 1711.00003 · v1 · pith:LMNSEIJKnew · submitted 2017-10-31 · 💻 cs.CV

Common Representation Learning Using Step-based Correlation Multi-Modal CNN

Gaurav Bhatt , Piyush Jha , Balasubramanian Raman This is my paper

classification 💻 cs.CV

keywords learningcommonrepresentationdataperformancetechniquesapproachesautoencoder

0 comments

read the original abstract

Deep learning techniques have been successfully used in learning a common representation for multi-view data, wherein the different modalities are projected onto a common subspace. In a broader perspective, the techniques used to investigate common representation learning falls under the categories of canonical correlation-based approaches and autoencoder based approaches. In this paper, we investigate the performance of deep autoencoder based methods on multi-view data. We propose a novel step-based correlation multi-modal CNN (CorrMCNN) which reconstructs one view of the data given the other while increasing the interaction between the representations at each hidden layer or every intermediate step. Finally, we evaluate the performance of the proposed model on two benchmark datasets - MNIST and XRMB. Through extensive experiments, we find that the proposed model achieves better performance than the current state-of-the-art techniques on joint common representation learning and transfer learning tasks.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Deep Multi-View Learning via Task-Optimal CCA
cs.LG 2019-07 unverdicted novelty 6.0

End-to-end deep optimization of CCA plus task loss produces discriminative shared representations that outperform prior multi-view methods on classification and semi-supervised tasks.