pith. sign in

arxiv: 1702.02519 · v2 · pith:MFEKSWVNnew · submitted 2017-02-08 · 💻 cs.LG · cs.AI· stat.ML

Deep Generalized Canonical Correlation Analysis

classification 💻 cs.LG cs.AIstat.ML
keywords dgccalearningdeeprepresentationgeneralizednonlinearanalysiscanonical
0
0 comments X
read the original abstract

We present Deep Generalized Canonical Correlation Analysis (DGCCA) -- a method for learning nonlinear transformations of arbitrarily many views of data, such that the resulting transformations are maximally informative of each other. While methods for nonlinear two-view representation learning (Deep CCA, (Andrew et al., 2013)) and linear many-view representation learning (Generalized CCA (Horst, 1961)) exist, DGCCA is the first CCA-style multiview representation learning technique that combines the flexibility of nonlinear (deep) representation learning with the statistical power of incorporating information from many independent sources, or views. We present the DGCCA formulation as well as an efficient stochastic optimization algorithm for solving it. We learn DGCCA representations on two distinct datasets for three downstream tasks: phonetic transcription from acoustic and articulatory measurements, and recommending hashtags and friends on a dataset of Twitter users. We find that DGCCA representations soundly beat existing methods at phonetic transcription and hashtag recommendation, and in general perform no worse than standard linear many-view techniques.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. From Classical Machine Learning to Emerging Foundation Models: Review on Multimodal Data Integration for Cancer Research

    q-bio.QM 2025-07 unverdicted novelty 3.0

    A review mapping the transition from classical machine learning to foundation models for multimodal data integration in cancer research.