How deep learning works --The geometry of deep learning

· 2017 · cs.LG · arXiv 1710.10784

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

open full Pith review browse 2 citing papers arXiv PDF

abstract

Why and how that deep learning works well on different tasks remains a mystery from a theoretical perspective. In this paper we draw a geometric picture of the deep learning system by finding its analogies with two existing geometric structures, the geometry of quantum computations and the geometry of the diffeomorphic template matching. In this framework, we give the geometric structures of different deep learning systems including convolutional neural networks, residual networks, recursive neural networks, recurrent neural networks and the equilibrium prapagation framework. We can also analysis the relationship between the geometrical structures and their performance of different networks in an algorithmic level so that the geometric framework may guide the design of the structures and algorithms of deep learning systems.

representative citing papers

Deep network as memory space: complexity, generalization, disentangled representation and interpretability

cs.LG · 2019-07-12 · unverdicted · novelty 5.0

Deep networks are framed as memory spaces whose complexity is defined by a Fisher metric, with the least action principle linking this complexity to generalization and disentanglement for better interpretability.

Gauge theory and twins paradox of disentangled representations

cs.LG · 2019-06-24 · unverdicted · novelty 3.0

Authors propose a fibre bundle gauge theory model for disentangled representations and connect it to the relativity twins paradox.

citing papers explorer

Showing 2 of 2 citing papers.

Deep network as memory space: complexity, generalization, disentangled representation and interpretability cs.LG · 2019-07-12 · unverdicted · none · ref 13 · internal anchor
Deep networks are framed as memory spaces whose complexity is defined by a Fisher metric, with the least action principle linking this complexity to generalization and disentanglement for better interpretability.
Gauge theory and twins paradox of disentangled representations cs.LG · 2019-06-24 · unverdicted · none · ref 3 · internal anchor
Authors propose a fibre bundle gauge theory model for disentangled representations and connect it to the relativity twins paradox.

How deep learning works --The geometry of deep learning

fields

years

verdicts

representative citing papers

citing papers explorer