Connectivity-Optimized Representation Learning via Persistent Homology

Christoph Hofer; Mandar Dixit; Marc Niethammer; Roland Kwitt

arxiv: 1906.09003 · v1 · pith:BW4D5X2Gnew · submitted 2019-06-21 · 💻 cs.LG · cs.CG· math.AT· stat.ML

Connectivity-Optimized Representation Learning via Persistent Homology

Christoph Hofer , Roland Kwitt , Mandar Dixit , Marc Niethammer This is my paper

Pith reviewed 2026-05-25 19:14 UTC · model grok-4.3

classification 💻 cs.LG cs.CGmath.ATstat.ML

keywords persistent homologyrepresentation learningautoencodersone-class learningkernel density estimationlatent space connectivityanomaly detection

0 comments

The pith

A persistent homology loss controls the connectivity of an autoencoder's latent space to support one-class learning with kernel density estimators.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops a loss based on persistent homology to enforce controllable connectivity properties in the latent space of an autoencoder. This structure then supports one-class learning by allowing informed parameter selection when modeling the in-class distribution via kernel density estimators. On computer vision data the resulting models match existing methods in overall performance and outperform them substantially when the number of samples is small. The work further shows that a single autoencoder trained on auxiliary unlabeled data produces a latent mapping that can be reused across different one-class tasks.

Core claim

A novel loss operating on persistent homology information controls the connectivity of an autoencoder's latent space under mild conditions that keep the loss differentiable. The controlled connectivity enables informed parameter selection for kernel density estimators that model the in-class distribution. One-class models built this way achieve competitive results on vision benchmarks and large gains in the low-sample regime. A single autoencoder trained once on auxiliary data suffices to produce reusable latent mappings for multiple one-class problems.

What carries the argument

persistent homology loss - a loss term that uses information from persistent homology to impose desired connectivity properties on an autoencoder's latent space.

If this is right

The loss is differentiable under mild conditions.
The imposed connectivity enables informed parameter selection for kernel density estimators in one-class learning.
The resulting one-class models achieve competitive performance on computer vision data.
Performance advantages are largest in the low sample size regime.
A single autoencoder trained on auxiliary unlabeled data yields a reusable latent mapping across datasets.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The reusability of the encoder points to potential use in transfer settings for anomaly detection across related domains.
Connectivity optimization may extend to other density-based downstream tasks such as clustering or semi-supervised learning.
The low-sample gains could be checked on non-image data to test whether the benefit is specific to computer vision.

Load-bearing premise

The persistent homology loss is differentiable under mild conditions and the connectivity properties it induces can be leveraged for parameter selection in kernel density estimation for one-class learning.

What would settle it

Experiments on standard computer vision datasets that show the one-class models fail to outperform other methods by a large margin in the low-sample regime would falsify the performance claims.

Figures

Figures reproduced from arXiv: 1906.09003 by Christoph Hofer, Mandar Dixit, Marc Niethammer, Roland Kwitt.

**Figure 1.** Figure 1: Vietoris-Rips complex built from S = {z1, z2, z3} with only zero- and one-dimensional simplices, i.e., vertices and edges. 2.1. Filtration/Persistent homology To study point clouds of latent representations, zi , from a topological perspective, consider the union of closed balls (with radius r) around zi w.r.t. some metric δ on R n, i.e., Sr = [ b i=1 B(zi , r) with r ≥ 0 . (2) Sr induces a topological (su… view at source ↗

**Figure 2.** Figure 2: 2D toy example of a connectivity-optimized mapping, mlp : R 2 → R 2 (see §3.2), learned on 1,500 samples, xi, from three Gaussians (left). The figure highlights the homogenization effect enforced by the proposed loss, at 20 (middle) / 60 (right) training epochs and lists the mean min./avg./max. values of εt, i.e., (α, ˆ ε,ˆ βˆ), computed over 3,000 batches of size 50. 3.2. Toy example We demonstrate the ef… view at source ↗

**Figure 3.** Figure 3: Autoencoder architecture with B independent branches mapping into latent space Z ⊂ R n = R D × · · · × R D. The connectivity loss Lη is computed per branch, summed, and added to the reconstruction loss (here k · k1). of a new class C, we first compute zi = fθ(xi) and then split zi into its D-dimensional parts z 1 i , . . . , zB i , provided by each branch (see [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗

**Figure 4.** Figure 4: Connectivity (left) and reconstruction (right) loss over all training iterations on CIFAR-100 w/ and w/o branching. Thus, with respect to reconstruction, the latent space carries equivalent information with and without branching, but is structurally different. Further evidence is provided when using fθ for one-class learning on CIFAR-10. Branching leads to an average AUC of 0.78 and 0.75 (for 16/32 branch… view at source ↗

**Figure 5.** Figure 5: (Left) Connectivity loss over training iterations on CIFAR-100 for 16 branches and varying λ; (Right) One-class performance (AUC) on CIFAR-10 over the number of training samples, 10 ≤ m ≤ 5,000, per class. During training, the behavior of Lη is almost equal for λ ≥ 10.0. For λ = 1.0, however, the loss noticeably converges to a higher value. In fact, reconstruction error dominates in the latter case, leadin… view at source ↗

**Figure 8.** Figure 8: Average εd, d ∈ †(S), per branch, computed from batches, S, of size 100 over CIFAR-100 (test split) and TinyImageNet (test split); fθ is learned from the training portion of CIFAR-100 with η = 2. in training neural networks (e.g., 32, 64, 128), the runtime difference is negligible, especially compared to the overall cost of backpropagation. Importantly, our method integrates well into existing deep learni… view at source ↗

read the original abstract

We study the problem of learning representations with controllable connectivity properties. This is beneficial in situations when the imposed structure can be leveraged upstream. In particular, we control the connectivity of an autoencoder's latent space via a novel type of loss, operating on information from persistent homology. Under mild conditions, this loss is differentiable and we present a theoretical analysis of the properties induced by the loss. We choose one-class learning as our upstream task and demonstrate that the imposed structure enables informed parameter selection for modeling the in-class distribution via kernel density estimators. Evaluated on computer vision data, these one-class models exhibit competitive performance and, in a low sample size regime, outperform other methods by a large margin. Notably, our results indicate that a single autoencoder, trained on auxiliary (unlabeled) data, yields a mapping into latent space that can be reused across datasets for one-class learning.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper gives a concrete differentiable PH loss to enforce latent connectivity in autoencoders for better one-class KDE, but lacks checks that training stays in the differentiable regime.

read the letter

The main contribution is a loss term that pulls information from persistent homology to penalize unwanted connectivity in an autoencoder's latent space. They show this can be differentiated under mild conditions, analyze the induced properties, and then reuse the encoder for one-class learning where the controlled connectivity helps choose KDE bandwidths. On vision data the low-sample results look competitive and sometimes substantially better than baselines, with the added practical point that one auxiliary-trained encoder works across multiple target datasets.

Referee Report

2 major / 0 minor

Summary. The paper proposes a persistent homology loss to impose controllable connectivity on the latent space of an autoencoder. It asserts differentiability under mild conditions, provides a theoretical analysis of the induced properties, and applies the resulting representations to one-class learning by enabling informed bandwidth selection for kernel density estimators. On computer vision datasets the one-class models are competitive overall and outperform baselines by a large margin in the low-sample regime; a single auxiliary-data autoencoder is shown to be reusable across target datasets.

Significance. If the differentiability claim and the causal link between the imposed connectivity and improved KDE parameter selection both hold, the work would offer a concrete route for injecting topological control into representation learning pipelines, with immediate relevance to anomaly detection and low-data regimes. The reuse of a single auxiliary model across datasets is a practical strength.

major comments (2)

[Abstract / theoretical analysis] Abstract and theoretical analysis section: the claim that the PH loss is differentiable under mild conditions and that the resulting connectivity can be leveraged for KDE bandwidth selection is load-bearing, yet the manuscript provides no empirical verification that training trajectories remain inside the differentiable strata of the persistence diagram throughout optimization with standard gradient descent.
[Experimental evaluation] One-class learning experiments: the attribution of performance gains to the connectivity properties (rather than generic representation learning) requires an ablation that isolates the PH loss; without it, the central claim that the imposed structure enables informed parameter selection cannot be fully substantiated.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive comments and the positive assessment of the work's potential. We address each major comment below.

read point-by-point responses

Referee: [Abstract / theoretical analysis] Abstract and theoretical analysis section: the claim that the PH loss is differentiable under mild conditions and that the resulting connectivity can be leveraged for KDE bandwidth selection is load-bearing, yet the manuscript provides no empirical verification that training trajectories remain inside the differentiable strata of the persistence diagram throughout optimization with standard gradient descent.

Authors: The manuscript provides a theoretical analysis establishing differentiability of the PH loss under mild conditions on the persistence diagram (no critical pairs crossing during optimization). These conditions are generic for the loss formulation and are expected to hold under standard gradient descent, as the loss penalizes deviations from the target connectivity. We agree, however, that explicit empirical verification would strengthen the claim. In the revised version we will add monitoring of persistence diagrams along training trajectories to confirm that the strata remain differentiable. revision: yes
Referee: [Experimental evaluation] One-class learning experiments: the attribution of performance gains to the connectivity properties (rather than generic representation learning) requires an ablation that isolates the PH loss; without it, the central claim that the imposed structure enables informed parameter selection cannot be fully substantiated.

Authors: The reported experiments compare against standard autoencoders and other one-class methods, showing gains especially in the low-sample regime that align with the theoretical link to KDE bandwidth selection. To isolate the PH loss contribution more directly, we will include an ablation (with vs. without the PH term) in the revised manuscript. This will provide clearer evidence that the connectivity properties, rather than generic representation learning, enable the informed parameter selection. revision: yes

Circularity Check

0 steps flagged

No circularity; derivation relies on external persistent homology and independent empirical evaluation

full rationale

The paper introduces a novel loss operating on persistent homology information to control autoencoder latent connectivity, asserts differentiability under mild conditions with accompanying theoretical analysis, and applies the resulting structure to KDE parameter selection for one-class learning. No quoted equations or steps in the abstract reduce any claimed prediction or result to a fitted input by construction, nor do they rely on load-bearing self-citations or imported uniqueness theorems. Performance is assessed via standard computer vision datasets and comparisons to other methods, rendering the chain self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The approach rests on the differentiability assumption for the homology loss and on hyperparameters that balance the new loss against reconstruction; no new entities are postulated.

free parameters (1)

homology loss weight
Hyperparameter balancing the persistent homology term against the standard reconstruction loss, chosen to achieve desired connectivity.

axioms (1)

domain assumption Persistent homology loss is differentiable under mild conditions
Invoked to justify end-to-end training of the autoencoder.

pith-pipeline@v0.9.0 · 5689 in / 1146 out tokens · 45913 ms · 2026-05-25T19:14:40.593269+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

52 extracted references · 52 canonical work pages · 1 internal anchor

[1]

write newline

" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION format.date year duplicate empty "emp...

work page
[2]

On the surprising behavior of distance metrics in high dimensional space

Aggarwal, C., Hinneburg, A., and Keim, D. On the surprising behavior of distance metrics in high dimensional space. In ICDT, 2001

work page 2001
[3]

Distributed computation of persistent homology

Bauer, U., Kerber, M., and Reininghaus, J. Distributed computation of persistent homology. In ALENEX, 2014 a

work page 2014
[4]

Clear and compress: Computing persistent homology in chunks

Bauer, U., Kerber, M., and Reininghaus, J. Clear and compress: Computing persistent homology in chunks. In Topological Methods in Data Analysis and Visualization III, pp.\ 103--117. Springer, 2014 b

work page 2014
[5]

A topological regularizer for classifiers via persistent homology

Chen, C., Ni, X., Bai, Q., and Wang, Y. A topological regularizer for classifiers via persistent homology. In AISTATS, 2019

work page 2019
[6]

R-FCN : Object detection via region-based fully convolutional networks

Dai, J., Li, Y., He, K., and Sun, J. R-FCN : Object detection via region-based fully convolutional networks. In NIPS, 2016

work page 2016
[7]

Dualities in persistent (co)homology

de Silva , V., Morozov, D., and Vejdemo-Johansson, M. Dualities in persistent (co)homology. Inverse Problems, 27 0 (12): 0 124003, 2011

work page 2011
[8]

Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Fei, L. F. Imagenet: A large-scale hierarchical image database. In CVPR, 2009

work page 2009
[9]

SimBa : An efficient tool for approximating Rips -filtration persistence via simplicial batch-collapse

Dey, T., Shi, D., and Wang, Y. SimBa : An efficient tool for approximating Rips -filtration persistence via simplicial batch-collapse. In ESA, 2016

work page 2016
[10]

and Harer, J

Edelsbrunner, H. and Harer, J. L. Computational Topology : An Introduction. American Mathematical Society, 2010

work page 2010
[11]

and El-Yaniv, R

Goland, I. and El-Yaniv, R. Deep anomaly detection using geometric transformations. In NIPS, 2018

work page 2018
[12]

Generating Sequences With Recurrent Neural Networks

Graves, A. Generating sequences with recurrent neural networks. CoRR, 2013. https://arxiv.org/abs/1308.0850

work page internal anchor Pith review Pith/arXiv arXiv 2013
[13]

Deep residual learning for image recognition

He, K., Zhang, X., Ren, S., and Sun, J. Deep residual learning for image recognition. In CVPR, 2016

work page 2016
[14]

and Gimpel, K

Hendrycks, D. and Gimpel, K. A baseline for detecting misclassified and out-of-distribution examples in neural networks. In ICLR, 2017

work page 2017
[15]

Densely connected convolutional networks

Huang, G., Liu, Z., van der Maaten , L., and Weinberger, K. Densely connected convolutional networks. In CVPR, 2017

work page 2017
[16]

and Yamada, M

Iwata, T. and Yamada, M. Multi-view anomaly detection via robust probabilistic latent variable models. In NIPS, 2016

work page 2016
[17]

and Ba, J

Kingma, D. and Ba, J. Adam: A method for stochastic optimization. In ICLR, 2014

work page 2014
[18]

and Welling, M

Kingma, D. and Welling, M. Auto-encoding variational Bayes . In ICLR, 2014

work page 2014
[19]

and Hinton, G

Krizhevsky, A. and Hinton, G. Learning multiple layers of features from tiny images. Technical report, University of Toronto, 2009

work page 2009
[20]

Krizhevsky, A., Sutskever, I., and Hinton, G. E. Imagenet classification with deep convolutional neural networks. In NIPS, 2012

work page 2012
[21]

Training confidence-calibrated classifiers for detecting out-of-distribution samples

Lee, K., Lee, H., Lee, K., and Shin, J. Training confidence-calibrated classifiers for detecting out-of-distribution samples. In ICLR, 2018

work page 2018
[22]

Enhancing the reliability of out-of-distribution image detection in neural networks

Liang, S., Y.Li, and Srikant, R. Enhancing the reliability of out-of-distribution image detection in neural networks. In ICLR, 2018

work page 2018
[23]

SSD: single shot multibox detector

Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., and Berg, A. SSD: single shot multibox detector. In ECCV, 2016

work page 2016
[24]

and Frey, B

Makhzani, A. and Frey, B. k -sparse autoencoders. In ICLR, 2014

work page 2014
[25]

Jaitly, J

Makhzani, A., annd N. Jaitly, J. S., and Goodfellow, I. Adversarial autoencoders. In ICLR, 2016

work page 2016
[26]

Automatic differentiation in PyTorch

Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z., Demaison, A., Antiga, L., and Lerer, A. Automatic differentiation in PyTorch . In NIPS Autodiff WS, 2017

work page 2017
[27]

A review of novelty detection

Pimentel, M., D.A.Clifton, Clifton, L., and Tarassenko, L. A review of novelty detection. Sig. Proc., 99: 0 215--249, 2014

work page 2014
[28]

Persistent homology for learning densities with bounded support

Pokorny, F., Ek, C., Kjellstr\"om, H., and Kragic, D. Persistent homology for learning densities with bounded support. In NIPS, 2012 a

work page 2012
[29]

Topological constraints and kernel-based density estimation

Pokorny, F., Ek, C., Kjellstr\"om, H., and Kragic, D. Topological constraints and kernel-based density estimation. In NIPS WS on Algebraic Topology and Machine Learning, 2012 b

work page 2012
[30]

Unsupervised representation learning with deep convolutional generative adversarial networks

Radford, A., Metz, L., and Chintala, S. Unsupervised representation learning with deep convolutional generative adversarial networks. In ICLR, 2016

work page 2016
[31]

Faster R-CNN: towards real-time object detection with region proposal networks

Ren, S., He, K., Girshick, R., and Sun, J. Faster R-CNN: towards real-time object detection with region proposal networks. In NIPS, 2015

work page 2015
[32]

Contractive auto-encoders: Explicit inveriance during feature extraction

Rifai, S., Vincent, P., Muller, X., Glorot, X., and Bengio, Y. Contractive auto-encoders: Explicit inveriance during feature extraction. In ICML, 2011

work page 2011
[33]

Deep one-class classification

Ruff, L., Vandermeulen, R., Goernitz, N., Deecke, L., Siddiqui, S., Bindern, A., M\"uller, E., and Kloft, M. Deep one-class classification. In ICML, 2018

work page 2018
[34]

Learning representations by backpropagating errors

Rumelhart, D., Hinton, G., and Williams, R. Learning representations by backpropagating errors. Nature, 323: 0 533--536, 1986

work page 1986
[35]

Adversarially learned one-class classifier for novelty detection

Sabokrou, M., Khalooei, M., Fathy, M., and Adeli, E. Adversarially learned one-class classifier for novelty detection. In CVPR, 2018

work page 2018
[36]

Estimating the support of a highdimensional distribution

Sch\"olkof, B., Platt, J., Shawe-Taylor, J., Smola, A., and Williamson, R. Estimating the support of a highdimensional distribution. Neural computation, 13 0 (7): 0 1443–1471, 2001

work page 2001
[37]

Sequence to sequence learning with neural networks

Sutskever, I., Vinyals, O., and Le, Q. Sequence to sequence learning with neural networks. In NIPS, 2014

work page 2014
[38]

Metric entropy analogues of sum set theory

Tao, T. Metric entropy analogues of sum set theory. Online: https://bit.ly/2zRAKUy, 2014

work page 2014
[39]

Java P lex: A research software package for persistent (co)homology

Tausz, A., Vejdemo-Johansson, M., and Adams, H. Java P lex: A research software package for persistent (co)homology. In ICMS, 2014

work page 2014
[40]

and Duin, R

Tax, D. and Duin, R. Support vector data description. Machine learning, 54 0 (1): 0 45--66, 2004

work page 2004
[41]

and Duin, R

Tax, D. and Duin, R. Growing multi-class classifiers with a reject option. Pattern Recognition Letters, 29: 0 1565--1570, 2008

work page 2008
[42]

Wasserstein auto-encoders

Tolstikhin, I., Bousquet, O., Gelly, S., and Sch\"olkopf, B. Wasserstein auto-encoders. In ICLR, 2018

work page 2018
[43]

Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion

Vincent, P., Larochele, H., Lajoie, I., Bengio, Y., and Manzagol, P.-A. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. JMLR, 11: 0 3371--3408, 2010

work page 2010
[44]

Learning discriminative reconstructions for unsupervised outlier removal

Xia, Y., Cao, X., Wen, F., Hua, G., and Sun, J. Learning discriminative reconstructions for unsupervised outlier removal. In ICCV, 2015

work page 2015
[45]

Unsupervised deep embedding for clustering analysis

Xie, J., Girshick, R., and Farhadi, A. Unsupervised deep embedding for clustering analysis. In ICML, 2016

work page 2016
[46]

Towards k -means-friendly spaces: Simultaneous deep learning and clustering

Yang, B., Fu, X., Sidiropoulos, N., and Hong, M. Towards k -means-friendly spaces: Simultaneous deep learning and clustering. ICML, 2017

work page 2017
[47]

Provable self-representation based outlier detection in a union of subspaces

You, C., Robinson, D., and Vidal, R. Provable self-representation based outlier detection in a union of subspaces. In CVPR, 2017

work page 2017
[48]

and Komodakis, N

Zagoruyko, S. and Komodakis, N. Wide residual networks. In BMVC, 2016

work page 2016
[49]

Deconvolutional networks

Zeiler, M., Krishnan, D., Taylor, G., and Fergus, R. Deconvolutional networks. In CVPR, 2010

work page 2010
[50]

Deep structured energy based models for anomaly detection

Zhai, S., Cheng, Y., Lu, W., and Zhang, Z. Deep structured energy based models for anomaly detection. In ICML, 2016

work page 2016
[51]

and Pfaffenroth, R

Zhou, C. and Pfaffenroth, R. Anomaly detection with robust deep autoencoder. In KDD, 2017

work page 2017
[52]

Deep autoencoding Gaussian mixture model for unsupervised anomaly detection

Zong, B., Song, Q., Min, M., Cheng, W., Lumezanu, C., Cho, D., and Chen, H. Deep autoencoding Gaussian mixture model for unsupervised anomaly detection. In ICLR, 2018

work page 2018

[1] [1]

write newline

" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION format.date year duplicate empty "emp...

work page

[2] [2]

On the surprising behavior of distance metrics in high dimensional space

Aggarwal, C., Hinneburg, A., and Keim, D. On the surprising behavior of distance metrics in high dimensional space. In ICDT, 2001

work page 2001

[3] [3]

Distributed computation of persistent homology

Bauer, U., Kerber, M., and Reininghaus, J. Distributed computation of persistent homology. In ALENEX, 2014 a

work page 2014

[4] [4]

Clear and compress: Computing persistent homology in chunks

Bauer, U., Kerber, M., and Reininghaus, J. Clear and compress: Computing persistent homology in chunks. In Topological Methods in Data Analysis and Visualization III, pp.\ 103--117. Springer, 2014 b

work page 2014

[5] [5]

A topological regularizer for classifiers via persistent homology

Chen, C., Ni, X., Bai, Q., and Wang, Y. A topological regularizer for classifiers via persistent homology. In AISTATS, 2019

work page 2019

[6] [6]

R-FCN : Object detection via region-based fully convolutional networks

Dai, J., Li, Y., He, K., and Sun, J. R-FCN : Object detection via region-based fully convolutional networks. In NIPS, 2016

work page 2016

[7] [7]

Dualities in persistent (co)homology

de Silva , V., Morozov, D., and Vejdemo-Johansson, M. Dualities in persistent (co)homology. Inverse Problems, 27 0 (12): 0 124003, 2011

work page 2011

[8] [8]

Deng, J., Dong, W., Socher, R., Li, L.-J., Li, K., and Fei, L. F. Imagenet: A large-scale hierarchical image database. In CVPR, 2009

work page 2009

[9] [9]

SimBa : An efficient tool for approximating Rips -filtration persistence via simplicial batch-collapse

Dey, T., Shi, D., and Wang, Y. SimBa : An efficient tool for approximating Rips -filtration persistence via simplicial batch-collapse. In ESA, 2016

work page 2016

[10] [10]

and Harer, J

Edelsbrunner, H. and Harer, J. L. Computational Topology : An Introduction. American Mathematical Society, 2010

work page 2010

[11] [11]

and El-Yaniv, R

Goland, I. and El-Yaniv, R. Deep anomaly detection using geometric transformations. In NIPS, 2018

work page 2018

[12] [12]

Generating Sequences With Recurrent Neural Networks

Graves, A. Generating sequences with recurrent neural networks. CoRR, 2013. https://arxiv.org/abs/1308.0850

work page internal anchor Pith review Pith/arXiv arXiv 2013

[13] [13]

Deep residual learning for image recognition

He, K., Zhang, X., Ren, S., and Sun, J. Deep residual learning for image recognition. In CVPR, 2016

work page 2016

[14] [14]

and Gimpel, K

Hendrycks, D. and Gimpel, K. A baseline for detecting misclassified and out-of-distribution examples in neural networks. In ICLR, 2017

work page 2017

[15] [15]

Densely connected convolutional networks

Huang, G., Liu, Z., van der Maaten , L., and Weinberger, K. Densely connected convolutional networks. In CVPR, 2017

work page 2017

[16] [16]

and Yamada, M

Iwata, T. and Yamada, M. Multi-view anomaly detection via robust probabilistic latent variable models. In NIPS, 2016

work page 2016

[17] [17]

and Ba, J

Kingma, D. and Ba, J. Adam: A method for stochastic optimization. In ICLR, 2014

work page 2014

[18] [18]

and Welling, M

Kingma, D. and Welling, M. Auto-encoding variational Bayes . In ICLR, 2014

work page 2014

[19] [19]

and Hinton, G

Krizhevsky, A. and Hinton, G. Learning multiple layers of features from tiny images. Technical report, University of Toronto, 2009

work page 2009

[20] [20]

Krizhevsky, A., Sutskever, I., and Hinton, G. E. Imagenet classification with deep convolutional neural networks. In NIPS, 2012

work page 2012

[21] [21]

Training confidence-calibrated classifiers for detecting out-of-distribution samples

Lee, K., Lee, H., Lee, K., and Shin, J. Training confidence-calibrated classifiers for detecting out-of-distribution samples. In ICLR, 2018

work page 2018

[22] [22]

Enhancing the reliability of out-of-distribution image detection in neural networks

Liang, S., Y.Li, and Srikant, R. Enhancing the reliability of out-of-distribution image detection in neural networks. In ICLR, 2018

work page 2018

[23] [23]

SSD: single shot multibox detector

Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C.-Y., and Berg, A. SSD: single shot multibox detector. In ECCV, 2016

work page 2016

[24] [24]

and Frey, B

Makhzani, A. and Frey, B. k -sparse autoencoders. In ICLR, 2014

work page 2014

[25] [25]

Jaitly, J

Makhzani, A., annd N. Jaitly, J. S., and Goodfellow, I. Adversarial autoencoders. In ICLR, 2016

work page 2016

[26] [26]

Automatic differentiation in PyTorch

Paszke, A., Gross, S., Chintala, S., Chanan, G., Yang, E., DeVito, Z., Lin, Z., Demaison, A., Antiga, L., and Lerer, A. Automatic differentiation in PyTorch . In NIPS Autodiff WS, 2017

work page 2017

[27] [27]

A review of novelty detection

Pimentel, M., D.A.Clifton, Clifton, L., and Tarassenko, L. A review of novelty detection. Sig. Proc., 99: 0 215--249, 2014

work page 2014

[28] [28]

Persistent homology for learning densities with bounded support

Pokorny, F., Ek, C., Kjellstr\"om, H., and Kragic, D. Persistent homology for learning densities with bounded support. In NIPS, 2012 a

work page 2012

[29] [29]

Topological constraints and kernel-based density estimation

Pokorny, F., Ek, C., Kjellstr\"om, H., and Kragic, D. Topological constraints and kernel-based density estimation. In NIPS WS on Algebraic Topology and Machine Learning, 2012 b

work page 2012

[30] [30]

Unsupervised representation learning with deep convolutional generative adversarial networks

Radford, A., Metz, L., and Chintala, S. Unsupervised representation learning with deep convolutional generative adversarial networks. In ICLR, 2016

work page 2016

[31] [31]

Faster R-CNN: towards real-time object detection with region proposal networks

Ren, S., He, K., Girshick, R., and Sun, J. Faster R-CNN: towards real-time object detection with region proposal networks. In NIPS, 2015

work page 2015

[32] [32]

Contractive auto-encoders: Explicit inveriance during feature extraction

Rifai, S., Vincent, P., Muller, X., Glorot, X., and Bengio, Y. Contractive auto-encoders: Explicit inveriance during feature extraction. In ICML, 2011

work page 2011

[33] [33]

Deep one-class classification

Ruff, L., Vandermeulen, R., Goernitz, N., Deecke, L., Siddiqui, S., Bindern, A., M\"uller, E., and Kloft, M. Deep one-class classification. In ICML, 2018

work page 2018

[34] [34]

Learning representations by backpropagating errors

Rumelhart, D., Hinton, G., and Williams, R. Learning representations by backpropagating errors. Nature, 323: 0 533--536, 1986

work page 1986

[35] [35]

Adversarially learned one-class classifier for novelty detection

Sabokrou, M., Khalooei, M., Fathy, M., and Adeli, E. Adversarially learned one-class classifier for novelty detection. In CVPR, 2018

work page 2018

[36] [36]

Estimating the support of a highdimensional distribution

Sch\"olkof, B., Platt, J., Shawe-Taylor, J., Smola, A., and Williamson, R. Estimating the support of a highdimensional distribution. Neural computation, 13 0 (7): 0 1443–1471, 2001

work page 2001

[37] [37]

Sequence to sequence learning with neural networks

Sutskever, I., Vinyals, O., and Le, Q. Sequence to sequence learning with neural networks. In NIPS, 2014

work page 2014

[38] [38]

Metric entropy analogues of sum set theory

Tao, T. Metric entropy analogues of sum set theory. Online: https://bit.ly/2zRAKUy, 2014

work page 2014

[39] [39]

Java P lex: A research software package for persistent (co)homology

Tausz, A., Vejdemo-Johansson, M., and Adams, H. Java P lex: A research software package for persistent (co)homology. In ICMS, 2014

work page 2014

[40] [40]

and Duin, R

Tax, D. and Duin, R. Support vector data description. Machine learning, 54 0 (1): 0 45--66, 2004

work page 2004

[41] [41]

and Duin, R

Tax, D. and Duin, R. Growing multi-class classifiers with a reject option. Pattern Recognition Letters, 29: 0 1565--1570, 2008

work page 2008

[42] [42]

Wasserstein auto-encoders

Tolstikhin, I., Bousquet, O., Gelly, S., and Sch\"olkopf, B. Wasserstein auto-encoders. In ICLR, 2018

work page 2018

[43] [43]

Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion

Vincent, P., Larochele, H., Lajoie, I., Bengio, Y., and Manzagol, P.-A. Stacked denoising autoencoders: Learning useful representations in a deep network with a local denoising criterion. JMLR, 11: 0 3371--3408, 2010

work page 2010

[44] [44]

Learning discriminative reconstructions for unsupervised outlier removal

Xia, Y., Cao, X., Wen, F., Hua, G., and Sun, J. Learning discriminative reconstructions for unsupervised outlier removal. In ICCV, 2015

work page 2015

[45] [45]

Unsupervised deep embedding for clustering analysis

Xie, J., Girshick, R., and Farhadi, A. Unsupervised deep embedding for clustering analysis. In ICML, 2016

work page 2016

[46] [46]

Towards k -means-friendly spaces: Simultaneous deep learning and clustering

Yang, B., Fu, X., Sidiropoulos, N., and Hong, M. Towards k -means-friendly spaces: Simultaneous deep learning and clustering. ICML, 2017

work page 2017

[47] [47]

Provable self-representation based outlier detection in a union of subspaces

You, C., Robinson, D., and Vidal, R. Provable self-representation based outlier detection in a union of subspaces. In CVPR, 2017

work page 2017

[48] [48]

and Komodakis, N

Zagoruyko, S. and Komodakis, N. Wide residual networks. In BMVC, 2016

work page 2016

[49] [49]

Deconvolutional networks

Zeiler, M., Krishnan, D., Taylor, G., and Fergus, R. Deconvolutional networks. In CVPR, 2010

work page 2010

[50] [50]

Deep structured energy based models for anomaly detection

Zhai, S., Cheng, Y., Lu, W., and Zhang, Z. Deep structured energy based models for anomaly detection. In ICML, 2016

work page 2016

[51] [51]

and Pfaffenroth, R

Zhou, C. and Pfaffenroth, R. Anomaly detection with robust deep autoencoder. In KDD, 2017

work page 2017

[52] [52]

Deep autoencoding Gaussian mixture model for unsupervised anomaly detection

Zong, B., Song, Q., Min, M., Cheng, W., Lumezanu, C., Cho, D., and Chen, H. Deep autoencoding Gaussian mixture model for unsupervised anomaly detection. In ICLR, 2018

work page 2018