pith. sign in

arxiv: 2107.02287 · v1 · submitted 2021-07-05 · 🌌 astro-ph.GA · cs.CV· cs.LG

Morphological Classification of Galaxies in S-PLUS using an Ensemble of Convolutional Networks

Pith reviewed 2026-05-24 12:35 UTC · model grok-4.3

classification 🌌 astro-ph.GA cs.CVcs.LG
keywords galaxy morphologyconvolutional neural networksensemble learningS-PLUS surveydeep learningGalaxy Zoomorphological classificationelliptical galaxies
0
0 comments X

The pith

An ensemble of four convolutional networks classifies elliptical and spiral galaxies at 99 percent accuracy using S-PLUS images.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper shows that an ensemble of pre-trained convolutional models can classify galaxies into elliptical and spiral types from images alone. It draws training labels from the Galaxy Zoo project and applies the method to the first data release of the S-PLUS survey. The ensemble reaches approximately 99 percent accuracy on a held-out test sample, exceeding the performance of any single model. The goal is to replace subjective visual inspection with a repeatable, automatic procedure that still matches human-level results.

Core claim

The authors build an ensemble of four convolutional neural networks trained on S-PLUS galaxy images labeled by Galaxy Zoo volunteers and obtain an accuracy of approximately 99 percent on the test sample when the networks are pre-trained.

What carries the argument

An ensemble that combines the outputs of four pre-trained convolutional neural networks applied directly to galaxy images.

Load-bearing premise

The Galaxy Zoo visual classifications supply accurate and unbiased labels for training the networks.

What would settle it

A comparison of the ensemble outputs against independent expert classifications on a new sample of S-PLUS galaxies where the experts agree with one another.

read the original abstract

The universe is composed of galaxies that have diverse shapes. Once the structure of a galaxy is determined, it is possible to obtain important information about its formation and evolution. Morphologically classifying galaxies means cataloging them according to their visual appearance and the classification is linked to the physical properties of the galaxy. A morphological classification made through visual inspection is subject to biases introduced by subjective observations made by human volunteers. For this reason, systematic, objective and easily reproducible classification of galaxies has been gaining importance since the astronomer Edwin Hubble created his famous classification method. In this work, we combine accurate visual classifications of the Galaxy Zoo project with \emph {Deep Learning} methods. The goal is to find an efficient technique at human performance level classification, but in a systematic and automatic way, for classification of elliptical and spiral galaxies. For this, a neural network model was created through an Ensemble of four other convolutional models, allowing a greater accuracy in the classification than what would be obtained with any one individual. Details of the individual models and improvements made are also described. The present work is entirely based on the analysis of images (not parameter tables) from DR1 (www.datalab.noao.edu) of the Southern Photometric Local Universe Survey (S-PLUS). In terms of classification, we achieved, with the Ensemble, an accuracy of $\approx 99 \%$ in the test sample (using pre-trained networks).

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 0 minor

Summary. The manuscript describes an ensemble of four pre-trained convolutional neural networks applied to S-PLUS DR1 images for binary morphological classification of galaxies into elliptical and spiral types, using Galaxy Zoo volunteer visual classifications as labels, and reports an accuracy of approximately 99% on a held-out test sample.

Significance. If the reported accuracy is robust, the work would demonstrate a scalable, reproducible method for objective galaxy morphology classification that could be applied to large photometric surveys, addressing the subjectivity of human visual inspection. The ensemble of pre-trained networks is a standard technique, but the result's value depends on verification that the performance reflects morphological signal rather than label properties.

major comments (3)
  1. [Abstract] Abstract: the central claim of ≈99% accuracy on the test sample is presented without any information on the train/test split sizes or ratios, the architectures or fine-tuning details of the four base CNNs, the ensemble aggregation rule, training hyperparameters, or statistical significance of the accuracy figure, preventing assessment of whether the result is reproducible or load-bearing.
  2. [Abstract] Abstract/Methods: the manuscript states that the work is 'entirely based on' Galaxy Zoo visual classifications treated as ground truth, yet reports no volunteer agreement fractions, label uncertainty estimates, or cross-checks against expert visual classifications or independent indicators (e.g., spectroscopic or structural parameters); if label error rates are a few percent due to inter-observer variance or image-quality biases, the headline accuracy cannot be interpreted at face value and may partly reflect fitting to label noise.
  3. [Abstract] The assumption that Galaxy Zoo labels provide accurate and unbiased training/test targets is load-bearing for the 99% accuracy claim, but no quantitative validation of this assumption is supplied, leaving open the possibility that the ensemble performance is inflated by systematic label properties rather than learned morphological features.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for the careful reading and constructive feedback. We agree that the abstract requires expansion for reproducibility and that the reliance on Galaxy Zoo labels merits explicit discussion of their known properties. We will revise the abstract and add a short limitations paragraph in the methods or discussion section. No standing objections apply as all points can be addressed through clarification and added context from the existing analysis.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the central claim of ≈99% accuracy on the test sample is presented without any information on the train/test split sizes or ratios, the architectures or fine-tuning details of the four base CNNs, the ensemble aggregation rule, training hyperparameters, or statistical significance of the accuracy figure, preventing assessment of whether the result is reproducible or load-bearing.

    Authors: We agree the abstract is too terse. The full manuscript contains the requested details (train/test split, the four pre-trained CNN architectures, fine-tuning procedure, ensemble aggregation by probability averaging, hyperparameters, and accuracy with binomial confidence interval). In revision we will move a concise summary of these elements into the abstract so the central claim can be evaluated without reading the methods. revision: yes

  2. Referee: [Abstract] Abstract/Methods: the manuscript states that the work is 'entirely based on' Galaxy Zoo visual classifications treated as ground truth, yet reports no volunteer agreement fractions, label uncertainty estimates, or cross-checks against expert visual classifications or independent indicators (e.g., spectroscopic or structural parameters); if label error rates are a few percent due to inter-observer variance or image-quality biases, the headline accuracy cannot be interpreted at face value and may partly reflect fitting to label noise.

    Authors: The referee correctly notes that the current text does not quantify GZ label reliability. We will add a sentence citing the GZ literature on volunteer agreement rates for clear elliptical/spiral cases and will explicitly state that the reported accuracy is measured against the same volunteer labels used for training. We will also add a brief limitations paragraph acknowledging that residual label noise may contribute to the measured performance. revision: yes

  3. Referee: [Abstract] The assumption that Galaxy Zoo labels provide accurate and unbiased training/test targets is load-bearing for the 99% accuracy claim, but no quantitative validation of this assumption is supplied, leaving open the possibility that the ensemble performance is inflated by systematic label properties rather than learned morphological features.

    Authors: We accept that the manuscript should address this assumption directly rather than leave it implicit. In revision we will insert a short paragraph noting that GZ labels have been validated against expert classifications in the original GZ papers and that our held-out test set is drawn from the same labeling process; we will also state that any systematic label bias would affect both training and test sets equally. This does not change the empirical result but makes the interpretation explicit. revision: yes

Circularity Check

0 steps flagged

No circularity: accuracy evaluated on held-out test set

full rationale

The paper's central result is an empirical accuracy of ≈99% measured on a test sample distinct from the training data. This is a standard supervised learning evaluation against external Galaxy Zoo labels and does not reduce by construction to the training inputs, fitted parameters, or any self-citation chain. No load-bearing step matches the enumerated circularity patterns.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

Abstract-only review limits identification of all parameters; the core reliance is on external labeled dataset and pre-trained networks.

free parameters (2)
  • Base model selection and pre-training weights
    The paper uses four pre-trained convolutional models but specifics are not in abstract.
  • Ensemble aggregation rule
    Method for combining the four models' outputs to reach the reported accuracy is unspecified.
axioms (1)
  • domain assumption Galaxy Zoo labels serve as reliable ground truth for supervised training
    Abstract explicitly relies on these classifications as the basis for the models.

pith-pipeline@v0.9.0 · 5818 in / 1162 out tokens · 67096 ms · 2026-05-24T12:35:41.865880+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

What do these tags mean?
matches
The paper's claim is directly supported by a theorem in the formal canon.
supports
The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends
The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses
The paper appears to rely on the theorem as machinery.
contradicts
The paper's claim conflicts with a theorem or certificate in the canon.
unclear
Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

77 extracted references · 77 canonical work pages · 1 internal anchor

  1. [1]

    Morphological Classification of Galaxies in S-PLUS using an Ensemble of Convolutional Networks

    INTRODUC ¸ ˜AO Classificac ¸˜ao morfol ´ogica ´e a categorizac ¸˜ao das gal ´axias conforme sua forma. Quando esta classificac ¸ ˜ao ´e baseada na inspec ¸˜ao visual das imagens, elementos subjetivos s ˜ao arXiv:2107.02287v1 [astro-ph.GA] 5 Jul 2021 2 N. M. Cardoso, G. B. O. Schwarz, L. O. Dias, C. R. Bom, L. Sodr´e Jr. e C. Mendes de Oliveira agregados. Em...

  2. [2]

    Para isso, primeiramente ser˜ao apresentados os dados utilizados e como os prepara- mos

    CONJUNTO DE DADOS Para este trabalho queremos desenvolver uma t ´ecnica efi- ciente e automatizada para a classificac ¸˜ao morfol ´ogica de gal´axias usando Deep Learning . Para isso, primeiramente ser˜ao apresentados os dados utilizados e como os prepara- mos. 2.1. Aquisic ¸ ˜ao dos dados A imagem da gal ´axia, com sua respectiva classificac ¸˜ao morfol´ogi...

  3. [3]

    noc ¸˜ao

    M ´ETODOS DE DEEP LEARNING Nesta sec ¸˜ao, explicamos sobre a preparac ¸˜ao dos dados e como fizemos o aumento artificial dos dados para obter me- lhores resultados na avaliac ¸˜ao dos modelos. Em seguida, descrevemos as redes convolucionais utilizadas: VGG, In- ception Resnet, EfficientNet e DenseNet. Introduzimos o conceito de ( Ensemble) e descrevemos as ...

  4. [4]

    Na Sec ¸˜ao 4.1 ser˜ao apresentadas as m´etricas utilizadas, suas ex- press˜oes e o que elas avaliam

    RESULTADOS Existem v ´arias m ´etricas que ajudam a definir a capaci- dade de classificac ¸˜ao de um modelo de deep learning. Na Sec ¸˜ao 4.1 ser˜ao apresentadas as m´etricas utilizadas, suas ex- press˜oes e o que elas avaliam. Na Sec ¸ ˜ao 4.2, ser ´a com- parada a performance do modelo entre o conjunto de trei- namento e de validac ¸˜ao usando metricas qu...

  5. [5]

    DISCUSS ˜AO E CONCLUS ˜AO Em contraste com as abordagens de aprendizagem co- muns, o que fizemos neste trabalho foi construir modelos a partir dos dados de treinamento, escolher entre os melhores modelos e combin´a-los. O resultado principal deste trabalho ´e, ent ˜ao, a combinac ¸˜ao das predic ¸˜oes de v ´arias redes com seus respectivos melhores hiperpa...

  6. [6]

    DISPONIBILIDADE DOS DADOS N´os disponibilizamos publicamente os cat ´alogos de classificac ¸˜ao bem como os modelos de Deep Learning na p´agina https://natanael.net

  7. [7]

    AGRADECIMENTOS Os modelos foram implementados usando v ´arios proje- tos open-source, como a linguagem de programac ¸˜ao Python [59], o Trilogy [60], as bibliotecas de deep learning Tensor- flow [61] e Keras [62] e outras bibliotecas de computac ¸ ˜ao cient´ıfica [63–70]. Este projeto tamb´em fez uso de servic ¸os online, como o SkyServer7 e o S-PLUS Cloud8...

  8. [8]

    E. P. Hubble. Extragalactic nebulae. Astrophysical Journal, 64:321–369, Dec 1926

  9. [9]

    Galaxy Zoo: Morphological Classification and Citizen Science, pages 213–236

    Lucy Fortson, Karen Masters, Robert Nichol, et al. Galaxy Zoo: Morphological Classification and Citizen Science, pages 213–236. 2012

  10. [10]

    Hart, Steven P

    Ross E. Hart, Steven P. Bamford, Kyle W. Willett, et al. Ga- laxy Zoo: comparing the demographics of spiral arm number and a new method for correcting redshift bias. Monthly Noti- ces of the Royal Astronomical Society, 461(4):3663–3682, 07 2016

  11. [11]

    Morphological classification of galaxies using photometric pa- rameters: The concentration index versus the coarseness pa- rameter

    Chisato Yamauchi, Shin-ichi Ichikawa, Mamoru Doi, et al. Morphological classification of galaxies using photometric pa- rameters: The concentration index versus the coarseness pa- rameter. The Astronomical Journal , 130(4):1545–1557, Oct 2005

  12. [12]

    Deep Learning

    Ian Goodfellow, Yoshua Bengio, and Aaron Courville. Deep Learning. MIT Press, 2016

  13. [13]

    Yann LeCun, Yoshua Bengio, and Geoffrey E. Hinton. Deep learning. Nature, 521(7553):436–444, 2015

  14. [14]

    Densecap: Fully convolutional localization networks for dense captio- ning

    Justin Johnson, Andrej Karpathy, and Li Fei-Fei. Densecap: Fully convolutional localization networks for dense captio- ning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 2016

  15. [15]

    Matconvnet: Convolutio- nal neural networks for matlab

    Andrea Vedaldi and Karel Lenc. Matconvnet: Convolutio- nal neural networks for matlab. In Proceedings of the 23rd ACM International Conference on Multimedia, MM ’15, page 689–692, New York, NY , USA, 2015. Association for Com- puting Machinery

  16. [16]

    The southern photometric local universe survey (s-plus): impro- ved seds, morphologies, and redshifts with 12 optical fil- ters

    C Mendes de Oliveira, T Ribeiro, W Schoenell, et al. The southern photometric local universe survey (s-plus): impro- ved seds, morphologies, and redshifts with 12 optical fil- ters. Monthly Notices of the Royal Astronomical Society , 489(1):241–267, Aug 2019

  17. [17]

    Logistic regression in rare events data

    Gary King and Langche Zeng. Logistic regression in rare events data. Political Analysis, 9:137–163, Spring 2001

  18. [18]

    Clash: pre- cise new constraints on the mass profile of the galaxy cluster a2261

    Dan Coe, Keiichi Umetsu, Adi Zitrin, Megan Donahue, Eli- nor Medezinski, Marc Postman, Mauricio Carrasco, Timo An- guita, Margaret J Geller, Kenneth J Rines, et al. Clash: pre- cise new constraints on the mass profile of the galaxy cluster a2261. The Astrophysical Journal, 757(1):22, 2012

  19. [19]

    C. R. Bom, A. Cortesi, G. Lucatelli, et al. Deep learning as- sessment of galaxy morphology in s-plus datarelease 1, 2021

  20. [20]

    Effective training of a neural network character classifier for word re- cognition

    Larry Yaeger, Richard Lyon, and Brandyn Webb. Effective training of a neural network character classifier for word re- cognition. In Proceedings of the 9th International Conference on Neural Information Processing Systems , NIPS’96, page 807–813, Cambridge, MA, USA, 1996. MIT Press

  21. [21]

    Antialiasing

    Tom McReynolds and David Blythe. Antialiasing. In Advan- ced Graphics Programming Using OpenGL , pages 169–184. Elsevier, 2005

  22. [22]

    Alan C. Bovik. Basic gray level image processing. In The Essential Guide to Image Processing , pages 43–68. Elsevier, 2009

  23. [23]

    Simard, Dave Steinkraus, and John C

    Patrice Y . Simard, Dave Steinkraus, and John C. Platt. Best practices for convolutional neural networks applied to visual document analysis. In Proceedings of the Seventh Interna- tional Conference on Document Analysis and Recognition - Volume 2, ICDAR ’03, page 958, USA, 2003. IEEE Computer Society

  24. [24]

    Deep Learning with Python

    Francois Chollet. Deep Learning with Python . Manning Pu- blications Co., USA, 1st edition, 2017

  25. [25]

    Electrocardiogram classification by modified EfficientNet with data augmentation

    Naoki Nonaka and Jun Seita. Electrocardiogram classification by modified EfficientNet with data augmentation. In 2020 Computing in Cardiology Conference (CinC) . Computing in Cardiology, December 2020

  26. [26]

    Classification of protein crystallization images using EfficientNet with data augmentation

    David William Edwards II and Imren Dinc. Classification of protein crystallization images using EfficientNet with data augmentation. In CSBio '20: Proceedings of the Eleventh International Conference on Computational Systems-Biology and Bioinformatics. ACM, November 2020

  27. [27]

    Jude He- manth

    Ansh Mittal, Anu Soorya, Preeti Nagrath, and D. Jude He- manth. Data augmentation based morphological classification of galaxies using deep convolutional neural network. Earth Science Informatics, 13(3):601–617, December 2019

  28. [28]

    Very deep convolu- tional networks for large-scale image recognition, 2015

    Karen Simonyan and Andrew Zisserman. Very deep convolu- tional networks for large-scale image recognition, 2015

  29. [29]

    Berg, and Li Fei-Fei

    Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause, San- jeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C. Berg, and Li Fei-Fei. ImageNet Large Scale Visual Recognition Chal- lenge. International Journal of Computer Vision (IJCV) , 115(3):211–252, 2015

  30. [30]

    Malicious software classification using VGG16 deep neural network’s bottleneck features

    Edmar Rezende, Guilherme Ruppert, Tiago Carvalho, Anto- nio Theophilo, Fabio Ramos, and Paulo de Geus. Malicious software classification using VGG16 deep neural network’s bottleneck features. In Advances in Intelligent Systems and Computing, pages 51–59. Springer International Publishing, 2018

  31. [31]

    VGG16 for plant image classification with transfer learning and data augmen- tation

    Mohamad Aqib Haqmi Abas, Nurlaila Ismail, Ahmad Ih- san Mohd Yassin, and Mohd Nasir Taib. VGG16 for plant image classification with transfer learning and data augmen- tation. International Journal of Engineering & Technology , 7(4.11):90, October 2018

  32. [32]

    Classification of brain tumor by combination of pre-trained vgg16 cnn

    Ouiza Nait Belaid and Malik Loudini. Classification of brain tumor by combination of pre-trained vgg16 cnn. Journal of Information Technology Management, 12(2):13–25, 2020

  33. [33]

    Inception-v4, inception-resnet and the impact of resi- dual connections on learning, 2016

    Christian Szegedy, Sergey Ioffe, Vincent Vanhoucke, and Alex Alemi. Inception-v4, inception-resnet and the impact of resi- dual connections on learning, 2016

  34. [34]

    Going deeper with convo- lutions

    Christian Szegedy, Wei Liu, Yangqing Jia, Pierre Sermanet, Scott Reed, Dragomir Anguelov, Dumitru Erhan, Vincent Va- nhoucke, and Andrew Rabinovich. Going deeper with convo- lutions. In 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 1–9, 2015

  35. [35]

    Deep residual learning for image recognition

    Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. In 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pages 770–778, 2016

  36. [36]

    Very deep con- volutional neural networks for complex land cover mapping using multispectral remote sensing imagery

    Masoud Mahdianpari, Bahram Salehi, Mohammad Rezaee, Fariba Mohammadimanesh, and Yun Zhang. Very deep con- volutional neural networks for complex land cover mapping using multispectral remote sensing imagery. Remote Sensing, 10(7):1119, July 2018

  37. [37]

    Classification of breast masses on ultrasound shear wave elas- tography using convolutional neural networks

    Tomoyuki Fujioka, Leona Katsuta, Kazunori Kubota, et al. Classification of breast masses on ultrasound shear wave elas- tography using convolutional neural networks. Ultrasonic Imaging, 42(4-5):213–220, June 2020

  38. [38]

    Deep learning for the classification of small-cell and non- small-cell lung cancer

    Mark Kriegsmann, Christian Haag, Cleo-Aron Weis, et al. Deep learning for the classification of small-cell and non- small-cell lung cancer. Cancers, 12(6):1604, June 2020

  39. [39]

    Mingxing Tan and Quoc V . Le. Efficientnet: Rethinking model scaling for convolutional neural networks, 2020

  40. [40]

    Pan Zhang, Ling Yang, and Daoliang Li. EfficientNet-b4- ranger: A novel method for greenhouse cucumber disease re- cognition under natural complex environment.Computers and Electronics in Agriculture, 176:105652, September 2020

  41. [41]

    Weinberger

    Gao Huang, Zhuang Liu, Laurens van der Maaten, and Kilian Q. Weinberger. Densely connected convolutional networks, 2018

  42. [42]

    Protein 18 N

    Zhong Li, Yuele Lin, Arne Elofsson, and Yuhua Yao. Protein 18 N. M. Cardoso, G. B. O. Schwarz, L. O. Dias, C. R. Bom, L. Sodr´e Jr. e C. Mendes de Oliveira contact map prediction based on resnet and densenet. BioMed Research International, 2020, 2020

  43. [43]

    Alzheimer’s disease early detection using a low cost three-dimensional densenet-121 architecture

    Braulio Solano-Rojas, Ricardo Villal ´on-Fonseca, and Gabri- ela Mar´ın-Ravent´os. Alzheimer’s disease early detection using a low cost three-dimensional densenet-121 architecture. In Lecture Notes in Computer Science, pages 3–15. Springer In- ternational Publishing, 2020

  44. [44]

    Classification of breast cancer histopathological images using interleaved densenet with senet (idsnet)

    Xia Li, Xi Shen, Yongxia Zhou, Xiuhui Wang, and Tie-Qiang Li. Classification of breast cancer histopathological images using interleaved densenet with senet (idsnet). PLoS ONE , 15, 2020

  45. [45]

    Densenet-201-based deep neural network with composite learning factor and pre- computation for multiple sclerosis classification

    Shui-Hua Wang and Yu-Dong Zhang. Densenet-201-based deep neural network with composite learning factor and pre- computation for multiple sclerosis classification. ACM Trans. Multimedia Comput. Commun. Appl., 16(2s), June 2020

  46. [46]

    Rolfe, and Geoffrey I

    Yakov Frayman, Bernard F. Rolfe, and Geoffrey I. Webb. Sol- ving regression problems using competitive ensemble models. In Lecture Notes in Computer Science, pages 511–522. Sprin- ger Berlin Heidelberg, 2002

  47. [47]

    L. K. Hansen and P. Salamon. Neural network ensembles. IEEE Transactions on Pattern Analysis and Machine Intelli- gence, 12(10):993–1001, 1990

  48. [48]

    Multi-label classification of fundus images with EfficientNet

    Jing Wang, Liu Yang, Zhanqiang Huo, Weifeng He, and Junwei Luo. Multi-label classification of fundus images with EfficientNet. IEEE Access, 8:212499–212508, 2020

  49. [49]

    Identifying melanoma ima- ges using efficientnet ensemble: Winning solution to the siim- isic melanoma classification challenge, 2020

    Qishen Ha, Bo Liu, and Fuxu Liu. Identifying melanoma ima- ges using efficientnet ensemble: Winning solution to the siim- isic melanoma classification challenge, 2020

  50. [50]

    Meta- learning for anomaly classification with set equivariant networks: Application in the milky way, 2020

    Ademola Oladosu, Tony Xu, Philip Ekfeldt, et al. Meta- learning for anomaly classification with set equivariant networks: Application in the milky way, 2020

  51. [51]

    Cryptographic limitations on learning boolean formulae and finite automata

    Michael Kearns and Leslie Valiant. Cryptographic limitations on learning boolean formulae and finite automata. J. ACM, 41(1):67–95, January 1994

  52. [52]

    Schapire

    Robert E. Schapire. The strength of weak learnability. Ma- chine Learning, 5(2):197–227, June 1990

  53. [53]

    Bagging predictors

    Leo Breiman. Bagging predictors. Machine Learning , 24(2):123–140, August 1996

  54. [54]

    David H. Wolpert. Stacked generalization. Neural Networks, 5(2):241–259, 1992

  55. [55]

    Stacked regressions

    Leo Breiman. Stacked regressions. Machine Learning , 24(1):49–64, July 1996

  56. [56]

    Linearly combining den- sity estimators via stacking

    Padhraic Smyth and David Wolpert. Linearly combining den- sity estimators via stacking. Machine Learning, 36(1/2):59– 83, 1999

  57. [57]

    Training stochastic model recognition algorithms as networks can lead to maximum mutual information esti- mation of parameters

    John Bridle. Training stochastic model recognition algorithms as networks can lead to maximum mutual information esti- mation of parameters. In D. Touretzky, editor, Advances in Neural Information Processing Systems , volume 2. Morgan- Kaufmann, 1990

  58. [58]

    Kingma and Jimmy Ba

    Diederik P. Kingma and Jimmy Ba. Adam: A method for stochastic optimization, 2017

  59. [59]

    Incorporating nesterov momentum into

    Timothy Dozat. Incorporating nesterov momentum into. 2015

  60. [60]

    On the variance of the adaptive learning rate and beyond, 2020

    Liyuan Liu, Haoming Jiang, Pengcheng He, Weizhu Chen, Xi- aodong Liu, Jianfeng Gao, and Jiawei Han. On the variance of the adaptive learning rate and beyond, 2020

  61. [61]

    On the variance of the adaptive learning rate and beyond, 2012

    Geoffrey Hinton, Nitish Srivastava, and Kevin Swersky. On the variance of the adaptive learning rate and beyond, 2012

  62. [62]

    Dropout: A simple way to prevent neural networks from overfitting

    Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. Dropout: A simple way to prevent neural networks from overfitting. Journal of Ma- chine Learning Research, 15(56):1929–1958, 2014

  63. [63]

    Hinton, Nitish Srivastava, Alex Krizhevsky, et al

    Geoffrey E. Hinton, Nitish Srivastava, Alex Krizhevsky, et al. Improving neural networks by preventing co-adaptation of fe- ature detectors, 2012

  64. [64]

    The meaning and use of the area under a receiver operating characteristic (ROC) curve

    J A Hanley and B J McNeil. The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radio- logy, 143(1):29–36, April 1982

  65. [65]

    An introduction to ROC analysis

    Tom Fawcett. An introduction to ROC analysis. Pattern Re- cognition Letters, 27(8):861–874, June 2006

  66. [66]

    Interactively testing remote servers using the python programming language

    Guido van Rossum and Jelke de Boer. Interactively testing remote servers using the python programming language. CWi Quarterly, 4(4):283–303, 1991

  67. [67]

    Dan Coe. Trilogy. https://www.stsci.edu/˜dcoe/ trilogy/Intro.html, 2012

  68. [68]

    Tensor- Flow: Large-scale machine learning on heterogeneous sys- tems, 2015

    Mart ´ın Abadi, Ashish Agarwal, Paul Barham, et al. Tensor- Flow: Large-scale machine learning on heterogeneous sys- tems, 2015. Software available from tensorflow.org

  69. [69]

    Keras.https://keras.io, 2015

    Franc ¸ois Chollet et al. Keras.https://keras.io, 2015

  70. [70]

    A. M. Price-Whelan, B. M. Sip ˝ocz, H. M. G¨unther, et al. The astropy project: Building an open-science project and status of the v2.0 core package. The Astronomical Journal, 156(3):123, August 2018

  71. [71]

    Pedregosa, G

    F. Pedregosa, G. Varoquaux, A. Gramfort, et al. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12:2825–2830, 2011

  72. [72]

    Sch ¨onberger, Juan Nunez- Iglesias, et al

    St ´efan van der Walt, Johannes L. Sch ¨onberger, Juan Nunez- Iglesias, et al. scikit-image: image processing in python. Pe- erJ, 2:e453, June 2014

  73. [73]

    Harris, K

    Charles R. Harris, K. Jarrod Millman, St’efan J. van der Walt, et al. Array programming with NumPy. Nature, 585(7825):357–362, September 2020

  74. [74]

    Oliphant, et al

    Pauli Virtanen, Ralf Gommers, Travis E. Oliphant, et al. SciPy 1.0: Fundamental Algorithms for Scientific Computing in Python. Nature Methods, 17:261–272, 2020

  75. [75]

    Perez and B

    F. Perez and B. E. Granger. Ipython: A system for interac- tive scientific computing. Computing in Science Engineering, 9(3):21–29, 2007

  76. [76]

    Data Structures for Statistical Computing in Python

    Wes McKinney. Data Structures for Statistical Computing in Python. In St ´efan van der Walt and Jarrod Millman, editors, Proceedings of the 9th Python in Science Conference , pages 56 – 61, 2010

  77. [77]

    J. D. Hunter. Matplotlib: A 2d graphics environment. Com- puting in Science Engineering, 9(3):90–95, 2007