Searching for Hot Subdwarf Stars in LAMOST DR1-II. Pure spectroscopic identification method for hot subdwarfs
Pith reviewed 2026-05-25 13:08 UTC · model grok-4.3
The pith
Hierarchical extreme learning machine identifies hot subdwarf stars from LAMOST spectra alone.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The HELM algorithm, trained suitably on spectral data, reliably identifies hot subdwarf stars in LAMOST DR1 from spectroscopy alone, producing a sample of 56 stars whose derived parameters confirm the two helium sequences in the Teff-log(nHe/nH) plane.
What carries the argument
The hierarchical extreme learning machine (HELM) algorithm that classifies objects by operating directly on observed spectroscopy to isolate spectral properties.
If this is right
- HELM works without supplementary photometric data for classification.
- The same trained method applies to searching for other objects with clear spectral features.
- The sample contains five He-rich stars with log(nHe/nH) > -1 and 51 He-poor stars.
- The two helium sequences reported by Edelmann et al. appear in the new data.
Where Pith is reading between the lines
- Later LAMOST data releases could be processed with the same HELM setup to expand the known hot subdwarf population.
- Pure spectral selection may avoid biases that photometric pre-selection introduces in other surveys.
- The algorithm could be retrained on different wavelength ranges or resolution to target rarer subtypes.
Load-bearing premise
The training set must contain spectral examples representative of hot subdwarfs versus other stars present in the LAMOST survey.
What would settle it
Follow-up high-resolution spectroscopy or independent classification showing that a large fraction of the 56 candidates lack the atmospheric parameters of hot subdwarfs.
Figures
read the original abstract
Employing a new machine learning method, named hierarchical extreme learning machine (HELM) algorithm, we identified 56 hot subdwarf stars in the first data release (DR1) of the Large Sky Area Multi-Object Fibre Spectroscopic Telescope (LAMOST) survey. The atmospheric parameters of the stars are obtained by fitting the profiles of hydrogen (H) Balmer lines and helium (He) lines with synthetic spectra calculated from non-Local Thermodynamic Equilibrium (NLTE) model atmospheres. Five He-rich hot subdwarf stars were found in our sample with their log(nHe/nH) > -1 , while 51 stars are He-poor sdB, sdO and sdOB stars. We also confirmed the two He sequences of hot subdwarf stars found by Edelmann et al. (2003) in Teff - log(nHe/nH) diagram. The HELM algorithm works directly on the observed spectroscopy and is able to filter out spectral properties without supplementary photometric data. The results presented in this study demonstrate that the HELM algorithm is a reliable method to search for hot subdwarf stars after a suitable training is performed, and it is also suitable to search for other objects which have obvious features in their spectra or images.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript applies the hierarchical extreme learning machine (HELM) algorithm to LAMOST DR1 spectra to identify 56 hot subdwarf stars without photometric data. Atmospheric parameters are derived by fitting Balmer and helium line profiles with NLTE synthetic spectra, yielding 5 He-rich stars (log(nHe/nH) > -1) and 51 He-poor sdB/sdO/sdOB stars. The work confirms the two helium sequences previously reported by Edelmann et al. (2003) in the Teff-log(nHe/nH) plane and asserts that HELM is reliable for hot subdwarfs (and other objects with clear spectral features) after suitable training.
Significance. If the HELM classifications are shown to be robust, the method supplies a photometry-independent route to enlarge samples of hot subdwarfs in large spectroscopic surveys, enabling statistical studies of their formation channels and atmospheric evolution. The confirmation of the two He sequences is consistent with earlier work but does not constitute a new result.
major comments (2)
- [Abstract / Methods] Abstract and §3 (or equivalent methods section): The central claim that HELM is a reliable classifier after suitable training is unsupported by any reported details on training-set construction (size, selection of known sdB/sdO versus LAMOST contaminants, class balance) or quantitative performance (accuracy, precision, recall, cross-validation scores, or false-positive rate on held-out LAMOST-like spectra). This information is load-bearing for the reliability of the 56 identifications.
- [Abstract / Results] Abstract and results section: No cross-check of the HELM-selected candidates against independent hot-subdwarf catalogs or any estimate of contamination rate is presented. The subsequent NLTE parameter fitting occurs after classification and therefore cannot validate the upstream HELM decisions; if the training distribution differs from the survey in S/N, wavelength coverage, or contaminant mix, the identifications rest on an untested assumption.
minor comments (1)
- [Abstract] Abstract: the phrase 'suitable training' is repeated without elaboration; a one-sentence summary of training-set provenance would improve clarity.
Simulated Author's Rebuttal
We thank the referee for the constructive comments, which highlight areas where the manuscript can be strengthened. We respond to each major comment below and will revise the manuscript to address the concerns.
read point-by-point responses
-
Referee: [Abstract / Methods] Abstract and §3 (or equivalent methods section): The central claim that HELM is a reliable classifier after suitable training is unsupported by any reported details on training-set construction (size, selection of known sdB/sdO versus LAMOST contaminants, class balance) or quantitative performance (accuracy, precision, recall, cross-validation scores, or false-positive rate on held-out LAMOST-like spectra). This information is load-bearing for the reliability of the 56 identifications.
Authors: We agree that the current manuscript does not provide sufficient detail on the HELM training procedure to fully support the reliability claim. In the revised version, we will add a dedicated subsection to the methods describing the training-set construction (including size, selection of known hot subdwarfs and LAMOST contaminants, and class balance) along with quantitative performance metrics such as accuracy, precision, recall, and cross-validation scores. This will directly address the load-bearing nature of this information for the 56 identifications. revision: yes
-
Referee: [Abstract / Results] Abstract and results section: No cross-check of the HELM-selected candidates against independent hot-subdwarf catalogs or any estimate of contamination rate is presented. The subsequent NLTE parameter fitting occurs after classification and therefore cannot validate the upstream HELM decisions; if the training distribution differs from the survey in S/N, wavelength coverage, or contaminant mix, the identifications rest on an untested assumption.
Authors: The referee correctly identifies that no independent cross-check or contamination-rate estimate is included, and that the downstream NLTE fitting cannot validate the HELM classification step. In revision we will perform and report a cross-match against existing hot-subdwarf catalogs to quantify overlap and provide an estimate of contamination. We will also add an explicit discussion of the assumptions regarding training versus survey distributions (S/N, wavelength coverage, and contaminant mix) and any associated limitations. revision: yes
Circularity Check
No significant circularity detected in derivation chain
full rationale
The paper applies the HELM algorithm (described as a new machine learning method) to LAMOST spectra after suitable training to select 56 candidates, then fits atmospheric parameters via NLTE synthetic spectra on those candidates and confirms known He sequences from external literature (Edelmann et al. 2003). No equations, self-definitional relations, fitted inputs renamed as predictions, or load-bearing self-citations appear in the text. The classification step is independent of the downstream parameter fits, and the method is presented as externally applicable rather than reducing to its own outputs by construction.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Beers, Timothy C., Preston, George W., Shectman, Stephen A., et al. 1992, AJ, 103, 267
work page 1992
-
[2]
M., Cassisi, S., D’Antona, F., et al
Brown, T. M., Cassisi, S., D’Antona, F., et al. 2016, ApJ, 822, 44
work page 2016
-
[3]
Bu, Yude., Lei, Zhenxin., Zhao, Gang., et al. 2017, ApJS, 233, 2 (Paper I)
work page 2017
- [4]
-
[5]
Chen, Xuefei., Han, Zhanwen., Deca, Jan., et al. 2013, MNRAS, 434, 186
work page 2013
- [6]
-
[7]
M., Morales-Rueda, L., Marsh, T
Copperwheat, C. M., Morales-Rueda, L., Marsh, T. R., et al. 2011, MNRAS, 415, 1381
work page 2011
-
[8]
Cui, Xiang-Qun., Zhao, Yong-Heng., Chu, Yao-Quan., et al. 2012, RAA, 12, 1197
work page 2012
-
[9]
Dorman, Ben., Rood, Robert T., & O’Connell, Robert W. 1993, ApJ, 419, 596
work page 1993
- [10]
-
[11]
Edelmann, H., Heber, U., Hagen, H.-J., et al. 2003, A&A, 400, 939 20
work page 2003
-
[12]
Gaia Data Release 2. Summary of the contents and survey properties
Eisenstein, Daniel J., Liebert, James., Harris, Hugh C., et al. 2006, ApJS, 167, 40 Gaia Collaboration, Brown, A., Vallenari, A., et al. 2018a, arXiv:1804.09365 Gaia Collaboration, Babusiaux, C., van Leeuwen, F., et al. et al. 2018b, arXiv:1804.09378
work page internal anchor Pith review Pith/arXiv arXiv 2006
- [13]
-
[14]
Geier, S., Frst, F., Ziegerer, E., et al. 2015, Science, 347, 1126
work page 2015
-
[15]
Geier, S., Østensen, R. H., N´emeth, P., et al. 2017, A&A, 600, 50
work page 2017
- [16]
-
[17]
Han, Z., Podsiadlowski, Ph., Maxted, P. F. L., et al. 2002, MNRAS, 336, 449
work page 2002
-
[18]
Han, Z., Podsiadlowski, Ph., Maxted, P. F. L., et al. 2003, MNRAS, 341, 669
work page 2003
-
[19]
Han, Z., Podsiadlowski, Ph., & Lynas-Gray, A. E. 2007, MNRAS, 380, 1098
work page 2007
- [20]
- [21]
-
[22]
Huang, G., Zhu, Q., & Siew, C. 2006, Neurocomputing, 70, 489
work page 2006
- [23]
-
[24]
A brief introductory guide to TLUSTY and SYNSPEC
Hubeny, I., & Lanz, T. 2017, arXiv:1706.01859
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[25]
Kawka, A., Vennes, S., O’Toole, S., et al. 2015, MNRAS, 450, 3514
work page 2015
-
[26]
O., Pelisoli, I., Koester, D., et al
Kepler, S. O., Pelisoli, I., Koester, D., et al. 2015, MNRAS, 446, 4078
work page 2015
-
[27]
O., Pelisoli, I., Koester, D., et al
Kepler, S. O., Pelisoli, I., Koester, D., et al. 2016, MNRAS, 455, 3413
work page 2016
- [28]
- [29]
-
[30]
Lei, Zhenxin., Chen, Xuemei., Zhang, Fenghui., et al. 2015, MNRAS, 449, 2741
work page 2015
-
[31]
Lei, Zhenxin., Zhao, Gang., Zeng, Aihua., et al. 2016, MNRAS, 463, 3449
work page 2016
- [32]
-
[33]
Luo, A.-Li., Zhao, Yong-Heng., Zhao, Gang., et al. 2015, RAA, 15, 1095
work page 2015
-
[34]
Luo, Yang-Ping., N´emeth, P., Liu, Chao., et al. 2016, ApJ, 818, 202
work page 2016
-
[35]
2014, Mathematical Problems in Engineering, 2014, 1
Mao, L., Zhang, L., Liu, X., et al. 2014, Mathematical Problems in Engineering, 2014, 1
work page 2014
-
[36]
Maxted, P. F. L., Heber, U., Marsh, T. R., et al. 2001, MNRAS, 326, 139
work page 2001
-
[37]
2010, Neurocomputing, 73, 1906
Minhas, R., Baradarani, A., Seifzadeh,S., et al. 2010, Neurocomputing, 73, 1906
work page 2010
- [38]
-
[39]
Napiwotzki, R., Karl, C. A., Lisker, T., et al. 2001, Ap&SS, 291, 321 N´emeth, P., Østensen, R., Tremblay, P., et al. 2014, ASPC, 481, 95 N´emeth, P., Kawka, A., & Vennes, S. 2012, MNRAS, 427, 2180 O’Connell, Robert W. 1999, ARA&A, 37, 603 21 Østensen, R. H. 2006, Baltic, 15, 85 Østensen, R. H., Silvotti, R., Charpinet, S., et al. 2010, MNRAS, 409, 1740 P...
work page 2001
-
[40]
Sirko, Edwin., Goodman, Jeremy., Knapp, Gillian R., et al. 2004, AJ, 127, 899
work page 2004
- [41]
- [42]
- [43]
- [44]
-
[45]
Xiong, H., Chen, X., Podsiadlowski, Ph., et al. 2017, A&A, 599, 54
work page 2017
- [46]
-
[47]
York, Donald G., Adelman, J., Anderson, John E., et al. 2000, AJ, 120, 1579
work page 2000
-
[48]
Zhang, Xianfei., & Jeffery, C. S. 2012, MNRAS, 419, 452
work page 2012
-
[49]
Zhang, Xianfei., Hall, Philip D.., Jeffery, C. Simon., et al. 2017, ApJ, 835, 242
work page 2017
-
[50]
Zhao, Gang., Chen, Yu-Qin., Shi, Jian-Rong., et al. 2006, ChJAA, 6, 265
work page 2006
-
[51]
Zhao, Gang., Zhao, Yong-Heng., Chu, Yao-Quan., et al. 2012, RAA, 12, 723 22
work page 2012
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.