pith. sign in

arxiv: 2604.15560 · v3 · submitted 2026-04-16 · 🌌 astro-ph.EP · astro-ph.IM· cs.LG

ExoNet: Calibrated Multimodal Deep Learning for TESS Exoplanet Candidate Vetting using Phase-Folded Light Curves, Stellar Parameters, and Multi-Head Attention

Pith reviewed 2026-05-10 09:18 UTC · model grok-4.3

classification 🌌 astro-ph.EP astro-ph.IMcs.LG
keywords TESS exoplanet vettingmultimodal deep learningphase-folded light curveshabitable zone candidatesKepler training transfermulti-head attentiontemperature scaling calibrationcandidate prioritization
0
0 comments X

The pith

A multimodal neural network trained on Kepler data identifies 1,754 high-confidence TESS planet candidates including six Earth-sized habitable-zone worlds.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

This paper introduces ExoNet, a deep learning system that vets TESS exoplanet candidates by jointly analyzing phase-folded light curves and stellar parameters. The model combines convolutional networks for different light curve views with multi-head attention and a calibrated fusion step, then applies the result to thousands of unconfirmed TESS signals after training on Kepler examples. It reports strong classification performance and produces a shortlist of promising targets. A sympathetic reader would care because TESS has catalogued far more candidates than can be confirmed by hand, so reliable automated ranking could speed up the search for habitable planets.

Core claim

ExoNet achieves a test AUC of 0.9549 and 86.3 percent accuracy when trained on 7,585 labeled Kepler Objects of Interest. The architecture fuses global and local phase-folded light curve views with stellar parameters through 1D CNNs, 8-head multi-head attention, a residual fusion head, and post-hoc temperature scaling. When applied to 4,720 verified unconfirmed TESS planet candidates, it yields 1,754 high-confidence signals, 52 habitable-zone candidates, and six Earth-sized habitable-zone targets below 1.6 Earth radii, with TOI-5728.01 and TOI-6716.01 emerging as the most Earth-like.

What carries the argument

A calibrated late-fusion architecture that processes global and local phase-folded light curve views with 1D CNNs, applies 8-head multi-head attention to temporal feature maps, incorporates stellar parameters, and combines everything in a residual fusion head followed by temperature scaling.

If this is right

  • The 1,754 high-confidence TESS signals can be prioritized for ground-based confirmation observations.
  • The six Earth-sized habitable-zone targets become concrete objects for detailed atmospheric or radius refinement studies.
  • Ablation results show that removing any single modality lowers AUC, so all three data types are needed for peak performance.
  • The open release of code and the candidate catalog allows other groups to reproduce the rankings and test extensions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • The same fusion approach could be tested on other transit surveys that supply both light curves and basic stellar parameters.
  • Temperature scaling provides probability estimates that could be used to rank candidates by expected confirmation yield.
  • Domain adaptation methods might further improve transfer if systematic differences between Kepler and TESS noise become measurable.

Load-bearing premise

The distribution of false-positive signals and noise properties learned from the Kepler training set remains close enough to those in the TESS data for the decision boundary to stay reliable.

What would settle it

Independent follow-up observations that confirm or refute the planetary nature of the six Earth-sized habitable-zone candidates, especially TOI-5728.01 and TOI-6716.01.

Figures

Figures reproduced from arXiv: 2604.15560 by Md.Rashadul Islam.

Figure 1
Figure 1. Figure 1: ExoNet three-stream architecture. The global and local [PITH_FULL_IMAGE:figures/full_fig_p004_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: ExoNet evaluation on the held-out Kepler test set (1,139 samples). [PITH_FULL_IMAGE:figures/full_fig_p005_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: ExoNet training history (80-epoch run, early stopping at epoch 77). [PITH_FULL_IMAGE:figures/full_fig_p006_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Post-hoc Temperature Scaling calibration ( [PITH_FULL_IMAGE:figures/full_fig_p006_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: ExoNet calibrated predictions across 4,720 TESS Planet Candidates. [PITH_FULL_IMAGE:figures/full_fig_p006_5.png] view at source ↗
read the original abstract

The discovery of exoplanets at scale has become one of the defining data science challenges in modern astrophysics. NASA's Transiting Exoplanet Survey Satellite (TESS) had catalogued over 7,800 planet candidates by early 2026, yet confirmation stands at fewer than 720. This paper introduces ExoNet, a multimodal deep learning framework that jointly processes phase-folded global and local light curve views alongside stellar parameter features through a calibrated late-fusion architecture combining 1D Convolutional Neural Networks, 8-head Multi-Head Attention over temporal feature maps, and a residual fusion head with post-hoc Temperature Scaling calibration. Trained on 7,585 labeled Kepler Objects of Interest, ExoNet achieves Test AUC = 0.9549 and 86.3% accuracy. Applied to 4,720 verified unconfirmed TESS Planet Candidates with TOI-TIC cross-identification verified against the NASA Exoplanet Archive, the model yields 1,754 high-confidence signals, 52 habitable-zone candidates, and six Earth-sized habitable-zone targets below 1.6 Earth radii. TOI-5728.01 and TOI-6716.01 emerge as the most Earth-like unconfirmed candidates. Full ablation confirms each modality improves AUC. Code and catalog are openly released.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

3 major / 2 minor

Summary. The paper presents ExoNet, a multimodal deep learning architecture that fuses 1D CNN-processed global and local phase-folded light-curve views with stellar parameters via 8-head multi-head attention and a residual late-fusion head, followed by temperature scaling. Trained on 7,585 labeled Kepler Objects of Interest, the model reports a test AUC of 0.9549 and 86.3% accuracy; when applied to 4,720 verified unconfirmed TESS planet candidates (TOI-TIC cross-matched to the NASA Exoplanet Archive), it identifies 1,754 high-confidence signals, 52 habitable-zone candidates, and six Earth-sized habitable-zone targets below 1.6 Earth radii.

Significance. If the Kepler-to-TESS generalization holds, the work offers a practical, calibrated tool for large-scale TESS candidate vetting and could prioritize follow-up resources on a small number of promising habitable-zone targets. The open release of code and catalog strengthens reproducibility. The significance is limited, however, by the absence of any demonstrated robustness to the substantial differences in cadence, noise properties, and false-positive populations between the two missions.

major comments (3)
  1. [Abstract] Abstract: the central claim that the model yields 1,754 high-confidence TESS signals (including 52 HZ and 6 Earth-sized HZ candidates) rests on the untested assumption that the Kepler-trained decision boundary remains reliable after domain shift. No cross-mission validation set, adversarial alignment, or quantitative comparison of transit-like signal and false-positive distributions between Kepler (30-min cadence) and TESS (2-min cadence) is provided.
  2. [Abstract] Abstract and methods description: the reported test AUC of 0.9549 and 86.3% accuracy are given without any information on the train/test split strategy (e.g., ratios, stratification by planet radius or host-star type), handling of class imbalance, or metrics used to confirm that the held-out Kepler test set is representative of the TESS application domain.
  3. [Application to TESS candidates] Application section: the definition of 'high-confidence signals' and the probability threshold used to select the 1,754 candidates is not stated, nor is any calibration check (beyond temperature scaling) performed on TESS data to confirm that the reported probabilities remain well-calibrated after the survey change.
minor comments (2)
  1. The ablation study is mentioned but no table or quantitative deltas for each modality (global view, local view, stellar parameters, attention) are supplied in the abstract or visible summary.
  2. Clarify the exact cross-identification procedure used to match the 4,720 TESS candidates to TIC IDs and confirm they are absent from the Kepler training set.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their thorough and constructive review. We address each major comment below with point-by-point responses. We have revised the manuscript to supply the requested methodological details and to expand the discussion of domain-shift limitations. These changes improve transparency without altering the core claims or results.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the central claim that the model yields 1,754 high-confidence TESS signals (including 52 HZ and 6 Earth-sized HZ candidates) rests on the untested assumption that the Kepler-trained decision boundary remains reliable after domain shift. No cross-mission validation set, adversarial alignment, or quantitative comparison of transit-like signal and false-positive distributions between Kepler (30-min cadence) and TESS (2-min cadence) is provided.

    Authors: We acknowledge that no explicit cross-mission validation set or adversarial alignment was performed, as a sufficiently large, labeled TESS dataset with confirmed planets and false positives was not available during model development. In the revised manuscript we add a dedicated limitations subsection that (i) qualitatively compares Kepler and TESS light-curve noise properties and false-positive populations using published TESS noise models, (ii) reports the fraction of the 1,754 candidates that overlap with independent TESS vetting catalogs, and (iii) explicitly frames the 1,754 signals as high-priority follow-up targets rather than confirmed planets. A full quantitative domain-adaptation experiment remains outside the present scope but is noted as future work. revision: partial

  2. Referee: [Abstract] Abstract and methods description: the reported test AUC of 0.9549 and 86.3% accuracy are given without any information on the train/test split strategy (e.g., ratios, stratification by planet radius or host-star type), handling of class imbalance, or metrics used to confirm that the held-out Kepler test set is representative of the TESS application domain.

    Authors: We agree these details were omitted. The revised Methods section now states: an 80/20 train/test split was performed with stratification on binned planet radius and host-star spectral type; class imbalance was mitigated with a weighted binary cross-entropy loss whose weights are the inverse of the class frequencies in the training set; and representativeness is demonstrated by a supplementary table comparing the distributions of planet radius, transit depth, stellar Teff, and log g between the Kepler test set and the TESS candidate sample. We also report F1-score, precision-recall AUC, and calibration plots on the held-out Kepler test set. revision: yes

  3. Referee: [Application to TESS candidates] Application section: the definition of 'high-confidence signals' and the probability threshold used to select the 1,754 candidates is not stated, nor is any calibration check (beyond temperature scaling) performed on TESS data to confirm that the reported probabilities remain well-calibrated after the survey change.

    Authors: We will add an explicit statement that high-confidence signals are those with temperature-scaled probability > 0.85, a threshold selected by maximizing F1 on the Kepler validation fold. We further include a new paragraph describing a post-hoc sanity check: the model was applied to the small set of TESS-confirmed planets and known false positives present in the NASA Exoplanet Archive at the time of writing; the resulting probability distributions are reported and show that confirmed planets receive systematically higher scores. We note that a full recalibration on TESS would require a larger labeled TESS sample and is therefore listed as a limitation. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper trains a multimodal neural network (1D CNNs + 8-head attention + late fusion + temperature scaling) on 7,585 externally labeled Kepler KOIs, reports standard held-out test metrics (AUC 0.9549), and then applies the fixed model to 4,720 TESS candidates to produce counts. These TESS outputs are genuine inferences from a model whose parameters were never fitted to TESS data or labels. No self-definitional loops, fitted-input-as-prediction, self-citation load-bearing, uniqueness theorems, or ansatz smuggling appear in the described chain. The derivation remains self-contained against external benchmarks.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

The central claim rests on the assumption that Kepler-derived labels are sufficiently unbiased and that the TESS noise distribution is close enough to Kepler for transfer to be valid. No new physical entities are postulated.

free parameters (2)
  • Temperature scaling parameter
    Post-hoc scalar fitted on a validation set to adjust output probabilities.
  • Model hyperparameters (learning rate, number of attention heads, fusion weights)
    Chosen during training; exact values not stated in abstract.
axioms (1)
  • domain assumption Kepler and TESS transit signals share the same underlying statistical distribution of true planets versus false positives
    Required for the trained classifier to generalize without additional domain adaptation.

pith-pipeline@v0.9.0 · 5555 in / 1417 out tokens · 43618 ms · 2026-05-10T09:18:55.497158+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. One Transit Is All You Need: Detecting Exoplanets Through Learned Stellar Behaviour with EXOVEIL

    astro-ph.EP 2026-06 unverdicted novelty 7.0

    EXOVEIL detects single-transit exoplanets via a Transformer world model trained on masked Kepler data, recovering 32% of 1000 ppm injections and 100% of tested TESS planets without retraining.

Reference graph

Works this paper leans on

16 extracted references · 16 canonical work pages · cited by 1 Pith paper

  1. [1]

    Transiting Exoplanet Survey Satellite (TESS),

    G. R. Rickeret al., “Transiting Exoplanet Survey Satellite (TESS),”Journal of Astronomical Telescopes, Instruments, and Systems, vol. 1, p. 014003, 2015

  2. [2]

    TESS Planet Candidate Statistics,

    NASA Exoplanet Archive, “TESS Planet Candidate Statistics,”

  3. [3]

    Available: https://exoplanetarchive.ipac.caltech

    [Online]. Available: https://exoplanetarchive.ipac.caltech. edu

  4. [4]

    Identifying exoplanets with deeplearning:Afive-planetresonantchainaroundkepler-80and an eighth planet around kepler-90,

    C. J. Shallue and A. Vanderburg, “Identifying exoplanets with deeplearning:Afive-planetresonantchainaroundkepler-80and an eighth planet around kepler-90,”The Astronomical Journal, vol. 155, no. 2, p. 94, 2018

  5. [5]

    ExoMiner: A highly accurate and ex- plainabledeeplearningclassifierthatvalidates301newexoplan- ets,

    H. Valizadeganet al., “ExoMiner: A highly accurate and ex- plainabledeeplearningclassifierthatvalidates301newexoplan- ets,”The Astrophysical Journal, vol. 926, no. 2, p. 120, 2022

  6. [6]

    ExoMiner++: Extending exominer to TESS and new domains,

    ——, “ExoMiner++: Extending exominer to TESS and new domains,”The Astrophysical Journal, 2025, in press

  7. [7]

    Keplerplanet-detectionmission:Introduc- tion and first results,

    W.J.Boruckiet al.,“Keplerplanet-detectionmission:Introduc- tion and first results,”Science, vol. 327, no. 5968, pp. 977–980, 2010

  8. [8]

    Scientific domain knowledge improves exo- planet transit classification with deep learning,

    M. Ansdellet al., “Scientific domain knowledge improves exo- planet transit classification with deep learning,”The Astrophys- ical Journal Letters, vol. 869, no. 1, p. L7, 2018

  9. [9]

    Identifying exoplanets with deep learning. II. two new super-earths uncovered by a neural network in k2 data,

    A. Dattiloet al., “Identifying exoplanets with deep learning. II. two new super-earths uncovered by a neural network in k2 data,”The Astronomical Journal, vol. 157, no. 5, p. 169, 2019

  10. [10]

    Exoplanet validation with machine learning: 50 new validated kepler planets,

    D. J. Armstronget al., “Exoplanet validation with machine learning: 50 new validated kepler planets,”Monthly Notices of the Royal Astronomical Society, vol. 504, no. 4, pp. 5327–5344, 2021

  11. [11]

    Machine learning classification of TESS planet candidatesusingstellarandtransitparameters,

    C. Huanget al., “Machine learning classification of TESS planet candidatesusingstellarandtransitparameters,”The Astronom- ical Journal, 2025, in press

  12. [12]

    A CNN-BiLSTM-Attention architecture for kepler exoplanet transit vetting,

    X. Chenet al., “A CNN-BiLSTM-Attention architecture for kepler exoplanet transit vetting,”Monthly Notices of the Royal Astronomical Society, 2025, in press

  13. [13]

    The multimodal uni- verse: Enabling large-scale machine learning with 100tb of as- tronomical scientific data,

    The Multimodal Universe Collaboration, “The multimodal uni- verse: Enabling large-scale machine learning with 100tb of as- tronomical scientific data,” inAdvances in Neural Information Processing Systems (NeurIPS), 2024

  14. [14]

    Lightkurve: Kepler and TESS time series analysis in python,

    Lightkurve Collaborationet al., “Lightkurve: Kepler and TESS time series analysis in python,”Astrophysics Source Code Li- brary, p. ascl:1812.013, 2018

  15. [15]

    On cali- bration of modern neural networks,

    C. Guo, G. Pleiss, Y. Sun, and K. Q. Weinberger, “On cali- bration of modern neural networks,”Proceedings of the 34th International Conference on Machine Learning (ICML), pp. 1321–1330, 2017

  16. [16]

    Most 1.6 Earth-Radius planets are not rocky,

    L. A. Rogers, “Most 1.6 Earth-Radius planets are not rocky,” The Astrophysical Journal, vol. 801, no. 1, p. 41, 2015