Independent Recovery of Vanishing Sources on POSS-I Photographic Plates Using Automated Source Detection and Cross-Epoch Matching
Pith reviewed 2026-05-10 18:55 UTC · model grok-4.3
The pith
An independent pipeline recovers most reported vanished sources from POSS-I plates but detects no significant temporal clustering.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The pipeline recovers 8/9 sources in the April 1950 benchmark field and 3/3 in the July 1952 field. It matches 3450 of the 5399 entries in the published Solano et al. catalog (63.9 percent) at a median separation of 0.94 arcsec. A full-footprint search over POSS-I coverage yields 2.85 million candidates after PSF cuts, deduplication, and Pan-STARRS DR1 rejection. Application of day-window tests across the 368 observation nights produces a post-test relative risk of 1.35 that is not statistically significant, and a negative binomial model of nightly counts likewise shows no effect.
What carries the argument
The automated detection pipeline that performs PSF-filtered source detection on POSS-I Red DSS cutouts, applies local astrometric registration refinement, performs cross-epoch matching to POSS-I Blue and POSS-II Red plates, and rejects candidates using Pan-STARRS DR1.
If this is right
- The pipeline reproduces 63.9 percent of the prior catalog entries with sub-arcsecond positional agreement.
- A large, filtered catalog of 2.85 million candidates is generated for further investigation after post-processing.
- No statistically significant excess of candidates appears on any particular calendar day within the 1949-1957 interval.
- The false-positive rate remains low at approximately 0.2 per 10-arcmin field in non-crowded control regions.
Where Pith is reading between the lines
- The recovered candidates could be followed up with targeted high-resolution imaging to check for reappearance or other properties.
- Larger samples or alternative statistical models might be needed to detect any subtle day-of-year pattern if one exists.
- The same cross-epoch rejection approach could be extended to additional historical plate archives to expand the search for vanished sources.
Load-bearing premise
That PSF filtering, local astrometric refinement, and Pan-STARRS DR1 rejection correctly separate true vanished sources from artifacts or misdetections without systematic bias across the full survey footprint and varying plate conditions.
What would settle it
Detection of faint counterparts to most of the 2.85 million candidates in deeper modern surveys such as Gaia or LSST within 3 arcsec would indicate they are persistent rather than vanished.
Figures
read the original abstract
We present an independent pipeline for detecting candidate vanished sources on digitized first-epoch Palomar Observatory Sky Survey (POSS-I) photographic plates. The pipeline detects and PSF-filters sources on POSS-I Red DSS cutouts, applies local astrometric registration refinement, and identifies candidates by cross-epoch matching against POSS-I Blue and POSS-II Red with Pan-STARRS DR1 rejection. On a 20-case benchmark harness, the pipeline recovers 8/9 sources in the April 1950 field and 3/3 in the July 1952 field, with a false positive rate of about 0.2 per 10 arcmin field on random non-crowded controls. A full-footprint sweep over the POSS-I coverage using 30 arcmin patches yields a filtered catalog of 2.85 million candidate vanished sources after post-processing PSF cuts, deduplication, and Pan-STARRS DR1 rejection. Cross-matching against the published 5,399-source Solano et al. (2022) catalog yields 3,450 matches (63.9%) with median separation 0.94 arcsec; among unrecovered catalog entries within our footprint, we find no Pan-STARRS DR1 counterpart within 3 arcsec. Applying Bruehl and Villarroel (2025)-style temporal windows to this catalog over the 368 POSS-I observation nights in the 1949-1957 interval gives a post-test calendar-day relative risk of 1.35 for the +1 day window, but the effect is not statistically significant (95% CI 0.91-2.00; two-sided Fisher p = 0.17) and is sensitive to coding unobserved days as zero-transient days. A negative binomial model of nightly candidate counts with nightly patch coverage as exposure is likewise null (IRR = 1.03, 95% CI 0.89-1.18, p = 0.71). The catalog-level replication is strong; the temporal association remains inconclusive.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper presents an independent automated pipeline for detecting candidate vanished sources on digitized POSS-I Red photographic plates. The method performs source detection and PSF filtering on DSS cutouts, applies local astrometric registration refinement, and identifies candidates via cross-epoch matching to POSS-I Blue and POSS-II Red plates followed by Pan-STARRS DR1 rejection. On a 20-case benchmark, it recovers 8/9 sources in the April 1950 field and 3/3 in the July 1952 field with a reported false-positive rate of ~0.2 per 10 arcmin on non-crowded controls. A full-footprint run over 30-arcmin patches produces a filtered catalog of 2.85 million candidates. This catalog overlaps 63.9% (3,450 matches, median separation 0.94 arcsec) with the Solano et al. (2022) catalog of 5,399 sources. Temporal association tests using Bruehl & Villarroel-style windows over 368 nights yield a non-significant relative risk of 1.35 (p=0.17) and a null negative binomial result (IRR=1.03, p=0.71).
Significance. If the pipeline's purity and completeness hold across the full survey, the 2.85M-candidate catalog would constitute a substantial independent resource for archival transient studies, enabling statistical analyses of source disappearance rates in the 1949-1957 era. The 63.9% replication of the prior Solano catalog is a clear strength, demonstrating consistency between independent detection methods. However, the temporal clustering test remains inconclusive, limiting immediate astrophysical conclusions. The work advances automated archival searches but its impact depends on robust validation of the large catalog.
major comments (2)
- [Benchmark harness and full-footprint sweep] Benchmark harness and false-positive controls: The recovery rates (8/9 and 3/3) and FP rate (~0.2 per 10 arcmin) are derived from only 20 cases on two specific dates using non-crowded random controls. The full 2.85M catalog is generated across the heterogeneous POSS-I footprint (varying plate quality, stellar density, and artifact rates) without reported injection-recovery tests, crowding-stratified FP measurements, or plate-quality stratification. This gap directly affects whether the reported purity can be extrapolated to the full catalog.
- [Catalog generation and cross-matching] Pan-STARRS DR1 rejection and post-processing: The pipeline relies on Pan-STARRS DR1 rejection plus PSF cuts and deduplication to produce the 2.85M catalog, yet no quantitative assessment of completeness, false-negative rate, or systematic bias in rejection across the survey is provided. The note that unrecovered Solano entries lack Pan-STARRS counterparts within 3 arcsec is useful but insufficient without an error budget or simulation to confirm the rejection step does not preferentially remove real transients.
minor comments (3)
- [Methods and abstract] The abstract and methods would benefit from explicit listing of the exact PSF cut thresholds, local astrometric refinement tolerances, and deduplication criteria rather than referring to them generically as 'post-processing cuts'.
- [Benchmark results] Consider adding a table or figure summarizing the 20-case benchmark recoveries, including individual source properties, separations, and reasons for the single non-recovery.
- [Temporal association test] The temporal analysis correctly reports non-significance and sensitivity to day coding, but a brief power calculation for the negative binomial model would help readers interpret the null result.
Simulated Author's Rebuttal
We thank the referee for their constructive feedback, which highlights both the potential utility of our 2.85M-candidate catalog and the need for clearer discussion of validation limits. We address each major comment below with clarifications on our approach and scope. Revisions to the manuscript will incorporate expanded caveats and a dedicated limitations section to better contextualize the benchmark and rejection steps.
read point-by-point responses
-
Referee: Benchmark harness and false-positive controls: The recovery rates (8/9 and 3/3) and FP rate (~0.2 per 10 arcmin) are derived from only 20 cases on two specific dates using non-crowded random controls. The full 2.85M catalog is generated across the heterogeneous POSS-I footprint (varying plate quality, stellar density, and artifact rates) without reported injection-recovery tests, crowding-stratified FP measurements, or plate-quality stratification. This gap directly affects whether the reported purity can be extrapolated to the full catalog.
Authors: We acknowledge that the 20-case benchmark on two fields with non-crowded controls provides only a baseline demonstration rather than a comprehensive purity assessment across the full heterogeneous footprint. Our primary validation metric remains the 63.9% overlap with the independent Solano et al. (2022) catalog, which offers external consistency without relying solely on our internal FP estimates. The reported FP rate serves as a conservative lower bound for non-crowded regions. In the revised manuscript we will add a 'Limitations and Future Work' section explicitly discussing the absence of injection-recovery tests and the potential impact of crowding and plate quality variations, while noting that the catalog is presented as a resource for further statistical studies rather than a purity-guaranteed list. revision: partial
-
Referee: Pan-STARRS DR1 rejection and post-processing: The pipeline relies on Pan-STARRS DR1 rejection plus PSF cuts and deduplication to produce the 2.85M catalog, yet no quantitative assessment of completeness, false-negative rate, or systematic bias in rejection across the survey is provided. The note that unrecovered Solano entries lack Pan-STARRS counterparts within 3 arcsec is useful but insufficient without an error budget or simulation to confirm the rejection step does not preferentially remove real transients.
Authors: The 3-arcsec Pan-STARRS DR1 rejection radius is chosen to accommodate known astrometric uncertainties in the digitized POSS-I plates while removing sources that remain detectable in modern surveys. The observation that unrecovered Solano entries also lack PS counterparts within this radius supports that the step is not selectively discarding real transients from the prior catalog. We agree that without dedicated injection simulations we cannot provide a full error budget or false-negative quantification. The revised manuscript will expand the methods and discussion sections to justify the radius choice, report the fraction of Solano sources rejected by this criterion, and include a paragraph on possible biases, while emphasizing that the approach is intentionally conservative to minimize contamination from persistent sources. revision: partial
Circularity Check
No significant circularity; pipeline and catalog rely on external data and independent processing
full rationale
The derivation chain consists of source detection on POSS-I plates, PSF filtering, local astrometric refinement, cross-epoch matching, and rejection against Pan-STARRS DR1 plus the external Solano et al. (2022) catalog. Benchmark recovery rates and the 2.85M catalog are generated from these steps without fitting parameters to the target outputs or redefining quantities in terms of themselves. The temporal window analysis applies a cited external style to newly produced data and yields a null result; no self-citation chains, ansatz smuggling, or fitted-input-as-prediction reductions are present. The central claims remain self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Sources absent from Pan-STARRS DR1 within the matching radius are either truly vanished or artifacts from the older plates
Reference graph
Works this paper leans on
-
[1]
2025, Scientific Reports, 15, 34125, doi: 10.1038/s41598-025-21620-3
Bruehl, J., & Villarroel, B. 2025, Scientific Reports, 15, 34125, doi: 10.1038/s41598-025-21620-3
-
[2]
Chambers, K. C., Magnier, E. A., Metcalfe, N., et al. 2016, arXiv e-prints
work page 2016
-
[3]
Lasker, B. M., Lattanzi, M. G., McLean, B. J., et al. 2008, The Astronomical Journal, 136, 735, doi: 10.1088/0004-6256/136/2/735
-
[4]
2022, Monthly Notices of the Royal Astronomical Society, 515, 1380, doi: 10.1093/mnras/stac1880
Solano, E., Garc´ ıa-Alvarez, D., Marcos-Arenal, P., et al. 2022, Monthly Notices of the Royal Astronomical Society, 515, 1380, doi: 10.1093/mnras/stac1880
-
[5]
2024, Monthly Notices of the Royal Astronomical Society, 527, 6312, doi: 10.1093/mnras/stad3726
Solano, E., et al. 2024, Monthly Notices of the Royal Astronomical Society, 527, 6312, doi: 10.1093/mnras/stad3726
-
[6]
https: //ui.adsabs.harvard.edu/abs/2021NatSR..1112794V
Villarroel, B., Imaz, I., Joshi, J., et al. 2021, Scientific Reports, 11, 12794, doi: 10.1038/s41598-021-92162-7
-
[7]
Villarroel, B., Solano, E., & Marcy, G. W. 2025, arXiv e-prints
work page 2025
-
[8]
Villarroel, B., Solano, E., Mattsson, L., et al. 2020, The Astronomical Journal, 159, 8, doi: 10.3847/1538-3881/ab570f
-
[9]
Villarroel, B., Streblyanska, A., Bruehl, S., & Geier, S. 2026, arXiv e-prints
work page 2026
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.