Estimating Earthquake Magnitude in Sentinel-1 Imagery via Ranking
Pith reviewed 2026-05-23 23:43 UTC · model grok-4.3
The pith
Pairwise ranking alongside magnitude regression cuts mean absolute error by over 30 percent for earthquake estimation from Sentinel-1 imagery.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
We propose to pose the estimation of earthquake magnitudes as a metric-learning problem, training models to not only estimate earthquake magnitude from Sentinel-1 satellite imagery but to additionally rank pairwise samples. Our experiments show at max a 30%+ improvement in MAE over prior regression-only based methods, particularly transformer-based architectures.
What carries the argument
A combined loss that performs magnitude regression while enforcing pairwise ranking on Sentinel-1 image pairs.
Load-bearing premise
That the pairwise ranking term, rather than other modeling decisions, is what produces the observed accuracy gains in the low-data regime.
What would settle it
Train the exact same architecture and data pipeline once with the ranking term and once without it, then compare the resulting MAE values while holding all other factors fixed.
Figures
read the original abstract
Earthquakes are commonly estimated using physical seismic stations, however, due to the installation requirements and costs of these stations, global coverage quickly becomes impractical. An efficient and lower-cost alternative is to develop machine learning models to globally monitor earth observation data to pinpoint regions impacted by these natural disasters. However, due to the small amount of historically recorded earthquakes, this becomes a low-data regime problem requiring algorithmic improvements to achieve peak performance when learning to regress earthquake magnitude. In this paper, we propose to pose the estimation of earthquake magnitudes as a metric-learning problem, training models to not only estimate earthquake magnitude from Sentinel-1 satellite imagery but to additionally rank pairwise samples. Our experiments show at max a 30%+ improvement in MAE over prior regression-only based methods, particularly transformer-based architectures.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes framing earthquake magnitude estimation from Sentinel-1 imagery as a metric-learning problem that combines standard regression with an additional pairwise ranking objective. It claims this yields up to 30%+ MAE reduction relative to regression-only baselines (especially transformers) in the low-data regime imposed by limited historical earthquake records.
Significance. If the MAE gains are robust and causally attributable to the ranking term rather than other modeling choices, the method would constitute a practical algorithmic advance for low-data remote-sensing applications in disaster monitoring.
major comments (2)
- [Abstract] Abstract: the headline claim of 'at max a 30%+ improvement in MAE' supplies no dataset size, train/test split details, baseline implementations, statistical testing, or ablation isolating the ranking loss; without these the central claim cannot be evaluated.
- [Experiments] The manuscript provides no controlled ablation that holds architecture, data handling, optimization, and regularization fixed while toggling only the presence or weighting of the pairwise ranking objective; therefore the reported improvement cannot be attributed to the ranking component as asserted.
Simulated Author's Rebuttal
We thank the referee for their constructive comments, which highlight opportunities to strengthen the clarity and rigor of our claims. We address each major comment below and will incorporate revisions in the next version of the manuscript.
read point-by-point responses
-
Referee: [Abstract] Abstract: the headline claim of 'at max a 30%+ improvement in MAE' supplies no dataset size, train/test split details, baseline implementations, statistical testing, or ablation isolating the ranking loss; without these the central claim cannot be evaluated.
Authors: We agree that the abstract is concise and omits key experimental details needed to evaluate the headline claim. In the revision we will expand the abstract to report the number of earthquake events and Sentinel-1 images, the train/test split ratios, the specific baseline implementations (including the transformer architectures), any statistical significance testing performed, and an explicit reference to the ablation studies that isolate the ranking objective. These additions will be kept within abstract length constraints while making the central result evaluable. revision: yes
-
Referee: [Experiments] The manuscript provides no controlled ablation that holds architecture, data handling, optimization, and regularization fixed while toggling only the presence or weighting of the pairwise ranking objective; therefore the reported improvement cannot be attributed to the ranking component as asserted.
Authors: The referee correctly identifies that a fully controlled ablation is required to causally attribute gains to the ranking term. While our experiments already compare regression-only versus regression-plus-ranking models across multiple architectures, we acknowledge that other factors were not held identical in every comparison. We will add a dedicated controlled ablation in the revised experiments section that fixes the architecture, data splits, optimizer, learning-rate schedule, and regularization while varying only the ranking-loss weight (including the zero-weight regression-only case). Results from this ablation will be reported to directly support attribution of the observed MAE reductions. revision: yes
Circularity Check
No circularity in derivation; empirical gains reported without self-referential fitting
full rationale
The paper frames magnitude estimation as a combined regression-plus-ranking metric learning task and reports empirical MAE improvements of up to 30% over regression-only baselines. No equations, parameter-fitting procedures, or self-citations are shown that would make any claimed prediction or uniqueness result equivalent to its own inputs by construction. The central claim rests on experimental comparisons rather than any definitional or self-citation reduction, rendering the derivation self-contained.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
https://doi.org/10.5270/esa-c5d3d65, http://dx.doi.org/ 10.5270/ESA-c5d3d65
Copernicus dem (2022). https://doi.org/10.5270/esa-c5d3d65, http://dx.doi.org/ 10.5270/ESA-c5d3d65
-
[2]
Scientific Reports 10(1) (Jan 2020)
Ban, Y., Zhang, P., Nascetti, A., Bevington, A.R., Wulder, M.A.: Near real-time wildfire progression monitoring with sentinel-1 sar time series and deep learning. Scientific Reports 10(1) (Jan 2020). https://doi.org/10.1038/s41598-019-56967-x, http://dx.doi.org/10.1038/s41598-019-56967-x
-
[3]
Bonafilia, D., Tellman, B., Anderson, T., Issenberg, E.: Sen1floods11: A georef- erenced dataset to train and test deep learning flood algorithms for sentinel-1. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops (June 2020)
work page 2020
-
[4]
Cambrin, D.R., Garza, P.: Quakeset: A dataset and low-resource models to monitor earthquakes through sentinel-1. Proceedings of the 21th International Conference on Information Systems for Crisis Response and Management (In Press)
-
[5]
Chandrakumar, C., Prasanna, R., Stephens, M., Tan, M.L., Holden, C., Punchi- hewa, A., Becker, J.S., Jeong, S., Ravishan, D.: Algorithms for detecting p-waves and earthquake magnitude estimation: Initial literature review findings (2023)
work page 2023
-
[6]
In: 2009 IEEE conference on computer vision and pattern recognition
Deng, J., Dong, W., Socher, R., Li, L.J., Li, K., Fei-Fei, L.: Imagenet: A large- scale hierarchical image database. In: 2009 IEEE conference on computer vision and pattern recognition. pp. 248–255. Ieee (2009) 6 D. Cambrin et al
work page 2009
-
[7]
In: International Conference on Learning Representations (2021), https://openreview
Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., Uszkoreit, J., Houlsby, N.: An image is worth 16x16 words: Transformers for image recognition at scale. In: International Conference on Learning Representations (2021), https://openreview. net/forum?id=YicbFdNTTy
work page 2021
-
[8]
Geophysical Journal International 216(1), 332–349 (2019)
Funning, G.J., Garcia, A.: A systematic study of earthquake detectability us- ing sentinel-1 interferometric wide-swath data. Geophysical Journal International 216(1), 332–349 (2019)
work page 2019
-
[9]
Journal of Geoscience Education45(3), 225–228 (May 1997)
Hodder, A.P.W.: The cost of earthquakes. Journal of Geoscience Education45(3), 225–228 (May 1997). https://doi.org/10.5408/1089-9995-45.3.225, http://dx.doi. org/10.5408/1089-9995-45.3.225
-
[10]
In: Proceedings of the IEEE/CVF international conference on computer vision
Howard, A., Sandler, M., Chu, G., Chen, L.C., Chen, B., Tan, M., Wang, W., Zhu, Y., Pang, R., Vasudevan, V., et al.: Searching for mobilenetv3. In: Proceedings of the IEEE/CVF international conference on computer vision. pp. 1314–1324 (2019)
work page 2019
-
[11]
IEEE Geo- science and Remote Sensing Letters19, 1–5 (2022)
Khan, I., Kwon, Y.W.: P-detector: Real-time p-wave detection in a seismic wave- form recorded on a low-cost mems accelerometer using deep learning. IEEE Geo- science and Remote Sensing Letters19, 1–5 (2022)
work page 2022
-
[12]
Bulletin of the Seismological So- ciety of America112(2), 669–679 (2022)
Liu, H., Li, S., Song, J.: Discrimination between earthquake p waves and mi- crotremors via a generative adversarial network. Bulletin of the Seismological So- ciety of America112(2), 669–679 (2022)
work page 2022
-
[13]
Decoupled Weight Decay Regularization
Loshchilov, I., Hutter, F., et al.: Fixing weight decay regularization in adam. arXiv preprint arXiv:1711.05101 5 (2017)
work page internal anchor Pith review Pith/arXiv arXiv 2017
-
[14]
Mehta, S., Rastegari, M.: Separable self-attention for mobile vision transformers (2022)
work page 2022
-
[15]
In: IGARSS 2019 - 2019 IEEE In- ternational Geoscience and Remote Sensing Symposium
Mouche, A., Soulat, F., Potin, P., Martino, L.: Sentinel-1 contribution to tropi- cal cyclones observations at high resolution. In: IGARSS 2019 - 2019 IEEE In- ternational Geoscience and Remote Sensing Symposium. pp. 5472–5475 (2019). https://doi.org/10.1109/IGARSS.2019.8898584
-
[16]
Nature communications11(1), 3952 (2020)
Mousavi, S.M., Ellsworth, W.L., Zhu, W., Chuang, L.Y., Beroza, G.C.: Earth- quake transformer—an attentive deep-learning model for simultaneous earthquake detection and phase picking. Nature communications11(1), 3952 (2020)
work page 2020
-
[17]
Remote Sensing12(19), 3136 (Sep 2020)
Sishodia, R.P., Ray, R.L., Singh, S.K.: Applications of remote sensing in precision agriculture: A review. Remote Sensing12(19), 3136 (Sep 2020). https://doi.org/ 10.3390/rs12193136, http://dx.doi.org/10.3390/rs12193136
-
[18]
In: Proceedings of the 30th interna- tional conference on advances in geographic information systems
Stewart, A.J., Robinson, C., Corley, I.A., Ortiz, A., Ferres, J.M.L., Banerjee, A.: Torchgeo: deep learning with geospatial data. In: Proceedings of the 30th interna- tional conference on advances in geographic information systems. pp. 1–12 (2022)
work page 2022
-
[19]
Remote sensing of environment120, 9–24 (2012)
Torres, R., Snoeij, P., Geudtner, D., Bibby, D., Davidson, M., Attema, E., Potin, P., Rommen, B., Floury, N., Brown, M., et al.: Gmes sentinel-1 mission. Remote sensing of environment120, 9–24 (2012)
work page 2012
-
[20]
Remote Sensing of Environment248, 111965 (Oct 2020)
Wang, L., Marzahn, P., Bernier, M., Ludwig, R.: Sentinel-1 insar measurements of deformation over discontinuous permafrost terrain, northern quebec, canada. Remote Sensing of Environment248, 111965 (Oct 2020). https://doi.org/10.1016/ j.rse.2020.111965, http://dx.doi.org/10.1016/j.rse.2020.111965
-
[21]
In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition
Woo, S., Debnath, S., Hu, R., Chen, X., Liu, Z., Kweon, I.S., Xie, S.: Convnext v2: Co-designing and scaling convnets with masked autoencoders. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. pp. 16133–16142 (2023)
work page 2023
-
[22]
Journal of Geophysical Research: Solid Earth127(3), e2021JB023283 (2022)
Zhu, W., Tai, K.S., Mousavi, S.M., Bailis, P., Beroza, G.C.: An end-to-end earth- quake detection method for joint phase picking and association using deep learning. Journal of Geophysical Research: Solid Earth127(3), e2021JB023283 (2022)
work page 2022
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.