pith. sign in

arxiv: 2606.05731 · v1 · pith:IMAHGRZHnew · submitted 2026-06-04 · 💻 cs.LG

Intercomparison of Machine Learning Algorithms for Remote Sensing-based In-season Crop Mapping

Pith reviewed 2026-06-28 03:18 UTC · model grok-4.3

classification 💻 cs.LG
keywords machine learningremote sensingcrop mappingin-season mappingsupport vector machinesLandsat SentinelCalifornia almondsIowa corn
0
0 comments X

The pith

Support vector machines achieve the highest accuracy for mapping almonds and corn by early June using satellite time series and rotation history in unseen years.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper compares ten machine learning algorithms to map crop types before harvest using satellite imagery and past rotation records. It shows that support vector machines deliver the best performance when tested on years not used in training, reaching mean F1 scores of 0.74 for almonds in California and 0.59 for corn in Iowa by early June. This matters because official crop maps currently arrive only after harvest, limiting timely responses to climate threats. The study uses year-wise cross-validation to account for interannual variability and quantifies uncertainty from phenology and distribution.

Core claim

Harmonized Landsat-Sentinel surface reflectance imagery time series combined with crop rotation history information can be used with support vector machines to map corn in Iowa and almonds in California at 30m resolution accurately by early June in unseen years, outperforming other algorithms across thousands of model configurations evaluated with year-wise cross-validation.

What carries the argument

Year-wise cross-validation of ten machine learning algorithms with hyperparameter tuning on combined satellite reflectance time series and crop rotation history inputs.

If this is right

  • Interannual variation is a large source of uncertainty in the maps.
  • Ensemble approaches or additional ancillary data show potential to improve performance further.
  • The methods can be extended to multiclass maps of all crop types and CONUS-wide application.
  • In-season crop yield forecasting becomes feasible with these approaches.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

  • These mapping techniques could be adapted to other crops and regions facing similar climate risks.
  • Integration with real-time climate data might enable proactive emergency responses.
  • Quantifying uncertainty from phenology could help prioritize areas for ground verification.

Load-bearing premise

That satellite imagery time series and crop rotation history provide enough information to produce accurate in-season crop maps in years not seen during training without additional data or specific adjustments.

What would settle it

Demonstrating that a different algorithm or additional inputs consistently produce higher F1 scores than 0.74 for almonds and 0.59 for corn when tested on multiple new years would falsify the superiority of support vector machines with these inputs.

Figures

Figures reproduced from arXiv: 2606.05731 by August Posch, Auroop R. Ganguly, Forrest M. Hoffman, Jitendra Kumar.

Figure 1
Figure 1. Figure 1: Location of the two study areas within the United States. The California (CA) study area is marked by the red squar [PITH_FULL_IMAGE:figures/full_fig_p003_1.png] view at source ↗
Figure 2
Figure 2. Figure 2: Performance of the best configuration of each algorithm for almonds in California by early June. The best mean F1 [PITH_FULL_IMAGE:figures/full_fig_p011_2.png] view at source ↗
Figure 3
Figure 3. Figure 3: Performance of the best configuration of each algorithm for corn in Iowa by early June. The best mean F1 scores cam [PITH_FULL_IMAGE:figures/full_fig_p012_3.png] view at source ↗
Figure 4
Figure 4. Figure 4: Performance of the best model for each in season cutoff date, such as June, August, or End Of Year. Early [PITH_FULL_IMAGE:figures/full_fig_p013_4.png] view at source ↗
Figure 5
Figure 5. Figure 5: Performance (F1 mean) for selected algorithms trained on 0.1% sample as well as 100% of training data via bagging. [PITH_FULL_IMAGE:figures/full_fig_p014_5.png] view at source ↗
Figure 6
Figure 6. Figure 6: Off-diagonal: Pairwise F1 scores across validation years for the top 38 (within 0.01 mean F1 score of best) Almonds￾California-June models. The following pairs of years have negative correlations: 2018 with 2019, 2018 with 2020, and 2018 with 2021. On-diagonal: Histogram of F1 scores for each validation year for the top 38 Almonds-California-June models. Table VII PEARSON CORRELATION COEFFICIENTS FOR PAIRW… view at source ↗
Figure 7
Figure 7. Figure 7: Crop rotation history of almond pixels. Each column shows a history profile for almonds in a specific year of inter [PITH_FULL_IMAGE:figures/full_fig_p018_7.png] view at source ↗
Figure 8
Figure 8. Figure 8: Performance of K Nearest Neighbors algorithms based on choice of hyperparameters: distance metric, compositing [PITH_FULL_IMAGE:figures/full_fig_p018_8.png] view at source ↗
read the original abstract

In-season crop type mapping is critical for food security in the face of increasingly extreme climate-related threats to crops. Currently, the USDA Cropland Data Layer provides crop type labels at 30m resolution and is available the February after harvest, but no product exists that maps crop types before harvest with satisfactory accuracy that would allow emergency managers to respond to crop threats in near real time. Furthermore, the relative advantages of a wide range of algorithms have not been evaluated in a way that accounts for interannual variability, until this study. Here, Harmonized Landsat-Sentinel surface reflectance imagery time series and crop rotation history information are combined to map corn in Iowa and almonds in California at 30m resolution accurately by early June in unseen years, with robust quantification of uncertainty due to phenology and crop distribution. Thousands of model configurations across ten machine learning algorithms were compared using a year-wise cross-validation and a suite of metrics. Hyperparameter search revealed Support Vector Machines to be the most successful algorithm overall, with a mean F1 score of 0.74 (0.59) across five unseen validation years for almonds by early June in California (corn by early June in Iowa). Interannual variation was a large source of uncertainty, but patterns showed the potential to further improve performance with ensemble approaches or ancillary data. Future work may extend these methods to include multiclass maps of all crop types, CONUS-wide maps, and in-season crop yield forecasting.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Referee Report

1 major / 1 minor

Summary. The manuscript intercompares ten machine learning algorithms for in-season 30 m crop mapping of almonds in California and corn in Iowa. It combines Harmonized Landsat-Sentinel surface reflectance time series with crop rotation history, performs year-wise cross-validation across five unseen years, and reports that Support Vector Machines achieve the highest mean F1 scores (0.74 for almonds, 0.59 for corn) by early June, while identifying interannual variation as a major uncertainty source and suggesting potential for ensembles or ancillary data.

Significance. If the per-year results and uncertainty quantification hold, the study supplies a practical benchmark for temporal generalization in remote-sensing crop classification, directly addressing the latency gap with the USDA Cropland Data Layer. The year-wise hold-out design is a clear methodological strength for assessing real-world applicability.

major comments (1)
  1. [Abstract] Abstract: the headline claim that SVM is 'the most successful algorithm overall' rests only on the highest mean F1 across five years; given the explicit statement that 'interannual variation was a large source of uncertainty,' the manuscript must show either (a) per-year rankings or (b) paired statistical tests confirming that SVM differences are significant and consistent, otherwise the aggregate mean does not support declaring one algorithm superior.
minor comments (1)
  1. The abstract refers to 'early June' without stating the precise day-of-year or phenological window used for each crop; this should be clarified for reproducibility.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for the constructive review. We address the single major comment below.

read point-by-point responses
  1. Referee: [Abstract] Abstract: the headline claim that SVM is 'the most successful algorithm overall' rests only on the highest mean F1 across five years; given the explicit statement that 'interannual variation was a large source of uncertainty,' the manuscript must show either (a) per-year rankings or (b) paired statistical tests confirming that SVM differences are significant and consistent, otherwise the aggregate mean does not support declaring one algorithm superior.

    Authors: We agree that the abstract phrasing overstates the result given the acknowledged interannual variation. The revised manuscript will change the abstract to state that SVMs achieved the highest mean F1 score rather than declaring them 'the most successful algorithm overall.' We will also add a table (or expanded supplementary table) listing per-year F1 scores for the top three algorithms across the five validation years so readers can directly evaluate consistency. This revision directly addresses the request for per-year rankings. revision: yes

Circularity Check

0 steps flagged

No circularity: empirical ML comparison with external year-holdout validation

full rationale

The paper conducts a standard empirical intercomparison of ten ML algorithms for crop mapping, using year-wise cross-validation on five unseen validation years. Hyperparameter search and model selection occur within training folds, with F1 scores and other metrics evaluated on held-out years that are never used for fitting. No equations, self-definitional steps, fitted-input predictions, or load-bearing self-citations appear in the methodology; the reported performance metrics are computed directly on independent test data and do not reduce to the training inputs by construction. This is a self-contained empirical study against external benchmarks.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 0 invented entities

The central claim depends on the adequacy of the chosen satellite inputs and rotation history for generalization, plus the assumption that hyperparameter search across thousands of configurations did not introduce selection bias in the reported best algorithm.

free parameters (1)
  • Hyperparameters across ten algorithms
    Thousands of model configurations were compared via hyperparameter search; each algorithm's optimal settings are fitted values chosen to maximize validation performance.
axioms (1)
  • domain assumption Year-wise cross-validation isolates interannual variability without leakage from spatial or temporal autocorrelation
    Invoked when stating the method accounts for interannual variability and tests on unseen years.

pith-pipeline@v0.9.1-grok · 5801 in / 1216 out tokens · 168782 ms · 2026-06-28T03:18:16.291962+00:00 · methodology

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Reference graph

Works this paper leans on

32 extracted references · 18 canonical work pages · 1 internal anchor

  1. [1]

    Enhancement of river flooding due to global warming,

    H. Alifu, Y. Hirabayashi, Y. Imada et al., "Enhancement of river flooding due to global warming," Sci. Rep., vol. 12, Art. no. 20687, 2022, doi: 10.1038/s41598-022-25182-6

  2. [2]

    Climate- mediated shifts in temperature fluctuations promote extinction risk,

    K. Duffy, T. C. Gouhier, and A. R. Ganguly, "Climate- mediated shifts in temperature fluctuations promote extinction risk," Nat. Clim. Change, vol. 12, pp. 1037- 1044, 2022, doi: 10.1038/s41558-022-01490-7

  3. [3]

    The impacts of climate change on river flood risk at the global scale,

    N. W. Arnell and S. N. Gosling, "The impacts of climate change on river flood risk at the global scale," Clim. 21 > REPLACE THIS LINE WITH YOUR MANUSCRIPT ID NUMBER (DOUBLE -CLICK HERE TO EDIT) < Change, vol. 134, pp. 387-401, 2016, doi: 10.1007/s10584-014-1084-5

  4. [4]

    Climate change impacts on crop yields,

    E. E. Rezaei, H. Webber, S. Asseng et al., "Climate change impacts on crop yields," Nat. Rev. Earth Environ., vol. 4, pp. 831-846, 2023, doi: 10.1038/s43017-023-00491-0

  5. [5]

    Monitoring US agriculture: the USDA- NASS Cropland Data Layer program,

    C. Boryan et al., "Monitoring US agriculture: the USDA- NASS Cropland Data Layer program," Geocarto Int., vol. 26, no. 5, pp. 341-358, 2011, doi: 10.1080/10106049.2011.562309

  6. [6]

    A high-performance and in-season classification system of field-level crop types using time-series Landsat data and machine learning,

    Y. Cai, K. Guan, J. Peng et al., "A high-performance and in-season classification system of field-level crop types using time-series Landsat data and machine learning," Remote Sens. Environ., vol. 210, pp. 35-47, 2018, doi: 10.1016/j.rse.2018.02.045

  7. [7]

    Mapping of crop types and crop sequences with combined time series of Sentinel-1, Sentinel-2 and Landsat 8 data for Germany,

    L. Blickensdörfer, M. Schwieder, D. Pflugmacher et al., "Mapping of crop types and crop sequences with combined time series of Sentinel-1, Sentinel-2 and Landsat 8 data for Germany," Remote Sens. Environ., vol. 269, Art. no. 112831, 2022, doi: 10.1016/j.rse.2021.112831

  8. [8]

    Synergistic use of Sentinel-1 and Sentinel-2 images for in-season crop type classification,

    S. Sharma, D. Ryu, S. K. C. et al., "Synergistic use of Sentinel-1 and Sentinel-2 images for in-season crop type classification," in Proc. IGARSS, 2023, pp. 3498-3501, doi: 10.1109/IGARSS52108.2023.10282334

  9. [9]

    Development of a 10-m resolution maize and soybean map over China,

    H. Li et al., "Development of a 10-m resolution maize and soybean map over China," Remote Sens. Environ., vol. 294, Art. no. 113623, 2023, doi: 10.1016/j.rse.2023.113623

  10. [10]

    Rapid early-season maize mapping without crop labels,

    N. You, J. Dong, J. Li et al., "Rapid early-season maize mapping without crop labels," Remote Sens. Environ., vol. 290, Art. no. 113496, 2023, doi: 10.1016/j.rse.2023.113496

  11. [11]

    Mapping crops within the growing season across the United States,

    V. Konduri et al., "Mapping crops within the growing season across the United States," Remote Sens. Environ., vol. 251, Art. no. 112048, 2020, doi: 10.1016/j.rse.2020.112048

  12. [12]

    Mapcurves: A quantitative method for comparing categorical maps,

    W. W. Hargrove, F. M. Hoffman, and P. F. Hessburg, "Mapcurves: A quantitative method for comparing categorical maps," J. Geographical Syst., vol. 8, no. 2, pp. 187-208, 2006, doi: 10.1007/s10109-006-0025-x

  13. [13]

    Pre- and within-season crop type classification trained with archival land-cover information,

    D. Johnson and R. Mueller, "Pre- and within-season crop type classification trained with archival land-cover information," Remote Sens. Environ., vol. 264, Art. no. 112576, 2021, doi: 10.1016/j.rse.2021.112576

  14. [14]

    In-season wall-to-wall crop-type mapping using ensemble of image-segmentation models,

    S. A. Zaheer et al., "In-season wall-to-wall crop-type mapping using ensemble of image-segmentation models," IEEE Trans. Geosci. Remote Sens., vol. 61, Art. no. 4411311, 2023, doi: 10.1109/TGRS.2023.3335214

  15. [15]

    Automated in-season crop-type data-layer mapping without ground truth for CONUS,

    H. Li et al., "Automated in-season crop-type data-layer mapping without ground truth for CONUS," IEEE Trans. Geosci. Remote Sens., vol. 62, Art. no. 4403214, 2024, doi: 10.1109/TGRS.2024.3361895

  16. [16]

    The Harmonized Landsat and Sentinel-2 surface reflectance data set,

    M. Claverie et al., "The Harmonized Landsat and Sentinel-2 surface reflectance data set," Remote Sens. Environ., vol. 219, pp. 145-161, 2018, doi: 10.1016/j.rse.2018.09.002

  17. [17]

    The Harmonized Landsat and Sentinel-2 Version 2.0 Surface Reflectance Dataset,

    J. Ju, Q. Zhou, B. Freitag, D. P. Roy, H. K. Zhang, M. Sridhar, J. Mandel, S. Arab, G. Schmidt, C. J. Crawford, F. Gascon, P. A. Strobl, J. G. Masek, and C. S. R. Neigh, "The Harmonized Landsat and Sentinel-2 Version 2.0 Surface Reflectance Dataset," Remote Sens. Environ., vol. 324, Art. no. 114723, 2025, doi: 10.1016/j.rse.2025.114723

  18. [18]

    Random forests,

    L. Breiman, "Random forests," Mach. Learn., vol. 45, pp. 5-32, 2001

  19. [19]

    Scikit-learn: Machine learning in Python,

    F. Pedregosa et al., "Scikit-learn: Machine learning in Python," J. Mach. Learn. Res., vol. 12, pp. 2825-2830, 2011

  20. [20]

    Extremely randomized trees,

    P. Geurts, D. Ernst, and L. Wehenkel, "Extremely randomized trees," Mach. Learn., vol. 63, pp. 3-42, 2006

  21. [21]

    The regression analysis of binary sequences,

    D. R. Cox, "The regression analysis of binary sequences," J. Roy. Stat. Soc. B, vol. 20, pp. 215-242, 1958

  22. [22]

    SAGA: A Fast Incremental Gradient Method With Support for Non-Strongly Convex Composite Objectives

    A. Defazio, F. Bach, and S. Lacoste-Julien, "SAGA: A fast incremental gradient method with support for non- strongly convex composite objectives," arXiv:1407.0202, 2014, doi: 10.48550/arXiv.1407.0202

  23. [23]

    Support-vector networks,

    C. Cortes and V. Vapnik, "Support-vector networks," Mach. Learn., vol. 20, pp. 273-297, 1995

  24. [24]

    Learning Bayesian networks: The combination of knowledge and statistical data,

    D. Heckerman, D. Geiger, and D. M. Chickering, "Learning Bayesian networks: The combination of knowledge and statistical data," Mach. Learn., vol. 20, pp. 197-243, 1995

  25. [25]

    Discriminatory analysis, non- parametric discrimination: Consistency properties,

    E. Fix and J. L. Hodges, "Discriminatory analysis, non- parametric discrimination: Consistency properties," USAF School of Aviation Medicine, Tech. Rep. 21-49- 004, 1951

  26. [26]

    Beyond regression: New tools for prediction and analysis in the behavioral sciences,

    P. J. Werbos, "Beyond regression: New tools for prediction and analysis in the behavioral sciences," Ph.D. dissertation, Harvard Univ., Cambridge, MA, USA, 1975

  27. [27]

    Learning representations by back-propagating errors,

    D. E. Rumelhart, G. E. Hinton, and R. J. Williams, "Learning representations by back-propagating errors," Nature, vol. 323, pp. 533-536, 1986

  28. [28]

    On lines and planes of closest fit to systems of points in space,

    K. Pearson, "On lines and planes of closest fit to systems of points in space," Philos. Mag., vol. 2, pp. 559-572, 1901

  29. [29]

    Analysis of a complex of statistical variables into principal components,

    H. Hotelling, "Analysis of a complex of statistical variables into principal components," J. Educ. Psychol., vol. 24, pp. 417-441, 1933

  30. [30]

    Bagging predictors,

    L. Breiman, "Bagging predictors," Mach. Learn., vol. 24, pp. 123-140, 1996. August Posch received a B.A. in mathematics from Bowdoin College, Brunswick, ME , USA in 2018 , and M.S. in data science from Roux Institute and Khoury College of Computer Sciences, Northeastern University, Portland, ME, USA in 2022. He currently works as a Data Scientist and post...

  31. [31]

    Figure S5

    Reflectance scale has a factor of 105, so 1000 on the plot means 10% of the sun’s light was reflected back to the satellite. Figure S5. Green reflectance by year by period. Here, period 0 means days 1 -90, period 1 means days 91-104, period 2 means days 105-118, period 3 means days 119-132, period 4 means days 133-146, and period 5 means days 147-160. Ref...

  32. [32]

    Figure S7

    Reflectance scale has a factor of 10 5, so 1000 on the plot means 10% of the sun’s light was reflected back to the satellite. Figure S7. Near-infrared reflectance by year by period. Here, period 0 means days 1-90, period 1 means days 91- 104, period 2 means days 105-118, period 3 means days 119-132, period 4 means days 133-146, and period 5 means days 147...