A Neural-Mean Vecchia Gaussian Process for Unified Argo Modeling

Jian Cao; Nian Liu

arxiv: 2510.18067 · v2 · submitted 2025-10-20 · 📊 stat.CO · stat.ME

A Neural-Mean Vecchia Gaussian Process for Unified Argo Modeling

Nian Liu , Jian Cao This is my paper

Pith reviewed 2026-05-18 05:49 UTC · model grok-4.3

classification 📊 stat.CO stat.ME

keywords Gaussian process regressionArgo observationsVecchia approximationneural network meanspatio-temporal modelingocean temperature

0 comments

The pith

A neural mean and Vecchia approximation let one Gaussian process model Argo ocean temperature across broad domains without moving windows.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes a unified Gaussian process regression that handles both the mean and covariance for Argo temperature data in one step. A neural network supplies the mean structure while a generic spatio-temporal covariance is scaled with the Vecchia approximation to keep computation feasible for large regions. This data-driven setup matches the predictive accuracy of prior benchmarks that rely on moving-window regression and separate parametric mean fitting, using the same January-March 2007-2016 Argo observations. A sympathetic reader would care because the method removes the need to hand-design local windows and mean forms for each new ocean domain.

Core claim

The authors claim that their Neural-Mean Vecchia Gaussian Process supplies a single, flexible framework for jointly modeling mean and covariance in Argo temperature data, delivering predictive performance comparable to established moving-window methods while scaling to large spatial regions through the Vecchia approximation.

What carries the argument

Neural network mean function combined with a Vecchia-approximated global spatio-temporal covariance inside the Gaussian process, allowing joint estimation without localization or separate mean subtraction.

If this is right

Replaces case-specific choices of mean structures and window sizes with a single data-driven model.
Keeps predictive accuracy while cutting computational cost from cubic to quasi-linear in the number of observations.
Enables consistent large-scale ocean temperature analysis with one covariance structure across wide regions.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same structure could be applied to Argo salinity or other variables collected on the same floats.
It offers a template for unified modeling of other large environmental spatio-temporal datasets that currently rely on local windows.
One could test whether the neural mean automatically learns known ocean features such as seasonal thermocline shifts.

Load-bearing premise

A single neural mean and one global spatio-temporal covariance function are flexible enough to capture ocean temperature structure across broad domains without the localization supplied by moving windows.

What would settle it

On the same Argo temperature data from January to March over 2007-2016, if the new model's out-of-sample prediction errors exceed those of the moving-window benchmarks, the unified approach would fail to deliver on its performance claim.

Figures

Figures reproduced from arXiv: 2510.18067 by Jian Cao, Nian Liu.

read the original abstract

Argo is an international program that collects temperature and salinity observations in the upper two kilometers of the global ocean. Most existing approaches for modeling Argo temperature rely on localized modeling within moving windows, first estimating a prescribed mean structure and then fitting Gaussian processes (GPs) to the mean-subtracted anomalies. Such strategies introduce challenges in designing suitable mean structures and defining local moving windows, often resulting in case-specific modeling choices. In this work, we propose a one-stop Gaussian process regression framework with a flexible mean structure and a generic spatio-temporal covariance function to jointly model Argo temperature data across broad spatial domains. Our fully data-driven approach achieves predictive performance that compares favorably with the established benchmarks that require moving-window regression and separate parametric mean estimation. To ensure scalability over large spatial regions, we employ the Vecchia approximation, which reduces the computational complexity from cubic to quasi-linear in the number of observations while preserving predictive accuracy. Using Argo data from January to March over the years 2007-2016, the same dataset used in prior benchmark studies, we demonstrate that our approach provides a unified and data-driven alternative for large-scale oceanographic analysis.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Neural-mean Vecchia GP unifies Argo modeling without moving windows, but lacks ablations showing the global covariance holds up against localized baselines.

read the letter

The main point is that the authors replace the standard two-step Argo pipeline with a single GP that uses a neural network for the mean and a generic spatio-temporal covariance, then applies Vecchia to keep computation feasible on large datasets. They evaluate on the same 2007-2016 January-March Argo temperature records used in prior benchmarks and report that predictive performance holds up without the usual windowing or parametric mean choices. This is the concrete novelty: a data-driven mean and covariance fit inside one scalable framework for broad ocean domains. It does well at spelling out why moving windows and hand-designed means create extra modeling decisions, and the Vecchia step is a direct way to address the cubic scaling problem that otherwise blocks global fits. The approach is straightforward to understand and targets a real workflow pain point in environmental statistics. The soft spot is the missing check on whether the global covariance actually works once the neural mean is in place. Ocean temperature has sharp regional and depth-dependent structure, so residuals after the mean could still be non-stationary; without an ablation that swaps in localized covariance on identical data splits, it is hard to know how much the unified model truly gains over the benchmarks. The abstract-level claim of favorable performance would be stronger with those numbers and controls shown explicitly. This is for readers already working on scalable GPs for sensor data or oceanographic applications who want fewer case-by-case choices. It deserves a serious referee because the data are public, the method is a reasonable extension, and the application matters enough that referees can verify the empirical details and suggest the needed comparisons.

Referee Report

3 major / 1 minor

Summary. The paper presents a unified Gaussian process model for Argo temperature data using a neural network to model the mean function and a generic spatio-temporal covariance function approximated by the Vecchia method for scalability. It claims this data-driven approach provides predictive performance that compares favorably to benchmark methods relying on moving-window regression and parametric mean estimation, using the same 2007-2016 Argo dataset.

Significance. If validated, this could offer a more streamlined alternative for modeling large-scale ocean data by avoiding the complexities of localized windows and custom mean functions. The neural mean provides flexibility while Vecchia ensures computational feasibility for large datasets.

major comments (3)

[Abstract] Abstract: The central claim that the approach 'achieves predictive performance that compares favorably with the established benchmarks' is asserted without any quantitative metrics (RMSE, CRPS, log predictive density), error bars, tables, or description of the validation protocol (train/test splits, cross-validation scheme, or exact benchmark replication). This omission is load-bearing for the headline empirical result.
[Model and Experiments] Model and Experiments: No ablation is presented that isolates the global spatio-temporal covariance (with neural mean) against an otherwise identical localized baseline on the same data splits. Given known non-stationarity in ocean temperature (thermocline depth, eddy scales varying with latitude and depth), this directly tests whether the single global kernel suffices without moving-window localization.
[Implementation] Implementation: Full derivation of the joint neural-mean + Vecchia likelihood, neural architecture details, hyperparameter optimization procedure, and exact free-parameter count are absent, preventing assessment of whether the model reduces to quantities already fitted in the cited benchmarks or introduces new degrees of freedom.

minor comments (1)

[Abstract] Abstract: The exact subset (January–March only) and how it matches the benchmark studies' data selection is not stated explicitly.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their careful reading and constructive comments. We address each major point below and will revise the manuscript accordingly where appropriate.

read point-by-point responses

Referee: [Abstract] Abstract: The central claim that the approach 'achieves predictive performance that compares favorably with the established benchmarks' is asserted without any quantitative metrics (RMSE, CRPS, log predictive density), error bars, tables, or description of the validation protocol (train/test splits, cross-validation scheme, or exact benchmark replication). This omission is load-bearing for the headline empirical result.

Authors: We agree that the abstract would benefit from explicit quantitative support for the central claim. The full manuscript (Section 4) reports RMSE, CRPS, and log predictive density on the 2007-2016 Argo data using the identical validation protocol and benchmark implementations as the cited prior studies. We will revise the abstract to include the key numerical results (e.g., our RMSE and CRPS values relative to the moving-window benchmarks) and a concise statement of the evaluation protocol. revision: yes
Referee: [Model and Experiments] Model and Experiments: No ablation is presented that isolates the global spatio-temporal covariance (with neural mean) against an otherwise identical localized baseline on the same data splits. Given known non-stationarity in ocean temperature (thermocline depth, eddy scales varying with latitude and depth), this directly tests whether the single global kernel suffices without moving-window localization.

Authors: The manuscript's primary goal is to demonstrate that a single global model with a flexible neural mean and generic spatio-temporal kernel can match or exceed the performance of established localized approaches on the same dataset. Direct comparisons are already made to the moving-window regression benchmarks that incorporate localization. While an explicit ablation isolating only the covariance structure would be informative, it would require re-implementing a localized Vecchia baseline with the identical neural mean, which is computationally intensive. We will add a discussion of non-stationarity and the implications of the global kernel in the revised text, and we are open to including a limited ablation if space and compute permit. revision: partial
Referee: [Implementation] Implementation: Full derivation of the joint neural-mean + Vecchia likelihood, neural architecture details, hyperparameter optimization procedure, and exact free-parameter count are absent, preventing assessment of whether the model reduces to quantities already fitted in the cited benchmarks or introduces new degrees of freedom.

Authors: We acknowledge these implementation details were insufficiently documented. The revised manuscript will include: (i) the explicit joint likelihood derivation combining the neural mean with the Vecchia-approximated covariance, (ii) the neural network architecture (layer widths, activations, and input features), (iii) the hyperparameter optimization procedure (including optimizer, learning rate schedule, and early stopping), and (iv) the total number of free parameters. These elements are distinct from the parametric means in the cited benchmarks because the neural mean introduces additional trainable weights. revision: yes

Circularity Check

0 steps flagged

No significant circularity in the neural-mean Vecchia GP derivation

full rationale

The paper introduces a hybrid model with a neural network mean function, a single global spatio-temporal covariance kernel, and Vecchia approximation for scalable GP regression on Argo data. It fits this unified structure directly to the 2007-2016 temperature observations and reports empirical predictive performance against external benchmarks that use moving-window regression plus separate parametric means. No equation or claim reduces a 'prediction' to a quantity already fitted by construction, nor does any load-bearing step collapse to a self-citation whose validity is internal to the present work. The central performance comparison is therefore an independent empirical test rather than a tautology.

Axiom & Free-Parameter Ledger

2 free parameters · 1 axioms · 0 invented entities

Abstract-only access prevents enumeration of exact neural-network parameters or covariance hyperparameters; the central claim rests on the domain assumption that the Vecchia approximation preserves accuracy and that a global covariance suffices.

free parameters (2)

Neural network parameters for mean function
Weights and biases of the neural mean are fitted to data; count and architecture unspecified in abstract.
Spatio-temporal covariance hyperparameters
Length scales and variance parameters of the generic covariance function are fitted; exact form not given.

axioms (1)

domain assumption Vecchia approximation preserves predictive accuracy for this dataset
Invoked to justify quasi-linear scaling while claiming comparable performance to full GPs.

pith-pipeline@v0.9.0 · 5726 in / 1315 out tokens · 60267 ms · 2026-05-18T05:49:41.825822+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We propose a one-stop Gaussian process regression framework with a flexible mean structure and a generic spatio-temporal covariance function to jointly model Argo temperature data across broad spatial domains... employ the Vecchia approximation, which reduces the computational complexity from cubic to quasi-linear
IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Our fully data-driven approach achieves predictive performance that compares favorably with the established benchmarks that require moving-window regression and separate parametric mean estimation.

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

40 extracted references · 40 canonical work pages

[1]

Argo float data and metadata from Global Data Assembly Centre (Argo GDAC).https://doi.org/10.17882/42182

Argo (2000). Argo float data and metadata from Global Data Assembly Centre (Argo GDAC).https://doi.org/10.17882/42182

work page doi:10.17882/42182 2000
[2]

Bevilacqua, M., Gaetan, C., Mateu, J., and Porcu, E. (2012). Estimating space and space- time covariance functions for large data sets: a weighted composite likelihood approach. Journal of the American Statistical Association, 107(497):268–280

work page 2012
[3]

and Wallin, J

Bolin, D. and Wallin, J. (2020). Multivariate type g mat´ ern stochastic partial differen- tial equation random fields.Journal of the Royal Statistical Society Series B: Statistical Methodology, 82(1):215–239

work page 2020
[4]

P., Davis, R

Bretherton, F. P., Davis, R. E., and Fandry, C. (1976). A technique for objective analysis and design of oceanographic experiments applied to mode-73.Deep Sea Research and Oceanographic Abstracts, 23(7):559–582

work page 1976
[5]

G., and Katzfuss, M

Cao, J., Guinness, J., Genton, M. G., and Katzfuss, M. (2022). Scalable Gaussian-process 24 regression and variable selection using Vecchia approximations.Journal of Machine Learn- ing Research, 23(348):1–30

work page 2022
[6]

T., and Katzfuss, M

Cao, J., Kang, M., Jimenez, F., Sang, H., Schaefer, F. T., and Katzfuss, M. (2023). Vari- ational Sparse Inverse Cholesky Approximation for Latent Gaussian Processes via Double Kullback-Leibler Minimization. InProceedings of the 40th International Conference on Machine Learning, pages 3559–3576. PMLR. ISSN: 2640-3498

work page 2023
[7]

Cao, J., ZHANG, J., SUN, Z., and Katzfuss, M. (2024). Locally Anisotropic Non- stationary Covariance Functions on the Sphere.Journal of Agricultural, Biological and Environmental Statistics, 29(2):212–231

work page 2024
[8]

O., and Gelfand, A

Datta, A., Banerjee, S., Finley, A. O., and Gelfand, A. E. (2016). Hierarchical nearest- neighbor gaussian process models for large geostatistical datasets.Journal of the American Statistical Association, 111(514):800–812

work page 2016
[9]

Du, J., Zhang, H., and Mandrekar, V. S. (2009). Fixed-domain asymptotic properties of tapered maximum likelihood estimators.The Annals of Statistics, 37(6A):3330–3361

work page 2009
[10]

A., Reich, B

Eidsvik, J., Shaby, B. A., Reich, B. J., Wheeler, M., and Niemi, J. (2014). Estimation and prediction in spatial models with block composite likelihoods.Journal of Computa- tional and Graphical Statistics, 23(2):295–315

work page 2014
[11]

O., Datta, A., Cook, B

Finley, A. O., Datta, A., Cook, B. D., Morton, D. C., Andersen, H. E., and Banerjee, S. (2019). Efficient algorithms for bayesian nearest neighbor gaussian processes.Journal of Computational and Graphical Statistics, 28(2):401–414. PMID: 31543693

work page 2019
[12]

G., and Nychka, D

Furrer, R., Genton, M. G., and Nychka, D. (2006). Covariance tapering for interpolation of large spatial datasets.Journal of Computational and Graphical Statistics, 15(3):502– 523. 25

work page 2006
[13]

Gasparin, F., Roemmich, D., Gilson, J., and Cornuelle, B. (2015). Assessment of the upper-ocean observing system in the equatorial pacific: The role of argo in resolving intraseasonal to interannual variability.Journal of Atmospheric and Oceanic Technology, 32(9):1668 – 1688

work page 2015
[14]

Gray, A. R. and Riser, S. C. (2015). A method for multiscale optimal analysis with application to argo data.Journal of Geophysical Research: Oceans, 120(6):4340–4356

work page 2015
[15]

Guinness, J. (2018). Permutation and grouping methods for sharpening gaussian process approximations.Technometrics, 60(4):415–429

work page 2018
[16]

Guinness, J. (2021). Gaussian process learning via Fisher scoring of Vecchia’s approxi- mation.Statistics and Computing, 31(3):25

work page 2021
[17]

J., Kuusela, M., Lee, A

Hu, A. J., Kuusela, M., Lee, A. B., Giglio, D., and Wood, K. M. (2024). Spatiotemporal methods for estimating subsurface ocean thermal response to tropical cyclones.Advances in Statistical Climatology, Meteorology and Oceanography, 10(2):69–93

work page 2024
[18]

Katzfuss, M. (2017). A multi-resolution approximation for massive spatial datasets. Journal of the American Statistical Association, 112(517):201–214

work page 2017
[19]

and Guinness, J

Katzfuss, M. and Guinness, J. (2021). A General Framework for Vecchia Approxima- tions of Gaussian Processes.Statistical Science, 36(1):124–141. Publisher: Institute of Mathematical Statistics

work page 2021
[20]

Katzfuss, M., Guinness, J., and Lawrence, E. (2022). Scaled Vecchia Approximation for Fast Computer-Model Emulation.SIAM/ASA Journal on Uncertainty Quantification, 10(2):537–554

work page 2022
[21]

G., Schervish, M

Kaufman, C. G., Schervish, M. J., and Nychka, D. W. (2008). Covariance tapering for likelihood-based estimation in large spatial data sets.Journal of the American Statistical Association, 103(484):1545–1555. 26

work page 2008
[22]

Korte-Stapff, M., Yarger, D., Stoev, S., and Hsing, T. (2022). A functional regression model for heterogeneous biogeochemical argo data in the southern ocean.https://doi. org/10.48550/arXiv.2211.04012. arXiv:2211.04012

work page doi:10.48550/arxiv.2211.04012 2022
[23]

and Stein, M

Kuusela, M. and Stein, M. L. (2018). Locally stationary spatio-temporal interpolation of argo profiling float data.Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, 474

work page 2018
[24]

Lindgren, F., Rue, H., and Lindstr¨ om, J. (2011). An explicit link between gaussian fields and gaussian markov random fields: the stochastic partial differential equation approach. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 73(4):423–498

work page 2011
[25]

Neal, R. M. (2012).Bayesian learning for neural networks, volume 118. Springer Science & Business Media

work page 2012
[26]

Nychka, D., Bandyopadhyay, S., Hammerling, D., Lindgren, F., and Sain, S. (2015). A multiresolution gaussian process model for the analysis of large spatial datasets.Journal of Computational and Graphical Statistics, 24(2):579–599

work page 2015
[27]

R., Dunn, J

Ridgway, K. R., Dunn, J. R., and Wilkin, J. L. (2002). Ocean interpolation by four- dimensional weighted least squares—application to the waters around australasia.Journal of Atmospheric and Oceanic Technology, 19(9):1357 – 1375

work page 2002
[28]

Roemmich, D. (1983). Optimal estimation of hydrographic station data and derived fields.Journal of Physical Oceanography, 13(8):1544–1549

work page 1983
[29]

and Gilson, J

Roemmich, D. and Gilson, J. (2009). The 2004–2008 mean and annual cycle of temper- ature, salinity, and steric height in the global ocean from the argo program.Progress in Oceanography, 82(2):81–100

work page 2009
[30]

and Held, L

Rue, H. and Held, L. (2005).Gaussian Markov random fields: theory and applications. Chapman and Hall/CRC. 27

work page 2005
[31]

In: Proc

Salvana, M. L., Cao, J., and Jun, M. (2025). 3d bivariate spatial modelling of argo ocean temperature and salinity profiles.https://doi.org/10.48550/arXiv.2210. 11611. arXiv:2210.11611

work page doi:10.48550/arxiv.2210 2025
[32]

Sch¨ afer, F., Katzfuss, M., and Owhadi, H. (2021). Sparse cholesky factorization by kullback–leibler minimization.SIAM Journal on scientific computing, 43(3):A2019– A2046

work page 2021
[33]

Stein, M. L. (1999).Interpolation of Spatial Data. Springer New York, NY

work page 1999
[34]

L., Chi, Z., and Welty, L

Stein, M. L., Chi, Z., and Welty, L. J. (2004). Approximating Likelihoods for Large Spa- tial Data Sets.Journal of the Royal Statistical Society Series B: Statistical Methodology, 66(2):275–296

work page 2004
[35]

Varin, C., Reid, N., and Firth, D. (2011). An overview of composite likelihood methods. Statistica Sinica, 21(1):5–42

work page 2011
[36]

Vecchia, A. V. (1988). Estimation and Model Identification for Continuous Spatial Processes.Journal of the Royal Statistical Society. Series B (Methodological), 50(2):297–

work page 1988
[37]

Publisher: [Royal Statistical Society, Wiley]

work page
[38]

and Bhadra, A

Yarger, D. and Bhadra, A. (2023). Multivariate confluent hypergeometric covariance functions with simultaneous flexibility over smoothness and tail decay.https://doi.org/ 10.48550/arXiv.2312.05682. arXiv:2312.05682

work page doi:10.48550/arxiv.2312.05682 2023
[39]

Yarger, D., Stoev, S., and Hsing, T. (2022). A functional-data approach to the Argo data.The Annals of Applied Statistics, 16(1):216 – 246

work page 2022
[40]

Yarger, D., Stoev, S., and Hsing, T. (2023). Multivariate mat´ ern models – a spectral approach.https://doi.org/10.48550/arXiv.2309.02584. arXiv:2309.02584. 28

work page doi:10.48550/arxiv.2309.02584 2023

[1] [1]

Argo float data and metadata from Global Data Assembly Centre (Argo GDAC).https://doi.org/10.17882/42182

Argo (2000). Argo float data and metadata from Global Data Assembly Centre (Argo GDAC).https://doi.org/10.17882/42182

work page doi:10.17882/42182 2000

[2] [2]

Bevilacqua, M., Gaetan, C., Mateu, J., and Porcu, E. (2012). Estimating space and space- time covariance functions for large data sets: a weighted composite likelihood approach. Journal of the American Statistical Association, 107(497):268–280

work page 2012

[3] [3]

and Wallin, J

Bolin, D. and Wallin, J. (2020). Multivariate type g mat´ ern stochastic partial differen- tial equation random fields.Journal of the Royal Statistical Society Series B: Statistical Methodology, 82(1):215–239

work page 2020

[4] [4]

P., Davis, R

Bretherton, F. P., Davis, R. E., and Fandry, C. (1976). A technique for objective analysis and design of oceanographic experiments applied to mode-73.Deep Sea Research and Oceanographic Abstracts, 23(7):559–582

work page 1976

[5] [5]

G., and Katzfuss, M

Cao, J., Guinness, J., Genton, M. G., and Katzfuss, M. (2022). Scalable Gaussian-process 24 regression and variable selection using Vecchia approximations.Journal of Machine Learn- ing Research, 23(348):1–30

work page 2022

[6] [6]

T., and Katzfuss, M

Cao, J., Kang, M., Jimenez, F., Sang, H., Schaefer, F. T., and Katzfuss, M. (2023). Vari- ational Sparse Inverse Cholesky Approximation for Latent Gaussian Processes via Double Kullback-Leibler Minimization. InProceedings of the 40th International Conference on Machine Learning, pages 3559–3576. PMLR. ISSN: 2640-3498

work page 2023

[7] [7]

Cao, J., ZHANG, J., SUN, Z., and Katzfuss, M. (2024). Locally Anisotropic Non- stationary Covariance Functions on the Sphere.Journal of Agricultural, Biological and Environmental Statistics, 29(2):212–231

work page 2024

[8] [8]

O., and Gelfand, A

Datta, A., Banerjee, S., Finley, A. O., and Gelfand, A. E. (2016). Hierarchical nearest- neighbor gaussian process models for large geostatistical datasets.Journal of the American Statistical Association, 111(514):800–812

work page 2016

[9] [9]

Du, J., Zhang, H., and Mandrekar, V. S. (2009). Fixed-domain asymptotic properties of tapered maximum likelihood estimators.The Annals of Statistics, 37(6A):3330–3361

work page 2009

[10] [10]

A., Reich, B

Eidsvik, J., Shaby, B. A., Reich, B. J., Wheeler, M., and Niemi, J. (2014). Estimation and prediction in spatial models with block composite likelihoods.Journal of Computa- tional and Graphical Statistics, 23(2):295–315

work page 2014

[11] [11]

O., Datta, A., Cook, B

Finley, A. O., Datta, A., Cook, B. D., Morton, D. C., Andersen, H. E., and Banerjee, S. (2019). Efficient algorithms for bayesian nearest neighbor gaussian processes.Journal of Computational and Graphical Statistics, 28(2):401–414. PMID: 31543693

work page 2019

[12] [12]

G., and Nychka, D

Furrer, R., Genton, M. G., and Nychka, D. (2006). Covariance tapering for interpolation of large spatial datasets.Journal of Computational and Graphical Statistics, 15(3):502– 523. 25

work page 2006

[13] [13]

Gasparin, F., Roemmich, D., Gilson, J., and Cornuelle, B. (2015). Assessment of the upper-ocean observing system in the equatorial pacific: The role of argo in resolving intraseasonal to interannual variability.Journal of Atmospheric and Oceanic Technology, 32(9):1668 – 1688

work page 2015

[14] [14]

Gray, A. R. and Riser, S. C. (2015). A method for multiscale optimal analysis with application to argo data.Journal of Geophysical Research: Oceans, 120(6):4340–4356

work page 2015

[15] [15]

Guinness, J. (2018). Permutation and grouping methods for sharpening gaussian process approximations.Technometrics, 60(4):415–429

work page 2018

[16] [16]

Guinness, J. (2021). Gaussian process learning via Fisher scoring of Vecchia’s approxi- mation.Statistics and Computing, 31(3):25

work page 2021

[17] [17]

J., Kuusela, M., Lee, A

Hu, A. J., Kuusela, M., Lee, A. B., Giglio, D., and Wood, K. M. (2024). Spatiotemporal methods for estimating subsurface ocean thermal response to tropical cyclones.Advances in Statistical Climatology, Meteorology and Oceanography, 10(2):69–93

work page 2024

[18] [18]

Katzfuss, M. (2017). A multi-resolution approximation for massive spatial datasets. Journal of the American Statistical Association, 112(517):201–214

work page 2017

[19] [19]

and Guinness, J

Katzfuss, M. and Guinness, J. (2021). A General Framework for Vecchia Approxima- tions of Gaussian Processes.Statistical Science, 36(1):124–141. Publisher: Institute of Mathematical Statistics

work page 2021

[20] [20]

Katzfuss, M., Guinness, J., and Lawrence, E. (2022). Scaled Vecchia Approximation for Fast Computer-Model Emulation.SIAM/ASA Journal on Uncertainty Quantification, 10(2):537–554

work page 2022

[21] [21]

G., Schervish, M

Kaufman, C. G., Schervish, M. J., and Nychka, D. W. (2008). Covariance tapering for likelihood-based estimation in large spatial data sets.Journal of the American Statistical Association, 103(484):1545–1555. 26

work page 2008

[22] [22]

Korte-Stapff, M., Yarger, D., Stoev, S., and Hsing, T. (2022). A functional regression model for heterogeneous biogeochemical argo data in the southern ocean.https://doi. org/10.48550/arXiv.2211.04012. arXiv:2211.04012

work page doi:10.48550/arxiv.2211.04012 2022

[23] [23]

and Stein, M

Kuusela, M. and Stein, M. L. (2018). Locally stationary spatio-temporal interpolation of argo profiling float data.Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, 474

work page 2018

[24] [24]

Lindgren, F., Rue, H., and Lindstr¨ om, J. (2011). An explicit link between gaussian fields and gaussian markov random fields: the stochastic partial differential equation approach. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 73(4):423–498

work page 2011

[25] [25]

Neal, R. M. (2012).Bayesian learning for neural networks, volume 118. Springer Science & Business Media

work page 2012

[26] [26]

Nychka, D., Bandyopadhyay, S., Hammerling, D., Lindgren, F., and Sain, S. (2015). A multiresolution gaussian process model for the analysis of large spatial datasets.Journal of Computational and Graphical Statistics, 24(2):579–599

work page 2015

[27] [27]

R., Dunn, J

Ridgway, K. R., Dunn, J. R., and Wilkin, J. L. (2002). Ocean interpolation by four- dimensional weighted least squares—application to the waters around australasia.Journal of Atmospheric and Oceanic Technology, 19(9):1357 – 1375

work page 2002

[28] [28]

Roemmich, D. (1983). Optimal estimation of hydrographic station data and derived fields.Journal of Physical Oceanography, 13(8):1544–1549

work page 1983

[29] [29]

and Gilson, J

Roemmich, D. and Gilson, J. (2009). The 2004–2008 mean and annual cycle of temper- ature, salinity, and steric height in the global ocean from the argo program.Progress in Oceanography, 82(2):81–100

work page 2009

[30] [30]

and Held, L

Rue, H. and Held, L. (2005).Gaussian Markov random fields: theory and applications. Chapman and Hall/CRC. 27

work page 2005

[31] [31]

In: Proc

Salvana, M. L., Cao, J., and Jun, M. (2025). 3d bivariate spatial modelling of argo ocean temperature and salinity profiles.https://doi.org/10.48550/arXiv.2210. 11611. arXiv:2210.11611

work page doi:10.48550/arxiv.2210 2025

[32] [32]

Sch¨ afer, F., Katzfuss, M., and Owhadi, H. (2021). Sparse cholesky factorization by kullback–leibler minimization.SIAM Journal on scientific computing, 43(3):A2019– A2046

work page 2021

[33] [33]

Stein, M. L. (1999).Interpolation of Spatial Data. Springer New York, NY

work page 1999

[34] [34]

L., Chi, Z., and Welty, L

Stein, M. L., Chi, Z., and Welty, L. J. (2004). Approximating Likelihoods for Large Spa- tial Data Sets.Journal of the Royal Statistical Society Series B: Statistical Methodology, 66(2):275–296

work page 2004

[35] [35]

Varin, C., Reid, N., and Firth, D. (2011). An overview of composite likelihood methods. Statistica Sinica, 21(1):5–42

work page 2011

[36] [36]

Vecchia, A. V. (1988). Estimation and Model Identification for Continuous Spatial Processes.Journal of the Royal Statistical Society. Series B (Methodological), 50(2):297–

work page 1988

[37] [37]

Publisher: [Royal Statistical Society, Wiley]

work page

[38] [38]

and Bhadra, A

Yarger, D. and Bhadra, A. (2023). Multivariate confluent hypergeometric covariance functions with simultaneous flexibility over smoothness and tail decay.https://doi.org/ 10.48550/arXiv.2312.05682. arXiv:2312.05682

work page doi:10.48550/arxiv.2312.05682 2023

[39] [39]

Yarger, D., Stoev, S., and Hsing, T. (2022). A functional-data approach to the Argo data.The Annals of Applied Statistics, 16(1):216 – 246

work page 2022

[40] [40]

Yarger, D., Stoev, S., and Hsing, T. (2023). Multivariate mat´ ern models – a spectral approach.https://doi.org/10.48550/arXiv.2309.02584. arXiv:2309.02584. 28

work page doi:10.48550/arxiv.2309.02584 2023