Conjugate Nearest Neighbor Gaussian Process Models for Efficient Statistical Interpolation of Large Spatial Data

Andrew O. Finley; Bruce D. Cook; Shinichiro Shirota; Sudipto Banerjee

arxiv: 1907.10109 · v1 · pith:YFITKEHInew · submitted 2019-07-23 · 📊 stat.ME · stat.AP· stat.CO

Conjugate Nearest Neighbor Gaussian Process Models for Efficient Statistical Interpolation of Large Spatial Data

Shinichiro Shirota , Andrew O. Finley , Bruce D. Cook , Sudipto Banerjee This is my paper

Pith reviewed 2026-05-24 17:00 UTC · model grok-4.3

classification 📊 stat.ME stat.APstat.CO

keywords spatial statisticsGaussian processesnearest-neighbor Gaussian processconjugate Bayesian inferencelarge spatial datastatistical interpolationLiDAR data analysis

0 comments

The pith

Conjugate Bayesian updates make low-rank plus nearest-neighbor Gaussian process models exact and scalable for massive spatial interpolation.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper develops scalable Gaussian process models for very large spatial datasets by pairing a low-rank Gaussian predictive process with a nearest-neighbor Gaussian process residual term. It fits these models through exact conjugate Bayesian computations instead of iterative sampling algorithms. The approach targets rich Bayesian inference while keeping matrix operations feasible for data sets with thousands to millions of locations. Simulation experiments test accuracy for long-range prediction, and the methods are applied to LiDAR data collected over a large forested region in Alaska.

Core claim

Low-rank Gaussian predictive processes combined with sparsity-inducing nearest-neighbor Gaussian processes can be fitted exactly using conjugate Bayesian modeling to avoid expensive iterative algorithms, delivering computationally efficient spatial interpolation that maintains accuracy for long-range prediction in both simulations and real LiDAR data.

What carries the argument

The conjugate nearest-neighbor Gaussian process that pairs a low-rank Gaussian predictive process with an NNGP residual process under exact conjugate Bayesian updates.

If this is right

Exact conjugate updates remove the need for MCMC or variational approximations in model fitting.
Computational cost scales linearly or near-linearly with the number of spatial locations rather than cubically.
The models support routine Bayesian interpolation on remotely sensed data sets of size previously considered intractable.
Simulation results indicate that long-range predictive performance remains stable across different choices of neighbor sets and rank.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The conjugacy structure may transfer to other hierarchical spatial models that admit closed-form updates once the covariance is fixed.
Real-time sequential updating becomes feasible for streaming environmental sensor networks.
The same low-rank plus sparse decomposition could be tested on non-Gaussian likelihoods by embedding it inside a larger conjugate framework.

Load-bearing premise

The combination of the low-rank Gaussian predictive process and the nearest-neighbor Gaussian process residual preserves sufficient accuracy for long-range spatial prediction under conjugate fitting.

What would settle it

A side-by-side comparison on a held-out spatial data set with known long-range dependence where the proposed model's mean squared prediction error exceeds that of a full Gaussian process or an alternative scalable method by more than a small fixed threshold.

Figures

Figures reproduced from arXiv: 1907.10109 by Andrew O. Finley, Bruce D. Cook, Shinichiro Shirota, Sudipto Banerjee.

**Figure 2.** Figure 2: (a) Tanana Inventory Unit (TIU) study area. Point symbols denote location of the Bonanza Creek [PITH_FULL_IMAGE:figures/full_fig_p008_2.png] view at source ↗

**Figure 3.** Figure 3: NNGP (a) and SLGP (b) CRPS search grid results for the BECF. The “optimal” parameter combination, [PITH_FULL_IMAGE:figures/full_fig_p008_3.png] view at source ↗

**Figure 4.** Figure 4: Semivariogram of BCEF non-spatial regression [PITH_FULL_IMAGE:figures/full_fig_p009_4.png] view at source ↗

**Figure 5.** Figure 5: BCEF and TIU pixel-level SLGP model predicted square root of forest canopy height mean and associated [PITH_FULL_IMAGE:figures/full_fig_p010_5.png] view at source ↗

read the original abstract

A key challenge in spatial statistics is the analysis for massive spatially-referenced data sets. Such analyses often proceed from Gaussian process specifications that can produce rich and robust inference, but involve dense covariance matrices that lack computationally exploitable structures. The matrix computations required for fitting such models involve floating point operations in cubic order of the number of spatial locations and dynamic memory storage in quadratic order. Recent developments in spatial statistics offer a variety of massively scalable approaches. Bayesian inference and hierarchical models, in particular, have gained popularity due to their richness and flexibility in accommodating spatial processes. Our current contribution is to provide computationally efficient exact algorithms for spatial interpolation of massive data sets using scalable spatial processes. We combine low-rank Gaussian processes with efficient sparse approximations. Following recent work by [1], we model the low-rank process using a Gaussian predictive process (GPP) and the residual process as a sparsity-inducing nearest-neighbor Gaussian process (NNGP). A key contribution here is to implement these models using exact conjugate Bayesian modeling to avoid expensive iterative algorithms. Through the simulation studies, we evaluate performance of the proposed approach and the robustness of our models, especially for long range prediction. We implement our approaches for remotely sensed light detection and ranging (LiDAR) data collected over the US Forest Service Tanana Inventory Unit (TIU) in a remote portion of Interior Alaska.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

The paper adds exact conjugate Bayesian updates to the GPP plus NNGP spatial model so inference stays closed-form even at large scale.

read the letter

The core move is to take the low-rank Gaussian predictive process for the mean trend and the nearest-neighbor Gaussian process for the residual field, then fit the joint model with conjugate normal-inverse-gamma updates under a Gaussian likelihood. Because the combined covariance remains a low-rank plus sparse structure, the usual multivariate-normal conditioning formulas apply directly and deliver exact posteriors without MCMC or variational approximation. That is the incremental step beyond the earlier GPP and NNGP papers cited in the abstract. The simulation section checks long-range prediction performance, and the authors run the method on a LiDAR dataset covering a large Alaskan inventory unit, which is the right kind of scale test. The construction itself looks clean; conjugacy follows from the model assumptions without hidden inconsistencies. The main open question is how much accuracy is lost for long-range dependence once both the low-rank and nearest-neighbor approximations are active at once. The simulations are meant to address this, but the abstract gives no numerical detail on bias or coverage, so the strength of that evidence is still unclear. Readers who need exact Bayesian spatial interpolation on grids of 10^5–10^6 points will find the approach practical. It is worth sending to referees because the conjugacy claim is verifiable from the model statement and the application domain is concrete.

Referee Report

1 major / 1 minor

Summary. The manuscript proposes combining a low-rank Gaussian predictive process (GPP) with a sparsity-inducing nearest-neighbor Gaussian process (NNGP) residual process, fitted via exact conjugate Bayesian updates under a Gaussian likelihood, to enable scalable exact inference and spatial interpolation for massive datasets. Performance is assessed via simulation studies emphasizing long-range prediction accuracy and robustness, with an application to LiDAR data over the Tanana Inventory Unit in Alaska.

Significance. If the conjugate updates are correctly derived and the combined GPP+NNGP model retains the claimed long-range predictive accuracy without hidden approximation error, the work would provide a practical exact-Bayesian alternative to iterative methods for large spatial data, strengthening the set of scalable tools available for environmental and remote-sensing applications.

major comments (1)

[Abstract] The abstract states that simulation studies evaluate 'robustness of our models, especially for long range prediction,' yet no quantitative metrics, baseline comparisons, or error decompositions are referenced; without these details it is impossible to assess whether the central claim of preserved long-range accuracy holds or whether the NNGP residual compensates for GPP truncation as asserted.

minor comments (1)

[Abstract] The citation '[1]' for the GPP construction should be expanded to a full reference in the bibliography.

Simulated Author's Rebuttal

1 responses · 0 unresolved

We thank the referee for their constructive feedback and recommendation. We address the single major comment below and have made revisions to improve clarity in the abstract while preserving the manuscript's focus on the conjugate GPP-NNGP framework.

read point-by-point responses

Referee: [Abstract] The abstract states that simulation studies evaluate 'robustness of our models, especially for long range prediction,' yet no quantitative metrics, baseline comparisons, or error decompositions are referenced; without these details it is impossible to assess whether the central claim of preserved long-range accuracy holds or whether the NNGP residual compensates for GPP truncation as asserted.

Authors: We agree that the abstract would benefit from greater specificity to allow readers to immediately evaluate the long-range prediction claims. Section 4 of the manuscript reports simulation results with quantitative metrics including RMSE, CRPS, and 95% interval coverage probabilities, along with direct comparisons to full GP and standalone NNGP baselines. These results include error decompositions demonstrating that the NNGP residual process compensates for GPP truncation without degrading long-range accuracy. To address the referee's concern, we have revised the abstract to reference these metrics and comparisons explicitly. The revised abstract now reads in part: 'Through simulation studies, we evaluate performance using RMSE, CRPS, and coverage probabilities, demonstrating robustness especially for long-range prediction relative to full GP and NNGP baselines.' This revision is incorporated in the updated manuscript. revision: yes

Circularity Check

0 steps flagged

No significant circularity

full rationale

The derivation relies on standard conjugacy of multivariate normal likelihoods with Gaussian process priors, which follows directly from the usual updating formulas without any reduction to fitted parameters or self-citations. The combination of GPP and NNGP is presented as a modeling choice whose performance is assessed via independent simulation studies and real-data application; no equation or claim equates a prediction to its own input by construction. The cited prior work on GPP/NNGP is external and does not form a self-referential chain.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Abstract-only review; no explicit free parameters, axioms, or invented entities are described.

pith-pipeline@v0.9.0 · 5782 in / 1008 out tokens · 29236 ms · 2026-05-24T17:00:39.629794+00:00 · methodology

discussion (0)

Reference graph

Works this paper leans on

36 extracted references · 36 canonical work pages · 1 internal anchor

[1]

Smoothed full-scale approximation of gaussian process models for computation of large spatial datasets,

B. Zhang, H. Sang, and J. Z. Huang, “Smoothed full-scale approximation of gaussian process models for computation of large spatial datasets,”Statistica Sinica, 2018, doi:10.5705/ss.202017.0008

work page doi:10.5705/ss.202017.0008 2018
[2]

Geostatistics for large datasets,

Y. Sun, B. Li, and M. Genton, “Geostatistics for large datasets,” inAdvances And Challenges In Space-time Modelling Of Natural Events, J. Montero, E. Porcu, and M. Schlather, Eds. Berlin Heidelberg: Springer- Verlag, 2012, pp. 55–77

work page 2012
[3]

High-dimensional Bayesian geostatis- tics,

S. Banerjee, “High-dimensional Bayesian geostatis- tics,” Bayesian Analysis, vol. 12, pp. 583–614, 2017

work page 2017
[4]

A case study competition among methods for analyzing large spa- tial data,

M. Heaton, A. Datta, A. Finley, R. Furrer, R. Guhaniyogi, F. Gerber, R. B. Gramacy, J. Guiness, D. Hammerling, M. Katzfuss, F. Lindgren, D. Ny- chka, and A. Sun, F. Zammit-Mangion, “A case study competition among methods for analyzing large spa- tial data,”Journal of Agricultural Biological and En- vironmental Statistics, 2018, doi.org/10.1007/s13253- 018-00348-w

work page doi:10.1007/s13253- 2018
[5]

The ICESat-2 laser altimetry mission,

W. Abdalati, H. Zwally, R. Bindschadler, B. Csatho, S. Farrell, H. Fricker, R. Harding, Dand Kwok, M. Lefsky, T. Markus, A. Marshak, T. Neumann, S. Palm, B. Schutz, B. Smith, J. Spinhirne, and C. Webb, “The ICESat-2 laser altimetry mission,” Proceedings of the IEEE, vol. 98, no. 5, pp. 735–751, 2010

work page 2010
[6]

Ice, cloud, and land elevation satellite-2,

ICESat-2, “Ice, cloud, and land elevation satellite-2,” http://icesat.gsfc.nasa.gov/, 2015, accessed: 1-5-2015

work page 2015
[7]

Global ecosystem dynamics investigation Li- DAR,

GEDI, “Global ecosystem dynamics investigation Li- DAR,” http://science.nasa.gov/missions/gedi/, 2014, accessed: 1-5-2015

work page 2014
[8]

NASA Goddard’s LiDAR, Hyper- spectral and Thermal (g-liht) Airborne Imager,

B. Cook, L. Corp, R. Nelson, E. Middleton, D. Mor- ton, J. McCorkel, J. Masek, K. Ranson, V. Ly, and P. Montesano, “NASA Goddard’s LiDAR, Hyper- spectral and Thermal (g-liht) Airborne Imager,”Re- mote Sensing, vol. 5, no. 8, pp. 4045–4066, 2013

work page 2013
[9]

A full scale approximation of covariance functions for large spatial data sets

H. Sang and J. Z. Huang, “A full scale approximation of covariance functions for large spatial data sets.” Journal of the Royal Statistical Society, Series B , vol. 74, pp. 111–132, 2012

work page 2012
[10]

Eﬃcient algorithms for Bayesian nearest neighbor gaussian processes

A. O. Finley, A. Datta, B. C. Cook, D. C. Mor- ton, H. E. Andersen, and S. Banerjee, “Eﬃcient algorithms for Bayesian nearest neighbor gaussian processes.” Journal of Computational and Graphical Statistics, 2018, doi:10.1080/10618600.2018.1537924

work page doi:10.1080/10618600.2018.1537924 2018
[11]

Gaussain predictive process models for large spatial data sets

S. Banerjee, A. E. Gelfand, A. O. Finley, and H. Sang, “Gaussain predictive process models for large spatial data sets.” Journal of the Royal Statistical Society, Series B, vol. 70, pp. 825–848, 2008

work page 2008
[12]

Hierarchical nearest-neighbor Gaussian process models for large geostatistical datasets

A. Datta, S. Banerjee, A. O. Finley, and A. E. Gelfand, “Hierarchical nearest-neighbor Gaussian process models for large geostatistical datasets.”Jour- nal of the American Statistical Association, vol. 111, pp. 800–812, 2016

work page 2016
[13]

Improving the performance of predictive process modeling for large datasets

A. Finley, H. Sang, S. Banerjee, and A. E. Gelfand, “Improving the performance of predictive process modeling for large datasets.”Computational Statistics and Data Analysis, vol. 53, pp. 2873–2884, 2009

work page 2009
[14]

A multi-resolution approximation for massive spatial datasets

M. Katzfuss, “A multi-resolution approximation for massive spatial datasets.” Journal of the American Statistical Association, vol. 112, pp. 201–214, 2017

work page 2017
[15]

Fused Gaussian process for very large spatial data

P. Ma and E. L. Kang, “Fused Gaussian process for very large spatial data.” 2017, arXiv:1702.08797

work page arXiv 2017
[16]

Estimation of model identiﬁcation for continuous spatial processes

A. V. Vecchia, “Estimation of model identiﬁcation for continuous spatial processes.” Journal of the Royal Statistical Society, Series B, vol. 50, pp. 297–312, 1988

work page 1988
[17]

Permutation and grouping methods for sharpening Gaussian process approximations

J. Guinness, “Permutation and grouping methods for sharpening Gaussian process approximations.”Tech- nometrics, 2018, doi:10.1080/00401706.2018.1437476

work page doi:10.1080/00401706.2018.1437476 2018
[18]

Covariance approximation for large multivariate spatial data sets with an application to multiple climate model errors

H. Sang, M. Jun, and J. Z. Huang, “Covariance approximation for large multivariate spatial data sets with an application to multiple climate model errors.” Annals of Applied Statistics, vol. 4, pp. 2519–2548, 2011

work page 2011
[19]

Covariance structure of the Gibbs sampler with applications to the comparisons of estimators and augmentation schemes

J. S. Liu, W. H. Wong, and A. Kong, “Covariance structure of the Gibbs sampler with applications to the comparisons of estimators and augmentation schemes.”Biometrika, vol. 81, pp. 27–40, 1994

work page 1994
[20]

An optimized blas library based on gotoblas2

X. Zhang, “An optimized blas library based on gotoblas2.” https://github.com/xianyi/OpenBLAS/, 2016, accessed 2015-06-01

work page 2016
[21]

Openmp: an indus- try standard api for shared-memory programming,

L. Dagum and R. Menon, “Openmp: an indus- try standard api for shared-memory programming,” Computational Science & Engineering, IEEE, vol. 5, no. 1, pp. 46–55, 1998

work page 1998
[22]

Strictly proper scor- ing rules, prediction, and estimation

T. Gneiting and A. E. Raftery, “Strictly proper scor- ing rules, prediction, and estimation.”Journal of the American Statistical Association, vol. 102, pp. 359– 378, 2007

work page 2007
[23]

FARSITE: Fire Area Simulator - model development and evaluation,

M. A. Finney, “FARSITE: Fire Area Simulator - model development and evaluation,” U.S. Depart- ment of Agriculture, Forest Service, Rocky Mountain Research Station, Tech. Rep. Research Paper RMRS- RP-4, 2004

work page 2004
[24]

Beyond potential vegetation: Combining lidar data 12 and a height-structured model for carbon studies,

G. C. Hurtt, R. Dubayah, J. Drake, P. R. Moor- croft, S. W. Pacala, J. B. Blair, and M. G. Fearon, “Beyond potential vegetation: Combining lidar data 12 and a height-structured model for carbon studies,” Ecological Applications, vol. 14, no. 3, pp. 873–883, 2004

work page 2004
[25]

Guidance on spatial wildland ﬁre analysis: models, tools, and techniques

R. D. Stratton, “Guidance on spatial wildland ﬁre analysis: models, tools, and techniques.” U.S. Depart- ment of Agriculture, Forest Service, Rocky Mountain Research Station, Tech. Rep. General Technical Re- port RMRS-GTR-183, 2006

work page 2006
[26]

A global forest canopy height map from the moderate resolution imaging spectroradiometer and the geoscience laser altimeter system,

M. A. Lefsky, “A global forest canopy height map from the moderate resolution imaging spectroradiometer and the geoscience laser altimeter system,” Geophysical Research Letters , vol. 37, no. 15, 2010, l15401. [Online]. Available: http://dx.doi.org/10.1029/2010GL043622

work page doi:10.1029/2010gl043622 2010
[27]

Water availability predicts forest canopy height at the global scale,

T. Klein, C. Randin, and C. Korner, “Water availability predicts forest canopy height at the global scale,” Ecology Letters, vol. 18, no. 12, pp. 1311–1320, 2015. [Online]. Available: http: //dx.doi.org/10.1111/ele.12525

work page doi:10.1111/ele.12525 2015
[28]

Mapping forest canopy height globally with spaceborne lidar,

M. Simard, N. Pinto, J. B. Fisher, and A. Baccini, “Mapping forest canopy height globally with spaceborne lidar,” Journal of Geophysical Research: Biogeosciences , vol. 116, no. G4, 2011, doi:10.1029/2011JG001708. [Online]. Available: http://dx.doi.org/10.1029/2011JG001708

work page doi:10.1029/2011jg001708 2011
[29]

Forest biomass estimation over regional scales using multisource data,

A. Baccini, M. A. Friedl, C. E. Woodcock, and R. Warbington, “Forest biomass estimation over regional scales using multisource data,” Geophysical Research Letters , vol. 31, no. 10, 2004, doi:10.1029/2004GL019782. [Online]. Available: http://dx.doi.org/10.1029/2004GL019782

work page doi:10.1029/2004gl019782 2004
[30]

Bonanza creek experimental forest,

B. C. LTER, “Bonanza creek experimental forest,” http://www.lter.uaf.edu/research/study-sites-bcef, accessed: 11-5-19

work page
[31]

H. . T. I. G-LiHT: Goddard’s LiDAR, https://gliht. gsfc.nasa.gov/, 2019, accessed: 5-11-19

work page 2019
[32]

High- resolution global maps of 21st-century forest cover change,

M. C. Hansen, P. V. Potapov, R. Moore, M. Hancher, S. A. Turubanova, A. Tyukavina, D. Thau, S. V. Stehman, S. J. Goetz, T. R. Loveland, A. Kommareddy, A. Egorov, L. Chini, C. O. Justice, and J. R. G. Townshend, “High- resolution global maps of 21st-century forest cover change,” Science, vol. 342, no. 6160, pp. 850–853, 2013. [Online]. Available: http: //s...

work page 2013
[33]

Fire history in Alaska,

AICC, “Fire history in Alaska,” http://afsmaps.blm. gov/imf_ﬁrehistory/imf.jsp?site=ﬁrehistory, 2016, accessed: 3-8-16

work page 2016
[34]

Banerjee, B

S. Banerjee, B. P. Carlin, and A. E. Gelfand,Hier- archical Modeling and Analysis for Spatial Data, 2nd ed. Chapman and Hall/CRC, 2014

work page 2014
[35]

Practical bayesian modeling and inference for massive spatial data sets on modest computing environmentsâĂă,

L. Zhang, A. Datta, and S. Banerjee, “Practical bayesian modeling and inference for massive spatial data sets on modest computing environmentsâĂă,” Statistical Analysis and Data Mining: The ASA Data Science Journal, vol. 12, no. 3, pp. 197–209, 2019

work page 2019
[36]

Spatial Factor Models for High-Dimensional and Large Spatial Data: An Application in Forest Variable Mapping

D. Taylor-Rodriguez, A. O. Finley, A. Datta, C. Bab- cock, H. Andersen, B. D. Cook, D. C. Morton, and S. Banerjee, “Spatial factor models for high- dimensional and large spatial data: an application in forest variable mapping.” 2018, arXiv:1801.02078

work page internal anchor Pith review Pith/arXiv arXiv 2018

[1] [1]

Smoothed full-scale approximation of gaussian process models for computation of large spatial datasets,

B. Zhang, H. Sang, and J. Z. Huang, “Smoothed full-scale approximation of gaussian process models for computation of large spatial datasets,”Statistica Sinica, 2018, doi:10.5705/ss.202017.0008

work page doi:10.5705/ss.202017.0008 2018

[2] [2]

Geostatistics for large datasets,

Y. Sun, B. Li, and M. Genton, “Geostatistics for large datasets,” inAdvances And Challenges In Space-time Modelling Of Natural Events, J. Montero, E. Porcu, and M. Schlather, Eds. Berlin Heidelberg: Springer- Verlag, 2012, pp. 55–77

work page 2012

[3] [3]

High-dimensional Bayesian geostatis- tics,

S. Banerjee, “High-dimensional Bayesian geostatis- tics,” Bayesian Analysis, vol. 12, pp. 583–614, 2017

work page 2017

[4] [4]

A case study competition among methods for analyzing large spa- tial data,

M. Heaton, A. Datta, A. Finley, R. Furrer, R. Guhaniyogi, F. Gerber, R. B. Gramacy, J. Guiness, D. Hammerling, M. Katzfuss, F. Lindgren, D. Ny- chka, and A. Sun, F. Zammit-Mangion, “A case study competition among methods for analyzing large spa- tial data,”Journal of Agricultural Biological and En- vironmental Statistics, 2018, doi.org/10.1007/s13253- 018-00348-w

work page doi:10.1007/s13253- 2018

[5] [5]

The ICESat-2 laser altimetry mission,

W. Abdalati, H. Zwally, R. Bindschadler, B. Csatho, S. Farrell, H. Fricker, R. Harding, Dand Kwok, M. Lefsky, T. Markus, A. Marshak, T. Neumann, S. Palm, B. Schutz, B. Smith, J. Spinhirne, and C. Webb, “The ICESat-2 laser altimetry mission,” Proceedings of the IEEE, vol. 98, no. 5, pp. 735–751, 2010

work page 2010

[6] [6]

Ice, cloud, and land elevation satellite-2,

ICESat-2, “Ice, cloud, and land elevation satellite-2,” http://icesat.gsfc.nasa.gov/, 2015, accessed: 1-5-2015

work page 2015

[7] [7]

Global ecosystem dynamics investigation Li- DAR,

GEDI, “Global ecosystem dynamics investigation Li- DAR,” http://science.nasa.gov/missions/gedi/, 2014, accessed: 1-5-2015

work page 2014

[8] [8]

NASA Goddard’s LiDAR, Hyper- spectral and Thermal (g-liht) Airborne Imager,

B. Cook, L. Corp, R. Nelson, E. Middleton, D. Mor- ton, J. McCorkel, J. Masek, K. Ranson, V. Ly, and P. Montesano, “NASA Goddard’s LiDAR, Hyper- spectral and Thermal (g-liht) Airborne Imager,”Re- mote Sensing, vol. 5, no. 8, pp. 4045–4066, 2013

work page 2013

[9] [9]

A full scale approximation of covariance functions for large spatial data sets

H. Sang and J. Z. Huang, “A full scale approximation of covariance functions for large spatial data sets.” Journal of the Royal Statistical Society, Series B , vol. 74, pp. 111–132, 2012

work page 2012

[10] [10]

Eﬃcient algorithms for Bayesian nearest neighbor gaussian processes

A. O. Finley, A. Datta, B. C. Cook, D. C. Mor- ton, H. E. Andersen, and S. Banerjee, “Eﬃcient algorithms for Bayesian nearest neighbor gaussian processes.” Journal of Computational and Graphical Statistics, 2018, doi:10.1080/10618600.2018.1537924

work page doi:10.1080/10618600.2018.1537924 2018

[11] [11]

Gaussain predictive process models for large spatial data sets

S. Banerjee, A. E. Gelfand, A. O. Finley, and H. Sang, “Gaussain predictive process models for large spatial data sets.” Journal of the Royal Statistical Society, Series B, vol. 70, pp. 825–848, 2008

work page 2008

[12] [12]

Hierarchical nearest-neighbor Gaussian process models for large geostatistical datasets

A. Datta, S. Banerjee, A. O. Finley, and A. E. Gelfand, “Hierarchical nearest-neighbor Gaussian process models for large geostatistical datasets.”Jour- nal of the American Statistical Association, vol. 111, pp. 800–812, 2016

work page 2016

[13] [13]

Improving the performance of predictive process modeling for large datasets

A. Finley, H. Sang, S. Banerjee, and A. E. Gelfand, “Improving the performance of predictive process modeling for large datasets.”Computational Statistics and Data Analysis, vol. 53, pp. 2873–2884, 2009

work page 2009

[14] [14]

A multi-resolution approximation for massive spatial datasets

M. Katzfuss, “A multi-resolution approximation for massive spatial datasets.” Journal of the American Statistical Association, vol. 112, pp. 201–214, 2017

work page 2017

[15] [15]

Fused Gaussian process for very large spatial data

P. Ma and E. L. Kang, “Fused Gaussian process for very large spatial data.” 2017, arXiv:1702.08797

work page arXiv 2017

[16] [16]

Estimation of model identiﬁcation for continuous spatial processes

A. V. Vecchia, “Estimation of model identiﬁcation for continuous spatial processes.” Journal of the Royal Statistical Society, Series B, vol. 50, pp. 297–312, 1988

work page 1988

[17] [17]

Permutation and grouping methods for sharpening Gaussian process approximations

J. Guinness, “Permutation and grouping methods for sharpening Gaussian process approximations.”Tech- nometrics, 2018, doi:10.1080/00401706.2018.1437476

work page doi:10.1080/00401706.2018.1437476 2018

[18] [18]

Covariance approximation for large multivariate spatial data sets with an application to multiple climate model errors

H. Sang, M. Jun, and J. Z. Huang, “Covariance approximation for large multivariate spatial data sets with an application to multiple climate model errors.” Annals of Applied Statistics, vol. 4, pp. 2519–2548, 2011

work page 2011

[19] [19]

Covariance structure of the Gibbs sampler with applications to the comparisons of estimators and augmentation schemes

J. S. Liu, W. H. Wong, and A. Kong, “Covariance structure of the Gibbs sampler with applications to the comparisons of estimators and augmentation schemes.”Biometrika, vol. 81, pp. 27–40, 1994

work page 1994

[20] [20]

An optimized blas library based on gotoblas2

X. Zhang, “An optimized blas library based on gotoblas2.” https://github.com/xianyi/OpenBLAS/, 2016, accessed 2015-06-01

work page 2016

[21] [21]

Openmp: an indus- try standard api for shared-memory programming,

L. Dagum and R. Menon, “Openmp: an indus- try standard api for shared-memory programming,” Computational Science & Engineering, IEEE, vol. 5, no. 1, pp. 46–55, 1998

work page 1998

[22] [22]

Strictly proper scor- ing rules, prediction, and estimation

T. Gneiting and A. E. Raftery, “Strictly proper scor- ing rules, prediction, and estimation.”Journal of the American Statistical Association, vol. 102, pp. 359– 378, 2007

work page 2007

[23] [23]

FARSITE: Fire Area Simulator - model development and evaluation,

M. A. Finney, “FARSITE: Fire Area Simulator - model development and evaluation,” U.S. Depart- ment of Agriculture, Forest Service, Rocky Mountain Research Station, Tech. Rep. Research Paper RMRS- RP-4, 2004

work page 2004

[24] [24]

Beyond potential vegetation: Combining lidar data 12 and a height-structured model for carbon studies,

G. C. Hurtt, R. Dubayah, J. Drake, P. R. Moor- croft, S. W. Pacala, J. B. Blair, and M. G. Fearon, “Beyond potential vegetation: Combining lidar data 12 and a height-structured model for carbon studies,” Ecological Applications, vol. 14, no. 3, pp. 873–883, 2004

work page 2004

[25] [25]

Guidance on spatial wildland ﬁre analysis: models, tools, and techniques

R. D. Stratton, “Guidance on spatial wildland ﬁre analysis: models, tools, and techniques.” U.S. Depart- ment of Agriculture, Forest Service, Rocky Mountain Research Station, Tech. Rep. General Technical Re- port RMRS-GTR-183, 2006

work page 2006

[26] [26]

A global forest canopy height map from the moderate resolution imaging spectroradiometer and the geoscience laser altimeter system,

M. A. Lefsky, “A global forest canopy height map from the moderate resolution imaging spectroradiometer and the geoscience laser altimeter system,” Geophysical Research Letters , vol. 37, no. 15, 2010, l15401. [Online]. Available: http://dx.doi.org/10.1029/2010GL043622

work page doi:10.1029/2010gl043622 2010

[27] [27]

Water availability predicts forest canopy height at the global scale,

T. Klein, C. Randin, and C. Korner, “Water availability predicts forest canopy height at the global scale,” Ecology Letters, vol. 18, no. 12, pp. 1311–1320, 2015. [Online]. Available: http: //dx.doi.org/10.1111/ele.12525

work page doi:10.1111/ele.12525 2015

[28] [28]

Mapping forest canopy height globally with spaceborne lidar,

M. Simard, N. Pinto, J. B. Fisher, and A. Baccini, “Mapping forest canopy height globally with spaceborne lidar,” Journal of Geophysical Research: Biogeosciences , vol. 116, no. G4, 2011, doi:10.1029/2011JG001708. [Online]. Available: http://dx.doi.org/10.1029/2011JG001708

work page doi:10.1029/2011jg001708 2011

[29] [29]

Forest biomass estimation over regional scales using multisource data,

A. Baccini, M. A. Friedl, C. E. Woodcock, and R. Warbington, “Forest biomass estimation over regional scales using multisource data,” Geophysical Research Letters , vol. 31, no. 10, 2004, doi:10.1029/2004GL019782. [Online]. Available: http://dx.doi.org/10.1029/2004GL019782

work page doi:10.1029/2004gl019782 2004

[30] [30]

Bonanza creek experimental forest,

B. C. LTER, “Bonanza creek experimental forest,” http://www.lter.uaf.edu/research/study-sites-bcef, accessed: 11-5-19

work page

[31] [31]

H. . T. I. G-LiHT: Goddard’s LiDAR, https://gliht. gsfc.nasa.gov/, 2019, accessed: 5-11-19

work page 2019

[32] [32]

High- resolution global maps of 21st-century forest cover change,

M. C. Hansen, P. V. Potapov, R. Moore, M. Hancher, S. A. Turubanova, A. Tyukavina, D. Thau, S. V. Stehman, S. J. Goetz, T. R. Loveland, A. Kommareddy, A. Egorov, L. Chini, C. O. Justice, and J. R. G. Townshend, “High- resolution global maps of 21st-century forest cover change,” Science, vol. 342, no. 6160, pp. 850–853, 2013. [Online]. Available: http: //s...

work page 2013

[33] [33]

Fire history in Alaska,

AICC, “Fire history in Alaska,” http://afsmaps.blm. gov/imf_ﬁrehistory/imf.jsp?site=ﬁrehistory, 2016, accessed: 3-8-16

work page 2016

[34] [34]

Banerjee, B

S. Banerjee, B. P. Carlin, and A. E. Gelfand,Hier- archical Modeling and Analysis for Spatial Data, 2nd ed. Chapman and Hall/CRC, 2014

work page 2014

[35] [35]

Practical bayesian modeling and inference for massive spatial data sets on modest computing environmentsâĂă,

L. Zhang, A. Datta, and S. Banerjee, “Practical bayesian modeling and inference for massive spatial data sets on modest computing environmentsâĂă,” Statistical Analysis and Data Mining: The ASA Data Science Journal, vol. 12, no. 3, pp. 197–209, 2019

work page 2019

[36] [36]

Spatial Factor Models for High-Dimensional and Large Spatial Data: An Application in Forest Variable Mapping

D. Taylor-Rodriguez, A. O. Finley, A. Datta, C. Bab- cock, H. Andersen, B. D. Cook, D. C. Morton, and S. Banerjee, “Spatial factor models for high- dimensional and large spatial data: an application in forest variable mapping.” 2018, arXiv:1801.02078

work page internal anchor Pith review Pith/arXiv arXiv 2018