Validating remotely sensed biomass estimates with forest inventory data in the western US

Dimitrios Gounaridis; Joseph O. Sexton; Kai Zhu; Neil H. Carter; Panshi Wang; Xiuyu Cao

arxiv: 2506.03120 · v1 · submitted 2025-06-03 · 📊 stat.AP · cs.LG

Validating remotely sensed biomass estimates with forest inventory data in the western US

Xiuyu Cao , Joseph O. Sexton , Panshi Wang , Dimitrios Gounaridis , Neil H. Carter , Kai Zhu This is my paper

Pith reviewed 2026-05-19 10:41 UTC · model grok-4.3

classification 📊 stat.AP cs.LG

keywords aboveground biomassremote sensing validationforest inventoryGEDI LiDARcarbon monitoringwestern United StatesterraPulsespatial aggregation

0 comments

The pith

TerraPulse remote sensing biomass estimates match US Forest Service inventory data with high agreement when aggregated over large areas in Utah, Nevada, and Washington.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper sets out to test whether a commercial aboveground biomass density product derived from GEDI LiDAR and optical data can serve as a reliable replacement or supplement for traditional ground-based forest measurements. It does so by directly comparing the terraPulse dataset against independent reference data from the Forest Inventory and Analysis program, aggregated to 64,000-hectare hexagons and to county boundaries. A reader would care because accurate, spatially complete biomass maps are needed for carbon accounting, climate policy, and ecosystem management, yet most commercial products lack this kind of independent check. The results show strong statistical agreement at both scales, while also flagging systematic differences in non-forest zones and high-biomass forests.

Core claim

Aggregated to 64,000-hectare hexagons across Utah, Nevada, and Washington, terraPulse and FIA biomass estimates agree with R2 = 0.88, RMSE = 26.68 Mg/ha, and correlation r = 0.94; at the county scale the agreement rises to R2 = 0.90, RMSE = 32.62 Mg/ha, slope = 1.07, and r = 0.95. The comparison reveals that terraPulse values tend to be higher than FIA in non-forest areas and lower in dense forests, patterns the authors attribute to FIA sampling limitations and optical saturation effects.

What carries the argument

Direct statistical comparison of two independent biomass datasets after aggregation to identical 64,000-hectare hexagonal tiles and county polygons, which converts point-scale inventory plots into area-representative reference values.

If this is right

The terraPulse dataset can be treated as sufficiently accurate for regional carbon stock assessments in similar western landscapes.
The same FIA-based aggregation method supplies a practical template for validating other GEDI-derived or optical biomass products.
Adjustments for non-forest vegetation and high-biomass saturation would be required before the product is used at finer resolutions.
Operational carbon monitoring programs can now cite this benchmark when adopting commercial remote-sensing layers.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The validation framework could be extended to additional states or continents where FIA-style inventories exist, creating a global consistency check.
Discrepancies in non-forest areas point to a broader need for ground sampling programs that deliberately target low-biomass cover types.
If saturation effects in high-biomass forests prove systematic across multiple sensors, hybrid models that incorporate radar or additional LiDAR metrics may be required.

Load-bearing premise

That the FIA plot network supplies an unbiased reference for biomass across both forest and non-forest land at the chosen aggregation scales.

What would settle it

Repeating the comparison at a finer aggregation scale (for example, 1-km grid cells) and observing a sharp drop in R2 or a large increase in RMSE would indicate that the reported agreement is an artifact of spatial averaging.

read the original abstract

Monitoring aboveground biomass (AGB) and its density (AGBD) at high resolution is essential for carbon accounting and ecosystem management. While NASA's spaceborne Global Ecosystem Dynamics Investigation (GEDI) LiDAR mission provides globally distributed reference measurements for AGBD estimation, the majority of commercial remote sensing products based on GEDI remain without rigorous or independent validation. Here, we present an independent regional validation of an AGBD dataset offered by terraPulse, Inc., based on independent reference data from the US Forest Service Forest Inventory and Analysis (FIA) program. Aggregated to 64,000-hectare hexagons and US counties across the US states of Utah, Nevada, and Washington, we found very strong agreement between terraPulse and FIA estimates. At the hexagon scale, we report R2 = 0.88, RMSE = 26.68 Mg/ha, and a correlation coefficient (r) of 0.94. At the county scale, agreement improves to R2 = 0.90, RMSE =32.62 Mg/ha, slope = 1.07, and r = 0.95. Spatial and statistical analyses indicated that terraPulse AGBD values tended to exceed FIA estimates in non-forest areas, likely due to FIA's limited sampling of non-forest vegetation. The terraPulse AGBD estimates also exhibited lower values in high-biomass forests, likely due to saturation effects in its optical remote-sensing covariates. This study advances operational carbon monitoring by delivering a scalable framework for comprehensive AGBD validation using independent FIA data, as well as a benchmark validation of a new commercial dataset for global biomass monitoring.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

A useful regional validation of terraPulse biomass against FIA, but non-forest sampling assumptions limit how strong the agreement claims can be.

read the letter

The main thing to know about this paper is that it validates the terraPulse AGBD product against FIA data in Utah, Nevada, and Washington. Aggregated to 64,000-hectare hexagons they report R2 of 0.88, RMSE 26.68 Mg/ha, and r of 0.94; at county scale the numbers are R2 0.90, RMSE 32.62 Mg/ha, slope 1.07, and r 0.95. The work applies a standard validation approach to a commercial GEDI-derived map and supplies the first published numbers for this specific product and region.

Referee Report

2 major / 1 minor

Summary. The manuscript presents an independent regional validation of the terraPulse AGBD product (derived from GEDI and optical covariates) against USFS FIA plot data. Aggregating both to 64,000-ha hexagons and to counties across Utah, Nevada, and Washington, the authors report strong agreement (hexagon: R²=0.88, RMSE=26.68 Mg/ha, r=0.94; county: R²=0.90, RMSE=32.62 Mg/ha, slope=1.07, r=0.95) and attribute residuals to FIA under-sampling of non-forest vegetation and to saturation in high-biomass forests. The work supplies a scalable FIA-based validation framework for commercial biomass products.

Significance. If the quantitative agreement holds after addressing reference bias and missing methodological details, the study supplies a practical benchmark for a GEDI-based commercial dataset and demonstrates a reproducible aggregation-based validation approach that can be extended to other regions and products. This directly supports operational carbon accounting needs.

major comments (2)

[Abstract and §3] Abstract and §3 (Results): the headline claim of R²=0.88 / r=0.94 at the hexagon scale treats the FIA-derived mean as an unbiased reference across forest and non-forest classes. The text itself states that terraPulse exceeds FIA in non-forest areas “likely due to FIA’s limited sampling of non-forest vegetation,” yet no stratification by forest-cover fraction within each hexagon or county is performed. Because 64,000-ha hexagons in Utah/Nevada commonly contain substantial non-forest area, the reported metrics may partly reflect this systematic under-sampling rather than true product accuracy; a land-cover-weighted or forest-only comparison is required to substantiate the strength of the validation.
[§2] §2 (Methods): the manuscript provides no information on the number of FIA plots per hexagon or county, the exact aggregation procedure (area-weighted mean, plot weighting, etc.), or how non-forest pixels were masked or included in the comparison. These details are load-bearing for interpreting RMSE and R² values and for assessing whether the validation is robust to sampling density.

minor comments (1)

[Figures] Figure 1 or 2: the spatial maps of residuals would benefit from an explicit legend showing the number of FIA plots contributing to each hexagon to allow readers to judge sampling reliability.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the detailed and constructive review. The comments on potential reference bias from non-forest sampling and the need for greater methodological detail are well taken. We have revised the manuscript to incorporate additional analyses and clarifications as detailed in the point-by-point responses below.

read point-by-point responses

Referee: [Abstract and §3] Abstract and §3 (Results): the headline claim of R²=0.88 / r=0.94 at the hexagon scale treats the FIA-derived mean as an unbiased reference across forest and non-forest classes. The text itself states that terraPulse exceeds FIA in non-forest areas “likely due to FIA’s limited sampling of non-forest vegetation,” yet no stratification by forest-cover fraction within each hexagon or county is performed. Because 64,000-ha hexagons in Utah/Nevada commonly contain substantial non-forest area, the reported metrics may partly reflect this systematic under-sampling rather than true product accuracy; a land-cover-weighted or forest-only comparison is required to substantiate the strength of the validation.

Authors: We agree that the inclusion of mixed forest/non-forest hexagons and counties means the reported metrics partly capture FIA sampling limitations in non-forest vegetation, as already noted in the manuscript text. To strengthen the validation, we have added a new stratified analysis in the revised Results section (§3) and supplementary materials. Using NLCD land-cover data, we stratify hexagons and counties by forest-cover fraction and provide a forest-only subset (masking non-forest pixels in both products). These results show that agreement remains strong (R² > 0.85) in high-forest-cover units, while confirming that residuals in low-forest units are driven by the FIA sampling design rather than terraPulse error. This directly addresses the concern without altering the headline landscape-scale metrics. revision: yes
Referee: [§2] §2 (Methods): the manuscript provides no information on the number of FIA plots per hexagon or county, the exact aggregation procedure (area-weighted mean, plot weighting, etc.), or how non-forest pixels were masked or included in the comparison. These details are load-bearing for interpreting RMSE and R² values and for assessing whether the validation is robust to sampling density.

Authors: We agree these details are necessary for reproducibility and interpretation. In the revised Methods section (§2), we now specify the distribution of FIA plots per hexagon and per county (including minimum, mean, and maximum values across the three states), describe the aggregation procedure as the area-weighted mean of plot-level AGBD values scaled to the spatial unit area, and clarify the handling of non-forest pixels: they were retained in the primary comparison to represent full landscape estimates consistent with the FIA sampling frame, while a supplementary forest-masked comparison is added to isolate forest-specific agreement. These additions make the validation framework fully transparent and allow readers to assess robustness to sampling density. revision: yes

Circularity Check

0 steps flagged

No circularity: direct empirical comparison to independent external reference

full rationale

The paper reports a straightforward validation exercise that aggregates terraPulse AGBD and FIA-derived AGBD to 64,000-ha hexagons and counties, then computes standard agreement statistics (R², RMSE, r, slope). No equations, model derivations, or predictions are present that reduce by construction to fitted parameters or self-citations. The central claims rest on the external FIA dataset as reference; any noted discrepancies (e.g., higher terraPulse values in non-forest areas) are observational findings rather than self-referential steps. This matches the default case of a self-contained empirical study against external benchmarks, warranting a score of 0.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

The central claim rests on the representativeness of FIA plots for the chosen aggregation units and on the assumption that terraPulse and FIA measure the same quantity (aboveground biomass density) despite different sampling designs.

axioms (1)

domain assumption FIA plots provide an unbiased sample of biomass across forest and non-forest classes at hexagon and county scales
Invoked when interpreting residuals in non-forest areas

pith-pipeline@v0.9.0 · 5854 in / 1241 out tokens · 64921 ms · 2026-05-19T10:41:54.844206+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

Aggregated to 64,000-hectare hexagons and US counties... R2 = 0.88, RMSE = 26.68 Mg/ha, r = 0.94

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.