Interpretability of Graph Neural Networks to Assess Effects of Global Change Drivers on Ecological Networks

Colin Fontaine; Elisa Thebault; Emre Anakok; Pierre Barbillon

arxiv: 2503.15107 · v3 · pith:WYYLFCTRnew · submitted 2025-03-19 · 📊 stat.ML · cs.LG

Interpretability of Graph Neural Networks to Assess Effects of Global Change Drivers on Ecological Networks

Emre Anakok , Pierre Barbillon , Colin Fontaine , Elisa Thebault This is my paper

Pith reviewed 2026-05-22 23:44 UTC · model grok-4.3

classification 📊 stat.ML cs.LG

keywords graph neural networksinterpretabilityecological networkspollinationland usesampling effectsSpipoll datasetglobal change

0 comments

The pith

Graph neural network interpretability methods detect interactive effects of land use on plant-pollinator network connectivity.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper tests whether existing interpretability techniques for graph neural networks can identify how environmental covariates interact with specific plant genera to shape pollination network connectivity. A simulation study checks the methods' ability to recover these effects and the influence of debiasing for sampling. The same pipeline is then applied to the Spipoll dataset, showing that land use influences connectivity while sampling corrections modify the estimated effects. A reader would care because large-scale interaction data are now available and machine-learning tools may offer a practical way to link global change drivers to ecological outcomes.

Core claim

An extensive simulation study confirms that GNN interpretability methods can detect the interactive effect between a covariate and a genus of plant on connectivity, and that debiasing techniques influence the estimation of these effects. An application on the Spipoll dataset, with and without accounting for sampling effects, highlights the potential impact of land use on network connectivity and shows that accounting for sampling effects partially alters the estimation of these effects.

What carries the argument

GNN interpretability methods applied to pollination networks to extract effects of environmental covariates on connectivity.

If this is right

Land use modifications can alter plant-pollinator network connectivity as measured by the GNN model.
Accounting for sampling effects partially changes the estimated strength of those land-use effects.
GNNs make it feasible to analyze combined large-scale interaction, climate, and land-use datasets for pollination questions.
Interpretability outputs can isolate genus-specific responses to the same environmental driver.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same pipeline could be tested on networks involving other interaction types such as seed dispersal or herbivory.
Observational ecological datasets collected with uneven sampling effort may require explicit debiasing whenever machine-learning models are used.
If land-use signals remain after sampling correction, targeted field experiments could test whether changing land cover directly alters realized interactions.

Load-bearing premise

Existing GNN interpretability methods can reliably detect and quantify interactive effects between environmental covariates and plant genera on network connectivity.

What would settle it

A controlled simulation in which known interactive effects between a covariate and plant genus are systematically missed or misquantified by the interpretability methods.

read the original abstract

Pollinators play a crucial role for plant reproduction, either in natural ecosystem or in human-modified landscape. Global change drivers,including climate change or land use modifications, can alter the plant-pollinator interactions. To assess the potential influence of global change drivers on pollination, large-scale interactions, climate and land use data are required. While recent machine learning methods, such as graph neural networks (GNNs), allow the analysis of such datasets, interpreting their results can be challenging. We explore existing methods for interpreting GNNs in order to highlight the effects of various environmental covariates on pollination network connectivity. An extensive simulation study is performed to confirm whether these methods can detect the interactive effect between a covariate and a genus of plant on connectivity, and whether the application of debiasing techniques influences the estimation of these effects. An application on the Spipoll dataset, with and without accounting for sampling effects, highlights the potential impact of land use on network connectivity and shows that accounting for sampling effects partially alters the estimation of these effects.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

Applies existing GNN interpretability tools to pollination networks with a simulation check and Spipoll application, but the simulation's ability to validate real-data performance looks limited.

read the letter

The paper uses standard GNN interpretability methods on ecological interaction networks to examine how land use and other drivers affect connectivity. The simulation tests whether these methods recover known interactive effects between covariates and plant genera, and the Spipoll analysis compares results with and without sampling corrections, finding that land use effects on connectivity shift somewhat after correction.

Referee Report

3 major / 2 minor

Summary. The manuscript explores the application of graph neural networks (GNNs) to model large-scale plant-pollinator interaction data, using existing GNN interpretability methods to quantify effects of environmental covariates (e.g., land use) on network connectivity. It reports an extensive simulation study intended to confirm that these methods can recover interactive effects between a covariate and plant genus, examines the influence of debiasing techniques for sampling effects, and presents results from the Spipoll dataset showing land-use impacts on connectivity that are partially altered when sampling effects are accounted for.

Significance. If the simulation study provides rigorous validation that the chosen interpretability methods reliably recover covariate-by-genus interactions under network conditions representative of real ecological data, and if the Spipoll application is supported by appropriate controls, the work could offer a practical demonstration of GNN-based analysis for global-change questions in ecology while highlighting the value of sampling corrections. The explicit comparison of results with and without debiasing is a constructive element.

major comments (3)

[Simulation study] The simulation study (described in the abstract and presumably detailed in the methods/results sections) is presented as the primary evidence that existing GNN interpretability methods can detect and quantify interactive effects. However, no information is supplied on whether the simulated graphs reproduce the extreme sparsity, heavy-tailed degree distributions, genus-specific baseline rates, or land-use-induced covariate correlations observed in the Spipoll data; without such matching, recovery in simulation does not establish reliable performance on the real dataset that underpins the application claims.
[Abstract and methods] The abstract and overall presentation supply no equations, model specifications (e.g., GNN architecture, loss function, or interpretability technique such as GNNExplainer or gradient-based attribution), performance metrics, or validation details. This absence prevents evaluation of whether the stated claims about interactive-effect detection and the alteration of land-use estimates by sampling correction are actually supported by the results.
[Application on Spipoll dataset] The application result that 'accounting for sampling effects partially alters the estimation' of land-use effects on connectivity rests on the unverified assumption that the interpretability pipeline extracts genuine covariate-genus interactions rather than artifacts. No independent check (e.g., comparison to GLM interaction tests or held-out predictive validation of the extracted effects) is described that could falsify the pipeline.

minor comments (2)

[Abstract] The abstract is overly high-level and contains no quantitative results, making it impossible for readers to gauge the magnitude or statistical significance of the reported effects.
[Introduction and methods] Notation for network connectivity, the precise definition of 'interactive effect,' and the debiasing procedure should be introduced with explicit mathematical definitions early in the text.

Simulated Author's Rebuttal

3 responses · 0 unresolved

We thank the referee for their constructive comments, which help clarify how to strengthen the presentation and validation of our work. We respond to each major comment below and commit to revisions that address the concerns raised.

read point-by-point responses

Referee: [Simulation study] The simulation study (described in the abstract and presumably detailed in the methods/results sections) is presented as the primary evidence that existing GNN interpretability methods can detect and quantify interactive effects. However, no information is supplied on whether the simulated graphs reproduce the extreme sparsity, heavy-tailed degree distributions, genus-specific baseline rates, or land-use-induced covariate correlations observed in the Spipoll data; without such matching, recovery in simulation does not establish reliable performance on the real dataset that underpins the application claims.

Authors: We agree that explicit matching between simulated and real network properties is essential to support the transferability of the simulation results. The simulation parameters were selected to approximate key empirical characteristics of the Spipoll data (sparsity, degree distributions, genus-specific connectivities, and covariate structure), but a direct quantitative comparison was not included. We will revise the methods section to add this comparison, including summary statistics and visualizations of simulated versus observed network features, along with justification for any remaining discrepancies. revision: yes
Referee: [Abstract and methods] The abstract and overall presentation supply no equations, model specifications (e.g., GNN architecture, loss function, or interpretability technique such as GNNExplainer or gradient-based attribution), performance metrics, or validation details. This absence prevents evaluation of whether the stated claims about interactive-effect detection and the alteration of land-use estimates by sampling correction are actually supported by the results.

Authors: The full methods section provides the GNN architecture details, training loss, and interpretability procedures (GNNExplainer together with gradient-based attribution), as well as performance metrics from the simulation. The abstract is intentionally concise. To improve accessibility, we will expand the methods with explicit equations for the model and attribution scores and add a brief summary of these specifications to the abstract or a new methods overview paragraph. revision: yes
Referee: [Application on Spipoll dataset] The application result that 'accounting for sampling effects partially alters the estimation' of land-use effects on connectivity rests on the unverified assumption that the interpretability pipeline extracts genuine covariate-genus interactions rather than artifacts. No independent check (e.g., comparison to GLM interaction tests or held-out predictive validation of the extracted effects) is described that could falsify the pipeline.

Authors: The simulation study is intended as the primary validation that the pipeline recovers known interactions. The with/without debiasing comparison on Spipoll data provides an internal consistency check. We acknowledge the value of external corroboration and will add, where feasible, a side-by-side comparison of GNN-derived genus-by-covariate effects against interaction terms from a GLM fitted to the same interaction data. Any limitations arising from the network structure will be discussed explicitly. revision: yes

Circularity Check

0 steps flagged

No circularity: application paper with no derivations or self-referential predictions

full rationale

The manuscript applies pre-existing GNN interpretability techniques to the Spipoll dataset and reports a simulation study whose purpose is validation of those techniques. No equations, fitted parameters renamed as predictions, self-citation load-bearing uniqueness claims, or ansatzes smuggled via prior work appear in the abstract or described structure. The simulation is presented as an external check rather than a quantity forced by the model definition itself. Consequently the derivation chain contains no self-referential reductions.

Axiom & Free-Parameter Ledger

0 free parameters · 0 axioms · 0 invented entities

Only the abstract is available; no free parameters, axioms, or invented entities can be identified from the provided text.

pith-pipeline@v0.9.0 · 5717 in / 997 out tokens · 31234 ms · 2026-05-22T23:44:47.168148+00:00 · methodology

Interpretability of Graph Neural Networks to Assess Effects of Global Change Drivers on Ecological Networks

Core claim

What carries the argument

If this is right

Where Pith is reading between the lines

Load-bearing premise

What would settle it

discussion (0)