FRInGe: Distribution-Space Integrated Gradients with Fisher--Rao Geometry
Pith reviewed 2026-05-08 12:52 UTC · model grok-4.3
The pith
FRInGe defines Integrated Gradients paths in predictive distribution space using Fisher-Rao geodesics instead of straight lines in input space.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
FRInGe defines both the reference and the interpolation schedule in predictive distribution space, using a maximum-entropy predictive distribution as reference and the Fisher-Rao geodesic on the simplex as the path, then pulls this trajectory back to input space via the pullback Fisher metric to obtain attributions by integrating input gradients along the stabilized path.
What carries the argument
The Fisher-Rao geodesic on the probability simplex, pulled back to input space via the pullback Fisher metric to serve as the integration trajectory for gradients.
If this is right
- Attributions become less sensitive to the choice of input-space baseline.
- Calibration-sensitive attribution metrics improve across multiple ImageNet architectures.
- Performance stays competitive on perturbation-based and infidelity measures.
- The geometric construction supplies a principled, non-heuristic schedule for the integration path.
Where Pith is reading between the lines
- This construction could be applied to other gradient-based attribution methods that currently use straight-line paths.
- Models whose predictive distributions are already close to maximum entropy might exhibit smaller gains than poorly calibrated models.
- The pullback mechanism suggests a general route for importing information-geometric objects into input-space explanation techniques.
Load-bearing premise
That choosing the maximum-entropy predictive distribution as reference and interpolating along the Fisher-Rao geodesic in distribution space, then pulling the path back to input space, produces attributions that are meaningfully better and more faithful than those from standard Integrated Gradients.
What would settle it
If FRInGe shows no improvement or a clear drop in MAS scores relative to standard Integrated Gradients when tested on additional model families or datasets, while also failing to match on AUC or infidelity, the advantage of the distribution-space construction would be falsified.
Figures
read the original abstract
Gradient-based attribution methods are model-faithful and scalable, but Integrated Gradients (IG) can be brittle because explanations depend on heuristic baselines, straight-line paths, discretization, and saturation. We propose Fisher--Rao Integrated Gradients (FRInGe), which defines both the reference and interpolation schedule in predictive distribution space. FRInGe replaces input baselines with a maximum-entropy predictive reference and follows a Fisher-Rao geodesic on the probability simplex. The corresponding input-space trajectory is realized through the pullback Fisher metric and stabilized by KL and Euclidean trust regions; attributions are obtained by integrating input gradients along this trajectory. Across six ImageNet architectures, FRInGe most clearly improves calibration-oriented attribution metrics, especially MAS scores, while remaining competitive on perturbation AUC and infidelity.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript proposes Fisher-Rao Integrated Gradients (FRInGe), which relocates the baseline and interpolation path of Integrated Gradients from input space to the space of predictive distributions. The reference is the maximum-entropy distribution on the simplex, the path follows the Fisher-Rao geodesic, and the trajectory is realized in input space via the pullback Fisher metric regularized by KL and Euclidean trust regions. Attributions are obtained by integrating input gradients along this path. Experiments across six ImageNet architectures report the clearest gains on calibration-oriented metrics (especially MAS) while remaining competitive on perturbation AUC and infidelity.
Significance. If the reported improvements hold under fuller verification, the work supplies a geometrically principled alternative to heuristic baselines and straight-line paths in attribution methods. By grounding the construction in the Fisher-Rao metric and maximum-entropy reference, it offers a reproducible, parameter-light route to more stable explanations, particularly for calibration-sensitive downstream tasks. The multi-architecture empirical comparison provides a concrete benchmark that future distribution-space methods can be measured against.
minor comments (3)
- [Abstract] Abstract: the phrase 'stabilized by KL and Euclidean trust regions' is used without indicating the specific radius or weighting parameters; these should be stated explicitly or referenced to the method section so readers can reproduce the exact path.
- The manuscript should clarify whether the reported MAS improvements are accompanied by statistical significance tests across the six architectures or merely point estimates; this affects the strength of the 'most clearly improves' claim.
- Notation: ensure the pullback metric is denoted consistently (e.g., g_FR or similar) and that the distinction between the distribution-space geodesic and its input-space image is made explicit in every equation that uses the integrated gradient formula.
Simulated Author's Rebuttal
We thank the referee for their careful reading and positive assessment of our work on Fisher-Rao Integrated Gradients (FRInGe). The referee summary accurately captures the method's core contributions, including the use of the maximum-entropy reference, Fisher-Rao geodesics, and the pullback construction with trust-region stabilization. We are pleased that the empirical results on calibration metrics across six ImageNet models were viewed as a useful benchmark. No specific major comments were provided in the report.
Circularity Check
No significant circularity detected in derivation chain
full rationale
The paper defines FRInGe by replacing IG baselines with a maximum-entropy predictive distribution on the simplex, interpolating along the Fisher-Rao geodesic, realizing the input trajectory via pullback of the Fisher metric, and integrating gradients along that path (stabilized by KL/Euclidean trust regions). All components draw from established external concepts (Integrated Gradients, Fisher-Rao geometry, KL divergence) without any self-definitional reduction, fitted parameter renamed as prediction, or load-bearing self-citation chain. The central construction is independent of the reported empirical outcomes on MAS/AUC/infidelity, which are presented as observed results rather than deductive necessities. The derivation remains self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
Reference graph
Works this paper leans on
-
[1]
Constructing sensible baselines for integrated gradients.arXiv preprint arXiv:2412.13864, 2024
Jai Bardhan, Cyrin Neeraj, Mihir Rawat, and Subhadip Mitra. Constructing sensible baselines for integrated gradients.arXiv preprint arXiv:2412.13864, 2024
-
[2]
Explanations can be manipulated and geometry is to blame
Ann-Kathrin Dombrowski, Maximillian Alber, Christopher Anders, Marcel Ackermann, Klaus- Robert Müller, and Pan Kessel. Explanations can be manipulated and geometry is to blame. Advances in Neural Information Processing Systems (NeurIPS), 32, 2019
work page 2019
-
[3]
Gabriel Erion, Joseph D Janizek, Pascal Sturmfels, Scott M Lundberg, and Su-In Lee. Improving performance of deep learning models with axiomatic attribution priors and expected gradients. Nature Machine Intelligence, 3(7):620–631, 2021
work page 2021
-
[4]
Guided integrated gradients: An adaptive path method for removing noise
Andrei Kapishnikov, Subhashini Venugopalan, Besim Avci, Ben Wedin, Michael Terry, and Tolga Bolukbasi. Guided integrated gradients: An adaptive path method for removing noise. InConference on Computer Vision and Pattern Recognition (CVPR), pages 5050–5058. IEEE, 2021
work page 2021
-
[5]
A new baseline assumption of integated gradients based on shaply value
Shuyang Liu, Zixuan Chen, Ge Shi, Ji Wang, Changjie Fan, Yu Xiong, Runze Wu Yujing Hu, Ze Ji, and Yang Gao. A new baseline assumption of integated gradients based on shaply value. arXiv preprint arXiv:2310.04821, 2023
-
[6]
Explaining deep neural network models with adversarial gradient integration
Deng Pan, Xin Li, and Dongxiao Zhu. Explaining deep neural network models with adversarial gradient integration. InInternational Joint Conference on Artificial Intelligence (IJCAI), 2021
work page 2021
-
[7]
Geometrically guided saliency maps
Md Mahfuzur Rahman, Noah Lewis, and Sergey Plis. Geometrically guided saliency maps. InICLR 2022 Workshop on PAIR2Struct: Privacy, Accountability, Interpretability, Robustness, Reasoning on Structured Data, 2022
work page 2022
-
[8]
Yuchen Ren, Zhengyu Zhao, Chenhao Lin, Bo Yang, Lu Zhou, Zhe Liu, and Chao Shen. Improving integrated gradient-based transferable adversarial examples by refining the integration path. InConference on Artificial Intelligence (AAAI), volume 39, pages 6731–6739, 2025
work page 2025
-
[9]
Using the path of least resistance to explain deep networks
Sina Salek and Joseph Enguehard. Using the path of least resistance to explain deep networks. arXiv preprint arXiv:2502.12108, 2025
-
[10]
Visualizing the impact of feature attribution baselines.Distill, 2020
Pascal Sturmfels, Scott Lundberg, and Su-In Lee. Visualizing the impact of feature attribution baselines.Distill, 2020. https://distill.pub/2020/attribution-baselines
work page 2020
-
[11]
The many shapley values for model explanation
Mukund Sundararajan and Amir Najmi. The many shapley values for model explanation. In International Conference on Machine Learning (ICML), pages 9269–9278, 2020. 10
work page 2020
-
[12]
Axiomatic attribution for deep networks
Mukund Sundararajan, Ankur Taly, and Qiqi Yan. Axiomatic attribution for deep networks. In International Conference on Machine Learning (ICML), pages 3319–3328, 2017
work page 2017
-
[13]
Maximum entropy baseline for integrated gradients
Hanxiao Tan. Maximum entropy baseline for integrated gradients. In2023 International Joint Conference on Neural Networks (IJCNN), pages 1–8. IEEE, 2023
work page 2023
-
[14]
Weighted integrated gradients for feature attribution.arXiv preprint arXiv:2505.03201, 2025
Kien Tran Duc Tuan, Tam Nguyen Trong, Son Nguyen Hoang, Khoat Than, and Anh Nguyen Duc. Weighted integrated gradients for feature attribution.arXiv preprint arXiv:2505.03201, 2025
-
[15]
Integrated decision gradients: Compute your attributions where the model makes its decision
Chase Walker, Sumit Jha, Kenny Chen, and Rickard Ewetz. Integrated decision gradients: Compute your attributions where the model makes its decision. InConference on Artificial Intelligence (AAAI), volume 38, pages 5289–5297, 2024
work page 2024
-
[16]
Attribution in scale and space
Shawn Xu, Subhashini Venugopalan, and Mukund Sundararajan. Attribution in scale and space. InConference on Computer Vision and Pattern Recognition (CVPR), pages 9680–9689. IEEE, 2020
work page 2020
-
[17]
Manifold integrated gradients: Riemannian geometry for feature attribution
Eslam Zaher, Maciej Trzaskowski, Quan Nguyen, and Fred Roosta. Manifold integrated gradients: Riemannian geometry for feature attribution. InInternational Conference on Machine Learning (ICML), pages 58090–58104, 2024
work page 2024
-
[18]
Yue Zhuo and Zhiqiang Ge. Ig 2: Integrated gradient on iterative gradient path for feature attribution.Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 46(11):7173– 7190, 2024. 11 A FRInGe Implementation Details A.1 FRInGe Pseudocode Algorithm 1FRInGe Require: Input x, target t, logits F ; damping λ, KL budget τ, step cap ηmax, Euclidean...
work page 2024
-
[19]
a Jacobian-vector product (JVP) to computeJ F (x)v,
-
[20]
multiplication by the analytic softmax Fisher matrixS(p(x)),
-
[21]
a vector-Jacobian product (VJP) to map the result back to input space. This matrix-free implementation is the key step that makes the method computationally feasible for image-sized inputs. A.4 Regularized Linear System, Decoupled Smoothing, and Sobolev Preconditioning At each waypoint k, FRInGe computes an update direction vk by solving a regularized lin...
-
[22]
one forward pass to evaluatep(x k),
-
[23]
one backward pass to compute the waypoint-tracking gradientg k =∇ xLk(xk),
-
[24]
Thus, an approximate total cost for FRInGe is CFRInGe ≈T[(1 +K)C fwd + (2 +K)C bwd]
one backward pass for attribution accumulation along the path, 4.K matrix-vector products with the pullback Fisher operator, where each product requires one JVP and one VJP. Thus, an approximate total cost for FRInGe is CFRInGe ≈T[(1 +K)C fwd + (2 +K)C bwd]. AssumingC fwd ≈C bwd, the relative overhead scales as CFRInGe CIG ≈ T N (K+ 1.5). Worst-Case Estim...
-
[25]
Although the blur-change maps already focus on informative image regions, the final attribution map remains largely noisy. This control experiment shows that the benefits of FRInGe do not arise from γprior alone, but from its interaction with the Fisher-aware geometry and the regularized solve. C Experimental Protocol and Reproducibility C.1 Dataset, Mode...
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.