Recognition: unknown
Mapping data sensitivities in global QCD analysis with linear response and influence functions
Pith reviewed 2026-05-07 07:05 UTC · model grok-4.3
The pith
Linear response and influence functions quantify how each data point shapes the central values, uncertainties, and correlations of QCD fits.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
Here we develop a framework based on linear response and influence functions, which are gradient-based sensitivity measures that directly quantify how experimental information propagates to fitted quantities and observables. These quantities cleanly expose how data locally determines the central values and uncertainties of quantum correlation functions, as well as the correlations between them, providing a transparent and general framework for diagnosing information flow in inverse problems in QCD.
What carries the argument
Linear response and influence functions, gradient-based sensitivity measures that quantify how experimental information propagates to fitted quantities and observables.
If this is right
- The method directly quantifies the contribution of each experiment to the central values of the fitted quantum correlation functions.
- It maps how data determines both the uncertainties and the mutual correlations among those functions.
- The framework supplies a general diagnostic for tracing information flow through any high-dimensional inverse problem in QCD.
- Local gradient calculations replace the need for repeated full re-optimizations when testing individual data influences.
Where Pith is reading between the lines
- The same gradient machinery could be used to rank the impact of proposed future experiments on particular observables before data are taken.
- The linear-response view might be combined with existing uncertainty quantification tools to produce more localized error bands.
- Similar sensitivity maps could be applied to other inverse problems in particle physics that rely on large global fits.
Load-bearing premise
Linear approximations around the best-fit point are assumed to accurately capture data sensitivities even in the high-dimensional and potentially nonlinear space of QCD fits.
What would settle it
A direct comparison of the linear predictions against the actual changes obtained by fully refitting the global analysis after removing or perturbing a single data set; large discrepancies in central values or uncertainties would show the approximation fails.
Figures
read the original abstract
Global QCD analyses provide the primary framework for extracting hadron structure from experimental data, yet the mechanisms by which data constrain non-perturbative functions remain difficult to interpret due to the high dimensionality and complexity of these fits. Here we develop a framework based on linear response and influence functions, which are gradient-based sensitivity measures that directly quantify how experimental information propagates to fitted quantities and observables. These quantities cleanly expose how data locally determines the central values and uncertainties of quantum correlation functions, as well as the correlations between them, providing a transparent and general framework for diagnosing information flow in inverse problems in QCD.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript develops a framework based on linear response theory and influence functions to quantify the propagation of experimental data information into the central values, uncertainties, and correlations of parton distribution functions (PDFs) and derived observables in global QCD analyses. The approach defines gradient-based sensitivity measures around the best-fit point to diagnose local data-to-PDF mappings without repeated refits.
Significance. If the linear approximations prove accurate for relevant data variations, the framework would offer a computationally efficient and interpretable tool for mapping information flow in high-dimensional QCD fits. This could aid in identifying which datasets constrain specific PDF features, improving uncertainty quantification, and guiding experimental design, constituting a useful methodological contribution to global analyses.
major comments (2)
- [§3 (framework derivation) and §5 (numerical results)] The central claim that linear response and influence functions 'cleanly expose' data sensitivities assumes the first-order Taylor expansion around the minimum remains accurate. However, global QCD fits involve nonlinear DGLAP evolution, convolution integrals, and flexible parameterizations (typically 20-50 parameters) where the Hessian can be ill-conditioned. No explicit validation—such as comparing influence-function predictions to actual refits after finite data removal or rescaling—is presented to confirm the approximation's validity for typical perturbations.
- [§4.1 (definition of influence functions)] The influence functions are defined via gradients of the existing fit procedure, but the manuscript does not address how regularization choices in the Hessian or tolerance criteria affect the resulting sensitivity measures, which could introduce systematic biases in the reported correlations.
minor comments (2)
- [§2 (background)] Notation for the linear response operator and the influence function could be clarified with an explicit equation relating them to the Hessian and gradient of the chi-squared function.
- [Figures 2-4] Figure captions should explicitly state the specific global fit (e.g., NNPDF or CT18) and data sets used in the demonstrations to allow reproducibility.
Simulated Author's Rebuttal
We thank the referee for their positive assessment of the significance of our work and for the constructive comments, which help strengthen the presentation of the linear response framework. We address each major comment below and indicate the revisions we will implement.
read point-by-point responses
-
Referee: [§3 (framework derivation) and §5 (numerical results)] The central claim that linear response and influence functions 'cleanly expose' data sensitivities assumes the first-order Taylor expansion around the minimum remains accurate. However, global QCD fits involve nonlinear DGLAP evolution, convolution integrals, and flexible parameterizations (typically 20-50 parameters) where the Hessian can be ill-conditioned. No explicit validation—such as comparing influence-function predictions to actual refits after finite data removal or rescaling—is presented to confirm the approximation's validity for typical perturbations.
Authors: We agree that explicit validation of the linear approximation is important for establishing the practical utility of the framework. The influence functions are formally exact to first order at the best-fit minimum, but we acknowledge that the original manuscript did not include direct numerical comparisons against refits. In the revised manuscript we will add a dedicated validation subsection in §5. This will include a limited set of explicit refits after small, controlled data perturbations (e.g., rescaling selected datasets by 5–10 % or removing a small number of points) and direct comparison of the resulting PDF shifts and uncertainty changes against the predictions obtained from the influence functions. We will also add a short discussion of the expected range of validity, noting that larger perturbations will eventually probe nonlinearities while the method remains intended for local sensitivity diagnostics around the minimum. This addition will directly address the concern about the ill-conditioned Hessian and nonlinear evolution. revision: yes
-
Referee: [§4.1 (definition of influence functions)] The influence functions are defined via gradients of the existing fit procedure, but the manuscript does not address how regularization choices in the Hessian or tolerance criteria affect the resulting sensitivity measures, which could introduce systematic biases in the reported correlations.
Authors: The influence functions are constructed from the gradient of the total χ² with respect to the data, evaluated using the inverse Hessian matrix obtained from the original global fit. Consequently, they inherit the same regularization scheme and tolerance criterion that were used to determine the best-fit point and its uncertainties. We will revise the text in §4.1 to make this dependence explicit, stating that the reported data-to-PDF sensitivities and correlations are those of the regularized fit. We will also add a brief paragraph discussing how changes in the tolerance criterion would rescale the overall sensitivity measures uniformly (while preserving relative rankings of datasets), and we will note that any systematic bias from regularization is already present in the baseline fit itself. If space allows, we can include a short numerical illustration of the effect of varying the tolerance parameter on a subset of the sensitivity maps. revision: partial
Circularity Check
No circularity in gradient-based sensitivity framework
full rationale
The paper defines influence functions and linear response measures directly from gradients of the existing global QCD fit procedure (chi^2 minimization with DGLAP evolution and PDF parameterizations). These quantities are constructed to compute data-to-observable sensitivities by design, without any reduction of a claimed prediction back to fitted inputs by construction, self-definition of central results, or load-bearing self-citations. The derivation chain is a standard application of first-order Taylor expansion and statistical influence functions to an inverse problem; it does not rename known results or smuggle ansatze via prior work. The linearity assumption is an explicit modeling choice whose validity is separate from circularity. No load-bearing step equates outputs to inputs.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption The objective function of the global QCD fit is differentiable with respect to both the fit parameters and the input data points.
Reference graph
Works this paper leans on
-
[1]
Collins,Foundations of Perturbative QCD, Vol
J. Collins,Foundations of Perturbative QCD, Vol. 32 (Cambridge University Press, 2011)
2011
- [2]
-
[3]
C. Cocuzza, T. J. Hague, W. Melnitchouk, N. Sato, and A. W. Thomas, arXiv:2602.16589 [hep-ph] (2026)
-
[4]
C. Cocuzza, N. T. Hunt-Smith, W. Melnitchouk, N. Sato, and A. W. Thomas (JAM Collaboration (Spin PDF Analysis Group)), Phys. Rev. D112, 114017 (2025), arXiv:2506.13616 [hep-ph]
-
[5]
T.-J. Houet al., Phys. Rev. D103, 014013 (2021), arXiv:1912.10053 [hep-ph]
- [6]
- [7]
- [8]
-
[9]
M. Cerutti, A. Accardi, I. P. Fernando, S. Li, J. F. Owens, and S. Park, Phys. Rev. D111, 094013 (2025), arXiv:2501.06849 [hep-ph]
-
[10]
T. Anderson, W. Melnitchouk, and N. Sato (JAM Col- laboration (PDF Analysis Group)), Phys. Rev. D112, 094011 (2025), arXiv:2501.00665 [hep-ph]
-
[11]
D. de Florian, R. Sassot, and M. Stratmann, Phys. Rev. D75, 114010 (2007), arXiv:hep-ph/0703242
- [12]
- [13]
-
[14]
A. Bacchetta, V. Bertone, C. Bissolotti, M. Cerutti, M. Radici, S. Rodini, and L. Rossi (MAP (Multi- dimensional Analyses of Partonic distributions)), Phys. Rev. Lett.135, 021904 (2025), arXiv:2502.04166 [hep- ph]
- [15]
- [16]
- [17]
-
[18]
Dotsonet al., arXiv:2504.13289 [hep-ph] (2025)
A. Dotsonet al., arXiv:2504.13289 [hep-ph] (2025)
-
[19]
Z. Panjsheeri, D. Q. Adams, A. Khawaja, S. Pandey, K. Tezgin, and S. Liuti, arXiv:2511.03065 [hep-ph] (2025)
- [20]
- [21]
-
[22]
R. D. Ballet al., Nucl. Phys. B867, 244 (2013), arXiv:1207.1303 [hep-ph]
work page Pith review arXiv 2013
-
[23]
U.-K. Yang and A. Bodek, Eur. Phys. J. C13, 241 (2000), arXiv:hep-ex/9908058
- [24]
- [25]
-
[26]
A. Accardi, L. T. Brady, W. Melnitchouk, J. F. Owens, and N. Sato, Phys. Rev. D93, 114017 (2016), arXiv:1602.03154 [hep-ph]
work page Pith review arXiv 2016
-
[27]
A. Accardi, M. E. Christy, C. E. Keppel, W. Mel- nitchouk, P. Monaghan, J. G. Morf´ ın, and J. F. Owens, Phys. Rev. D81, 034016 (2010), arXiv:0911.2254 [hep- ph]
-
[28]
A. Accardi, W. Melnitchouk, J. F. Owens, M. E. Christy, C. E. Keppel, L. Zhu, and J. G. Morfin, Phys. Rev. D 84, 014008 (2011), arXiv:1102.3686 [hep-ph]
- [29]
- [30]
-
[31]
J. Collins, T. C. Rogers, and N. Sato, Phys. Rev. D105, 076010 (2022), arXiv:2111.01170 [hep-ph]
- [32]
-
[33]
A. Candido, S. Forte, and F. Hekhorn, JHEP11, 129, arXiv:2006.07377 [hep-ph]
-
[34]
U. D’Alesio, C. Flore, and A. Prokudin, Phys. Lett. B 803, 135347 (2020), arXiv:2001.01573 [hep-ph]
-
[35]
L. Gamberg, M. Malda, J. A. Miller, D. Pitonyak, A. Prokudin, and N. Sato (Jefferson Lab Angular Mo- mentum (JAM), Jefferson Lab Angular Momentum), Phys. Rev. D106, 034014 (2022), arXiv:2205.00999 [hep- ph]
-
[36]
J. Pumplin, D. Stump, R. Brock, D. Casey, J. Huston, J. Kalk, H. L. Lai, and W. K. Tung, Phys. Rev. D65, 014013 (2001), arXiv:hep-ph/0101032
- [37]
- [38]
-
[39]
R. D. Ballet al.(NNPDF), JHEP04, 040, arXiv:1410.8849 [hep-ph]
-
[40]
R. D. Ballet al.(NNPDF), Eur. Phys. J. C77, 663 (2017), arXiv:1706.00428 [hep-ph]
work page internal anchor Pith review arXiv 2017
-
[41]
G. P. Salam and J. Rojo, Comput. Phys. Commun.180, 120 (2009), arXiv:0804.3755 [hep-ph]
work page Pith review arXiv 2009
- [42]
-
[43]
T. Carli, D. Clements, A. Cooper-Sarkar, C. Gwenlan, G. P. Salam, F. Siegert, P. Starovoitov, and M. Sutton, Eur. Phys. J. C66, 503 (2010), arXiv:0911.2985 [hep-ph]
-
[44]
M. Stratmann and W. Vogelsang, Phys. Rev. D64, 114007 (2001), arXiv:hep-ph/0107064
-
[45]
B.-T. Wang, T. J. Hobbs, S. Doyle, J. Gao, T.-J. Hou, P. M. Nadolsky, and F. I. Olness, Phys. Rev. D98, 094030 (2018), arXiv:1803.02777 [hep-ph]
work page Pith review arXiv 2018
- [46]
-
[47]
X. Jinget al., Phys. Rev. D108, 034029 (2023), arXiv:2306.03918 [hep-ph]
-
[48]
Deep Inside Convolutional Networks: Visualising Image Classification Models and Saliency Maps
K. Simonyan, A. Vedaldi, and A. Zisserman, arXiv:1312.6034 [cs.CV] (2014)
work page internal anchor Pith review arXiv 2014
-
[49]
R. Giordano, T. Broderick, and M. Jordan, arXiv:1506.04088 [stat.ML] (2015)
-
[50]
Ruelle, Nonlinearity22, 855–870 (2009)
D. Ruelle, Nonlinearity22, 855–870 (2009)
2009
-
[51]
D. Baehrens, T. Schroeter, S. Harmeling, M. Kawanabe, K. Hansen, and K.-R. Mueller, arXiv:0912.1128 [stat.ML] (2009)
- [52]
- [53]
-
[54]
R. Giordano, W. Stephenson, R. Liu, M. I. Jordan, and T. Broderick, A swiss army infinitesimal jackknife (2020), arXiv:1806.00550 [stat.ME]
- [55]
-
[56]
R. P. Feynman, Phys. Rev. Lett.23, 1415 (1969)
1969
-
[57]
J. D. Bjorken, Phys. Rev.179, 1547 (1969)
1969
-
[58]
J. D. Bjorken and E. A. Paschos, Phys. Rev.185, 1975 (1969)
1975
-
[59]
C. G. Callan and D. J. Gross, Phys. Rev. Lett.21, 311 (1968)
1968
-
[60]
The Theory of Deeply Inelastic Scattering
J. Blumlein, Prog. Part. Nucl. Phys.69, 28 (2013), arXiv:1208.6087 [hep-ph]
work page Pith review arXiv 2013
- [61]
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.