Effect Sizes in Marketing Research: Why Cohen's Local f² Belongs in the Toolkit
Pith reviewed 2026-05-10 03:42 UTC · model grok-4.3
The pith
Local Cohen's f-squared measures the unique contribution of individual predictors or blocks in marketing regression models and belongs in the effect-size toolkit.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper claims that the omission of the local f(B)^2 from the editorial's framework is significant because marketing research frequently uses regression models to test whether a focal construct contributes explanatory power beyond competing predictors and controls. It positions the R-squared foundation of this local effect size as particularly advantageous in large-sample settings common to marketing. Furthermore, it proposes extending f-squared-type measures to multilevel models and, more tentatively, to neural networks and other machine-learning approaches.
What carries the argument
Cohen's local f(B)^2, the proportional increase in R-squared obtained by adding a focal predictor block B to a multivariable model that already contains other variables.
If this is right
- Researchers gain a direct way to report the added explanatory power of a focal construct after controls and rivals are accounted for.
- Statistical reporting in regression-based marketing papers can better match the theoretical emphasis on incremental validity rather than global fit alone.
- The same local approach can be applied to multilevel models that handle nested marketing data structures.
- Tentative extensions open the possibility of using analogous incremental measures inside neural networks and other machine-learning models.
Where Pith is reading between the lines
- Widespread use could shift emphasis from overall model fit toward systematic comparison of what each added construct contributes.
- It might help standardize how papers demonstrate that a new variable improves explanation beyond what is already known.
- Testing the measure on both small and very large marketing data sets would clarify when its large-sample advantage appears most clearly.
Load-bearing premise
The local f-squared supplies information on substantive significance that is not already covered by other effect-size measures and that its R-squared foundation is especially useful for large marketing samples.
What would settle it
A direct comparison in published marketing regression studies showing whether local f-squared values produce different conclusions about which predictors matter substantively than alternatives such as standardized coefficients or partial R-squared.
read the original abstract
In an editorial in the Journal of Marketing, Steenkamp et al. (2026) make a valuable and timely intervention by urging marketing scholars to move beyond dichotomous significance testing and to report effect sizes that speak to substantive significance. Their editorial is especially strong in its insistence on exact p-values, richer statistical reporting, and closer alignment between rigor and relevance. Yet, their framework omits the local form of Cohen's f^2, that is f(B)^2 as an effect-size measure for the contribution of an individual predictor or predictor block B within a multivariable model. That omission matters because much of marketing research relies on regression-type models in which the central theoretical question is not merely whether a model fits globally, but whether a focal construct adds meaningful explanatory power beyond competing predictors and controls. This commentary argues that the R-squared foundation of local Cohen's f(B)^2 is a strength, especially in large-sample settings. Moreover, f-squared-type local effect sizes can be extended beyond ordinary least squares to multilevel models and, more tentatively, to neural networks and other machine-learning models.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The manuscript is a commentary on the Steenkamp et al. (2026) editorial in the Journal of Marketing. It argues that the editorial omits the local form of Cohen's f² (denoted f(B)²) as an effect-size measure for the incremental contribution of an individual predictor or block B in multivariable regression models. The paper claims this omission is consequential for marketing research because such models typically ask whether a focal construct adds explanatory power beyond controls, asserts that the R² foundation of local f² is advantageous especially in large-sample settings, and suggests that f²-type measures can be extended to multilevel models and (tentatively) to neural networks and other ML models.
Significance. If the argument holds and is substantiated with evidence, the commentary would usefully supplement the editorial by reminding readers of a standard, R²-based local effect size already available in regression software. It correctly identifies that marketing regressions often focus on incremental explanatory power rather than global fit alone. The paper draws on well-established properties of R² and Cohen's f² without introducing new derivations or data, so its primary value is advocacy and potential extension rather than methodological innovation.
major comments (2)
- [Abstract] Abstract and the paragraph asserting the R² foundation as a strength: the claim that local f² is advantageous in large-sample marketing data because it speaks to substantive significance is not supported by any side-by-side comparison, simulation, or empirical example showing that f(B)² produces materially different predictor rankings, substantive conclusions, or alignment with theory than the incremental R² (or partial R²) that standard software already reports. Since local f² is defined exactly as (R²_full − R²_reduced) / (1 − R²_full), the manuscript must demonstrate an informational gap rather than assert a preference for the rescaled form.
- [Discussion of extensions] Section discussing extensions beyond OLS: the statement that f-squared-type local effect sizes 'can be extended' to neural networks and other machine-learning models is presented without any concrete formula, computational procedure, or reference for how a local incremental contribution would be defined or estimated in non-linear, non-parametric models; the extension therefore remains speculative and does not yet meet the load-bearing standard for the paper's broader recommendation.
minor comments (2)
- [Throughout] Clarify the notation f(B)² versus the global f² throughout to prevent reader confusion, especially when the manuscript contrasts local and global measures.
- [Introduction] Verify the citation details and year for Steenkamp et al. (2026); if the editorial is forthcoming, note its status explicitly.
Simulated Author's Rebuttal
We thank the referee for the constructive and detailed comments on our commentary. We address each major comment point by point below, indicating where revisions have been made to the manuscript.
read point-by-point responses
-
Referee: [Abstract] Abstract and the paragraph asserting the R² foundation as a strength: the claim that local f² is advantageous in large-sample marketing data because it speaks to substantive significance is not supported by any side-by-side comparison, simulation, or empirical example showing that f(B)² produces materially different predictor rankings, substantive conclusions, or alignment with theory than the incremental R² (or partial R²) that standard software already reports. Since local f² is defined exactly as (R²_full − R²_reduced) / (1 − R²_full), the manuscript must demonstrate an informational gap rather than assert a preference for the rescaled form.
Authors: We acknowledge that the manuscript does not contain new simulations or empirical examples comparing local f² directly to incremental or partial R². As a commentary focused on advocacy rather than methodological innovation, the argument centers on the established role of local f² within Cohen's framework as a standardized, bounded measure of incremental contribution. To respond to the concern, we have revised the abstract and the relevant discussion paragraph to emphasize the interpretive standardization (normalization by residual variance) without asserting unshown empirical superiority, and we have added a concise numerical illustration using typical regression output to clarify how the rescaled form aids substantive interpretation in practice. revision: partial
-
Referee: [Discussion of extensions] Section discussing extensions beyond OLS: the statement that f-squared-type local effect sizes 'can be extended' to neural networks and other machine-learning models is presented without any concrete formula, computational procedure, or reference for how a local incremental contribution would be defined or estimated in non-linear, non-parametric models; the extension therefore remains speculative and does not yet meet the load-bearing standard for the paper's broader recommendation.
Authors: We agree that the original phrasing regarding extensions to neural networks and other machine-learning models was tentative and lacked concrete details, rendering it speculative. In the revised manuscript we have removed the specific reference to neural networks and ML models. The discussion of extensions is now confined to multilevel models, for which established pseudo-R² and local effect-size approaches are available in the literature, and we note only in passing that further adaptation to complex ML settings remains an open methodological question. revision: yes
Circularity Check
No circularity: commentary advocates established measure without derivations or self-referential reductions
full rationale
The paper is a commentary arguing that an editorial omits local Cohen's f² as an effect-size measure for individual predictor contributions in regression models. It contains no derivation chain, equations, fitted parameters, or predictions that reduce to the paper's own inputs by construction. References to R² foundations and extensions to multilevel or ML models cite established statistical concepts without self-citation loops or ansatzes smuggled from prior author work. The claim that the omission matters rests on substantive discussion of marketing research needs rather than any mathematical reduction or renaming of known results. This is a self-contained opinion piece on reporting practices with no load-bearing circular steps.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Local Cohen's f^2 is a valid and useful effect size based on changes in R-squared for assessing predictor importance.
Reference graph
Works this paper leans on
-
[1]
Bakeman, R., & McArthur, D. (1999). Determining the power of multiple regression analyses both with and without repeated measures. Behavior Research Methods, Instruments, and Computers, 31(1), 150 –154. https://doi.org/10.3758/BF03207705
-
[2]
Cohen, J. (1988). Statistical power analysis for the behavioral science (2nd ed.). Lawrence Erlbaum
work page 1988
-
[3]
Cohen, J., & Cohen, P. (1983). Applied multiple regression/correlation analysis for the behavioral sciences (2nd ed.). Lawrence Erlbaum. De La Rosa, W., Silverman, J., Sussman, A. B., Rino, G., Dorie, V .,
work page 1983
-
[4]
Hell, M., Giannella, E., & Dillman, L. (2025). Using Expenditure Reframes to Increase Interest in Claiming Government Benefits. Journal of Marketing , 1 –22. https://doi.org/10.1177/00222429251356992
-
[5]
Dedecker, J., Guedj, O., & Taupin, M. L. (2025). Asymptotic confidence interval for R2 in multiple linear regression. Statistics, 59(1), 1 –36. https://doi.org/10.1080/02331888.2024.2428978
-
[6]
Muluk, H. (2018). Hypocrisy and culture: Failing to practice what you preach receives harsher interpersonal reactions in independent (vs. interdependent) cultures. Journal of Experimental Social Psychology , 76, 371 –384. https://doi.org/10.1016/j.jesp.2017.12.009
-
[7]
Fisher, R. A. (1928). The general sampling distribution of the multiple correlation coefficient. Proceedings of the Royal Society of London. Series A , 121(788), 654 –673. https://doi.org/10.1098/rspa.1928.0224
-
[8]
Kelley, K. (2008). Sample size planning for the squared multiple correlation coefficient: Accuracy in parameter estimation via narrow confidence intervals. Multivariate Behavioral Research, 43(4), 524 –555. https://doi.org/10.1080/00273170802490632
-
[9]
Khalilzadeh, J., & Tasci, A. D. A. (2017). Large sample size, significance level, and the effect size: Solutions to perils of using big data for academic research. Tourism Management, 62, 89–96. https://doi.org/10.1016/j.tourman.2017.03.026
-
[10]
Lee, Y.-S. (1971). Some results on the sampling distribution of the multiple correlation coefficient. Journal of the Royal Statistical Society: Series B , 33(1), 117 –130. https://doi.org/10.1111/j.2517-6161.1971.tb00863.x
-
[11]
Lorah, J. (2018). Effect size measures for multilevel models: definition, interpretation, and TIMSS example. Large-Scale Assessments in Education , 6(1), 1 –11. https://doi.org/10.1186/s40536-018-0061-2
-
[12]
Messner, W. (2023). From black box to clear box: A hypothesis testing framework for scalar regression problems using deep artificial neural networks. Applied Soft Computing, 146, 1–14. https://doi.org/10.1016/j.asoc.2023.110729
-
[13]
Messner, W. (2024). Exploring multilevel data with deep learning and XAI: The effect of personal -care advertising spending on subjective happiness. International Business Review, 33(1), 1–22. https://doi.org/10.1016/j.ibusrev.2023.102203
-
[14]
Messner, W., Greene, T., & Matalone, J. (2025). From bytes to biases: Investigating the cultural self -perception of large language models. Journal of Public Policy & Marketing, 44(3), 370–391. https://doi.org/10.1177/07439156251319788
-
[15]
Ogasawara, H. (2006). Asymptotic expansion and conditional robustness for the sample multiple correlation coefficient under nonnormality. Communications in Statistics - Simulation and Computation , 35(1), 177 –199. https://doi.org/10.1080/03610910500416207
-
[16]
Olkin, I., & Pratt, J. W. (1958). Unbiased estimation of certain correlation coefficients. The Annals of Mathematical Statistics, 29(1), 201–211
work page 1958
-
[17]
Mermelstein, R. J. (2012). A practical guide to calculating Cohen's f2, a measure of local effect size, from PROC MIXED. Frontiers in Psychology , 3, 1 –6. https://doi.org/10.3389/fpsyg.2012.00111
-
[18]
Soper, H. E. (1929). The general sampling distribution of the multiple correlation coefficient. Journal of the Royal Statistical Society, 92(3), 445–447. https://doi.org/10.2307/2342239
-
[19]
Slotegraaf, R. J. (2026). Cementing JM’s impact on the marketing ecosystem: Empirical Execution. Journal of Marketing, 90(3), 1 –12. https://doi.org/10.1177/00222429261426047
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.