A tool to determine the degrees of freedom in tree-structured varying coefficient models
Pith reviewed 2026-05-20 00:27 UTC · model grok-4.3
The pith
A formula approximates the degrees of freedom in tree-structured varying coefficient models by accounting for recursive partitioning.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper develops an easy-to-apply formula to approximate the degrees of freedom of a TSVC model. This formula is employed for model selection based on the Bayesian information criterion and compared to the naive solution, setting the DoF to the number of free model parameters, in a simulation study. Results indicated that calculation of the DoF using the proposed formula resulted in more accurate selection results with improved predictive ability.
What carries the argument
The easy-to-apply formula that approximates the effective degrees of freedom introduced by recursive partitioning when building a tree-structured varying coefficient model.
Load-bearing premise
The proposed easy-to-apply formula accurately approximates the effective degrees of freedom introduced by recursive partitioning in TSVC models.
What would settle it
A simulation study that compares held-out predictive performance of TSVC models chosen by BIC using the proposed DoF formula against models chosen by BIC with naive parameter count, or that checks whether the formula's values match cross-validated estimates of effective degrees of freedom.
Figures
read the original abstract
The tree-structured varying coefficient (TSVC) model is a flexible approach for generalized regression, where the linear effects of the covariates are allowed to vary with the values of effect modifiers. Relevant effect modifiers and interactions are identified using recursive partitioning. In TSVC models, analogously to other semi- and nonparametric regression approaches, one needs to account for the cost of data-driven model building when deriving the model degrees of freedom (DoF). To address this issue, we develop an easy-to-apply formula to approximate the DoF of a TSVC model. This formula is employed for model selection based on the Bayesian information criterion (BIC) and compared to the naive solution, setting the DoF to the number of free model parameters, in a simulation study. To illustrate the proposed DoF method, TSVC models using BIC-based selection were fitted to data from the Survey of Health, Ageing, and Retirement in Europe. Results indicated that calculation of the DoF using the proposed formula resulted in more accurate selection results with improved predictive ability.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper develops an easy-to-apply approximation formula for the effective degrees of freedom (DoF) in tree-structured varying coefficient (TSVC) models to account for the complexity introduced by recursive partitioning when performing BIC-based model selection. The formula is compared to the naive DoF (equal to the number of free parameters) in a simulation study, and TSVC models selected via the proposed DoF are fitted to data from the Survey of Health, Ageing, and Retirement in Europe (SHARE), with results indicating improved selection accuracy and predictive performance.
Significance. If the DoF approximation holds, the method would offer a practical correction for model complexity in semi-parametric regressions that incorporate tree-based effect modification, improving BIC-based selection and out-of-sample prediction in applications such as health and social science data analysis. The simulation results and real-data illustration provide concrete evidence of gains over the naive approach.
major comments (2)
- [§3.2, Eq. (8)] §3.2, Eq. (8): the DoF formula conditions on the realized tree structure after partitioning; it is unclear whether (or how) the expression adjusts for the data-driven selection of splits when the number of candidate effect modifiers is moderate to large, which is the regime where the skeptic's concern about understated DoF would be most relevant.
- [Table 3] Table 3, high-dimensional modifier rows: the reported gains in selection accuracy and predictive ability are shown only for the simulation settings described; without additional runs that increase tree depth or the size of the modifier candidate set, it is difficult to confirm that the approximation remains accurate outside the low-dimensional regime where the derivation is most likely to hold.
minor comments (2)
- [Abstract] The abstract states that the proposed DoF yields 'more accurate selection results' but does not define the accuracy metric (e.g., true-positive rate for relevant modifiers or out-of-sample MSE).
- [§2] Notation for the varying-coefficient functions and the partitioning operator could be introduced with a small numerical example in §2 to improve readability for readers unfamiliar with TSVC models.
Simulated Author's Rebuttal
We thank the referee for the constructive and insightful comments on our manuscript. We address each major comment below in a point-by-point manner and indicate the changes we will make to strengthen the paper.
read point-by-point responses
-
Referee: [§3.2, Eq. (8)] §3.2, Eq. (8): the DoF formula conditions on the realized tree structure after partitioning; it is unclear whether (or how) the expression adjusts for the data-driven selection of splits when the number of candidate effect modifiers is moderate to large, which is the regime where the skeptic's concern about understated DoF would be most relevant.
Authors: We agree that Equation (8) provides an approximation conditional on the realized tree structure obtained after recursive partitioning. The formula incorporates the complexity of the fitted tree by adjusting for the number and nature of the selected splits and nodes, rather than relying solely on the count of free parameters. This conditional approach is intentional, as it reflects the effective degrees of freedom of the model once the partitioning has been performed. However, we acknowledge that when the pool of candidate effect modifiers is moderate to large, the preceding data-driven search over possible splits may introduce additional selection effects not fully captured by conditioning on the final tree alone. In the revised manuscript, we will clarify this distinction in Section 3.2, add a discussion of the approximation's scope, and note that the BIC penalty using this DoF still yields improved selection performance in the regimes we examined. revision: partial
-
Referee: [Table 3] Table 3, high-dimensional modifier rows: the reported gains in selection accuracy and predictive ability are shown only for the simulation settings described; without additional runs that increase tree depth or the size of the modifier candidate set, it is difficult to confirm that the approximation remains accurate outside the low-dimensional regime where the derivation is most likely to hold.
Authors: The referee is correct that the current simulation results in Table 3 are limited to the settings described in the paper. To provide stronger evidence of the approximation's behavior, we will conduct additional simulation experiments that increase both tree depth and the number of candidate effect modifiers. The outcomes of these runs will be reported in a revised or supplementary version of Table 3, allowing readers to assess performance in higher-dimensional regimes. revision: yes
Circularity Check
No circularity: DoF approximation presented as independent formula
full rationale
The paper develops an easy-to-apply formula to approximate the effective degrees of freedom in tree-structured varying coefficient models that arise from recursive partitioning. This formula is then used for BIC-based model selection and compared against the naive count of free parameters in simulations and real data. No quoted derivation step reduces the proposed formula to a fitted quantity on the same data, a self-referential definition, or a load-bearing self-citation whose validity depends on the present result. The central claim therefore remains an independent approximation whose accuracy can be checked externally against simulation benchmarks and predictive performance, satisfying the criteria for a self-contained derivation.
Axiom & Free-Parameter Ledger
axioms (1)
- domain assumption Recursive partitioning in TSVC models adds effective degrees of freedom beyond the final parameter count that can be approximated by a simple formula.
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
we develop an easy-to-apply formula to approximate the DoF of a TSVC model... ˆdf_M[s]_MFP(μ) = 2.13 + 2.02s + 1.26p + 0.61ps + 0.16·10^{-3} ps n
-
IndisputableMonolith/Foundation/DimensionForcing.leanalexander_duality_circle_linking unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
MC approximated DoF... search cost... recursive partitioning
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
T. Hothorn and K. Hornik and A. Zeileis , journal=. Unbiased recursive partitioning:. 2006 , doi =
work page 2006
-
[2]
Mixed effects regression trees for clustered data , author=. Stat Probab Lett , volume=. 2011 , doi =
work page 2011
- [3]
-
[4]
Building multivariable fractional polynomial models with selection of variables and transformations: a boostrap investigation , author =. 1999 , journal =
work page 1999
-
[5]
How Biased is the Apparent Error Rate of a Prediction Rule? , journal =. 1986 , author =
work page 1986
-
[6]
On Measuring and Correcting the Effects of Data Mining and Model Selection , author =. 1998 , journal =
work page 1998
-
[7]
L. Wang and H. Zhao and X. Fan , journal =. Degrees of Freedom: Search Cost and Self-Consistency , year =
-
[8]
On Degrees of Freedom of Projection Estimators with Applications to Multivariate Nonparametric Regression , author =. 2020 , journal =
work page 2020
-
[9]
B. Efron , title =. 2004 , pages =. doi:10.1198/016214504000000692 , journal =
-
[10]
Degrees of freedom for nonlinear least squares estimation
N. Hansen and A. Sokol , title =. arXiv: 1402.2997 , year =
work page internal anchor Pith review Pith/arXiv arXiv
-
[11]
Degrees of freedom in lasso problems , author =. Annals of Statistics , year =
-
[12]
H. Zhou and T. Hastie and R. Tibshirani , title =. Annals of Statistics , year =
-
[13]
Degrees of Freedom and Model Search , author =. Statistica Sinica , year =
-
[14]
W. Adler and S. Potapov and B. Lausen , title =. 2011 , journal =. doi:10.1007/s00180-011-0249-1 , pages =
- [15]
-
[16]
C. L. Niedzwiedz and S. V. Katikireddi and J. P. Pell and R. Mitchell , year =. Socioeconomic inequalities in thequality of life of older Europeans in different welfare regimes , journal =
-
[17]
I. T. M. Heyworth and M. L. Hazell and M. F. Linehan and T. L. Frank , title =. 2009 , journal =
work page 2009
-
[18]
N. E. Rothrock and R. D. Hays and K. Spritzer and S. E. Yount and W. Riley and D. Cella , title =. 2010 , journal =
work page 2010
-
[19]
A. Marengoni and S. Angleman and R. Melis and F. Mangialasche and A. Karp and A. Garmen aand B. Meinow and L. Fratiglioni , title =. 2011 , journal =
work page 2011
-
[20]
T. T. Makovski and S. Schmitz and M. P. Zeegers and S. Stranges and M. van der Akker , title =. Ageing Res Rev , doi =. 2019 , volume =
work page 2019
-
[21]
T. T. Makovski and G. Le Coroller and P. Putrik and Y. H. Choi and M. P. Zeegers and S. Stranges and M. Ruiz Castell and L. Huiart and M. van den Akker , title =. 2020 , jorunal =
work page 2020
-
[22]
O. von dem Knesebeck and M. Wahrendorf and M. Hyde and J. Siegrist , title =. 2007 , journal =
work page 2007
-
[23]
M. A. Killingsworth , year =. Experienced well-being rises with income, even above \ 75,000 per year , journal =
-
[24]
C. Rodr\'iguez-Bl\'aquez and O. Ribeiro and A. Ayala and L. Teixeira and L. Araj\'uo and M. J. Forjaz , title =. Int J Environ Res Public Health , year =
- [25]
-
[26]
Demography of Europe: 2024 edition , publisher =
work page 2024
- [27]
-
[28]
Adv Data Anal Classif , volume=
A comparison of five recursive partitioning methods to find person subgroups involved in meaningful treatment -- subgroup interactions , author=. Adv Data Anal Classif , volume=. 2014 , doi =
work page 2014
-
[29]
G. Tutz and M.-R. Oelker , journal=. Modelling clustered heterogeneity:. 2017 , doi =
work page 2017
-
[30]
Robustness of linear mixed-effects models to violations of distributional assumptions , author =. 2020 , journal =
work page 2020
-
[31]
Clustering in linear mixed models with approximate Dirichlet process mixtures using EM algorithm , author=. Stat Model , volume=. 2013 , doi =
work page 2013
- [32]
-
[33]
Data Analysis Using Regression and Multilevel/Hierarchical Models , author =. 2007 , publisher =
work page 2007
-
[34]
Variable selection for generalized linear mixed models by L_1 -penalized estimation , author =. 2014 , volume =. doi:10.1007/s11222-012-9359-z , journal =
- [35]
-
[36]
D. Howel , journal=. Interpreting and evaluating the. 2012 , publisher=
work page 2012
- [37]
-
[38]
International Conference on Machine Learning , pages=
Selective inference for sparse high-order interaction models , author=. International Conference on Machine Learning , pages=. 2017 , organization=
work page 2017
-
[39]
Selective inference after likelihood-or test-based model selection in linear models , author=. Stat Probab Lett , volume=. 2018 , publisher=
work page 2018
-
[40]
Antiviral treatment of COVID-19: which role can clinical parameters play in therapy evaluation? , author =. 2023 , journal =
work page 2023
-
[41]
Spuck, N. and Schmid, M. and Heim, N. and Klarmann-Schulz, U. and Hoerauf, A. and Berger, M. , year =. Statistics & Computing , volume =
-
[42]
2024 , author =. doi:10.6103/SHARE.w9.900 , journal =
- [43]
-
[44]
B\"orsch-Supan, A. and Brandt, M. and Hunkler, C. and Kneip, T. and Korbmacher, J. and Malter, F. and Schaan, B. and Stuck, S. and Zuber, S. , year =. International Journal of Epidemiology , doi =
-
[45]
Hyde, M. and Wiggins, R.D. and Higgs, P. and Blane, D.B. , year =. Aging & Mental Health , volume =
- [46]
-
[47]
Borrat-Besson, C. and Ryser, V. and Goncalves, J. , year =. FORS Working Papers , doi =
-
[48]
Faber, J. and Berger, M. and Wilke, C. and Huebner-Schmid, J. and Schaprian, T. and Santana, M. M. and Grobe-Einsler, M. and et al. , year =. doi:10.1002/ana.26824 , journal =
-
[49]
Fixed and random effects models: making an informed choice , author =. 2018 , journal =
work page 2018
-
[50]
Linear Mixed Models for Longitudinal Data , author =. 2000 , publisher =
work page 2000
-
[51]
M. N. Wright and I. K\"onig , title =. 2019 , journal =
work page 2019
-
[52]
Detecting treatment-subgroup interactions in clustered data with generalized linear mixed-effects model trees , author =. Behav Res Methods , doi =. 2018 , volume =
work page 2018
- [53]
-
[54]
Heagerty, P. and Kurland, B.F. , journal =. Misspecified maximum likelihood estimates and generalised linear mixed models , year =
-
[55]
Grilli, L. and Rampichini, C. , journal =. The role of sample cluster means in multi-level models: A view on endogeneity and measurment error issues , year =
-
[56]
Unbiased regression trees for longitudinal and clustered data , journal =. 2015 , doi =
work page 2015
-
[57]
Tree-based Machine Learning Methods for Survey Research , author =. J Eur Surv Res Assoc , doi =. 2019 , volume =
work page 2019
-
[58]
Recursive Partitioning in Health Sciences , author =. 1999 , publisher =
work page 1999
-
[59]
An Introduction to Recursive Partitioning: Rationale, Application and Characteristics of Classification and Regression Trees, Bagging and Random Forests , author =. 2009 , journal =
work page 2009
- [60]
-
[61]
Frias-Goytia, G. L. and Lojo-Seoane, C. and Mallo, S. C. and Nieto-Vieites, A. and Juncos-Rabad\`an, O. and Pereiro, A.X. , title =. 2024 , journal =
work page 2024
-
[62]
W. D. Fisher , title =. J Am Stat Soc , year =
-
[63]
Pattern Recognition and Neural Networks , author =. 1996 , journal =
work page 1996
-
[64]
Sela, R. J. and Simonoff, J. S. , title =. 2012 , journal =. doi:10.1007/s10994-011-5258-3 , volume =
-
[65]
Exact post-selection inference for sequential regression procedures , author=. J Am Stat Assoc , volume=. 2016 , publisher=
work page 2016
-
[66]
Data Min Knowl Discov , volume=
Decision tree boosted varying coefficient models , author=. Data Min Knowl Discov , volume=. 2022 , publisher=
work page 2022
-
[67]
Varying coefficient regression models: a review and new developments , author=. Int Stat Rev , volume=. 2015 , doi =
work page 2015
-
[68]
The one standard error rule for model selection: does it work? , author=. Stats , volume=. 2021 , doi =
work page 2021
- [69]
-
[70]
Taylor, J. and Tibshirani, R. , year =. Post-selection inference for. Can J Stat , volume =
- [71]
-
[72]
Valid post-selection inference , volume =. Ann Stat , author =. doi:10.1214/12-aos1077 , number =
- [73]
- [74]
- [75]
-
[76]
Burnham, K. P. and Anderson, D. R. , title =. 2002 , adress =
work page 2002
-
[77]
Tree-Values: Selective Inference for Regression Trees , author =. 2022 , journal =
work page 2022
-
[78]
Selective inference for effect modification via the lasso , author =. 2022 , journal =
work page 2022
-
[79]
Statistical methods with varying coefficient models , author =. 2008 , journal =
work page 2008
-
[80]
J Comput Graph Stat , author =
Boosted Varying-Coefficient Regression Models for Product Demand Prediction , volume =. J Comput Graph Stat , author =. 2014 , pages =. doi:10.1080/10618600.2013.778777 , number =
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.