Simultaneous Variable Selection, Clustering, and Smoothing in Function on Scalar Regression
Pith reviewed 2026-05-25 16:38 UTC · model grok-4.3
The pith
A Bayesian prior in function-on-scalar regression selects, clusters, and smooths effects of correlated predictors simultaneously.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The methodology groups effects of highly correlated predictors, performing dimension reduction without dropping relevant predictors from the model, by means of a joint prior that induces simultaneous variable selection, clustering, and smoothing of the functional coefficients.
What carries the argument
The joint prior that simultaneously induces selection, clustering, and smoothing on the functional coefficients.
If this is right
- Dimension reduction occurs through grouping rather than deletion, so all predictors remain available for interpretation.
- The method outperforms existing dimension reduction approaches in function-on-scalar settings according to the simulation results.
- Application to age-specific fertility rates shows the model can be used on real functional response data with correlated scalar predictors.
- The simultaneous handling of selection, clustering, and smoothing removes the need for separate preprocessing steps for multicollinearity.
Where Pith is reading between the lines
- The clustering mechanism could be tested on datasets where predictor correlation structure is known in advance to check whether recovered groups match expected patterns.
- If the prior works without bias, the same construction might reduce reliance on principal-component-style preprocessing in other functional regression problems.
- Extension to settings with mixed scalar and functional predictors would be a natural next test of whether the joint prior generalizes beyond the scalar-predictor case studied here.
Load-bearing premise
The joint prior successfully induces the desired clustering, selection, and smoothing simultaneously without introducing bias or instability in the functional coefficient estimates.
What would settle it
A simulation study in which the estimated clusters fail to align with the known correlation structure among predictors or in which the method produces visibly biased coefficient estimates relative to an oracle model would falsify the central claim.
Figures
read the original abstract
We address the problem of multicollinearity in a function-on-scalar regression model by using a prior which simultaneously selects, clusters, and smooths functional effects. Our methodology groups effects of highly correlated predictors, performing dimension reduction without dropping relevant predictors from the model. We validate our approach via a simulation study, showing superior performance relative to existing dimension reduction approaches in the function-on-scalar literature. We also demonstrate the use of our model on a data set of age specific fertility rates from the United Nations Gender Information database.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a Bayesian joint prior for the functional coefficients in a function-on-scalar regression model. The prior is constructed to simultaneously induce variable selection, clustering of effects from correlated predictors, and smoothing of the functional coefficients. This is positioned as a way to perform dimension reduction while retaining relevant predictors under multicollinearity. The approach is evaluated through a simulation study claiming superior performance relative to existing dimension-reduction methods in the function-on-scalar literature, and is illustrated on age-specific fertility rate data from the United Nations Gender Information database.
Significance. If the central construction holds, the work offers a unified regularization strategy that combines selection, clustering, and smoothing without explicit stepwise procedures. This could be useful in high-dimensional functional regression settings where predictors are correlated. The simulation comparison and real-data application provide concrete evidence of practical behavior, though the strength of the superiority claim depends on the specific metrics and design details in the full methods and results sections.
major comments (2)
- [§3] §3 (prior construction): the claim that the joint prior induces clustering, selection, and smoothing simultaneously without introducing bias in the functional coefficient estimates requires explicit verification that the hyperparameter choices do not create dependence between the three effects; the simulation design should include a sensitivity check on these hyperparameters to confirm the reported performance gains are robust.
- [§4] §4 (simulation study): the superiority claim relative to existing approaches needs to be supported by reporting the exact error metrics (e.g., integrated squared error, prediction MSE) and the number of Monte Carlo replications; without these, it is difficult to assess whether the dimension-reduction benefit is statistically significant or merely descriptive.
minor comments (3)
- [Abstract/Introduction] The abstract and introduction should include a brief statement of the model equation (function-on-scalar regression) and the precise form of the functional coefficients to orient readers before describing the prior.
- [§3] Notation for the clustering component (e.g., how group membership is encoded) should be defined consistently between the prior specification and the MCMC algorithm description.
- [§5] In the real-data application, the number of predictors and the degree of multicollinearity observed in the UN fertility data should be quantified to allow readers to judge the relevance of the multicollinearity-handling claim.
Simulated Author's Rebuttal
Thank you for the constructive feedback and the recommendation for minor revision. We address each major comment below.
read point-by-point responses
-
Referee: [§3] §3 (prior construction): the claim that the joint prior induces clustering, selection, and smoothing simultaneously without introducing bias in the functional coefficient estimates requires explicit verification that the hyperparameter choices do not create dependence between the three effects; the simulation design should include a sensitivity check on these hyperparameters to confirm the reported performance gains are robust.
Authors: The hierarchical structure of the prior is intended to separate the selection, clustering, and smoothing components through distinct hyperparameter controls. To directly address the request for verification, we will add a sensitivity analysis on the hyperparameters in the revised manuscript, confirming robustness of the reported gains. revision: yes
-
Referee: [§4] §4 (simulation study): the superiority claim relative to existing approaches needs to be supported by reporting the exact error metrics (e.g., integrated squared error, prediction MSE) and the number of Monte Carlo replications; without these, it is difficult to assess whether the dimension-reduction benefit is statistically significant or merely descriptive.
Authors: We agree that explicit reporting is needed for full assessment. The revised manuscript will include the precise error metrics (including integrated squared error and prediction MSE) along with the number of Monte Carlo replications performed. revision: yes
Circularity Check
No significant circularity identified
full rationale
The provided abstract and description present a Bayesian prior construction for simultaneous selection, clustering, and smoothing in function-on-scalar regression, with performance validated through simulation studies against existing methods. No equations, fitted parameters, or self-citations are exhibited that reduce any claimed prediction or result to the inputs by construction. The central methodology is described as a novel joint prior inducing the desired properties, with external validation via simulation rather than internal redefinition or load-bearing self-reference. The derivation chain appears self-contained against the stated benchmarks.
Axiom & Free-Parameter Ledger
Lean theorems connected to this paper
-
IndisputableMonolith/Cost/FunctionalEquation.leanwashburn_uniqueness_aczel unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
We address the problem of multicollinearity ... by using a prior which simultaneously selects, clusters, and smooths functional effects ... G∼DP(G0,α) ... π0δ0+(1−π0)G∗
-
IndisputableMonolith/Foundation/RealityFromDistinction.leanreality_from_one_distinction unclear?
unclearRelation between the paper passage and the cited Recognition theorem.
Dirichlet Process priors ... stick breaking construction ... Neal [2000] Gibbs sampler
What do these tags mean?
- matches
- The paper's claim is directly supported by a theorem in the formal canon.
- supports
- The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
- extends
- The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
- uses
- The paper appears to rely on the theorem as machinery.
- contradicts
- The paper's claim conflicts with a theorem or certificate in the canon.
- unclear
- Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.
Reference graph
Works this paper leans on
-
[1]
Unsupervised curve clustering using b-splines
Christophe Abraham, Pierre-Andr \'e Cornillon, ERIC Matzner-L ber, and Nicolas Molinari. Unsupervised curve clustering using b-splines. Scandinavian journal of statistics, 30 0 (3): 0 581--595, 2003
work page 2003
- [2]
-
[3]
Ferguson distributions via p \'o lya urn schemes
David Blackwell, James B MacQueen, et al. Ferguson distributions via p \'o lya urn schemes. The annals of statistics, 1 0 (2): 0 353--355, 1973
work page 1973
-
[4]
Howard D Bondell and Brian J Reich. Simultaneous regression shrinkage, variable selection, and supervised clustering of predictors with oscar. Biometrics, 64 0 (1): 0 115--123, 2008
work page 2008
-
[5]
Variable selection in function-on-scalar regression
Yakuan Chen, Jeff Goldsmith, and R Todd Ogden. Variable selection in function-on-scalar regression. Stat, 5 0 (1): 0 88--101, 2016
work page 2016
-
[6]
S McKay Curtis and Sujit K Ghosh. A bayesian approach to multicollinearity and the simultaneous selection and clustering of predictors in linear regression. Journal of Statistical Theory and Practice, 5 0 (4): 0 715--735, 2011
work page 2011
-
[7]
Bayesian selection and clustering of polymorphisms in functionally related genes
David B Dunson, Amy H Herring, and Stephanie M Engel. Bayesian selection and clustering of polymorphisms in functionally related genes. Journal of the American Statistical Association, 103 0 (482): 0 534--546, 2008
work page 2008
-
[8]
Flexible smoothing with b-splines and penalties
Paul HC Eilers and Brian D Marx. Flexible smoothing with b-splines and penalties. Statistical science, pages 89--102, 1996
work page 1996
-
[9]
Bayesian density estimation and inference using mixtures
Michael D Escobar and Mike West. Bayesian density estimation and inference using mixtures. Journal of the american statistical association, 90 0 (430): 0 577--588, 1995
work page 1995
-
[10]
High-dimensional adaptive function-on-scalar regression
Zhaohu Fan and Matthew Reimherr. High-dimensional adaptive function-on-scalar regression. Econometrics and statistics, 1: 0 167--183, 2017
work page 2017
- [11]
-
[12]
Jeff Goldsmith and Tomoko Kitago. Assessing systematic effects of stroke on motor control by using hierarchical function-on-scalar regression. Journal of the Royal Statistical Society: Series C (Applied Statistics), 65 0 (2): 0 215--236, 2016
work page 2016
-
[13]
Variable selection in the functional linear concurrent model
Jeff Goldsmith and Joseph E Schwartz. Variable selection in the functional linear concurrent model. Statistics in medicine, 36 0 (14): 0 2237--2250, 2017
work page 2017
-
[14]
Functional data clustering: a survey
Julien Jacques and Cristian Preda. Functional data clustering: a survey. Advances in Data Analysis and Classification, 8 0 (3): 0 231--255, 2014
work page 2014
-
[15]
Variable selection in clustering via dirichlet process mixture models
Sinae Kim, Mahlet G Tadesse, and Marina Vannucci. Variable selection in clustering via dirichlet process mixture models. Biometrika, 93 0 (4): 0 877--893, 2006
work page 2006
-
[16]
Dynamic Function-on-Scalars Regression
Daniel R Kowal. Dynamic function-on-scalars regression. arXiv preprint arXiv:1806.01460, 2018
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[17]
Bayesian Function-on-Scalars Regression for High Dimensional Data
Daniel R Kowal and Daniel C Bourgeois. Bayesian function-on-scalars regression for high dimensional data. arXiv preprint arXiv:1808.06689, 2018
work page internal anchor Pith review Pith/arXiv arXiv 2018
-
[18]
Estimating mixture of dirichlet process models
Steven N MacEachern and Peter M \"u ller. Estimating mixture of dirichlet process models. Journal of Computational and Graphical Statistics, 7 0 (2): 0 223--238, 1998
work page 1998
-
[19]
Markov chain sampling methods for dirichlet process mixture models
Radford M Neal. Markov chain sampling methods for dirichlet process mixture models. Journal of computational and graphical statistics, 9 0 (2): 0 249--265, 2000
work page 2000
-
[20]
Predictive performance of dirichlet process shrinkage methods in linear regression
David J Nott. Predictive performance of dirichlet process shrinkage methods in linear regression. Computational Statistics & Data Analysis, 52 0 (7): 0 3658--3669, 2008
work page 2008
-
[21]
Simultaneous variable selection and smoothing for high-dimensional function-on-scalar regression
Alice Parodi and Matthew Reimherr. Simultaneous variable selection and smoothing for high-dimensional function-on-scalar regression. Electron. J. Statist., 12 0 (2): 0 4602--4639, 2018. doi:10.1214/18-EJS1509. URL https://doi.org/10.1214/18-EJS1509
-
[22]
Functional clustering by bayesian wavelet methods
Shubhankar Ray and Bani Mallick. Functional clustering by bayesian wavelet methods. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 68 0 (2): 0 305--332, 2006
work page 2006
-
[23]
A constructive definition of dirichlet priors
Jayaram Sethuraman. A constructive definition of dirichlet priors. Statistica sinica, pages 639--650, 1994
work page 1994
-
[24]
Dhruv B. Sharma, Howard D. Bondell, and Hao Helen Zhang. Consistent group identification and variable selection in regression with correlated predictors. Journal of Computational and Graphical Statistics, 22 0 (2): 0 319--340, 2013
work page 2013
-
[25]
Sparse regression with exact clustering
Yiyuan She. Sparse regression with exact clustering. Electronic Journal of Statistics, 4: 0 1055--1096, 2010
work page 2010
-
[26]
Bayesian variable selection in clustering high-dimensional data
Mahlet G Tadesse, Naijun Sha, and Marina Vannucci. Bayesian variable selection in clustering high-dimensional data. Journal of the American Statistical Association, 100 0 (470): 0 602--617, 2005
work page 2005
-
[27]
Tibshirani and Jonathan Taylor
Ryan J. Tibshirani and Jonathan Taylor. The solution path of the generalized lasso. Ann. Statist., 39 0 (3): 0 1335--1371, 06 2011. doi:10.1214/11-AOS878. URL https://doi.org/10.1214/11-AOS878
-
[28]
Group scad regression analysis for microarray time course gene expression data
Lifeng Wang, Guang Chen, and Hongzhe Li. Group scad regression analysis for microarray time course gene expression data. Bioinformatics, 23 0 (12): 0 1486--1494, 2007
work page 2007
-
[29]
Model selection and estimation in regression with grouped variables
Ming Yuan and Yi Lin. Model selection and estimation in regression with grouped variables. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 68 0 (1): 0 49--67, 2006
work page 2006
-
[30]
Joint Clustering and Registration of Functional Data
Yafeng Zhang and Donatello Telesca. Joint clustering and registration of functional data. arXiv preprint arXiv:1403.7134, 2014
work page internal anchor Pith review Pith/arXiv arXiv 2014
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.