Likelihood-based Inference for Skewed Responses in a Crossover Trial Setup
Pith reviewed 2026-05-24 09:10 UTC · model grok-4.3
The pith
Linear mixed effect models with skew-normal random effects or errors model asymmetric responses in crossover trials via EM algorithm estimation.
A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.
Core claim
The paper claims that a linear mixed effect model with a skew-normal random effect or a skew-normal random error term adequately represents the asymmetric responses arising in crossover trials. Maximum likelihood estimates are obtained through the EM algorithm in both cases, and the resulting fitted models are shown to be applicable to the 3x3 design with gene expression measurements.
What carries the argument
Linear mixed effect model with skew-normal random effect or skew-normal random error, estimated by the EM algorithm for maximum likelihood.
If this is right
- Maximum likelihood estimates remain computable when responses exhibit skewness in crossover designs.
- The method directly handles the 3x3 layout with multiple gene measurements per subject and period.
- Simulations recover the true parameters under both skew-normal random-effect and random-error variants.
- Application to the asthma gene-expression trial demonstrates practical estimation and model use.
Where Pith is reading between the lines
- The same modeling strategy could be tested on other repeated-measures clinical trials that produce skewed outcomes.
- Direct comparison of fit statistics between the skew-normal versions and conventional normal mixed models on the same dataset would quantify the gain from asymmetry.
- Extension to four or more periods or to additional covariates such as baseline gene levels could be examined without altering the core estimation procedure.
Load-bearing premise
The skew-normal distribution captures the observed asymmetry in gene expression responses across periods without needing another asymmetric family or a transformation.
What would settle it
Fitting the proposed skew-normal mixed model to the gene expression data and finding that residuals remain asymmetric or that a standard normal mixed model produces equal or higher likelihood values would undermine the central claim.
Figures
read the original abstract
This work proposes a statistical model for crossover trials with multiple skewed responses measured in each period. A 3 $\times$ 3 crossover trial data where different drug doses were administered to subjects with a history of seasonal asthma rhinitis to grass pollen is used for motivation. In each period, gene expression values for ten genes were measured from each subject. It considers a linear mixed effect model with skew normally distributed random effect or random error term to model the asymmetric responses in the crossover trials. The paper examines cases (i) when a random effect follows a skew-normal distribution, as well as (ii) when a random error follows a skew-normal distribution. The EM algorithm is used in both cases to compute maximum likelihood estimates of parameters. Simulations and crossover data from the gene expression study illustrate the proposed approach. Keywords: Crossover design, Mixed effect models, Skew-normal distribution, EM algorithm.
Editorial analysis
A structured set of objections, weighed in public.
Referee Report
Summary. The paper proposes a linear mixed-effects model for analyzing skewed responses in 3×3 crossover trials, specifically motivated by gene-expression data from an asthma study. It considers two cases: skew-normal random effects or skew-normal errors, derives EM algorithms to obtain maximum likelihood estimates in each case, and illustrates the methods with simulations and real-data analysis of ten genes across periods.
Significance. If the EM derivations are correct and the skew-normal specification adequately captures the observed asymmetry, the work supplies a practical likelihood-based tool for crossover designs with non-normal responses, an area relevant to clinical and genomic studies. The dual-case treatment (random effect vs. error) and the combination of simulation and real-data examples are strengths.
major comments (2)
- [§3.2] §3.2 (EM algorithm for skew-normal random effects): the E-step expressions for the conditional expectations involving the skew-normal latent variables are stated without an explicit derivation or reference to the standard skew-normal stochastic representation; this step is load-bearing for verifying that the subsequent M-step yields true MLEs.
- [Table 4] Table 4 (real-data parameter estimates): the reported standard errors for the skewness parameter under the two model variants are obtained from the observed information matrix, but no check is provided that the matrix is positive definite or that the estimates are interior to the parameter space; this affects the reliability of inference for the gene-expression application.
minor comments (3)
- [Abstract] The abstract uses 'skew normally distributed' twice; consistent hyphenation ('skew-normally') would improve readability.
- [§2.1] Section 2.1 defines the crossover design but does not cite the original trial protocol or the source of the gene-expression measurements; adding this reference would aid reproducibility.
- [§4] In the simulation section the number of Monte Carlo replications is stated as 500, yet the reported coverage probabilities in Table 3 appear to be based on a different count; clarify the exact replication number used for each metric.
Simulated Author's Rebuttal
We thank the referee for the careful review and constructive comments on our manuscript. We address each major comment below and indicate the revisions we plan to make.
read point-by-point responses
-
Referee: [§3.2] §3.2 (EM algorithm for skew-normal random effects): the E-step expressions for the conditional expectations involving the skew-normal latent variables are stated without an explicit derivation or reference to the standard skew-normal stochastic representation; this step is load-bearing for verifying that the subsequent M-step yields true MLEs.
Authors: We agree that an explicit derivation or reference would improve verifiability of the EM algorithm. In the revised manuscript we will add a concise derivation of the relevant conditional expectations in the E-step, drawing on the standard stochastic representation of the skew-normal distribution (Azzalini and Capitanio, 2014). revision: yes
-
Referee: [Table 4] Table 4 (real-data parameter estimates): the reported standard errors for the skewness parameter under the two model variants are obtained from the observed information matrix, but no check is provided that the matrix is positive definite or that the estimates are interior to the parameter space; this affects the reliability of inference for the gene-expression application.
Authors: We acknowledge the importance of these checks for the reported standard errors. In the revision we will add a brief statement confirming that the observed information matrix is positive definite at the reported MLEs and that the skewness-parameter estimates lie in the interior of the parameter space for the genes analyzed. revision: yes
Circularity Check
No significant circularity; model definition precedes estimation
full rationale
The paper defines a linear mixed-effects model with skew-normal random effect or error term, then applies the EM algorithm to obtain MLEs from data. No equation reduces a claimed prediction or fitted quantity to an input by construction, no self-citation chain supplies a load-bearing uniqueness result, and the skew-normal choice is presented as a modeling decision rather than derived from prior self-work. Simulations and real-data analysis are downstream of the model specification, with no renaming of known results or smuggling of ansatzes via citation. The derivation chain is therefore self-contained against external benchmarks.
Axiom & Free-Parameter Ledger
free parameters (2)
- skewness parameter
- variance components and fixed effects
axioms (1)
- domain assumption Responses follow a linear mixed-effects structure whose random terms or errors belong to the skew-normal family
Reference graph
Works this paper leans on
-
[1]
R.B. Arellano-Valle and A. Azzalini. The centred parametrization for the multivariate skew-normal distribution. Journal of Multivariate Analysis , 100(4):816, 2009
work page 2009
-
[2]
R.B. Arellano-Valle and Marc G. Genton. On fundamental skew distri- butions. Journal of Multivariate Analysis , 96(1):93–116, 2005
work page 2005
-
[3]
R.B. Arellano-Valle, H. Bolfarine, and V.H. Lachos. Skew-normal Linear Mixed Models. Journal of Data Science , 3:415–438, 2005
work page 2005
- [4]
-
[5]
A. Azzalini and A. Capitanio. Statistical applications of the multivariate skew normal distribution. Journal of the Royal Statistical Society. Series B: Statistical Methodology , 61(3):579–602, 1999. 25
work page 1999
-
[6]
A. Azzalini and A. Capitanio. The Skew-Normal and Related Families . Cambridge, 2014
work page 2014
-
[7]
A. Azzalini and A. Dalla Valle. The multivariate skew-normal distribu- tion. Biometrika, 83(4):715–726, 1996
work page 1996
-
[8]
M´ arcia D. Branco and Dipak K. Dey. A general class of multivariate skew-elliptical distributions. Journal of Multivariate Analysis , 79(1): 99–113, 2001
work page 2001
-
[9]
Divan Aristo Burger, Robert Schall, and Sean van der Merwe. A robust method for the assessment of average bioequivalence in the presence of outliers and skewness. Pharmaceutical Research, 38(10):1697–1709, 2021
work page 2021
-
[10]
Vernon M. Chinchilli and James D. Esinhart. Design and analy- sis of intra-subject variability in cross-over experiments. Statistics in Medicine, 15:1619–1634, 1996
work page 1996
-
[11]
The Gene Expression Omnibus database
Emily Clough and Tanya Barrett. The Gene Expression Omnibus database. Methods in Molecular Biology , 1418:93–110, 2016
work page 2016
-
[12]
The use of asymmetric distributions in average bioequivalence
Roberto Molina de Souza, Jorge Alberto Achcar, Edson Zangiacomi Martinez, and Josmar Mazucheli. The use of asymmetric distributions in average bioequivalence. Statistics in Medicine , 35(15):2525–2542, 2016
work page 2016
-
[13]
A.P. Dempster, N.M. Laird, and D.B. Rubin. Maximum likelihood from incomplete data via the EM algorithm. Journal of the Royal Statistical Society: Series B (Methodological) , 39:1–38, 1977
work page 1977
-
[14]
Nonlinear mixed-effects models with misspecified random-effects distribution
Reza Drikvandi. Nonlinear mixed-effects models with misspecified random-effects distribution. Pharmaceutical Statistics , 19(3):187–201, 2019
work page 2019
-
[15]
Diagnosing mis- specification of the random-effects distribution in mixed models
Reza Drikvandi, Geert Verbeke, and Geert Molenberghs. Diagnosing mis- specification of the random-effects distribution in mixed models. Biometrics, 73(1):63–71, 2016
work page 2016
-
[16]
Changyong Feng, Hongyue Wang, Naiji Lu, Tian Chen, Hua He, Ying Lu, and Xin M. Tu. Log-transformation and its implications for data analysis. Shanghai Archives of Psychiatry , 26(2):105–109, 2014. 26
work page 2014
-
[17]
Smooth Random Effects Distribution in a Linear Mixed Model
Wendimagegn Ghidey, Emmanuel Lesaffre, and Paul Eilers. Smooth Random Effects Distribution in a Linear Mixed Model. Biometrics, 60 (4):945–953, 2004
work page 2004
-
[18]
J.G. Glosup and M.C. Axelrod. Use of the AIC with the EM Algorithm: A Demonstration of a Probability Model Selection Technique. In Joint Statistical Meeting, 1994
work page 1994
-
[19]
Julie M. Grender and William D. Johnson. Analysis of crossover designs with multivariate response. Statistics in Medicine , 12(1):69–89, 1993
work page 1993
-
[20]
M.J.R. Healy. Multivariate Normal Plotting. Journal of the Royal Sta- tistical Society. Series C (Applied Statistics) , 17(2):157–161, 1968
work page 1968
-
[21]
W.D. Johnson and D.E. Mercante. Analyzing multivariate data in crossover designs using permutation tests. Journal of Biopharmaceu- tical Statistics, 6(3):327–342, 1996
work page 1996
-
[22]
Byron Jones and Michael G. Kenward. Design and Analysis of Cross- Over Trials. Chapman & Hall/CRC, second edition, 2003
work page 2003
-
[23]
V.H. Lachos, Pulak Ghosh, and R.B. Arellano-Valle. Likelihood based inference for skew-normal independent linear mixed models. Statistica Sinica, 20(1):303–322, 2010
work page 2010
-
[24]
Nan M. Laird and James H. Ware. Random-Effects Models for Longi- tudinal Data. Biometrics, 38(4):963–974, 1982
work page 1982
-
[25]
B.R. Leaker, V.A. Malkov, R. Mogg, M.K. Ruddy, G.C. Nicholson, A.J. Tan, C. Tribouley, and G. Chen. The nasal mucosal late allergic reaction to grass pollen involves type 2 inflammation ( IL-5 and IL-13 ), the inflammasome ( IL-1 b ), and complement. Nature, 10(2):408–420, 2016
work page 2016
-
[26]
Frank J. Massey. The Kolmogorov-Smirnov Test for Goodness of Fit. Journal of the American Statistical Association , 46(253):68–78, 1951
work page 1951
-
[27]
Likelihood-based missing data analysis in multivariate crossover trials
Savita Pareek, Kalyan Das, and Siuli Mukhopadhyay. Likelihood-based missing data analysis in multivariate crossover trials. arXiv pre-print , 2021
work page 2021
-
[28]
Nonlinear 27 mixed-effects models with scale mixture of skew-normal distributions
Marcos Antonio Alves Pereira and Cibele Maria Russo. Nonlinear 27 mixed-effects models with scale mixture of skew-normal distributions. Journal of Applied Statistics , 46(9):1602–1620, 2019
work page 2019
-
[29]
Mary Putt and Vernon M. Chinchilli. A mixed effects model for the analysis of repeated measures cross-over studies. Statistics in Medicine , 18(22):3037–3058, 1999
work page 1999
-
[30]
R: A Language and Environment for Statistical Com- puting
R Core Team. R: A Language and Environment for Statistical Com- puting. R Foundation for Statistical Computing, Vienna, Austria, 2022. URL https://www.R-project.org/
work page 2022
-
[31]
Fernanda L. Schumacher, V.H. Lachos, and Larissa A. Matos. Scale mixture of skew-normal linear mixed models with within-subject serial dependence. Statistics in Medicine , 40(7):1790–1810, 2021
work page 2021
-
[32]
Cross-Over Trials in Clinical Research
Stephen Senn. Cross-Over Trials in Clinical Research . John Wiley & Sons, Ltd., 2002
work page 2002
-
[33]
S.S. Shapiro and M.B. Wilk. An Analysis of Variance Test for Normality (Complete Samples). Biometrika, 52(3):591–611, 1965
work page 1965
-
[34]
Gail E. Tudor, Gary G. Koch, and Diane Catellier. Statistical meth- ods for crossover designs in bioenvironmental and public health studies. Handbook of Statistics , 18:571–614, 2000
work page 2000
-
[35]
Linear Mixed Models for Longi- tudinal Data
Geert Verbeke and Geert Molenberghs. Linear Mixed Models for Longi- tudinal Data . Springer-Verlag New York, Inc., 2000
work page 2000
-
[36]
C.F. Jeff Wu. On the convergence properties of the EM algorithm. Annals of Statistics , 11(1):95–103, 1983
work page 1983
-
[37]
Linear Mixed Models with Flexible Distributions of Random Effects for Longitudinal Data
Daowen Zhang and Marie Davidian. Linear Mixed Models with Flexible Distributions of Random Effects for Longitudinal Data. Biometrics, 57: 795–802, 2001. 28 9 Appendix 9.1 Skew-Normal Distribution The following is a brief overview of the skew-normal distribution and the terminology that we have used in our analysis. (i) Univariate skew-normal variate (Azzal...
work page 2001
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.