A Generative Approach to Joint Modeling of Quantitative and Qualitative Responses

Lulu Kang; Wei Chen; Xiaoning Kang; Xinwei Deng

arxiv: 2102.09448 · v1 · submitted 2021-02-18 · 📊 stat.ME · math.ST· stat.TH

A Generative Approach to Joint Modeling of Quantitative and Qualitative Responses

Xiaoning Kang , Lulu Kang , Wei Chen , Xinwei Deng This is my paper

Pith reviewed 2026-05-24 14:12 UTC · model grok-4.3

classification 📊 stat.ME math.STstat.TH

keywords generative modeljoint modelingquantitative qualitative responsespenalized likelihoodasymptotic optimalityclassificationpredictionhigh-dimensional predictors

0 comments

The pith

A generative model of the joint distribution of quantitative responses, qualitative responses, and predictors yields efficient penalized estimation plus asymptotically optimal classification and prediction.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper proposes modeling the full joint distribution of quantitative responses, qualitative responses, and the predictors themselves, rather than conditioning the responses only on the predictors. This generative framing incorporates the dependency structure among predictors as additional information for estimation. The model is fit via a penalized likelihood that remains computationally efficient even with many predictors. Accurate classification of the qualitative response and accurate prediction of the quantitative response follow directly, and the generative construction permits proofs of asymptotic optimality under regularity conditions. The claims are checked in simulations and in material-science and genetics data sets.

Core claim

By modeling the joint distribution of the quantitative response, the qualitative response, and the predictors, the proposed generative approach enables efficient parameter estimation via penalized likelihood. It delivers accurate classification and prediction while allowing the establishment of asymptotic optimality under regularity conditions. The method is demonstrated through simulations and applications in material science and genetics.

What carries the argument

A generative model of the joint distribution of quantitative responses, qualitative responses, and predictors, fitted by penalized likelihood.

If this is right

Parameter estimation remains efficient under the penalized likelihood framework even with high-dimensional predictors.
Classification accuracy for the qualitative response improves by using the joint structure.
Prediction accuracy for the quantitative response improves by using the joint structure.
Asymptotic optimality of both classification and prediction holds under the regularity conditions.
The same procedure applies directly to material-science and genetics data sets with mixed responses.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

The same generative construction could be extended to more than one qualitative response without changing the core estimation strategy.
In settings where predictors exhibit block or graphical dependence, the model might reduce the effective dimensionality without explicit variable selection.
Domains outside material science and genetics that routinely collect mixed quantitative-qualitative outcomes with correlated predictors could adopt the approach with minimal modification.

Load-bearing premise

The dependencies among the predictor variables supply useful additional information that improves joint modeling of the responses, and the invoked regularity conditions hold for the data-generating process.

What would settle it

A data set in which predictors are independent yet the generative model still claims superiority, or a finite-sample regime where classification or prediction error fails to approach the optimal rate under the stated regularity conditions.

read the original abstract

In many scientific areas, data with quantitative and qualitative (QQ) responses are commonly encountered with a large number of predictors. By exploring the association between QQ responses, existing approaches often consider a joint model of QQ responses given the predictor variables. However, the dependency among predictive variables also provides useful information for modeling QQ responses. In this work, we propose a generative approach to model the joint distribution of the QQ responses and predictors. The proposed generative model provides efficient parameter estimation under a penalized likelihood framework. It achieves accurate classification for qualitative response and accurate prediction for quantitative response with efficient computation. Because of the generative approach framework, the asymptotic optimality of classification and prediction of the proposed method can be established under some regularity conditions. The performance of the proposed method is examined through simulations and real case studies in material science and genetics.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

This paper develops a generative joint model for quantitative and qualitative responses that includes the predictors to capture their dependencies.

read the letter

The punchline is that this paper proposes modeling the joint distribution of the quantitative and qualitative responses along with the predictors themselves, using a generative framework with penalized likelihood estimation. This is new in the sense that most joint models for mixed responses condition on the predictors, whereas here the predictor dependencies are explicitly part of the model. That can be useful when predictors are correlated, as is common in genetics or material science data. The paper claims efficient estimation, good accuracy in classification and prediction, and asymptotic optimality under regularity conditions, supported by simulations and two real case studies. The approach does well in framing the problem generatively to leverage additional information from the predictors. The penalized framework is a standard and practical choice for high-dimensional settings. Where it is softer is in the details of how the joint distribution is constructed for the mixed response types. The abstract does not spell out the specific distributions or how the penalty is applied across the different variable types, which makes it hard to evaluate potential issues like model misspecification or computational scalability. The optimality results depend on regularity conditions that may or may not hold in typical data, and without more on the simulation setup it's difficult to judge how robust the performance claims are. This work is for methodologists in statistics who deal with mixed outcome data and want to incorporate predictor structure. It would be of interest to those in applied areas like genetics. It is worth a serious referee because the idea is coherent and the claims are grounded in external conditions rather than internal circularity. The paper shows clear thinking on the modeling choice. I would recommend sending it to peer review.

Referee Report

2 major / 1 minor

Summary. The manuscript proposes a generative model for the joint distribution of quantitative and qualitative (QQ) responses together with a large number of predictors. Parameter estimation proceeds via a penalized likelihood; the induced conditional rules are claimed to deliver accurate classification and prediction with efficient computation. Asymptotic optimality of these rules is asserted under regularity conditions. Performance is assessed via simulations and two real-data examples (material science, genetics).

Significance. If the penalized-likelihood construction and the asymptotic arguments are valid, the generative joint modeling framework supplies a coherent route to exploiting predictor dependence while furnishing theoretical guarantees that are often absent from separate or conditional modeling approaches. The combination of computational efficiency and optimality results would be a useful addition to the literature on mixed-response high-dimensional regression.

major comments (2)

[Abstract / theoretical development] The abstract states that asymptotic optimality of classification and prediction follows from the generative framework under regularity conditions, yet the manuscript supplies neither the explicit statement of those conditions nor the key steps of the proof (e.g., the form of the penalized likelihood or the convergence argument). Without these details the central theoretical claim cannot be evaluated.
[Abstract / simulation section] The abstract asserts that the method achieves accurate classification and prediction, but the provided description contains no information on simulation design, number of replications, error-bar reporting, or the specific baselines against which accuracy is measured. Consequently the numerical support for the accuracy claim cannot be assessed from the given material.

minor comments (1)

[Abstract] Notation for the joint distribution of QQ responses and predictors should be introduced once and used consistently; the current abstract alternates between “QQ responses” and “predictors” without a compact symbol.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the careful review and constructive feedback on our manuscript. We address each major comment below and outline the revisions we will make to strengthen the presentation of the theoretical results and simulation evidence.

read point-by-point responses

Referee: [Abstract / theoretical development] The abstract states that asymptotic optimality of classification and prediction follows from the generative framework under regularity conditions, yet the manuscript supplies neither the explicit statement of those conditions nor the key steps of the proof (e.g., the form of the penalized likelihood or the convergence argument). Without these details the central theoretical claim cannot be evaluated.

Authors: The penalized likelihood is explicitly given in Equation (5) of Section 2.2. The regularity conditions (including the irrepresentable condition, eigenvalue bounds on the covariance, and growth rates on the penalty parameter) are stated in full in Section 3.1 immediately preceding Theorem 3.2, which establishes the oracle property and asymptotic optimality of the induced classification and prediction rules. The proof proceeds by showing that the penalized estimator achieves the same rate as the oracle estimator under these conditions, with the full derivation provided in the supplementary material. To make the central claim more readily evaluable from the abstract and main text, we will add a one-sentence summary of the key conditions to the abstract and insert a short proof outline (two paragraphs) in Section 3 before the theorem statement. revision: yes
Referee: [Abstract / simulation section] The abstract asserts that the method achieves accurate classification and prediction, but the provided description contains no information on simulation design, number of replications, error-bar reporting, or the specific baselines against which accuracy is measured. Consequently the numerical support for the accuracy claim cannot be assessed from the given material.

Authors: Section 4 of the manuscript details the simulation design (high-dimensional regimes with p up to 2000, n ranging from 100 to 500, both independent and correlated predictors), uses 500 independent replications, reports results with standard-error bars, and compares against LDA, separate penalized regressions, and existing joint QQ models. We agree that the abstract is too terse on these points. We will revise the abstract to include a brief clause noting the simulation scale and the set of baselines used. revision: yes

Circularity Check

0 steps flagged

No significant circularity in derivation chain

full rationale

The paper proposes a generative joint model for QQ responses and predictors, with penalized likelihood estimation and asymptotic optimality claims explicitly conditioned on external regularity conditions rather than on quantities defined by the fitted model. No self-definitional reductions, fitted inputs renamed as predictions, or load-bearing self-citation chains appear in the abstract or described structure. The central claims remain independent of the estimation procedure itself and are presented as holding under stated assumptions that do not include the target results by construction.

Axiom & Free-Parameter Ledger

0 free parameters · 1 axioms · 0 invented entities

Only the abstract is available, so the ledger is necessarily incomplete; the main unverified premise is the set of regularity conditions required for the optimality theorem.

axioms (1)

domain assumption Regularity conditions sufficient for asymptotic optimality of classification and prediction
Invoked in the abstract to establish theoretical guarantees.

pith-pipeline@v0.9.0 · 5669 in / 1106 out tokens · 21158 ms · 2026-05-24T14:12:57.587120+00:00 · methodology

discussion (0)

Lean theorems connected to this paper

Citations machine-checked in the Pith Canon. Every link opens the source theorem in the public Lean library.

IndisputableMonolith/Foundation/RealityFromDistinction.lean reality_from_one_distinction unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

We propose a generative approach to model the joint distribution of the QQ responses and predictors... penalized likelihood... LDA classification rule... graphical lasso... asymptotic optimality... under some regularity conditions (C1)–(C7)
IndisputableMonolith/Cost/FunctionalEquation.lean washburn_uniqueness_aczel unclear

?

unclear
Relation between the paper passage and the cited Recognition theorem.

J(x) = ½(x + x⁻¹) − 1, φ-ladder, 8-tick period, c=1, ℏ, G forced from single distinction

What do these tags mean?

matches: The paper's claim is directly supported by a theorem in the formal canon.
supports: The theorem supports part of the paper's argument, but the paper may add assumptions or extra steps.
extends: The paper goes beyond the formal theorem; the theorem is a base layer rather than the whole result.
uses: The paper appears to rely on the theorem as machinery.
contradicts: The paper's claim conflicts with a theorem or certificate in the canon.
unclear: Pith found a possible connection, but the passage is too broad, indirect, or ambiguous to say the theorem truly supports the claim.

Reference graph

Works this paper leans on

49 extracted references · 49 canonical work pages

[1]

Mixed binary-continuous copula regression models with application to adverse birth outcomes

Klein N, Kneib T, Marra G, Radice R, Rokicki S, McGovern ME. Mixed binary-continuous copula regression models with application to adverse birth outcomes. Statistics in Medicine 2019;38(3):413--436

work page 2019
[2]

Regression models for a bivariate discrete and continuous outcome with clustering

Fitzmaurice GM, Laird NM. Regression models for a bivariate discrete and continuous outcome with clustering. Journal of the American statistical Association 1995;90(431):845--852

work page 1995
[3]

Generalized latent trait models

Moustaki I, Knott M. Generalized latent trait models. Psychometrika 2000;65(3):391--411

work page 2000
[4]

Bayesian latent variable models for clustered mixed outcomes

Dunson DB. Bayesian latent variable models for clustered mixed outcomes. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2000;62(2):355--366

work page 2000
[5]

A correlated probit model for joint modeling of clustered binary and continuous responses

Gueorguieva RV, Agresti A. A correlated probit model for joint modeling of clustered binary and continuous responses. Journal of the American Statistical Association 2001;96(455):1102--1112

work page 2001
[6]

Dynamic latent trait models for multidimensional longitudinal data

Dunson DB. Dynamic latent trait models for multidimensional longitudinal data. Journal of the American Statistical Association 2003;98(463):555--563

work page 2003
[7]

QQ models: Joint modeling for quantitative and qualitative quality responses in manufacturing systems

Deng X, Jin R. QQ models: Joint modeling for quantitative and qualitative quality responses in manufacturing systems. Technometrics 2015;57(3):320--331

work page 2015
[8]

Time-varying coefficient models for joint modeling binary and continuous outcomes in longitudinal data

K \"u r \"u m E, Li R, Shiffman S, Yao W. Time-varying coefficient models for joint modeling binary and continuous outcomes in longitudinal data. Statistica Sinica 2016;26(3):979--1000

work page 2016
[9]

A Bayesian hierarchical model for quantitative and qualitative responses

Kang L, Kang X, Deng X, Jin R. A Bayesian hierarchical model for quantitative and qualitative responses. Journal of Quality Technology 2018;50(3):290--308

work page 2018
[10]

Longitudinal joint modelling of binary and continuous outcomes: A comparison of bridge and normal distributions

Amini P, Verbeke G, Zayeri F, Mahjub H, Maroufizadeh S, Moghimbeigi A. Longitudinal joint modelling of binary and continuous outcomes: A comparison of bridge and normal distributions. Epidemiology, Biostatistics and Public Health 2018;15(1)

work page 2018
[11]

Regression models for mixed discrete and continuous responses with potentially missing values

Fitzmaurice GM, Laird NM. Regression models for mixed discrete and continuous responses with potentially missing values. Biometrics 1997;53(1):110--122

work page 1997
[12]

Joint regression analysis of correlated data using Gaussian copulas

Song PXK, Li M, Yuan Y. Joint regression analysis of correlated data using Gaussian copulas. Biometrics 2009;65(1):60--68

work page 2009
[13]

Association models for clustered data with binary and continuous responses

Lin L, Bandyopadhyay D, Lipsitz SR, Sinha D. Association models for clustered data with binary and continuous responses. Biometrics 2010;66(1):287--293

work page 2010
[14]

Selection and estimation for mixed graphical models

Chen S, Witten DM, Shojaie A. Selection and estimation for mixed graphical models. Biometrika 2014;102(1):47--64

work page 2014
[15]

Mixed graphical models via exponential families

Yang E, Baker Y, Ravikumar P, Allen G, Liu Z. Mixed graphical models via exponential families. Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics 2014;33:1042--1050

work page 2014
[16]

A semiparametric Bayesian joint model for multiple mixed-type outcomes: an application to acute myocardial infarction

Guglielmi A, Ieva F, Paganoni AM, Quintana FA. A semiparametric Bayesian joint model for multiple mixed-type outcomes: an application to acute myocardial infarction. Advances in Data Analysis and Classification 2018;12(2):399--423

work page 2018
[17]

Latent variable models for mixed eiscrete and continuous outcomes

Sammel MD, Ryan LM, Legler JM. Latent variable models for mixed eiscrete and continuous outcomes. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 1997;59(3):667--678

work page 1997
[18]

Bayesian latent variable models for mixed discrete outcomes

Dunson DB, Herring AH. Bayesian latent variable models for mixed discrete outcomes. Biostatistics 2005;6(1):11--25

work page 2005
[19]

Hierarchical Bayesian modeling of heterogeneous clusterand subject-level associations between continuous and binary outcomes in dairy production

Bello NM, Steibel JP, Tempelman RJ. Hierarchical Bayesian modeling of heterogeneous clusterand subject-level associations between continuous and binary outcomes in dairy production. Biometrical Journal 2012;54(2):230--248

work page 2012
[20]

Sparse linear discriminant analysis by thresholding for high dimensional data

Shao J, Wang Y, Deng X, Wang S. Sparse linear discriminant analysis by thresholding for high dimensional data. Annals of Statistics 2011;39(2):1241--1265

work page 2011
[21]

On model selection consistency of Lasso

Zhao P, Yu B. On model selection consistency of Lasso. Journal of Machine Learning Research 2006;7(12):2541--2563

work page 2006
[22]

A direct estimation approach to sparse linear discriminant analysis

Cai T, Liu W. A direct estimation approach to sparse linear discriminant analysis. Journal of the American Statistical Association 2012;106:1566--1577

work page 2012
[23]

Model selection and estimation in the Gaussian graphical model

Yuan M, Lin Y. Model selection and estimation in the Gaussian graphical model. Biometrika 2007;94(1):19--35

work page 2007
[24]

Large Gaussian covariance matrix estimation with Markov structures

Deng X, Yuan M. Large Gaussian covariance matrix estimation with Markov structures. Journal of Computational and Graphical Statistics 2009;18(3):640--657

work page 2009
[25]

Regression shrinkage and selection via the lasso

Tibshirani R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 1996;58(1):267--288

work page 1996
[26]

Sparse inverse covariance estimation with the graphical lasso

Friedman J, Hastie T, Tibshirani R. Sparse inverse covariance estimation with the graphical lasso. Biostatistics 2008;9(3):432--441

work page 2008
[27]

Sparsistency and rates of convergence in large covariance matrix estimation

Lam C, Fan J. Sparsistency and rates of convergence in large covariance matrix estimation. Annals of Statistics 2009;37(6B):4254--4278

work page 2009
[28]

Model Selection in Gaussian Graphical Models: High-Dimensional Consistency of l1-regularized MLE

Raskutti G, Yu B, Wainwright MJ, Ravikumar P. Model Selection in Gaussian Graphical Models: High-Dimensional Consistency of l1-regularized MLE. Advances in Neural Information Processing Systems 2008;21:1329--1336

work page 2008
[29]

Minimax estimation of large precision matrices with bandable Cholesky factor

Liu Y, Ren Z, et al. Minimax estimation of large precision matrices with bandable Cholesky factor. Annals of Statistics 2020;48(4):2428--2454

work page 2020
[30]

Tuning parameter selectors for the smoothly clipped absolute deviation method

Wang H, Li R, Tsai CL. Tuning parameter selectors for the smoothly clipped absolute deviation method. Biometrika 2007;94(3):553--568

work page 2007
[31]

On the adaptive elastic-net with a diverging number of parameters

Zou H, Zhang H. On the adaptive elastic-net with a diverging number of parameters. Annals of Statistics 2009;37(4):1733--1751

work page 2009
[32]

A unified approach to model selection and sparse recovery using regularized least squares

Lv J, Fan Y. A unified approach to model selection and sparse recovery using regularized least squares. Annals of Statistics 2009;37(6A):3498--3528

work page 2009
[33]

Generalized double pareto shrinkage

Armagan A, Dunson DB, Lee J. Generalized double pareto shrinkage. Statistica Sinica 2013;23(1):119--143

work page 2013
[34]

Covariance regularization by thresholding

Bickel PJ, Levina E. Covariance regularization by thresholding. Annals of Statistics 2008;36(6):2577--2604

work page 2008
[35]

Sparse permutation invariant covariance estimation

Rothman AJ, Bickel PJ, Levina E, Zhu J, et al. Sparse permutation invariant covariance estimation. Electronic Journal of Statistics 2008;2:494--515

work page 2008
[36]

Statistics for High-Dimensional Data

B\" u hlmann P, Van De Geer S. Statistics for High-Dimensional Data. Verlag Berlin Heidelberg: Springer; 2011

work page 2011
[37]

Regularization and variable selection via the elastic net

Zou H, Hastie T. Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2005;67(2):301--320

work page 2005
[38]

On variable ordination of Cholesky-based estimation for a sparse covariance matrix

Kang X, Deng X. On variable ordination of Cholesky-based estimation for a sparse covariance matrix. Canadian Journal of Statistics 2020;in press

work page 2020
[39]

Sparse estimation of a covariance matrix

Bien J, Tibshirani RJ. Sparse estimation of a covariance matrix. Biometrika 2011;98(4):807--820

work page 2011
[40]

Penalized classification using Fisher's linear discriminant

Witten DM, Tibshirani R. Penalized classification using Fisher's linear discriminant. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2011;73(5):753--772

work page 2011
[41]

Sparse Discriminant Analysis

Clemmensen L, Hastie T, Witten D, Ersb ll B. Sparse Discriminant Analysis. Technometrics 2011;53(4):406--413

work page 2011
[42]

Observation of unusual topological surface states in half-Heusler compounds LnPtBi (Ln=Lu,Y)

Liu Z, Yang L, Wu SC, Shekhar C, Jiang J, Yang H, et al. Observation of unusual topological surface states in half-Heusler compounds LnPtBi (Ln=Lu,Y). Nature Communications 2016;7(1):1--7

work page 2016
[43]

Commentary: The Materials Project: A materials genome approach to accelerating materials innovation

Jain A, Ong SP, Hautier G, Chen W, Richards WD, Dacek S, et al. Commentary: The Materials Project: A materials genome approach to accelerating materials innovation. Apl Materials 2013;1(1):011002

work page 2013
[44]

Materials design and discovery with high-throughput density functional theory: the open quantum materials database (OQMD)

Saal JE, Kirklin S, Aykol M, Meredig B, Wolverton C. Materials design and discovery with high-throughput density functional theory: the open quantum materials database (OQMD). Jom 2013;65(11):1501--1509

work page 2013
[45]

AFLOW: an automatic framework for high-throughput materials discovery

Curtarolo S, Setyawan W, Hart GL, Jahnatek M, Chepulskii RV, Taylor RH, et al. AFLOW: an automatic framework for high-throughput materials discovery. Computational Materials Science 2012;58:218--226

work page 2012
[46]

Molecular classification of Crohn's disease and ulcerative colitis patients using transcriptional profiles in peripheral blood mononuclear cells

Burczynski ME, Peterson RL, Twine NC, Zuberek KA, Brodeur BJ, Casciotti L, et al. Molecular classification of Crohn's disease and ulcerative colitis patients using transcriptional profiles in peripheral blood mononuclear cells. The Journal of Molecular Diagnostics 2006;8(1):51--61

work page 2006
[47]

Sparse quadratic discriminant analysis for high dimensional data

Li Q, Shao J. Sparse quadratic discriminant analysis for high dimensional data. Statistica Sinica 2015;25:457--473

work page 2015
[48]

Joint analysis of semicontinuous data with latent variables

Wang X, Feng X, Song X. Joint analysis of semicontinuous data with latent variables. Computational Statistics and Data Analysis 2020;p. 107005

work page 2020
[49]

write newline

" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION word.in bbl.in capitalize ":" * " " *...

work page

[1] [1]

Mixed binary-continuous copula regression models with application to adverse birth outcomes

Klein N, Kneib T, Marra G, Radice R, Rokicki S, McGovern ME. Mixed binary-continuous copula regression models with application to adverse birth outcomes. Statistics in Medicine 2019;38(3):413--436

work page 2019

[2] [2]

Regression models for a bivariate discrete and continuous outcome with clustering

Fitzmaurice GM, Laird NM. Regression models for a bivariate discrete and continuous outcome with clustering. Journal of the American statistical Association 1995;90(431):845--852

work page 1995

[3] [3]

Generalized latent trait models

Moustaki I, Knott M. Generalized latent trait models. Psychometrika 2000;65(3):391--411

work page 2000

[4] [4]

Bayesian latent variable models for clustered mixed outcomes

Dunson DB. Bayesian latent variable models for clustered mixed outcomes. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2000;62(2):355--366

work page 2000

[5] [5]

A correlated probit model for joint modeling of clustered binary and continuous responses

Gueorguieva RV, Agresti A. A correlated probit model for joint modeling of clustered binary and continuous responses. Journal of the American Statistical Association 2001;96(455):1102--1112

work page 2001

[6] [6]

Dynamic latent trait models for multidimensional longitudinal data

Dunson DB. Dynamic latent trait models for multidimensional longitudinal data. Journal of the American Statistical Association 2003;98(463):555--563

work page 2003

[7] [7]

QQ models: Joint modeling for quantitative and qualitative quality responses in manufacturing systems

Deng X, Jin R. QQ models: Joint modeling for quantitative and qualitative quality responses in manufacturing systems. Technometrics 2015;57(3):320--331

work page 2015

[8] [8]

Time-varying coefficient models for joint modeling binary and continuous outcomes in longitudinal data

K \"u r \"u m E, Li R, Shiffman S, Yao W. Time-varying coefficient models for joint modeling binary and continuous outcomes in longitudinal data. Statistica Sinica 2016;26(3):979--1000

work page 2016

[9] [9]

A Bayesian hierarchical model for quantitative and qualitative responses

Kang L, Kang X, Deng X, Jin R. A Bayesian hierarchical model for quantitative and qualitative responses. Journal of Quality Technology 2018;50(3):290--308

work page 2018

[10] [10]

Longitudinal joint modelling of binary and continuous outcomes: A comparison of bridge and normal distributions

Amini P, Verbeke G, Zayeri F, Mahjub H, Maroufizadeh S, Moghimbeigi A. Longitudinal joint modelling of binary and continuous outcomes: A comparison of bridge and normal distributions. Epidemiology, Biostatistics and Public Health 2018;15(1)

work page 2018

[11] [11]

Regression models for mixed discrete and continuous responses with potentially missing values

Fitzmaurice GM, Laird NM. Regression models for mixed discrete and continuous responses with potentially missing values. Biometrics 1997;53(1):110--122

work page 1997

[12] [12]

Joint regression analysis of correlated data using Gaussian copulas

Song PXK, Li M, Yuan Y. Joint regression analysis of correlated data using Gaussian copulas. Biometrics 2009;65(1):60--68

work page 2009

[13] [13]

Association models for clustered data with binary and continuous responses

Lin L, Bandyopadhyay D, Lipsitz SR, Sinha D. Association models for clustered data with binary and continuous responses. Biometrics 2010;66(1):287--293

work page 2010

[14] [14]

Selection and estimation for mixed graphical models

Chen S, Witten DM, Shojaie A. Selection and estimation for mixed graphical models. Biometrika 2014;102(1):47--64

work page 2014

[15] [15]

Mixed graphical models via exponential families

Yang E, Baker Y, Ravikumar P, Allen G, Liu Z. Mixed graphical models via exponential families. Proceedings of the Seventeenth International Conference on Artificial Intelligence and Statistics 2014;33:1042--1050

work page 2014

[16] [16]

A semiparametric Bayesian joint model for multiple mixed-type outcomes: an application to acute myocardial infarction

Guglielmi A, Ieva F, Paganoni AM, Quintana FA. A semiparametric Bayesian joint model for multiple mixed-type outcomes: an application to acute myocardial infarction. Advances in Data Analysis and Classification 2018;12(2):399--423

work page 2018

[17] [17]

Latent variable models for mixed eiscrete and continuous outcomes

Sammel MD, Ryan LM, Legler JM. Latent variable models for mixed eiscrete and continuous outcomes. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 1997;59(3):667--678

work page 1997

[18] [18]

Bayesian latent variable models for mixed discrete outcomes

Dunson DB, Herring AH. Bayesian latent variable models for mixed discrete outcomes. Biostatistics 2005;6(1):11--25

work page 2005

[19] [19]

Hierarchical Bayesian modeling of heterogeneous clusterand subject-level associations between continuous and binary outcomes in dairy production

Bello NM, Steibel JP, Tempelman RJ. Hierarchical Bayesian modeling of heterogeneous clusterand subject-level associations between continuous and binary outcomes in dairy production. Biometrical Journal 2012;54(2):230--248

work page 2012

[20] [20]

Sparse linear discriminant analysis by thresholding for high dimensional data

Shao J, Wang Y, Deng X, Wang S. Sparse linear discriminant analysis by thresholding for high dimensional data. Annals of Statistics 2011;39(2):1241--1265

work page 2011

[21] [21]

On model selection consistency of Lasso

Zhao P, Yu B. On model selection consistency of Lasso. Journal of Machine Learning Research 2006;7(12):2541--2563

work page 2006

[22] [22]

A direct estimation approach to sparse linear discriminant analysis

Cai T, Liu W. A direct estimation approach to sparse linear discriminant analysis. Journal of the American Statistical Association 2012;106:1566--1577

work page 2012

[23] [23]

Model selection and estimation in the Gaussian graphical model

Yuan M, Lin Y. Model selection and estimation in the Gaussian graphical model. Biometrika 2007;94(1):19--35

work page 2007

[24] [24]

Large Gaussian covariance matrix estimation with Markov structures

Deng X, Yuan M. Large Gaussian covariance matrix estimation with Markov structures. Journal of Computational and Graphical Statistics 2009;18(3):640--657

work page 2009

[25] [25]

Regression shrinkage and selection via the lasso

Tibshirani R. Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 1996;58(1):267--288

work page 1996

[26] [26]

Sparse inverse covariance estimation with the graphical lasso

Friedman J, Hastie T, Tibshirani R. Sparse inverse covariance estimation with the graphical lasso. Biostatistics 2008;9(3):432--441

work page 2008

[27] [27]

Sparsistency and rates of convergence in large covariance matrix estimation

Lam C, Fan J. Sparsistency and rates of convergence in large covariance matrix estimation. Annals of Statistics 2009;37(6B):4254--4278

work page 2009

[28] [28]

Model Selection in Gaussian Graphical Models: High-Dimensional Consistency of l1-regularized MLE

Raskutti G, Yu B, Wainwright MJ, Ravikumar P. Model Selection in Gaussian Graphical Models: High-Dimensional Consistency of l1-regularized MLE. Advances in Neural Information Processing Systems 2008;21:1329--1336

work page 2008

[29] [29]

Minimax estimation of large precision matrices with bandable Cholesky factor

Liu Y, Ren Z, et al. Minimax estimation of large precision matrices with bandable Cholesky factor. Annals of Statistics 2020;48(4):2428--2454

work page 2020

[30] [30]

Tuning parameter selectors for the smoothly clipped absolute deviation method

Wang H, Li R, Tsai CL. Tuning parameter selectors for the smoothly clipped absolute deviation method. Biometrika 2007;94(3):553--568

work page 2007

[31] [31]

On the adaptive elastic-net with a diverging number of parameters

Zou H, Zhang H. On the adaptive elastic-net with a diverging number of parameters. Annals of Statistics 2009;37(4):1733--1751

work page 2009

[32] [32]

A unified approach to model selection and sparse recovery using regularized least squares

Lv J, Fan Y. A unified approach to model selection and sparse recovery using regularized least squares. Annals of Statistics 2009;37(6A):3498--3528

work page 2009

[33] [33]

Generalized double pareto shrinkage

Armagan A, Dunson DB, Lee J. Generalized double pareto shrinkage. Statistica Sinica 2013;23(1):119--143

work page 2013

[34] [34]

Covariance regularization by thresholding

Bickel PJ, Levina E. Covariance regularization by thresholding. Annals of Statistics 2008;36(6):2577--2604

work page 2008

[35] [35]

Sparse permutation invariant covariance estimation

Rothman AJ, Bickel PJ, Levina E, Zhu J, et al. Sparse permutation invariant covariance estimation. Electronic Journal of Statistics 2008;2:494--515

work page 2008

[36] [36]

Statistics for High-Dimensional Data

B\" u hlmann P, Van De Geer S. Statistics for High-Dimensional Data. Verlag Berlin Heidelberg: Springer; 2011

work page 2011

[37] [37]

Regularization and variable selection via the elastic net

Zou H, Hastie T. Regularization and variable selection via the elastic net. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2005;67(2):301--320

work page 2005

[38] [38]

On variable ordination of Cholesky-based estimation for a sparse covariance matrix

Kang X, Deng X. On variable ordination of Cholesky-based estimation for a sparse covariance matrix. Canadian Journal of Statistics 2020;in press

work page 2020

[39] [39]

Sparse estimation of a covariance matrix

Bien J, Tibshirani RJ. Sparse estimation of a covariance matrix. Biometrika 2011;98(4):807--820

work page 2011

[40] [40]

Penalized classification using Fisher's linear discriminant

Witten DM, Tibshirani R. Penalized classification using Fisher's linear discriminant. Journal of the Royal Statistical Society: Series B (Statistical Methodology) 2011;73(5):753--772

work page 2011

[41] [41]

Sparse Discriminant Analysis

Clemmensen L, Hastie T, Witten D, Ersb ll B. Sparse Discriminant Analysis. Technometrics 2011;53(4):406--413

work page 2011

[42] [42]

Observation of unusual topological surface states in half-Heusler compounds LnPtBi (Ln=Lu,Y)

Liu Z, Yang L, Wu SC, Shekhar C, Jiang J, Yang H, et al. Observation of unusual topological surface states in half-Heusler compounds LnPtBi (Ln=Lu,Y). Nature Communications 2016;7(1):1--7

work page 2016

[43] [43]

Commentary: The Materials Project: A materials genome approach to accelerating materials innovation

Jain A, Ong SP, Hautier G, Chen W, Richards WD, Dacek S, et al. Commentary: The Materials Project: A materials genome approach to accelerating materials innovation. Apl Materials 2013;1(1):011002

work page 2013

[44] [44]

Materials design and discovery with high-throughput density functional theory: the open quantum materials database (OQMD)

Saal JE, Kirklin S, Aykol M, Meredig B, Wolverton C. Materials design and discovery with high-throughput density functional theory: the open quantum materials database (OQMD). Jom 2013;65(11):1501--1509

work page 2013

[45] [45]

AFLOW: an automatic framework for high-throughput materials discovery

Curtarolo S, Setyawan W, Hart GL, Jahnatek M, Chepulskii RV, Taylor RH, et al. AFLOW: an automatic framework for high-throughput materials discovery. Computational Materials Science 2012;58:218--226

work page 2012

[46] [46]

Molecular classification of Crohn's disease and ulcerative colitis patients using transcriptional profiles in peripheral blood mononuclear cells

Burczynski ME, Peterson RL, Twine NC, Zuberek KA, Brodeur BJ, Casciotti L, et al. Molecular classification of Crohn's disease and ulcerative colitis patients using transcriptional profiles in peripheral blood mononuclear cells. The Journal of Molecular Diagnostics 2006;8(1):51--61

work page 2006

[47] [47]

Sparse quadratic discriminant analysis for high dimensional data

Li Q, Shao J. Sparse quadratic discriminant analysis for high dimensional data. Statistica Sinica 2015;25:457--473

work page 2015

[48] [48]

Joint analysis of semicontinuous data with latent variables

Wang X, Feng X, Song X. Joint analysis of semicontinuous data with latent variables. Computational Statistics and Data Analysis 2020;p. 107005

work page 2020

[49] [49]

write newline

" write newline "" before.all 'output.state := FUNCTION n.dashify 't := "" t empty not t #1 #1 substring "-" = t #1 #2 substring "--" = not "--" * t #2 global.max substring 't := t #1 #1 substring "-" = "-" * t #2 global.max substring 't := while if t #1 #1 substring * t #2 global.max substring 't := if while FUNCTION word.in bbl.in capitalize ":" * " " *...

work page