STRUCTURAL EQUATION MODELING, 14(1), 26–47 Copyright © 2007, Lawrence Erlbaum Associates, Inc. Performance of Factor Mixture Models as a Function of Model Size, Covariate Effects, and Class-Specific Parameters Gitta Lubke University of Notre Dame Bengt O. Muthén University of California, Los Angeles Factor mixture models are designed for the analysis of multivariate data obtained from a population consisting of distinct latent classes. A common factor model is assumed to hold within each of the latent classes. Factor mixture modeling involves obtaining estimates of the model parameters, and may also be used to assign subjects to their most likely latent class. This simulation study investigates aspects of model performance such as parameter coverage and correct class membership assignment and focuses on covariate effects, model size, and class-specific versus class-invariant parameters. When fitting true models, parameter coverage is good for most parameters even for the smallest class separation investigated in this study (0.5 SD between 2 classes). The same holds for convergence rates. Correct class assignment is unsatisfactory for the small class separation without covariates, but improves dramatically with increasing separation, covariate effects, or both. Model performance is not influenced by the differences in model size investigated here. Class-specific parameters may improve some aspects of model performance but negatively affect other aspects. Factor mixture models combine latent class analysis and common factor analysis. Factor mixture models are designed for data from possibly heterogenous populations consisting of several latent classes, and are an adequate choice if it is reasonable to assume that observed variables within each class can be modeled using a common factor model. There are two types of latent variables in factor mixture Correspondence should be addressed to Gitta Lubke, Department of Psychology, 118 Haggar Hall, University of Notre Dame, Notre Dame, IN 46556. E-mail: [email protected] PERFORMANCE OF FACTOR MIXTURE MODELS 27 models, the categorical latent class variable indicating the class membership of each subject, and the continuous latent factor(s) representing the common content of the observed variables. It is not observed to which class a subject belongs, hence class membership is latent, and it is not observed how a subject scores on the factors underlying the observed variables, hence factors are latent. Factor mixture modeling can be used to assign subjects to their most likely class and to obtain estimates of the model parameters. Subject assignment can be the primary objective of a study, for instance when the interest is to establish subpopulations representing different types of risk behavior. Detection of different risk behavior classes can serve to investigate the effectiveness of a specific prevention program for members of a high-risk class. On the other hand, a study may aim mainly at the comparison of classes with respect to model parameters such as factor means in growth mixture models, or class-specific effects of covariates. Factor mixture models are used increasingly in empirical research, both for class assignment (e.g., Neuman et al., 1999; Rasmussen et al., 2002) and class comparisons (e.g., Duncan, Duncan, & Stryker, 2000; McArdle & Epstein, 1987; B. O. Muthén & Muthén, 2000; Nagin & Tremblay, 2001). The primary goal of this study is to assess the performance of these two aspects of factor mixture modeling and establish the conditions under which applied researchers can expect factor mixture models to perform properly. It is investigated to what extent covariate effects, model size, or presence of class-specific model parameters such as class-specific factor loadings can influence model performance. Because separation between classes is known to affect model performance, the study is carried out at different levels of separation. Effects of class separation on model performance that are due to covariate effects are distinguished from class separation effects that are due to differences between classes regarding factor means or intercepts of observed variables within class. A large body of research has focused on establishing the correct number of classes and on developing or assessing indexes that may serve to compare alternative models (Bamber & Santen, 2000; Jedidi, Ramaswarmy, DeSarbo, & Wedel, 1996; Nagin, 1999; see also Everitt & Hand, 1981; McLachlan & Peel, 2000, for in-depth discussions, and Lubke & Neale, 2005, for some simulation results). In this study the focus is not on finding the most adequate model for a given data set; rather, the performance of factor mixture models when true models are fitted to artificial data is considered. The objective is to quantify the proportion of subjects that are correctly assigned to their true class, and the coverage of true parameter values as a function of increasing covariate effects and class separation. Because true models are fitted, these proportions and average rates provide an upper bound of what might be expected in an empirical study in which fitted models are simplifications of the true data generating process. The quantification of model performance is done for different factor mixture models under varying conditions. Because models are fitted to artificial data, deviations from true values can be computed. 28 LUBKE AND MUTHÉN The data generating models are the latent profile model (Lazarsfeld & Henry, 1968)1 and factor mixture models with one, two, or three within-class factors and 8 or 12 observed variables. The models differ with respect to their size and complexity. The within-class structure of the latent profile model is relatively simple in that observed variables within class are assumed to be independent conditional on class membership (i.e., local independence), hence all covariances within class are zero. The one-, two-, and three-factor models require the estimation of an increasingly complex latent structure within class. Class separation is measured in terms of the multivariate Mahalanobis distance (MD) between two classes. Although it is evident that increasing class separation will improve correct class assignment, it is not known how far classes have to be separated to assign subjects to their true class with an acceptably high probability, or to recover true model parameters such as factor mean differences between classes. This study includes the evaluation of model performance for smaller class separation than has been investigated so far in the context of factor mixture analysis. Increasing effects of a continuous covariate that predicts class membership are investigated. Class-predicting covariates increase class separation when considering a multivariate distance computed for covariates and observed within-class variables. To distinguish between class separation due to covariate effects and class separation due to factor mean or intercept differences, three multivariate MDs are computed, namely (a) using only the means and covariances of the observed variables within class, (b) using the observed variables and the covariates jointly, or (c) using only the covariates. Larger effects of covariates on class membership result in a larger MD when computed for observed variables and covariates jointly.2 Class-invariant versus specific factor covariances, and measurement invariance versus noninvariance concern the restrictiveness of the within-class model. More restrictive models require fewer parameters to be estimated, which may ease model estimation. However, the less restrictive models have more class-specific parameters, which may be advantageous to distinguish between classes. The focus of this simulation study is showing what an empirical researcher can expect regarding correct class assignment and parameter coverage under a variety of conditions. Correct class assignment is assessed as the proportion of subjects assigned to the true class based on the highest posterior class probability (i.e., modal assignment). Coverage of true parameter values is computed for a given number of replications as the proportion of replications for which the 95% confidence interval of the estimated parameter covers the true parameter. We also evaluate whether entropy (see, e.g., Muthén & Muthén, 2001) is a suitable indicator of correct class 1The latent profile model can be considered as a factor mixture model with zero factors, or with factors with zero variance. 2The MD between two classes that is used in this study equals M = ( - )t -1( - ) . 1 2 ⌺ 1 2 PERFORMANCE OF FACTOR MIXTURE MODELS 29 assignment. Entropy is closely related to average class probabilities, which may provide an indication of how problematic or unproblematic class assignment is in a given analysis. Intuitively, it seems that a researcher can be more confident about the results of, say, a two-class model, if subjects are assigned to either one of the classes with a probability close to unity than if class probabilities are close to .5. Finally, convergence rates are presented for all models. MODEL SPECIFICATION Slightly different specifications of factor mixture models have been proposed in the literature (Arminger, Stein, & Wittenberg, 1999; Dolan & van der Maas, 1998; Jedidi, Jagpal, & DeSarbo, 1997; McLachlan & Peel, 2000; Vermunt & Magidson, 2002; Yung, 1997). Here, we largely follow the model description in Muthén and Shedden (1999; for a less technical account, see Lubke & Muthén, 2005). Considered are factor mixture models with a single categorical latent variable and continuous outcomes within class.3 General Model Description The factor mixture model consists of several different regressions. First, observed continuous variables within class are regressed on the within-class factors and on covariates. Second, the within-class factors are regressed on the latent class variable and on covariates. Third, the class variable is regressed on covariates. Class membership can also function as a predictor of observed outcome variables, but this option is not considered in this article. Regression of observed variables on factors and covariates. Throughout, subscript i, where i = 1, … , I, is used as a subject index. Following commonly used notation we denote the J = 1, … J observed continuous variables as Y, and the scores of subject i on l = 1, … L underlying continuous factors as il. Factors are assumed to be multivariate normally distributed conditional on class membership and covariates. The J × 1 vector contains the regression intercepts, ⌳y is the J × L matrix of factor loadings, and εi are the errors of subject i. Errors are assumed to be normally distributed with zero mean, and are assumed to have zero covariances with factors or other errors. Covariates can be continuous or categorical, and are 3The factor mixture model that can be estimated in the current version of Mplus (L. K. Muthén & Muthén, 2004) is more general in that it allows more class-specific parameters than the model described in the next section, mixed categorical or continuous observed variables within class, and more than a single latent class variable. 30 LUBKE AND MUTHÉN denoted as X. The regression weights of Y on X are contained in ⌫y. The linear regression of Y on and X is expressed as y i = + ⌳ y i + ⌫ yx i + ε i. (1) Regression of factors on class and covariates. The latent class variable is a multinomial variable. For k = 1, … , K latent classes, we have ìï1 if subject i belongs to class k cik = ïí ïïî0 otherwise. (2) The factors are regressed on the class variable and on covariates X, which can be expressed for subject i as ηi = Ac i + ⌫ ηx i + i. (3) The residual factor scores of subject i, that is, the part of the factor that is not explained by class membership or by the covariates, is denoted as i. These are multivariate normally distributed with zero means within each class. ⌫η contains the regression weights of the factors on the covariates. A has dimensions L × K and contains possibly class-specific regression intercepts. If regression intercepts in Equation 1 are estimated, one column of A has to be fixed, which is equivalent to the restriction in multigroup analysis where the factor means of one of the groups have to be fixed in case intercepts are estimated (Sörbom, 1974). Taking expectations, one can see that conditional on X, the other columns of A contain the factor mean differences with respect to the arbitrarily chosen reference class. Regression of the latent class variable on covariates. The third part is a multinomial logistic regression where X predicts the log odds of the probability of belonging to class k as compared to the probability of belonging to an arbitrarily chosen reference class, say the Kth class. The third part of the model is denoted as é P (cik = 1 | x i ) ù ú = λ ck + ⌫ ck x i . ln ê êë P(ciK = 1 | x i ) úû (4) ⌫ck contains the regression weights, and the regression intercept is denoted as λck. Conditional on class membership and covariates X, observed variables Y are multivariate normally distributed. Class membership is considered missing, and an Expectation-Maximization (EM)-type algorithm can be used to estimate the model conditional on covariates X. In the E step, the posterior probabilities of the subjects of belonging to each of the K classes is computed given the parameter estimates. In the M step, estimates of the model parameters are updated given the pos- PERFORMANCE OF FACTOR MIXTURE MODELS 31 terior class probabilities of the subjects. Approaches to estimate mixture models are described in McLachlan and Peel (2000). This general model encompasses several well-known models as submodels. Fixing within-class factor variances to zero results in local independence of observed variables within class, which is the main characteristic of classic latent class models. Fixing the number of classes to 1 results in common factor models. The submodels used in this study are described in more detail in the next section. SIMULATION STUDY DESIGN The study has three parts. In Part 1, the effects of model size and model complexity are investigated. Part 2 focuses on covariate effects and class separation, and Part 3 aims at evaluating the effects of different class-specific parameters. Throughout the simulation, two-class models are considered, K = 2. There are no direct effects of covariates on the observed variables or covariate effects on factor scores, consequently the matrices ⌫y and ⌫η contain zeros. The covariate effects investigated in Part 2 concern the prediction of class membership. All parameter values used for the data generation are shown in the Appendix. Part 1: Model Size and Complexity Four models are compared in Part 1, namely the latent profile model, and one-, two-, and three-factor models. The latent profile model can be represented as a special case of the factor mixture model where residual factor scores (see Equation 3) have zero variance. As a result, the covariance matrix of observed variables Y conditional on class membership equals the covariance matrix of the residuals ε (see Equation 1). Because residuals are assumed to have zero covariances (discussed earlier), the conditional covariance matrix of observed variables is diagonal, and observed variables are independent given class. The latent profile model used for data generation has eight observed variables. The means of the observed variables are class specific, and are used to manipulate class separation. Residual variances are class invariant. The single-, two-, and three-factor models in this simulation differ with respect to the number of observed variables. The single-factor and two-factor models have 8 observed indicators, whereas the three-factor model has 12 indicators. All factor models in Part 1 are measurement invariant; that is, intercepts, loadings, and residual variances are class invariant (Lubke, Dolan, Kelderman, & Mellenbergh, 2003; Meredith, 1993). In both classes, factor variances and covariances (if present) equal unity and 0.5, respectively, which results in class-invariant covariance matrices of observed variables Y. The analysis of normal mixtures with class-invariant covariance matrices may be less prone to problems related to multiple singularities 32 LUBKE AND MUTHÉN of the likelihood. It is known that the likelihood surface of a normal mixture with class-specific covariance matrices may have many singularities (McLachlan & Peel, 2000). This can affect convergence rates and increases the risk of incorrect solutions corresponding to a local instead of the global maximum. Model performance in Part 1 is compared to the analysis data with class-specific covariance matrices covered in Part 3. Data are generated under these four models for two levels of class separation. The MD when computed using the model implied within class mean vectors and covariance matrices equals 1 and 1.5, respectively (MD is explained in more detail in the next section). None of the models in Part 1 has covariate effects. Part 2: Mahalanobis Distance, Class Separation, and Covariate Effects In Part 2, class separation and covariate effects are investigated in more detail. The separation of the two classes is measured by the multivariate MD. The MD between two points, say P1 with coordinates (a1, b1) and P2 with coordinates (a2, b2), where A and B are uncorrelated and have unit variance, is equal to the squared standardized Euclidian distance between the two points. The MD is a distance that not only takes variances but also covariances between variables into account. Everything else held constant, the MD between two points decreases with increasing covariation of the variables involved. Because the MD takes (co)variance(s) into account, factor mean differences, intercept differences, and also the reliability of observed variables can affect this distance. Figure 1 shows the relation among the standardized factor mean difference, reliability, and the MD for a single-factor model with five indicators. Compared to standardized factor mean differences, commonly observed reliability variation (i.e., between 0.4 and 0.8) has only a limited impact on the MD. The MD is largely determined by the standardized factor mean differences between the classes. Note that an MD can be computed using model-implied means and covariances. The MD when based on model-implied means and covariances is denoted as MDi, which does not vary across MC replications for a given model, and therefore characterizes different types of data sets. An MD can also be computed using observed means and covariances. This distance is denoted as MD0. Contrary to MDi, MD0 varies across data sets that are generated under a given model due to random variation of factor scores and residual scores. Hence, MD0 characterizes a single data set, and can be used to predict aspects of model performance that vary across MC replications for a given model (e.g., correct class assignment). Furthermore, MDs can be computed (a) only for the observed within-class variables Y, (b) only for the covariates X, or (c) for Y and X jointly. The data-generating model in Part 2 is a single-factor/two-class model with eight observed variables and four levels of intercept differences between classes. PERFORMANCE OF FACTOR MIXTURE MODELS FIGURE 1 33 Relation between standardized factor mean difference, reliability, and MD. The MDi is computed for the observed variables Y are 0.5, 1.0, 1.5, and 2.0, respectively. A single normally distributed covariate predicts class membership. The four levels of intercept differences are combined with four effect sizes of the covariate, leading to 16 different types of data. The effect sizes of the covariate measured in terms of MDi computed for the covariate alone are 0, .5, 1, and 1.5, respectively. Part 3: Class-Specific Parameters Part 3 focuses on class-specific parameters. Parameter coverage of class-specific parameters is an important aspect of model performance, especially when investigating differences in the factor structure of a given test across latent classes. If intercepts, loadings, or residual variances differ across classes, measurement invariance is violated (Lubke & Dolan, 2003; Lubke et al., 2003; Meredith, 1993; Widaman & Reise, 1997). Class-specific factor covariances do not violate measurement invariance, but detection of this type of class difference can be important conceptually when characterizing the different classes. Note that class-specific factor loadings, residual variances, and factor covariances lead to class-specific covariance matrices of the observed variables Y, which may increase problems related to multiple singularities such as convergence on local maxima or nonconvergence. To minimize this risk, different sets of random starting values are used (see later). The data-generating model in Part 3 is a two-factor/two-class model. Four different types of data are generated: (a) data with class-specific intercepts, (b) data with class-specific loadings, (c) data with class-specific residual variances, and (d) 34 LUBKE AND MUTHÉN data with class-specific factor covariances. In the latter manipulation, the covariance between the two factors is close to zero in one of the classes. Data are generated under these four models with the same two levels of MDi as in Part 1. As in Part 1, no covariates are included. In addition to fitting the true models, we also fit a model with fixed factor loadings to the first type of data (class-specific intercepts). Hence, the fitted factor structure of this model is similar to a simple linear growth mixture model where the factor loadings are also fixed. Fixing parameters may increase model performance. Data Generation and Measures of Model Performance For all models, 100 data sets (i.e., Monte Carlo [MC] replications) are generated using Equations 1 through 4.4 The number of subjects for each of the 100 replications is 300, with prior class probabilities for the two classes equaling 0.5.5 The 100 replications differ with respect to the random variables; that is, the subjects’ scores on the latent class variable, c, the residual factor scores, , the regression residuals, ε, and the covariates X (if included). The model parameters corresponding to the within-class model (i.e., the regression of observed variables on the factors, and factors on the latent class variable, Equations 1 and 3) are fixed for each set of 100 MC replications. The specific parameter values used for the data generation are listed in the Appendix. There are four main measures of model performance in this simulation study. The first measure is coverage of true parameter values, which is derived as follows. For each individual MC replication, 95% confidence intervals are computed using parameter estimates and their maximum likelihood standard errors. Our measure of coverage is the proportion of MC replications for which the confidence interval around the estimated value covers the true value. Throughout, coverage is reported for the different parameter matrices separately. If a parameter matrix contains more than a single parameter, average coverage for that matrix is reported. Second, we consider the proportion of subjects that are assigned to their true class based on their highest posterior class probability. Third, entropy is evaluated as a potential indicator of the quality of class assignment. Entropy is closely related to average class probabilities. Average class probabilities are computed by taking the mean of the highest class probability of each subject across subjects. In our study, entropy and average class probabilities correlated .95 and higher for each of the sets of 100 MC replications, hence only entropy is reported. Fourth, convergence rates are pre- 4As noted by an anonymous reviewer, larger numbers of MC replications would be preferable especially to obtain parameter coverage rates; however, the computation time of fitting mixture models imposes practical limits. 5Sample size effects and different class proportions are investigated in Lubke and Neale (2006). PERFORMANCE OF FACTOR MIXTURE MODELS 35 sented. Some additional indicators of model performance are presented and explained in the Results section. True models are fitted to the data throughout. Fifty sets of random starting values are provided for each fitted model. Ten iterations are computed for each of the 50 sets, and the 10 solutions with the highest likelihood values are then selected and iterated until the convergence criterion is met. Models with and without covariate effects are fitted in Part 2 to data generated with different effect sizes of a single continuous covariate. Because model estimation is carried out conditional on the covariate, and because the covariate only predicts class membership, a model neglecting the covariate effect can also be considered a true model. Fitting a model without covariate effects to data generated with covariates predicting class membership does not lead to biased parameter estimates. In this way, the effect of including covariates on the model performance (e.g., parameter coverage, correct class assignment, average class probabilities, entropy, and convergence) can be assessed. Fitting a model with a covariate effect on class membership should result in better performance because the covariate increases the MD between classes. The analyses are carried out using an extended version of the Runall utility in combination with Mplus 3.12.6 RESULTS In addition to the four main indicators of model performance, the number of Monte Carlo replications for which one of the two fitted classes contained only two or less subjects are reported. In these MC replications, the subjects of two classes of the data-generating model are apparently collapsed into a single class; the second class contains only an extremely small number of subjects. Note that throughout this section, coverage and so on are computed for noncollapsed MC replications. Part 1: Model Size and Complexity The results of Part 1 are shown in Table 1. As explained earlier, parameter coverage is measured as the proportion of MC replications for which the 95% confidence intervals of the estimated parameters cover the true parameters, and is reported for the different parameter matrices separately. Note that 100 MC replications is a relatively small number when computing confidence intervals. Individual coverage rates should therefore be used with some caution. The main focus here is on investigating trends when comparing model performance across different conditions. 6The original Runall utility is available on the Mplus Web site (www.statmodel.com). 36 LUBKE AND MUTHÉN TABLE 1 Part 1: Model Size Coverage Assign Entropy aLRT Collapsed Nonconverged υ Λ ψ α Θ αc 0 F1.d1 F1.d2 F2.d1 F2.d2 F3.d1 F3.d2 .94 .85 .70 .93 .73 .65 .37 1 14 0 .95 .86 .87 .94 .92 .75 .43 1 13 0 .94 .87 .72 .95 .74 .66 .37 1 13 0 .93 .89 .85 .96 .93 .75 .40 .98 6 0 .95 .85 .72 .94 .73 .66 .35 .98 12 0 .96 .89 .84 .94 .92 .75 .42 .99 8 0 LPM.d1 LPM.d2 .91 .96 .96 .87 .70 .45 .19 0 0 .96 .98 .85 .56 1 0 0 Note. F1 = single-factor model; F2 = two-factor model; F3 = three-factor model; LPM = latent profile model; d1 and d2 refer to class separation MD = 1 and MD = 1.5. Parameter coverage is presented for intercepts of observed variables υ, factor loadings Λ, factor variances and covariances ψ, factor mean differences α, residuals Θ and the intercept of the latent class variable αc, which corresponds to the estimation of the class proportions. Assign = proportion of correct class assignment; aLRT = adjusted likelihood ratio test; Collapsed = number of collapsed Monte Carlo (MC) replications as collapsed; Nonconverged = number of nonconverged MC replications. Correct class assignment is computed as the proportion of subjects for whom the highest posterior class probability was equal to the true class probability. In addition, Table 1 also shows the number of nonconverged and collapsed MC replications, entropy, and the adjusted likelihood ratio test (aLRT; Lo, Mendell, & Rubin, 2001). When comparing the four different models, it is obvious from Table 1 that model performance does not change with the increase in model size investigated here. For the three different factor models the parameter coverage, correct assignment, entropy, aLRT, and numbers of nonconverged or collapsed MC replications is almost identical. The latent profile model has slightly better entropy and correct assignment. As expected, class separation influences model performance. Parameter coverage of the factor mean and intercept differences is better for the larger distance models, as are entropy and correct assignment. Regarding the absolute values of model performance, parameter coverage is above 90% for factor loadings, factor variances and covariances, and residual variances even for the smaller class separation. Factor mean differences are covered less well, but approach 90% for larger separation. This result is important for researchers investigating factor mean differences across classes. Correct class assignment is 65% for small and 75% for larger separation. Hence, approximately PERFORMANCE OF FACTOR MIXTURE MODELS 37 one third and one quarter of the subjects are misclassified for the two levels of class separation studied in this part (i.e., factor mean differences of 1 and 1.5 SD, respectively). This classification error rate improves drastically with an increase in separation as shown in Part 2 next. Note that the aLRT performs very good except for the small distance latent profile model, where based on the aLRT the necessity of a second class would only be accepted in 19% of MC replications. Part 2: Mahalanobis Distance, Class Separation, and Covariate Effects Part 2 addresses class separation. Effects of separation due to factor mean differences across classes and effects of separation due to mean differences between classes of a single covariate are studied in a fully crossed design. This is done by combining data that are separated by MDi based on observed variables within class Y equaling 0.5, 1, 1.5, and 2 with covariate effects corresponding to MDi computed for the covariate X of 0, 0.5, 1, and 1.5. Note that the observed variable and covariate MDs are not strictly additive because covariates affecting class membership and observed variables within class are correlated, and an MD depends on the correlation. The data-generating model is a single-factor/two-class model with class-specific intercepts. Results can be compared to the first two columns of Table 1 featuring the same model but with class-invariant intercepts and factor mean differences. Parameter coverage is acceptable even for the smallest class separation without covariate effects. In fact, the coverage rates for class separation MDi = .05 are very similar to results reported in the first column of Table 1. The results are not reported in a separate table because they do not increase significantly with increasing separation or covariate effects, which indicates that they do not depend on the differences in class separation studied here. It is noteworthy that the coverage of the intercepts is slightly better than the coverage for factor mean differences (see also Table 3). The estimation of class proportions (i.e., parameter λc) increases from .33 to .73 with increasing separation without covariates and from .55 to .82 when including the largest covariate effects. Smaller covariate effects result in intermediate values. Correct class assignment and entropy are reported in Table 2. Correct assignment increases monotonically with class separation due to intercept differences and with increasing covariate effects. Entropy increases similarly, although increases in entropy and correct assignment are not related in a monotone way. The results presented in Table 2 are averaged over MC replications and show that intercept differences and covariate effects both contribute to the improvement of correct class assignment and entropy. The following post-hoc analysis was conducted to evaluate whether equal size Y intercept differences and X mean differences as measured by the corresponding MDs are equally important predictors of 38 LUBKE AND MUTHÉN TABLE 2 Part 2: Proportion Correct Class Assignment and Entropy for the Single-Factor/Two-Class Model With Increasing Intercept Differences and Covariate Effects MDx = 0 d1 d2 d3 d4 MDx = .5 MDx = 1 MDx = 1.5 Assign Entropy Assign Entropy Assign Entropy Assign Entropy .58 .74 .85 .92 .41 .33 .54 .73 .61 .76 .86 .92 .39 .38 .56 .75 .68 .80 .88 .93 .45 .45 .62 .77 .75 .84 .90 .94 .55 .55 .69 .82 Note. d1–d4 correspond to MDi computed for ⌼ equaling 0.5, 1, 1.5, and 2, respectively. MDx is the MD computed for ⌾. correct assignment. Because the observed MDs vary over MC replications, it is possible to select matched pairs of equal observed MDy and MDx, and to compare the correct assignment proportion of the corresponding individual MC replications using a paired t test. The t test was not significant, thereby showing no evidence that either intercept differences or covariate mean differences have a stronger influence on correct assignment when holding the MD induced by the two types of class separation constant. The aLRT was not significant for a large proportion of the data sets with the smallest intercept differences and no covariate effect (i.e., .96), indicating that the second class is not necessary to model the data. This result improved dramatically with increasing covariate effects or intercept differences. The aLRT was significant in all MC replications of a given combination of intercept and covariate effects when intercept differences or the covariate effect had an MD of 1, which mirrors the result of the single-factor model with factor mean differences studied in Part 1. The number of collapsed MC replications is 24 for the smallest intercept differences (MDi = 0.5), and decreases to 4 when adding the smallest covariate effect. All other combinations do not result in collapsing of the two true classes into a single class. This is a dramatic difference from the single-factor model in Part 1, which had 14 and 13 collapsed MC replications for MDi equaling 1 and 1.5, respectively. Although not the primary focus of Part 2, the result indicates that class-specific intercepts improve this aspect of model performance (see also the results of Part 3). As in Part 1, there was no nonconvergence. Part 3: Class-Specific Parameters The data-generating model in this part is the two-factor/two-class model. Potential effects of different class-specific parameters on model performance are investigated separately for intercepts, loadings, factor covariances, and residual vari- 39 PERFORMANCE OF FACTOR MIXTURE MODELS TABLE 3 Part 3: Fixed and Model Specific Parameters Λ Fixed Coverage Assign Entropy aLRT Collapsed Nonconverged ψ1 Λ1 Θ1 α ψ ν2 Λ2 Θ2 ψ2 αc d1 d2 .94 .94 Λ Cs ν Cs d1 d2 d1 d2 .74 .88 .88 .84 .91 .77 .91 .95 .94 .95 .95 .93 .96 .95 .95 .88 .94 .96 .76 .91 .93 .95 .96 .74 .32 1 0 0 .96 .85 .54 1 0 0 ψ Cs .88 .70 .34 .94 0 10 .98 .78 .43 1 0 5 .73 .65 .55 .91 0 0 .94 .93 .91 .83 .58 .93 0 0 Θ Cs d1 d2 d1 d2 .94 .95 .78 .86 .94 .96 .94 .93 .93 .91 .64 .86 .92 .93 .77 .85 .89 .92 .88 .87 .67 .33 1 0 0 .94 .97 .77 .40 .99 0 3 .77 .67 .40 .78 0 1 .89 .76 .46 .96 0 0 Note. d1 and d2 refer to class separation MD = 1 and MD = 1.5; CS = Class specific. Parameter coverage is presented for intercepts of observed variables ν, factor loadings Λ, factor variances and covariances ψ, factor mean differences, α, residuals Θ for both classes, and for the intercept of the latent class variable αc, which corresponds to the estimation of the class proportions. Assign = Proportion of correct class assignment; aLRT = adjusted likelihood ratio test; Collapsed = number of collapsed Monte Carlo (MC) replications; Nonconverged = number of nonconverged MC replications. ances. In addition, the effect of having fixed factor loadings is investigated. Results of Part 3 can be compared to the third and fourth columns of Table 1, which show the results of the two-factor/two-class model with class-invariant parameters (invariant except for factor mean differences). Fixing factor loadings has clear beneficial effects on the estimation of αc, the parameter related to class proportions, on correct assignment, and on the number of collapsed MC replications. The coverage of αc increases to .96 for both distances (from .74 and .93), the correct assignment increases to .74 and .85 (from .66 and .75), and there are no collapsed or nonconverged MC replications. Because the factor structure of the two-factor/two-class model studied here is similar to a simple linear growth mixture model, similar results can be expected for the latter. The model with class-specific loadings differs from the corresponding model in Part 1 only with respect to the loadings in the second class. Again, there is an improvement in model performance, although the improvement is (not surprisingly) less dramatic than for the model with fixed loadings. Estimates of factor mean differences have about the same coverage, as do the other within-class parameters. The coverage of αc is improved for the small distance, which is accompanied by a 40 LUBKE AND MUTHÉN slight increase in correct assignment. Entropy is not improved. Again, there are no collapsed MC replications. Note, however, that for the small distance there are 10 nonconverged MC replications, and 5 for the larger separation. Class-specific intercepts have slightly higher coverage rates than factor means. Note that there are no collapsed MC replications. Entropy is higher than for the corresponding model in Part 1, but this does not correspond with an improved correct assignment, at least not for the smaller separation. The aLRT seems to perform slightly worse. Recall that the measures of model performance including the aLRT are computed for the noncollapsed MC replications only. Class-specific factor covariances are recovered well, as the coverage is .88, and .94, respectively, which matches the coverage of the model in Part 1. Although αc is covered slightly better, class assignment remains at .67 and .77. Entropy and aLRT perform as in Part 1;, again, there are no collapsed MC replications. Finally, class-specific residual variances are also recovered well; coverage does not differ from the corresponding Part 1 results. There is no improvement regarding correct assignment, entropy, or the coverage of αc. Importantly, factor mean differences are less well recovered if residual variances are class specific. The aLRT performs worse for the smaller distance. There are no collapsed MC replications and one nonconverged MC replication. DISCUSSION Simulation studies are notorious for their limitations, and this study is no exception. The most important limiting factor 3in this study is the computation time needed to fit large numbers of mixture models. Possible extensions of this study may include (but are not limited to) an increase in the number of MC replications, other types of within-class models, smaller class separation, more than two classes, unequal class proportions, and varying sample sizes. A different study focusing on model selection when fitting correctly and incorrectly specified models has shown that incorrect model choice due to decreasing class separation can at least partially be compensated for with an increase in sample size (Lubke & Neale, 2006). Although this finding may extend to parameter coverage and correct class assignment, it would need further investigation. Unequal class proportions did not have a noticeable effect on model selection in that study; however, this may be different for parameter coverage especially if the number of subjects in the minority class is very small. The choice of scenarios in this study is based on reviewing articles describing analyses of empirical data, which are mainly studies using growth mixture models. Apart from the fact that loadings are usually fixed in growth models, class separation in most of those studies was larger than the separation investigated in this simulation. The general aim of this study is to obtain guidelines PERFORMANCE OF FACTOR MIXTURE MODELS 41 concerning the performance of more general factor mixture models under less favorable circumstances. The very general result across all three parts of this study is that parameter coverage is good even for small class separation, but that correct class assignment is satisfactory only when classes are well separated. The study shows that the coverage of factor mean differences and αc is more sensitive to class separation than other parameters, and decreases if variances (especially residual variances) are allowed to be class specific. Interestingly, the complexity of the within-class model with respect to the factor structure, or the number of observed variables within class, does not seem to greatly influence model performance. For the models evaluated in Part 1, there are virtually no differences between models. In an empirical study, class separation regarding observed variables within class is usually not under the control of the researcher. Although the coverage of within-class parameters is generally good, factor mean or intercept differences, which characterize the difference between classes researchers might be most interested in, are not covered satisfactorily at the smallest distance. In addition, classification error rates when assigning subjects to their correct class are quite high. As shown in Part 2, however, it is possible to improve coverage of factor mean or intercept differences and correct assignment considerably by including class-predicting covariates. Even if class separation with respect to the observed variables within class is very small, small covariate effects can already reduce classification errors. Our post-hoc comparison of class separation due to differences in Y and equal-sized class separation due to covariate effects showed no indication that inclusion of class-prediction covariates cannot fully counterbalance lacking mean differences in Y. It is important to note, though, that both observed variables Y and covariates X are the basis for the model-based clustering of subjects when fitting a factor mixture model with a class predicting covariate. If the interest is in, say, examining potential subtypes of a disorder, including covariates such as gender may lead to clusters of subjects that correspond to a mix of subtype and gender differences. It depends therefore on the particular application whether inclusion of covariates makes sense on a conceptual level. Having class-specific or class-invariant parameters is also usually not under the control of the researcher. Coverage of class-specific parameters and other aspects of model performance are especially important in the context of measurement invariance (Meredith, 1993). Measurement invariance holds if differences between classes can be modeled with factor mean differences and factor variance and covariance differences. In Part 1 we evaluated models that are measurement invariant across classes. The results were contrasted in Part 3 where models with different types of class-specific parameters were fitted. Most important, coverage of class-specific parameters is comparable to the coverage of class-invariant parameters. Considering the fact that none of the models with class-specific parameters resulted in collapsing of the true two classes into a single class, it is obvious that 42 LUBKE AND MUTHÉN measurement-invariant models are more problematic to fit. Up to 14% of the replications in some settings of the invariant models in Part 1 resulted in collapsed classes. In an empirical setting, in which a researcher usually has only a single data set, inclusion of covariates may help overcome this problem. It deserves attention that fixed factor loadings, which are common when fitting growth mixture models, result in an improvement of various aspects of model performance. Parameter coverage of the remaining parameters is .94 and above, and correct assignment is .74 and .85 when classes have a factor mean difference of 1 and 1.5 SD, respectively. Fixing loadings reduces the number of parameters involved in fitting the observed covariance structure. Hence, this model improvement may be related to the finding that class-invariant as opposed to class-specific residual variances increase the coverage of factor mean differences. The improvement of correct assignment when compared to a corresponding model with estimated factor loadings is not accompanied by an increase in entropy (see later). Given the variation found in the quality of class assignment in this study, an interesting question is how to obtain a good measure of the quality of class assignment in practice when there is only a single data set, true class membership is not known, and, as a result, correct assignment proportions cannot be computed. In this study, entropy and average class probabilities were highly correlated. Entropy seemed to vary with correct assignment, although not perfectly so. This study seemed to indicate that entropy values below .60 are generally related to misclassifying approximately 20% or more of the subjects. Entropy values around .80 and above are related to at least 90% correct assignment. Although the relation between entropy and correct assignment is not monotone, these values may serve as an indication of correct class assignment. Further research is needed to investigate cutoff values if the fitted model is a simplification of the true data-generating process, and if the fitted model includes slight model misspecifications. The aLRT (Lo et al., 2001) performed very well for the models investigated in this study. The performance deteriorated when class separation was only 0.5 SD and no covariates were included to improve overall separation. Convergence rates were generally above .95 and often equal to 1, and hence very satisfactory even for small separation. This study reveals that, in general, true factor loadings are recovered very well even for small distances. Other aspects of model performance are less satisfactory at the smaller distances investigated here. The study also shows that fixed factor loadings can result in a considerable improvement of model performance. Therefore, it may be worth evaluating a two-step analysis where loadings are estimated in a first step and fixed to the estimates in a second step. Although two-step procedures may induce other types of problems (for an interesting discussion see Ander- PERFORMANCE OF FACTOR MIXTURE MODELS 43 son & Gerbing, 1988, 1992; Fornell & Yi, 1992a, 1992b), it may be a useful approach in an exploratory context when classes are not well separated and inclusion of covariates is undesirable because the aim is to cluster subjects with respect to observed variables within class. ACKNOWLEDGMENTS The research by Gitta Lubke was supported through a subcontract to grant MH65322 by the National Institute of Mental Health (NIMH). The research of Bengt O. Muthén was supported by NIMH and the National Institute on Drug Abuse under grant No. MH40859, by NIMH under grants No. MH01259, No. MH38725, and No. MH42968, and by the National Institute on Alcohol Abuse and Alcoholism under grant K02 AA 00230. REFERENCES Anderson, J. C., & Gerbing, D. W. (1988). Structural equation modeling in practice: A review and recommended two-step approach. Psychological Bulletin, 103, 411–423. Anderson, J. C., & Gerbing, D. W. (1992). Assumptions and comparative strengths of the two-step approach: Comment on Fornell and Yi. Sociological Methods and Research, 20, 321–333. Arminger, G., Stein, P., & Wittenberg, J. (1999). Mixtures of conditional mean and covariance structure models. Psychometrika, 64, 475–494. Bamber, D., & Santen, v. J. P. H. (2000). How to assess a model’s testability and identifiability. Journal of Mathematical Psychology, 44, 20–40. Dolan, C. V., & van der Maas, H. L. J. (1998). Fitting multivariate normal finite mixtures subject to structural equation modeling. Psychometrika, 63, 227–253. Duncan, S. C., Duncan, T. E., & Strycker, L. A. (2000). Risk and protective factors influencing adolescent problem behavior: A multivariate latent growth curve analysis. Annals of Behavioral Medicine, 22, 103–109. Everitt, B. S., & Hand, D. J. (1981). Finite mixture distributions. London: Chapman & Hall. Fornell, C., & Yi, Y. (1992a). Assumptions of the two-step approach to latent variable modeling. Sociological Methods and Research, 20, 291–320. Fornell, C., & Yi, Y. (1992b). Assumptions of the two-step approach: Reply to Anderson and Gerbing. Sociological Methods and Research, 20, 334–339. Jedidi, K., Jagpal, H. S., & DeSarbo, W. S. (1997). STEMM: A general finite mixture structural equation model. Journal of Classification, 14, 23–50. Jedidi, K., Ramaswarmy, V., DeSarbo, W. S., & Wedel, M. (1996). On estimating finite mixtures of multivariate regression and simultaneous equation models. Structural Equation Modeling, 3, 266–289. Lazarsfeld, P. F., & Henry, N. W. (1968). Latent structure analysis. Boston: Houghton-Mifflin. Lo, Y., Mendell, N., & Rubin, D. B. (2001). Testing the number of components in a normal mixture. Biometrika, 88, 767–778. 44 LUBKE AND MUTHÉN Lubke, G. H., & Dolan, C. V. (2003). Can unequal residual variances across subpopulations mask differences in residual means in the common factor model? Structural Equation Modeling, 10, 175–192. Lubke, G. H., Dolan, C. V., Kelderman, H., & Mellenbergh, G. J. (2003). On the relationship between sources of within- and between-group differences and measurement invariance in the context of the common factor model. Intelligence, 173, 1–24. Lubke, G. H., & Muthén, B. O. (2005). Investigating population heterogeneity with factor mixture models. Psychological Methods, 10, 21–39. Lubke, G. H., & Neale, M. C. (2006). Distinguishing between latent classes and continuous factors: Resolution by maximum likelihood. Manuscript in press in Multivariate Behavioral Research. McArdle, J. J., & Epstein, D. (1987). Latent growth curves within developmental structural equation models. Child Development, 58, 110–133. McLachlan, G. J., & Peel, D. (2000). Finite mixture models. New York: Wiley. Meredith, W. (1993). Measurement invariance, factor analysis, and factorial invariance. Psychometrika, 58, 525–543. Muthén, B. O., & Muthén, L. K. (2000). Integrating person-centered and variable-centered analysis: Growth mixture modeling with latent trajectory classes: Alcoholism. Clinical and Experimental Research, 24, 882–891. Muthén, B. O., & Shedden, K. (1999). Finite mixture modeling with mixture outcomes using the EM algorithm. Biometrics, 55, 463–469. Muthén, L. K., & Muthén, B. O. (2001). Mplus user’s guide. Los Angeles: Muthén & Muthén. Muthén, L. K., & Muthén, B. O. (2004). Mplus version 3.01 [Computer program]. Los Angeles: Muthén & Muthén. Nagin, D. S. (1999). Analyzing developmental trajectories: A semi-parametric, group-based approach. Psychological Methods, 4, 139–157. Nagin, D. S., & Tremblay, R. E. (2001). Analyzing developmental trajectories of distinct but related behaviors: A group-based approach. Psychological Methods, 6, 18–34. Neuman, R. J., Todd, R. D., Heath, A. C., Reich, W., Hudziak, J. J., Bucholz, K. K., et al. (1999). The evaluation of ADHD typology in three constrasting samples: A latent class approach. Journal of the American Academy of Child and Adolescent Psychiatry, 38, 25–33. Rasmussen, E. R., Neuman, R. J., Heath, A. C., Levy, F., Hay, D. A., & Todd, R. D. (2002). Replication of the latent class structure of Attention-Deficit/Hyperactivity Disorder (ADHD) subtypes in a sample of Australian twins. Journal of Child Psychology and Psychiatry, 43, 1018–1028. Sörbom, D. (1974). A general method for studying differences in factor means and factor structure between groups. British Journal of Mathematical and Statistical Psychology, 27, 229–239. Vermunt, J. K., & Magidson, J. (2002). Latent class cluster analysis. In J. A. Hagenaars & A. L. McCutcheon (Eds.), Applied latent class analysis (pp. 89–106). Cambridge, UK: Cambridge University Press. Widaman, K. F., & Reise, S. P. (1997). Exploring the measurement invariance of psychological instruments: Applications in the substance use domain. In K. J. Bryant, M. Windle, & S. G. West (Eds.), The science of prevention: Methodological advances from alcohol and substance abuse research (pp. 281–324). Washington, DC: American Psychological Association. Yung, Y. F. (1997). Finite mixtures in confirmatory factor analysis models. Psychometrika, 62, 297–330. APPENDIX Here the parameter values used for the data generation are listed for each model. The notation corresponds to Equations 1 through 4. PERFORMANCE OF FACTOR MIXTURE MODELS 45 Part 1 Latent profile model. Class-invariant parameters: residual variances, diagonal of the matrix Θ ε = [.5.5.5.5.5.5.5.5]¢ Class-specific parameters: regression intercepts ν = [00000000]¢ MD = 1, regression intercepts class 2 ν = [.25 -.17.4 -.5.25 -.17.4 -.5]¢ MD = 1.5, regression intercepts class 2 ν = [.45 -.35.6 -.67.45 -.35.6 -.67]¢ Single-factor model. Class-invariant parameters: factor loadings Λ y = [1.8.8.8.8.8.8.8]¢ factor variance Ψ = 1 regression intercepts ν = [00000000]¢ residual variances, diagonal of the matrix Θ ε = [.25.15.15.15.15.15.15.15.15]¢ Class-specific parameters: factor mean class 1, first column of A = [0] MD = 1, factor mean class 2, second column of A = [1.05] MD = 1.5, factor mean class 2, second column of A = [1.55] Two-factor model. Class-invariant parameters: é 1 .8 .8 .8 0 0 0 0 ù ¢ ú factor loadings Λ y = ê ê 0 0 0 0 1 .8 .8 .8ú ë û é 1 .5ù ú factor covariance matrix Ψ = ê ê.5 1 ú ë û regression intercepts ν = [00000000]¢ residual variances, diagonal of the matrix Θ ε = [.25.15.15.15.25.15.15.15]¢ Class-specific parameters: factor means class 1, first column of A = [00]¢ MD = 1, factor mean class 2, second column of A = [.945.945]¢ MD = 1.5, factor mean class 2, second column of A = [1.351.35]¢ Three-factor model. Class-invariant parameters: é 1 .8 .8 .8 0 0 0 0 0 0 0 0 ù ¢ ê ú factor loadings Λ y = êê 0 0 0 0 1 .8 .8 .8 0 0 0 0 úú ê 0 0 0 0 0 0 0 0 1 .8 .8 .8ú êë úû 46 LUBKE AND MUTHÉN é 1 .5 .5ù ê ú factor covariance matrix Ψ = êê.5 1 .5úú êê.5 .5 1 úú ë û regression intercepts ν = [000000000000]¢ residual variances, diagonal of the matrix Θ ε = [.25.15.15.15.25.15.15.15.25 .15.15.15]¢ Class-specific parameters: factor means class 1, first column of A = [000]¢ MD = 1, factor mean class 2, second column of A = [.86.86.86]¢ MD = 1.5, factor mean class 2, second column of A = [1.261.261.26]¢ Part 2 Data for Part 2 are generated using the same parameter values as listed in Part 1 for the single-factor model, with the following exceptions. Single-factor model with class-specific intercepts and covariate effects. regression intercepts class 1 ν = [00000000]¢ regression intercepts class 2, MDy = 0.5, ν = [.5 -.1.15 -.075.5 -.1.15 -.075]¢ regression intercepts class 2, MDy = 1.0, ν = [.15 -.12.2 -.3.15 -.12.2 -.3]¢ regression intercepts class 2, MDy = 1.5, ν = [.2 -.2.36 -.4.2 -.2.36 -.4]¢ regression intercepts class 2, MDy = 2.0, ν = [.3 -.26.5 -.5.3 -.26.5 -.5]¢ covariate effect, MDx = 0.5, γ c = [.5] covariate effect, MDx = 1, γ c = [1.2] covariate effect, MDx = 1.5, γ c = [1.95] Part 3 Data for Part 3 are generated using the same parameter values as listed in Part 1 for the two-factor model, with the following exceptions. Two-factor model with class-specific loadings. é 1 .8 .6 .8 0 0 0 0 ù ¢ ú factor loadings class 1 Λ y = ê ê 0 0 0 0 1 .6 .8 .6ú ë û é 1 .9 .6 .9 0 0 0 0 ù ¢ ú factor loadings class 2 Λ y = ê ê 0 0 0 0 1 .6 .9 .9ú ë û PERFORMANCE OF FACTOR MIXTURE MODELS 47 Two-factor model with class-specific intercepts. regression intercepts class 1 ν = [00000000]¢ regression intercepts class 2, MD = 1, ν = [.1.1.34 -.2.1.1.34 -.2]¢ regression intercepts class 2, MD = 1.5, ν = [.2.34.41-.35.2.34.41-.35]¢ Two-factor model with class-specific factor variances and covariances. é 1 .5ù ú factor covariance matrix class 1 Ψ = ê ê.5 1 ú ë û é 1 .15ù ú factor covariance matrix class 1 Ψ = ê ê.15 1 ú ë û Two-factor model with class-specific residual variances. residual variances class 1, diagonal of the matrix Θ ε = [.25.15.15.15.25.15.15.15]¢ residual variances class 2, diagonal of the matrix Θ ε = [.3.3.1.1.3.3.1.1]¢
© Copyright 2026 Paperzz