Causal Inference in Latent Class Analysisby Stephanie T. Lanza, Donna L. Coffman, Shu Xu

Structural Equation Modeling: A Multidisciplinary Journal


Inferring Multiple Causality: The Limitations of Path Analysis

P. S. Petraitis, A. E. Dunham, P. H. Niewiarowski

The challenge of causal inference

Olaf Dammann, Alan Leviton

The Severe 5%: A Latent Class Analysis of the Externalizing Behavior Spectrum in the United States

Michael G. Vaughn, Matt DeLisi, Tracy Gunter, Qiang Fu, Kevin M. Beaver, Brian E. Perron, Matthew O. Howard


This article was downloaded by: [Wilfrid Laurier University]

On: 10 September 2013, At: 13:24

Publisher: Routledge

Informa Ltd Registered in England and Wales Registered Number: 1072954 Registered office: Mortimer House, 37-41 Mortimer Street, London W1T 3JH, UK

Structural Equation Modeling: A

Multidisciplinary Journal

Publication details, including instructions for authors and subscription information:

Causal Inference in Latent Class Analysis

Stephanie T. Lanza a , Donna L. Coffman a & Shu Xu b a The Pennsylvania State University b New York University

Published online: 22 Jul 2013.

To cite this article: Stephanie T. Lanza , Donna L. Coffman & Shu Xu (2013) Causal Inference in

Latent Class Analysis, Structural Equation Modeling: A Multidisciplinary Journal, 20:3, 361-383, DOI: 10.1080/10705511.2013.797816

To link to this article:


Taylor & Francis makes every effort to ensure the accuracy of all the information (the “Content”) contained in the publications on our platform. However, Taylor & Francis, our agents, and our licensors make no representations or warranties whatsoever as to the accuracy, completeness, or suitability for any purpose of the Content. Any opinions and views expressed in this publication are the opinions and views of the authors, and are not the views of or endorsed by Taylor & Francis. The accuracy of the Content should not be relied upon and should be independently verified with primary sources of information. Taylor and Francis shall not be liable for any losses, actions, claims, proceedings, demands, costs, expenses, damages, and other liabilities whatsoever or howsoever caused arising directly or indirectly in connection with, in relation to or arising out of the use of the Content.

This article may be used for research, teaching, and private study purposes. Any substantial or systematic reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any form to anyone is expressly forbidden. Terms &

Conditions of access and use can be found at

Structural Equation Modeling, 20:361–383, 2013

Copyright © Taylor & Francis Group, LLC

ISSN: 1070-5511 print/1532-8007 online

DOI: 10.1080/10705511.2013.797816

Causal Inference in Latent Class Analysis

Stephanie T. Lanza,1 Donna L. Coffman,1 and Shu Xu2 1The Pennsylvania State University 2New York University

The integration of modern methods for causal inference with latent class analysis (LCA) allows social, behavioral, and health researchers to address important questions about the determinants of latent class membership. In this article, 2 propensity score techniques, matching and inverse propensity weighting, are demonstrated for conducting causal inference in LCA. The different causal questions that can be addressed with these techniques are carefully delineated. An empirical analysis based on data from the National Longitudinal Survey of Youth 1979 is presented, where college enrollment is examined as the exposure (i.e., treatment) variable and its causal effect on adult substance use latent class membership is estimated. A step-by-step procedure for conducting causal inference in LCA, including multiple imputation of missing data on the confounders, exposure variable, and multivariate outcome, is included. Sample syntax for carrying out the analysis using

SAS and R is given in an appendix.

Keywords: average causal effect, causal inference, latent class analysis, propensity scores

Latent class analysis (LCA), a technique for identifying underlying subgroups (i.e., latent classes) in a population, is a statistical method that is now widely accessible to and frequently used by social, behavioral, and health researchers. With this technique, a model with a userspecified number of latent classes is fit to a data set, yielding a vector of latent class membership probabilities and a matrix of class-specific probabilities of each response to the set of observed variables used to measure the latent variable. Recent software advances, including PROC LCA (Lanza, Dziak, Huang, Wagner, & Collins, 2013) and Mplus (Muthén & Muthén, 1998–2010), make conducting LCA and its extensions straightforward. In addition, finite mixture models more complex than LCA are becoming widely adopted by applied researchers. These models include latent transition analysis (Collins & Lanza, 2010), associative latent transition analysis (Bray, Lanza, & Collins, 2010), growth mixture modeling (Muthén & Shedden, 1999; Nagin, 2005), and finite mixture regression (Wedel & DeSarbo, 2002). All of these models share the characteristic that underlying heterogeneity is explained by a latent grouping variable, but that individuals’ actual group membership cannot be known with certainty. Rather, each individual has a (typically nonzero) probability of membership in each latent class.

Correspondence should be addressed to Stephanie T. Lanza, The Methodology Center, The Pennsylvania State

University, 204 E. Calder Way, Suite 400, State College, PA 16801. E-mail: 361

D ow nl oa de d by [W ilf rid

L au rie r U niv ers ity ] a t 1 3:2 4 1 0 S ep tem be r 2 01 3 362 LANZA, COFFMAN, XU

A well-understood extension of finite mixture models that holds great practical importance is the ability to include observed covariates, which serve as predictors of latent class membership.

This extension is important in that it allows scientists to better understand the composition of each subgroup. For example, researchers could use LCA to identify latent classes of substance use behavior in adolescence. In this case, the identification of factors that are significantly related to increased odds of membership in classes characterized by high-risk behavior would allow for preventive interventions to be targeted toward individuals with high levels on those factors.