Skip to main content
Article
Penalized regression procedures for variable selection in the potential outcomes framework
Technical Report, Department of Statistics, Penn State University (2013)
  • Debashis Ghosh, Penn State University
  • Yeying Zhu, University of Waterloo
  • Donna L Coffman, Penn State University
Abstract

A recent topic of much interest in causal inference is model selection. In this article, we describe a framework in which to consider penalized regression approaches to variable selection for causal effects. The framework leads to a simple `impute, then select' class of procedures that is agnostic to the type of imputation algorithm as well as penalized regression used. It also clarifies how model selection involves a multivariate regression model, and that these methods can be applied for identifying subgroups in which treatment effects are homogeneous. Analogies and links with the literature on machine learning methods, missing data and imputation are drawn. A shared LASSO and difference LASSO algorithm are defined, along with their multiple imputation analogues. The procedures are illustrated using a well-known right heart catheterization dataset.

Keywords
  • Average causal effect; counterfactual,
  • imputed data; L1 penalty; qualitative interaction; treatment heterogeneity.
Publication Date
2013
Citation Information
Debashis Ghosh, Yeying Zhu and Donna L Coffman. "Penalized regression procedures for variable selection in the potential outcomes framework" Technical Report, Department of Statistics, Penn State University (2013)
Available at: http://works.bepress.com/debashis_ghosh/59/