To determine if candidate cancer biomarkers have utility in a clinical setting, validation using immunohistochemical methods is typically done. Most analyses of such data have not incorporated the multivariate nature of the staining proﬁles. In this article, we consider modelling such data using recently developed ideas from the machine learning community. In particular, we consider the joint goals of feature selection and classiﬁcation. We develop esti- mation procedures for the analysis of immunohistochemical proﬁles using the least absolute selection and shrinkage operator. These lead to novel and ﬂexible models and algorithms for the analysis of compositional data. The techniques are illustrated using data from a cancer biomarker study.
Available at: http://works.bepress.com/debashis_ghosh/30/