Discrete nonparametric algorithms for outlier detection with genomic data
In high-throughput studies involving genetic data such as from gene expression mi- croarrays, dierential expression analysis between two or more experimental conditions has been a very common analytical task. Much of the resulting literature on multiple comparisons has paid relatively little attention to the choice of test statistic. In this article, we focus on the issue of choice of test statistic based on a special pattern of dierential expression. The approach here is based on recasting multiple comparisons procedures for assessing outlying expression values. A major complication is that the resulting p-values are discrete; some theoretical properties of sequential testing procedures in this context are explored. We propose the use of q-value estimation procedures in this setting. Data from a gene expression proling experiment in prostate cancer are used to illustrate the methodology.
Debashis Ghosh. "Discrete nonparametric algorithms for outlier detection with genomic data" Journal of Biopharmaceutical Statistics, to appear (2010).
Available at: http://works.bepress.com/debashis_ghosh/40