Analyzing data with clumping at zero: An example demonstrationJournal of Clinical Epidemiology (2000)
This article demonstrates the use of two approaches to analyzing the relationship of multiple covariates to an outcome which has a high proportion of zero values. One approach is to categorize the continuous outcome (including the zero category) and then fit a proportional odds model. Another approach is to use logistic regression to model the probability of a zero response and ordinary least squares linear regression to model the non-zero continuous responses. The use of these two approaches was demonstrated using outcomes data on hours of care received from the Springfield Elder Project. A crude linear model including both zero and non-zero values was also used for comparison. We conclude that the choice of approaches for analysis depends on the data. If the proportional odds assumption is valid, then it appears to be the method of choice; otherwise, the combination of logistic regression and a linear model is preferable.
Publication DateOctober, 2000
Citation InformationChang BH, Pocock S. Analyzing data with clumping at zero. An example demonstration. J Clin Epidemiol. 2000 Oct;53(10):1036-43. PubMed PMID: 11027937.