Skip to main content
Article
Analyzing data with clumping at zero: An example demonstration
Journal of Clinical Epidemiology (2000)
  • Bei-Hung Chang, Boston University
  • Stuart Pocock, New England Research Institutes
Abstract
This article demonstrates the use of two approaches to analyzing the relationship of multiple covariates to an outcome which has a high proportion of zero values. One approach is to categorize the continuous outcome (including the zero category) and then fit a proportional odds model. Another approach is to use logistic regression to model the probability of a zero response and ordinary least squares linear regression to model the non-zero continuous responses. The use of these two approaches was demonstrated using outcomes data on hours of care received from the Springfield Elder Project. A crude linear model including both zero and non-zero values was also used for comparison. We conclude that the choice of approaches for analysis depends on the data. If the proportional odds assumption is valid, then it appears to be the method of choice; otherwise, the combination of logistic regression and a linear model is preferable.
Publication Date
October, 2000
DOI
10.1016/S0895-4356(00)00223-7
Citation Information
Chang BH, Pocock S. Analyzing data with clumping at zero. An example demonstration. J Clin Epidemiol. 2000 Oct;53(10):1036-43. PubMed PMID: 11027937.