Skip to main content
Unpublished Paper
Approximate Principal Direction Trees
(2012)
  • Mark McCartin-Lim
  • Andrew McGregor, University of Massachusetts - Amherst
  • Rui Wang
Abstract

We introduce a new spatial data structure for high dimensional data called the approximate principal direction tree (APD tree) that adapts to the intrinsic dimension of the data. Our algorithm ensures vector-quantization accuracy similar to that of computationally-expensive PCA trees with similar time-complexity to that of lower- accuracy RP trees.

APD trees use a small number of power- method iterations to find splitting planes for recursively partitioning the data. As such they provide a natural trade-o between the running-time and accuracy achieved by RP and PCA trees. Our theoretical results establish a) strong performance guarantees regardless of the convergence rate of the power- method and b) that O(log d) iterations suffice to establish the guarantee of PCA trees when the intrinsic dimension is d. We demonstrate this trade-off and the efficacy of our data structure on both the CPU and GPU.

Disciplines
Publication Date
June 18, 2012
Comments
This is the pre-published version harvested from arXiv.
Citation Information
Mark McCartin-Lim, Andrew McGregor and Rui Wang. "Approximate Principal Direction Trees" (2012)
Available at: http://works.bepress.com/andrew_mcgregor/1/