Skip to main content
Article
Boosting Computational Effectiveness in Big Spatial Flow Data Analysis with Intelligent Data Reduction
International Journal of Geo-Information
  • Ran Tao, University of South Florida
  • Zhaoya Gong, University of Birmingham
  • Qiwei Ma, University of Birmingham
  • Jean-Claude Thill, University of North Carolina
Document Type
Article
Publication Date
1-1-2020
Keywords
  • big flow data,
  • head/tail breaks,
  • geocomputation,
  • network analysis,
  • data reduction
Digital Object Identifier (DOI)
https://doi.org/10.3390/ijgi9050299
Disciplines
Abstract

One of the enduring issues of spatial origin-destination (OD) flow data analysis is the computational inefficiency or even the impossibility to handle large datasets. Despite the recent advancements in high performance computing (HPC) and the ready availability of powerful computing infrastructure, we argue that the best solutions are based on a thorough understanding of the fundamental properties of the data. This paper focuses on overcoming the computational challenge through data reduction that intelligently takes advantage of the heavy-tailed distributional property of most flow datasets. We specifically propose the classification technique of head/tail breaks to this end. We test this approach with representative algorithms from three common method families, namely flowAMOEBA from flow clustering, Louvain from network community detection, and PageRank from network centrality algorithms. A variety of flow datasets are adopted for the experiments, including inter-city travel flows, cellphone call flows, and synthetic flows. We propose a standard evaluation framework to evaluate the applicability of not only the selected three algorithms, but any given method in a systematic way. The results prove that head/tail breaks can significantly improve the computational capability and efficiency of flow data analyses while preserving result quality, on condition that the analysis emphasizes the “head” part of the dataset or the flows with high absolute values. We recommend considering this easy-to-implement data reduction technique before analyzing a large flow dataset.

Rights Information
Creative Commons Attribution 4.0
Citation / Publisher Attribution

International Journal of Geo-Information, v. 9, issue 5, art. 299

Citation Information
Ran Tao, Zhaoya Gong, Qiwei Ma and Jean-Claude Thill. "Boosting Computational Effectiveness in Big Spatial Flow Data Analysis with Intelligent Data Reduction" International Journal of Geo-Information Vol. 9 Iss. 5 (2020)
Available at: http://works.bepress.com/ran-tao/9/