Skip to main content
Other
Exploiting redundancy to boost performance in a RAID-10 style cluster-based file system
School of Computing: Faculty Publications
  • Yifeng Zhu, University of Maine
  • Hong Jiang, University of Nebraska-Lincoln
  • Xiao Qin, New Mexico Institute of Mining and Technology
  • Dan Feng, Huazhong University of Science and Technology, Wuhan, China
  • David Swanson, University of Nebraska-Lincoln
Date of this Version
1-1-2006
Disciplines
Comments

Published in Cluster Computing 9 (2006), pp. 433–447; doi: 10.1007/s10586-006-0011-6 Copyright © 2006 Springer Science+Business Media, LLC. Used by permission.

Abstract

While aggregating the throughput of existing disks on cluster nodes is a cost-effective approach to alleviate the I/O bottleneck in cluster computing, this approach suffers from potential performance degradations due to contentions for shared resources on the same node between storage data processing and user task computation. This paper proposes to judiciously utilize the storage redundancy in the form of mirroring existed in a RAID-10 style file system to alleviate this performance degradation. More specifically, a heuristic scheduling algorithm is developed, motivated from the observations of a simple cluster configuration, to spatially schedule write operations on the nodes with less load among each mirroring pair. The duplication of modified data to the mirroring nodes is performed asynchronously in the background. The read performance is improved by two techniques: doubling the degree of parallelism and hot-spot skipping. A synthetic benchmark is used to evaluate these algorithms in a real cluster environment and the proposed algorithms are shown to be very effective in performance enhancement.

Citation Information
Yifeng Zhu, Hong Jiang, Xiao Qin, Dan Feng, et al.. "Exploiting redundancy to boost performance in a RAID-10 style cluster-based file system" (2006)
Available at: http://works.bepress.com/yifeng_zhu/11/