Abstract
An improved range data distribution method is proposed, which is suitable for both the homogeneous and heterogeneous database cluster with consideration of full use of different computing resources of nodes. In order to avoid the problem of load imbalance caused by the hot accessing, an online migrating algorithm is presented during the parallel processing. The experimental results show that the improved range partition method and the rebalancing strategy of online migrating algorithm not only significantly improve the throughput of database cluster but also keep the balanced state well. At the same time, the cluster system has achieved better scalability.
Supported by the National High-Tech Research and Development Plan of China under Grant(863) No.2007AA01Z153, also Supported by the Natural Science Foundation of Zhejiang Province No.Y1080102 and the School Scientific Research Founds of ZJUT No.X1038109.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Nguyen, K.Q., Thompson, T., Bryan, G.: An enhanced hybrid range partitioning strategy for parallel database systems. In: Proceedings of the Eighth International Workshop on Database and Expert System Applications, pp. 289–294. IEEE Computer Society, Los Alamitos (1997)
Hababeh, I.O., Ramachandran, M., Bowring, N.: A high-performance computing method for data allocation in distributed database systems. The Journal of Supercomputing 39, 3–18 (2007)
Wang, J., Tsutaya, Y., Segawa, N., et al.: Approaches to balancing data load of shared-nothing clusters and their performance comparison. In: Proceedings of the 9th International Conference on Parallel and Distributed Systems, pp. 293–299. IEEE Computer Society, Los Alamitos (2002)
Perez, J.M., Garcia, F., Carretero, J., Calderon, A., Sanchez, L.M.: Data allocation and load balancing for heterogeneous cluster storage systems. In: Proceedings of the 3rd IEEE/ACM International Sympo-sium on Cluster Computing and the Grid (CCGRID 2003), pp. 718–723 (2003)
Hirano, Y., Satoh, T., Inoue, U., Teranaka, K.: Load balancing algorithms for parallel database processing on shared memory multiprocessors. In: Proceedings of 1st Parallel and Distributed Information Systems, pp. 210–217 (1991)
De Giusti, A.E., Naiouf, M.R., De Giusti, L.C., Chichizola, F.: Dynamic load balancing in parallel processing on non-homogeneous clusters. Journal of Computer Science and Technology 5(4), 272–278 (2005)
Rahm, E., Marek, R.: Analysis of dynamic load balancing strategies for parallel shared nothing database systems. In: Proceedings of 19th Conference on VLDB, pp. 182–193 (1993)
Scheuermann, P., Weikum, G., Zabback, P.: Data partitioning and load balancing in parallel disk systems. The VLDB Journal (7), 48–66 (1998)
Dewitt, D., Gray, J.: Parallel database system: the future of high performance database systems. Communication of ACM 33(6) (1992)
Beynon, M.D., Kurc, T., Catalyurek, U., Chang, C., Sussman, A., Saltz, J.: Distributed processing of very large datasets with DataCutter. Parallel Computing 27(11), 1457–1478 (2001)
Zhu, F., Sun, X., Salzberg, B., Hvasshovd, S.-O.: Supporting load balancing and efficient reorganization during system scaling. In: Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium(IPDPS 2005) (April 2005)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Gong, W., Yang, L., Huang, D., Chen, L. (2009). New Balanced Data Allocating and Online Migrating Algorithms in Database Cluster. In: Li, Q., Feng, L., Pei, J., Wang, S.X., Zhou, X., Zhu, QM. (eds) Advances in Data and Web Management. APWeb WAIM 2009 2009. Lecture Notes in Computer Science, vol 5446. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-00672-2_46
Download citation
DOI: https://doi.org/10.1007/978-3-642-00672-2_46
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-00671-5
Online ISBN: 978-3-642-00672-2
eBook Packages: Computer ScienceComputer Science (R0)