Abstract
In this paper, the fuzzy association algorithm based on Load Classifier is proposed to study the fuzzy association rules of numerical data flow. A method of dynamic partitioning of data blocks by load classifier for data stream is proposed, and the membership function of design optimization is proposed. The FP-Growth algorithm is used to realize the parallelization processing of fuzzy association rules. First, based on the load balancing classifier, variable window is proposed to divide the original data stream. Second, the continuous data preprocessing is performed and is converted into fuzzy interval data by the improved membership function. Finally through simulation experiments of the Load Classifier, compared with the four algorithms, the data processing time is similar after convergence, and the data processing time of SDBA (Spark Dynamic Block Adjustment Spark) is lower than 25 ms.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Sinthuja, M., Puviarasan, N., Aruna, P.: Mining frequent itemsets using proposed top-down approach based on linear prefix tree (TD-LP-Growth). In: Smys, S., Bestak, R., Chen, J.Z., Kotuliak, I. (eds.) International Conference on Computer Networks and Communication Technologies. Lecture Notes on Data Engineering and Communications Technologies, vol. 15, pp. 23–32. Springer, Singapore (2019). https://doi.org/10.1007/978-981-10-8681-6_4
Imielienskin, T., Swami, A., Agrawal, R.: Mining association rules between set of items in large databases. ACM Sigmod Rec. 22(2), 207–216 (1993)
Han, J., Pei, J., Yin, Y.: Mining frequent patterns without candidate generation. In: Proceedings of the ACM SIGMOD International Conference on Management of Data, pp. 1–12. ACM (2000)
Zheng, H., He, J., Zhang, Y.C., Shi, Y.: A fuzzy decision tree approach based on data distribution construction. In: Proceedings of the Australasian Computer Science Week Multiconference. ACM (2017). https://doi.org/10.1145/3014812.3014817
El-Hajj, M., Zaïane, O.R.: COFI approach for mining frequent itemsets revisited. In: Proceedings of the 9th ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery (DMKD 2004), pp. 70–75. ACM, New York (2004)
Borgelt, C., Yang, X., Nogales-Cadenas, R., Carmona-Saez, P., Pascual-Montano, A.: Finding closed frequent item sets by intersecting transactions. In: Proceedings of the 14th International Conference on Extending Database Technology (EDBT/ICDT 2011), pp. 367–376. ACM, New York (2011)
Kim, M.-S., Kim, S.-W., Shin, M.: Optimization of subsequence matching under time warping in time-series databases. In: Proceedings of the 2005 ACM Symposium on Applied Computing (SAC 2005), pp. 581–586. ACM, New York (2005)
Thanh Lam, H., Calders, T.: Mining top-k frequent items in a data stream with flexible sliding windows. In: Proceedings of the 16th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2010), pp. 283–292. ACM, New York (2010)
Giannella, C., Han, J., Pei, J., et al.: Mining frequent patterns in data streams at multiple time granularities. Next Gener. Data Min. 212, 191–212 (2003)
Ramkumar, T., Srinivasan, R., Hariharan, S.: Synthesizing global association rules from different data sources based on desired interestingness metrics. Int. J. Inf. Technol. Decis. Making 13(03), 473–495 (2014)
Leung, C.K.-S., Carmichael, C.L., Hao, B.: Efficient mining of frequent patterns from uncertain data. In: IEEE ICDM Workshops, pp. 489–494 (2007)
Leung, C.K.-S., Mateo, M.A.F., Brajczuk, D.A.: A tree-based approach for frequent pattern mining from uncertain data. In: Washio, T., Suzuki, E., Ting, K.M., Inokuchi, A. (eds.) PAKDD 2008. LNCS (LNAI), vol. 5012, pp. 653–661. Springer, Heidelberg (2008). https://doi.org/10.1007/978-3-540-68125-0_61
Li, H., Wang, Y., Zhang, D., Zhang, M., Chang, E.Y.: PFP: parallel fp-growth for query recommendation. In: Proceedings of the 2008 ACM Conference on Recommender Systems (RecSys 2008), pp. 107–114. ACM, New York (2008)
Delgado, M., Marin, N., Sanchez, D., Vila, M.: Fuzzy association rules: general model and applications. IEEE Trans. Fuzzy Syst. 11(2), 214–225 (2003)
Koh, Y.S.: Rare association rule mining and knowledge discovery: Technologies for infrequent and critical event detection: volume 3 (2009)
Padillo, F., Luna, J.M., Ventura, S.: An evolutionary algorithm for mining rare association rules: a big data approach. In: 2017 IEEE Congress on Evolutionary Computation (CEC), pp. 2007–2014 (2017)
Shakeri, O., Pedram, M.M., Kelarestaghi, M.: A fuzzy constrained stream sequential pattern mining algorithm. In: 7th International Symposium on Telecommunications, pp. 20–24 (2014)
Chen, C., et al.: Finding active membership functions for genetic-fuzzy data mining. Int. J. Inf. Technol. Decis. Making 14(06), 1215–1242 (2015)
Grahne, G., Zhu, J.: Mining frequent itemsets from secondary memory. In: Fourth IEEE International Conference on Data Mining (ICDM 2004), Brighton, UK, pp. 91–98 (2004)
Zhou, L., Zhong, Z., Chang, J., et al.: Balanced parallel FP-growth with MapReduce. In: 2010 IEEE Youth Conference on Information, Computing and Telecommunications, Beijing, pp. 243–246 (2010)
Lin, J.C.W., Zhang, Y., Fournier-Viger, P., Hong, T.P.: Efficiently updating the discovered multiple fuzzy frequent itemsets with transaction insertion. Int. J. Fuzzy Syst. 20(8), 2440–2457 (2018)
Lin, J.C., et al.: Mining weighted frequent itemsets without candidate generation in uncertain databases. Int. J. Inf. Technol. Decis. Making 16(06), 1549–1579 (2017)
Funding
This work was supported in part by the National Natural Science Foundation of P. R. China (No. 61672296, No. 61602261, and No. 61762071), the Major Natural Science Research Projects in Colleges and Universities of Jiangsu Province (No. 18KJA520008), and the Postgraduate Research and Practice Innovation Program of Jiangsu Province (No. SJKY19_0761, No. SJKY19_0759).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Chen, J., Zheng, H., Li, P., Zhang, Z., Li, H., Liu, W. (2020). Fuzzy Association Rule Mining Algorithm Based on Load Classifier. In: He, J., et al. Data Science. ICDS 2019. Communications in Computer and Information Science, vol 1179. Springer, Singapore. https://doi.org/10.1007/978-981-15-2810-1_18
Download citation
DOI: https://doi.org/10.1007/978-981-15-2810-1_18
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-15-2809-5
Online ISBN: 978-981-15-2810-1
eBook Packages: Computer ScienceComputer Science (R0)