Abstract
Efficient algorithms have been developed for mining frequent patterns in traditional data where the content of each transaction is definitely known. It is a core technique used in many mining tasks like sequential pattern mining, correlative mining etc. As we know, fuzzy logic provides a mathematical framework that is compatible with poorly quantitative yet qualitatively significant data. Genetic algorithm (GA) is one of the optimization algorithms, which is invented to mimic some of the processes observed in natural evolution. It is a stochastic search technique based on the mechanism of natural selection and natural genetics. That is a general one, capable of being applied to an extremely wide range of problems. In this paper, we have fuzzified our original dataset and have applied various frequent pattern mining techniques on it. Then the result of a particular frequent pattern mining technique that is frequent pattern (FP) growth is taken into consideration in which we apply the concept of GA. Here, the frequent patterns observed are considered as the set of initial population. For the selection criteria, we consider the mean squared residue score rather using the threshold value. It was observed that out of the three fuzzy based frequent mining techniques and the GA based fuzzy FP growth technique the later finds the best individual frequent patterns. Also, the run time of the algorithm and the number of frequent patterns generated is far better than the rest of the techniques used. To extend our findings we have also compared the results obtained by the GA based fuzzy FP growth with an usual approach on a normalized dataset and then applied the concept of FP growth to find the frequent patterns followed by GA. Then by analyzing the result we found that GA based fuzzy FP growth stills yields the best individual frequent patterns.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Zhu, C., Zhang, X., Sun, J., Huang, B.: Algorithm for mining sequential pattern in time series data. In: International Conference on Communications and Mobile Computing, pp. 258–262 (January 2009)
Kriegel, H., Kroger, P., Zimek, A.: Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering. ACM Transactions on Knowledge Discovery from Data (TKDD) 3(1), 1–58 (2009)
Koh, Y., Rountree, N., O’Keefe, R.: Mining interesting imperfectly sporadic rules. Knowledge and Information Systems 14(2), 179–196 (2008)
Zheng, Z., Kohavi, R., Mason, L.: Real world performance of association rule algorithms. In: Proc. of ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 401–406 (2001)
Han, J., Cheng, H., Xin, D., Ya, X.: Frequent Pattern Mining: Current Status and Future Directions. Data Mining and Knowledge Discovery 15(1), 55–86 (2007)
Creighton, C., Hanash, S.: Mining gene expression databases for association rules. Journal of Bioinformatics 19, 79–86 (2003)
Zhang, A., Teo, B., Ooi, B., Tan, K.L.: Mining deterministic biclusters in gene expression data. In: Proc. of 4th Symposium on Bioinformatics and Bioengineering, pp. 283–292 (2004)
Pasquier, N., Bastide, Y., Taouil, R., Lakhal, L.: Discovering frequent closed itemsets for association rules. In: Beeri, C., Bruneman, P. (eds.) ICDT 1999. LNCS, vol. 1540, pp. 398–416. Springer, Heidelberg (1998)
Pei, J., Han, J., Mao, R.: CLOSET: An efficient algorithm for mining frequent closed item sets. In: Proc. of ACM SIGMOD International Workshop on Data Mining and Knowledge Discovery (DMKD), pp. 21–30 (2000)
Zaki, M.J., Hsiao, C.: CHARM: An efficient algorithm for closed association rule mining. In: Proc. of SIAM International Conf. on Data Mining (SDM), pp. 457–473 (2002)
Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proc. of the 20th International Conference on Very Large Databases, pp. 478–499 (1994)
Zhang, S., Wu, X., Zhang, C., Lu, J.: Computing the minimum-support for mining frequent patterns. Knowledge and Information Systems 15, 233–257 (2008)
Zaki, M., Ogihara, M.: Theoretical foundations of association rules. In: Proc. of the 3rd ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, pp. 85–93 (1998)
Silberschatz, A., Tuzhilin, A.: What makes patterns interesting in knowledge discovery systems. In: Proc. of IEEE Transactions on Knowledge and Data Engineering, pp. 970–974 (1996)
Xu, Y., Yu, Y., Liu, G., Lu, H.: From path tree to frequent patterns: a framework for Mining Frequent Patterns. In: Proc. of IEEE International Conf. on Data Mining (ICDM 2002), pp. 514–521 (2002)
Cohen, E., Datar, M., Fujiwara, S., Gionis, A., Indyk, P., Motwani, R., Ullman, J.D., Yang, C.: Finding interesting associations without support pruning. In: Proc. of IEEE Transactions on Knowledge and Data Engineering, pp. 64–78 (2001)
Roddick, J.F., Rice, S.: What’s interesting about cricket? – on thresholds and anticipation in discovered rules. In: Proc. of SIGKDD Explorations, pp. 1–5 (2001)
Hipp, J., Guntzer, U.: Is pushing constraints deeply into the mining algorithms really what we want? In: Proc. of SIGKDD Explorations, pp. 50–55 (2002)
Wang, K., He, Y., Han, J.: Pushing support constraints into association rules Mining. IEEE Transactions on Knowledge and Data Engineering, 642–658 (2003)
Han, J., Pei, J., Yin, Y., Mao, R.: Mining frequent patterns without candidate generations: a frequent pattern tree approach. Data Mining and knowledge Discovery 8(1), 53–87 (2004)
Chan, K.Y., Zhu, H.L., Lau, C.C., Ling, S.H.: Gene Signature Selection for Cancer Prediction Using an Integrated Approach of Genetic Algorithm and Support Vector Machine. In: Proc. of IEEE Congress on Evolutionary Computation(CEC 2008), pp. 217–224 (2008)
Chakraborty, A., Maka, H.: Biclustering of Gene Expression Data Using Genetic Algorithm. Proc. of IEEE, 765–770 (2005)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Mishra, D., Mishra, S., Satapathy, S.K., Patnaik, S. (2012). Genetic Algorithm Based Fuzzy Frequent Pattern Mining from Gene Expression Data. In: Patnaik, S., Yang, YM. (eds) Soft Computing Techniques in Vision Science. Studies in Computational Intelligence, vol 395. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25507-6_1
Download citation
DOI: https://doi.org/10.1007/978-3-642-25507-6_1
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25506-9
Online ISBN: 978-3-642-25507-6
eBook Packages: EngineeringEngineering (R0)