Nothing Special   »   [go: up one dir, main page]

Skip to main content

Genetic Algorithm Based Fuzzy Frequent Pattern Mining from Gene Expression Data

  • Chapter
  • First Online:
Soft Computing Techniques in Vision Science

Part of the book series: Studies in Computational Intelligence ((SCI,volume 395))

  • 774 Accesses

Abstract

Efficient algorithms have been developed for mining frequent patterns in traditional data where the content of each transaction is definitely known. It is a core technique used in many mining tasks like sequential pattern mining, correlative mining etc. As we know, fuzzy logic provides a mathematical framework that is compatible with poorly quantitative yet qualitatively significant data. Genetic algorithm (GA) is one of the optimization algorithms, which is invented to mimic some of the processes observed in natural evolution. It is a stochastic search technique based on the mechanism of natural selection and natural genetics. That is a general one, capable of being applied to an extremely wide range of problems. In this paper, we have fuzzified our original dataset and have applied various frequent pattern mining techniques on it. Then the result of a particular frequent pattern mining technique that is frequent pattern (FP) growth is taken into consideration in which we apply the concept of GA. Here, the frequent patterns observed are considered as the set of initial population. For the selection criteria, we consider the mean squared residue score rather using the threshold value. It was observed that out of the three fuzzy based frequent mining techniques and the GA based fuzzy FP growth technique the later finds the best individual frequent patterns. Also, the run time of the algorithm and the number of frequent patterns generated is far better than the rest of the techniques used. To extend our findings we have also compared the results obtained by the GA based fuzzy FP growth with an usual approach on a normalized dataset and then applied the concept of FP growth to find the frequent patterns followed by GA. Then by analyzing the result we found that GA based fuzzy FP growth stills yields the best individual frequent patterns.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

eBook
USD 15.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Zhu, C., Zhang, X., Sun, J., Huang, B.: Algorithm for mining sequential pattern in time series data. In: International Conference on Communications and Mobile Computing, pp. 258–262 (January 2009)

    Google Scholar 

  2. Kriegel, H., Kroger, P., Zimek, A.: Clustering high-dimensional data: A survey on subspace clustering, pattern-based clustering, and correlation clustering. ACM Transactions on Knowledge Discovery from Data (TKDD) 3(1), 1–58 (2009)

    Article  Google Scholar 

  3. Koh, Y., Rountree, N., O’Keefe, R.: Mining interesting imperfectly sporadic rules. Knowledge and Information Systems 14(2), 179–196 (2008)

    Article  Google Scholar 

  4. Zheng, Z., Kohavi, R., Mason, L.: Real world performance of association rule algorithms. In: Proc. of ACM International Conference on Knowledge Discovery and Data Mining (SIGKDD), pp. 401–406 (2001)

    Google Scholar 

  5. Han, J., Cheng, H., Xin, D., Ya, X.: Frequent Pattern Mining: Current Status and Future Directions. Data Mining and Knowledge Discovery 15(1), 55–86 (2007)

    Article  MathSciNet  Google Scholar 

  6. Creighton, C., Hanash, S.: Mining gene expression databases for association rules. Journal of Bioinformatics 19, 79–86 (2003)

    Article  Google Scholar 

  7. Zhang, A., Teo, B., Ooi, B., Tan, K.L.: Mining deterministic biclusters in gene expression data. In: Proc. of 4th Symposium on Bioinformatics and Bioengineering, pp. 283–292 (2004)

    Google Scholar 

  8. Pasquier, N., Bastide, Y., Taouil, R., Lakhal, L.: Discovering frequent closed itemsets for association rules. In: Beeri, C., Bruneman, P. (eds.) ICDT 1999. LNCS, vol. 1540, pp. 398–416. Springer, Heidelberg (1998)

    Chapter  Google Scholar 

  9. Pei, J., Han, J., Mao, R.: CLOSET: An efficient algorithm for mining frequent closed item sets. In: Proc. of ACM SIGMOD International Workshop on Data Mining and Knowledge Discovery (DMKD), pp. 21–30 (2000)

    Google Scholar 

  10. Zaki, M.J., Hsiao, C.: CHARM: An efficient algorithm for closed association rule mining. In: Proc. of SIAM International Conf. on Data Mining (SDM), pp. 457–473 (2002)

    Google Scholar 

  11. Agrawal, R., Srikant, R.: Fast algorithms for mining association rules. In: Proc. of the 20th International Conference on Very Large Databases, pp. 478–499 (1994)

    Google Scholar 

  12. Zhang, S., Wu, X., Zhang, C., Lu, J.: Computing the minimum-support for mining frequent patterns. Knowledge and Information Systems 15, 233–257 (2008)

    Article  Google Scholar 

  13. Zaki, M., Ogihara, M.: Theoretical foundations of association rules. In: Proc. of the 3rd ACM SIGMOD Workshop on Research Issues in Data Mining and Knowledge Discovery, pp. 85–93 (1998)

    Google Scholar 

  14. Silberschatz, A., Tuzhilin, A.: What makes patterns interesting in knowledge discovery systems. In: Proc. of IEEE Transactions on Knowledge and Data Engineering, pp. 970–974 (1996)

    Google Scholar 

  15. Xu, Y., Yu, Y., Liu, G., Lu, H.: From path tree to frequent patterns: a framework for Mining Frequent Patterns. In: Proc. of IEEE International Conf. on Data Mining (ICDM 2002), pp. 514–521 (2002)

    Google Scholar 

  16. Cohen, E., Datar, M., Fujiwara, S., Gionis, A., Indyk, P., Motwani, R., Ullman, J.D., Yang, C.: Finding interesting associations without support pruning. In: Proc. of IEEE Transactions on Knowledge and Data Engineering, pp. 64–78 (2001)

    Google Scholar 

  17. Roddick, J.F., Rice, S.: What’s interesting about cricket? – on thresholds and anticipation in discovered rules. In: Proc. of SIGKDD Explorations, pp. 1–5 (2001)

    Google Scholar 

  18. Hipp, J., Guntzer, U.: Is pushing constraints deeply into the mining algorithms really what we want? In: Proc. of SIGKDD Explorations, pp. 50–55 (2002)

    Google Scholar 

  19. Wang, K., He, Y., Han, J.: Pushing support constraints into association rules Mining. IEEE Transactions on Knowledge and Data Engineering, 642–658 (2003)

    Google Scholar 

  20. Han, J., Pei, J., Yin, Y., Mao, R.: Mining frequent patterns without candidate generations: a frequent pattern tree approach. Data Mining and knowledge Discovery 8(1), 53–87 (2004)

    Article  MathSciNet  Google Scholar 

  21. Chan, K.Y., Zhu, H.L., Lau, C.C., Ling, S.H.: Gene Signature Selection for Cancer Prediction Using an Integrated Approach of Genetic Algorithm and Support Vector Machine. In: Proc. of IEEE Congress on Evolutionary Computation(CEC 2008), pp. 217–224 (2008)

    Google Scholar 

  22. Chakraborty, A., Maka, H.: Biclustering of Gene Expression Data Using Genetic Algorithm. Proc. of IEEE, 765–770 (2005)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Debahuti Mishra .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer-Verlag Berlin Heidelberg

About this chapter

Cite this chapter

Mishra, D., Mishra, S., Satapathy, S.K., Patnaik, S. (2012). Genetic Algorithm Based Fuzzy Frequent Pattern Mining from Gene Expression Data. In: Patnaik, S., Yang, YM. (eds) Soft Computing Techniques in Vision Science. Studies in Computational Intelligence, vol 395. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25507-6_1

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-25507-6_1

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-25506-9

  • Online ISBN: 978-3-642-25507-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics