Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Association rule mining with mostly associated sequential patterns

Published: 01 April 2015 Publication History

Abstract

Extraction of interesting patterns from data in the form of mostly associated sequential patterns.Speeding up the finding interesting patterns in data.Providing a tool for visual exploration of patterns extracted from data.Ability to be used for searching patterns in big data.The proposed algorithm can be extended to find different type of patterns such as the weakest associated patterns. In this paper, we address the problem of mining structured data to find potentially useful patterns by association rule mining. Different than the traditional find-all-then-prune approach, a heuristic method is proposed to extract mostly associated patterns (MASPs). This approach utilizes a maximally-association constraint to generate patterns without searching the entire lattice of item combinations. This approach does not require a pruning process. The proposed approach requires less computational resources in terms of time and memory requirements while generating a long sequence of patterns that have the highest co-occurrence. Furthermore, k-item patterns can be obtained thanks to the sub-lattice property of the MASPs. In addition, the algorithm produces a tree of the detected patterns; this tree can assist decision makers for visual analysis of data. The outcome of the algorithm implemented is illustrated using traffic accident data. The proposed approach has a potential to be utilized in big data analytics.

References

[1]
Agrawal, A., Srikant R. (1994). "Fast algorithms for mining association rules." In Proceedings of the 20th VLDB conference. Santiago, Chile.
[2]
M.Z. Ashrafi, D. Taniar, K. Smith, A new approach of eliminating redundant association rules, database and expert systems applications, Lecture Notes in Computer Science (2004) 465-474.
[3]
M.F. Ashrafi, D. Taniar, K. Smith, Redundant association rules reduction techniques, AI 2005: Advances in artificial intelligence, Lecture Notes in Computer Science (2005) 254-263.
[4]
Bayardo, R. Frequent itemset mining dataset repository. <http://www.cs.rpi.edu/~zaki/Workshops/FIMI/data/> (accessed October 2014).
[5]
Burdick, D., Calimlim, M., Gehrke J. (2001). "MAFIA: A maximal frequent itemset algorithm for transactional databases." In Proceedings of the 17th international conference on data engineering. Heidelberg, Germany (pp. 443-452).
[6]
K. Buza, Feedback prediction for blogs, in: Data analysis, machine learning and knowledge discovery, Springer International Publishing, 2014, pp. 145-152.
[7]
J. Cheng, Y. Ke, W. Ng, Effective elimination of redundant association rules, Data Mining and Knowledge Discovery, 16 (2008) 221-249.
[8]
C.-H. Chen, G.-C. Lan, Z.-P. Hong, Y.-K. Lin, Mining high coherent association rules with consideration of support measure, Expert Systems with Applications, 40 (2013) 6531-6537.
[9]
C.M. Chen, P. Tsai, Mining interesting association rules from customer databases and transaction databases, Information Systems, 29 (2004) 685-696.
[10]
A. Das, W. Ng, Y. Woon, Rapid association rule mining, CIKM'01, ACM, Atlanta, Georgia, USA, 2001.
[11]
Z.-H. Deng, Fast mining top-rank-k frequent patterns by using node-lists, Expert Systems with Applications, 41 (2014) 1763-1768.
[12]
Denwattana, N., Getta, J. R. (2001). A parameterised algorithm for mining association rules, In The 12th Australasian dataset conference (pp. 45-51).
[13]
Do, T. D., Hui, S. C., Fong A. (2003). Mining frequent itemsets with category-based constraints." In 6th International conference discovery science, lecture notes in computer science. Sapporo, Japan (pp. 76-86).
[14]
Donepudi, H. (2013). "Detection of interesting traffic accident patterns by association rule mining" (thesis). Baton Rouge, LA, USA: Computer Science and Engineering, Louisiana State University.
[15]
F. Guillet, H.J. Hamilton, Springer, 2007.
[16]
J. Han, y. Fu, Mining multiple-level association rules in large databases, IEEE Transactions on Knowledge and Data Engineering, 11 (1999) 798-805.
[17]
J. Han, J. Pei, Y. Yin, R. MAO, Mining frequent patterns without candidate generation: A frequent-pattern tree approach, Data Mining and Knowledge Discovery, 8 (2004) 53-87.
[18]
Heravi, M. J., Zaïane, O. R. (2010). A study on interestingness measures for associative classifiers. In 2010 ACM symposium on applied computing (SAC). Sierre, Switzerland.
[19]
F. Herrera, C.J. Carmona, P. González, M.J. Jesus, An overview on subgroup discovery: Foundations and applications, Knowledge and Information Systems, 29 (2011) 495-525.
[20]
T. Hong, C. Horng, C. Wu, S. Wang, An improved data mining approach using predictive itemsets, Expert Systems with Applications, 36 (2009) 72-80.
[21]
Z. Jin, R. Wang, H. Huang, Y. Hu, Efficient interesting association rule mining based on causal criterion using feature selection, Journal of Information & Computational Science, 11 (2014) 4393-4403.
[22]
A. Király, A. Laiho, J. Abonyi, A. Gyenesei, Novel techniques and an efficient algorithm for closed pattern mining, Expert Systems with Applications, 41 (2014) 5105-5114.
[23]
W. Klbsgen, Explora: a multipattern and multistrategy discovery assistant, Advances in Knowledge Discovery and Data Mining (1996) 249-271.
[24]
Klemettinen, M., Heikki, M., Ronkainen, P., Toivonen, H. Verkamo, I. (1994). Finding interesting rules from large sets of discovered association rules. CIKM'94 Proceedings of the third international conference on Information and knowledge management. Gaithersburg, MD, USA (pp. 401-407).
[25]
S. Kotsiantis, D. Kanellopoulos, Association rules mining: A recent overview, GESTS International Transactions on Computer Science and Engineering, 32 (2006) 71-82.
[26]
D. Lo, S.C. Khoo, L. Wong, Non-redundant sequential rules - Theory and algorithm, Information Systems, 34 (2009) 438-453.
[27]
R. Marukatat, Structure-based rule selection framework for association rule mining of traffic accident data, in: Computational Intelligence and Security, Springer, 2006, pp. 231-239.
[28]
E.R. Omiecinski, Alternative interest measures for mining associations in databases, IEEE Transactions on Knowledge and Data Engineering, 15 (2003) 57-69.
[29]
Pasquier, N., Bastide, Y., Taouil, R. Lakhal, L. (1998). Discovering frequent closed itemsets for association rules. In ICDT'99, International conference on database theory, lecture notes in computer science (pp. 398-416).
[30]
Gwangbum. Pyun, Unil Yun, Mining top-k frequent patterns with combination reducing techniques, Applied Intelligence, 41 (2014) 76-98.
[31]
A.Y. Rodríguez-González, J.F. Martínez-Trinidad, J.A. Carrasco-Ochoa, Mining frequent patterns and association rules using similarities, Expert Systems with Applications, 40 (2013) 6823-6836.
[32]
Sigal. Sahar, Interestingness measures - On determining what is, in: Data mining and knowledge discovery handbook, Springer, 2010, pp. 603-612.
[33]
B. Strack, J.P. DeShazo, C. Gennings, J.L. Olmo, S. Ventura, K.J. Cios, Impact of HbA1c measurement on hospital readmission rates: Analysis of 70,000 clinical database patient records, BioMed Research International, 2014 (2014).
[34]
Tan, P. N., Kumar, V., Srivastava J. (2002). Selecting the right interestingness measure for association patterns. In: KDD'02 Proceedings of the eighth ACM SIGKDD international conference on knowledge discovery and data mining. New York.
[35]
Team, R. (2008). Development core, R: A language and environment for, Vienna: R foundation for statistical computing.
[36]
F.-C. Tseng, Mining frequent itemsets in large databases: The hierarchical partitioning approach, Expert Systems with Applications, 40 (2013) 1654-1661.
[37]
B. Vo, F. Coenen, B. Le, A new method for mining frequent weighted itemsets based on WIT-trees, Expert Systems with Applications, 40 (2013) 1256-1264.
[38]
G.I. Webb, X. Yu, Discovering interesting association rules by clustering, in: Advances in artificial intelligence, Springer, 2004, pp. 1055-1061.
[39]
G. Williams, Rattle: A data mining GUI for R, The R Journal, 1 (2009) 45-55.
[40]
M. Wojciechowski, M. Zakrzewicz, Dataset filtering techniques in constraint-based frequent pattern mining, in: ESF exploratory workshop on pattern detection and discovery, lecture notes in computer science, Springer, 2002, pp. 77-91.
[41]
Y. Xu, Y. Li, G. Shaw, Reliable representations for association rules, Data & Knowledge Engineering, 70 (2011) 555-575.

Cited By

View all
  1. Association rule mining with mostly associated sequential patterns

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image Expert Systems with Applications: An International Journal
      Expert Systems with Applications: An International Journal  Volume 42, Issue 5
      April 2015
      563 pages

      Publisher

      Pergamon Press, Inc.

      United States

      Publication History

      Published: 01 April 2015

      Author Tags

      1. Association rule mining
      2. Big data
      3. Data mining
      4. Interesting rules
      5. Knowledge discovery
      6. Pattern recognition

      Qualifiers

      • Research-article

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)0
      • Downloads (Last 6 weeks)0
      Reflects downloads up to 18 Feb 2025

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Sequence aware recommenders for fashion E-commerceElectronic Commerce Research10.1007/s10660-022-09627-824:4(2733-2753)Online publication date: 1-Dec-2024
      • (2023)A comprehensive review of visualization methods for association rule miningExpert Systems with Applications: An International Journal10.1016/j.eswa.2023.120901233:COnline publication date: 15-Dec-2023
      • (2019)CD-CARSExpert Systems with Applications: An International Journal10.1016/j.eswa.2019.06.020135:C(388-409)Online publication date: 30-Nov-2019
      • (2019)Significance-based discriminative sequential pattern miningExpert Systems with Applications: An International Journal10.1016/j.eswa.2018.12.046122:C(54-64)Online publication date: 15-May-2019
      • (2018)OOIMASPExpert Systems with Applications: An International Journal10.1016/j.eswa.2017.10.01593:C(62-71)Online publication date: 1-Mar-2018
      • (2016)WARM Based Data Pre-fetching and Cache Replacement Strategies for Location Dependent Information System in Wireless EnvironmentWireless Personal Communications: An International Journal10.1007/s11277-016-3425-390:4(1811-1842)Online publication date: 1-Oct-2016
      • (2016)A sparse memory allocation data structure for sequential and parallel association rule miningThe Journal of Supercomputing10.1007/s11227-015-1566-x72:2(347-370)Online publication date: 1-Feb-2016
      • (2016)SPSRGCluster Computing10.1007/s10586-016-0633-219:4(1703-1721)Online publication date: 1-Dec-2016
      • (2015)UTARMInternational Journal of Knowledge Engineering and Data Mining10.1504/IJKEDM.2015.0712933:2(208-237)Online publication date: 1-Aug-2015

      View Options

      View options

      Figures

      Tables

      Media

      Share

      Share

      Share this Publication link

      Share on social media