Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

A novel approach for mining maximal frequent patterns

Published: 01 May 2017 Publication History

Abstract

An N-list structure is used to compress the dataset for mining Maximal Frequent Patterns.A pruning technique is then proposed and used in INLA-MFP algorithm to improve the runtime and memory usage.Experiments were conducted to show that INLA-MFP outperforms well-known algorithms. Mining maximal frequent patterns (MFPs) is an approach that limits the number of frequent patterns (FPs) to help intelligent systems operate efficiently. Many approaches have been proposed for mining MFPs, but the complexity of the problem is enormous. Therefore, the run time and memory usage are still large. Recently, the N-list structure has been proposed and verified to be very effective for mining FPs, frequent closed patterns, and top-rank-k FPs. Therefore, this paper uses the N-list structure for mining MFPs. A pruning technique is also proposed to prune branches to reduce the search space. This technique is applied to an algorithm called INLA-MFP (improved N-list-based algorithm for mining maximal frequent patterns) for mining MFPs. Experiments were conducted to evaluate the effectiveness of the proposed algorithm. The experimental results show that INLA-MFP outperforms two state-of-the-art algorithms for mining MFPs.

References

[1]
R.C. Agarwal, C.C. Aggarwal, V.V.V. Prasad, Depth first generation of long patterns, in: KDD00, 2000, pp. 108-118.
[2]
R. Agrawal, T. Imielinski, A.N. Swami, Mining association rules between sets of items in large databases, in: SIGMOD93, 1993, pp. 207-216.
[3]
R. Agrawal, R. Srikant, Fast algorithms for mining association rules, in: VLDB'94, 1994, pp. 487-499.
[4]
D. Burdick, M. Calimlim, J. Flannick, J. Gehrke, T. Yiu, MAFIA: A Maximal Frequent Itemset Algorithm, IEEE IEEE Transactions on Knowledge and Data Engineering, 17 (2005) 1490-1504.
[5]
Z.H. Deng, X. Xu, Fast mining erasable itemsets using NC_sets, Expert Systems with Applications, 39 (2012) 4453-4463.
[6]
Z.H. Deng, Fast mining top-rank-k frequent patterns by using Node-lists, Expert Systems with Applications, 41 (2014) 1763-1768.
[7]
Z.H. Deng, DiffNodesets: An efficient structure for fast mining frequent itemsets, Applied Soft Computing, 41 (2016) 214-223.
[8]
Z.H. Deng, S.L. Lv, Fast mining frequent itemsets using Nodesets, Expert Systems with Applications, 41 (2014) 4505-4512.
[9]
Z.H. Deng, S.L. Lv, PrePost+: An efficient N-lists-based algorithm for mining frequent itemsets via children-parent equivalence pruning, Expert Systems with Applications, 42 (2015) 5424-5432.
[10]
Z.H. Deng, Z. Wang, J.J. Jiang, A new algorithm for fast mining frequent itemsets using N-lists, SCIENCE CHINA Information Sciences, 55 (2012) 2008-2030.
[11]
J. Dong, M. Han, BitTableFI: An efficient mining frequent itemsets algorithm, Knowledge-Based Systems, 20 (2007) 329-335.
[12]
W. Fan, K. Zhang, H. Cheng, J. Gao, X. Yan, J. Han, Direct mining of discriminative and essential frequent patterns via model-based search tree, in: SIGKDD08, 2008, pp. 230-238.
[13]
K. Gouda, M.J. Zaki, GenMax: An efficient algorithm for mining maximal frequent itemsets, Data Mining and Knowledge Discovery, 11 (2005) 223-242.
[14]
G. Grahne, J. Zhu, Fast algorithms for frequent itemset mining using FP-trees, IEEE Transactions on Knowledge and Data Engineering, 17 (2005) 1347-1362.
[15]
J. Han, J. Pei, Y. Yin, Mining frequent patterns without candidate generation, in: SIGMOD00, 2000, pp. 1-12.
[16]
Q. Huynh, T. Le, B. Vo, B. Le, An efficient and effective algorithm for mining top-rank-k frequent patterns, Expert Systems with Applications, 42 (2015) 156-164.
[17]
H.S. Le, A novel kernel fuzzy clustering algorithm for Geo-Demographic Analysis, Information Sciences, 317 (2015) 202-223.
[18]
T. Le, B. Vo, An N-list-based algorithm for mining frequent closed patterns, Expert Systems with Applications, 42 (2015) 6648-6657.
[19]
T. Le, B. Vo, F. Coenen, An efficient algorithm for mining erasable itemsets using the difference of NC-Sets, in: IEEE SMC'13, 2013, pp. 2270-2274.
[20]
B. Liu, W. Hsu, Y. Ma, Integrating classification and association rule mining, in: SIGKDD98, 1998, pp. 80-86.
[21]
X.B. Liu, K. Zhai, W. Pedrycz, An improved association rules mining method, Expert Systems with Applications, 39 (2012) 1362-1374.
[22]
T.T.L. Nguyen, N.T. Nguyen, Updating mined class association rules for record insertion, Applied Intelligence, 42 (2015) 707-721.
[23]
G. Pyun, U. Yun, Mining top-k frequent patterns with combination reducing techniques, Applied Intelligence, 41 (2014) 76-98.
[24]
B. Vo, T. Le, F. Coenen, T.P. Hong, Mining frequent itemsets using the N-list and subsume concepts, International Journal of Machine Learning and Cybernetics, 7 (2016) 253-265.
[25]
H. Xiao, F. Jing, K. Bettina, M.T. Son, C. Plant, Relevant overlaping subspace clusters on categorical data, in: KDD14, 2014, pp. 213-222.
[26]
M.J. Zaki, C.J. Hsiao, Efficient algorithms for mining closed itemsets and their lattice structure, IEEE Transactions on Knowledge and Data Engineering, 17 (2005) 462-478.

Cited By

View all
  • (2024)Co-occurrence Order-preserving Pattern Mining with Keypoint Alignment for Time SeriesACM Transactions on Management Information Systems10.1145/365845015:2(1-27)Online publication date: 13-Apr-2024
  • (2024)X-FSPMiner: A Novel Algorithm for Frequent Similar Pattern MiningACM Transactions on Knowledge Discovery from Data10.1145/364382018:5(1-26)Online publication date: 30-Jan-2024
  • (2024)MMC: efficient and effective closed high-utility itemset miningThe Journal of Supercomputing10.1007/s11227-024-06224-480:13(18900-18918)Online publication date: 1-Sep-2024
  • Show More Cited By
  1. A novel approach for mining maximal frequent patterns

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image Expert Systems with Applications: An International Journal
    Expert Systems with Applications: An International Journal  Volume 73, Issue C
    May 2017
    98 pages

    Publisher

    Pergamon Press, Inc.

    United States

    Publication History

    Published: 01 May 2017

    Author Tags

    1. Data mining
    2. Maximal frequent patterns
    3. N-list structure
    4. Pattern mining
    5. Pruning technique

    Qualifiers

    • Research-article

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)0
    • Downloads (Last 6 weeks)0
    Reflects downloads up to 19 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Co-occurrence Order-preserving Pattern Mining with Keypoint Alignment for Time SeriesACM Transactions on Management Information Systems10.1145/365845015:2(1-27)Online publication date: 13-Apr-2024
    • (2024)X-FSPMiner: A Novel Algorithm for Frequent Similar Pattern MiningACM Transactions on Knowledge Discovery from Data10.1145/364382018:5(1-26)Online publication date: 30-Jan-2024
    • (2024)MMC: efficient and effective closed high-utility itemset miningThe Journal of Supercomputing10.1007/s11227-024-06224-480:13(18900-18918)Online publication date: 1-Sep-2024
    • (2024)nMITP-Miner: An Efficient Method for Mining Frequent Maximal Inter-transaction PatternsComputational Collective Intelligence10.1007/978-3-031-70816-9_27(346-358)Online publication date: 9-Sep-2024
    • (2023)A novel approach to discover frequent weighted subgraphs using the average measureApplied Intelligence10.1007/s10489-023-04501-y53:16(19491-19504)Online publication date: 8-Mar-2023
    • (2022)OWSP-Miner: Self-adaptive One-off Weak-gap Strong Pattern MiningACM Transactions on Management Information Systems10.1145/347624713:3(1-23)Online publication date: 4-Feb-2022
    • (2022)Septic shock prediction and knowledge discovery through temporal pattern miningArtificial Intelligence in Medicine10.1016/j.artmed.2022.102406132:COnline publication date: 1-Oct-2022
    • (2022)An efficient and scalable approach for mining subgraphs in a single large graphApplied Intelligence10.1007/s10489-022-03164-552:15(17881-17895)Online publication date: 1-Dec-2022
    • (2021)MaxRI: A method for discovering maximal rare itemsets2021 4th International Conference on Data Science and Information Technology10.1145/3478905.3478972(334-341)Online publication date: 23-Jul-2021
    • (2021)GrAFCI+ A fast generator-based algorithm for mining frequent closed itemsetsKnowledge and Information Systems10.1007/s10115-021-01575-363:7(1873-1908)Online publication date: 1-Jul-2021
    • Show More Cited By

    View Options

    View options

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media