Nothing Special   »   [go: up one dir, main page]

skip to main content
research-article

Mining colossal patterns with length constraints

Published: 01 December 2021 Publication History

Abstract

Mining of colossal patterns is used to mine patterns in databases with many attributes and values, but the number of instances in each database is small. Although many efficient approaches for extracting colossal patterns have been proposed, they cannot be applied to colossal pattern mining with constraints. In this paper, we solve the challenge of extracting colossal patterns with length constraints. Firstly, we describe the problems of min-length constraint and max-length constraint and combine them with length constraints. After that, we evolve a proposal for efficiently truncating candidates in the mining process and another one for fast checking of candidates. Based on these properties, we offer the mining algorithm of Length Constraints for Colossal Pattern (LCCP) to extract colossal patterns with length constraints. Experiments are also conducted to show the effectiveness of the proposed LCCP algorithm with a comparison to some other ones.

References

[1]
Telikani A, Gandomi A, and Shahbahrami A A survey of evolutionary computation for association rule mining Inf Sci 2020 524 318-352
[2]
Shao Y, Liu B, Wang S, and Li G Software defect prediction based on correlation weighted class association rule mining Knowl Based Syst 2020 196 105742
[3]
Alibasa M, Calvo R, and Yacef K Sequential pattern mining suggests wellbeing supportive behaviors IEEE Access 2019 7 130133-130143
[4]
Huynh B, Trinh C, Huynh H, Van T, Vo B, and Snásel V An efficient approach for mining sequential patterns using multiple threads on very large databases Eng Appl of AI 2018 74 242-251
[5]
Fournier-Viger P, Yang Y, Yang P, Lin J, Yun U (2020) TKE: Mining Top-K frequent Episodes, in IEA/AIE 2020: 832–845
[6]
Smedt J, Deeva G, and Weerdt J Mining behavioral sequence constraints for classification IEEE Trans Knowl Data Eng 2020 32 6 1130-1142
[7]
Zou H Clustering algorithm and its application in data mining Wirel Pers Commun 2020 110 1 21-30
[8]
Astrova I, Koschel A, and Lee S Using market basket analysis to find semantic duplicates in ontology ICCSA 2020 4 197-211
[9]
Hagen M, Stein B (2018) Weblog analysis, in Encyclopedia of Social Network Analysis and Mining. 2nd Ed.
[10]
Littmann M, Goldberg T, Seitz S, Bodén M, and Rost B Detailed prediction of protein sub-nuclear localization BMC Bioinformatics 2019 20 1 205:1-205:15
[11]
Dessouky M, Taha E, Dessouky M, Eltholth A, Hassan E, and El-Samie F Non-parametric spectral estimation techniques for DNA sequence analysis and exon region prediction Comput Electric Eng 2019 73 334-348
[12]
Kumar D, Sharma D (2019) Deep learning in gene expression modeling, in Handbook of Deep Learning Applications, pp. 363–383
[13]
Bachman J, Gyori B, and Sorger P FamPlex: a resource for entity recognition and relationship resolution of human protein families and complexes in biomedical text mining BMC Bioinform 2018 19 1 248:1-248:14
[14]
Deng N, Chen X, Li D, and Xiong C Frequent patterns mining in DNA sequence IEEE Access 2019 7 108400-108410
[15]
Lin J, Yang L, Fournier-Viger P, and Hong T Mining of skyline patterns by considering both frequent and utility constraints Eng Appl AI 2019 77 229-238
[16]
Sohrabi M and Barforoush A Efficient colossal pattern mining in high dimensional datasets Knowl-Based Syst 2012 33 41-52
[17]
Zhu F, Yan X, Han J, Yu P, Cheng H (2007) Mining colossal frequent patterns by core pattern fusion. ICDE’07, pp. 706–715
[18]
Dabbiru M and Shashi M An efficient approach to colossal pattern mining Int J Comput Sci Network Security 2010 6 304-312
[19]
Prasanna K, Seetha M (2015) A doubleton pattern mining approach for discovering colossal patterns from biological dataset. Int J Comput Appl 119(21)
[20]
Prasanna K and Seetha M Efficient and accurate discovery of colossal pattern sequences from biological datasets: A doubleton pattern mining strategy (DPMine) IMCIP 2015 54 412-421
[21]
Nguyen T, Vo B, and Snásel V Efficient algorithms for mining colossal patterns in high dimensional databases Knowl-Based Syst 2017 122 75-89
[22]
Van T, Vo B, and Le B Mining sequential patterns with itemset constraints Knowl Inf Syst 2018 57 2 311-330
[23]
Le T, Nguyen A, Huynh B, Vo B, and Pedrycz W Mining constrained inter-sequence patterns: a novel approach to cope with item constraints Appl Intell 2018 48 5 1327-1343
[24]
Nguyen D, Nguyen L, Vo B, and Pedrycz W Efficient mining of class association rules with the itemset constraint Knowl-Based Syst 2016 103 73-88
[25]
Bessiere C, Lazaar N, Lebbah Y, M. M. (2018) Users constraints in itemset mining, CoRR abs/1801.00345
[26]
Nguyen D, Nguyen L, Vo B, and Hong T A novel method for constrained class association rule mining Inf Sci 2015 320 107-125
[27]
Vo B, Le T, Pedrycz W, Nguyen G, and Baik S Mining erasable itemsets with subset and superset itemset constraints Expert Syst Appl 2017 69 50-61
[28]
Nguyen T, Bay V, Huynh B, Snasel V, Nguyen L (2017) Constraint-based method for mining colossal patterns in high dimensional databases, in Information Systems Architecture and Technology - ISAT, Advances in Intelligent Systems and Computing, pp. 195–204
[29]
Zulkurnain N (2012) DisClose : discovering colossal closed itemsets from high dimensional datasets via a compact row-tree. University of Manchester
[30]
Vanahalli M and Patil N An efficient parallel row enumerated algorithm for mining frequent colossal closed itemsets from high dimensional datasets Inf Sci 2019 496 343-362
[31]
Zaki FM and Zulkurnain N RARE: mining colossal closed itemset in high dimensional data Knowl-Based Syst 2018 161 1-11
[32]
Vanahalli M and Patil N Distributed mining of significant frequent colossal closed itemsets from long biological dataset ISDA 2018 1 891-902
[33]
Hosseininasab A, Hoeve W-J, and Ciré A Constraint-based sequential pattern mining with decision diagrams AAAI 2019 33 1495-1502
[34]
Abeysinghe R and Cui L Query constraint based mining of association rules for exploratory analysis of clinical datasets in the National Sleep Research Resource BMC Med Inf Decis Making 2018 18 S-2 89-100
[35]
Belaid M, Bessiere C, Lazaar N (2019) Constraint programming for association rules. SDM:127–135
[36]
Van T, Yoshitaka A, and Le B Mining web access patterns with super-pattern constraint Appl Intell 2018 48 11 3902-3914
[37]
Song W, Cai K, Zhang M, and Yuen C Codes with run-length and GC-content constraints for DNA-based data storage IEEE Commun Lett 2018 22 10 2004-2007
[38]
Singh K and Bhaskar Biswas B Efficient algorithm for mining high utility pattern considering length constraints Int J Data Warehous Min 2019 15 3 1-27
[39]
Wu Y, Fan J, Li Y, Guo L, and Wu X NetDAP: (δ, γ) - approximate pattern matching with length constraints Appl Intell 2020 50 11 4094-4116

Index Terms

  1. Mining colossal patterns with length constraints
        Index terms have been assigned to the content through auto-classification.

        Recommendations

        Comments

        Please enable JavaScript to view thecomments powered by Disqus.

        Information & Contributors

        Information

        Published In

        cover image Applied Intelligence
        Applied Intelligence  Volume 51, Issue 12
        Dec 2021
        516 pages

        Publisher

        Kluwer Academic Publishers

        United States

        Publication History

        Published: 01 December 2021
        Accepted: 13 March 2021

        Author Tags

        1. Colossal pattern
        2. Data mining
        3. High-dimensional database
        4. Length constraints

        Qualifiers

        • Research-article

        Contributors

        Other Metrics

        Bibliometrics & Citations

        Bibliometrics

        Article Metrics

        • 0
          Total Citations
        • 0
          Total Downloads
        • Downloads (Last 12 months)0
        • Downloads (Last 6 weeks)0
        Reflects downloads up to 20 Dec 2024

        Other Metrics

        Citations

        View Options

        View options

        Media

        Figures

        Other

        Tables

        Share

        Share

        Share this Publication link

        Share on social media