Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/1835804.1835839acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

UP-Growth: an efficient algorithm for high utility itemset mining

Published: 25 July 2010 Publication History

Abstract

Mining high utility itemsets from a transactional database refers to the discovery of itemsets with high utility like profits. Although a number of relevant approaches have been proposed in recent years, they incur the problem of producing a large number of candidate itemsets for high utility itemsets. Such a large number of candidate itemsets degrades the mining performance in terms of execution time and space requirement. The situation may become worse when the database contains lots of long transactions or long high utility itemsets. In this paper, we propose an efficient algorithm, namely UP-Growth (Utility Pattern Growth), for mining high utility itemsets with a set of techniques for pruning candidate itemsets. The information of high utility itemsets is maintained in a special data structure named UP-Tree (Utility Pattern Tree) such that the candidate itemsets can be generated efficiently with only two scans of the database. The performance of UP-Growth was evaluated in comparison with the state-of-the-art algorithms on different types of datasets. The experimental results show that UP-Growth not only reduces the number of candidates effectively but also outperforms other algorithms substantially in terms of execution time, especially when the database contains lots of long transactions.

Supplementary Material

JPG File (kdd2010_wu_uge_01.jpg)
MOV File (kdd2010_wu_uge_01.mov)

References

[1]
R. Agrawal and R. Srikant. Fast algorithms for mining association rules. In Proc. of the 20th Int'l Conf. on Very Large Data Bases, pp. 487--499, 1994.
[2]
C. F. Ahmed, S. K. Tanbeer, B.-S. Jeong, and Y.-K. Lee. Efficient tree structures for high utility pattern mining in incremental databases. In IEEE Transactions on Knowledge and Data Engineering, Vol. 21, Issue 12, pp. 1708--1721, 2009.
[3]
R. Chan, Q. Yang, and Y. Shen. Mining high utility itemsets. In Proc. of Third IEEE Int'l Conf. on Data Mining, pp. 19--26, Nov., 2003.
[4]
A. Erwin, R. P. Gopalan, and N. R. Achuthan. Efficient mining of high utility itemsets from large datasets. In Proc. of PAKDD 2008, LNAI 5012, pp. 554--561.
[5]
J. Han, J. Pei, and Y. Yin. Mining frequent patterns without candidate generation. In Proc. of the ACM-SIGMOD Int'l Conf. on Management of Data, pp. 1--12, 2000.
[6]
Y.-C. Li, J.-S. Yeh, and C.-C. Chang. Isolated items discarding strategy for discovering high utility itemsets. In Data & Knowledge Engineering, Vol. 64, Issue 1, pp. 198--217, Jan., 2008.
[7]
Y. Liu, W. Liao, and A. Choudhary. A fast high utility itemsets mining algorithm. In Proc. of the Utility-Based Data Mining Workshop, 2005.
[8]
B.-E. Shie, V. S. Tseng, and P. S. Yu. Online mining of temporal maximal utility itemsets from data streams. In Proc. of the 25th Annual ACM Symposium on Applied Computing, Switzerland, Mar., 2010.
[9]
H. Yao, H. J. Hamilton, L. Geng, A unified framework for utility-based measures for mining itemsets. In Proc. of ACM SIGKDD 2nd Workshop on Utility-Based Data Mining, pp. 28--37, USA, Aug., 2006.
[10]
S.-J. Yen and Y.-S. Lee. Mining high utility quantitative association rules. In Proc. of 9th Int'l Conf. on Data Warehousing and Knowledge Discovery, Lecture Notes in Computer Science 4654, pp. 283--292, Sep., 2007.
[11]
Frequent itemset mining implementations repository, http://fimi.cs.helsinki.fi/

Cited By

View all
  • (2024)Deep Learning Framework with Convolutional Sequential Semantic Embedding for Mining High-Utility Itemsets and Top-N RecommendationsJournal of information and communication convergence engineering10.56977/jicce.2024.22.1.4422:1(44-55)Online publication date: 31-Mar-2024
  • (2024)AN EFFICIENT ALGORITHM FOR MINING HIGH UTILITY ITEMSETSVinh University Journal of Science10.56824/vujs.2023a14753:2A(56-72)Online publication date: 20-Jun-2024
  • (2024)Mining Top-k High On-shelf Utility Itemsets Using Novel Threshold Raising StrategiesACM Transactions on Knowledge Discovery from Data10.1145/364511518:5(1-23)Online publication date: 8-Feb-2024
  • Show More Cited By

Index Terms

  1. UP-Growth: an efficient algorithm for high utility itemset mining

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    KDD '10: Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
    July 2010
    1240 pages
    ISBN:9781450300551
    DOI:10.1145/1835804
    Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

    Sponsors

    Publisher

    Association for Computing Machinery

    New York, NY, United States

    Publication History

    Published: 25 July 2010

    Permissions

    Request permissions for this article.

    Check for updates

    Author Tags

    1. candidate pruning
    2. frequent itemset
    3. high utility itemset
    4. utility mining

    Qualifiers

    • Research-article

    Conference

    KDD '10
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)39
    • Downloads (Last 6 weeks)4
    Reflects downloads up to 24 Nov 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (2024)Deep Learning Framework with Convolutional Sequential Semantic Embedding for Mining High-Utility Itemsets and Top-N RecommendationsJournal of information and communication convergence engineering10.56977/jicce.2024.22.1.4422:1(44-55)Online publication date: 31-Mar-2024
    • (2024)AN EFFICIENT ALGORITHM FOR MINING HIGH UTILITY ITEMSETSVinh University Journal of Science10.56824/vujs.2023a14753:2A(56-72)Online publication date: 20-Jun-2024
    • (2024)Mining Top-k High On-shelf Utility Itemsets Using Novel Threshold Raising StrategiesACM Transactions on Knowledge Discovery from Data10.1145/364511518:5(1-23)Online publication date: 8-Feb-2024
    • (2024)Toward Correlated Sequential RulesIEEE Transactions on Artificial Intelligence10.1109/TAI.2024.34293065:10(5340-5351)Online publication date: Oct-2024
    • (2024)Incremental Top-k High Utility Pattern Mining and Analyzing Over the Entire Accumulated Dynamic DatabaseIEEE Access10.1109/ACCESS.2024.340656212(77605-77620)Online publication date: 2024
    • (2024)Incremental high average-utility itemset mining: survey and challengesScientific Reports10.1038/s41598-024-60279-014:1Online publication date: 30-Apr-2024
    • (2024)Improved adaptive-phase fuzzy high utility pattern mining algorithm based on tree-list structure for intelligent decision systemsScientific Reports10.1038/s41598-023-50375-y14:1Online publication date: 10-Jan-2024
    • (2024)IPHM: Incremental Periodic High-Utility Mining Algorithm in Dynamic and Evolving Data EnvironmentsHeliyon10.1016/j.heliyon.2024.e37761(e37761)Online publication date: Sep-2024
    • (2024)MLC-miner: Efficiently discovering multi-level closed high utility patterns from quantitative hierarchical transaction databasesExpert Systems with Applications10.1016/j.eswa.2024.124383254(124383)Online publication date: Nov-2024
    • (2024)An efficient method for mining High-Utility itemsets from unstable negative profit databasesExpert Systems with Applications10.1016/j.eswa.2023.121489237(121489)Online publication date: Mar-2024
    • Show More Cited By

    View Options

    Login options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media