Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/2449360.2449367dlproceedingsArticle/Chapter ViewAbstractPublication PagesausdmConference Proceedingsconference-collections
research-article
Free access

Distributed association rule mining with minimum communication overhead

Published: 01 December 2009 Publication History

Abstract

In distributed association rule mining algorithm, one of the major and challenging hindrances is to reduce the communication overhead. Data sites are required to exchange lot of information in the data mining process which may generates massive communication overhead. In this paper we propose an association rule mining algorithm which minimizes the communication overhead among the participating data sites. Instead of transmitting all itemsets and their counts, we propose to transmit a binary vector and count of only frequently large itemsets. Message Passing Interface (MPI) technique is exploited to avoid broadcasting among data sites. Performance study shows that the proposed algorithm performs better than two other well known algorithms known as Fast Distributed Algorithm for Mining Association Rules (FDM) and Count Distribution (CD) in terms of communication overhead.

References

[1]
R. Agrawal and J. C. Shafer (1996): Parallel Mining of Association Rules", Knowledge and Data Engineering, IEEE Transactions on Volume 8, Issue 6, pp. 962--969.
[2]
W. Cheung, J. Han, V. T. Ng, A. W. Fu, Y. Fu (1996): A fast distributed algorithm for mining association rules, 4th International Conference on Parallel and Distributed Information Systems, 18--20 pp. 31--42.
[3]
M. Kantarcioglu, C. Clifton (2004): Privacy-preserving distributed mining of association rules on horizontally partitioned data", Knowledge and Data Engineering IEEE Transaction Volume 16, Issue 9, pp. 1026--1037.
[4]
R. Agrawal and R. Srikant (1994): Fast algorithms for mining association rules, Proceedings of the 20th International Conference on Very Large Data Bases. Santiago, Chile: VLDB, pp. 487--499.
[5]
R. Agrawal and J. Shafer (1996): Parallel Mining of Association Rules: Design, Implementation and Experience, Research Report RJ 10004, IBM Almaden Research Center, San Jose, Calif.
[6]
Argonne National Laboratory (MPI): MPI web pages at Argonne National Laboratory URL: http://www-unix.mcs.anl.gov/mpi.
[7]
Q. Ding, Q. Ding, W. Perrizo (2008): PARM---An Efficient Algorithm to Mine Association Rules From Spatial Data, IEEE transaction on systems, man, and cybernetics -- part B: cybernetics, vol. 38, no. 6.
[8]
K. Sun, F. Bai (2008): Mining Weighted Association Rules without Preassigned Weights, IEEE transactions on knowledge and data engineering, vol. 20, no. 4.
[9]
F. Tao, F. Murtagh, M. Farid (2003): Weighted Association Rule Mining using Weighted Support and Significance Framework, SIGKDD,
[10]
S. J. Yen, Y. S. Lee (2007): Mining High Utility Quantitative Association Rules, Springer Berlin. pp. 283--292.
[11]
J. Han, M. Kamber (2006): Data Mining Concepts and Techniques, Second Edition, Elsevier Inc. ISBN: 13:978-1-55860-901-3.
[12]
P. N. Tan, M. Steinbach, V. Kumar (2006): "Introduction to Data Mining", 1st Edition, ISBN/ISSN: 0321321367, 2006.

Cited By

View all
  • (2016)Appraisal of homogeneous techniques in Distributed Data MiningProceedings of the International Conference on Informatics and Analytics10.1145/2980258.2980266(1-6)Online publication date: 25-Aug-2016
  • (2015)A distributed frequent itemset mining algorithm using Spark for Big Data analyticsCluster Computing10.1007/s10586-015-0477-118:4(1493-1501)Online publication date: 1-Dec-2015

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image DL Hosted proceedings
AusDM '09: Proceedings of the Eighth Australasian Data Mining Conference - Volume 101
December 2009
198 pages
ISBN:9781920682828

Publisher

Australian Computer Society, Inc.

Australia

Publication History

Published: 01 December 2009

Author Tags

  1. MPI
  2. allgather
  3. allreduce
  4. association rule mining
  5. data mining

Qualifiers

  • Research-article

Acceptance Rates

Overall Acceptance Rate 98 of 232 submissions, 42%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)38
  • Downloads (Last 6 weeks)11
Reflects downloads up to 08 Feb 2025

Other Metrics

Citations

Cited By

View all
  • (2016)Appraisal of homogeneous techniques in Distributed Data MiningProceedings of the International Conference on Informatics and Analytics10.1145/2980258.2980266(1-6)Online publication date: 25-Aug-2016
  • (2015)A distributed frequent itemset mining algorithm using Spark for Big Data analyticsCluster Computing10.1007/s10586-015-0477-118:4(1493-1501)Online publication date: 1-Dec-2015

View Options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Login options

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media