Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3357254.3357271acmotherconferencesArticle/Chapter ViewAbstractPublication PagesaiprConference Proceedingsconference-collections
research-article

A lock-free algorithm of tree-based reduction for large scale clustering on GPGPU

Published: 16 August 2019 Publication History

Abstract

Recently, the art of concurrency and parallelism has been advanced rapidly. However, conventional techniques still suffer of the drawback of lock contention. To name a few, atomic instruction has relatively low scalability as the number of iterations are increasing. This causes a serious slowdown when programmer cope with large-scale data mining processing such as clustering billions of data with numerous iterations. This paper proposes a Lock-free technique of tree-based reduction for large scale clustering on GPGPU. Proposal method is divided into two steps: fine reduction and coarse reduction. In the first reduction step, the clustering program obtain K * N intermediate array where K is the number of clusters and N is the number of blocks. In the following step, new mean value is calculated over N blocks. By doing this, the clustering program can evade using atomic instruction which causes lock contention in coping with large scale clusters. In experiment, the performance of native GPU kernel with atomic instruction, Thrust template libraries and proposal method is compared and evaluated.

References

[1]
Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurasamy, R.: Advances in Knowledge Discovery and Data Mining. AAAI/MIT Press (1996)
[2]
Fukunaga, K., Narendra, P.M.: A branch and bound algorithm for computing k-nearest neighbors. IEEE Trans. Comput. (1975)
[3]
Cheeseman, P., Stutz, J.: Bayesian classification (autoclass): Theory and results. In Fayyad, U.M., Piatetsky-Shapiro, G., Smyth, P., Uthurusamy, R., eds.: Advances in Knowledge Discovery and Data Mining, AAAI/MIT Press (1996)
[4]
Smyth, P., Ghil, M., Ide, K., Roden, J., Fraser, A.: Detecting atmospheric regimes using cross-validated clustering. In Pregibon, D., Uthurusamy, R., eds.: Proceedings Third International Conference on Knowledge Discovery and Data Mining, Newport Beach, CA, AAAI Press (1997)
[5]
Gersho, A., Gray, R.M.: Vector quantization and signal compression. Kluwer Academic Publishers (1992)
[6]
Shaw, C.T., King, G.P.: Using cluster analysis to classify time series. Physica D 58 (1992)
[7]
Dhillon, I.S., Modha, D.S., Spangler, W.S.: Visualizing class structure of multidimensional data. In Weisberg, S., ed.: Proceedings of the 30th Symposium on the Interface: Computing Science and Statistics, Minneapolis, MN. (1998)
[8]
Dhillon, I.S., Modha, D.S., Spangler, W.S.: Visualizing class structure of highdimensional data with applications. Submitted for publication (1999)
[9]
Broder, A.Z., Glassman, S.C., Manasse, M.S., Zweig, G.: Syntactic clustering of the web. Technical Report 1997-015, Digital Systems Research Center (1997)
[10]
Pawel Gajer, Michael T. Goodrich, Stephen G. Kobourov: A multi-dimensional approach to force-directed layouts of large graphs. Comput. Geom. 29(1): 3--18 (2004)
[11]
Agrawal, R., Shafer, J.C.: Parallel mining of association rules: Design, implementation, and experience. IEEE Trans. Knowledge and Data Eng. 8 (1996)
[12]
Chattratichat, J., Darlington, J., Ghanem, M., Guo, Y., Huning, H., Kohler, M., Sutiwaraphun, J., To, H.W., Yang, D.: Large scale data mining: Challenges and responses. In Pregibon, D., Uthurusamy, R., eds.: Proceedings Third International Conference on Knowledge Discovery and Data Mining, Newport Beach, CA, AAAI Press (1997)
[13]
Cheung, D.W., Xiao, Y.: Effect of data distribution in parallel mining of associations. Data Mining and Knowledge Discovery (1999) to appear.
[14]
Han, E.H., Karypis, G., Kumar, V.: Scalable parallel data mining for association rules. In: SIGMOD Record: Proceedings of the 1997 ACM-SIGMOD Conference on Management of Data, Tucson, AZ, USA. (1997)
[15]
D. Judd, P. K. McKinley, and A. K. Jain, Large-Scale Parallel Data Clustering, in Proceedings of the International Conference on Pattern Recognition (ICPR1996) Volume IV, 1996.
[16]
I. S. Dhillon and D. S. Modha, Data-Clustering Algorithm on Distributed Memory Multiprocessors, in Revised Papers from
[17]
Large-Scale Parallel Data Mining, Workshop on Large-Scale Parallel KDD Systems, SIGKDD: Springer-Verlag, 2000.
[18]
Clemens Lutz, Sebastian Breß, Tilmann Rabl, Steffen Zeuch, Volker Markl, "Efficient and Scalable k-Means on GPUs", Datenbank-Spektrum 18(3): 157--169 (2018)
[19]
A Study of the Behavior of Synchronization Methods in Commonly Used Languages and Systems. IPDPS 2013: 1309--1320
[20]
Mario Zechner, Michael Granitzer: Accelerating K-Means on the Graphics Processor via CUDA. INTENSIVE 2009:7--15
[21]
Yunlong Xu, Lan Gao, Rui Wang, Zhongzhi Luan, Weiguo Wu, Depei Qian: Lock-based synchronization for GPU architectures. Conf Computing Frontiers2016:205--213
[22]
Yunlong Xu, Rui Wang, Nilanjan Goswami, Tao Li, Depei Qian: Software Transactional Memory for GPU Architectures. Computer Architecture Letters 13(1): 49--52 (2014)
[23]
Jade Alglave, Mark Batty, Alastair F. Donaldson, Ganesh Gopalakrishnan, Jeroen Ketema, Daniel Poetzl, Tyler Sorensen, John Wickerson: GPU Concurrency: Weak Behaviours and Programming Assumptions. ASPLOS 2015: 577--591
[24]
Nathan R. Tallent, John M. Mellor-Crummey, Allan Porterfield: Analyzing lock contention in multithreaded applications. PPOPP 2010: 269--280
[25]
Pan, X., Klaftenegger, D. & Jonsson, B. (2015). Forecasting Lock Contention Before Adopting Another Lock Algorithm.
[26]
Daniel Cederman, Bapi Chatterjee, Nhan Nguyen Dang, Yiannis Nikolakopoulos, Marina Papatriantafilou, Philippas Tsigas: A Study of the Behavior of Synchronization Methods in Commonly Used Languages and Systems. IPDPS 2013: 1309--1320.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
AIPR '19: Proceedings of the 2nd International Conference on Artificial Intelligence and Pattern Recognition
August 2019
198 pages
ISBN:9781450372299
DOI:10.1145/3357254
  • Conference Chairs:
  • Li Ma,
  • Xu Huang
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 August 2019

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. CUDA
  2. GPGPU
  3. k-means
  4. large scale unsupervised clustering
  5. lock-free algorithm
  6. tree-based reduction

Qualifiers

  • Research-article

Conference

AIPR 2019

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 50
    Total Downloads
  • Downloads (Last 12 months)4
  • Downloads (Last 6 weeks)1
Reflects downloads up to 20 Nov 2024

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media