Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3458744.3473365acmotherconferencesArticle/Chapter ViewAbstractPublication PagesicppConference Proceedingsconference-collections
research-article

Towards Faster Execution of Ensemble ML Bootstrap Based Techniques

Published: 23 September 2021 Publication History

Abstract

Algorithms for ensemble methods (EM) based on bootstrap aggregation often perform copious amount of redundant computations (RC) thus limiting their practicality. Given this constraint, we propose a framework that views these algorithms as a collection of computational units (cu), a tightly coupled set of both mathematical operations and data. This view facilitates a reduction in RC (RRC), thereby allowing for faster execution plans. Inspired by the floor tiling approach in VLSI, we look to engineer solutions for RRC while possibly reconfiguring the underlying computing system’s compiler technology stack. We start by showing that under the assumption that the computational system has unbounded but finite memory (i.e., the memory is large enough to hold all intermediate values) and that each   has a uniform cost, our approach reduces to a well-studied directed bandwidth problem for the directed acyclic graphs (DAGs). Next, we consider a more realistic scenario where the computing system has limited memory and concurrent execution while still assuming a uniform cost. Using a new notion of (r, s) set cover of a DAG (nodes representing computational units and edges representing their interdependencies) we formulate the problem of reducing redundant computational steps in EM as a variation of a directed bandwidth problem. We show that the graph’s minimum bandwidth is closely related to memory requirements for studying RRC. Finally, our preliminary experimental results are supportive of the proposed approach for RRC and promising that it can be applied to a broader set of algorithms in decision sciences.

References

[1]
G. M. Amdahl. 1967. Validity of the Single Processor Approach to Achieving Large Scale Computing Capabilities. In Proceedings of the April 18-20, 1967, Spring Joint Computer Conference(AFIPS ’67). Association for Computing Machinery, New York, USA, 483–485.
[2]
M. R. Garey, R. L. Graham, D. S. Johnson, and D. E. Knuth. 1978. Complexity Results for Bandwidth Minimization. SIAM J. Appl. Math. 34, 3 (1978), 477–495.
[3]
V. B. Gavirangaswamy. 2018. High Performance Computing Techniques for Analyzing Risky Decision Making. PhD Dissertation, Western Michigan University. (2018).
[4]
V. B. Gavirangaswamy and A. Gupta. 2017. CPU-GPU implementation of ensemble clustering algorithm for increased performance. In 2017 Int. Conf. on Intelligent Communication and Computational Techniques (ICCT). 262–271.
[5]
V. B. Gavirangaswamy, A. Gupta, and A. Gupta. 2016. A parallel implementation of reinforced learning model used in analyzing risky decision making. In 2016 Int. Conf. on High Performance Computing Simulation (HPCS). 940–946.
[6]
V. B Gavirangaswamy and H. M. Saleh. 2021. FTiPSim- Floor Tile Planning Simulator. https://github.com/vinaybabug/ftip_simulator.
[7]
M. I. Gordon, W. Thies, and S. Amarasinghe. 2006. Exploiting Coarse-Grained Task, Data, and Pipeline Parallelism in Stream Programs. SIGOPS Oper. Syst. Rev. 40, 5 (2006), 151–162.
[8]
E. M. Gurari and I. H. Sudborough. 1984. Improved dynamic programming algorithms for bandwidth minimization and the mincut linear arrangement problem. Journal of Algorithms 5, 4 (1984), 531–546.
[9]
P. Jain, L. Kanesh, W. Lochet, S. Saurabh, and R. Sharma. 2019. Exact and Approximate Digraph Bandwidth. In 39th IARCS Annual Conference on Foundations of Software Technology and Theoretical Computer Science (FSTTCS 2019)(Leibniz International Proceedings in Informatics (LIPIcs), Vol. 150), Arkadev Chattopadhyay and Paul Gastin (Eds.). Schloss Dagstuhl–Leibniz-Zentrum fuer Informatik, 18:1–18:15.
[10]
Y-K. Kwok and I. Ahmad. 1999. Static Scheduling Algorithms for Allocating Directed Task Graphs to Multiprocessors. ACM Comput. Surv. 31, 4 (1999), 406–471.
[11]
G. Moore. 1975. Progress In Digital Integrated Electronics. In IEEE International Electron Devices Meeting. 11 – 13.
[12]
J. B Saxe. 1980. Dynamic-programming algorithms for recognizing small-bandwidth graphs in polynomial time. SIAM Journal on Algebraic Discrete Methods 1, 4 (1980), 363–369.
[13]
I. Song, W. Yoon, E. Jang, and S. Choi. 2011. Task Scheduling Algorithm with Minimal Redundant Duplications in Homogeneous Multiprocessor System. In Grid and Distributed Computing, Tai-hoon Kim, Hojjat Adeli, Hyun-seob Cho, Osvaldo Gervasi, Stephen S. Yau, Byeong-Ho Kang, and Javier García Villalba (Eds.). Springer Berlin Heidelberg, Berlin, 238–245.
[14]
L. Zhao, Y. Ren, and K. Sakurai. 2013. Reliable workflow scheduling with less resource redundancy. Parallel Comput. 39, 10 (2013), 567–585.
Index terms have been assigned to the content through auto-classification.

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICPP Workshops '21: 50th International Conference on Parallel Processing Workshop
August 2021
314 pages
ISBN:9781450384414
DOI:10.1145/3458744
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 23 September 2021

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. DAG
  2. automatic redundancy reduction
  3. compiler
  4. dependence analysis
  5. machine learning
  6. parallel processing
  7. theory of computation

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Funding Sources

  • National Institute of General Medical Sciences of the National Institutes of Health
  • Career Enhancement Grant through the Univ. of Rhode Island

Conference

ICPP 2021

Acceptance Rates

Overall Acceptance Rate 91 of 313 submissions, 29%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • 0
    Total Citations
  • 45
    Total Downloads
  • Downloads (Last 12 months)2
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Feb 2025

Other Metrics

Citations

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

HTML Format

View this article in HTML Format.

HTML Format

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media