research-article

Open access

SilvanForge: A Schedule-Guided Retargetable Compiler for Decision Tree Inference

Authors: Ashwin Prasad, Sampath Rajendra, Kaushik Rajan, R Govindarajan, Uday BondhugulaAuthors Info & Claims

SOSP '24: Proceedings of the ACM SIGOPS 30th Symposium on Operating Systems Principles

Pages 488 - 504

https://doi.org/10.1145/3694715.3695958

Published: 15 November 2024 Publication History

PDF eReader

Abstract

The proliferation of machine learning together with the rapid evolution of the hardware ecosystem has led to a surge in the demand for model inference on a variety of hardware. Decision tree based models are the most popular models on tabular data. This paper is motivated by the problems encountered when targeting inference of these models to run at peak performance on CPU and GPU targets. Existing solutions are neither portable nor achieve the best possible performance for the specific hardware they target. This is because they do not explore and customize optimization configurations to the target processor and the model being used.

We present SilvanForge, a schedule-guided, retargetable compiler for decision tree based models that searches over several optimization choices and automatically generates high-performance inference routines for CPUs and GPUs. SilvanForge has two core components. The first is a scheduling language that encapsulates the optimization space, and techniques to efficiently explore this space. The second is an optimizing retargetable compiler that can generate code for any specified schedule. SilvanForge's ability to use different data layouts, loop structures and caching strategies enables it to achieve portable performance across a range of targets.

We evaluate SilvanForge on several hundred decision tree models across different batch sizes and target architectures. We find that our schedule exploration strategy is able to quickly find near-optimal schedules. In terms of performance, SilvanForge generated code is an order of magnitude faster than XGBoost, about 2--4× faster on average than RAPIDS FIL and Tahoe, and 2.5--3× faster than Hummingbird over several batch sizes. While most systems only target NVIDIA GPUs, SilvanForge achieves competent performance on AMD GPUs as well. On CPUs, SilvanForge is able to outperform Treebeard by up to 5× by utilizing additional sources of parallelism at small batch sizes.

References

[1]

[n. d.]. Kaggle AI Report 2023. https://www.kaggle.com/ai-report-2023. Accessed: 2024-04-16.

Abstract

References

Index Terms

Recommendations

Treebeard: An Optimizing Compiler for Decision Tree Based ML Inference

All-pairs computations on many-core graphics processors

Compiler-based code generation and autotuning for geometric multigrid on GPU-accelerated supercomputers

Comments

Information

Published In

Sponsors

In-Cooperation

Publisher

Publication History

Check for updates

Badges

Author Tags

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Contributors

Other Metrics

Bibliometrics

Article Metrics

Other Metrics

Citations

View options

PDF

eReader

Login options

Full Access

Figures

Other

Share

Share this Publication link

Share on social media

Affiliations