Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJanuary 2025
SHIELD: Security-Aware Scheduling for Real-Time DAGs on Heterogeneous Systems
ACM Transactions on Cyber-Physical Systems (TCPS), Volume 9, Issue 1Article No.: 2, Pages 1–29https://doi.org/10.1145/3702236Many control applications in real-time cyber-physical systems are represented as Directed Acyclic Graphs (DAGs) due to complex interactions among their functional components, and executed on distributed heterogeneous platforms. Data communication between ...
- research-articleOctober 2023
TMDS: Temperature-aware Makespan Minimizing DAG Scheduler for Heterogeneous Distributed Systems
ACM Transactions on Design Automation of Electronic Systems (TODAES), Volume 28, Issue 6Article No.: 99, Pages 1–22https://doi.org/10.1145/3616869To meet application-specific performance demands, recent embedded platforms often involve the use of intricate micro-architectural designs and very small feature sizes leading to complex chips with multi-million gates. Such ultra-high gate densities often ...
- abstractMay 2022
OpenCL Command-buffer Extension: Design and Implementation
IWOCL '22: Proceedings of the 10th International Workshop on OpenCLArticle No.: 1, Pages 1–2https://doi.org/10.1145/3529538.3529979OpenCL1 allows a programmer to offload a sequence of commands to a heterogeneous accelerator, such as a GPU. For embedded devices the overhead of building a command sequence can be expensive, and many applications require the same pipeline of commands to ...
- research-articleJanuary 2022
Performance-Effective DAG Scheduling for Heterogeneous Distributed Systems
ICDCN '22: Proceedings of the 23rd International Conference on Distributed Computing and NetworkingPages 234–235https://doi.org/10.1145/3491003.3491298The problem of scheduling Directed Acyclic Graphs (DAGs) in order to minimize schedule length (also known as makespan), is known to be a challenging as well as computationally hard problem. Therefore, researchers have endeavored towards the design of ...
-
- research-articleSeptember 2021
HMDS: A Makespan Minimizing DAG Scheduler for Heterogeneous Distributed Systems
ACM Transactions on Embedded Computing Systems (TECS), Volume 20, Issue 5sArticle No.: 106, Pages 1–26https://doi.org/10.1145/3477037The problem of scheduling Directed Acyclic Graphs in order to minimize makespan (schedule length), is known to be a challenging and computationally hard problem. Therefore, researchers have endeavored towards the design of various heuristic solution ...
- research-articleAugust 2020
Energy-aware strategies for reliability-oriented real-time task allocation on heterogeneous platforms
ICPP '20: Proceedings of the 49th International Conference on Parallel ProcessingArticle No.: 16, Pages 1–11https://doi.org/10.1145/3404397.3404419Low energy consumption and high reliability are widely identified as increasingly relevant issues in real-time systems on heterogeneous platforms. In this paper, we propose a multi-criteria optimization strategy to minimize the expected energy ...
- research-articleJune 2019
Program analysis for process migration
SOAP 2019: Proceedings of the 8th ACM SIGPLAN International Workshop on State Of the Art in Program AnalysisPages 32–37https://doi.org/10.1145/3315568.3329969Today's computer systems have become increasingly heterogeneous. Data centers integrate accelerators, CPUs with heterogeneous cores and with various ISAs which exhibit different performance and power characteristics. Mobile phones, following a similar ...
- research-articleNovember 2018
An adaptive offline implementation selector for heterogeneous parallel platforms
International Journal of High Performance Computing Applications (SAGE-HPCA), Volume 32, Issue 6Pages 854–863https://doi.org/10.1177/1094342017698746Heterogeneous parallel platforms, comprising multiple processing units and architectures, have become a cornerstone in improving the overall performance and energy efficiency of scientific and engineering applications. Nevertheless, taking full ...
- research-articleJune 2017
Optimal Checkpointing Period with Replicated Execution on Heterogeneous Platforms
FTXS '17: Proceedings of the 2017 Workshop on Fault-Tolerance for HPC at Extreme ScalePages 9–16https://doi.org/10.1145/3086157.3086165In this paper, we design and analyze strategies to replicate the execution of an application on two different platforms subject to failures, using checkpointing on a shared stable storage. We derive the optimal pattern size~$W$ for a periodic ...
- posterMay 2016
Accelerating the 3D euler atmospheric solver through heterogeneous CPU-GPU platforms
CF '16: Proceedings of the ACM International Conference on Computing FrontiersPages 353–356https://doi.org/10.1145/2903150.2903480In climate change studies, the atmospheric model is an essential component for building a high-resolution climate simulation system. While the accuracy of atmospheric simulations has long been limited by the computational capabilities of CPU platforms, ...
- ArticleAugust 2015
The Loop-of-Stencil-Reduce Paradigm
- Marco Aldinucci,
- Marco Danelutto,
- Maurizio Drocco,
- Peter Kilpatrick,
- Guilherme Peretti Pezzi,
- Massimo Torquati
TRUSTCOM-BIGDATASE-ISPA '15: Proceedings of the 2015 IEEE Trustcom/BigDataSE/ISPA - Volume 03Pages 172–177In this paper we advocate the Loop-of-stencil-reduce pattern as a way to simplify the parallel programming of heterogeneous platforms (multicore+GPUs). Loop-of-Stencil-reduce is general enough to subsume map, reduce, map-reduce, stencil, stencil-reduce, ...
- ArticleAugust 2015
AKI: Automatic Kernel Identification and Annotation Tool Based on C++ Attributes
TRUSTCOM-BIGDATASE-ISPA '15: Proceedings of the 2015 IEEE Trustcom/BigDataSE/ISPA - Volume 03Pages 148–153Massively parallel architectures are mainly based on a parallel heterogeneous setup, they are composed by different computing devices that speedup specific code region named kernel. These kernels are executed offline in the corresponding devices. ...
- ArticleAugust 2015
The Loop-of-Stencil-Reduce Paradigm
- Marco Aldinucci,
- Marco Danelutto,
- Maurizio Drocco,
- Peter Kilpatrick,
- Guilherme Peretti Pezzi,
- Massimo Torquati
TRUSTCOM-BIGDATASE-ISPA '15: Proceedings of the 2015 IEEE Trustcom/BigDataSE/ISPA - Volume 03Pages 172–177https://doi.org/10.1109/Trustcom.2015.628In this paper we advocate the Loop-of-stencil-reduce pattern as a way to simplify the parallel programming of heterogeneous platforms (multicore+GPUs). Loop-of-Stencil-reduce is general enough to subsume map, reduce, map-reduce, stencil, stencil-reduce, ...
- ArticleAugust 2015
AKI: Automatic Kernel Identification and Annotation Tool Based on C++ Attributes
TRUSTCOM-BIGDATASE-ISPA '15: Proceedings of the 2015 IEEE Trustcom/BigDataSE/ISPA - Volume 03Pages 148–153https://doi.org/10.1109/Trustcom.2015.624Massively parallel architectures are mainly based on a parallel heterogeneous setup, they are composed by different computing devices that speedup specific code region named kernel. These kernels are executed offline in the corresponding devices. ...
- research-articleMay 2015
Improving application performance by efficiently utilizing heterogeneous many-core platforms
CCGRID '15: Proceedings of the 15th IEEE/ACM International Symposium on Cluster, Cloud, and Grid ComputingPages 709–712https://doi.org/10.1109/CCGrid.2015.44Heterogeneous platforms integrating different types of processing units (such as multi-core CPUs and GPUs) are in high demand in high performance computing. Existing studies have shown that using heterogeneous platforms can improve application ...
- research-articleMarch 2015
HeteroVisor: Exploiting Resource Heterogeneity to Enhance the Elasticity of Cloud Platforms
VEE '15: Proceedings of the 11th ACM SIGPLAN/SIGOPS International Conference on Virtual Execution EnvironmentsPages 79–92https://doi.org/10.1145/2731186.2731191This paper presents HeteroVisor, a heterogeneity-aware hypervisor, that exploits resource heterogeneity to enhance the elasticity of cloud systems. Introducing the notion of 'elasticity' (E) states, HeteroVisor permits applications to manage their ...
Also Published in:
ACM SIGPLAN Notices: Volume 50 Issue 7 - research-articleJune 2014
Improving performance by matching imbalanced workloads with heterogeneous platforms
ICS '14: Proceedings of the 28th ACM international conference on SupercomputingPages 241–250https://doi.org/10.1145/2597652.2597675Although GPUs are considered ideal to accelerate massively data-parallel applications, there are still exceptions to this rule. For example, imbalanced applications cannot be efficiently processed by GPUs: despite the massive data parallelism, a varied ...
- ArticleAugust 2012
Hierarchical partitioning algorithm for scientific computing on highly heterogeneous CPU + GPU clusters
Euro-Par'12: Proceedings of the 18th international conference on Parallel ProcessingPages 489–501https://doi.org/10.1007/978-3-642-32820-6_49Hierarchical level of heterogeneity exists in many modern high performance clusters in the form of heterogeneity between computing nodes, and within a node with the addition of specialized accelerators, such as GPUs. To achieve high performance of ...
- ArticleJuly 2012
ScalaPipe: A Streaming Application Generator
SAAHPC '12: Proceedings of the 2012 Symposium on Application Accelerators in High Performance ComputingPages 44–53https://doi.org/10.1109/SAAHPC.2012.14ScalaPipe is a streaming application generator for heterogeneous platforms. By using a collection of domain-specific languages (DSLs) embedded in the Scala programming language, ScalaPipe allows creation of streaming applications that can run on a ...