Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleFebruary 2024
APPy: Annotated Parallelism for Python on GPUs
CC 2024: Proceedings of the 33rd ACM SIGPLAN International Conference on Compiler ConstructionPages 113–125https://doi.org/10.1145/3640537.3641575GPUs are increasingly being used used to speed up Python applications in the scientific computing and machine learning domains. Currently, the two common approaches to leveraging GPU acceleration in Python are 1) create a custom native GPU kernel, and ...
- research-articleJanuary 2024
ISP Agent: A Generalized In-storage-processing Workload Offloading Framework by Providing Multiple Optimization Opportunities
- Seokwon Kang,
- Jongbin Kim,
- Gyeongyong Lee,
- Jeongmyung Lee,
- Jiwon Seo,
- Hyungsoo Jung,
- Yong Ho Song,
- Yongjun Park
ACM Transactions on Architecture and Code Optimization (TACO), Volume 21, Issue 1Article No.: 11, Pages 1–24https://doi.org/10.1145/3632951As solid-state drives (SSDs) with sufficient computing power have recently become the dominant devices in modern computer systems, in-storage processing (ISP), which processes data within the storage without transferring it to the host memory, is being ...
- ArticleAugust 2023
Hierarchical Management of Extreme-Scale Task-Based Applications
AbstractThe scale and heterogeneity of exascale systems increment the complexity of programming applications exploiting them. Task-based approaches with support for nested tasks are a good-fitting model for them because of the flexibility lying in the ...
- demonstrationMay 2023
Interaction-Oriented Programming: Intelligent, Meaning-Based Multiagent Systems
AAMAS '23: Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent SystemsPages 3041–3043Interaction-Oriented Programming (IOP) is an approach for engineering decentralized multiagent systems based in the idea of modeling interaction meaning. Modeling meaning enables agents to make flexible decentralized decisions. IOP addresses the key ...
- research-articleMay 2023
Mandrake: Multiagent Systems as a Basis for Programming Fault-Tolerant Decentralized Applications
AAMAS '23: Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent SystemsPages 1218–1220We define a decentralized software application as one that consists of autonomous agents that communicate through asynchronous messaging. Constructing a decentralized application involves designing agents as independent local computations that coordinate ...
-
- research-articleMay 2023
Kiko: Programming Agents to Enact Interaction Models
AAMAS '23: Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent SystemsPages 1154–1163Realizing a multiagent system involves implementing member agents who interact based on a protocol while making decisions in a decentralized manner. Current programming models for agents offer poor abstractions for decision making and fail to adequately ...
- research-articleMay 2023
Logical Time for Reactive Software
CPS-IoT Week '23: Proceedings of Cyber-Physical Systems and Internet of Things Week 2023Pages 313–318https://doi.org/10.1145/3576914.3587494Timing is an essential feature of reactive software. It is not just a performance metric, but rather forms a core part of the semantics of programs. This paper argues for a notion of logical time that serves as an engineering model to complement a ...
- research-articleFebruary 2023
Harmonic CUDA: Asynchronous Programming on GPUs
PMAM'23: Proceedings of the 14th International Workshop on Programming Models and Applications for Multicores and ManycoresPages 39–49https://doi.org/10.1145/3582514.3582517We introduce Harmonic CUDA, a dataflow programming model for GPUs that allows programmers to describe algorithms as a dependency graph of producers and consumers where data flows continuously through the graph for the duration of the kernel. This ...
- research-articleJanuary 2023
Unified Programming Models for Heterogeneous High-Performance Computers
Journal of Computer Science and Technology (JCST), Volume 38, Issue 1Pages 211–218https://doi.org/10.1007/s11390-023-2888-4AbstractUnified programming models can effectively improve program portability on various heterogeneous high-performance computers. Existing unified programming models put a lot of effort to code portability but are still far from achieving good ...
- research-articleDecember 2023
Mint: An Accelerator for Mining Temporal Motifs
- Nishil Talati,
- Haojie Ye,
- Sanketh Vedula,
- Kuan-Yu Chen,
- Yuhan Chen,
- Daniel Liu,
- Yichao Yuan,
- David Blaauw,
- Alex Bronstein,
- Trevor Mudge,
- Ronald Dreslinski
MICRO '22: Proceedings of the 55th Annual IEEE/ACM International Symposium on MicroarchitecturePages 1270–1287https://doi.org/10.1109/MICRO56248.2022.00089A variety of complex systems, including social and communication networks, financial markets, biology, and neuroscience are modeled using temporal graphs that contain a set of nodes and directed timestamped edges. Temporal motifs in temporal graphs ...
- ArticleMay 2023
Programming Heterogeneous Architectures Using Hierarchical Tasks
- Mathieu Faverge,
- Nathalie Furmento,
- Abdou Guermouche,
- Gwenolé Lucas,
- Raymond Namyst,
- Samuel Thibault,
- Pierre-André Wacrenier
AbstractTask-based systems have gained popularity as they promise to exploit the computational power of complex heterogeneous systems. A common programming model is the so-called Sequential Task Flow (STF) model, which, unfortunately, has the intrinsic ...
- research-articleAugust 2022
Performance portability study of epistasis detection using SYCL on NVIDIA GPU
BCB '22: Proceedings of the 13th ACM International Conference on Bioinformatics, Computational Biology and Health InformaticsArticle No.: 69, Pages 1–8https://doi.org/10.1145/3535508.3545591We describe the experience of converting a CUDA implementation of a high-order epistasis detection algorithm to SYCL. The goals are for our work to be useful to application and compiler developers with a detailed description of migration paths between ...
- abstractMay 2022
Using interoperability mode in SYCL 2020
IWOCL '22: Proceedings of the 10th International Workshop on OpenCLArticle No.: 21, Page 1https://doi.org/10.1145/3529538.3529997SYCL is a programming standard targeting hardware platforms with a host connected to various heterogeneous accelerators. Both the host and accelerator parts of the computation are expressed in a single-source modern C++ program. While the previous ...
- research-articleFebruary 2022
HeteroFlow: An Accelerator Programming Model with Decoupled Data Placement for Software-Defined FPGAs
FPGA '22: Proceedings of the 2022 ACM/SIGDA International Symposium on Field-Programmable Gate ArraysPages 78–88https://doi.org/10.1145/3490422.3502369To achieve high performance with FPGA-equipped heterogeneous compute systems, it is crucial to co-optimize data placement and compute scheduling to maximize data reuse and bandwidth utilization for both on- and off-chip memory accesses. However, ...
- research-articleSeptember 2021
A new method for deriving priority from dual hesitant fuzzy preference relations
International Journal of Intelligent Systems (IJIS), Volume 36, Issue 11Pages 6613–6644https://doi.org/10.1002/int.22562AbstractDual hesitant fuzzy elements (DHFEs) are suitable to express hesitant possible preferred and nonpreferred judgments of decision makers. Preference relation is an important tool in decision making that only needs the decision makers to compare a ...
- tutorialJuly 2021
Interaction-Oriented Programming: An Application Semantics Approach for Engineering Decentralized Applications
PODC'21: Proceedings of the 2021 ACM Symposium on Principles of Distributed ComputingPages 575–576https://doi.org/10.1145/3465084.3467486Interaction-Oriented Programming (IOP) refers to multiagent concepts, languages, and programming models for engineering applications that are characterized by interactions between autonomous parties. Such applications arise in domains such as e-commerce,...
- research-articleJune 2021
Automatic mapping and code optimization for OpenCL kernels on FT-matrix architecture (WIP paper)
LCTES 2021: Proceedings of the 22nd ACM SIGPLAN/SIGBED International Conference on Languages, Compilers, and Tools for Embedded SystemsPages 37–41https://doi.org/10.1145/3461648.3463845FT-Matrix is a typical vector-SIMD architecture that refines the cooperation between scalar and vector units. This approach is widely used in digital signal processing, high-performance computing, and artificial intelligence, among other fields. FT-...
- research-articleJanuary 2021
A Group Decision Making Method with Interval-Valued Intuitionistic Fuzzy Preference Relations and Its Application in the Selection of Cloud Computing Vendors for SMEs
To solve the problem of choosing the appropriate cloud computing vendors in small and medium-sized enterprises (SMEs), this paper boils it down to a group decision making (GDM) problem. To facilitate the judgment, this paper uses preference relation as ...
- research-articleNovember 2020
Lazy Build of Acceleration Structures with Traversal Shaders
SA '20: SIGGRAPH Asia 2020 Technical CommunicationsArticle No.: 11, Pages 1–4https://doi.org/10.1145/3410700.3425430Modern ray tracing APIs allow developers to easily build acceleration structures (AS) with various optimization techniques. However, the visibility-driven on-demand build can not be implemented with the current APIs due to the lack of flexibility ...
- research-articleMarch 2021
X-Centric: A Survey on Compute-, Memory- and Application-Centric Computer Architectures
MEMSYS '20: Proceedings of the International Symposium on Memory SystemsPages 178–193https://doi.org/10.1145/3422575.3422792Big Data and machine learning constitute the multifaceted challenge of computer engineering in the past decade. The meaningful processing of vast amounts of unstructured data from a myriad of sensors and devices is a complicated endeavor already. ...