Export Citations
Save this search
Please login to be able to save your searches and receive alerts for new content matching your search criteria.
- research-articleJune 2024
Apache Arrow DataFusion: A Fast, Embeddable, Modular Analytic Query Engine
SIGMOD/PODS '24: Companion of the 2024 International Conference on Management of DataPages 5–17https://doi.org/10.1145/3626246.3653368Apache Arrow DataFusion is a fast, embeddable, and extensible query engine written in Rust that uses Apache Arrow as its memory model. In this paper we describe the technologies on which it is built, and how it fits in long-term database implementation ...
- research-articleSeptember 2024
One Automaton to Rule Them All: Beyond Multiple Regular Expressions Execution
CGO '24: Proceedings of the 2024 IEEE/ACM International Symposium on Code Generation and OptimizationPages 193–206https://doi.org/10.1109/CGO57630.2024.10444810Regular Expressions (REs) matching is crucial to identify strings exhibiting certain morphological properties in a data stream, resulting paramount in contexts such as deep packet inspection in computer security and genome analysis in bioinformatics. Yet,...
- research-articleSeptember 2023
BlockPilot: A Proposer-Validator Parallel Execution Framework for Blockchain
ICPP '23: Proceedings of the 52nd International Conference on Parallel ProcessingPages 193–202https://doi.org/10.1145/3605573.3605621Traditional blockchain systems still struggle with limited throughput, particularly those compatible with EVM, which are crucial in many blockchain applications. One of the main reasons arises from serial execution, which doesn’t exploit parallelism in ...
- research-articleFebruary 2023
Block-STM: Scaling Blockchain Execution by Turning Ordering Curse to a Performance Blessing
- Rati Gelashvili,
- Alexander Spiegelman,
- Zhuolun Xiang,
- George Danezis,
- Zekun Li,
- Dahlia Malkhi,
- Yu Xia,
- Runtian Zhou
PPoPP '23: Proceedings of the 28th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel ProgrammingPages 232–244https://doi.org/10.1145/3572848.3577524Block-STM is a parallel execution engine for smart contracts, built around the principles of Software Transactional Memory. Transactions are grouped in blocks, and every execution of the block must yield the same deterministic outcome. Block-STM further ...
- research-articleJuly 2022
Utilizing parallelism in smart contracts on decentralized blockchains by taming application-inherent conflicts
ICSE '22: Proceedings of the 44th International Conference on Software EngineeringPages 2315–2326https://doi.org/10.1145/3510003.3510086Traditional public blockchain systems typically had very limited transaction throughput because of the bottleneck of the consensus protocol itself. With recent advances in consensus technology, the performance limit has been greatly lifted, typically to ...
-
- short-paperDecember 2021
Efficient parallel execution of block transactions in blockchain
Middleware '21: Proceedings of the 22nd International Middleware Conference: Doctoral SymposiumPages 8–11https://doi.org/10.1145/3491087.3493676Miners and validators in current blockchains serially execute block transactions. Such serial execution cannot efficiently utilize modern multi-core resources, consequently hampering system throughput. We propose three approaches to improve blockchain ...
- research-articleSeptember 2021
Excluding Parallel Execution to Improve Global Fixed Priority Response Time Analysis
ACM Transactions on Embedded Computing Systems (TECS), Volume 20, Issue 5sArticle No.: 104, Pages 1–24https://doi.org/10.1145/3477035Response Time Analysis (RTA) is an effective method for testing the schedulability of real-time tasks on multiprocessor platforms. Existing RTAs for global fixed priority scheduling calculate the upper bound of the worst case response time of each task. ...
- posterApril 2021
Efficient and robust data integrity verification scheme for high-performance storage devices
SAC '21: Proceedings of the 36th Annual ACM Symposium on Applied ComputingPages 1199–1202https://doi.org/10.1145/3412841.3442113Most of the data generated on high-performance computing systems are transferred to storage in remote systems for various purposes such as backup. To detect data corruption caused by network or storage failures during data transfer, the receiver system ...
- research-articleNovember 2019
Adaptive parallel execution of deep neural networks on heterogeneous edge devices
SEC '19: Proceedings of the 4th ACM/IEEE Symposium on Edge ComputingPages 195–208https://doi.org/10.1145/3318216.3363312New applications such as smart homes, smart cities, and autonomous vehicles are driving an increased interest in deploying machine learning on edge devices. Unfortunately, deploying deep neural networks (DNNs) on resource-constrained devices presents ...
- research-articleOctober 2018
Virtual machine design for parallel dynamic programming languages
Proceedings of the ACM on Programming Languages (PACMPL), Volume 2, Issue OOPSLAArticle No.: 109, Pages 1–25https://doi.org/10.1145/3276479To leverage the benefits of modern hardware, dynamic languages must support parallelism, and parallelism requires a virtual machine (VM) capable of parallel execution — a parallel VM. However, unrestricted concurrency and the dynamism of dynamic ...
- research-articleOctober 2016
Dependent partitioning
OOPSLA 2016: Proceedings of the 2016 ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and ApplicationsPages 344–358https://doi.org/10.1145/2983990.2984016A key problem in parallel programming is how data is partitioned: divided into subsets that can be operated on in parallel and, in distributed memory machines, spread across multiple address spaces.
We present a dependent partitioning framework that ...
Also Published in:
ACM SIGPLAN Notices: Volume 51 Issue 10 - articleAugust 2016
Supporting efficient execution of continuous space agent-based simulation on GPU
Concurrency and Computation: Practice & Experience (CCOMP), Volume 28, Issue 12Pages 3313–3332https://doi.org/10.1002/cpe.3808Using agent-based simulation ABS to analyze complex adaptive systems gains growing popularity over the past decades. One of the fundamental issues in ABS is to increase the execution speed. In this paper, we identify two common modules that widely exist ...
- research-articleJune 2016
Running Simultaneous Kepler Sessions for the Parallelization of Parametric Scans and Optimization Studies Applied to Complex Workflows
- Micha Owsiak,
- Marcin Pciennik,
- Bartek Palak,
- Tomasz Zok,
- Cedric Reux,
- Luc Di Gallo,
- Denis Kalupin,
- Thomas Johnson,
- Mireille Schneider
Procedia Computer Science (PROCS), Volume 80, Issue CPages 690–699https://doi.org/10.1016/j.procs.2016.05.362In this paper we present an approach taken to run multiple Kepler sessions at the same time. This kind of execution is one of the requirements for Integrated Tokamak Modelling platform developed by the Nuclear Fusion community within the context of ...
- research-articleApril 2015
Armadillo: A Compilation Chain for Privacy Preserving Applications
SCC '15: Proceedings of the 3rd International Workshop on Security in Cloud ComputingPages 13–19https://doi.org/10.1145/2732516.2732520In this work we present Armadillo a compilation chain used for compiling applications written in a high-level language (C++) to work on encrypted data. The back-end of the compilation chain is based on homomorphic encryption. The tool-chain further ...
- ArticleAugust 2014
Scientific Workflow Partitioning in Multisite Cloud
Revised Selected Papers, Part I, of the Euro-Par 2014 International Workshops on Parallel Processing - Volume 8805Pages 105–116https://doi.org/10.1007/978-3-319-14325-5_10Scientific workflows allow scientists to conduct experiments that manipulate data with multiple computational activities using Scientific Workflow Management Systems SWfMSs. As the scale of the data increases, SWfMSs need to support workflow execution ...
- research-articleApril 2014
Excalibur: an autonomic cloud architecture for executing parallel applications
- Alessandro Ferreira Leite,
- Tainá Raiol,
- Claude Tadonki,
- Maria Emilia M. T. Walter,
- Christine Eisenbeis,
- Alba Cristina Magalhães Alves de Melo
CloudDP '14: Proceedings of the Fourth International Workshop on Cloud Data and PlatformsArticle No.: 2, Pages 1–6https://doi.org/10.1145/2592784.2592786IaaS providers often allow the users to specify many requirements for their applications. However, users without advanced technical knowledge usually do not provide a good specification of the cloud environment, leading to low performance and/or high ...
- research-articleNovember 2013
PICCO: a general-purpose compiler for private distributed computation
CCS '13: Proceedings of the 2013 ACM SIGSAC conference on Computer & communications securityPages 813–826https://doi.org/10.1145/2508859.2516752Secure computation on private data has been an active area of research for many years and has received a renewed interest with the emergence of cloud computing. In recent years, substantial progress has been made with respect to the efficiency of the ...
- ArticleSeptember 2013
Space Efficient Formats for Structure of Sparse Matrices Based on Tree Structures
SYNASC '13: Proceedings of the 2013 15th International Symposium on Symbolic and Numeric Algorithms for Scientific ComputingPages 344–351https://doi.org/10.1109/SYNASC.2013.52Very large sparse matrices are often processed on massively parallel computer systems with distributed memory architectures consisting of tens or hundreds of thousands of processor cores. The problem occurs when we want or need to load/store these ...
- ArticleSeptember 2013
The Study of Impact of Matrix-Processor Mapping on the Parallel Sparse Matrix-Vector Multiplication
SYNASC '13: Proceedings of the 2013 15th International Symposium on Symbolic and Numeric Algorithms for Scientific ComputingPages 321–328https://doi.org/10.1109/SYNASC.2013.49Sparse matrix-vector multiplication (shortly spM V) is one of the most common subroutines in the numerical linear algebra. The parallelization of this task looks easy and straightforward, but it is not optimal in general case. This paper discuss some ...
- research-articleMay 2013
Scaling analytics applications with OpenCL for loosely coupled heterogeneous clusters
CF '13: Proceedings of the ACM International Conference on Computing FrontiersArticle No.: 35, Pages 1–10https://doi.org/10.1145/2482767.2482812OpenCL is an open standard for heterogeneous parallel programming, exploiting multi-core CPUs, GPUs, or other accelerators as parallel computing resources. Recent work has extended the OpenCL parallel programming model for distributed heterogeneous ...