Segura et al., 2021 - Google Patents
Energy-efficient stream compaction through filtering and coalescing accesses in gpgpu memory partitionsSegura et al., 2021
View PDF- Document ID
- 6154003844937861400
- Author
- Segura A
- Arnau J
- González A
- Publication year
- Publication venue
- IEEE Transactions on Computers
External Links
Snippet
Graph-based applications are essential in emerging domains such as data analytics or machine learning. Data gathering in a knowledge-based society requires great data processing efficiency. High-throughput GPGPU architectures are key to enable efficient …
- 238000005056 compaction 0 title abstract description 63
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F12/00—Accessing, addressing or allocating within memory systems or architectures
- G06F12/02—Addressing or allocation; Relocation
- G06F12/08—Addressing or allocation; Relocation in hierarchically structured memory systems, e.g. virtual memory systems
- G06F12/0802—Addressing of a memory level in which the access to the desired data or data block requires associative addressing means, e.g. caches
- G06F12/0806—Multiuser, multiprocessor or multiprocessing cache systems
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/50—Computer-aided design
- G06F17/5009—Computer-aided design using simulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3466—Performance evaluation by tracing or monitoring
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3409—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30286—Information retrieval; Database structures therefor; File system structures therefor in structured data stores
- G06F17/30587—Details of specialised database models
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3457—Performance evaluation by simulation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3442—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment for planning or managing the needed capacity
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/36—Preventing errors by testing or debugging software
- G06F11/362—Software debugging
- G06F11/3636—Software debugging by tracing the execution of the program
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2201/00—Indexing scheme relating to error detection, to error correction, and to monitoring
- G06F2201/885—Monitoring specific for caches
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F8/00—Arrangements for software engineering
- G06F8/40—Transformations of program code
- G06F8/41—Compilation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2217/00—Indexing scheme relating to computer aided design [CAD]
- G06F2217/78—Power analysis and optimization
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/76—Architectures of general purpose stored programme computers
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Yan et al. | Alleviating irregularity in graph analytics acceleration: A hardware/software co-design approach | |
Dai et al. | FPGP: Graph processing framework on FPGA a case study of breadth-first search | |
Ozdal et al. | Energy efficient architecture for graph analytics accelerators | |
Wang et al. | Melia: A mapreduce framework on opencl-based fpgas | |
Bai et al. | Efficient data loader for fast sampling-based GNN training on large graphs | |
Segura et al. | SCU: a GPU stream compaction unit for graph processing | |
Zuckerman et al. | Cohmeleon: Learning-based orchestration of accelerator coherence in heterogeneous SoCs | |
Bhatotia | Incremental parallel and distributed systems | |
Adiletta et al. | Characterizing the Scalability of Graph Convolutional Networks on Intel® PIUMA | |
Segura et al. | Energy-efficient stream compaction through filtering and coalescing accesses in gpgpu memory partitions | |
Jaros et al. | Use of multiple GPUs on shared memory multiprocessors for ultrasound propagation simulations | |
Che et al. | Programming GPGPU graph applications with linear algebra building blocks | |
Lan et al. | Improving performance of GPU specific OpenCL program on CPUs | |
Addisie et al. | Collaborative accelerators for streamlining MapReduce on scale-up machines with incremental data aggregation | |
Parashar | Big data challenges in simulation-based science. | |
Wang et al. | Hardware-accelerated hypergraph processing with chain-driven scheduling | |
Papadopoulos et al. | Customization methodology for implementation of streaming aggregation in embedded systems | |
Hoxha et al. | High performance heterogeneous multicore architectures: A study | |
Zhang et al. | Architectural Implications in Graph Processing of Accelerator with Gardenia Benchmark Suite | |
Kim et al. | Analyzing gcn aggregation on gpu | |
Yin et al. | DeltaGNN: Accelerating graph neural networks on dynamic graphs with delta updating | |
Abdullin et al. | Distributed analysis of the BMC kind: Making it fit the tornado supercomputer | |
Zhang et al. | Rethinking Design Paradigm of Graph Processing System with a CXL-like Memory Semantic Fabric | |
Wang et al. | Energy cost evaluation of parallel algorithms for multiprocessor systems | |
Hoang et al. | BatchGNN: Efficient CPU-Based Distributed GNN Training on Very Large Graphs |