Abstract
No abstract available.
Cited By
- Zhou N, Zhou H and Hoppe D (2023). Containerization for High Performance Computing Systems: Survey and Prospects, IEEE Transactions on Software Engineering, 49:4, (2722-2740), Online publication date: 1-Apr-2023.
- Zhang K, Su H and Dou Y (2021). Multilevel parallelism optimization of stencil computations on SIMDlized NUMA architectures, The Journal of Supercomputing, 77:11, (13584-13600), Online publication date: 1-Nov-2021.
- Williams B, Eker A, Chiu K and Ponomarev D High-Performance PDES on Manycore Clusters Proceedings of the 2021 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation, (153-164)
- Memeti S, Pllana S, Binotto A, Kołodziej J and Brandic I (2019). Using meta-heuristics and machine learning for software optimization of parallel computing systems, Computing, 101:8, (893-936), Online publication date: 1-Aug-2019.
- Hayashi A, Paul S and Sarkar V GPUIterator: bridging the gap between Chapel and GPU platforms Proceedings of the ACM SIGPLAN 6th on Chapel Implementers and Users Workshop, (2-11)
- Eckstein J, Goldberg N and Kagawa A Rule-enhanced penalized regression by column generation using rectangular maximum agreement Proceedings of the 34th International Conference on Machine Learning - Volume 70, (1059-1067)
- Salles T and Hardiman L (2016). Badlands, Computers & Geosciences, 91:C, (77-89), Online publication date: 1-Jun-2016.
- Achour S and Rinard M (2015). Approximate computation with outlier detection in Topaz, ACM SIGPLAN Notices, 50:10, (711-730), Online publication date: 18-Dec-2015.
- Hayashi A, Zhao J, Ferguson M and Sarkar V LLVM-based communication optimizations for PGAS programs Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in HPC, (1-11)
- Achour S and Rinard M Approximate computation with outlier detection in Topaz Proceedings of the 2015 ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and Applications, (711-730)
- Peretti Pezzi G, Vaissié E, Viala Y, Caromel D and Gourbesville P (2015). Parallel profiling of water distribution networks using the Clément formula, Applied Mathematics and Computation, 267:C, (83-95), Online publication date: 15-Sep-2015.
- Grelck C Single Assignment C (SAC) Central European Functional Programming School, (207-282)
- Solomonik E, Matthews D, Hammond J, Stanton J and Demmel J (2014). A massively parallel tensor contraction framework for coupled-cluster computations, Journal of Parallel and Distributed Computing, 74:12, (3176-3190), Online publication date: 1-Dec-2014.
- Chan Y, Wellings A, Gray I and Audsley N On the Locality of Java 8 Streams in Real-Time Big Data Applications Proceedings of the 12th International Workshop on Java Technologies for Real-time and Embedded Systems, (20-28)
- Sun Y, Lifflander J and Kalé L PICS Proceedings of the 4th International Workshop on Runtime and Operating Systems for Supercomputers, (1-8)
- Das D, Chen D, Martins A, Schneider N and Smith N (2014). Frame-semantic parsing, Computational Linguistics, 40:1, (9-56), Online publication date: 1-Mar-2014.
- Dennis J Compiling Fresh Breeze Codelets Proceedings of Programming Models and Applications on Multicores and Manycores, (51-60)
- Dennis J Compiling Fresh Breeze Codelets Proceedings of Programming Models and Applications on Multicores and Manycores, (51-60)
- Guo Z, Mi J, Xiong S and Grant P (2014). Phase field study of the tip operating state of a freely growing dendrite against convection using a novel parallel multigrid approach, Journal of Computational Physics, 257:PA, (278-297), Online publication date: 15-Jan-2014.
- Lotz J, Naumann U, Sagebaum M and Schanen M Discrete adjoints of PETSc through dco/c++ and adjoint MPI Proceedings of the 19th international conference on Parallel Processing, (497-507)
- Zhang Z, Zhang S, Wang Y, Jiang Y and Wang H (2013). Use of parallel deterministic dynamic programming and hierarchical adaptive genetic algorithm for reservoir operation optimization, Computers and Industrial Engineering, 65:2, (310-321), Online publication date: 1-Jun-2013.
- Wu Y, Li T, Sun L and Chen J (2013). Parallelization of a hydrological model using the message passing interface, Environmental Modelling & Software, 43, (124-132), Online publication date: 1-May-2013.
- Lin J, Zha L and Xu Z (2013). Consolidated cluster systems for data centers in the cloud age, Frontiers of Computer Science: Selected Publications from Chinese Universities, 7:1, (1-19), Online publication date: 1-Feb-2013.
- Esteban F, Díaz D, Hernández P, Caballero J, Dorado G and Gálvez S (2013). Direct approaches to exploit many-core architecture in bioinformatics, Future Generation Computer Systems, 29:1, (15-26), Online publication date: 1-Jan-2013.
- Zhu F, Gonzalez D, Carpenter T, Atkinson M and Wardlaw J (2012). Parallel perfusion imaging processing using GPGPU, Computer Methods and Programs in Biomedicine, 108:3, (1012-1021), Online publication date: 1-Dec-2012.
- Neves M, Ferreto T and De Rose C Scheduling mapreduce jobs in HPC clusters Proceedings of the 18th international conference on Parallel Processing, (179-190)
- Feng M, Gupta R and Neamtiu I (2012). Effective parallelization of loops in the presence of I/O operations, ACM SIGPLAN Notices, 47:6, (487-498), Online publication date: 6-Aug-2012.
- Bordner J and Norman M Enzo-P / Cello Proceedings of the Extreme Scaling Workshop, (1-11)
- Wang J, Ponomarev D and Abu-Ghazaleh N Performance Analysis of a Multithreaded PDES Simulator on Multicore Clusters Proceedings of the 2012 ACM/IEEE/SCS 26th Workshop on Principles of Advanced and Distributed Simulation, (93-95)
- Stewart R, Trinder P and Maier P Supervised Workpools for Reliable Massively Parallel Computing Proceedings of the 2012 Conference on Trends in Functional Programming - Volume 7829, (247-262)
- Feng M, Gupta R and Neamtiu I Effective parallelization of loops in the presence of I/O operations Proceedings of the 33rd ACM SIGPLAN Conference on Programming Language Design and Implementation, (487-498)
- Das D and Smith N Graph-based lexicon expansion with sparsity-inducing penalties Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, (677-687)
- Brendel P, Dłotko P, Mrozek M and Żelazna N Homology computations via acyclic subspace Proceedings of the 4th international conference on Computational Topology in Image Context, (117-127)
- Shi L, Zhao Y and Tang J (2012). Batch Mode Active Learning for Networked Data, ACM Transactions on Intelligent Systems and Technology, 3:2, (1-25), Online publication date: 1-Feb-2012.
- Feng M, Lin C and Gupta R (2012). PLDS, ACM Transactions on Architecture and Code Optimization, 8:4, (1-21), Online publication date: 1-Jan-2012.
- Pacini E, Ribero M, Mateos C, Mirasso A and Garino C Simulation on cloud computing infrastructures of parametric studies of nonlinear solids problems Proceedings of the Second international conference on Advances in New Technologies, Interactive Interfaces and Communicability, (58-70)
- Pedemonte M, Nesmachnow S and Cancela H (2011). A survey on parallel ant colony optimization, Applied Soft Computing, 11:8, (5181-5197), Online publication date: 1-Dec-2011.
- Dwornik M and Pięta A Parallel implementation of stochastic inversion of seismic tomography data Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part II, (353-360)
- Feng M, Gupta R and Hu Y (2011). SpiceC, ACM SIGPLAN Notices, 46:8, (69-80), Online publication date: 7-Sep-2011.
- Solomonik E and Demmel J Communication-optimal parallel 2.5D matrix multiplication and LU factorization algorithms Proceedings of the 17th international conference on Parallel processing - Volume Part II, (90-109)
- Georgiev K, Ostromsky T and Zlatev Z New parallel implementation of an air pollution computer model --- performance study on an IBM blue gene/p computer Proceedings of the 8th international conference on Large-Scale Scientific Computing, (283-290)
- Bailey D and Borwein J (2011). High-precision numerical integration, Journal of Symbolic Computation, 46:7, (741-754), Online publication date: 1-Jul-2011.
- Grelck C Single assignment C (SAC) high productivity meets high performance Proceedings of the 4th Summer School conference on Central European Functional Programming School, (207-278)
- Schnorr L, Legrand A and Vincent J Multi-scale analysis of large distributed computing systems Proceedings of the third international workshop on Large-scale system and application performance, (27-34)
- Abdennhader N, Belgacem M, Couturier R, Laiymani D, Miquée S, Niinimaki M and Sauget M Gridification of a radiotherapy dose computation application with the xtremweb-CH environment Proceedings of the 6th international conference on Advances in grid and pervasive computing, (188-197)
- Gray I and Audsley N (2011). Targeting complex embedded architectures by combining the multicore communications API (mcapi) with compile-time virtualisation, ACM SIGPLAN Notices, 46:5, (51-60), Online publication date: 11-Apr-2011.
- Gray I and Audsley N Targeting complex embedded architectures by combining the multicore communications API (mcapi) with compile-time virtualisation Proceedings of the 2011 SIGPLAN/SIGBED conference on Languages, compilers and tools for embedded systems, (51-60)
- Feng M, Gupta R and Hu Y SpiceC Proceedings of the 16th ACM symposium on Principles and practice of parallel programming, (69-80)
- Kulkarni A, Liu Y and Smith S (2010). Task types for pervasive atomicity, ACM SIGPLAN Notices, 45:10, (671-690), Online publication date: 17-Oct-2010.
- Kulkarni A, Liu Y and Smith S Task types for pervasive atomicity Proceedings of the ACM international conference on Object oriented programming systems languages and applications, (671-690)
- Kristensen M and Vinter B Numerical Python for scalable architectures Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model, (1-9)
- Schanen M, Förster M and Naumann U Second-order algorithmic differentiation by source transformation of MPI code Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interface, (257-264)
- Zhang Q, Zhang Y, Yu H and Huang X Efficient partial-duplicate detection based on sequence matching Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval, (675-682)
- Gray I and Audsley N Supporting islands of coherency for highly-parallel embedded architectures using compile-time virtualisation Proceedings of the 13th International Workshop on Software & Compilers for Embedded Systems, (1-10)
- Jardak C, Riihijärvi J, Oldewurtel F and Mähönen P Parallel processing of data from very large-scale wireless sensor networks Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, (787-794)
- Yang L, Yu L, Tang J, Wang L, Zhao J and Li X Enabling multi-core based monitoring and fault tolerance in C++/Java Proceedings of the 3rd International Workshop on Multicore Software Engineering, (32-39)
- Juve G and Deelman E (2010). Scientific workflows and clouds, XRDS: Crossroads, The ACM Magazine for Students, 16:3, (14-18), Online publication date: 1-Mar-2010.
- Grelck C, Julku J and Penczek F S-Net for multi-memory multicores Proceedings of the 5th ACM SIGPLAN workshop on Declarative aspects of multicore programming, (25-34)
- Dorta T, Jiménez J, Martín J, Bidarte U and Astarloa A (2010). Reconfigurable multiprocessor systems, International Journal of Reconfigurable Computing, 2010, (1-10), Online publication date: 1-Jan-2010.
- Vömel C (2010). ScaLAPACK's MRRR algorithm, ACM Transactions on Mathematical Software, 37:1, (1-35), Online publication date: 1-Jan-2010.
- Agullo E, Hadri B, Ltaief H and Dongarrra J Comparative study of one-sided factorizations with multiple software packages on multi-core hardware Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, (1-12)
- Kashef R and Kamel M (2009). Enhanced bisecting k-means clustering using intermediate cooperation, Pattern Recognition, 42:11, (2557-2569), Online publication date: 1-Nov-2009.
- Lin Y and Hung L (2009). New families of computation-efficient parallel prefix algorithms, WSEAS Transactions on Computers, 8:10, (1651-1660), Online publication date: 1-Oct-2009.
- Hatcher P, Reno M, Antoniu G and Bouge L (2009). Cluster Computing with Java, Computing in Science and Engineering, 7:2, (34-39), Online publication date: 1-Mar-2009.
- Suresh S, Mani V, Omkar S, Kim H and Sundararajan N (2009). A new load distribution strategy for linear network with communication delays, Mathematics and Computers in Simulation, 79:5, (1488-1501), Online publication date: 1-Jan-2009.
- Zhou K, Hou Q, Wang R and Guo B Real-time KD-tree construction on graphics hardware ACM SIGGRAPH Asia 2008 papers, (1-11)
- Alberto A, Benet J, Arias E, Cebrian D, Rojo T and Cuartero F (2008). A high performance tool for the simulation of the dynamic pantograph-catenary interaction, Mathematics and Computers in Simulation, 79:3, (652-667), Online publication date: 1-Dec-2008.
- Levesque M, Ichikawa K, Date S and Haga J Bringing flexibility to virtual screening for enzymatic inhibitors on the grid Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing, (201-208)
- Vidal A, Garcia V, Alonso P and Bernabeu M (2008). Parallel computation of the eigenvalues of symmetric Toeplitz matrices through iterative methods, Journal of Parallel and Distributed Computing, 68:8, (1113-1121), Online publication date: 1-Aug-2008.
- Collette S, Cucu L and Goossens J (2008). Integrating job parallelism in real-time scheduling theory, Information Processing Letters, 106:5, (180-187), Online publication date: 1-May-2008.
- Hung L and Lin Y (2008). Parallel prefix algorithms on the multicomputer, WSEAS Transactions on Computer Research, 3:4, (213-223), Online publication date: 1-Apr-2008.
- Frattolillo F (2008). Supporting data management on cluster grids, Future Generation Computer Systems, 24:2, (166-176), Online publication date: 1-Feb-2008.
- Evans G, Keith J and Kroese D Parallel cross-entropy optimization Proceedings of the 39th conference on Winter simulation: 40 years! The best is yet to come, (2196-2202)
- Sánchez A, Pérez M, Gueant P, Montes J, Herrero P and Cortes T (2007). Improving GridFTP transfers by means of a multiagent parallel file system, Multiagent and Grid Systems, 3:4, (441-451), Online publication date: 1-Dec-2007.
- Ananthanarayanan R and Modha D Anatomy of a cortical simulator Proceedings of the 2007 ACM/IEEE conference on Supercomputing, (1-12)
- Nakashole N, Suleman H and Pedzai C Dynamic role allocation for small search engine clusters Proceedings of the 2007 annual research conference of the South African institute of computer scientists and information technologists on IT research in developing countries, (91-96)
- Ramadan O A parallel FDTD algorithm for the solution of Maxwell's equations with nearly PML absorbing boundary conditions Proceedings of the 2007 international conference on Computational science and its applications - Volume Part III, (421-430)
- Meza J, Oliva R, Hough P and Williams P (2007). OPT++, ACM Transactions on Mathematical Software, 33:2, (12-es), Online publication date: 1-Jun-2007.
- Intrigila B, Melatti I, Tofani A and Macchiarelli G (2007). Computational models of myocardial endomysial collagen arrangement, Computer Methods and Programs in Biomedicine, 86:3, (232-244), Online publication date: 1-Jun-2007.
- Zhu W, Niu Y and Gao G (2007). Performance portability on EARTH: a case study across several parallel architectures, Cluster Computing, 10:2, (115-126), Online publication date: 1-Jun-2007.
- Knottenbelt W and Bradley J Tackling large state spaces in performance modelling Proceedings of the 7th international conference on Formal methods for performance evaluation, (318-370)
- Bounanos S, Fleury M, Nicolas S and Vickers A (2007). Regular Paper, International Journal of High Performance Computing Applications, 21:2, (222-245), Online publication date: 1-May-2007.
- Peng H, Wang M and Lai C (2007). Design of parallel algorithms for fractal video compression, International Journal of Computer Mathematics, 84:2, (193-202), Online publication date: 1-Feb-2007.
- Kim J, Lee J and Yeom H Experimental Assessment of the Practicality of a Fault-Tolerant System Proceedings of the 33rd conference on Current Trends in Theory and Practice of Computer Science, (878-887)
- Crane J, Crawford F and Nelson S (2006). Grid enabled magnetic resonance scanners for near real-time medical image processing, Journal of Parallel and Distributed Computing, 66:12, (1524-1533), Online publication date: 1-Dec-2006.
- Wang Z, Ghoniem N, Swaminarayan S and LeSar R (2006). A parallel algorithm for 3D dislocation dynamics, Journal of Computational Physics, 219:2, (608-621), Online publication date: 1-Dec-2006.
- Gnanendran K and Sundarraj R (2006). Alternative model representations and computing capacity, Decision Support Systems, 42:3, (1413-1430), Online publication date: 1-Dec-2006.
- Akaydin O and Ramadan O 3-D parallel implementation of the wave-equation PML algorithm using distributed system Proceedings of the 21st international conference on Computer and Information Sciences, (834-843)
- Sánchez A, Pérez M, Gueant P, Montes J and Herrero P A parallel data storage interface to GridFTP Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part II, (1203-1212)
- Liao S Parallelizing user-defined and implicit reductions globally on multiprocessors Proceedings of the 11th Asia-Pacific conference on Advances in Computer Systems Architecture, (189-202)
- Yang L, Xu L, Lin M and Quinn J A parallel GNFS algorithm with the biorthogonal block lanczos method for integer factorization Proceedings of the Third international conference on Autonomic and Trusted Computing, (428-438)
- Ostromsky T and Zlatev Z Parallel and GRID implementation of a large scale air pollution model Proceedings of the 6th international conference on Numerical methods and applications, (475-482)
- Lin Y Computation-efficient parallel prefix Proceedings of the 6th WSEAS International Conference on Applied Informatics and Communications, (280-285)
- Yang L, Xu L, Lin M and Quinn J A parallel GNFS algorithm based on a reliable look-ahead block lanczos method for integer factorization Proceedings of the 2006 international conference on Embedded and Ubiquitous Computing, (110-120)
- Wooding K and Williams H Doubly-Focused enumeration of pseudosquares and pseudocubes Proceedings of the 7th international conference on Algorithmic Number Theory, (208-221)
- Lv X, Zhao Y, Huang X, Xia G and Wang Z (2006). An efficient parallel/unstructured-multigrid preconditioned implicit method for simulating 3D unsteady compressible flows with moving objects, Journal of Computational Physics, 215:2, (661-690), Online publication date: 1-Jul-2006.
- Abu-Khzam F, Langston M, Shanbhag P and Symons C (2006). Scalable Parallel Algorithms for FPT Problems, Algorithmica, 45:3, (269-284), Online publication date: 1-Jul-2006.
- Cera M, Pezzi G, Pilla M, Maillard N and Navaux P Scheduling dynamically spawned processes in MPI-2 Proceedings of the 12th international conference on Job scheduling strategies for parallel processing, (33-46)
- Flores-Becerra G, Garcia V and Vidal A Efficient parallel algorithm for constructing a unit triangular matrix with prescribed singular values Proceedings of the 7th international conference on High performance computing for computational science, (349-362)
- Alves J, Borges C and Filho A Distributed security constrained optimal power flow integrated to a DSM based energy management system for real time power systems security control Proceedings of the 7th international conference on High performance computing for computational science, (131-144)
- Ivanov L (2006). A modern course on parallel and distributed processing, Journal of Computing Sciences in Colleges, 21:6, (29-38), Online publication date: 1-Jun-2006.
- Jiang Z, Lin D, Xu L and Lin L Integrating grid with cryptographic computing Proceedings of the Second international conference on Information Security Practice and Experience, (321-331)
- Nakata K, Yamashita M, Fujisawa K and Kojima M (2006). A parallel primal-dual interior-point method for semidefinite programs using positive definite matrix completion, Parallel Computing, 32:1, (24-43), Online publication date: 1-Jan-2006.
- Lin Y and Su C (2005). Faster optimal parallel prefix circuits, Journal of Parallel and Distributed Computing, 65:12, (1585-1595), Online publication date: 1-Dec-2005.
- Allcock W, Bresnahan J, Kettimuthu R, Link M, Dumitrescu C, Raicu I and Foster I The Globus Striped GridFTP Framework and Server Proceedings of the 2005 ACM/IEEE conference on Supercomputing
- Mavriplis D, Aftosmis M and Berger M High Resolution Aerospace Applications using the NASA Columbia Supercomputer Proceedings of the 2005 ACM/IEEE conference on Supercomputing
- Li F, Chen G, Kandemir M and Karakoy M Exploiting last idle periods of links for network power management Proceedings of the 5th ACM international conference on Embedded software, (134-137)
- Rokicki J, Krause M and Wichulski M Parallelization of numerical CFD codes in cluster and grid environments Proceedings of the 6th international conference on Parallel Processing and Applied Mathematics, (825-830)
- Dalcín L, Paz R and Storti M (2005). MPI for Python, Journal of Parallel and Distributed Computing, 65:9, (1108-1115), Online publication date: 1-Sep-2005.
- Krivulin N, Guster D and Hall C Parallel implementation of a random search procedure Proceedings of the 5th WSEAS international conference on Simulation, modelling and optimization, (367-372)
- Tai C, Zhao Y and Liew K (2005). Parallel computation of unsteady incompressible viscous flows around moving rigid bodies using an immersed object method with overlapping grids, Journal of Computational Physics, 207:1, (151-172), Online publication date: 20-Jul-2005.
- Zhang Y, Wong D and Zheng W (2005). User-level checkpoint and recovery for LAM/MPI, ACM SIGOPS Operating Systems Review, 39:3, (72-81), Online publication date: 1-Jul-2005.
- Stanescu D, Kim D and Woyczynski W (2005). Numerical study of interacting particles approximation for integro-differential equations, Journal of Computational Physics, 206:2, (706-726), Online publication date: 1-Jul-2005.
- Numrich R (2005). Parallel numerical algorithms based on tensor notation and Co-Array Fortran syntax, Parallel Computing, 31:6, (588-607), Online publication date: 1-Jun-2005.
- Bahi J, Contassot-Vivier S and Couturier R (2005). Evaluation of the asynchronous iterative algorithms in the context of distant heterogeneous clusters, Parallel Computing, 31:5, (439-461), Online publication date: 1-May-2005.
- Kengue J, Valtchev P and Djamegni C A parallel algorithm for lattice construction Proceedings of the Third international conference on Formal Concept Analysis, (249-264)
- O'Cearbhaill E and O'Mahony M (2005). Parallel implementation of a transportation network model, Journal of Parallel and Distributed Computing, 65:1, (1-14), Online publication date: 1-Jan-2005.
- Becerra G and Maciá A Parallel global and local convergent algorithms for solving the iniverse additive singular value problem Proceedings of the 4th WSEAS International Conference on Systems Theory and Scientific Computation, (1-6)
- Dean J and Ghemawat S MapReduce Proceedings of the 6th conference on Symposium on Operating Systems Design & Implementation - Volume 6, (10-10)
- Hammond L, Carlstrom B, Wong V, Hertzberg B, Chen M, Kozyrakis C and Olukotun K (2004). Programming with transactional coherence and consistency (TCC), ACM SIGOPS Operating Systems Review, 38:5, (1-13), Online publication date: 1-Dec-2004.
- Hammond L, Carlstrom B, Wong V, Hertzberg B, Chen M, Kozyrakis C and Olukotun K (2004). Programming with transactional coherence and consistency (TCC), ACM SIGARCH Computer Architecture News, 32:5, (1-13), Online publication date: 1-Dec-2004.
- Hammond L, Carlstrom B, Wong V, Hertzberg B, Chen M, Kozyrakis C and Olukotun K (2004). Programming with transactional coherence and consistency (TCC), ACM SIGPLAN Notices, 39:11, (1-13), Online publication date: 1-Nov-2004.
- Hammond L, Carlstrom B, Wong V, Hertzberg B, Chen M, Kozyrakis C and Olukotun K Programming with transactional coherence and consistency (TCC) Proceedings of the 11th international conference on Architectural support for programming languages and operating systems, (1-13)
- Ostromsky T, Dimov I and Zlatev Z Parallel implementation and one year experiments with the danish eulerian model Proceedings of the Third international conference on Numerical Analysis and its Applications, (440-447)
- Arnal J, Migallón H, Migallón V and Penadés J Parallel newton iterative methods based on incomplete LU factorizations for solving nonlinear systems Proceedings of the 6th international conference on High Performance Computing for Computational Science, (716-729)
- Arias E and Hernández V Numerical integration of the differential riccati equation Proceedings of the 6th international conference on High Performance Computing for Computational Science, (671-684)
- Peinado J and Vidal A Three parallel algorithms for solving nonlinear systems and optimization problems Proceedings of the 6th international conference on High Performance Computing for Computational Science, (657-670)
- García V, Vidal V, Verdú G, Garayoa J and Miró R Parallel resolution of the two-group time dependent neutron diffusion equation with public domain ODE codes Proceedings of the 6th international conference on High Performance Computing for Computational Science, (368-381)
- Huang W and Tafti D (2004). A Parallel Adaptive Mesh Refinement Algorithm for Solving Nonlinear Dynamical Systems, International Journal of High Performance Computing Applications, 18:2, (171-181), Online publication date: 1-May-2004.
- Martin D, Wilsey P, Hoekstra R, Keiter E, Hutchinson S, Russo T and Waters L Scheduling Optimization on the Simbus Backplane Proceedings of the 37th annual symposium on Simulation
- Zomaya A, Patterson D and Olariu S (2004). Sequential and Parallel Meta-Heuristics for Solving the Single Row Routing Problem, Cluster Computing, 7:2, (123-139), Online publication date: 1-Apr-2004.
- Oliveira C and Pardalos P (2004). Randomized parallel algorithms for the multidimensional assignment problem, Applied Numerical Mathematics, 49:1, (117-133), Online publication date: 1-Apr-2004.
- Welsh M and Mainland G Programming sensor networks using abstract regions Proceedings of the 1st conference on Symposium on Networked Systems Design and Implementation - Volume 1, (3-3)
- Oian E, Garrido I, Chaib M, Fladmark G and Espedal M (2004). Modeling fractured and faulted regions, Computing and Visualization in Science, 6:2-3, (123-129), Online publication date: 1-Mar-2004.
- Grimshaw A, Humphrey M and Natrajan A (2004). A philosophical and technical comparison of Legion and Globus, IBM Journal of Research and Development, 48:2, (233-254), Online publication date: 1-Mar-2004.
- Øian E, Garrido I, Chaib M, Fladmark G and Espedal M (2004). Modeling fractured and faulted regions, Computing and Visualization in Science, 6:2-3, (123-129), Online publication date: 1-Mar-2004.
- Fernández J, Carazo J and García I (2004). Three-dimensional reconstruction of cellular structures by electron microscope tomography and parallel computing, Journal of Parallel and Distributed Computing, 64:2, (285-300), Online publication date: 1-Feb-2004.
- References Grid resource management, (507-566)
- Wolf F and Mohr B Specifying performance properties of parallel applications using compound events On-line monitoring systems and computer tool interoperability, (91-110)
- Gorlatch S (2004). Send-receive considered harmful, ACM Transactions on Programming Languages and Systems, 26:1, (47-56), Online publication date: 1-Jan-2004.
- Li H, Rilling J and Goswami D (2004). Granularity-Driven Dynamic Predicate Slicing Algorithms for Message Passing Systems, Automated Software Engineering, 11:1, (63-89), Online publication date: 1-Jan-2004.
- Lin Y and Hsiao J (2004). A new approach to constructing optimal parallel prefix circuits with small depth, Journal of Parallel and Distributed Computing, 64:1, (97-107), Online publication date: 1-Jan-2004.
- Kaiser T (2003). A methodology for creating large modules, ACM SIGPLAN Fortran Forum, 22:3, (11-24), Online publication date: 1-Dec-2003.
- Tai C and Zhao Y (2003). Parallel unsteady incompressible viscous flow computations using an unstructured multigrid method, Journal of Computational Physics, 192:1, (277-311), Online publication date: 20-Nov-2003.
- Michailidis P and Margaritis K (2003). Performance evaluation of load balancing strategies for approximate string matching application on an MPI cluster of heterogeneous workstations, Future Generation Computer Systems, 19:7, (1075-1104), Online publication date: 1-Oct-2003.
- Govett M, Hart L, Henderson T, Middlecoff J and Schaffer D (2003). The scalable modeling system, Parallel Computing, 29:8, (995-1020), Online publication date: 1-Aug-2003.
- Choi E and Cicci D (2003). Analysis of GPS static positioning problems, Applied Mathematics and Computation, 140:1, (37-51), Online publication date: 30-Jul-2003.
- Gerndt A, Reimersdahl T, Kuhlen T, Bischof C, Hörschler I, Meinke M and Schröder W (2003). Large-Scale CFD Data Handling in a VR-Based Otorhinolaryngological CAS-System using a Linux-Cluster, The Journal of Supercomputing, 25:2, (143-154), Online publication date: 1-Jun-2003.
- Shires D and Mohan R (2003). Optimization and Performance of a Fortran 90 MPI-Based Unstructured Code on Large-Scale Parallel Systems, The Journal of Supercomputing, 25:2, (131-141), Online publication date: 1-Jun-2003.
- Huang S, Aubanel E and Bhavsar V Mesh partitioners for computational grids Proceedings of the 2003 international conference on Computational science and its applications: PartIII, (60-68)
- Martin D, Wilsey P, Hoekstra R, Keiter E, Hutchinson S, Russo T and Waters L Redesigning the WARPED Simulation Kernel for Analysis and Application Development Proceedings of the 36th annual symposium on Simulation
- Lin Y, Hsu Y and Liu C (2003). Constructing H4, a Fast Depth-Size Optimal Parallel Prefix Circuit, The Journal of Supercomputing, 24:3, (279-304), Online publication date: 1-Mar-2003.
- Georgiou S, Koukouvinos C and Seberry J Hadamard matrices, orthogonal designs and construction algorithms DESIGNS 2002, (133-205)
- Gorlatch S SAT Patterns and skeletons for parallel and distributed computing, (29-63)
- Padua D and Hoeflinger J Supercomputers Encyclopedia of Computer Science, (1710-1718)
- Quinn M, Miller R, Miller R and Quinn M Parallel processing Encyclopedia of Computer Science, (1349-1365)
- Song Z and Roussopoulos N (2002). Using Hilbert curve in image storing and retrieving, Information Systems, 27:8, (523-536), Online publication date: 1-Dec-2002.
- Gong L, Sun X and Watson E (2002). Performance Modeling and Prediction of Nondedicated Network Computing, IEEE Transactions on Computers, 51:9, (1041-1055), Online publication date: 1-Sep-2002.
- Koulopoulos D, Papoutsis K, Goulas G and Housos E (2002). PLEIADES, Software—Practice & Experience, 32:11, (1035-1049), Online publication date: 1-Sep-2002.
- Duato J, Yalamanchili S and Ni L (2002). Interconnection Networks, 10.5555/2821578, Online publication date: 6-Aug-2002.
- Bergen D and Miller B (2002). Building an MPI cluster, XRDS: Crossroads, The ACM Magazine for Students, 8:5, (2-2), Online publication date: 1-Aug-2002.
- Taylor V, Wu X, Geisler J and Stevens R Using Kernel Couplings to Predict Parallel Application Performance Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing
- Dingle N, Harrison P and Knottenbelt W Response time densities in generalised stochastic petri net models Proceedings of the 3rd international workshop on Software and performance, (46-54)
- Peinado J and Vidal A A parallel Newton-GMRES algorithm for solving large scale nonlinear systems Proceedings of the 5th international conference on High performance computing for computational science, (328-342)
- Fernández J, Cañas A, Díaz A, González J, Ortega J and Prieto A Performance of message-passing MATLAB toolboxes Proceedings of the 5th international conference on High performance computing for computational science, (228-242)
- Hendriks E BProc Proceedings of the 16th international conference on Supercomputing, (129-136)
- Harrison P and Knottenbelt W Passage time distributions in large Markov chains Proceedings of the 2002 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, (77-85)
- Lampoudi S and Beazley D SWILL Proceedings of the FREENIX Track: 2002 USENIX Annual Technical Conference, (19-27)
- Harrison P and Knottenbelt W (2002). Passage time distributions in large Markov chains, ACM SIGMETRICS Performance Evaluation Review, 30:1, (77-85), Online publication date: 1-Jun-2002.
- Stankovic N and Zhang K (2002). A Distributed Parallel Programming Framework, IEEE Transactions on Software Engineering, 28:5, (478-493), Online publication date: 1-May-2002.
- Németh Z and Sunderam V A Comparison of Conventional Distributed Computing Environments and Computational Grids Proceedings of the International Conference on Computational Science-Part II, (729-738)
- Dwivedula M, Hariri S and Parashar M A Software Design Model for Parallel Applications on Heterogeneous Systems Proceedings of the 16th International Parallel and Distributed Processing Symposium
- Silva J and Hughes A Monte Carlo Simulation of HIV-1 Evolution in Response to Selection by Antibodies Proceedings of the 16th International Parallel and Distributed Processing Symposium
- Goswami D, Singh A and Preiss B (2002). From Design Patterns to Parallel Architectural Skeletons, Journal of Parallel and Distributed Computing, 62:4, (669-695), Online publication date: 1-Apr-2002.
- Owczarz W and Zlatev Z (2002). Parallel matrix computations in air pollution modelling, Parallel Computing, 28:2, (355-368), Online publication date: 1-Feb-2002.
- Benner P, Byers R, Mayo R, Quintana-Ortí E and Hernández V (2002). Parallel Algorithms for LQ Optimal Control of Discrete-Time Periodic Linear Systems, Journal of Parallel and Distributed Computing, 62:2, (306-325), Online publication date: 1-Feb-2002.
- Zlatev Z Massive data set issues in air pollution modelling Handbook of massive data sets, (1169-1220)
- Evans D and Saman M Cluster computing with message-passing interface Highly parallel computaions, (3-42)
- Gau C and Stadtherr M Parallel interval-Newton using message passing Proceedings of the 2001 ACM/IEEE conference on Supercomputing, (23-23)
- Getov V, von Laszewski G, Philippsen M and Foster I (2001). Multiparadigm communications in Java for grid computing, Communications of the ACM, 44:10, (118-125), Online publication date: 1-Oct-2001.
- Ostromsky T, Owczarz W and Zlatev Z Computational challenges in large-scale air pollution modelling Proceedings of the 15th international conference on Supercomputing, (407-418)
- Mezher D A graphical tool for driving the parallel computation of pseudosprectra Proceedings of the 15th international conference on Supercomputing, (270-276)
- Demaine E, Foster I, Kesselman C and Snir M (2001). Generalized Communicators in the Message Passing Interface, IEEE Transactions on Parallel and Distributed Systems, 12:6, (610-616), Online publication date: 1-Jun-2001.
- Chakrabarti D and Banerjee P (2001). Static Single Assignment Form for Message-Passing Programs, International Journal of Parallel Programming, 29:2, (139-184), Online publication date: 1-Apr-2001.
- González P, Cabaleiro J and Pena T (2001). Parallel Computation of Wavelet Transforms Using the Lifting Scheme, The Journal of Supercomputing, 18:2, (141-152), Online publication date: 1-Feb-2001.
- Owczarx W and Zlatev Z Running a large air pollution model on an IBM SMP computer Practical parallel computing, (57-77)
- Schumacher M (2001). Objective coordination in multi-agent system engineering, 10.5555/1791542, Online publication date: 1-Jan-2001.
- Gan B, Liu L, Jain S, Turner S, Cai W and Hsu W Manufacturing sypply chain management Proceedings of the 32nd conference on Winter simulation, (1245-1251)
- Jian L, Yingjun L, Xiaoxing M, Min C, Xianping T, Guanqun Z and Jianzhong L (2000). A hierarchical framework for parallel seismic applications, Communications of the ACM, 43:10, (55-59), Online publication date: 1-Oct-2000.
- Kim V, Banerjee P and De K Fine-Grained Parallel VLSI Synthesis for Commercial CAD on a Network of Workstations Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
- Hereld M, Judson I and Stevens R (2000). Tutorial, IEEE Computer Graphics and Applications, 20:4, (22-28), Online publication date: 1-Jul-2000.
- Roerdink J and Meijster A (2000). The Watershed Transform: Definitions, Algorithms and Parallelization Strategies, Fundamenta Informaticae, 41:1,2, (187-228), Online publication date: 1-Apr-2000.
- Nyland L, Prins J, Goldberg A and Mills P (2000). A Design Methodology for Data-Parallel Applications, IEEE Transactions on Software Engineering, 26:4, (293-314), Online publication date: 1-Apr-2000.
- Gorlatch S (2000). Toward Formally-Based Design of Message Passing Programs, IEEE Transactions on Software Engineering, 26:3, (276-288), Online publication date: 1-Mar-2000.
- Benner P, Castillo M, Quintana-Ortí E and Hernández V (2000). Parallel Partial Stabilizing Algorithms for Large Linear Control Systems, The Journal of Supercomputing, 15:2, (193-206), Online publication date: 1-Feb-2000.
- Shang J, Wagner M, Pan Y and Blake D (2000). Strategies for Adopting FVTD on Multicomputers, Computing in Science and Engineering, 2:1, (10-21), Online publication date: 1-Jan-2000.
- Kipfer P and Slusallek P Transparent distributed processing for rendering Proceedings of the 1999 IEEE symposium on Parallel visualization and graphics, (39-46)
- Andersen P (1999). The Texas Tech tornado cluster, XRDS: Crossroads, The ACM Magazine for Students, 6:1, (28-32), Online publication date: 1-Sep-1999.
- Zaki O, Lusk E, Gropp W and Swider D (1999). Toward Scalable Performance Visualization with Jumpshot, International Journal of High Performance Computing Applications, 13:3, (277-288), Online publication date: 1-Aug-1999.
- Darema F (1999). New Software Technologies for the Development and Runtime Support of Complex Applications, International Journal of High Performance Computing Applications, 13:3, (180-190), Online publication date: 1-Aug-1999.
- Shah M and Kant T (1999). Finite element analysis of fibre reinforced polymer shells using higher-order shear deformation theories on parallel distributed memory machines, International Journal of Computer Applications in Technology, 12:2-5, (206-210), Online publication date: 1-Jul-1999.
- Kaper H, Wiebel E and Tipei S (1999). Data Sonification and Sound Visualization, Computing in Science and Engineering, 1:4, (48-58), Online publication date: 1-Jul-1999.
- Prabhakaran P and Banerjee P (1999). Parallel Algorithms for Force Directed Scheduling of Flattened and Hierarchical Signal Flow Graphs, IEEE Transactions on Computers, 48:7, (762-768), Online publication date: 1-Jul-1999.
- Goudreau M, Lang K, Rao S, Suel T and Tsantilas T (1999). Portable and Efficient Parallel Computing Using the BSP Model, IEEE Transactions on Computers, 48:7, (670-689), Online publication date: 1-Jul-1999.
- Baptist L and Cormen T Multidimensional, multiprocessor, out-of-core FFTs with distributed memory and parallel disks (extended abstract) Proceedings of the eleventh annual ACM symposium on Parallel algorithms and architectures, (242-250)
- Lawrence R, Almasi G and Rushmeier H (1999). A Scalable Parallel Algorithm for Self-Organizing Maps with Applicationsto Sparse Data Mining Problems, Data Mining and Knowledge Discovery, 3:2, (171-195), Online publication date: 1-Jun-1999.
- Burge L and George K JMAS Proceedings of the 5th conference on USENIX Conference on Object-Oriented Technologies & Systems - Volume 5, (9-9)
- Sharma G, Radhakrishnan R, Rajasekaran U, Abu-Ghazaleh N and Wilsey P Time Warp simulation on clumps Proceedings of the thirteenth workshop on Parallel and distributed simulation, (174-181)
- Baratloo A, Dasgupta P, Karamcheti V and Kedem Z Metacomputing with MILAN Proceedings of the Eighth Heterogeneous Computing Workshop
- Karavanic K and Miller B Improving online performance diagnosis by the use of historical performance data Proceedings of the 1999 ACM/IEEE conference on Supercomputing, (42-es)
- Rao D, Thondugulam N, Radhakrishnan R and Wilsey P Unsynchronized parallel discrete event simulation Proceedings of the 30th conference on Winter simulation, (1563-1570)
- Grimshaw A, Ferrari A, Lindahl G and Holcomb K (1998). Metasystems, Communications of the ACM, 41:11, (46-55), Online publication date: 1-Nov-1998.
- Wei X and Ju J (1998). SFT, ACM SIGOPS Operating Systems Review, 32:4, (70-76), Online publication date: 1-Oct-1998.
- Díaz A and Kaltofen E FOXBOX Proceedings of the 1998 international symposium on Symbolic and algebraic computation, (30-37)
- Cohen J, Fraigniaud P, König J and Raspaud A (1998). Optimized Broadcasting and Multicasting Protocols in Cut-Through Routed Networks, IEEE Transactions on Parallel and Distributed Systems, 9:8, (788-802), Online publication date: 1-Aug-1998.
- Coddington P and Ko S Techniques for empirical testing of parallel random number generators Proceedings of the 12th international conference on Supercomputing, (282-288)
- Kimura T and Takemiya H Local area metacomputing for multidisciplinary problems Proceedings of the 12th international conference on Supercomputing, (149-156)
- Chetlur M, Abu-Gazaleh N, Radhakrishnan R and Wilsey P (1998). Optimizing communication in time-warp simulators, ACM SIGSIM Simulation Digest, 28:1, (64-71), Online publication date: 1-Jul-1998.
- Chetlur M, Abu-Gazaleh N, Radhakrishnan R and Wilsey P Optimizing communication in time-warp simulators Proceedings of the twelfth workshop on Parallel and distributed simulation, (64-71)
- Kim V and Banerjee P Parallel algorithms for power estimation Proceedings of the 35th annual Design Automation Conference, (672-677)
- Hung C, YarKhan A, Wong K, von Laven S and Coleman T Parallel implementation of an integrated edge-preserving smoothing algorithm in clusters of workstations Proceedings of the 36th annual ACM Southeast Conference, (20-22)
- Davis T Generating computer animations with frame coherence in a distributed computing environment Proceedings of the 36th annual ACM Southeast Conference, (1-7)
- Kale L (1998). Programming Languages for CSE, IEEE Computational Science & Engineering, 5:2, (18-26), Online publication date: 1-Apr-1998.
- Kurtz B, Kim C and Alsabbagh J (1998). Parallel computing in the undergraduate curriculum, ACM SIGCSE Bulletin, 30:1, (212-216), Online publication date: 1-Mar-1998.
- Kurtz B, Kim C and Alsabbagh J Parallel computing in the undergraduate curriculum Proceedings of the twenty-ninth SIGCSE technical symposium on Computer science education, (212-216)
- Yang C, Souter A and Pollock L (1998). All-du-path coverage for parallel programs, ACM SIGSOFT Software Engineering Notes, 23:2, (153-162), Online publication date: 1-Mar-1998.
- Yang C, Souter A and Pollock L All-du-path coverage for parallel programs Proceedings of the 1998 ACM SIGSOFT international symposium on Software testing and analysis, (153-162)
- Cormen T, Wegmann J and Nicol D Multiprocessor out-of-core FFTs with distributed memory and parallel disks (extended abstract) Proceedings of the fifth workshop on I/O in parallel and distributed systems, (68-78)
- Hawick K and James H Distributed high-performance computation for remote sensing Proceedings of the 1997 ACM/IEEE conference on Supercomputing, (1-13)
- Tobis M, Schafer C, Foster I, Jacob R and Anderson J FOAM Proceedings of the 1997 ACM/IEEE conference on Supercomputing, (1-15)
- Blackston D and Suel T Highly portable and efficient implementations of parallel adaptive N-body methods Proceedings of the 1997 ACM/IEEE conference on Supercomputing, (1-20)
- Krishnaswamy D and Banerjeer P Exploiting task and data parallelism in parallel Hough and Radon transforms Proceedings of the international Conference on Parallel Processing
- Cordsen J, Pohl H and Schröder-Preikschat W Performance considerations in software multicasts Proceedings of the 11th international conference on Supercomputing, (213-220)
- Smith D and El-Rewini H Static Scheduling of MPI Tasks Proceedings of the 2nd IEEE Symposium on Computers and Communications (ISCC '97)
- Krishnaswamy D, Banerjee P, Rudnick E and Patel J (1997). Asynchronous parallel algorithms for test set partitioned fault simulation, ACM SIGSIM Simulation Digest, 27:1, (30-37), Online publication date: 1-Jul-1997.
- Subhlok J and Yang B (1997). A new model for integrated nested task and data parallel programming, ACM SIGPLAN Notices, 32:7, (1-12), Online publication date: 1-Jul-1997.
- Subhlok J and Yang B A new model for integrated nested task and data parallel programming Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming, (1-12)
- Lu P Implementing optimized distributed data sharing using scoped behaviour and a class library Proceedings of the 3rd conference on USENIX Conference on Object-Oriented Technologies (COOTS) - Volume 3, (11-11)
- Krishnaswamy D, Banerjee P, Rudnick E and Patel J Asynchronous parallel algorithms for test set partitioned fault simulation Proceedings of the eleventh workshop on Parallel and distributed simulation, (30-37)
- Barth D and Fraigniaud P Approximation algorithms for structured communication problems Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures, (180-188)
- Xu H, Gui Y and Ni L (1997). Optimal Software Multicast in Wormhole-Routed Multistage Networks, IEEE Transactions on Parallel and Distributed Systems, 8:6, (597-607), Online publication date: 1-Jun-1997.
- Moreira J and Naik V (1997). Dynamic resource management on distributed systems using reconfigurable applications, IBM Journal of Research and Development, 41:3, (303-330), Online publication date: 1-May-1997.
- Stankovic N and Zhang K Visual parallel programming with Visper Proceedings of the High-Performance Computing on the Information Superhighway, HPC-Asia '97
- Krishnaswamy D, Rudnick E, Patel J and Banerjee P SPITFIRE Proceedings of the 15th IEEE VLSI Test Symposium
- Melin E, Raffin B, Rebeuf X and Virot B SCL-chan Proceedings of the 1997 Workshop on High-Level Programming Models and Supportive Environments (HIPS '97)
- Cohen J, Fraigniaud P, König J and Raspaud A Broadcasting and Multicasting in Cut-through Routed Networks Proceedings of the 11th International Symposium on Parallel Processing, (734-738)
- Levine D, Facello M, Hallstrom P, Reeder G, Walenz B and Stevens F (1997). Stalk, IEEE Computational Science & Engineering, 4:2, (55-65), Online publication date: 1-Apr-1997.
- Dillon E, Dos Santos C and Guyard J (1997). Teaching an engineering approach for network computing, ACM SIGCSE Bulletin, 29:1, (229-232), Online publication date: 1-Mar-1997.
- Dillon E, Dos Santos C and Guyard J Teaching an engineering approach for network computing Proceedings of the twenty-eighth SIGCSE technical symposium on Computer science education, (229-232)
- Taylor V, Chen J, Disz T, Papka M and Stevens R (1996). Interactive Virtual Reality in Simulations, IEEE Computational Science & Engineering, 3:4, (46-54), Online publication date: 1-Dec-1996.
- Foster I, Geisler J, Kesselman C and Tuecke S Multimethod communication for high-performance metacomputing applications Proceedings of the 1996 ACM/IEEE conference on Supercomputing, (41-es)
- Ogawa H and Matsuoka S OMPI Proceedings of the 1996 ACM/IEEE conference on Supercomputing, (37-es)
- Foster I, Kohr D, Krishnaiyer R and Choudhary A Double standards Proceedings of the 1996 ACM/IEEE conference on Supercomputing, (36-es)
- Bokhari S (1996). Multiphase Complete Exchange on Paragon, SP2, and CS-2, IEEE Parallel & Distributed Technology: Systems & Technology, 4:3, (45-59), Online publication date: 1-Sep-1996.
- Sundaresan N and Gannon D (1996). Coir, Journal of Parallel and Distributed Computing, 37:1, (98-105), Online publication date: 25-Aug-1996.
- Foster I, Kesselman C and Tuecke S (1996). The Nexus Approach to Integrating Multithreading and Communication, Journal of Parallel and Distributed Computing, 37:1, (70-82), Online publication date: 25-Aug-1996.
- Foster I, Geisler J, Nickless B, Smith W and Tuecke S Software infrastructure for the I-WAY high-performance distributed computing experiment Proceedings of the 5th IEEE International Symposium on High Performance Distributed Computing
- Grimshaw A and Wulf W Legion-a view from 50,000 feet Proceedings of the 5th IEEE International Symposium on High Performance Distributed Computing
- Dongarra J, Otto S, Snir M and Walker D (1996). A message passing standard for MPP and workstations, Communications of the ACM, 39:7, (84-90), Online publication date: 1-Jul-1996.
- Taylor V, Chen J, Huang M, Canfield T and Stevens R (1996). Identifying and Reducing Critical Lag in Finite Element Simulations, IEEE Computer Graphics and Applications, 16:4, (67-71), Online publication date: 1-Jul-1996.
- Goudreau M, Lang K, Rao S, Suel T and Tsantilas T Towards efficiency and portability Proceedings of the eighth annual ACM symposium on Parallel Algorithms and Architectures, (1-12)
- Kwang K and Arakawa M (1996). Benchmark Evaluation of the IBM SP2 for Parallel Signal Processing, IEEE Transactions on Parallel and Distributed Systems, 7:5, (522-536), Online publication date: 1-May-1996.
- Beckman P and Gannon D Tulip Proceedings of the 10th International Parallel Processing Symposium, (532-536)
- Agarwal R, Balle S, Gustavson F, Joshi M and Palkar P (1995). A three-dimensional approach to parallel matrix multiplication, IBM Journal of Research and Development, 39:5, (575-582), Online publication date: 1-Sep-1995.
- Xu H, Gui Y and Ni L Optimal software multicast in wormhole-routed multistage networks Proceedings of the 1994 ACM/IEEE conference on Supercomputing, (703-712)
Index Terms
- Using MPI: portable parallel programming with the message-passing interface
Please enable JavaScript to view thecomments powered by Disqus.
Recommendations
MPI + MPI: a new hybrid approach to parallel programming with MPI plus shared memory
Hybrid parallel programming with the message passing interface (MPI) for internode communication in conjunction with a shared-memory programming model to manage intranode parallelism has become a dominant approach to scalable parallel programming. While ...
MT-MPI: multithreaded MPI for many-core environments
ICS '14: Proceedings of the 28th ACM international conference on SupercomputingMany-core architectures, such as the Intel Xeon Phi, provide dozens of cores and hundreds of hardware threads. To utilize such architectures, application programmers are increasingly looking at hybrid programming models, where multiple threads interact ...