default search action
14th ICS 2000: Santa Fe, New Mexico, USA
- John Reynders, Alexander V. Veidenbaum:
Proceedings of the 14th international conference on Supercomputing, ICS 2000, Santa Fe, NM, USA, May 8-11, 2000. ACM 2000, ISBN 1-58113-270-0
Java Compilation and Performance
- Pedro V. Artigas, Manish Gupta, Samuel P. Midkiff, José E. Moreira:
Automatic loop transformations and parallelization for Java. 1-10 - Renato Ferreira, Gagan Agrawal, Joel H. Saltz:
Compiling object-oriented data intensive applications. 11-21 - Tao Li, Lizy Kurian John, Narayanan Vijaykrishnan, Anand Sivasubramaniam, Jyotsna Sabarinathan, Anupama Murthy:
Using complete system simulation to characterize SPECjvm98 benchmarks. 22-33
Interconnection Networks/Network Processors
- José Flich, Manuel P. Malumbres, Pedro López, José Duato:
Performance evaluation of a new routing strategy for irregular networks with source routing. 34-43 - Valentin Puente, Cruz Izu, José A. Gregorio, Ramón Beivide, J. M. Prellezo, Fernando Vallejo:
Improving parallel system performance by changing the arrangement of the network links. 44-53 - Patrick Crowley, Marc E. Fiuczynski, Jean-Loup Baer, Brian N. Bershad:
Characterizing processor architectures for programmable network interfaces. 54-65
Sparse Compilation Techniques
- Hao Yu, Lawrence Rauchwerger:
Adaptive reduction parallelization techniques. 66-77 - Eladio Gutiérrez, Oscar G. Plata, Emilio L. Zapata:
A compiler method for the parallel execution of irregular reductions in scalable shared memory multiprocessors. 78-87 - Nikolay Mateev, Keshav Pingali, Paul Stodghill, Vladimir Kotlyar:
Next-generation generic programming and its application to sparse matrix computations. 88-99
MP Scheduling, Load Balancing, Memmory Management
- Yanyong Zhang, Anand Sivasubramaniam, José E. Moreira, Hubertus Franke:
A simulation-based study of scheduling mechanisms for a dynamic cluster environment. 100-109 - Karen D. Devine, Bruce Hendrickson, Erik G. Boman, Matthew St. John, Courtenay T. Vaughan:
Design of dynamic load-balancing tools for parallel applications. 110-118 - Dimitrios S. Nikolopoulos, Theodore S. Papatheodorou, Constantine D. Polychronopoulos, Jesús Labarta, Eduard Ayguadé:
A case for use-level dynamic page migration. 119-130
Compilation I
- Ken Kennedy:
Fast greedy weighted fusion. 131-140 - Nawaaz Ahmed, Nikolay Mateev, Keshav Pingali:
Synthesizing transformations for locality enhancement of imperfectly-nested loop nests. 141-152 - Vivek Sarkar:
Optimized unrolling of nested loops. 153-166
Memory Hierarchy
- Chengqiang Zhang, Sally A. McKee:
Hardware-only stream prefetching and dynamic access ordering. 167-175 - Chia-Lin Yang, Alvin R. Lebeck:
Push vs. pull: data movement for linked data structures. 176-186 - Cheol Ho Park, JaeWoong Chung, Byeong Hag Seong, Yangwoo Roh, Daeyeon Park:
Boosting superpage utilization with the shadow memory and the partial-subblock TLB. 187-195
Micro-Architecture
- Toshinori Sato, Itsujiro Arita:
Table size reduction for data value predictors by exploiting narrow width values. 196-205 - Srinivas Mantripragada, Alexandru Nicolau:
Using profiling to reduce branch misprediction costs on a dynamically scheduled processor. 206-214
Applications
- Dragan Mirkovic, Rishad Mahasoom, S. Lennart Johnsson:
An adaptive software library for fast Fourier transforms. 215-224 - Yun He, Chris H. Q. Ding:
Using accurate arithmetics to improve numerical reproducibility and stability in parallel applications. 225-234
Performance Evaluation and Modeling
- Patrick H. Worley:
Performance evaluation of the IBM SP and the Compaq AlphaServer SC. 235-244 - Jeffrey S. Vetter:
Performance analysis of distributed applications using automatic classification of communication inefficiencies. 245-254 - Mark M. Mathis, Nancy M. Amato, Marvin L. Adams:
A general performance model for parallel sweeps on orthogonal grids for particle transport calculations. 255-263
MP Potpouri
- Marius Pirvu, Laxmi N. Bhuyan:
Hardware spatial forwarding for widely shared data. 264-273 - Xiaohui Shen, Wei-keng Liao, Alok N. Choudhary, Gokhan Memik, Mahmut T. Kandemir, Sachin More, George K. Thiruvathukal, Arti Singh:
A novel application development environment for large-scale scientific computations. 274-283 - Junpei Niwa, Takashi Matsumoto, Kei Hiraki:
Comparative study of page-based and segment-based software DSM through compiler optimization. 284-295
Compilation II
- Suhyun Kim, Soo-Mook Moon, Jinpyo Park, Kemal Ebcioglu:
Unroll-based register coalescing. 296-305 - Gary M. Zoppetti, Gagan Agrawal, Lori L. Pollock, José Nelson Amaral, Xinan Tang, Guang R. Gao:
Automatic compiler techniques for thread coarsening for multithreaded architectures. 306-315 - Somnath Ghosh, Margaret Martonosi, Sharad Malik:
Automated cache optimizations using CME driven diagnosis. 316-326
Instruction-Level Parallelism
- Ramon Canal, Antonio González:
A low-complexity issue logic. 327-335 - Michael Gschwind, Kemal Ebcioglu, Erik R. Altman, Sumedh W. Sathaye:
Binary translation and architecture convergence issues for IBM system/390. 336-347
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.