Nothing Special   »   [go: up one dir, main page]

skip to main content
10.5555/602770.602886acmconferencesArticle/Chapter ViewAbstractPublication PagesscConference Proceedingsconference-collections
Article
Free access

Optimal software multicast in wormhole-routed multistage networks

Published: 14 November 1994 Publication History

Abstract

Multistage interconnection networks are a popular class of interconnection architecture for constructing scalable parallel computers (SPCs). The focus of this paper is on wormhole routed multistage networks supporting turnaround routing. Existing machines characterized by such a system model include the IBM SP-1, TMC CM-5, and Meiko CS-2.Efficient collective communication among processor nodes is critical to the performance of SPCs. A system-level multicast service, in which the same message is delivered from a source node to an arbitrary number of destination nodes, is fundamental in supporting collective communication primitives including the application-level broadcast, reduction, and barrier synchronization. This paper addresses how to efficiently implement multicast services in wormhole-routed multistage networks, in the absence of hardware multicast support, by exploiting the properties of the switching technology. An optimal multicast algorithm is proposed. The results of implementations on a 64-node SP-1 show that the proposed algorithm significantly outperforms the application-level broadcast primitives provided by currently existing collective communication libraries including the public domain MPI.

References

[1]
Message Passing Interface Forum, "MPI: A Message-Passing Interface Standard," tech. rep., University of Tennessee, Mar. 1994.
[2]
H. Xu, P. K. McKinley, and L. M. Ni, "Efficient implementation of barrier synchronization in wormhole-routed hypercube multicomputers," Journal of Parallel and Distributed Computing, vol. 16, pp. 172 - 184, October 1992.
[3]
H. Xu, E. T. Kalns, P. K. McKinley, and L. M. Ni, "ComPaSS: A communication package for scalable software design," accepted to appear in Journal of Parallel and Distributed Computing, 1994.
[4]
P. K. McKinley, H. Xu, A. H. Esfahanian, and L. M. Ni, "Unicast-based multicast communication in wormhole-routed direct networks," in Proceedings of the 1992 International Conference on Parallel Processing, vol. II, pp. 10-19, Aug. 1992.
[5]
A. Bar-Noy, J. Bruck, C.-T. Ho, S. Kipnis, and B. Schieber, "Computing global combine operations in the multi-port postal model," in Proceedings of the fifth IEEE symposium on parallel and distributed processing, pp. 336-343, Dec. 1993.
[6]
D. F. Robinson, D. Judd, P. K. McKinley, and B. H. C. Cheng, "Efficient collective data distribution in all-port wormhole-routed hypercubes," in Proceedings of Supercomputing '93, pp. 792-801, Nov. 1993.
[7]
C.-T. Ho and M.-Y. Kao, "Optimal broadcast on hypercubes with wormhole and E-cube routings," in Proceedings of the 1993 International Conference on Parallel and Distributed Systems, pp. 694-697, 1992.
[8]
W. D. Gropp, E. Lusk, and A. Skjellum, Using MPI: Portable Parallel Programming with the Message-Passing Interface. MIT Press, 1994.
[9]
W. J. Dally and C. L. Seitz, "The torus routing chip," Journal of Distributed Computing, vol. 1, no. 3, pp. 187-196, 1986.
[10]
C. E. Leiserson et al., "The network architecture of the Connection Machine CM-5," in Proceedings of the ACM Symposium on Parallel Algorithms and Architectures, (San Diego, CA.), pp. 272-285, Association for Computing Machinery, 1992.
[11]
Meiko Limited, Waltham, MA., Computing Surface: CS-2 Communications Networks, 1993.
[12]
C. B. Stunkel et al., "Architecture and implementation of Vulcan," in Proc. of the 8th International Parallel Processing Symposium, pp. 268-274, Apr. 1994.
[13]
L. M. Ni and P. K. McKinley, "A survey of wormhole routing techniques in direct networks," IEEE Computer, vol. 26, pp. 62 - 76, Feb. 1993.
[14]
C. E. Leiserson, "Fat-trees: Universal networks for hardware-efficient supercomputing," IEEE Transactions on Computers, vol. C-34, pp. 892-901, Oct. 1985.
[15]
L. M. Ni, Y. Gui, and S. Q. He, "Performance evaluation of multistage wormhole networks with turnaround routing," Tech. Rep. MSU-CPS-ACS-96, Michigan State University, July 1994.
[16]
H. Xu, Y.-D. Gui, and L. M. Ni, "Optimal software mulitcast in wormhole-routed multistage networks," Tech. Rep. MSU-CPS-ACS-91, Michigan State University, Mar. 1994.
[17]
W. Gropp, E. Lusk, and S. Pieper, "Users Guide for the ANL IBM SP-1 DRAFT," Tech. Rep. ANL/MCS-TM-00, Argonne National Laboratory, Feb. 1994.
[18]
W. Gropp and B. Smith, "Users manual for the Chameleon parallel programming tools," Tech. Rep. ANL-93/23, Argonne National Laboratory, June 1993.
[19]
H. Franke, "MPI-F: An MPI implementation for IBM SP-1," Feb. 1994. Available on anonymous ftp from info.mcs.anl.gov.

Cited By

View all
  • (1999)Asynchronous Tree-Based Multicasting in Wormhole-Switched MINsIEEE Transactions on Parallel and Distributed Systems10.1109/71.80957410:11(1159-1178)Online publication date: 1-Nov-1999
  • (1998)Efficient Broadcast and Multicast on Multistage Interconnection Networks Using Multiport EncodingIEEE Transactions on Parallel and Distributed Systems10.1109/71.7305299:10(1004-1028)Online publication date: 1-Oct-1998
  • (1997)A Reliable Hardware Barrier Synchronization SchemeProceedings of the 11th International Symposium on Parallel Processing10.5555/645607.661666(274-280)Online publication date: 1-Apr-1997
  • Show More Cited By
  1. Optimal software multicast in wormhole-routed multistage networks

    Recommendations

    Comments

    Please enable JavaScript to view thecomments powered by Disqus.

    Information & Contributors

    Information

    Published In

    cover image ACM Conferences
    Supercomputing '94: Proceedings of the 1994 ACM/IEEE conference on Supercomputing
    November 1994
    840 pages
    ISBN:0818666056

    Sponsors

    Publisher

    IEEE Computer Society Press

    Washington, DC, United States

    Publication History

    Published: 14 November 1994

    Check for updates

    Qualifiers

    • Article

    Conference

    SC '94
    Sponsor:

    Acceptance Rates

    Overall Acceptance Rate 1,516 of 6,373 submissions, 24%

    Contributors

    Other Metrics

    Bibliometrics & Citations

    Bibliometrics

    Article Metrics

    • Downloads (Last 12 months)18
    • Downloads (Last 6 weeks)7
    Reflects downloads up to 02 Oct 2024

    Other Metrics

    Citations

    Cited By

    View all
    • (1999)Asynchronous Tree-Based Multicasting in Wormhole-Switched MINsIEEE Transactions on Parallel and Distributed Systems10.1109/71.80957410:11(1159-1178)Online publication date: 1-Nov-1999
    • (1998)Efficient Broadcast and Multicast on Multistage Interconnection Networks Using Multiport EncodingIEEE Transactions on Parallel and Distributed Systems10.1109/71.7305299:10(1004-1028)Online publication date: 1-Oct-1998
    • (1997)A Reliable Hardware Barrier Synchronization SchemeProceedings of the 11th International Symposium on Parallel Processing10.5555/645607.661666(274-280)Online publication date: 1-Apr-1997
    • (1997)Architecture-Dependent Tuning of the Parameterized Communication Model for Optimal MulticastingProceedings of the 11th International Symposium on Parallel Processing10.5555/645607.661656(578-582)Online publication date: 1-Apr-1997
    • (1997)Sufficient Conditions for Optimal Multicast CommunicationProceedings of the international Conference on Parallel Processing10.5555/645533.656509Online publication date: 11-Aug-1997
    • (1997)Performance Analysis of Multistage Interconnection Networks using a Multicast AlgorithmProceedings of the High-Performance Computing on the Information Superhighway, HPC-Asia '9710.5555/523549.822921Online publication date: 28-Apr-1997
    • (1997)Implementing multidestination worms in switch-based parallel systemsACM SIGARCH Computer Architecture News10.1145/384286.26412925:2(50-61)Online publication date: 1-May-1997
    • (1997)Implementing multidestination worms in switch-based parallel systemsProceedings of the 24th annual international symposium on Computer architecture10.1145/264107.264129(50-61)Online publication date: 1-Jun-1997
    • (1997)Performance Evaluation of Switch-Based Wormhole NetworksIEEE Transactions on Parallel and Distributed Systems10.1109/71.5982748:5(462-474)Online publication date: 1-May-1997
    • (1996)A Class of Interconnection Networks for MulticastingProceedings of the 10th International Parallel Processing Symposium10.5555/645606.661164(796-802)Online publication date: 15-Apr-1996

    View Options

    View options

    PDF

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Get Access

    Login options

    Media

    Figures

    Other

    Tables

    Share

    Share

    Share this Publication link

    Share on social media