Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2903150.2903168acmconferencesArticle/Chapter ViewAbstractPublication PagescfConference Proceedingsconference-collections
research-article

P-Socket: optimizing a communication library for a PCIe-based intra-rack interconnect

Published: 16 May 2016 Publication History

Abstract

Data centers require efficient, low-cost, flexible interconnects to manage the rapidly growing internal traffic generated by an increasingly diverse set of applications. To meet these requirements, data center networks are increasingly employing alternatives such as RapidIO, Freedom, and PCIe, which require fewer physical devices and/or have simpler protocols than more traditional interconnects. These networks offer raw high performance communication capabilities, but simply using them for conventional TCP/IP-based communication fails to realize the potential performance of the physical network. Here we analyze causes for this performance loss for the TCP/IP protocol over one such fabric, PCIe, and we explore a hardware/software solution that mitigates overheads and exploits PCIe's advanced features. The result is P-Socket, an efficient library that enables legacy socket applications to run without modification. Our experiments show that P-Socket achieves an end-to-end latency of 1.2μs and effective bandwidth of up to 2.87GB/s (out of a theoretical peak of 3.05GB/s).

References

[1]
P. Balaji, S. Narravula, K. Vaidyanathan, S. Krishnamoorthy, J. Wu, and D. K. Panda. Sockets Direct Procotol over InfiniBand in clusters: Is it beneficial? In Proc. IEEE International Symposium on Performance Analysis of Systems and Software, pages 28--35, Mar. 2004.
[2]
P. Balaji, H. V. Shah, and D. K. Panda. Sockets vs RDMA interface over 10-Gigabit networks: An in-depth analysis of the memory traffic bottleneck. In Proc. Workshop on Remote Direct Memory Access (RDMA): Applications, Implementations, and Technologies (RAIT), Sept. 2004.
[3]
P. Balaji, P. Shivam, P. Wyckoff, and D. Panda. High performance user level sockets over Gigabit Ethernet. In Proc. IEEE International Conference on Cluster Computing, pages 179--186, Sept. 2002.
[4]
N. J. Boden, D. Cohen, R. E. Felderman, A. E. Kulawik, C. L. Seizovic, and W.-K. Su. Myrinet: A Gigabit-per-Second local area network. Micro, 15(1):29--36, Dec. 1995.
[5]
P. Buonadonna, A. Geweke, and D. Culler. An implementation and analysis of the Virtual Interface Architecture. In Proc. ACM/IEEE Conference on Supercomputing, pages 1--15, Nov. 1998.
[6]
L. Chisvin. PCIe ready for datacenter role. http://www.eetimes.com/AUTHOR.ASP?SECTION_ID=36&DOC_ID=1319539, Sept. 2013.
[7]
J. Chu and V. Kashyap. Transmission of IP over InfiniBand (IPoIB). http://www.hjp.at/doc/rfc/rfc4391.html, Apr. 2006.
[8]
I. Corp. Intel 82599 10 Gigabit Ethernet Controller: Product brief. http://www.intel.com/content/www/us/en/ethernet-controllers/82599-10-gbe-controller-brief.html, Aug. 2009.
[9]
Dolphin Corp. Supersockets for linux: Overview. http://www.dolphinics.com/download/WHITEPAPERS/Dolphin_Express_IX_SuperSockets_for_Linux.pdf, Aug. 2013.
[10]
C. Dubnicki, A. Bilas, Y. Chen, S. Damianakis, and K. Li. VMMC-2: Efficient support for reliable, connection-oriented communication. In Proc. IEEE Hot Interconnects V, Aug. 1997.
[11]
W. Feng, P. Balaji, C. Baron, L. N. Bhuyan, and D. K. Panda. Performance characterization of a 10-Gigabit Ethernet TOE. In Proc. High Performance Interconnects, pages 58--63, Aug. 2005.
[12]
P. Grun. Introduction to InfiniBand™ for end users. https://cw.infinibandta.org/document/dl/7268, Apr. 2010.
[13]
S. Hefty. RSOCKETS: RDMA for dummies. In Proc. Open Fabrics Developer Workshop, Apr. 2013.
[14]
Y. Hoskote, B. A. Bloechel, G. E. Dermer, V. Erraguntla, D. Finan, J. Howard, D. Klowden, S. G. Naendra, G. Ruhl, J. W. Tschanz, S. Vangal, V. Veeramachaneni, H. Wilson, J. Wu, and N. Borkar. A TCP offload accelerator for 10 Gb/s Ethernet in 90-nm CMOS. Solid-State Circuits, 38(11):1866--1875, Feb. 2003.
[15]
R. Hou, T. Jiang, L. Zhang, P. Qi, J. Dong, H. Wang, X. Gu, and S. Zhang. Cost effective data center servers. In Proc. IEEE International Symposium on High Performance Computer Architecture, pages 179--187, Feb. 2013.
[16]
J. Hurwitz and W. Feng. End-to-end performance of 10-Gigabit Ethernet on commodity systems. IEEE Micro, 24(1):10--12, Jan.-Feb. 2004.
[17]
R. Jones. Care and feeding of Netperf 2.6.X. http://www.netperf.org/svn/netperf2/tags/netperf-2.6.0/doc/netperf.html, 2012.
[18]
V. Krishnan. Towards an integrated IO and clustering solution using PCI Express. In Proc. IEEE International Conference on Cluster Computing, pages 259--266, Sept. 2007.
[19]
M. Lauria, S. Pakin, and A. A. Chien. Efficient layering for high speed communication: Fast Messages 2.x. In Proc. IEEE High Performance Parallel and Distributed Computing, pages 10--20, July 1998.
[20]
R. Merritt. RapidIO nudges ARM into servers. http://www.eetimes.com/document.asp?doc_id=1318957, July 2013.
[21]
S. Pakin, M. Lauria, and A. Chien. High performance messaging on workstations: Illinois Fast Message (FM) for Myrinet. In Proc. ACM/IEEE Conference on High Performance Networking and Computing (Supercomputing), page 55, Dec. 1995.
[22]
F. Petrini, W. Feng, A. Hoisie, S. Coll, and E. Frachtenberg. The Quadrics network: High-performance clustering technology. Micro, 22(1):46--57, Nov. 2002.
[23]
PLX Technology, Inc. Expresslane PEX 8648-AA AB, and BB 48-lane/12-port PCI Express Gen 2 switch data book. PEX8648-SIL-PB-1.0, http://www.plxtech.com/products/expresslane/pex8648, Apr. 2009.
[24]
PLX Technology, Inc. Expresslane PEX 8619-BA 16-lane, 16-port PCI Express Gen 2 switch with DMA data book. PEX8619-SIL-PB-1.4, http://www.plxtech.com/products/expresslane/pex8619, Apr. 2010.
[25]
K. Pulo. Fun with LD_PRELOAD. https://nf.nci.org.au/training/talks/lca2009.pdf, Jan. 2009.
[26]
A. Rao. AMD | SeaMicro technology overview. http://www.seamicro.com/sites/default/files/SM_TO01_64_v2.7.pdf, Oct. 2012.
[27]
R. Recio. The coming decade of data center networking discontinuities. In Proc. IEEE International Conference on Computing, Networking and Communications (keynote), Feb. 2012.
[28]
J. Regula. Integrating rack level connectivity into a PCI Express switch. In Proc. Hot Chips: A Symposium on High Performance Chips, pages 259--266, Aug. 2013.
[29]
S. H. Rodrigues, T. E. Anderson, and D. E. Culler. High-performance local area communication with Fast Sockets. In Proc. USENIX Technical Conference, pages 257--274, Jan. 1997.
[30]
P. Shivam, P. Wyckoff, and D. K. Panda. EMP: Zero-copy OS-bypass NIC-driven Gigabit Ethernet Message Passing. In Proc. IEEE International Conference on Supercomputing, pages 49--49, Nov. 2001.
[31]
Tektronix. Tektronix pci express logic protocol analyzer. http://www.tek.com/datasheet/tla7sa00-series, 2013.
[32]
H. Tezuka, A. Hori, and Y. Ishikawa. PM: A high-performances communication library for multi-user parallel environments. Technical Report TR-96-015, Tsukuba Research Center, 1996.
[33]
TOP500 Supercomputer Site. Interconnect Family/Myrinet. http://www.top500.org/statistics/details/connfam/2, 2013.
[34]
T. von Eicken, V. Avula, A. Basu, and V. Buch. Low-latency communication over ATM networks using Active Messages. Micro, 15(1):46--53, Dec. 1995.
[35]
T. von Eicken, A. Basu, V. Buch, and W. Vogels. U-Net: A user-level network interface for parallel and distributed computing. In Proc. ACM Symposium on Operating System Principles, pages 40--53, Dec. 1995.

Cited By

View all
  • (2022)An ultra-low latency and compatible PCIe interconnect for rack-scale communicationProceedings of the 18th International Conference on emerging Networking EXperiments and Technologies10.1145/3555050.3569128(232-244)Online publication date: 30-Nov-2022
  • (2019)Compatibility enhancement and performance measurement for socket interface with PCIe interconnectionsHuman-centric Computing and Information Sciences10.1186/s13673-019-0170-09:1(1-18)Online publication date: 1-Dec-2019

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
CF '16: Proceedings of the ACM International Conference on Computing Frontiers
May 2016
487 pages
ISBN:9781450341288
DOI:10.1145/2903150
  • General Chairs:
  • Gianluca Palermo,
  • John Feo,
  • Program Chairs:
  • Antonino Tumeo,
  • Hubertus Franke
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 16 May 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. PCIe
  2. data-center servers
  3. rack interconnects
  4. sockets

Qualifiers

  • Research-article

Conference

CF'16
Sponsor:
CF'16: Computing Frontiers Conference
May 16 - 19, 2016
Como, Italy

Acceptance Rates

Overall Acceptance Rate 24 of 66 submissions, 36%

Upcoming Conference

CF '25

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)12
  • Downloads (Last 6 weeks)3
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2022)An ultra-low latency and compatible PCIe interconnect for rack-scale communicationProceedings of the 18th International Conference on emerging Networking EXperiments and Technologies10.1145/3555050.3569128(232-244)Online publication date: 30-Nov-2022
  • (2019)Compatibility enhancement and performance measurement for socket interface with PCIe interconnectionsHuman-centric Computing and Information Sciences10.1186/s13673-019-0170-09:1(1-18)Online publication date: 1-Dec-2019

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media