Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3583781.3590276acmconferencesArticle/Chapter ViewAbstractPublication PagesglsvlsiConference Proceedingsconference-collections
research-article
Open access

Optimize the TX Architecture of RDMA NIC for Performance Isolation in the Cloud Environment

Published: 05 June 2023 Publication History

Abstract

Remote Direct Memory Access (RDMA) is a promising technology for achieving low latency and high bandwidth access to remote memory. However, performance interference exists when multiple tenants share an RDMA Network Interface Card (RNIC) in the cloud environment. Although some initial studies have investigated the root cause and possible solutions to RDMA performance interference, there is no research to analyze and solve the performance interference from the RNIC architecture. Compared with the existing software approach, optimizing RNIC architecture can introduce less performance and CPU overhead. This paper addresses performance isolation by modeling, analyzing, and optimizing the transmit-side (TX) RNIC architecture. First, we introduce a baseline TX RNIC architecture to explain the existing performance interference. Then, we propose separate caching and slicing execution to avoid the bandwidth-sensitive tenants affecting latency-sensitive tenants. Later, we add isolated backpressure and adaptive Weighted Round-robin scheduling to ensure the bandwidth-sensitive tenants share the bandwidth equally. Our experiments show that these optimizations achieve near-optimal performance isolation.

References

[1]
fpgasystems. 2023. fpga-network-stack. https://github.com/fpgasystems/fpga-network-stack.
[2]
Yixiao Gao, Qiang Li, Lingbo Tang, Yongqing Xi, Pengcheng Zhang, Wenwen Peng, Bo Li, Yaohui Wu, Shaozong Liu, Lei Yan, et al. 2021. When Cloud Storage Meets RDMA. In NSDI. 519--533.
[3]
Chuanxiong Guo, Haitao Wu, Zhong Deng, Gaurav Soni, Jianxi Ye, Jitu Padhye, and Marina Lipshteyn. 2016. RDMA over commodity ethernet at scale. In Proceedings of the 2016 ACM SIGCOMM Conference. 202--215.
[4]
Anuj Kalia, Michael Kaminsky, and David G Andersen. 2014. Using RDMA efficiently for key-value services. In Proceedings of the 2014 ACM Conference on SIGCOMM. 295--306.
[5]
Ning Kang, Zhan Wang, Fan Yang, Xiaoxiao Ma, Zhenlong Ma, Guojun Yuan, and Guangming Tan. 2022. csRNA: Connection-Scalable RDMA NIC Architecture in Datacenter Environment. In 2022 IEEE 40th International Conference on Computer Design (ICCD). IEEE, 398--406.
[6]
NVIDIA Mellanox. 2023 a. Quality of Service. https://docs.nvidia.com/networking/pages/viewpage.action"pageId=37849129
[7]
NVIDIA Mellanox. 2023 b. Single Root IO Virtualization (SR-IOV). https://docs.nvidia.com/networking/pages/viewpage.action?pageId=12013542
[8]
Vishal Shrivastav. 2019. Fast, scalable, and programmable packet scheduler in hardware. In Proceedings of the ACM Special Interest Group on Data Communication. 367--379.
[9]
David Sidler, Zeke Wang, Monica Chiosa, Amit Kulkarni, and Gustavo Alonso. 2020. StRoM: smart remote memory. In Proceedings of the Fifteenth European Conference on Computer Systems. 1--16.
[10]
AMD Xilinx. 2023 a. AXI Verification IP. https://docs.xilinx.com/r/en-US/pg267-axi-vip
[11]
AMD Xilinx. 2023 b. ERNIC Core Overview. https://docs.xilinx.com/r/en-US/pg332-ernic/Xilinx-Embedded-RDMA-Enabled-NIC-v4.0-LogiCORE-IP-Product-Guide
[12]
Kong Xinhao, Chen Jingrong, Bai Wei, Xu Yechen, Elhaddad Mahmoud, Raindel Shachar, Padhye Jitendra, Lebeck Alvin R., and Zhuo Danyang. 2023. Understanding RDMA Microarchitecture Resources for Performance Isolation. In 20th USENIX Symposium on Networked Systems Design and Implementation (NSDI 23). USENIX Association, Boston, MA. https://www.usenix.org/conference/nsdi23/presentation/kong
[13]
Yiwen Zhang, Yue Tan, Brent Stephens, and Mosharaf Chowdhury. 2022. Justitia: Software multi-tenancy in hardware kernel-bypass networks. In USENIX NSDI.

Index Terms

  1. Optimize the TX Architecture of RDMA NIC for Performance Isolation in the Cloud Environment

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      GLSVLSI '23: Proceedings of the Great Lakes Symposium on VLSI 2023
      June 2023
      731 pages
      ISBN:9798400701252
      DOI:10.1145/3583781
      This work is licensed under a Creative Commons Attribution International 4.0 License.

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 05 June 2023

      Check for updates

      Author Tags

      1. architecture design
      2. network interface card (nic)
      3. performance isolation
      4. remote direct memory access (rdma)

      Qualifiers

      • Research-article

      Funding Sources

      • Youth Innovation Promotion Association CAS
      • The Strategic Priority Research Program of the Chinese Academy of Sciences
      • The National Natural Science Foundation of China

      Conference

      GLSVLSI '23
      Sponsor:
      GLSVLSI '23: Great Lakes Symposium on VLSI 2023
      June 5 - 7, 2023
      TN, Knoxville, USA

      Acceptance Rates

      Overall Acceptance Rate 312 of 1,156 submissions, 27%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • 0
        Total Citations
      • 743
        Total Downloads
      • Downloads (Last 12 months)560
      • Downloads (Last 6 weeks)120
      Reflects downloads up to 28 Nov 2024

      Other Metrics

      Citations

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media