Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/2486001.2486005acmconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article
Free access

zUpdate: updating data center networks with zero loss

Published: 27 August 2013 Publication History

Abstract

Datacenter networks (DCNs) are constantly evolving due to various updates such as switch upgrades and VM migrations. Each update must be carefully planned and executed in order to avoid disrupting many of the mission-critical, interactive applications hosted in DCNs. The key challenge arises from the inherent difficulty in synchronizing the changes to many devices, which may result in unforeseen transient link load spikes or even congestions. We present one primitive, zUpdate, to perform congestion-free network updates under asynchronous switch and traffic matrix changes. We formulate the update problem using a network model and apply our model to a variety of representative update scenarios in DCNs. We develop novel techniques to handle several practical challenges in realizing zUpdate as well as implement the zUpdate prototype on OpenFlow switches and deploy it on a testbed that resembles real DCN topology. Our results, from both real-world experiments and large-scale trace-driven simulations, show that zUpdate can effectively perform congestion-free updates in production DCNs.

References

[1]
Floodlight. http://floodlight.openflowhub.org/.
[2]
MOSEK. http://mosek.com/.
[3]
OpenFlow 1.0. http://www.openflow.org/documents/openflow-spec-v1.0.0.pdf.
[4]
M. Al-Fares, A. Loukissas, and A. Vahdat. A Scalable, Commodity Data Center Network Architecture. In SIGCOMM'08.
[5]
M. Alizadeh, A. Greenberg, D. A. Maltz, J. Padhye, P. Patel, B. Prabhakar, S. Sengupta, and M. Sridharan. Data Center TCP DCTCP. In SIGCOMM'10.
[6]
C. Clark, K. Fraser, S. Hand, J. G. Hansen, E. Jul, C. Limpach, I. Pratt, and A. Warfield. Live Migration of Virtual Machines. In NSDI'05.
[7]
A. R. Curtis, J. C. Mogul, J. Tourrilhes, P. Yalag, P. Sharma, and S. Banerjee. Devoflow: Scaling Flow Management for High-Performance Networks. In SIGCOMM'11.
[8]
N. Feamster and H. Balakrishnan. Detecting BGP Configuration Faults with Static Analysis. In NSDI'05.
[9]
P. Francois, O. Bonaventure, B. Decraene, and P. A. Coste. Avoiding Disruptions During Maintenance Operations on BGP Sessions. IEEE Trans. on Netw. and Serv. Manag., 2007.
[10]
S. Ghorbani and M. Caesar. Walk the Line: Consistent Network Updates with Bandwidth Guarantees. In HotSDN'12.
[11]
J. P. John, E. Katz-Bassett, A. Krishnamurthy, T. Anderson, and A. Venkataramani. Consensus Routing: the Internet as a Distributed System. In NSDI'08.
[12]
P. Kazemian, M. Chang, H. Zeng, G. Varghese, N. McKeown, and S. Whyte. Real Time Network Policy Checking Using Header Space Analysis. In NSDI'13.
[13]
P. Kazemian, G. Varghese, and N. McKeown. Header Space Analysis: Static Checking for Networks. In NSDI'12.
[14]
E. Keller, S. Ghorbani, M. Caesar, and J. Rexford. Live Migration of an Entire Network (and its hosts). In HotNets'12.
[15]
A. Khurshid, W. Zhou, M. Caesar, and P. B. Godfrey. Veriflow: Verifying Network-Wide Invariants in Real Time. In HotSDN'12.
[16]
H. Mai, A. Khurshid, R. Agarwal, M. Caesar, P. B. Godfrey, and S. T. King. Debugging the Data Plane with Anteater. In SIGCOMM'11.
[17]
S. Raza, Y. Zhu, and C.-N. Chuah. Graceful Network State Migrations. Networking, IEEE/ACM Transactions on, 2011.
[18]
M. Reitblatt, N. Foster, J. Rexford, C. Schlesinger, and D. Walker. Abstractions for Network Update. In SIGCOMM'12.
[19]
L. Vanbever, S. Vissicchio, C. Pelsser, P. Francois, and O. Bonaventure. Seamless Network-Wide IGP Migrations. In SIGCOMM'11.
[20]
X. Wu, D. Turner, C.-C. Chen, D. A. Maltz, X. Yang, L. Yuan, and M. Zhang. NetPilot: Automating Datacenter Network Failure Mitigation. In SIGCOMM'12.

Cited By

View all
  • (2024)Resilient TCP Variant Enabling Smooth Network Updates for Software-Defined Data Center NetworksTsinghua Science and Technology10.26599/TST.2024.901001029:5(1615-1632)Online publication date: Oct-2024
  • (2024)Occam: A Programming System for Reliable Network ManagementProceedings of the Nineteenth European Conference on Computer Systems10.1145/3627703.3650086(148-162)Online publication date: 22-Apr-2024
  • (2024)Nous: Drop-Freeness and Duplicate-Freeness for Consistent Updating in SDN Multicast RoutingIEEE/ACM Transactions on Networking10.1109/TNET.2024.340496732:5(3685-3698)Online publication date: Oct-2024
  • Show More Cited By

Index Terms

  1. zUpdate: updating data center networks with zero loss

      Recommendations

      Comments

      Please enable JavaScript to view thecomments powered by Disqus.

      Information & Contributors

      Information

      Published In

      cover image ACM Conferences
      SIGCOMM '13: Proceedings of the ACM SIGCOMM 2013 conference on SIGCOMM
      August 2013
      580 pages
      ISBN:9781450320566
      DOI:10.1145/2486001
      • cover image ACM SIGCOMM Computer Communication Review
        ACM SIGCOMM Computer Communication Review  Volume 43, Issue 4
        October 2013
        595 pages
        ISSN:0146-4833
        DOI:10.1145/2534169
        Issue’s Table of Contents
      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Sponsors

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      Published: 27 August 2013

      Permissions

      Request permissions for this article.

      Check for updates

      Author Tags

      1. congestion
      2. data center network
      3. network update

      Qualifiers

      • Research-article

      Conference

      SIGCOMM'13
      Sponsor:
      SIGCOMM'13: ACM SIGCOMM 2013 Conference
      August 12 - 16, 2013
      Hong Kong, China

      Acceptance Rates

      SIGCOMM '13 Paper Acceptance Rate 38 of 246 submissions, 15%;
      Overall Acceptance Rate 462 of 3,389 submissions, 14%

      Contributors

      Other Metrics

      Bibliometrics & Citations

      Bibliometrics

      Article Metrics

      • Downloads (Last 12 months)115
      • Downloads (Last 6 weeks)12
      Reflects downloads up to 16 Dec 2024

      Other Metrics

      Citations

      Cited By

      View all
      • (2024)Resilient TCP Variant Enabling Smooth Network Updates for Software-Defined Data Center NetworksTsinghua Science and Technology10.26599/TST.2024.901001029:5(1615-1632)Online publication date: Oct-2024
      • (2024)Occam: A Programming System for Reliable Network ManagementProceedings of the Nineteenth European Conference on Computer Systems10.1145/3627703.3650086(148-162)Online publication date: 22-Apr-2024
      • (2024)Nous: Drop-Freeness and Duplicate-Freeness for Consistent Updating in SDN Multicast RoutingIEEE/ACM Transactions on Networking10.1109/TNET.2024.340496732:5(3685-3698)Online publication date: Oct-2024
      • (2024)Learning to Configure Converters in Hybrid Switching Data Center NetworksIEEE/ACM Transactions on Networking10.1109/TNET.2023.329480332:1(520-534)Online publication date: Feb-2024
      • (2024)Dynamic Learning-based Link Restoration in Traffic Engineering with ArchieIEEE INFOCOM 2024 - IEEE Conference on Computer Communications10.1109/INFOCOM52122.2024.10621357(2428-2437)Online publication date: 20-May-2024
      • (2024)Fast Algorithms for Loop-Free Network Updates using Linear Programming and Local SearchIEEE INFOCOM 2024 - IEEE Conference on Computer Communications10.1109/INFOCOM52122.2024.10621077(1930-1939)Online publication date: 20-May-2024
      • (2023)EBB: Reliable and Evolvable Express Backbone Network in MetaProceedings of the ACM SIGCOMM 2023 Conference10.1145/3603269.3604860(346-359)Online publication date: 10-Sep-2023
      • (2023)Dynamic Upgrade to SDN From a Global Perspective: Model and Its Heuristic SolutionsIEEE Transactions on Network and Service Management10.1109/TNSM.2023.327887720:4(4751-4764)Online publication date: Dec-2023
      • (2023)Multiobjective Genetic Algorithm for Fast Service Function Chain ReconfigurationIEEE Transactions on Network and Service Management10.1109/TNSM.2022.319582020:3(3501-3522)Online publication date: Sep-2023
      • (2023)On Priority-Based Scheduling for Network Updates in SDN2023 IEEE 48th Conference on Local Computer Networks (LCN)10.1109/LCN58197.2023.10223362(1-4)Online publication date: 2-Oct-2023
      • Show More Cited By

      View Options

      View options

      PDF

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader

      Login options

      Media

      Figures

      Other

      Tables

      Share

      Share

      Share this Publication link

      Share on social media