Nothing Special   »   [go: up one dir, main page]

skip to main content
poster

Don't drop, detour!

Published: 27 August 2013 Publication History

Abstract

Today's data centers must support a range of workloads with different demands. While existing approaches handle routine traffic smoothly, ephemeral but intense hotspots cause excessive packet loss and severely degrade performance. This loss occurs even though the congestion is typically highly localized, with spare buffer capacity available at nearby switches.
We argue that switches should share buffer capacity to effectively handle this spot congestion without the latency or monetary hit of deploying large buffers at individual switches. We present detour-induced buffer sharing (DIBS), a mechanism that achieves a near lossless network without requiring additional buffers. Using DIBS, a congested switch detours packets randomly to neighboring switches to avoid dropping the packets. We implement DIBS in hardware, on software routers in a testbed, and in simulation, and we demonstrate that it reduces the 99th percentile of query completion time by 85%, with very little impact on background traffic.

References

[1]
Priority flow control. http://www.cisco.com/en/US/prod/collateral/switches/ps9441/ps9670/white_paper_c11--542809.pdf.
[2]
M. Al-Fares, A. Loukissas, and A. Vahdat. A scalable, commodity data center network architecture. In SIGCOMM, 2008.
[3]
M. Alizadeh, A. Greenberg, D. A. Maltz, J. Padhye, P. Patel, B. Prabhakar, S. Sengupta, and M. Sridharan. Data center TCP (DCTCP). In SIGCOMM, 2010.
[4]
M. Alizadeh, A. Kabbani, B. Atikoglu, and B. Prabhakar. Stability analysis of QCN: The averaging principle. In SIGMETRICS, 2011.
[5]
S. Kandula, J. Padhye, and P. Bahl. Flyways to de-congest data center networks. In HotNets, 2009.
[6]
C. Raiciu, S. Barré, C. Pluntke, A. Greenhalgh, D. Wischik, and M. Handley. Improving datacenter performance and robustness with multipath TCP. In SIGCOMM, 2011.
[7]
V. Vasudevan, A. Phanishayee, H. Shah, E. Krevat, D. Andersen, G. Ganger, G. Gibson, and B. Mueller. Safe and effective fine-grained TCP retransmissions for datacenter communication. In SIGCOMM, 2009.
[8]
D. Zats, T. Das, P. Mohan, D. Borthakur, and R. H. Katz. DeTail: reducing the flow completion time tail in datacenter networks. In SIGCOMM, 2012.

Cited By

View all
  • (2021)Highly Available Service Access Through Proactive Events Execution in LTE NFVIEEE Transactions on Network and Service Management10.1109/TNSM.2021.310316018:3(2531-2544)Online publication date: Sep-2021
  • (2018)FUSOIEEE/ACM Transactions on Networking (TON)10.1109/TNET.2018.283041426:3(1376-1389)Online publication date: 1-Jun-2018
  • (2017)PBUF: Sharing Buffer to Mitigate Flooding Attacks2017 IEEE 23rd International Conference on Parallel and Distributed Systems (ICPADS)10.1109/ICPADS.2017.00059(392-399)Online publication date: Dec-2017
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM SIGCOMM Computer Communication Review
ACM SIGCOMM Computer Communication Review  Volume 43, Issue 4
October 2013
595 pages
ISSN:0146-4833
DOI:10.1145/2534169
Issue’s Table of Contents
  • cover image ACM Conferences
    SIGCOMM '13: Proceedings of the ACM SIGCOMM 2013 conference on SIGCOMM
    August 2013
    580 pages
    ISBN:9781450320566
    DOI:10.1145/2486001
Permission to make digital or hard copies of part or all of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for third-party components of this work must be honored. For all other uses, contact the Owner/Author.

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 27 August 2013
Published in SIGCOMM-CCR Volume 43, Issue 4

Check for updates

Author Tags

  1. buffers
  2. data center
  3. packet loss

Qualifiers

  • Poster

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)44
  • Downloads (Last 6 weeks)13
Reflects downloads up to 27 Jan 2025

Other Metrics

Citations

Cited By

View all
  • (2021)Highly Available Service Access Through Proactive Events Execution in LTE NFVIEEE Transactions on Network and Service Management10.1109/TNSM.2021.310316018:3(2531-2544)Online publication date: Sep-2021
  • (2018)FUSOIEEE/ACM Transactions on Networking (TON)10.1109/TNET.2018.283041426:3(1376-1389)Online publication date: 1-Jun-2018
  • (2017)PBUF: Sharing Buffer to Mitigate Flooding Attacks2017 IEEE 23rd International Conference on Parallel and Distributed Systems (ICPADS)10.1109/ICPADS.2017.00059(392-399)Online publication date: Dec-2017
  • (2015)SA-TCP: A novel approach to mitigate TCP Incast in data center networks2015 International Conference on Computing and Network Communications (CoCoNet)10.1109/CoCoNet.2015.7411220(420-426)Online publication date: Dec-2015
  • (2014)SA-TCP: A Novel Approach to Mitigate TCP Incast in Data Center Networks2014 Second International Conference on Advanced Cloud and Big Data10.1109/CBD.2014.55(163-167)Online publication date: Nov-2014

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Figures

Tables

Media

Share

Share

Share this Publication link

Share on social media