Nothing Special   »   [go: up one dir, main page]

skip to main content
10.1145/3563647.3563648acmconferencesArticle/Chapter ViewAbstractPublication PagescommConference Proceedingsconference-collections
research-article

NetGVT: offloading global virtual time computation to programmable switches

Published: 19 October 2022 Publication History

Abstract

Distributed discrete-event simulation is an essential method for analyzing large-scale models, including weather forecast and network simulations. A distributed simulation often requires synchronizing state among the different parts of the model according to a global virtual time (GVT). However, existing approaches require multiple round-trip times to a server to compute a new GVT value. In this paper, we propose NetGVT, a system that computes GVT using programmable switches, thereby avoiding the round-trip latency of a server-based solution. In particular, our design is concerned with two main constraints of the switch programming model: the limited number of arithmetic and logic operations and the limited memory available on the device. We aggregate computations and unroll them across different pipeline stages in a hierarchical manner to address the former. Then, we adopt compression mechanisms to store a short representation of virtual clocks in the on-chip registers to tackle the memory limitations. We implemented a prototype of NetGVT and evaluated its performance with a synthetic lock-step simulation in a Tofino switch. Our results demonstrate that NetGVT outperforms techniques that do not rely on in-network computing by 40% in terms of distributed simulations completion time.

Supplementary Material

ZIP File (p16-parizotto.zip)
Supplemental material.

References

[1]
2022. NetGVT. https://github.com/RicardoParizotto/p4app-NetGVT.
[2]
Maleen Abeydeera and Daniel Sanchez. 2020. Chronos: Efficient speculative parallelism for accelerators. In Proceedings of the Twenty-Fifth International Conference on Architectural Support for Programming Languages and Operating Systems. 1247--1262.
[3]
Theophilus A. Benson. 2019. In-Network Compute: Considered Armed and Dangerous. In Proceedings of the Workshop on Hot Topics in Operating Systems (Bertinoro, Italy) (HotOS '19). ACM, New York, NY, USA, 216--224.
[4]
Pat Bosshart, Dan Daly, Glen Gibb, Martin Izzard, Nick McKeown, Jennifer Rexford, Cole Schlesinger, Dan Talayco, Amin Vahdat, George Varghese, and David Walker. 2014. P4: Programming Protocol-independent Packet Processors. SIGCOMM Comput. Commun. Rev. (2014).
[5]
Pat Bosshart, Glen Gibb, Hun-Seok Kim, George Varghese, Nick McKeown, Martin Izzard, Fernando Mujica, and Mark Horowitz. 2013. Forwarding metamorphosis: Fast programmable match-action processing in hardware for SDN. ACM SIGCOMM Computer Communication Review 43, 4 (2013), 99--110.
[6]
Joachim Breitner and Chris Smith. 2017. Lock-step simulation is child's play (experience report). Proceedings of the ACM on Programming Languages 1, ICFP (2017), 1--15.
[7]
Alisson V Brito, Harald Bucher, Helder Oliveira, Luis Feliphe S Costa, Oliver Sander, Elmar UK Melcher, and Juergen Becker. 2015. A distributed simulation platform using HLA for complex embedded systems design. In 2015 IEEE/ACM 19th International Symposium on Distributed Simulation and Real Time Applications (DS-RT). IEEE, 195--202.
[8]
K. Mani Chandy and Jayadev Misra. 1979. Distributed simulation: A case study in design and verification of distributed programs. IEEE Transactions on Software Engineering 5 (1979), 440--452.
[9]
Huynh Tu Dang, Pietro Bressana, Han Wang, Ki Suh Lee, Noa Zilberman, Hakim Weatherspoon, Marco Canini, Fernando Pedone, and Robert Soulé. 2020. P4xos: Consensus as a network service. IEEE/ACM Transactions on Networking (2020).
[10]
Ali Eker, Barry Williams, Kenneth Chiu, and Dmitry Ponomarev. 2019. Controlled asynchronous GVT: accelerating parallel discrete event simulation on many-core clusters. In Proceedings of the 48th International Conference on Parallel Processing. 1--10.
[11]
Richard M. Fujimoto and Maria Hybinette. 1997. Computing Global Virtual Time in Shared-Memory Multiprocessors. ACM Trans. Model. Comput. Simul. (1997), 22 pages.
[12]
Richard M Fujimoto, Asad Waqar Malik, A Park, et al. 2010. Parallel and distributed simulation in the cloud. SCS M&S Magazine 3 (2010), 1--10.
[13]
Yilong Geng, Shiyu Liu, Zi Yin, Ashish Naik, Balaji Prabhakar, Mendel Rosenblum, and Amin Vahdat. 2018. Exploiting a natural network effect for scalable, fine-grained clock synchronization. In 15th USENIX Symposium on Networked Systems Design and Implementation (NSDI 18). 81--94.
[14]
Elsa Gonsiorowski, Christopher Carothers, and Carl Tropper. 2012. Modeling large scale circuits using massively parallel discrete-event simulation. In 2012 IEEE 20th International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems. IEEE, 127--133.
[15]
Jo Erskine Hannay and Tom van den Berg. 2017. The NATO MSG-136 reference architecture for M&S as a service. In Proc. NATO Modelling and Simulation Group Symp. on M&S Technologies and Standards for Enabling Alliance Interoperability and Pervasive M&S Applications (STO-MP-MSG-149).
[16]
Frederik Hauser, Marco Häberle, Daniel Merling, Steffen Lindner, Vladimir Gurevich, Florian Zeiger, Reinhard Frank, and Michael Menth. 2021. A Survey on Data Plane Programming with P4: Fundamentals, Advances, and Applied Research. arXiv preprint arXiv:2101.10632 (2021).
[17]
Mary Hogan, Shir Landau-Feibish, Mina Tahmasbi Arashloo, Jennifer Rexford, and David Walker. 2022. Modular Switch Programming Under Resource Constraints. In 19th USENIX Symposium on Networked Systems Design and Implementation (NSDI 22). 193--207.
[18]
Zsolt István, David Sidler, Gustavo Alonso, and Marko Vukolic. 2016. Consensus in a box: Inexpensive coordination in hardware. In 13th USENIX Symposium on Networked Systems Design and Implementation (NSDI 16). 425--438.
[19]
David R Jefferson. 1985. Virtual time. ACM Transactions on Programming Languages and Systems (TOPLAS) (1985).
[20]
Theo Jepsen, Leandro Pacheco de Sousa, Masoud Moshref, Fernando Pedone, and Robert Soulé. 2018. Infinite Resources for Optimistic Concurrency Control. In Proceedings of the 2018 Morning Workshop on In-Network Computing (NetCompute '18).
[21]
Xin Jin, Xiaozhou Li, Haoyu Zhang, Nate Foster, Jeongkeun Lee, Robert Soulé, Changhoon Kim, and Ion Stoica. 2018. Netchain: Scale-free sub-rtt coordination. In 15th {USENIX} Symposium on Networked Systems Design and Implementation ({NSDI} 18). 35--49.
[22]
Xin Jin, Xiaozhou Li, Haoyu Zhang, Robert Soulé, Jeongkeun Lee, Nate Foster, Changhoon Kim, and Ion Stoica. 2017. NetCache: Balancing Key-Value Stores with Fast In-Network Caching. In Proceedings of the 26th Symposium on Operating Systems Principles (SOSP '17). ACM.
[23]
Edimar Roque Martello Junior, Acacia Terra, Ricardo Parizotto, and Braulio Mello. 2020. Closing the Gap Between Lookahead and Checkpointing to Provide Hybrid Synchronization. In Anais do XLVII Seminário Integrado de Software e Hardware. SBC, 104--115.
[24]
Pravein Govindan Kannan, Raj Joshi, and Mun Choon Chan. 2019. Precise Time-Synchronization in the Data-Plane Using Programmable Switching ASICs. In Proceedings of the 2019 ACM Symposium on SDN Research (San Jose, CA, USA) (SOSR '19). Association for Computing Machinery, New York, NY, USA, 8--20.
[25]
Daehyeok Kim, Ankush Jain, Zaoxing Liu, George Amvrosiadis, Damian Hazen, Bradley Settlemyer, and Vyas Sekar. 2020. Unleashing In-network Computing on Scientific Workloads. arXiv preprint arXiv:2009.02457 (2020).
[26]
Daehyeok Kim, Jacob Nelson, Dan RK Ports, Vyas Sekar, and Srinivasan Seshan. 2021. RedPlane: enabling fault-tolerant stateful in-switch applications. In Proceedings of the 2021 ACM SIGCOMM 2021 Conference. 223--244.
[27]
Marios Kogias and Edouard Bugnion. 2020. HovercRaft: Achieving Scalability and Fault-Tolerance for Microsecond-Scale Datacenter Services. In Proceedings of the Fifteenth European Conference on Computer Systems (Heraklion, Greece) (EuroSys '20). Association for Computing Machinery, New York, NY, USA, Article 25, 17 pages.
[28]
Carson Kuzniar, Miguel Neves, Vladimir Gurevich, and Israat Haque. 2022. IoT Device Fingerprinting on Commodity Switches. In NOMS 2022-2022 IEEE/IFIP Network Operations and Management Symposium. 1--9.
[29]
Ki Suh Lee, Han Wang, Vishal Shrivastav, and Hakim Weatherspoon. 2016. Globally synchronized time via datacenter networks. In Proceedings of the 2016 ACM SIGCOMM Conference. 454--467.
[30]
Hejing Li, Jialin Li, and Antoine Kaufmann. 2022. SimBricks: end-to-end network system evaluation with modular simulation. In Proceedings of the ACM SIGCOMM 2022 Conference. 380--396.
[31]
Jialin Li, Ellis Michael, and Dan RK Ports. 2017. Eris: Coordination-free consistent transactions using in-network concurrency control. In Proceedings of the 26th Symposium on Operating Systems Principles.
[32]
Friedemann Mattern. 1989. Virtual Time and Global States of Distributed Systems. In Proc. Workshop on Parallel and Distributed Algorithms, Cosnard M. et al. (Ed.).
[33]
F. Mattern. 1993. Efficient Algorithms for Distributed Snapshots and Global Virtual Time Approximation. J. Parallel and Distrib. Comput. 18, 4 (1993), 423--434.
[34]
Paul Mazzurana. 2021. Weather Research and Forecasting Model Workload Evaluation on IBM Cloud. https://www.ibm.com/cloud/blog/weather-research-and-forecasting-model-workload-evaluation-on-ibm-cloud
[35]
Alian Mohammad, Umur Darbaz, Gabor Dozsa, Stephan Diestelhorst, Daehoon Kim, and Nam Sung Kim. 2017. dist-gem5: Distributed simulation of computer clusters. In 2017 IEEE International Symposium on Performance Analysis of Systems and Software (ISPASS). IEEE, 153--162.
[36]
Eiji Morinaga, Eiji Arai, and Hidefumi Wakamatsu. 2012. A Basic Study on Highly Distributed Production Scheduling. In IFIP International Conference on Advances in Production Management Systems. Springer, 638--645.
[37]
Ranjit Noronha and Nael B Abu-Ghazaleh. 2002. Using programmable NICs for time-warp optimization. In Proceedings 16th International Parallel and Distributed Processing Symposium. IEEE, 8--pp.
[38]
Joshua Pelkey and George Riley. 2011. Distributed simulation with MPI in ns-3. In Proceedings of the 4th International ICST Conference on Simulation Tools and Techniques. 410--414.
[39]
Marcelo Pizzutti and Alberto E Schaeffer-Filho. 2019. Adaptive Multi-path Routing based on Hybrid Data and Control Plane Operation. In IEEE INFOCOM 2019-IEEE Conference on Computer Communications.
[40]
Amedeo Sapio, Ibrahim Abdelaziz, Abdulla Aldilaijan, Marco Canini, and Panos Kalnis. 2017. In-Network Computation is a Dumb Idea Whose Time Has Come. In Proceedings of the 16th ACM Workshop on Hot Topics in Networks (HotNets-XVI).
[41]
Juan L Sarli, Horacio P Leone, and Ma De los Milagros Gutiérrez. 2016. Ontology-based semantic model of supply chains for modeling and simulation in distributed environment. In 2016 Winter Simulation Conference (WSC). IEEE, 1182--1193.
[42]
Anirudh Sivaraman, Mihai Budiu, Alvin Cheung, Changhoon Kim, Steve Licking, George Varghese, Hari Balakrishnan, Mohammad Alizadeh, and Nick McKeown. 2015. Packet transactions: A programming model for data-plane algorithms at hardware speed. CoRR, vol. abs/1512.05023 (2015).
[43]
Haoyu Song. 2013. Protocol-Oblivious Forwarding: Unleash the Power of SDN through a Future-Proof Forwarding Plane. In Proceedings of the Second ACM SIGCOMM Workshop on Hot Topics in Software Defined Networking (Hong Kong, China) (HotSDN '13). Association for Computing Machinery, New York, NY, USA, 127--132.
[44]
Yuta Tokusashi, Huynh Tu Dang, Fernando Pedone, Robert Soulé, and Noa Zilberman. 2019. The case for in-network computing on demand. In Proceedings of the Fourteenth EuroSys Conference 2019. 1--16.
[45]
Barry Williams, Ali Eker, Kenneth Chiu, and Dmitry Ponomarev. 2021. High-performance pdes on manycore clusters. In Proceedings of the 2021 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation. 153--164.
[46]
Dingming Wu, Ang Chen, TS Eugene Ng, Guohui Wang, and Haiyong Wang. 2019. Accelerated service chaining on a single switch ASIC. In Proceedings of the 18th ACM Workshop on Hot Topics in Networks. 141--149.
[47]
Zhuolong Yu, Yiwen Zhang, Vladimir Braverman, Mosharaf Chowdhury, and Xin Jin. 2020. NetLock: Fast, Centralized Lock Management Using Programmable Switches. In SIGCOMM.

Cited By

View all
  • (2024)P4Hauler: An Accelerator-Aware In-Network Load Balancer for Applications Performance BoostingIEEE Transactions on Cloud Computing10.1109/TCC.2024.338965812:2(697-711)Online publication date: Apr-2024

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image ACM Conferences
SOSR '22: Proceedings of the Symposium on SDN Research
October 2022
101 pages
ISBN:9781450398923
DOI:10.1145/3563647
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 19 October 2022

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. data plane
  2. global virtual time
  3. in-network compute

Qualifiers

  • Research-article

Funding Sources

  • FAPERGS
  • FAPESP
  • CNPq
  • Coordenacao de Aperfeicoamento de Pessoal de Nivel Superior - Brasil (CAPES)

Conference

SOSR '22
Sponsor:
SOSR '22: The ACM SIGCOMM Symposium on SDN Research
October 19 - 20, 2022
Virtual Event

Acceptance Rates

Overall Acceptance Rate 7 of 43 submissions, 16%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)61
  • Downloads (Last 6 weeks)6
Reflects downloads up to 30 Sep 2024

Other Metrics

Citations

Cited By

View all
  • (2024)P4Hauler: An Accelerator-Aware In-Network Load Balancer for Applications Performance BoostingIEEE Transactions on Cloud Computing10.1109/TCC.2024.338965812:2(697-711)Online publication date: Apr-2024

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media