Abstract
The use of modern system area networking technologies [9,3] to construct tightly integrated clusters of workstations exposes two weaknesses of current operating systems. First, the low latency of current networks is often hidden from the application due to the high cost of interrupt handling. Second, network event handling during high load may result in serious performance degradation because all processor time is used for network event handling resulting in application starvation. This paper concerns the problems related to providing efficient and stable network event handling for clusters of workstations and network servers. By stable we mean that the throughput and response time of the system does not suffer when the workload offered to the system is increased beyond the maximum capacity of the system.
The main part of this work was carried out as a Ph.D. student at the Department of Computer Science, University of Copenhagen
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
G. Banga and P. Druschel. Measuring the capacity of a Web server. In USENIX Symposium on Internet Technologies and Systems Proceedings, pages 61–71, 1997.
G. Banga, P. Druschel, and J. C. Mogul. Resource containers: A new facility for resource management in server systems. In Proceedings of the Third Symposium on Operating Systems Design and Implementation, pages 45–58, February 1999.
N. J. Boden, D. Cohen, R. E. Felderman, A. E. Kulawik, C. L. Seitz, J. N. Seizovic, and Wen-King Su. Myrinet: A gigabit-per-second Local Area Network. IEEE Micro, 15(1):29–36, February 1995.
J. C. Brustoloni, E. Gabber, A. Silberschatz, and A. Singh. Signaled receiver processing. In Proceedings of the 2000 USENIX Annual Technical Conference, 2000.
S. Damianakis, Y. Chen, and E. Felten. Reducing waiting costs in user-level communication. In Proceedings of the 11th International Parallel Processing Symposium (IPPS-97), pages 381–387. IEEE Computer Society Press, April 1–5 1997.
Dolphin Interconnect Solutions. PCI-SCI cluster adapter specification, May 1996. Version 1.2. See also http://www.dolphinics.no.
P. Druschel and G. Banga. Lazy receiver processing (LRP): A network subsystem architecture for server systems. In The Second Symposium on Operating Systems Design and Implementation Proceedings, pages 261–276, October 1996.
J. S. Hansen, P. T. Koch, and E. Jul. A stream protocol implementation for an SCI-based cluster of workstations. In Proceedings of the 1999 Workshop on Cluster-Based Computing, pages 16–20, Rhodes, Greece, June 1999. ACM.
IEEE. IEEE Standard for Scalable Coherent Interface (SCI). IEEE, 1992. Standard 1596–1992.
Intel Corporation. Pentium Pro Family Developer’s Manual. Volume 3: Operating Systems Writer’s Guide. Order Number 242691.
K. G. Langendoen, J. Romein, R. A. F. Bhoedjang, and H. E. Bal. Integrating polling, interrupts, and thread management. In Proceedings of the 6th Symposium on the Frontiers of Massively Parallel Computation, pages 13–22. IEEE, 1996.
O. Maquelin, G. R. Gao, H. H. J. Hum, K. Theobald, and X. Tian. Polling watchdog: Combining polling and interrupts for efficient message handling. In Proceedings of the 23rd Annual International Symposium on Computer Architecure, pages 179–190, 1996.
J. C. Mogul and K. K. Ramakrishnan. Eliminating Receive Livelock in an Interrupt-Driven Kernel. ACM Transactions on Computer Systems, 15(3):217–252, August 1997.
D. M. Mosberger and T. Jin. httperf—a tool for measuring web server performance. In Proceedings of the 1998 Workshop on Internet Server Performance. ACM, 1998.
D. Perkovic and P. J. Keleher. Responsiveness without interrupts. In Proceedings of the 1999 International Conference on Supercomputing, June 1999.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2001 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Hansen, J.S., Jul, E. (2001). Prioritizing Network Event Handling in Clusters of Workstations. In: Sakellariou, R., Gurd, J., Freeman, L., Keane, J. (eds) Euro-Par 2001 Parallel Processing. Euro-Par 2001. Lecture Notes in Computer Science, vol 2150. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-44681-8_101
Download citation
DOI: https://doi.org/10.1007/3-540-44681-8_101
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-42495-6
Online ISBN: 978-3-540-44681-1
eBook Packages: Springer Book Archive