Nothing Special   »   [go: up one dir, main page]

skip to main content
article

Grid environment for on-line application monitoring and performance analysis

Published: 01 December 2004 Publication History

Abstract

This paper presents an application monitoring infrastructure developed within the CrossGrid project. The software is aimed at enabling performance measurements for the application developer and in this way facilitating the development of applications in the Grid environment. The application monitoring infrastructure is composed of a distributed monitoring system, the OCM-G, and a performance analysis tool called G-PM. The OCM-G is an on-line, grid-enabled, monitoring system, while G-PM is an advanced graphical tool which allows to evaluate and present the results of performance monitoring, to support optimization of the application execution. G-PM supports build-in standard metrics and user-defined metrics expressed in the Performance Measurement Specification Language (PMSL). Communication between the G-PM and the OCM-G is performed according to a well-defined protocol, OMIS (On-line Monitoring Interface Specification). In this paper, the architecture and features of the OCM-G and G-PM are described as well as an example of use of the monitoring infrastructure to visualize the status and communication in the application, to evaluate the performance, including discovering the reason of the performance flaw.

References

[1]
{1} B. Bali¿, M. Bubak, T. Szepieniec, R. Wismüller and M. Radecki, OCM-G - Grid Application Monitoring System: Towards the First Prototype, Proc. Cracow GridWorkshop 2002, Krakow, December 2002.
[2]
{2} B. Balis, M. Bubak, W. Funika, T. Szepieniec, R. Wismüller and M. Radecki, in: Monitoring Grid Applications with Grid-enabled OMIS Monitor, F. Fernadez Rivera, M. Bubak, A. Gomez Tato and R. Doallo, eds, Proc. First European Across Grids Conference, Santiago de Compostela, Spain, February 2003. LNCS 2970, Springer, 2004.
[3]
{3} M. Bubak, W. Funika, B. Balis and R. Wismüller, Concept For Grid Application Monitoring, in Proceedings of the PPAM 2001 Conference, vol. 2328 of Lecture Notes in Computer Science, Naleczow, Poland, September 2001. Springer, pp. 307- 314.
[4]
{4} M. Bubak, W. Funika and R. Wismüller, The CrossGrid Performance Analysis Tool for Interactive Grid Applications, Proc. EuroPVM/MPI 2002, Linz, September 2002.
[5]
{5} M. Bubak, W. Funika, B. Bali¿ and R. Wismüller, On-line OCM-based Tool Support for Parallel Applications, in: Annual Review of Scalable Computing, (vol. 3), (chapter 2), Y. Chung and Kwong, eds, World Scientific Publishing Co. and Singapore University Press, Singapore, 2001, pp. 32-62.
[6]
{6} The CrossGrid Project (IST-2001-32243): http://www.eucrossgrid.org.
[7]
{7} CrossGrid biomedical application's Web page, http://www.eucrossgrid.org/biomedical.htm.
[8]
{8} The DataGrid Project: http://www.eu-datagrid.org.
[9]
{9} I. Foster, C. Kesselman, G. Tsudik and S. Tuecke, A Security Architecture for Computational Grids, in: Proc. 5th ACM Conference on Computer and Communications Security Conference, 1998, pp. 83-92.
[10]
{10} The GrADS Project: http://hipersoft.cs.rice.edu/grads.
[11]
{11} The GridLab Project: http://www.gridlab.org.
[12]
{12} P. Kacsuk, Parallel Program Development and Execution in the Grid, Proc. PARELEC 2002, International conference on parallel computing in electrical engineering, Warsaw, 2002, pp. 131-138.
[13]
{13} T. Ludwig, R. Wismüller, V. Sunderam and A. Bode, OMIS - On-line Monitoring Interface Specification (Version 2.0). Shaker Verlag, Aachen, (vol. 9), LRR-TUM Research Report Series, 1997, http://wwwbode.in.tum.de/omis/.
[14]
{14} R-GMA: A Grid Information and Monitoring System, http://www.gridpp.ac.uk/abstracts/AllHands_RGMA.pdf.
[15]
{15} N. Podhorszki, Z. Balaton and G. Gombas, Monitoring Message-Passing Parallel Applications in the Grid with GRM and Mercury Monitor, in: Proc. 2nd European Across Grids Conference, Nicosia, CY, To appear in Lecture Notes in Computer Science, Springer Verlag, 28-30 Jan. 2004.
[16]
{16} N. Podhorszki and P. Kacsuk, Monitoring Message Passing Applications in the Grid with GRM and R-GMA Proceedings of EuroPVM/MPI'2003, Venice, Italy, 2003. Springer 2003.
[17]
{17} J.S. Vetter and D.A. Reed, Real-time Monitoring, Adaptive Control and Interactive Steering of Computational Grids, The International Journal of High Performance Computing Applications 14 (2000), 357-366.
[18]
{18} R. Wismüller, M. Bubak, W. Funika, T. Arod' and M. Kurdziel, Support for User-Defined Metrics in the Online Performance Analysis Tool G-PM, in: Proc. 2nd European Across Grids Conference, Nicosia, CY, 28-30 Jan. 2004, LNCS 3165, Springer Verlag.
[19]
{19} M. Gerndt, Performance Tools for the Grid: State of the Art and Future. APART White Paper. Research Report Series, Vol. 30. LRR, Technische Universität München. Shaker Verlag, 2004.
[20]
{20} S. Zanikolas and R. Sakellariou, A Taxonomy of Grid Monitoring Systems, Journal of Future Generation Computer Systems , to appear.
[21]
{21} European Center for Parallelism of Barcelona. Paraver. Web page, http://www.cepba.upc.es/paraver/.
[22]
{22} University of Illinois. Pablo Performance Analysis Environment: Data Analysis. Web page, http://www-pablo.cs.uiuc. edu/Project/Pablo/PabloDataAnalysis.htm
[23]
{23} B.P. Miller et al., The Paradyn Parallel Performance Measurement Tools, IEEE Computer 28(11) (Nov. 1995), 37-46, http://www.cs.wisc.edu/paradyn/papers/overview.ps.gz.
[24]
{24} J.R. Hollingsworth, B.P. Miller, M.J.R. Goncalves, Z. Xu, O. Naim and L. Zheng, MDL: A Language and Compiler for Dynamic Program Instrumentation, in Proc. International Conference on Parallel Architectures and Compilation Techniques, San Francisco, CA, USA, Nov. 1997, ftp://grilled.cs.wisc.edu/technical_papers/mdl.ps.gz.
[25]
{25} F. Wolf and B. Mohr, EARL - A Programmable and Extensible Toolkit for Analyzing Event Traces of Message Passing Programs, in: Proc. of the 7th International Conference on High- Performance Computing and Networking (HPCN 99), A. Hoekstra and B. Hertzberger, eds, Amsterdam, The Netherlands, 1999, pp. 503-512.
[26]
{26} F. Wolf and B. Mohr, Automatic Performance Analysis of MPI Applications Based on Event Traces, in: Euro-Par 2000 Parallel Processing, 6th International Euro-Par Conference, A. Bode, T. Ludwig, W. Karl and R. Wismller, eds, volume 1900 of Lecture Notes in Computer Science, Munich, Germany, Aug. 2000, pp. 123-132, Springer-Verlag.
[27]
{27} T. Fahringer, M. Gerndt, G. Riley and J.L. Träff, Knowledge Specification for Automatic Performance Analysis. Technical report, ESPRIT IV Working Group on Automatic Performance Analysis, Nov. 1999, Web page, http://www.fz-juelich.de/apart-1/reports/wp2-asl.ps.gz.
[28]
{28} T. Fahringer and C. Seragiotto, Modeling and Detecting Performance Problems for Distributed and Parallel Programs with JavaPSL, in: 9th IEEE High-Performance Networking and Computing Conference, SC'2001, Denver, CO, Nov. 2001.
[29]
{29} R. Wolski, N. Spring and J. Hayes, The Network Weather Service: A Distributed Resource Performance Forecasting Service for Metacomputing, Future Generation Computer Systems 15 (1999), 757-768.
[30]
{30} X. Zhang, J. Freschl and J. Schopf, Performance Study of Monitoring and Information Services for Distributed Systems, Proceedings of HPDC, August 2003. Web page, http://www-unix.mcs.anl.gov/schopf/Pubs/xuehaijeff-hpdc2003.pdf.
[31]
{31} W. Allcock, J. Bester, J. Bresnahan, I. Foster, J. Gawor, J.A. Insley, J.M. Link and M.E. Papka, GridMapper: A Tool for Visualization of the Behavior of Large-Scale Distributed Systems , Proceedings of High Performance Distributed Computing 11 (HPDC-11), Edinburgh, Scotland, 2002. Web page, http://www-unix.mcs.anl.gov/fl/publications/hpdc11- gridmapper.pdf.
[32]
{32} W. Funika, M. Bubak, M. Smetek and R. Wismüller, Monitoring System for Distributed Java Applications, in: Proc. International Conference on Computational Science 2004, Krakow, 6-9 June 2004, Part III, pp. 472-479, LNCS 3038 Springer, 2004.
[33]
{33} B. Balis, M. Bubak and M. Wegiel, Adaptation of Legacy Software to Grid Services, in: Proc. International Conference on Computational Science 2004, Krakow, 6-9 June 2004, Part III, LNCS 3038, Springer, 2004, pp. 26-33.

Cited By

View all
  • (2009)Monitoring of SLA parameters within VO for the SOA paradigmProceedings of the 8th international conference on Parallel processing and applied mathematics: Part II10.5555/1893586.1893602(115-124)Online publication date: 13-Sep-2009
  • (2009)Real-Time Performance Support for Complex Grid ApplicationsEuro-Par 2008 Workshops - Parallel Processing10.1007/978-3-642-00955-6_38(329-338)Online publication date: 9-Apr-2009
  • (2007)Service level agreement metrics for real-time application on the gridProceedings of the 7th international conference on Parallel processing and applied mathematics10.5555/1786194.1786287(798-806)Online publication date: 9-Sep-2007
  • Show More Cited By

Recommendations

Comments

Please enable JavaScript to view thecomments powered by Disqus.

Information & Contributors

Information

Published In

cover image Scientific Programming
Scientific Programming  Volume 12, Issue 4
AxGrids 2004
December 2004
70 pages

Publisher

IOS Press

Netherlands

Publication History

Published: 01 December 2004

Qualifiers

  • Article

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)0
  • Downloads (Last 6 weeks)0
Reflects downloads up to 16 Nov 2024

Other Metrics

Citations

Cited By

View all
  • (2009)Monitoring of SLA parameters within VO for the SOA paradigmProceedings of the 8th international conference on Parallel processing and applied mathematics: Part II10.5555/1893586.1893602(115-124)Online publication date: 13-Sep-2009
  • (2009)Real-Time Performance Support for Complex Grid ApplicationsEuro-Par 2008 Workshops - Parallel Processing10.1007/978-3-642-00955-6_38(329-338)Online publication date: 9-Apr-2009
  • (2007)Service level agreement metrics for real-time application on the gridProceedings of the 7th international conference on Parallel processing and applied mathematics10.5555/1786194.1786287(798-806)Online publication date: 9-Sep-2007
  • (2005)Towards distributed monitoring and performance analysis services in the k-wfgrid projectProceedings of the 6th international conference on Parallel Processing and Applied Mathematics10.1007/11752578_20(156-163)Online publication date: 11-Sep-2005

View Options

View options

Login options

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media