JP2003345620A

JP2003345620A - Process monitoring method for multi-node cluster system

Info

Publication number: JP2003345620A
Application number: JP2002150973A
Authority: JP
Inventors: Kazuya Kamimura; 和也上村
Original assignee: Hitachi Software Engineering Co Ltd
Current assignee: Hitachi Software Engineering Co Ltd
Priority date: 2002-05-24
Filing date: 2002-05-24
Publication date: 2003-12-05

Abstract

<P>PROBLEM TO BE SOLVED: To always grasp movements of a cluster group (transition of states) and to automatically change monitor settings according to them. <P>SOLUTION: Servers 2 to 4 to be monitored including a standby server 4 are equipped with monitor processes 6, 7, and 14 for detecting abnormality of the cluster group and check processes 15, 16, and 17 for acquiring information B for checking whether the cluster group move or stop. A monitor setting process 11 of a monitor server 1 is informed of the check information B by the servers 2, 3, and 4 to be monitored and compares the check information B with a monitor setting table 13 in a memory 12 by the servers 2, 3, and 4 to be monitored to judge that the cluster group is switched between the servers 2 and 3 to be monitored and the standby server 4 when a discrepancy is found, thereby changing monitor settings for monitoring the servers to be monitored through a monitor process 5. <P>COPYRIGHT: (C)2004,JPO

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、多ノードクラスタ
システムにおけるサーバプロセスの監視方法に関する。The present invention relates to a method for monitoring a server process in a multi-node cluster system.

【０００２】[0002]

【従来の技術】従来、サーバプロセスを監視する方法と
しては、特開２００１−１１７７８９号公報に記載のも
のが知られている。これは、プログラム監視条件設定部
を備え、グラフィカルユーザインタフェースまたは定義
ファイルといった外部入力装置により、プログラムの監
視条件とし、第１に監視するプログラム名称を任意の文
字列で設定でき、第２にプログラムを構成する全てまた
は一部のプロセス名称を実行ファイル名またはコマンド
ライン名で設定でき、第３にこの第２の設定で指定した
プロセス名称単位に正常稼動と判断するプロセス数の下
限値および上限値などを閾値として設定できるものであ
り、プロセス数による幅広いプロセス監視を実現してい
る。2. Description of the Related Art Conventionally, as a method for monitoring a server process, a method described in JP-A-2001-117789 has been known. This is provided with a program monitoring condition setting unit, which can be set as a program monitoring condition by an external input device such as a graphical user interface or a definition file, and a program name to be monitored can be set with an arbitrary character string first, and a program can be set secondly. The names of all or some of the constituent processes can be set by the name of the executable file or the name of the command line. Thirdly, the lower limit and upper limit of the number of processes that are determined to be operating normally for each process name specified by the second setting Can be set as a threshold, and a wide range of process monitoring by the number of processes is realized.

【０００３】[0003]

【発明が解決しようとする課題】ところで、近年、信頼
性向上及び低コスト化の観点から、複数のシステムに共
通の待機系サーバを準備し、どのシステムでプロセスの
待機系への切り替えが発生しても、同じ待機系サーバへ
切り替わるという多ノードクラスタシステムが存在す
る。しかしながら、現状のプロセス監視方法では、多ノ
ードクラスタシステムにおける待機系サーバ上のプロセ
スを常に監視することは難しい。In recent years, a stand-by server common to a plurality of systems has been prepared from the viewpoints of improving reliability and reducing costs, and in which system a process has been switched to a stand-by system. However, there is a multi-node cluster system that switches to the same standby server. However, with the current process monitoring method, it is difficult to constantly monitor the process on the standby server in the multi-node cluster system.

【０００４】以下、これを図７を用いて説明する。[0004] This will be described below with reference to FIG.

【０００５】同図において、ここで、１つの監視サーバ
１と、２つの監視対象サーバ２，３と、１つの待機系サ
ーバ４とからなる３ノードクラスタシステムを例にして
説明する。In FIG. 1, a three-node cluster system including one monitoring server 1, two monitored servers 2 and 3, and one standby server 4 will be described as an example.

【０００６】監視サーバ１は常時起動した監視サーバ側
監視プロセス（以下、単に監視プロセスという）５を有
し、この監視プロセス５によって監視対象サーバ２，３
を監視する。また、監視対象サーバ２は、１以上のプロ
セス１０から構成されてパッケージ（プログラム）をな
すクラスタグループ８を起動し、かかるプロセス１０を
監視対象としてこのクラスタグループ８を監視する監視
対象側監視プロセス（以下、単に監視プロセスという）
６を備えている。この監視プロセス６の監視結果は、監
視対象プロセス情報Ａとして、監視サーバ１の監視プロ
セス５に送られる。同様にして、監視対象サーバ３も、
１以上のプロセス１０から構成されてパッケージ（プロ
グラム）をなすクラスタグループ９を起動し、かかるプ
ロセス１０を監視対象としてこのクラスタグループ９を
監視する監視対象側監視プロセス（以下、単に監視プロ
セスという）７を備えている。この監視プロセス７の監
視結果は、監視対象プロセス情報Ａとして、監視サーバ
１の監視プロセス５に送られる。The monitoring server 1 has a monitoring server-side monitoring process (hereinafter, simply referred to as a monitoring process) 5 which is always started.
To monitor. Further, the monitoring target server 2 starts a cluster group 8 that is composed of one or more processes 10 and forms a package (program), and monitors the cluster group 8 with the process 10 as a monitoring target. (Hereafter simply referred to as the monitoring process)
6 is provided. The monitoring result of the monitoring process 6 is sent to the monitoring process 5 of the monitoring server 1 as monitoring target process information A. Similarly, the monitored server 3
A monitoring target side monitoring process (hereinafter, simply referred to as a monitoring process) 7 that activates a cluster group 9 that is configured by one or more processes 10 and forms a package (program) and monitors the cluster group 9 with the process 10 as a monitoring target. It has. The monitoring result of the monitoring process 7 is sent to the monitoring process 5 of the monitoring server 1 as monitoring target process information A.

【０００７】監視プロセス５では、これら監視対象サー
バ２，３のクラスタグループ８，９毎に、プロセスの監
視項目やクラスタグループが正常と判断するためのプロ
セス数の範囲などの監視設定がなされており、監視対象
サーバ２，３からの監視対象プロセス情報Ａと該当する
監視設定とを対比することにより、クラスタグループ
８，９の状態を監視する。In the monitoring process 5, for each of the cluster groups 8 and 9 of the monitored servers 2 and 3, monitoring settings such as a process monitoring item and a range of the number of processes for determining that the cluster group is normal are set. The status of the cluster groups 8 and 9 is monitored by comparing the monitored process information A from the monitored servers 2 and 3 with the corresponding monitoring settings.

【０００８】ここで、起動しているクラスタグループ８
に異常をきたして監視対象サーバ２に障害が発生し、こ
の監視対象サーバ２でこのクラスタグループ８を実行で
きなくなると、このクラスタグループ８を待機系サーバ
４で実行させるようにする切り替え、即ち、待機系サー
バ切り替えが行なわれる。このようなとき、監視サーバ
１がこの待機系サーバ４を監視することが考えられる
が、このためには、この監視サーバ１の監視プロセス５
に待機系サーバ４でのクラスタグループ８に対して監視
設定がなされなければならない。Here, the activated cluster group 8
When a failure occurs in the monitored server 2 due to an abnormality in the monitored server 2 and the monitored server 2 cannot execute the cluster group 8, switching to execute the cluster group 8 on the standby server 4 is performed, that is, The standby server is switched. In such a case, it is conceivable that the monitoring server 1 monitors the standby server 4. For this purpose, the monitoring process 5 of the monitoring server 1 is performed.
First, the monitoring setting for the cluster group 8 in the standby server 4 must be performed.

【０００９】しかしながら、このようにするためには、
待機系サーバ４にも監視プロセスを設け、その監視結果
を監視サーバ１の監視プロセス５に通知するように構成
することが考えられるが、待機系サーバ４は、システム
が正常な状態では、起動中のクラスタグループが存在せ
ず、このような場合、待機系サーバの監視プロセスから
のかかる状態に対する監視対象プロセス情報とこの待機
系サーバに対する監視設定とから、監視サーバ１の監視
プロセス５は、待機系サーバ４に障害があると、判定す
ることになる。However, in order to do this,
It is conceivable to provide a monitoring process in the standby server 4 and to notify the monitoring process 5 of the monitoring server 1 of the monitoring result. However, when the system is in a normal state, the standby server 4 is running. In such a case, the monitoring process 5 of the monitoring server 1 determines that the monitoring process 5 of the monitoring server 1 is based on the monitoring target process information for the status from the monitoring process of the standby server and the monitoring setting for the standby server. It is determined that the server 4 has a failure.

【００１０】そこで、かかる状態で監視対象サーバ２か
らクラスタグループ８の待機系サーバ切り替えがある
と、監視サーバ１での監視プロセス５では、待機系サー
バ４に対し、このクラスタグループ８に該当する監視設
定への変更（更新）が行なわれなければならないが、図
示する構成のシステムでは、待機系サーバ切り替えに対
し、該当する監視設定への正確な変更を自動的に行なう
ことができない。このため、現状では、待機系サーバ４
の監視は行なわれていない。[0010] Therefore, if the standby server is switched from the monitored server 2 to the cluster group 8 in this state, the monitoring process 5 in the monitoring server 1 monitors the standby server 4 for the monitoring corresponding to the cluster group 8. The setting must be changed (updated). However, in the system having the configuration shown in the figure, it is not possible to automatically change the monitoring setting to the corresponding setting when the standby server is switched. For this reason, at present, the standby server 4
Is not monitored.

【００１１】そこで、待機系サーバ切り替え後では、こ
の待機系サーバ４で起動しているクラスタグループ８の
状態を監視することができず、これに異常が発生して
も、それを知るすべがなかった。Therefore, after the standby server is switched, the state of the cluster group 8 running on the standby server 4 cannot be monitored, and even if an abnormality occurs, there is no way to know it. Was.

【００１２】本発明の目的は、かかる問題を解消し、ク
ラスタグループの動き（状態の遷移）を常に把握し、こ
れに合わせて監視設定を自動的に変更することを可能と
し、多ノードクラスタシステムにおける待機系切り替え
に対応できるようにしたプロセス監視方法を提供するこ
とにある。SUMMARY OF THE INVENTION An object of the present invention is to solve such a problem, to constantly grasp the movement (change of state) of a cluster group, and to automatically change a monitoring setting in accordance with the movement. It is an object of the present invention to provide a process monitoring method adapted to cope with the switching of a standby system.

【００１３】[0013]

【課題を解決するための手段】上記目的を達成するため
に、本発明は、監視サーバと、監視サーバによってクラ
スタグループが監視される監視対象サーバと、監視対象
サーバに共通の待機系サーバとからなる多ノードクラス
タシステムのプロセス監視方法において、監視対象サー
バと待機系サーバとは夫々、常時起動しているチェック
プロセスによってオペレーティングシステムのプロセス
管理テーブルを一定時間間隔で確認して、クラスタグル
ープの起動，停止を示す確認結果を監視サーバ上の監視
設定変更プロセスに通知し、監視設定変更プロセスは、
通知されたこの確認結果を基に、監視対象サーバと待機
系サーバとの間でのクラスタグループの切り替わりを認
識し、監視設定変更プロセスでのこの認識を基に、クラ
スタグループが起動している監視対象サーバと待機系サ
ーバとのプロセス監視を行なうものである。In order to achieve the above object, the present invention provides a monitoring server, a monitored server whose cluster group is monitored by the monitoring server, and a standby server common to the monitored server. In the process monitoring method of a multi-node cluster system, the monitored server and the standby server respectively check the process management table of the operating system at regular time intervals by a check process that is always running, and start and stop the cluster group. The confirmation result indicating the stop is notified to the monitoring setting change process on the monitoring server.
Based on the result of the notification, the switch of the cluster group between the monitored server and the standby server is recognized, and based on this recognition in the monitoring setting change process, the monitoring that the cluster group is running It monitors processes of the target server and the standby server.

【００１４】そして、監視サーバには、起動中のクラス
タグループと監視対象サーバ，待機系サーバとの対応関
係、及び起動中のクラスタグループに対応する監視設定
とを示す監視設定テーブルが設けられており、監視設定
変更プロセスにより、監視対象サーバと待機系サーバと
の夫々毎に、チェックプロセスから通知される確認結果
と監視設定テーブルとを比較して、監視対象サーバと待
機系サーバとの間のクラスタグループの切り替わりを認
識し、かつこの切り替わりの認識とともに、監視設定テ
ーブルを変更するものである。The monitoring server is provided with a monitoring setting table indicating the correspondence between the activated cluster group, the monitored server, and the standby server, and the monitoring settings corresponding to the activated cluster group. The monitoring setting change process compares the confirmation result notified from the check process with the monitoring setting table for each of the monitored server and the standby server, and compares the cluster between the monitored server and the standby server. The switching of the group is recognized, and the monitoring setting table is changed together with the recognition of the switching.

【００１５】また、監視設定変更プロセスによるクラス
タグループの切り替わりの認識に伴って、クラスタグル
ープが起動もしくは停止した監視対象サーバまたは待機
系サーバに対する監視設定の変更処理を行なうものであ
る。Further, in accordance with the recognition of the switching of the cluster group by the monitoring setting change process, a process of changing the monitoring setting for the monitored server or the standby server in which the cluster group has been started or stopped is performed.

【００１６】また、この監視設定は、予め各クラスタグ
ループ毎に及びクラスタグループの組み合わせに応じ
て、監視サーバに設けられており、クラスタグループが
起動した監視対象サーバもしくは１以上のクラスタグル
ープが起動した待機系サーバを、該当する監視設定を選
択・設定することにより、監視するものである。The monitoring setting is provided in the monitoring server in advance for each cluster group and in accordance with the combination of the cluster groups, and the monitoring target server started by the cluster group or one or more cluster groups are started. The standby server is monitored by selecting and setting the corresponding monitoring setting.

【００１７】[0017]

【発明の実施の形態】以下、本発明を実施形態を図面を
参照して具体的に説明する。図１〜図４は本発明による
多ノードクラスタシステムのプロセス監視方法の一実施
形態を示すシステム図であって、１１は監視設定変更プ
ロセス、１２はメモリ、１３は監視設定テーブル、１４
は監視プロセス、１５〜１７はチェックプロセスであ
り、図７に対応する部分には同一符号をつけている。な
お、ここでは、図７の場合と同様に、３ノードクラスタ
システムの場合を例に挙げて説明することとする。Embodiments of the present invention will be specifically described below with reference to the drawings. 1 to 4 are system diagrams showing an embodiment of a process monitoring method for a multi-node cluster system according to the present invention, wherein 11 is a monitoring setting change process, 12 is a memory, 13 is a monitoring setting table, 14
Is a monitoring process, 15 to 17 are check processes, and portions corresponding to those in FIG. 7 are denoted by the same reference numerals. Here, as in the case of FIG. 7, a case of a three-node cluster system will be described as an example.

【００１８】図１は通常時のプロセス監視形態を示すも
のである。FIG. 1 shows a normal process monitoring mode.

【００１９】同図において、各監視対象サーバ２，３に
は、チェックプロセス１５，１６が設けられ、また、待
機系サーバ４においても、監視プロセス１４とチェック
プロセス１７とが設けられて監視対象サーバとしても機
能することにしている。また、監視サーバ１において
は、監視設定変更プロセス１１とメモリ１２とが設けら
れ、このメモリ１２には、夫々の監視対象サーバ２，
３，４に対する監視設定テーブル１３が書込み，読出し
可能に記憶されている。In FIG. 1, each of the monitored servers 2 and 3 is provided with a check process 15 and 16, and the standby server 4 is provided with a monitor process 14 and a check process 17. It also works as well. In the monitoring server 1, a monitoring setting change process 11 and a memory 12 are provided.
The monitoring setting table 13 for 3 and 4 is stored so as to be writable and readable.

【００２０】クラスタグループ８，９が起動する監視対
象サーバ２，３での監視プロセス６，７は、該当するク
ラスタグループ８，９の監視対象プロセス情報Ａを取得
し、これを監視サーバ１の監視プロセス５に通知する。
この監視プロセス５では、各クラスタグループ８，９の
監視設定（夫々を監視設定ａ，ｂとする）やこれらクラ
スタグループ８，９の組み合わせの監視設定（これを監
視設定ｃとする）が設けられており、監視対象サーバ２
に対しては、クラスタグループ８に対する監視設定ａ
が、監視対象サーバ３に対しては、クラスタグループ９
に対する監視設定ｂが夫々選択・設定され、監視対象サ
ーバ２，３毎に、通知された監視対象プロセス情報Ａと
該当する監視設定とを対比することにより、クラスタグ
ループ８，９の起動状態を監視している。そして、監視
プロセス５は、例えば、監視対象サーバ２からの監視対
象プロセス情報Ａと監視設定ａとから、クラスタグルー
プ８のプロセス数が規定の範囲外となったり、プロセス
が異常終了したりしたことを認識すると、この監視対象
サーバ２が異常と判定し、設定された処理（例えば、パ
トランプを鳴動させたり、警告メッセージを管理者に通
知するなどの処理）を実行する。The monitoring processes 6 and 7 of the monitored servers 2 and 3 started by the cluster groups 8 and 9 acquire the monitored process information A of the corresponding cluster groups 8 and 9 and monitor the monitored process information A of the monitoring server 1. Notify process 5.
In the monitoring process 5, monitoring settings for each of the cluster groups 8 and 9 (respectively referred to as monitoring settings a and b) and monitoring settings for a combination of the cluster groups 8 and 9 (referred to as monitoring settings c) are provided. Monitoring server 2
Is a monitoring setting a for the cluster group 8
However, for the monitored server 3, the cluster group 9
Of the cluster groups 8 and 9 by comparing the notified monitoring target process information A with the corresponding monitoring setting for each of the monitoring target servers 2 and 3. are doing. The monitoring process 5 determines, for example, that the number of processes in the cluster group 8 is out of the specified range or that the process has terminated abnormally based on the monitoring target process information A and the monitoring setting a from the monitoring target server 2. When the monitoring target server 2 is recognized, the monitoring target server 2 determines that there is an abnormality, and executes a set process (for example, a process of sounding a patrol lamp or notifying an administrator of a warning message).

【００２１】以上の動作は従来のシステムとほとんど変
わりないが、この実施形態は、監視対象サーバ２，３，
４や監視サーバ１を図示する上記の構成とすることによ
り、本来の待機系サーバ４も監視対象サーバとして含め
て、クラスタグループ８，９の動き（状態の遷移）を常
時監視し、この動きとともに、該当する監視対象サーバ
２，３，４に対して、監視プロセス５で正しい監視設定
を自動的に行なうことができるようにしたものである。
これを可能とするために、監視対象サーバ２，３にチェ
ックプロセス１５，１６を追加し、待機系サーバ４に監
視プロセス１４とチェックプロセス１７とを設けて監視
対象サーバの構成とし、さらに、監視サーバ１では、監
視設定変更プロセス１１と監視設定テーブル１３を備え
たメモリ１２とを追加したものである。The above operation is almost the same as that of the conventional system, however, in this embodiment, the monitored servers 2, 3,
4 and the monitoring server 1 in the above-described configuration, the original standby server 4 is also included as a monitoring target server, and the movement (state transition) of the cluster groups 8 and 9 is constantly monitored. The monitoring process 5 can automatically perform the correct monitoring setting for the corresponding monitoring target servers 2, 3, and 4.
In order to make this possible, check processes 15 and 16 are added to the monitored servers 2 and 3, and a monitoring process 14 and a check process 17 are provided in the standby server 4 to configure a monitored server. In the server 1, a monitoring setting change process 11 and a memory 12 having a monitoring setting table 13 are added.

【００２２】ここで、監視サーバ１の監視設定変更プロ
セス１１と監視対象サーバ２，３，４のチェックプロセ
ス１５，１６，１７とは常時起動しており、監視設定変
更プロセス１１と監視対象サーバ２，３，４のチェック
プロセス１５，１６，１７との間で、図５に示す処理動
作が行なわれる。Here, the monitoring setting change process 11 of the monitoring server 1 and the check processes 15, 16, and 17 of the monitoring target servers 2, 3, and 4 are always running, and the monitoring setting change process 11 and the monitoring target server 2 , 3, and 4 check processes 15, 16, and 17, the processing operations shown in FIG.

【００２３】即ち、チェックプロセス１５，１６，１７
は、一定時間間隔で監視対象サーバ２，３，４のオペレ
ーティングシステムのプロセス管理テーブルを確認し
（ステップ１００）、その確認結果を、起動中クラスタ
グループ情報Ｂとして、監視サーバ１上の監視設定変更
プロセス１１に通知する（ステップ１０１）。監視設定
変更プロセス１１は、かかる起動中クラスタグループ情
報Ｂをメモリ１２での監視設定テーブル１３の内容と比
較し、クラスタグループ８，９に動きがないかどうか
（即ち、待機系サーバ切り替わりなどによって状態の変
化（遷移）がないかどうか）を検出する。That is, the check processes 15, 16, 17
Checks the process management tables of the operating systems of the monitored servers 2, 3, and 4 at regular time intervals (step 100), and uses the check result as the active cluster group information B to change the monitoring setting on the monitoring server 1. The process 11 is notified (step 101). The monitoring setting change process 11 compares the running cluster group information B with the contents of the monitoring setting table 13 in the memory 12 and determines whether or not the cluster groups 8 and 9 are moving (that is, whether the cluster groups 8 and 9 are in a state due to switching of the standby server, etc. Change (transition) is detected).

【００２４】監視設定テーブル１３では、各クラスタグ
ループ８，９がどの監視対象サーバで起動しているかを
示す情報とそのときの監視プロセス５で設定される監視
対象サーバに対する監視設定とが表わされている。この
監視設定テーブル１３の図示の状態では、クラスタグル
ープ８が監視対象サーバ２で起動しており、監視プロセ
ス５において、監視対象サーバ２に対し、クラスタグル
ープ８の監視設定ａが選択・設定されていることを示し
ており、また、クラスタグループ９が監視対象サーバ３
で起動しており、監視プロセス５において、この監視対
象サーバ３に対し、クラスタグループ９の監視設定ｂが
選択・設定されていることを示しており、さらに、監視
対象サーバ（待機系サーバ）４では、このとき待機系サ
ーバ切り替えがなされていないので、起動するクラスタ
グループが存在せず、従って、監視プロセス５では、こ
の監視対象サーバ４に対する監視設定の選択・設定がな
されていないことを示している。従って、監視プロセス
５は、待機系サーバ４を監視していない。The monitoring setting table 13 shows information indicating which monitoring target server each of the cluster groups 8 and 9 is running and the monitoring setting for the monitoring target server set by the monitoring process 5 at that time. ing. In the illustrated state of the monitoring setting table 13, the cluster group 8 is activated on the monitored server 2, and the monitoring setting a of the cluster group 8 is selected and set for the monitored server 2 in the monitoring process 5. And that the cluster group 9 is the monitored server 3
Indicates that the monitoring setting b of the cluster group 9 has been selected and set for the monitoring target server 3 in the monitoring process 5, and the monitoring target server (standby server) 4 In this case, since the standby server has not been switched at this time, there is no cluster group to be activated, and therefore, the monitoring process 5 indicates that the monitoring setting has not been selected and set for the monitoring target server 4. I have. Therefore, the monitoring process 5 does not monitor the standby server 4.

【００２５】そして、夫々の監視対象サーバ２，３，４
のチェックプロセス１５，１６，１７から起動中クラス
タグループ情報Ｂが通知されると、監視設定変更プロセ
ス１１は、監視対象サーバ２，３，４毎に受信した起動
中クラスタグループ情報Ｂと監視設定テーブル１３での
該当する監視対象サーバの情報とを比較する。この場
合、各監視対象サーバ２，３，４からの起動中クラスタ
グループ情報Ｂは監視設定テーブル１３の内容と合致し
ており、これにより、図５に示す動作が行なわれる毎
に、図６におけるステップ２００，２０１からなる処理
が行なわれることになる。The respective monitored servers 2, 3, and 4
When the starting cluster group information B is notified from the check processes 15, 16, and 17 of the monitoring target, the monitoring setting change process 11 executes the starting cluster group information B and the monitoring setting table received for each of the monitored servers 2, 3, and 4. 13 and the information of the corresponding monitored server is compared. In this case, the running cluster group information B from each of the monitored servers 2, 3, and 4 matches the contents of the monitoring setting table 13, so that each time the operation shown in FIG. The processing consisting of steps 200 and 201 is performed.

【００２６】図１に示すかかる状態で、いま、監視対象
サーバ２に障害が発生したとすると、監視対象サーバ２
の待機系サーバ切り替えが発生し、クラスタグループ８
が待機系サーバである監視対象サーバ４に切り替わり、
この監視対象サーバ４で起動することになる。図２はシ
ステムのかかる状態を示すものであり、この場合の処理
動作を図６を用いて説明する。In the state shown in FIG. 1, if it is assumed that a failure has occurred in the monitored server 2,
Switching of the standby server of cluster group 8
Is switched to the monitored server 4 which is a standby server,
The monitoring target server 4 is started. FIG. 2 shows such a state of the system, and the processing operation in this case will be described with reference to FIG.

【００２７】図２及び図６において、先に説明したよう
に、監視サーバ１上の監視設定変更プロセス１１は、各
監視対象サーバ２，３，４から起動中クラスタグループ
情報Ｂの通知を受けており（ステップ２００）、待機系
サーバ切り替えが発生しなければ（ステップ２０１）、
一定時間間隔での図５に示す処理動作に伴い、ステップ
２００，２０１の動作を繰り返すことになる。In FIG. 2 and FIG. 6, as described above, the monitoring setting change process 11 on the monitoring server 1 receives notification of the active cluster group information B from each of the monitored servers 2, 3, and 4. If there is no standby server switching (step 201),
With the processing operation shown in FIG. 5 at regular time intervals, the operations of steps 200 and 201 are repeated.

【００２８】しかし、上記のように、監視対象サーバ２
のみで待機系サーバ切り替えが発生すると、監視対象サ
ーバ４上のチェックプロセス１７は、この監視対象サー
バ４のオペレーティングシステムのプロセス管理テーブ
ルで起動中クラスタグループ８を確認し（図５のステッ
プ１００）、監視サーバ１上の監視設定変更プロセス１
１に起動中クラスタグループ情報Ｂを通知する（図５の
ステップ１０１）。このときには、監視サーバ１のメモ
リ１２上に管理している監視設定テーブル１３では、情
報が変更されずに図１に示した内容がそのまま保持され
ているが、監視設定変更プロセス１１は、監視設定サー
バ４のチェックプロセス１７からの起動中クラスタグル
ープ情報Ｂを受信すると（ステップ２００）、これとメ
モリ１２上に管理している図１に示す監視設定テーブル
１３での監視対象サーバ４に対する情報とを比較する
（ステップ２０１）。このとき、このチェックプロセス
１７からの起動中クラスタグループ情報Ｂは、監視対象
サーバ４でクラスタグループ８のみが起動したことを表
わしているので、監視設定変更プロセス１１は、これを
監視設定テーブル１３での監視対象サーバ４に対する情
報（即ち、起動中クラスタグループがないことを示す情
報）と比較することにより（ステップ２０１）、監視対
象サーバ４で起動中クラスタグループがない状態からク
ラスタグループ８のみが起動した状態に遷移したことを
認識し（ステップ２０２）、監視プロセス５で監視設定
の変更処理を実行する（ステップ２０５）。この変更処
理は、監視プロセス５で、監視対象サーバ４に対し、正
常時に監視対象サーバ２を監視するのに使用していた監
視設定ａを選択設定するものである。これにより、監視
プロセス５が監視対象サーバ４の監視を開始する。However, as described above, the monitored server 2
When the standby server switching occurs only in the standby server, the check process 17 on the monitored server 4 checks the active cluster group 8 in the process management table of the operating system of the monitored server 4 (Step 100 in FIG. 5), Monitoring setting change process 1 on monitoring server 1
1 is notified of the running cluster group information B (step 101 in FIG. 5). At this time, in the monitoring setting table 13 managed on the memory 12 of the monitoring server 1, the information shown in FIG. When the booting cluster group information B is received from the check process 17 of the server 4 (step 200), it is compared with the information for the monitored server 4 in the monitoring setting table 13 shown in FIG. Compare (step 201). At this time, since the running cluster group information B from the check process 17 indicates that only the cluster group 8 has been started on the monitored server 4, the monitoring setting change process 11 indicates this in the monitoring setting table 13. Of the monitored server 4 (that is, information indicating that there is no active cluster group) (step 201), only the cluster group 8 is activated from the state in which there is no active cluster group on the monitored server 4. The monitoring process 5 recognizes that the state has transitioned to the changed state (step 202), and executes the monitoring setting changing process in the monitoring process 5 (step 205). In this change process, the monitoring process 5 selects and sets the monitoring setting a used for monitoring the monitoring target server 2 when the monitoring target server 4 is normal. Thereby, the monitoring process 5 starts monitoring the monitored server 4.

【００２９】しかる後、監視設定変更プロセス１１は、
メモリ１２上の監視設定テーブル１３の変更処理を実行
する（ステップ２０６）。この変更処理は、クラスタグ
ループ８のみが監視対象サーバ４で起動中であり、この
とき、監視プロセス５での監視対象サーバ４に対する監
視設定が監視設定ａであるように、監視設定テーブル１
３の内容を変更するものである。Thereafter, the monitoring setting change process 11
The monitoring setting table 13 on the memory 12 is changed (step 206). In this change processing, only the cluster group 8 is running on the monitored server 4, and at this time, the monitoring setting table 1 is set so that the monitoring setting for the monitored server 4 in the monitoring process 5 is the monitoring setting a.
3 is to be changed.

【００３０】一方、監視対象サーバ２では、クラスタグ
ループ８の待機系サーバ切り替えがあると、この監視対
象サーバ２上のチェックプロセス１５も、監視サーバ１
上の監視設定変更プロセス１１へ起動中のクラスタグル
ープが存在しない状態になったことを示す起動中クラス
タグループ情報Ｂを通知する（図５のステップ１０
１）。この通知を受信すると（ステップ２００）、この
監視設定変更プロセス１１は、この起動中クラスタグル
ープ情報Ｂとメモリ１２上の監視設定テーブル１３での
監視対象サーバ２の情報とを比較することにより（ステ
ップ２０１）、起動中のクラスタグループが存在しない
状態になったことを認識し（ステップ２０２）、監視プ
ロセス５での監視対象サーバ２での監視設定ａを解除さ
せてこの監視サーバ２の監視を終了する（ステップ２０
３）。これにより、監視対象サーバ２は監視設定の対象
外となる。On the other hand, in the monitored server 2, when the standby server of the cluster group 8 is switched, the check process 15 on the monitored server 2 is also performed by the monitoring server 1.
The running cluster group information B indicating that the running cluster group does not exist is notified to the monitoring setting change process 11 (step 10 in FIG. 5).
1). Upon receiving this notification (step 200), the monitoring setting change process 11 compares the running cluster group information B with the information of the monitoring target server 2 in the monitoring setting table 13 on the memory 12 (step 200). 201), it is recognized that there is no active cluster group (step 202), the monitoring setting a on the monitoring target server 2 in the monitoring process 5 is canceled, and the monitoring of this monitoring server 2 is ended. (Step 20
3). As a result, the monitored server 2 is excluded from the monitoring setting.

【００３１】なお、監視設定変更プロセス１１は、監視
対象サーバ２のチェックプロセス１５からの起動中クラ
スタグループ情報Ｂと監視対象サーバ４のチェックプロ
セス１７からの起動中クラスタグループ情報Ｂとによ
り、監視対象サーバ２のクラスタグループ８の待機系サ
ーバ切り替えを認識することができ、この認識のもとに
して、上記のように、監視プロセス５が監視対象サーバ
４に対して監視設定ａを選択・設定することができる。The monitoring setting change process 11 uses the starting cluster group information B from the check process 15 of the monitoring target server 2 and the starting cluster group information B from the checking process 17 of the monitoring target server 4. The switching of the standby server of the cluster group 8 of the server 2 can be recognized, and based on this recognition, the monitoring process 5 selects and sets the monitoring setting a for the monitoring target server 4 as described above. be able to.

【００３２】また、監視対象サーバ２の監視終了処理
（ステップ２０３）と監視対象サーバ４に対する監視設
定ａの選択・設定（ステップ２０５）とともに、監視設
定変更プロセス１１は、監視設定テーブル１３の変更処
理を実行する（ステップ２０６）。これは、図１に示し
た監視設定テーブル１３をシステムの新たな状態に合致
するように変更するものであり、クラスタグループ８が
監視対象サーバ４で起動中とし、このときの監視対象サ
ーバ４に対して監視プロセス５での監視設定を監視設定
ａとし、図２に示すような内容とするものである。In addition to the process of terminating the monitoring of the monitored server 2 (step 203) and the selection and setting of the monitoring setting a for the monitored server 4 (step 205), the monitoring setting change process 11 performs the process of changing the monitoring setting table 13. Is executed (step 206). This is to change the monitoring setting table 13 shown in FIG. 1 so as to match the new state of the system. It is assumed that the cluster group 8 is starting up on the monitored server 4 and the monitored server 4 On the other hand, the monitoring setting in the monitoring process 5 is referred to as monitoring setting a, and has the contents as shown in FIG.

【００３３】以上の処理が終わると、監視サーバ１の監
視プロセス５は、上記と同様にして、監視対象サーバ
３，４を監視し、また、夫々の監視対象サーバ２，３，
４のチェックプロセス１５，１６，１７が一定時間間隔
で図５に示す動作を繰り返す。When the above process is completed, the monitoring process 5 of the monitoring server 1 monitors the monitored servers 3 and 4 in the same manner as described above, and monitors the monitored servers 2 and 3 respectively.
The check processes 15, 16, and 17 of 4 repeat the operation shown in FIG. 5 at regular time intervals.

【００３４】以上説明した図２に示す状態で、さらに、
他の監視対象サーバ、この場合、監視対象サーバ３にも
障害が発生して、監視対象サーバ３のクラスタグループ
９が待機系サーバである監視対象サーバ４に切り替わる
場合もある。図３はかかる状態を示すものであって、こ
のための処理動作を、以下、これを図３及び図６を用い
て説明する。In the state shown in FIG. 2 described above,
In some cases, a failure occurs in another monitored server, in this case, the monitored server 3, and the cluster group 9 of the monitored server 3 is switched to the monitored server 4 that is a standby server. FIG. 3 shows such a state, and the processing operation for this will be described below with reference to FIGS.

【００３５】図３及び図６において、先に説明したよう
に、監視サーバ１上の監視設定変更プロセス１１は、各
監視対象サーバ２，３，４から起動中クラスタグループ
情報Ｂの通知を受けており（ステップ２００）、待機系
サーバ切り替えが発生しなければ（ステップ２０１）、
図２に示す状態で、一定時間間隔での図５に示す処理動
作に伴い、ステップ２００，２０１の動作を繰り返すこ
とになる。In FIG. 3 and FIG. 6, as described above, the monitoring setting change process 11 on the monitoring server 1 receives notification of the active cluster group information B from each of the monitored servers 2, 3, and 4. If there is no standby server switching (step 201),
In the state shown in FIG. 2, the operations of steps 200 and 201 are repeated with the processing operation shown in FIG. 5 at regular time intervals.

【００３６】かかる状態で、上記のように、監視対象サ
ーバ３で待機系サーバ切り替えが発生すると、監視対象
サーバ４上のチェックプロセス１７は、この監視対象サ
ーバ４のオペレーティングシステムのプロセス管理テー
ブルで起動中クラスタグループ８，９を確認し（図５の
ステップ１００）、組み合わせチェック処理を実行する
とともに、監視サーバ１上の監視設定変更プロセス１１
にこの組み合わせチェック処理を示す起動中クラスタグ
ループ情報Ｂを通知する（図５のステップ１０１）。In this state, when the standby server switching occurs in the monitored server 3 as described above, the check process 17 on the monitored server 4 starts up in the process management table of the operating system of the monitored server 4. The middle cluster groups 8 and 9 are confirmed (step 100 in FIG. 5), the combination check process is executed, and the monitoring setting change process 11 on the monitoring server 1 is performed.
Is notified of the activated cluster group information B indicating the combination check processing (step 101 in FIG. 5).

【００３７】この組み合わせチェック処理は、起動して
いるクラスタグループが２個以上となった場合にコール
されるものであって、これら起動しているクラスタグル
ープを夫々チェックし、監視サーバ１で予め設定されて
いる監視設定の中からクラスタグループの組み合わせに
対応した監視設定（この場合、上記の監視設定ｃ）を選
択させるための処理である。This combination check process is called when the number of activated cluster groups becomes two or more. Each of the activated cluster groups is checked and set in the monitoring server 1 in advance. This is a process for selecting a monitoring setting corresponding to the combination of cluster groups (in this case, the above-described monitoring setting c) from the monitoring settings that have been set.

【００３８】そして、このときには、監視サーバ１のメ
モリ１２上に管理している監視設定テーブル１３では、
情報が変更されずに図２に示した内容がそのまま保持さ
れているが、監視設定変更プロセス１１は、監視設定サ
ーバ４のチェックプロセス１７からの起動中クラスタグ
ループ情報Ｂを受信すると（ステップ２００）、これと
メモリ１２上に管理している図２に示す監視設定テーブ
ル１３での監視対象サーバ４に対する情報とを比較する
（ステップ２０１）。このとき、このチェックプロセス
１７からの起動中クラスタグループ情報Ｂは、監視対象
サーバ４でクラスタグループ８，９が起動したことを表
わしているので、監視設定変更プロセス１１は、これを
監視設定テーブル１３での監視対象サーバ４に対する情
報（即ち、クラスタグループ８のみが起動しているを示
す情報）と比較することにより（ステップ２０１）、監
視対象サーバ４でクラスタグループ８のみが起動してい
る状態からクラスタグループ８，９が起動した状態に遷
移したことを認識し（ステップ２０２）、これに伴って
監視プロセス５が監視設定の変更処理を実行する。この
変更処理は、２以上のクラスタグループの組み合わせを
確認し（ステップ２０４：この場合には、２つのクラス
タグループ８，９の組み合わせであることを確認す
る）、監視プロセス５で、監視対象サーバ４に対し、か
かる組み合わせに対応した上記の監視設定ｃを選択・設
定するものである（ステップ２０５）。これにより、監
視プロセス５が監視対象サーバ４の監視を続行する。At this time, in the monitoring setting table 13 managed on the memory 12 of the monitoring server 1,
Although the information shown in FIG. 2 is maintained as it is without being changed, the monitoring setting change process 11 receives the starting cluster group information B from the check process 17 of the monitoring setting server 4 (step 200). This is compared with the information for the monitored server 4 in the monitoring setting table 13 shown in FIG. 2 managed on the memory 12 (step 201). At this time, since the running cluster group information B from the check process 17 indicates that the cluster groups 8 and 9 have been started on the monitored server 4, the monitoring setting change process 11 indicates this to the monitoring setting table 13. From the state in which only the cluster group 8 is running on the monitored server 4 by comparing with the information on the monitored server 4 in step (i.e., information indicating that only the cluster group 8 is running) (step 201). Recognizing that the cluster groups 8 and 9 have transitioned to the activated state (step 202), the monitoring process 5 executes a process of changing the monitoring settings accordingly. In this change process, a combination of two or more cluster groups is confirmed (step 204: in this case, it is confirmed that the combination is a combination of two cluster groups 8 and 9). In response to this, the above-mentioned monitoring setting c corresponding to such a combination is selected and set (step 205). As a result, the monitoring process 5 continues monitoring the monitoring target server 4.

【００３９】しかる後、監視設定変更プロセス１１は、
メモリ１２上の監視設定テーブル１３の変更処理を実行
する（ステップ２０３）。この変更処理は、クラスタグ
ループ８，９が監視対象サーバ４で起動中であり、この
とき、監視プロセス５での監視対象サーバ４に対する監
視設定が監視設定ｃであるように、監視設定テーブル１
３の内容を変更するものである。Thereafter, the monitoring setting change process 11
The monitoring setting table 13 on the memory 12 is changed (step 203). In this change processing, the monitoring setting table 1 is set so that the cluster groups 8 and 9 are running on the monitored server 4 and the monitoring setting for the monitored server 4 in the monitoring process 5 is the monitoring setting c at this time.
3 is to be changed.

【００４０】一方、監視対象サーバ３では、クラスタグ
ループ９の待機系サーバ切り替えがあると、この監視対
象サーバ３上のチェックプロセス１６も、監視サーバ１
上の監視設定変更プロセス１１へ起動中のクラスタグル
ープ９が存在しない状態になったことを示す起動中クラ
スタグループ情報Ｂを通知する（図５のステップ１０
１）。この通知を受信すると（ステップ２００）、監視
サーバ１上の監視設定変更プロセス１１は、この起動中
クラスタグループ情報Ｂとメモリ１２上の監視設定テー
ブル１３での監視対象サーバ３の情報とを比較すること
により（ステップ２０１）、起動中のクラスタグループ
が存在しない状態になったことを認識し（ステップ２０
２）、監視プロセス５での監視対象サーバ３での監視設
定ｂを解除させてこの監視サーバ３の監視を終了する
（ステップ２０３）。これにより、監視対象サーバ３も
監視設定の対象外となる。On the other hand, in the monitored server 3, when the standby server of the cluster group 9 is switched, the check process 16 on the monitored server 3 is also performed by the monitoring server 1.
The running cluster group information B indicating that the running cluster group 9 does not exist is notified to the monitoring setting change process 11 (step 10 in FIG. 5).
1). Upon receiving this notification (step 200), the monitoring setting change process 11 on the monitoring server 1 compares the running cluster group information B with the information of the monitoring target server 3 in the monitoring setting table 13 on the memory 12. As a result (step 201), it is recognized that the active cluster group does not exist (step 20).
2) The monitoring setting b of the monitoring target server 3 in the monitoring process 5 is released, and the monitoring of the monitoring server 3 is terminated (step 203). As a result, the monitored server 3 is also excluded from the monitoring setting.

【００４１】なお、この場合も、監視設定変更プロセス
１１は、監視対象サーバ４のチェックプロセス１７から
の起動中クラスタグループ情報Ｂと監視対象サーバ３の
チェックプロセス１６からの起動中クラスタグループ情
報Ｂとにより、監視対象サーバ３のクラスタグループ９
の待機系サーバ切り替えを認識することができ、この認
識のもとにして、上記のように、監視プロセス５が監視
対象サーバ４に対して監視設定ｃを選択・設定するよう
にすることもできる。In this case as well, the monitoring setting change process 11 includes the starting cluster group information B from the check process 17 of the monitored server 4 and the starting cluster group information B from the check process 16 of the monitored server 3. The cluster group 9 of the monitored server 3
Can be recognized, and based on this recognition, the monitoring process 5 can select and set the monitoring setting c for the monitoring target server 4 as described above. .

【００４２】また、監視対象サーバ３の監視終了処理
（ステップ２０３）と監視対象サーバ４に対する監視設
定ｃの選択・設定（ステップ２０５）とともに、監視設
定変更プロセス１１は、監視設定テーブル１３の変更処
理を実行する（ステップ２０６）。これは、図２に示し
た監視設定テーブル１３をシステムの新たな状態に合致
するように変更するものであり、クラスタグループ９も
監視対象サーバ４で起動中とし、このときの監視対象サ
ーバ４に対して監視プロセス５での監視設定を監視設定
ｃとし、図３に示すような内容とするものである。In addition to the process of terminating the monitoring of the monitored server 3 (step 203) and the selection and setting of the monitoring setting c for the monitored server 4 (step 205), the monitoring setting change process 11 performs the process of changing the monitoring setting table 13. Is executed (step 206). This changes the monitoring setting table 13 shown in FIG. 2 so as to match the new state of the system. The cluster group 9 is also activated on the monitored server 4 and the monitored server 4 On the other hand, the monitoring setting in the monitoring process 5 is referred to as monitoring setting c, and has the contents shown in FIG.

【００４３】次に、障害が発生した監視対象サーバが回
復し、待機系サーバで起動していたクラスタグループが
元の監視対象サーバに復帰する（切り戻る）場合の動作
について説明する。Next, a description will be given of the operation in the case where the monitored server in which a failure has occurred is recovered, and the cluster group which has been activated in the standby server returns (switches back) to the original monitored server.

【００４４】システムが図３に示す状態となり、その
後、待機系サーバである監視対象サーバ４で起動中のク
ラスタグループ８が元の監視対象サーバ２に戻るような
場合もある（これを、以下、待機系サーバ切り戻りとい
う）。図４はかかる状態を示すものであって、このため
の処理動作を、以下、これを図４及び図６を用いて説明
する。In some cases, the system enters the state shown in FIG. 3, and thereafter the cluster group 8 running on the monitored server 4 which is the standby server returns to the original monitored server 2 (this will be described below). Standby server switchback). FIG. 4 shows such a state, and the processing operation for this will be described below with reference to FIGS.

【００４５】図４及び図６において、先に説明したよう
に、監視サーバ１上の監視設定変更プロセス１１は、各
監視対象サーバ２，３，４から起動中クラスタグループ
情報Ｂの通知を受けており（ステップ２００）、クラス
タグループ８，９の状態の遷移が発生しなければ（ステ
ップ２０１）、図３に示す状態で、一定時間間隔での図
５に示す処理動作に伴い、ステップ２００，２０１の動
作を繰り返すことになる。4 and 6, as described above, the monitoring setting change process 11 on the monitoring server 1 receives notification of the active cluster group information B from each of the monitored servers 2, 3, and 4. If the state transition of the cluster groups 8 and 9 does not occur (step 201) and the processing shown in FIG. 5 is performed at regular time intervals in the state shown in FIG. Operation is repeated.

【００４６】かかる状態で、上記のように、監視対象サ
ーバ４でそこから監視対象サーバ２へクラスタグループ
８が切り替わる待機系サーバ切り戻りが発生すると、監
視対象サーバ４上のチェックプロセス１７は、この監視
対象サーバ４のオペレーティングシステムのプロセス管
理テーブルで起動中クラスタグループ９のみを確認し
（図５のステップ１００）、監視サーバ１上の監視設定
変更プロセス１１にこの旨を示す起動中クラスタグルー
プ情報Ｂを通知する（図５のステップ１０１）。このと
きには、監視サーバ１のメモリ１２上に管理している監
視設定テーブル１３では、情報が変更されずに図３に示
した内容がそのまま保持されているが、監視設定変更プ
ロセス１１は、監視設定サーバ４のチェックプロセス１
７からの起動中クラスタグループ情報Ｂを受信すると
（ステップ２００）、これとメモリ１２上に管理してい
る図３に示す監視設定テーブル１３での監視対象サーバ
４に対する情報とを比較する（ステップ２０１）。この
とき、このチェックプロセス１７からの起動中クラスタ
グループ情報Ｂは、監視対象サーバ４でクラスタグルー
プ９のみが起動したことを表わしているので、監視設定
変更プロセス１１は、これを監視設定テーブル１３での
監視対象サーバ４に対する情報（即ち、クラスタグルー
プ９のみが起動しているを示す情報）と比較することに
より（ステップ２０１）、監視対象サーバ４でクラスタ
グループ８，９が起動している状態からクラスタグルー
プ９のみが起動した状態に遷移したことを認識し（ステ
ップ２０２）、これに伴って監視プロセス５で監視設定
の変更処理を実行する。この変更処理は、監視プロセス
５で、監視対象サーバ４に対し、正常時に監視対象サー
バ３を監視するのに使用していた監視設定ｂを選択・設
定するものである。これにより、監視プロセス５が監視
設定ｂでもって監視対象サーバ４の監視を継続する（ス
テップ２０５）。In this state, as described above, when the standby server returns from the monitored server 4 where the cluster group 8 is switched to the monitored server 2, the check process 17 on the monitored server 4 performs this process. Only the running cluster group 9 is checked in the process management table of the operating system of the monitored server 4 (step 100 in FIG. 5), and the running cluster group information B indicating this to the monitoring setting change process 11 on the monitoring server 1 (Step 101 in FIG. 5). At this time, in the monitoring setting table 13 managed on the memory 12 of the monitoring server 1, the information shown in FIG. Check process 1 of server 4
7 (step 200), it compares this with the information for the monitored server 4 in the monitoring setting table 13 shown in FIG. 3 managed in the memory 12 (step 201). ). At this time, since the running cluster group information B from the check process 17 indicates that only the cluster group 9 has been started on the monitored server 4, the monitoring setting change process 11 indicates this in the monitoring setting table 13. From the state in which the cluster groups 8 and 9 are running on the monitored server 4 by comparing with the information on the monitored server 4 (that is, information indicating that only the cluster group 9 is running) (step 201). Recognizing that only the cluster group 9 has transitioned to the activated state (step 202), the monitoring process 5 executes a process of changing the monitoring setting in accordance with this. This change process selects and sets the monitoring setting b used for monitoring the monitoring target server 3 in the monitoring process 5 when the monitoring target server 3 is normal. Thus, the monitoring process 5 continues monitoring the monitoring target server 4 with the monitoring setting b (step 205).

【００４７】しかる後、監視設定変更プロセス１１は、
メモリ１２上の監視設定テーブル１３の変更処理を実行
する（ステップ２０６）。この変更処理は、クラスタグ
ループ９のみが監視対象サーバ４で起動中であり、この
とき、監視プロセス５での監視対象サーバ４に対する監
視設定が監視設定ｂであるように、監視設定テーブル１
３の内容を変更するものである。After that, the monitoring setting change process 11
The monitoring setting table 13 on the memory 12 is changed (step 206). In this change process, only the cluster group 9 is running on the monitored server 4, and at this time, the monitoring setting table 1 is set so that the monitoring setting for the monitored server 4 in the monitoring process 5 is the monitoring setting b.
3 is to be changed.

【００４８】一方、監視対象サーバ２では、クラスタグ
ループ８の待機系サーバ切り戻りがあると、この監視対
象サーバ２上のチェックプロセス１５も、監視サーバ１
上の監視設定変更プロセス１１へ起動中クラスタグルー
プ８が存在する状態になったことを示す起動中クラスタ
グループ情報Ｂを通知する（図５のステップ１０１）。
この通知を受信すると（ステップ２００）、監視サーバ
１上の監視設定変更プロセス１１は、この起動中クラス
タグループ情報Ｂとメモリ１２上の図３に示す監視設定
テーブル１３での監視対象サーバ２の情報とを比較する
ことにより（ステップ２０１）、起動中クラスタグルー
プ８が存在する状態になったことを認識し（ステップ２
０２）、監視プロセス５で監視設定の変更処理を実行す
る（ステップ２０５）。この変更処理は、監視プロセス
５で、正常時に監視対象サーバ２を監視するのに使用す
る監視設定ａを選択・設定するものである。これによ
り、監視プロセス５が監視対象サーバ２の監視を開始す
る。On the other hand, in the monitored server 2, when the standby server of the cluster group 8 returns, the check process 15 on the monitored server 2 also performs the monitoring process 1.
The monitoring setting change process 11 is notified of the starting cluster group information B indicating that the starting cluster group 8 is present (step 101 in FIG. 5).
Upon receiving this notification (step 200), the monitoring setting change process 11 on the monitoring server 1 executes the startup cluster group information B and the information of the monitoring target server 2 in the monitoring setting table 13 shown in FIG. (Step 201), it is recognized that the activated cluster group 8 is in a state of being present (step 2).
02), a monitoring setting change process is executed by the monitoring process 5 (step 205). This change process selects and sets a monitoring setting a used by the monitoring process 5 to monitor the monitoring target server 2 in a normal state. Thereby, the monitoring process 5 starts monitoring the monitoring target server 2.

【００４９】なお、この場合も、監視設定変更プロセス
１１は、監視対象サーバ４のチェックプロセス１７から
の起動中クラスタグループ情報Ｂと監視対象サーバ２の
チェックプロセス１５からの起動中クラスタグループ情
報Ｂとにより、監視対象サーバ４から監視対象サーバ２
へのクラスタグループ８の待機系サーバ切り戻しを認識
することができ、この認識のもとにして、上記のよう
に、監視プロセス５が監視対象サーバ４に対して監視設
定ｂを、監視対象サーバ２に対して監視設定ａを夫々選
択・設定することができる。In this case as well, the monitoring setting change process 11 includes the starting cluster group information B from the check process 17 of the monitored server 4 and the starting cluster group information B from the check process 15 of the monitored server 2. As a result, from the monitored server 4 to the monitored server 2
The switching back of the standby server of the cluster group 8 to the monitoring target server 4 can be recognized, and based on this recognition, the monitoring process 5 transmits the monitoring setting b to the monitoring target server 4 as described above. 2, the monitoring setting a can be selected and set.

【００５０】また、監視対象サーバ２の監視開始処理
（ステップ２０５）と監視対象サーバ４に対する監視設
定変更（ステップ２０５）とともに、監視設定変更プロ
セス１１は、監視設定テーブル１３の変更処理を実行す
る（ステップ２０６）。これは、図３に示した監視設定
テーブル１３の内容をシステムの新たな状態に合致する
ように変更するものであり、クラスタグループ９が監視
対象サーバ４で、クラスタグループ８が監視対象サーバ
２で夫々起動中とし、このときの監視対象サーバ４に対
して監視プロセス５での監視設定を監視設定ｂとし、ま
た、監視対象サーバ２に対して監視プロセス５での監視
設定を監視設定ａとする図４に示すような内容とするも
のである。In addition to the monitoring start processing of the monitored server 2 (step 205) and the monitoring setting change of the monitored server 4 (step 205), the monitoring setting change process 11 executes the processing of changing the monitoring setting table 13 (step 205). Step 206). This changes the contents of the monitoring setting table 13 shown in FIG. 3 so as to match the new state of the system. The cluster group 9 is the monitored server 4 and the cluster group 8 is the monitored server 2. It is assumed that each of them is running, the monitoring setting of the monitoring process 5 for the monitoring target server 4 at this time is a monitoring setting b, and the monitoring setting of the monitoring target server 2 for the monitoring process 5 is a monitoring setting a. The contents are as shown in FIG.

【００５１】なお、図４に示す状態で、監視対象サーバ
４で起動中のクラスタグループ９が監視対象サーバ３に
待機系サーバ切り戻しがある場合も同様であるが、この
場合には、監視対象サーバ４がクラスタグループ９が起
動している状態から起動クラスタグループが存在しない
状態に切り替わるものであるから、この監視対象サーバ
４に対する監視プロセス５の監視が解除され、メモリ１
２上の監視設定テーブル１３の内容は、図１に示す内容
となる。In the state shown in FIG. 4, the same applies to the case where the cluster group 9 running on the monitored server 4 has the monitored server 3 switched back to the standby server. Since the server 4 switches from a state in which the cluster group 9 is running to a state in which the starting cluster group does not exist, the monitoring of the monitoring process 5 for the monitoring target server 4 is released, and the memory 1
2 has the contents shown in FIG.

【００５２】以上のようにして、この実施形態では、監
視サーバ１側の監視設定変更プロセス１１とメモリ１２
上の監視設定テーブル１３により、待機系サーバ４をも
監視対象サーバとして、これら監視対象サーバ間にわた
るクラスタグループの動き（遷移）を常時監視すること
ができ、しかも、この動きに応じて各監視対象サーバに
該当する監視設定を正確かつ自動的に選択・設定するこ
とができるものであり、待機系サーバ切り替えがあって
も、クラスタグループの正しい監視を行なうことができ
る。As described above, in this embodiment, the monitoring setting change process 11 and the memory 12 on the monitoring server 1 side
With the above monitoring setting table 13, the standby server 4 can also be used as a monitoring target server to constantly monitor the movement (transition) of the cluster group between these monitoring target servers. The monitoring setting corresponding to the server can be accurately and automatically selected and set. Even if the standby server is switched, correct monitoring of the cluster group can be performed.

【００５３】なお、以上の実施形態では、監視対象サー
バを２個とし、待機系サーバを１個とし、また、監視サ
ーバを１個とするシステムについて説明したが、本発明
はこれのみに限るものではなく、これら各サーバが任意
の個数のシステムにも該当することはいうまでもない。
勿論、この場合には、各監視対象サーバでのクラスタグ
ループに対する監視設定ばかりでなく、これらクラスタ
グループの全てのもしくはその一部の実際に実現可能な
組み合わせに対する監視設定が予め作成されており、監
視サーバでの監視プロセスに選択可能に設定されている
ことになる。In the above embodiment, a system in which the number of monitored servers is two, the number of standby servers is one, and the number of monitoring servers is one has been described, but the present invention is not limited to this. Instead, it goes without saying that each of these servers also corresponds to an arbitrary number of systems.
Of course, in this case, not only the monitoring settings for the cluster groups in each monitored server, but also the monitoring settings for all or a part of these cluster groups that are actually feasible are created in advance. This means that the server can be selected for the monitoring process.

【００５４】また、上記実施形態では、監視サーバ１側
の監視プロセス５は、監視設定変更プロセス１１での各
監視対象サーバ２，３，４からの起動クラスタグループ
情報Ｂによるクラスタグループの動きの認識（図６のス
テップ２０２）に基づいて、監視対象サーバの監視設定
の選択・設定（図６のステップ２０５）や監視終了処理
（図６のステップ２０３）を行なうようにしたが、監視
設定変更プロセス１１は、各監視対象サーバ２，３，４
からの起動クラスタグループ情報Ｂによるクラスタグル
ープの動きを認識すると（図６のステップ２０２）、ま
ず、メモリ１２上の監視設定テーブル１３の変更処理を
行ない、しかる後、監視プロセス５が監視設定テーブル
１３の確認を行なって、監視設定の選択・設定や監視終
了処理を行なうようにしてもよい。この場合、監視プロ
セス５は少なくとも監視対象サーバのいずれかからの監
視対象プロセス情報Ａにより、クラスタグループの動き
（待機系サーバ切り替えや切り戻り）を認識することが
でき、これを認識してから監視設定変更プロセス１１に
よって変更された監視設定テーブル１３を確認すること
により、監視設定の選択・設定や監視終了処理を行なう
ことができる。In the above embodiment, the monitoring process 5 of the monitoring server 1 recognizes the movement of the cluster group based on the starting cluster group information B from each of the monitored servers 2, 3, and 4 in the monitoring setting change process 11. Based on (Step 202 in FIG. 6), the selection and setting of the monitoring setting of the monitoring target server (Step 205 in FIG. 6) and the monitoring end processing (Step 203 in FIG. 6) are performed. 11 is each monitored server 2, 3, 4
When the movement of the cluster group is recognized based on the startup cluster group information B from the server (step 202 in FIG. 6), first, the monitor setting table 13 in the memory 12 is changed. May be checked to perform the selection / setting of the monitoring setting and the monitoring end processing. In this case, the monitoring process 5 can recognize the movement of the cluster group (switching of the standby server or switching back) based on the monitoring target process information A from at least one of the monitoring target servers. By checking the monitoring setting table 13 changed by the setting changing process 11, it is possible to select and set the monitoring setting and perform the monitoring end processing.

【００５５】[0055]

【発明の効果】以上、説明したように、本発明によれ
ば、多ノードクラスタシステムにおいて、待機系サーバ
のプロセス監視をも可能となり、クラスタグループの動
きに合わせて待機系サーバを含めた監視対象サーバに対
する監視設定の変更を正しくかつ自動的に行なうことが
でき、常に適切なプロセス監視を実現できる。As described above, according to the present invention, in a multi-node cluster system, it is also possible to monitor the process of the standby server, and the monitoring target including the standby server according to the movement of the cluster group. The monitoring setting for the server can be correctly and automatically changed, and appropriate process monitoring can always be realized.

【００５６】また、待機系サーバで同時に起動する可能
性のある複数のクラスタグループの組み合わせに対する
監視設定を作成することにより、クラスタシステムを構
成するサーバの台数に関係なく、クラスタグループの組
み合わせに対しても、監視設定やその変更が可能とな
り、汎用性・利便性を高めることができる。Further, by creating a monitoring setting for a combination of a plurality of cluster groups that may be simultaneously activated on the standby server, the combination of the cluster groups can be controlled regardless of the number of servers constituting the cluster system. Also, the monitoring setting and its change can be made, and the versatility and convenience can be improved.

[Brief description of the drawings]

【図１】本発明による多ノードクラスタシステムのプロ
セス監視方法の一実施形態での通常状態時のプロセス監
視形態を示すシステム図である。FIG. 1 is a system diagram showing a process monitoring mode in a normal state in an embodiment of a process monitoring method for a multi-node cluster system according to the present invention.

【図２】図１に示す状態で１つの監視対象サーバにのみ
障害が発生したことによる待機系サーバ切り替え後のプ
ロセス監視形態を示すシステム図である。FIG. 2 is a system diagram showing a process monitoring mode after a standby server is switched due to a failure occurring in only one monitored server in the state shown in FIG. 1;

【図３】図２に示す状態でさらに他の監視対象サーバで
待機系サーバ切り替えが発生したことによる待機系サー
バ切り替え後のプロセス監視形態を示すシステム図であ
る。FIG. 3 is a system diagram showing a process monitoring mode after a standby server switchover due to a standby server switchover occurring in still another monitored server in the state shown in FIG. 2;

【図４】図３に示す状態での待機系サーバのクラスタグ
ループが待機系サーバ切り戻りした場合のプロセス監視
形態を示すシステム図である。FIG. 4 is a system diagram showing a process monitoring mode when the cluster group of the standby server switches back to the standby server in the state shown in FIG. 3;

【図５】図１〜図４での監視対象サーバでのチェックプ
ロセス７の処理動作の一具体例を示すフローチャートで
ある。FIG. 5 is a flowchart showing a specific example of a processing operation of a check process 7 in the monitoring target server in FIGS. 1 to 4;

【図６】図１〜図４に示す実施形態の監視設定変更処理
動作の一具体例を示すフローチャートである。FIG. 6 is a flowchart showing a specific example of a monitoring setting change processing operation of the embodiment shown in FIGS. 1 to 4;

【図７】従来の多ノードクラスタシステムのプロセス監
視方法の一例を示すシステム図である。FIG. 7 is a system diagram showing an example of a conventional process monitoring method of a multi-node cluster system.

[Explanation of symbols]

１監視サーバ２，３監視対象サーバ４監視対象（待機系）サーバ５監視サーバ側の監視プロセス６，７監視対象サーバ側の監視プロセス８，９クラスタグループ１０監視対象プロセス１１監視設定変更プロセス１２メモリ１３監視設定テーブル１４監視対象サーバ側の監視プロセス１５〜１７チェックプロセス 1 monitoring server 2-3 Monitoring target server 4 Monitored (standby) server 5 Monitoring process on the monitoring server 6,7 Monitoring process on the monitored server 8,9 cluster group 10 Processes to be monitored 11 Monitoring setting change process 12 memory 13 Monitoring setting table 14 Monitoring process on the monitored server 15-17 Check process

Claims

[Claims]

1. A process monitoring method for a multi-node cluster system, comprising: a monitoring server; a monitored server whose cluster group is monitored by the monitoring server; and a standby server common to the monitored server. The server and the standby server respectively check the process management table of the operating system at regular time intervals by a check process that is constantly running, and monitor the monitoring results on the monitoring server to indicate the start and stop of the cluster group. Notifying the setting change process, the monitor setting change process receives the notification of the confirmation result, recognizes the switching of the cluster group between the monitored server and the standby server, and Based on the recognition in the above, the monitored server in which the cluster group is running and the standby server A multi-node cluster system process monitoring method characterized by monitoring a process with a server.

2. The monitoring server according to claim 1, wherein the monitoring server stores a correspondence between the active cluster group, the monitored server, and the standby server, and a monitoring setting corresponding to the active cluster group. A monitoring setting table is provided, and for each of the monitored server and the standby server, the monitoring result is compared with the confirmation result notified from the checking process by the monitoring setting change process. A multi-node cluster system process for recognizing switching of a cluster group between the monitored server and the standby server, and changing the monitoring setting table with the recognition of the switching. Monitoring method.

3. The monitoring setting change for the monitoring target server or the standby server in which a cluster group is started or stopped in accordance with the recognition of the switching of the cluster group by the monitoring setting change process according to claim 1 or 2. A multi-node cluster system process monitoring method, characterized by performing processing.

4. The monitoring server according to claim 3, wherein the monitoring setting is provided in the monitoring server in advance for each cluster group and in accordance with a combination of cluster groups, and A multi-node cluster system process monitoring method, wherein the standby server started by the cluster group is monitored by selecting and setting the corresponding monitoring setting.

5. A multi-node cluster system comprising a monitoring server, a monitored server whose cluster group is monitored by the monitoring server, and a standby server common to the monitored server. A multi-node cluster system, characterized by executing the multi-node cluster system process monitoring method according to any one of the above.