JP2001101149A

JP2001101149A - Distributed parallel data processor, recording medium recording distributed parallel data processing program and distributed parallel data processing system

Info

Publication number: JP2001101149A
Application number: JP27954499A
Authority: JP
Inventors: Hidenori Kawai; 英紀河合; Toru Akamine; 亨赤峯
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 1999-09-30
Filing date: 1999-09-30
Publication date: 2001-04-13

Abstract

PROBLEM TO BE SOLVED: To maximize the throughput of the whole system by uniforming the loads of respective computers in the parallel data processing for mainly executing reading processing from a secondary storage device and to prevent the sharp decline of performance by uniforming a subrogated load by remaining computers even when a part of computers are stopped. SOLUTION: Large-scale data are divided, distributed and stored in respective computers, redundantly storing divided data so as not to have the copies of the divided data overlapped to execute processing for the same divided data by different computers. Processing time including reading time from the secondary storage device is measured, a distribution ration is calculated so that the number of requests for the processing of a computer having long processing time is reduced and the number of requests for the processing of a computer having short processing time is increased to subrogate the processing. When a part of the computers is stopped, the distribution ratio of the stopped computer is fixed to 0 and the distribution ratio is recalculated, so that subrogated processing is prevented from being concentrated to a part of the computers by distributing the loads.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、利用者からの処理
要求を複数の計算機が独立に処理し、各計算機の処理結
果を統合して利用者に返答する分散並列型データ処理装
置及び分散並列型データ処理プログラムを記録した記録
媒体並びに分散並列型データ処理システム（以下、単に
分散並列型データ処理装置と記載する。）に関し、特に
各計算機が持つ二次記憶装置の入出力処理の時間を基に
各計算機の負荷を均一化する分散並列型データ処理装置
に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a distributed parallel data processing device and a distributed parallel data processing system, in which a plurality of computers independently process a processing request from a user, and process results of the respective computers are integrated and returned to the user. Storage medium storing a distributed data processing program and a distributed parallel data processing system (hereinafter, simply referred to as a distributed parallel data processing device), particularly based on the time of input / output processing of a secondary storage device of each computer. The present invention also relates to a distributed parallel data processing device for equalizing the load on each computer.

【０００２】[0002]

【従来の技術】従来、計算機によって大規模なデータに
対する検索や集計、加工などの処理を行う場合、二次記
憶装置からのデータの読み込みや書き込みなどの入出力
処理がボトルネックになる。このため、データを分割し
て複数の二次記憶装置に分散格納し、並列に入出力処理
を行う方法が一般に行われている。2. Description of the Related Art Conventionally, when a computer performs processing such as search, aggregation, and processing of large-scale data, input / output processing such as reading and writing of data from a secondary storage device becomes a bottleneck. For this reason, a method of dividing data, distributing and storing the divided data in a plurality of secondary storage devices, and performing input / output processing in parallel is generally performed.

【０００３】複数の独立した計算機から構成される分散
並列型データ処理装置においては、検索や集計、加工な
ど、利用者からの処理要求に対し、個々の計算機の二次
記憶装置に格納されている分割データをそれぞれ並列か
つ独立に読み出すことによって、処理の高速化を行うこ
とができる。[0003] In a distributed parallel data processing device composed of a plurality of independent computers, processing requests from a user, such as retrieval, aggregation, and processing, are stored in a secondary storage device of each computer. The processing can be speeded up by reading the divided data in parallel and independently.

【０００４】大規模なデータを対象として並列に処理を
行うアプリケーションとしては、膨大な文書群から、利
用者が指定したキーワードを含む文書の一覧を返す全文
検索システムや、データベース管理システム上のオンラ
イントランザクション処理サーバや、音声・動画データ
を供給するマルチメディアストリームサーバなどがあ
る。[0004] Examples of an application that performs parallel processing on large-scale data include a full-text search system that returns a list of documents including a keyword specified by a user from a huge document group, and an online transaction on a database management system. There are a processing server and a multimedia stream server that supplies audio / video data.

【０００５】例えば、大規模なデータベースから、指定
された条件を満たすレコードを抽出する検索処理では、
データベースを分割して分散格納することによって処理
を容易に並列化できる。この場合の処理の流れは以下の
ようになる。利用者は、受付役の計算機に条件式を送信
する。受付役の計算機は条件式から、処理対象となる分
割データを指定した部分処理を作成し、該当分割データ
を持つすべての計算機に送信する。各計算機は自分の持
っている分割データを独立に読み込んで、条件を満たす
レコードを部分結果として受付役の計算機に送信する。
受付役の計算機はすべての部分結果を収集、統合、編集
し、利用者に返答する。For example, in a search process for extracting a record satisfying specified conditions from a large-scale database,
Processing can be easily parallelized by dividing and storing the database in a distributed manner. The processing flow in this case is as follows. The user sends the conditional expression to the computer of the receiving role. The computer as the receiving unit creates a partial process specifying the divided data to be processed from the conditional expression, and transmits the partial process to all the computers having the corresponding divided data. Each computer reads its own divided data independently, and transmits a record that satisfies the condition to the accepting computer as a partial result.
The accepting computer collects, integrates, edits all partial results, and replies to the user.

【０００６】このように、元の文書群が大規模であって
も、各計算機が独立、並列に処理を行うことができるた
め、高速な応答が可能となる。さらに、あらかじめ各条
件について、該当するレコードをインデックス化した補
助ファイルを作成しておくことによって、不要なデータ
の読み込みを省略し、検索処理をさらに高速化すること
が可能である。As described above, even if the original document group is large-scale, each computer can perform processing independently and in parallel, so that high-speed response is possible. Furthermore, by creating an auxiliary file in which records corresponding to each condition are indexed in advance, unnecessary data reading can be omitted, and the search processing can be further speeded up.

【０００７】このような分散並列型データ処理装置で
は、各計算機間の負荷が不均一であると、常に処理中の
忙しい計算機が存在する一方で、処理待ちの遊休時間を
過ごす暇な計算機が出現して全体の処理効率を低下させ
てしまう問題がある。In such a distributed parallel type data processing apparatus, if the load among the computers is not uniform, there are always busy computers during the processing, and free computers that spend idle time waiting for the processing appear. Therefore, there is a problem that the overall processing efficiency is reduced.

【０００８】そのため、各計算機の負荷を均一化させる
ために、古くから数多くの提案がなされている。For this reason, many proposals have been made since ancient times to equalize the load on each computer.

【０００９】例えば第1の従来技術として、特許番号第2
638065号に示される特許掲載公報に記載された特許のよ
うに、CPUの利用率に着目した負荷分散方法が挙げられ
る。この方法によれば、まずあらかじめCPUが実行可能
な全てのタスクについて、CPU動作とディスクI/O動作の
割合を求めておく。次にシステムの動作中に各CPUの利
用率を測定し、利用率の高いCPUにはディスクI/O動作の
割合の高いタスクを依頼し、利用率の低いCPUにはCPU動
作の割合が高いタスクを依頼することによって、負荷を
均一化している。For example, as a first prior art, Patent No. 2
As in the patent described in Japanese Patent Publication No. 638065, there is a load distribution method that focuses on CPU utilization. According to this method, first, for all the tasks that can be executed by the CPU, the ratio between the CPU operation and the disk I / O operation is obtained in advance. Next, during the operation of the system, measure the utilization of each CPU, request a task with a high percentage of disk I / O operation to a CPU with a high utilization, and a high percentage of CPU operation to a CPU with a low utilization By requesting tasks, the load is equalized.

【００１０】その他に第2の従来技術として、例えば特
開平6-271227号公報に示された発明のように、分散格納
する分割データ量を均等化することによって負荷分散を
行う方法が挙げられる。この方法によれば、データベー
スを各計算機に分散格納する際に、各項目に対するレコ
ード数が均等となるように分割しておく。そのため、検
索処理を行う場合、各計算機が自分の所有している分割
データを全て読み込んで条件に該当しているレコードを
抽出するため、各計算機の負荷を均一化できる。As another second conventional technique, there is a method of distributing load by equalizing the amount of divided data to be distributed and stored, as in the invention disclosed in Japanese Patent Application Laid-Open No. 6-271227, for example. According to this method, when the database is distributed and stored in each computer, the database is divided so that the number of records for each item is equal. Therefore, when performing a search process, each computer reads all the divided data owned by itself and extracts records that meet the conditions, so that the load on each computer can be equalized.

【００１１】また、分散並列型データ処理装置において
は、各分割データの複製を作成し冗長格納することによ
って、故障などの理由により一部の計算機が停止した状
態でも、複製の分割データを所有する他の計算機が処理
を肩代わりすることによって、サービスを無停止で継続
できることも大きな特徴である。In the distributed parallel data processing apparatus, a copy of each divided data is created and redundantly stored, so that the duplicated data is owned even when some computers are stopped due to a failure or the like. Another major feature is that services can be continued without interruption by other computers taking over the processing.

【００１２】例えば第3の従来技術として、分散並列型
データ処理装置で無停止サービスを実現させる方法につ
いて、ビデオストリームサーバに関する例が1998年3
月、アイビーエムジャーナル・オブ・リサーチ・アン
ド・ディベロプメント、第42巻、第2号、187〜189頁(IB
M J. Res.Develop., Vol.42, No.2, p187-189, March,1
998)に記載されている。この文献によれば、あらかじめ
個々の計算機に格納されている分割データをさらに細か
く分割し、細分割データの複製を異なる計算機に冗長格
納させておく。これにより、ある計算機または二次記憶
装置に障害が発生しても、複製の細分割データを持つ計
算機が処理を肩代わりすることができるため、サービス
を無停止で継続することができる。For example, as a third prior art, a method of realizing a non-stop service in a distributed parallel data processing device is described in an example related to a video stream server in March 1998.
Moon, IBM Journal of Research and Development, Vol. 42, No. 2, pp. 187-189 (IB
M J. Res.Develop., Vol.42, No.2, p187-189, March, 1
998). According to this document, the divided data stored in advance in each computer is further finely divided, and a copy of the subdivided data is redundantly stored in a different computer. As a result, even if a failure occurs in a certain computer or a secondary storage device, the computer having the duplicate subdivided data can take over the processing, so that the service can be continued without interruption.

【００１３】またこの時、1台の計算機に格納されてい
た原本の分割データは、細分割して複数の計算機に分散
して格納されているため、肩代わり処理による負荷の増
加も複数の計算機の間で分散させることが可能である。
特に、ビデオストリームサーバでは、ファイルを先頭か
ら最後まで順番に読み出す処理が中心であるため、均等
な容量で細分割していれば、肩代わり処理による負荷も
均一である。At this time, since the original divided data stored in one computer is subdivided and stored in a plurality of computers, the increase in the load due to the shoulder replacement processing increases the load of the plurality of computers. It is possible to distribute among them.
In particular, in a video stream server, the process of reading a file in order from the beginning to the end is the center. Therefore, if the file is subdivided with an equal capacity, the load due to the shoulder replacement process is also uniform.

【００１４】[0014]

【発明が解決しようとする課題】しかしながら、上述し
た従来の分散並列型データ処理装置において、大規模デ
ータに対する検索や集計などの処理を行う場合、下記に
挙げる問題点が存在する。However, in the above-mentioned conventional distributed parallel data processing apparatus, the following problems exist when processing such as retrieval or totalization of large-scale data is performed.

【００１５】第1の問題点は、大規模データに対する処
理においては、第1の従来技術のように、CPUの利用率に
従って負荷分散制御を行っても、各計算機の処理時間は
必ずしも均一化されないことである。The first problem is that, in the processing of large-scale data, the processing time of each computer is not necessarily equalized even if the load distribution control is performed according to the CPU utilization rate as in the first related art. That is.

【００１６】その理由は、大規模データに対する処理時
間の大半はCPU処理時間よりもむしろ、二次記憶装置か
らの読み出し時間に依存しているからである。The reason is that most of the processing time for large-scale data depends on the reading time from the secondary storage device rather than the CPU processing time.

【００１７】第2の問題点は、検索処理の高速化のため
にデータをインデックス化している場合などでは、第2
の従来技術のように分散格納するデータ量を均等化して
も、必ずしも各計算機の処理時間が均一化されないこと
である。The second problem is that when data is indexed in order to speed up the search processing, the second problem occurs.
Even if the amount of data to be distributed and stored is equalized as in the related art, the processing time of each computer is not necessarily equalized.

【００１８】その理由は、データがインデックス化され
ている場合、条件に該当するデータの個数によって読み
出すデータ量が異なるため、各計算機の処理時間にばら
つきが生じてしまうからである。The reason is that when the data is indexed, the amount of data to be read differs depending on the number of data that satisfies the condition, so that the processing time of each computer varies.

【００１９】また仮に、インデックス中の各条件に該当
するレコードの個数が各計算機で等しくなるように分散
格納しようとすると、同一のレコードが異なるインデッ
クスに登録される場合が生ずる。そのため、複合条件の
処理を行う際に各計算機間で大量の部分結果をやり取り
する必要があり、読み出すデータ量は均一化されても、
通信処理が増大し、逆に並列化の効率が大幅に減少して
しまう。If the number of records corresponding to each condition in the index is distributed and stored so as to be equal in each computer, the same record may be registered in a different index. Therefore, when performing complex conditions, it is necessary to exchange a large amount of partial results between computers, and even if the amount of data to be read is made uniform,
The communication processing increases, and conversely, the efficiency of parallelization greatly decreases.

【００２０】第3の問題点は、各データに対するアクセ
ス頻度の分布が変化してしまうと、第3の従来技術のよ
うにデータを分散格納するだけでは対応が困難なことで
ある。その理由は、アクセス頻度の分布が変化すると、
第3の従来技術では、データの分割および分散格納方法
を変える必要があるからである。A third problem is that it is difficult to cope with a change in the distribution of access frequencies to each data only by distributing and storing the data as in the third related art. The reason is that if the distribution of access frequency changes,
This is because in the third conventional technique, it is necessary to change the method of dividing and distributing data.

【００２１】第4の問題点は、各計算機が異なるデータ
量を読み出して処理を行う場合において、計算機や二次
記憶装置に障害が発生すると、第3の従来技術のように
読み出し量を一定にしておく負荷分散方法では、肩代わ
り負荷に不均衡が存在するため性能が低下してしまうこ
とである。その理由は、第2の問題点と同じである。The fourth problem is that, when each computer reads out a different amount of data and performs processing, if a failure occurs in the computer or the secondary storage device, the reading amount is made constant as in the third related art. In the load distribution method, the performance is degraded due to the imbalance in the substitute load. The reason is the same as the second problem.

【００２２】本発明の主たる目的は、インデックス化さ
れた大規模データに対する検索や集計などの処理を行う
場合でも、正常時と障害発生時の両方で、常に負荷分散
を行うことができる分散並列型データ処理装置を提供す
ることにある。そのために、下記の4つの目的を実現さ
せる。A main object of the present invention is to provide a distributed parallel type that can always perform load distribution both in normal times and when a failure occurs, even when performing processing such as searching and totalizing indexed large-scale data. A data processing device is provided. For that purpose, the following four objectives are realized.

【００２３】第1の目的は、各計算機の処理時間が二次
記憶装置からの入出力処理に依存している場合でも、各
計算機間の負荷を均一化し、システム全体のスループッ
トを最大化させることである。A first object is to equalize the load between the computers and maximize the throughput of the entire system even when the processing time of each computer depends on the input / output processing from the secondary storage device. It is.

【００２４】第2の目的は、各計算機で読み出すデータ
量が異なる処理であっても、各計算機の全体の処理時間
を均一化し、システム全体のスループットを最大化させ
ることである。A second object is to equalize the overall processing time of each computer and maximize the throughput of the entire system, even if the processing involves different amounts of data to be read by each computer.

【００２５】第3の目的は、各データに対するアクセス
頻度の分布が変化しても、自動的に各計算機の全体の処
理時間を均一化させることである。A third object is to automatically equalize the entire processing time of each computer even if the distribution of the access frequency to each data changes.

【００２６】第4の目的は、一部の計算機または二次記
憶装置が停止した場合においても、その肩代わり負荷を
均一化することによって性能低下を防ぐことである。A fourth object of the present invention is to prevent a decrease in performance by equalizing the load even when some computers or secondary storage devices are stopped.

【００２７】[0027]

【課題を解決するための手段】本発明の第1の分散並列
型データ処理装置は、大規模データを分割して各計算機
に分散格納し、さらにその分割データの複製を作成し
て、重複しないように冗長格納し、同じ分割データを対
象とした部分処理を異なる複数の計算機で行うことがで
きる。そのため、ある計算機の負荷が大きくなったり、
ある計算機が停止した場合、その計算機が所有している
分割データと同一の分割データを所有する他の計算機
が、処理を肩代わりできる。A first distributed parallel data processing apparatus according to the present invention divides large-scale data, stores the divided data in each computer, and creates a copy of the divided data to prevent duplication. In this way, the data is redundantly stored, and partial processing on the same divided data can be performed by a plurality of different computers. Therefore, the load on a certain computer increases,
When a computer stops, another computer that owns the same divided data as that of the computer can take over the processing.

【００２８】また、各分割データに対して、二次記憶装
置の入出力処理も含めた処理時間を測定する負荷監視手
段と、負荷監視手段が測定した各分割データに対する処
理時間を基に、処理時間の長い計算機への部分処理の依
頼回数を減らし、処理時間の短い計算機への部分処理の
依頼回数を増やすように分配比率を計算する分配比率計
算手段と、分配比率計算手段によって求められた分配比
率に従って依頼先の計算機を決定する負荷分散手段を有
する。A load monitoring means for measuring the processing time of each divided data including the input / output processing of the secondary storage device, and a processing time based on the processing time for each divided data measured by the load monitoring means. Distribution ratio calculation means for calculating a distribution ratio so as to reduce the number of requests for partial processing to a computer with a long time and increase the number of requests for partial processing to a computer with a short processing time, and distribution obtained by the distribution ratio calculation means It has a load balancing means for determining the requested computer according to the ratio.

【００２９】このような構成を採用し、二次記憶装置の
入出力処理も含めた処理時間を負荷分散の指標として用
い、全ての計算機における処理時間を均一化する分配比
率を求めることにより、本発明の第1および第2の目的を
達成することができる。By adopting such a configuration, the processing time including the input / output processing of the secondary storage device is used as an index of load distribution, and the distribution ratio for equalizing the processing time in all the computers is obtained. The first and second objects of the invention can be achieved.

【００３０】また、本発明の第1の分散並列型データ処
理装置が有する分配比率計算手段は、一定周期毎に起動
して負荷を均一化する分配比率を計算するように構成し
てもよい。さらに、本発明の第2の分散並列型データ処
理装置は、本発明の第1の分散並列型データ処理装置の
構成に加え、各計算機の負荷の偏りを検出して分配比率
計算手段を起動させる計算時間制御手段を有する。Further, the distribution ratio calculating means of the first distributed parallel data processing apparatus of the present invention may be configured to start at regular intervals and calculate a distribution ratio for equalizing the load. Further, the second distributed parallel data processing device of the present invention, in addition to the configuration of the first distributed parallel data processing device of the present invention, detects a bias in the load of each computer and activates the distribution ratio calculating means. It has calculation time control means.

【００３１】このような構成を採用し、定期的あるいは
各計算機の負荷に偏りが生じた時点で、負荷を均一化す
る分配比率を再計算するため、本発明の第3の目的を達
成することができる。また、本発明の第1の分散並列型
データ処理装置は、停止した計算機を検出する障害検出
手段と、各計算機に依頼した処理を保持しておく依頼処
理管理手段を有する。The third object of the present invention is achieved by adopting such a configuration and recalculating the distribution ratio for equalizing the load periodically or when the load on each computer becomes uneven. Can be. Further, the first distributed parallel data processing apparatus of the present invention has a failure detecting means for detecting a stopped computer, and a request processing managing means for holding processing requested to each computer.

【００３２】このような構成を採用し、一部の計算機が
停止した時に停止した計算機への分配比率を0に固定し
て分配比率の再計算を行い、停止した計算機を除く構成
で負荷を分散させることによって、本発明の第4の目的
を達成することができる。By adopting such a configuration, when some computers stop, the distribution ratio to the stopped computers is fixed to 0, the distribution ratio is recalculated, and the load is distributed by the configuration excluding the stopped computers. By doing so, the fourth object of the present invention can be achieved.

【００３３】[0033]

【発明の実施の形態】次に、本発明の実施の形態につい
て図面を参照して詳細に説明する。［第１の実施の形態］図1を参照すると、本発明の第1の
実施の形態は、外部通信網3で接続されたn台の独立な計
算機1_1〜1_n（nは1以上の整数。nを計算機台数とす
る。）と、計算機1_1〜1_nが独立に所有する二次記憶装
置2_1〜2_nから構成される。Next, embodiments of the present invention will be described in detail with reference to the drawings. [First Embodiment] Referring to FIG. 1, in a first embodiment of the present invention, n independent computers 1_1 to 1_n connected by an external communication network 3 (n is an integer of 1 or more. n is the number of computers.), and the secondary storage devices 2_1 to 2_n independently owned by the computers 1_1 to 1_n.

【００３４】各二次記憶装置は、大規模データをm×n個
（mは1以上の整数。mを粒度と呼ぶことにする。）に分
割した分割データをm個づつ分散格納している。また、
各分割データのコピーをr個（rは1以上の整数。rを冗長
度と呼ぶことにする。）作成し、同じ二次記憶装置に重
複しないように冗長格納しておく。ここでは便宜上、大
規模データをm×n個に分割して作成されたデータを原本
分割データと呼び、原本分割データをコピーして作成さ
れた分割データを、複製分割データと呼ぶこととする。Each secondary storage device stores m divided data obtained by dividing large-scale data into m × n pieces (m is an integer of 1 or more; m is referred to as a granularity). . Also,
R copies of each divided data (r is an integer of 1 or more; r is referred to as redundancy) are created and redundantly stored in the same secondary storage device so as not to be duplicated. Here, for convenience, data created by dividing large-scale data into m × n pieces is referred to as original divided data, and divided data created by copying the original divided data is referred to as duplicate divided data.

【００３５】図2を用いて分割データの分散および冗長
格納方法について説明する。まず、大規模データをmラn
個に分割し、原本分割データD11〜Dnmを作成する。次
に、原本分割データD11〜D1mを二次記憶装置2_1に格納
し、原本分割データD21〜D2mを二次記憶装置2_2に格納
する。同様に、原本分割データDi1〜Dim（iは1以上n以
下の整数）を二次記憶装置2_iに分散格納する。A method of distributing and redundantly storing divided data will be described with reference to FIG. First, large-scale data
Then, the original data D11 to Dnm are created. Next, the original divided data D11 to D1m are stored in the secondary storage device 2_1, and the original divided data D21 to D2m are stored in the secondary storage device 2_2. Similarly, the original divided data Di1 to Dim (i is an integer of 1 or more and n or less) are distributed and stored in the secondary storage device 2_i.

【００３６】次に、分割データD11〜Dnmの複製を作成
し、複製分割データD11を二次記憶装置2_2に格納し、複
製分割データD12を二次記憶装置2_3に格納し、同様に複
製分割データD1j（jは1以上m以下の整数）を二次記憶装
置2_j+1に格納する。さらに、複製分割データD21を二次
記憶装置2_3に格納し、複製分割データD22を二次記憶装
置2_4に格納し、以下同様に、複製分割データD2jを二次
記憶装置2_j+2に格納する。同様にして、すべての複製
分割データDijを二次記憶装置2_(i+j)に冗長格納する。
ただし、添え字(i+j)がnを越えた場合、(i+j)をnで割っ
た余りを添え字とし、格納先の二次記憶装置を循環させ
るものとする。Next, a duplicate of the divided data D11 to Dnm is created, the duplicate divided data D11 is stored in the secondary storage device 2_2, and the duplicate divided data D12 is stored in the secondary storage device 2_3. D1j (j is an integer from 1 to m) is stored in the secondary storage device 2_j + 1. Further, the copy division data D21 is stored in the secondary storage device 2_3, the copy division data D22 is stored in the secondary storage device 2_4, and similarly, the copy division data D2j is stored in the secondary storage device 2_j + 2. Similarly, all the duplicate divided data Dij are redundantly stored in the secondary storage device 2_ (i + j).
However, when the subscript (i + j) exceeds n, the remainder obtained by dividing (i + j) by n is used as the subscript, and the storage destination secondary storage device is circulated.

【００３７】冗長度rが2の場合、複製分割データDijを
さらに一台づつずらして、二次記憶装置2_(i+j+1)にも
格納する。この場合も、(i+j+1)がnを越えた場合、(i+j
+1)をnで割った余りを添え字とし、循環して格納してい
く。When the redundancy r is 2, the duplicate divided data Dij is further shifted one by one and stored in the secondary storage device 2_ (i + j + 1). Also in this case, if (i + j + 1) exceeds n, (i + j + 1)
The remainder obtained by dividing +1) by n is used as a subscript and stored cyclically.

【００３８】一般的に、冗長度rの場合、複製されたr個
の分割データDijを二次記憶装置2_(i+j+s)（sは0以上r-
1以下の整数）にそれぞれ格納することによって、同一
の二次記憶装置に重複しないように分割データを冗長格
納できる。ただし、(i+j+s)がnを越えた場合、(i+j+s)
をnで割った余りを添え字とし、循環して格納するもの
とする。冗長度が大きい程、負荷を均一化しやすくな
り、同時に停止可能な計算機の数も増加する。In general, in the case of the redundancy r, the copied r pieces of divided data Dij are stored in the secondary storage device 2_ (i + j + s) (s is 0 to r−
By storing them in the same secondary storage device, respectively, the divided data can be redundantly stored so as not to be duplicated. However, if (i + j + s) exceeds n, (i + j + s)
The remainder obtained by dividing by n is used as a subscript and stored in a cyclic manner. The greater the redundancy, the easier the load becomes, and the more computers that can be stopped at the same time.

【００３９】以上のような分割データの配置によって、
ある計算機が停止した場合でも、別の計算機が処理を肩
代わりすることができる。なお、分散および冗長格納方
法は、同一の二次記憶装置に同一の分割データが重複し
ない配置を実現できるのであれば、ここで説明した方法
に限定されない。With the arrangement of the divided data as described above,
Even if one computer stops, another computer can take over the processing. Note that the distribution and redundant storage method is not limited to the method described here as long as the same divided data can be arranged in the same secondary storage device without overlapping.

【００４０】図1を参照すると、計算機1_i（iは1以上n
以下の整数）は、大きく受付処理部11、負荷管理部12、
データ処理部13の3つの制御部を含む。受付処理部11
は、利用者からの処理要求を受け付けて結果を応答する
制御部であり、要求受付手段111、要求分配手段112、分
割データ管理手段113、部分結果統合手段114、障害検出
手段115、依頼処理管理手段116を備えている。負荷管理
部12は、各計算機の処理時間を測定し負荷を均一化する
制御部であり、負荷監視手段121、分配比率計算手段12
2、負荷分散手段123を備えている。データ処理部13は、
受付処理部から依頼された部分処理要求を実行する制御
部であり、要求処理手段131を備えている。Referring to FIG. 1, a computer 1_i (i is 1 or more and n
The following integers) are roughly the reception processing unit 11, the load management unit 12,
It includes three control units of the data processing unit 13. Reception processing unit 11
Is a control unit that receives a processing request from a user and responds with a result. The request receiving unit 111, the request distribution unit 112, the divided data management unit 113, the partial result integration unit 114, the failure detection unit 115, the request processing management Means 116 are provided. The load management unit 12 is a control unit that measures the processing time of each computer and equalizes the load, and includes a load monitoring unit 121, a distribution ratio calculation unit 12
2. It has a load balancing means 123. The data processing unit 13
It is a control unit that executes a partial processing request requested by the reception processing unit, and includes a request processing unit 131.

【００４１】要求受付手段111は、外部通信網3を介して
利用者から処理要求を受け付け、m×n個の分割データを
対象とした部分処理を作成する。The request receiving means 111 receives a processing request from a user via the external communication network 3 and creates a partial process for m × n divided data.

【００４２】要求分配手段112は、負荷分散手段121によ
って指定された計算機内の要求処理手段131に、外部通
信網3を介して部分処理を依頼する。分割データ管理手
段113は、m×n個の原本分割データと、r×m×n個の複製
分割データが、それぞれどの二次記憶装置に格納されて
いるかを分割データ管理表によって管理する。例えば、
図2のように分割データが配置されている場合、分割デ
ータ管理手段113が作成する分割データ管理表は図3のよ
うになる。図3を見ると、分割データD11を所有する計算
機は、1_1,1_2,1_3であることがわかる。部分結果統合
手段114は、各計算機の要求処理手段131からm×n個の分
割データに対する部分処理の結果を収集し、統合、編集
して外部通信網3を通じて利用者に応答する。障害検出
手段115は、故障などの理由により停止した計算機の検
出を行う。依頼処理管理手段116は、各計算機に依頼し
た部分処理要求のうち、未処理の部分処理を図4に示す
ような依頼処理管理表として保存しておく。図4を見る
と、異なる処理要求Q1、Q2について、計算機1_1には分
割データD14,D1mを対象とした部分処理(Q1,D14),(Q1,D1
m)と、分割データD11を対象とした部分処理(Q2,D11)を
依頼中で結果が未返答であることがわかる。The request distribution unit 112 requests the request processing unit 131 in the computer designated by the load distribution unit 121 to perform a partial process via the external communication network 3. The divided data management unit 113 manages which secondary storage device stores the m × n original divided data and the r × m × n duplicate divided data by using a divided data management table. For example,
When divided data is arranged as shown in FIG. 2, the divided data management table created by the divided data management unit 113 is as shown in FIG. FIG. 3 shows that the computers that own the divided data D11 are 1_1, 1_2, and 1_3. The partial result integrating unit 114 collects the results of the partial processing on the m × n divided data from the request processing unit 131 of each computer, integrates and edits the results, and responds to the user via the external communication network 3. The failure detection means 115 detects a computer that has stopped due to a failure or the like. The request processing management means 116 stores the unprocessed partial processing among the partial processing requests requested by the computers as a request processing management table as shown in FIG. Referring to FIG. 4, for the different processing requests Q1 and Q2, the computer 1_1 performs partial processing (Q1, D14), (Q1, D1) on the divided data D14, D1m.
m), it can be seen that the partial processing (Q2, D11) for the divided data D11 has been requested and the result has not been replied.

【００４３】負荷監視手段121は、各分割データに対す
る部分処理について、二次記憶装置からの入出力処理も
含めた平均処理時間を測定する。分配比率計算手段122
は、負荷分散手段121が測定した各分割データに対する
部分処理の平均処理時間を基に、各計算機の処理時間が
均一になるような部分処理の分配比率を計算し、図5に
示すような分配比率表を作成する。図5を見ると、分割
データD11に対する部分処理は、計算機1_1, 1_2, 1_3に
それぞれ40%, 20%, 30%の比率の回数で依頼することが
わかる。負荷分散手段123は、分配比率計算手段122が計
算した分配比率を基に部分処理の分配先を決定し、要求
分配手段112に通知する。The load monitoring means 121 measures the average processing time of the partial processing for each divided data, including the input / output processing from the secondary storage device. Distribution ratio calculation means 122
Calculates the distribution ratio of the partial processing that makes the processing time of each computer uniform based on the average processing time of the partial processing for each divided data measured by the load distribution unit 121, and calculates the distribution ratio as shown in FIG. Create a ratio table. Referring to FIG. 5, it can be seen that the partial processing for the divided data D11 is requested to the computers 1_1, 1_2, and 1_3 at a ratio of 40%, 20%, and 30%, respectively. The load distribution unit 123 determines the distribution destination of the partial process based on the distribution ratio calculated by the distribution ratio calculation unit 122, and notifies the request distribution unit 112.

【００４４】要求処理手段131は、要求分配手段112から
依頼された部分処理を実行し、部分処理結果を部分結果
統合手段114に返答する。The request processing means 131 executes the partial processing requested by the request distribution means 112 and returns a result of the partial processing to the partial result integrating means 114.

【００４５】次に、図1から図6を参照して本実施の形態
の動作について詳細に説明する。利用者は、検索や集計
などの処理要求を、外部通信網3を通じて任意の計算機1
_a（aは1以上n以下の整数）の要求受付手段111に送信す
る（図6のステップS11）。要求受付手段111は、受け付
けた処理要求について、処理対象となる分割データを指
定したm×n個の部分処理を作成し、負荷分散手段123に
渡す（図6のステップS12）。負荷分散手段123は、すべ
ての部分処理について、分配比率計算手段122によって
あらかじめ作成された分配比率表を参照して依頼先の計
算機を決定し、要求分配手段112に渡す。要求分配手段1
12は、外部通信網3を通じて、指定された依頼先の計算
機の要求処理手段131に部分処理を依頼する（図6のステ
ップS13、S14）。Next, the operation of this embodiment will be described in detail with reference to FIGS. The user sends a processing request such as a search or tally to any computer 1 through the external communication network 3.
_a (a is an integer of 1 or more and n or less) is transmitted to the request receiving means 111 (step S11 in FIG. 6). The request accepting unit 111 creates m × n partial processes that specify the divided data to be processed with respect to the accepted processing request, and passes them to the load distribution unit 123 (Step S12 in FIG. 6). The load distribution unit 123 refers to the distribution ratio table created in advance by the distribution ratio calculation unit 122 to determine the requested computer for all of the partial processes, and passes it to the request distribution unit 112. Request distribution means 1
12 requests the request processing means 131 of the designated request destination computer for partial processing via the external communication network 3 (steps S13 and S14 in FIG. 6).

【００４６】同時に、依頼処理管理手段116が、依頼し
た部分処理を依頼処理管理表に登録する。At the same time, the request processing management means 116 registers the requested partial processing in the request processing management table.

【００４７】依頼先の計算機の要求処理手段131は、依
頼された部分処理を実行する。依頼先の計算機の要求処
理手段131が別の部分処理を処理中である場合は、待ち
行列につないでおき、順次処理を行っていく。あるい
は、要求処理手段131が複数の部分処理を同時に並行に
処理してもよい。The request processing means 131 of the requested computer executes the requested partial processing. If the request processing unit 131 of the requested computer is processing another partial process, the request is connected to a queue and the process is performed sequentially. Alternatively, the request processing unit 131 may simultaneously process a plurality of partial processes in parallel.

【００４８】次に、部分処理を依頼された計算機は、要
求処理手段131によって得られた部分処理の結果を、外
部通信網3を通じて依頼元の計算機1_a内の部分結果統合
手段114に返答する（図6のステップS15）。また、該部
分処理に要した処理時間を、依頼元の計算機1_a内の負
荷監視手段121に送信する。同時に、計算機1_a内の依頼
処理管理手段116が、返答された部分処理を依頼処理管
理表から削除する。計算機1_aの部分結果統合手段114
は、部分処理の結果を受け付けて保持しておき、m×n個
の分割データに対する部分結果がすべてそろった時点で
統合・編集し、外部通信網3を介して利用者に応答する
（図6のステップS16、S17）。Next, the computer requested to perform the partial processing returns the result of the partial processing obtained by the request processing means 131 to the partial result integrating means 114 in the requesting computer 1_a through the external communication network 3 ( Step S15 in FIG. 6). In addition, the processing time required for the partial processing is transmitted to the load monitoring unit 121 in the computer 1_a of the request source. At the same time, the request processing management means 116 in the computer 1_a deletes the replied partial processing from the request processing management table. Partial result integration means 114 of computer 1_a
Accepts and holds the results of the partial processing, integrates and edits all partial results for the m × n divided data, and responds to the user via the external communication network 3 (FIG. 6). Steps S16 and S17).

【００４９】ここでは、利用者からの処理要求が返答さ
れるまでの流れを、逐次処理的に説明したが、図6の各
ステップを非同期で並列に行うことによって、各計算機
の遊休時間を削減でき、より並列化の効率を向上させる
ことが可能である。Here, the flow up to the response of the processing request from the user has been described in a sequential manner, but the steps in FIG. 6 are performed asynchronously and in parallel to reduce the idle time of each computer. It is possible to further improve the efficiency of parallelization.

【００５０】次に、分配比率計算手段122が、負荷を均
一化させるために部分処理の分配比率を計算する方法に
ついて図7を用いて説明する。Next, a method in which the distribution ratio calculating means 122 calculates the distribution ratio of the partial processing in order to equalize the load will be described with reference to FIG.

【００５１】任意の分割データDij（iは1以上n以下の整
数。jは1以上m以下の整数）に対する部分処理の平均処
理時間をT(Dij)と記述することにする。また、分割デー
タDijについて、計算機1_k（kは1以上n以下の整数）に
部分処理を依頼する回数の割合をd(Dij,1_k)と記述する
ことにする。ここで、ある計算機1_kに格納されている
分割データが、D11、D12、D13のみであったとすると、
計算機1_kの平均処理時間E(1_k)は、近似的に E(1_k)=T(D11)ラd(D11,1_k)+T(D12)ラd(D12,1_k)+T(D13)ラ
d(D13,1_k) と計算できる。The average processing time of partial processing for arbitrary divided data Dij (i is an integer of 1 or more and n or less; j is an integer of 1 or more and m or less) is described as T (Dij). Further, for the divided data Dij, a ratio of the number of times of requesting the computer 1_k (k is an integer of 1 or more and n or less) to perform a partial process is described as d (Dij, 1_k). Here, if the divided data stored in a certain computer 1_k is only D11, D12, and D13,
The average processing time E (1_k) of the computer 1_k is approximately E (1_k) = T (D11) d (D11,1_k) + T (D12) d d (D12,1_k) + T (D13)
It can be calculated as d (D13,1_k).

【００５２】分配比率計算手段122は、負荷監視手段121
が測定した個々の分割データに対する部分処理の処理時
間と、分割データ管理手段113が保持する分割データ管
理表を基に、一定期間ごとに上記の計算方法によって、
各計算機の平均処理時間を計算する（図7のステップS2
1）。次に、平均処理時間が最大の計算機1_e（eは1以上
n以下の整数）を特定し（図7のステップS23）、計算機1
_eに格納されているのと同一の分割データを所有する計
算機のうち、平均処理時間が最小の計算機1_fを特定す
る（図7のステップS24）。さらに、計算機1_eと1_fが共
通して所有する分割データの分配比率について、計算機
1_eに依頼する回数の比率を下げ、計算機1_fに依頼する
比率を上げる（図7のステップS25）。再び各計算機の平
均処理時間を計算し（図7のステップS21）、平均処理時
間が均一になっていれば終了し、均一でなければ分配比
率の変更を繰り返す（図7のステップS22）。これによっ
て、各計算機の平均処理時間を均一化できる分配比率を
計算することが可能である。The distribution ratio calculating means 122 includes a load monitoring means 121
Based on the processing time of the partial processing for each divided data measured by the above and the divided data management table held by the divided data management means 113, the above-described calculation method is performed at regular intervals,
Calculate the average processing time of each computer (Step S2 in FIG. 7)
1). Next, the computer 1_e with the largest average processing time (e is 1 or more
(an integer less than or equal to n) (Step S23 in FIG. 7), and Calculator 1
Among the computers having the same divided data stored in _e, the computer 1_f having the shortest average processing time is specified (step S24 in FIG. 7). Further, regarding the distribution ratio of the divided data shared by the computers 1_e and 1_f, the computer
The ratio of the number of requests to 1_e is reduced, and the ratio of requests to the computer 1_f is increased (step S25 in FIG. 7). The average processing time of each computer is calculated again (step S21 in FIG. 7). If the average processing time is uniform, the process ends. If not, the change of the distribution ratio is repeated (step S22 in FIG. 7). As a result, it is possible to calculate a distribution ratio that can make the average processing time of each computer uniform.

【００５３】ここでは、図7のステップS22で、平均処理
時間が均一になるまで繰り返すとしたが、各計算機の処
理時間の平均値の変化が定常状態になった時点で計算を
打ち切っても良い。また、あらかじめ繰り返し回数を決
めておき途中で計算を打ち切るなどの方法も考えられ
る。Here, in step S22 in FIG. 7, the processing is repeated until the average processing time becomes uniform. However, the calculation may be terminated when the average value of the processing time of each computer changes to a steady state. . A method of determining the number of repetitions in advance and terminating the calculation in the middle is also conceivable.

【００５４】次に、図8を参照して、一部の計算機また
は二次記憶装置に故障などの障害が発生した場合の動作
について説明する。Next, with reference to FIG. 8, an operation when a failure such as a failure occurs in some computers or secondary storage devices will be described.

【００５５】ある瞬間に、故障などの予期しない理由に
よって計算機1_h（hは1以上n以下の整数）が停止してし
まったとする。障害検出手段115は、計算機1_hが停止し
たことを検出して分配比率計算手段122に知らせ、割り
込み処理を起動する（図8のステップS31）。障害検出の
方法としては、各計算機と定期的に外部通信網3を介し
て小さなデータ交換を行い、データ交換が不能になった
時にその計算機に障害が起きたものと判断する方法があ
る。あるいは、ハードウェアで専用の故障診断機構を取
り入れる方法もある。At a certain moment, it is assumed that the computer 1_h (h is an integer of 1 or more and n or less) is stopped for an unexpected reason such as a failure. The fault detecting means 115 detects that the computer 1_h has stopped, notifies the distribution ratio calculating means 122, and starts an interrupt process (step S31 in FIG. 8). As a method of failure detection, there is a method in which small data exchange is periodically performed with each computer via the external communication network 3, and when the data exchange becomes impossible, it is determined that the computer has failed. Alternatively, there is a method of incorporating a dedicated failure diagnosis mechanism in hardware.

【００５６】障害時の割り込み処理を起動された分配比
率計算手段122は、計算機1_hへの分配比率を0に固定し
て、図7に示す方法に従って分配比率を再計算する（図8
のステップS32およびS33）。次に、依頼処理管理手段11
6が保持している依頼処理管理表を参照して、計算機1_h
に依頼後結果が未返答な部分処理を負荷分散手段123に
渡し、停止していない計算機に再分配する（図8のステ
ップS34）。The distribution ratio calculating means 122, which has started the interruption processing at the time of failure, fixes the distribution ratio to the computer 1_h to 0 and recalculates the distribution ratio according to the method shown in FIG. 7 (FIG. 8).
Steps S32 and S33). Next, the request processing management means 11
6 referring to the request processing management table held by computer 1_h
Then, the partial process that has not returned the result after the request is passed to the load distribution unit 123, and is redistributed to computers that have not stopped (step S34 in FIG. 8).

【００５７】これ以降、入力される処理要求についての
部分処理は、計算機1_hへの分配比率が0になっているの
で、計算機1_hに依頼されることはない。計算機1_hが回
復した場合は、計算機1_hへの分配比率を適当に与えて
から分配比率計算手段122によって分配比率を再々計算
することによって、計算機1_hを含んだ構成で負荷の均
一化を行うことができる。Thereafter, the partial processing for the input processing request is not requested to the computer 1_h because the distribution ratio to the computer 1_h is 0. When the computer 1_h recovers, by appropriately giving the distribution ratio to the computer 1_h and re-calculating the distribution ratio by the distribution ratio calculating means 122, it is possible to equalize the load in the configuration including the computer 1_h. it can.

【００５８】次に、本実施の形態の効果について説明す
る。Next, the effect of the present embodiment will be described.

【００５９】本実施の形態では、複数の異なる計算機間
で同一な分割データを所有し、部分処理の分配比率を変
化させることによって、平均処理時間の長い計算機への
部分処理の依頼回数を減らし、平均処理時間の短い計算
機への部分処理の依頼回数を増やしている。そのため、
各計算機で読み出すデータ量が異なる処理であっても、
各計算機の全体の処理時間を均一化することが可能であ
り、システム全体のスループットを最大化できる。In the present embodiment, the same divided data is owned by a plurality of different computers and the distribution ratio of the partial processing is changed, so that the number of requests for the partial processing to the computer having a long average processing time is reduced. The number of requests for partial processing to a computer with a short average processing time is increasing. for that reason,
Even if the amount of data read by each computer is different,
The entire processing time of each computer can be made uniform, and the throughput of the entire system can be maximized.

【００６０】本実施の形態の第2の効果は、分配比率の
計算には各分割データに対する部分処理の平均処理時間
を用いているために、二次記憶装置の入出力処理の時間
を反映した負荷の均一化が可能なことである。The second effect of the present embodiment is that the calculation of the distribution ratio uses the average processing time of the partial processing for each divided data, so that the time of the input / output processing of the secondary storage device is reflected. That is, the load can be equalized.

【００６１】本実施の形態の第3の効果は、一定期間毎
に分配比率計算手段が各計算機の処理時間を均一化する
ように分配比率を計算するために、ある時点で各データ
へのアクセス頻度の分布が変化しても、柔軟に対応でき
ることである。The third effect of the present embodiment is that the distribution ratio calculation means calculates the distribution ratio so as to equalize the processing time of each computer at regular time intervals. It is possible to flexibly cope with a change in the frequency distribution.

【００６２】本実施の形態の第4の効果は、障害発生時
において停止した計算機を除く構成で分配比率の再計算
を行い、停止した計算機から未返答の部分処理を他の計
算機に再分配するため、予期しない理由による障害が発
生した場合でも無停止でサービスを継続でき、しかも、
停止した計算機の肩代わり負荷も均一化して障害発生時
の性能低下を防ぐことができることである。The fourth effect of the present embodiment is that the distribution ratio is recalculated in a configuration other than the computer stopped when a failure occurs, and the unprocessed partial processes are redistributed from the stopped computer to other computers. Therefore, even if an unexpected failure occurs, the service can be continued without interruption,
Another object of the present invention is to make the load on the shoulder of the stopped computer uniform, thereby preventing performance degradation when a failure occurs.

【００６３】［第２の実施の形態］次に、本発明の第2
の実施の形態について図面を参照して詳細に説明する。[Second Embodiment] Next, a second embodiment of the present invention will be described.
An embodiment will be described in detail with reference to the drawings.

【００６４】図17を参照すると、本発明の第2の実施の
形態は、計算機1_1〜1_n（nは1以上n以下の整数）が有
する負荷管理部12が、負荷監視手段121、分配比率計算
手段122、負荷分散手段123の構成に加え、計算時間制御
手段124を有する点で異なる。Referring to FIG. 17, according to the second embodiment of the present invention, the load management unit 12 of each of the computers 1_1 to 1_n (n is an integer of 1 or more and n or less) includes a load monitoring unit 121 and a distribution ratio calculation. The difference is that a calculation time control unit 124 is provided in addition to the configuration of the unit 122 and the load distribution unit 123.

【００６５】計算時間制御手段124は、各計算機に依頼
後未返答の部分処理の数を計数し、その偏りが大きくな
ると、分配比率計算手段122を起動する。本実施の形態
の動作を図面を参照して詳細に説明する。The calculation time control means 124 counts the number of unprocessed partial processes after requesting each computer, and activates the distribution ratio calculation means 122 when the deviation becomes large. The operation of the present embodiment will be described in detail with reference to the drawings.

【００６６】二次記憶装置2_1〜2_nへの分割データの格
納方法および、計算機1_1〜1_nの受付処理部11が含む要
求受付手段111、要求分配手段112、分割データ管理手段
113、部分結果統合手段114、障害検出手段115、依頼処
理管理手段116、およびデータ処理部13が含む要求処理
手段131、および負荷管理部12内の負荷監視手段121、負
荷分散手段123の動作は、第1の実施の形態の各手段の動
作と同一のため、説明は省略する。The method of storing the divided data in the secondary storage devices 2_1 to 2_n, and the request receiving unit 111, the request distribution unit 112, and the divided data management unit included in the reception processing unit 11 of each of the computers 1_1 to 1_n
113, the partial result integration means 114, the failure detection means 115, the request processing management means 116, the request processing means 131 included in the data processing unit 13, the load monitoring means 121 in the load management unit 12, and the operations of the load distribution means 123 Since the operation is the same as that of each means of the first embodiment, the description is omitted.

【００６７】第1の実施の形態では、分配比率計算手段1
22は、あらかじめ指定された一定の時間間隔で起動さ
れ、各計算機の負荷を均一化する分配比率を計算してい
た。本実施の形態では、各計算機に依頼後未返答の部分
処理の数の偏りが大きくなると、分配比率計算手段122
を起動して分配比率を再計算する。In the first embodiment, the distribution ratio calculating means 1
22 is started at a predetermined time interval specified in advance, and calculates a distribution ratio for equalizing the load of each computer. In the present embodiment, when the deviation in the number of partial processes that have not been replied to each computer after being requested increases, the distribution ratio calculation means 122
And recalculate the distribution ratio.

【００６８】図18を参照すると、計算時間制御手段124
は、負荷分散手段123が部分処理の依頼先を決定するた
びに起動され、依頼処理管理手段116が保持している依
頼処理管理表を参照して、各計算機に依頼後未返答の部
分処理の数を計数する（図18のS41）。次に、計算時間
制御手段124は、各計算機が持っている未返答の部分処
理の偏りVを計算する（図18のS42）。ここで偏りVと
は、全計算機の未返答の部分処理の数うち、最大値Umax
と最小値Uminの比Umax/Uminと定義する。この偏りV=Uma
x/Uminが、あらかじめ定めておいた閾値を越えると、分
配比率計算手段122を起動し、分配比率を再計算して補
正する。Referring to FIG. 18, calculation time control means 124
Is started each time the load distribution unit 123 determines the request destination of the partial process, and refers to the request process management table held by the request process management unit 116 to execute The number is counted (S41 in FIG. 18). Next, the calculation time control means 124 calculates the bias V of the unprocessed partial process of each computer (S42 in FIG. 18). Here, the bias V is the maximum value Umax of the number of unprocessed partial processes of all computers.
And the ratio Umax / Umin of the minimum value Umin. This bias V = Uma
When x / Umin exceeds a predetermined threshold value, the distribution ratio calculation means 122 is activated, and the distribution ratio is recalculated and corrected.

【００６９】なお、ここでは偏りVを未返答の部分処理
数の最大値と最小値の比としたが、他にも、平均値と最
大値の比や、各計算機における分散などを用いてもよ
い。Here, the bias V is defined as the ratio between the maximum value and the minimum value of the number of unprocessed partial processes, but other than that, the ratio between the average value and the maximum value, the variance in each computer, etc. may be used. Good.

【００７０】次に、本実施の形態の効果について説明す
る。Next, the effect of the present embodiment will be described.

【００７１】本実施の形態では、計算時間制御手段124
が各計算機に依頼後未返答の部分処理の数の偏りが大き
くなると、分配比率計算手段122を起動して、分配比率
を再計算する。これにより、各分割データに対する平均
処理時間が突然変わり、負荷の均衡が破れても、柔軟に
対応することができる。In the present embodiment, the calculation time control means 124
If the deviation of the number of unprocessed partial processes after requesting each computer increases, the distribution ratio calculation unit 122 is activated to recalculate the distribution ratio. Thus, even if the average processing time for each divided data suddenly changes and the load balance is broken, it is possible to flexibly cope with the load.

【００７２】［第３の実施の形態］次に、本発明の第3
の実施の形態について図面を参照して詳細に説明する。
図20を参照すると、本発明の第3の実施の形態は、本発
明の第1および第2の実施の形態と同様に、計算機1_1〜1
_n、二次記憶装置2_1〜2_n、外部通信網3を備え、さら
に、制御プログラムを記録した記録媒体4を備える。こ
の記録媒体4は磁気ディスク、半導体メモリ、CD-ROMそ
の他の記録媒体であってよい。[Third Embodiment] Next, a third embodiment of the present invention will be described.
An embodiment will be described in detail with reference to the drawings.
Referring to FIG. 20, a third embodiment of the present invention is similar to the first and second embodiments of the present invention, in which computers 1_1 to 1
_n, secondary storage devices 2_1 to 2_n, an external communication network 3, and a recording medium 4 on which a control program is recorded. The recording medium 4 may be a magnetic disk, a semiconductor memory, a CD-ROM, or another recording medium.

【００７３】制御プログラムは記録媒体4から計算機1_1
に読み込まれ、さらに外部通信網3を通じて計算機1_2〜
1_nに読み込まれる。また、制御プログラムは計算機1_1
〜1_nの動作を制御し、二次記憶装置2_1〜2_nに分割デ
ータを分散冗長格納する。計算機1_1〜1_nは、制御プロ
グラムの制御により、第1および第2の実施の形態におけ
る計算機1_1〜1_nによる処理と同一の処理を実行する。The control program is transmitted from the recording medium 4 to the computer 1_1.
To the computer 1_2 through the external communication network 3.
Read in 1_n. The control program is computer 1_1
To 1_n, and the divided data is distributed and redundantly stored in the secondary storage devices 2_1 to 2_n. The computers 1_1 to 1_n execute the same processing as the processing by the computers 1_1 to 1_n in the first and second embodiments under the control of the control program.

【００７４】[0074]

【実施例】次に、具体的な実施例を用いて本発明の第一
の実施の形態の動作を説明する。ここでは、第一の実施
の形態を用いた構成例として、大規模な文書群からキー
ワードなどの条件に一致した文書群を応答する全文検索
システムを挙げる。Next, the operation of the first embodiment of the present invention will be described with reference to specific examples. Here, as a configuration example using the first embodiment, a full-text search system that responds to a document group that matches a condition such as a keyword from a large-scale document group will be described.

【００７５】図9に本実施の形態を用いて構成した全文
検索システムを示す。図9を参照すると本実施例は、そ
れぞれローカルな二次記憶装置を所有した計算機とし
て、汎用のパーソナル・コンピュータP1_1,P1_2,P1_3
と、それをつなぐ外部通信網としてEthernetから構成さ
れている。ここでは、計算機の台数n=3の場合について
説明する。FIG. 9 shows a full-text search system configured using this embodiment. Referring to FIG. 9, in the present embodiment, general-purpose personal computers P1_1, P1_2, P1_3 are used as computers each having a local secondary storage device.
And Ethernet as an external communication network connecting them. Here, a case where the number of computers is n = 3 will be described.

【００７６】パーソナル・コンピュータ1_1〜1_3は受付
処理部11、負荷管理部12、データ処理部13の3つの制御
部として機能する中央演算装置と、二次記憶装置として
ハードディスクを備えている。また、受付処理部11は要
求受付手段111、要求分配手段112、分割データ管理手段
113、部分結果統合手段114、障害検出手段115、依頼処
理管理手段116の機能を備え、負荷管理部12は負荷監視
手段121、分配比率計算手段122、負荷分散手段123の機
能を備え、データ処理部13は要求処理手段131の機能を
備えている。Each of the personal computers 1_1 to 1_3 includes a central processing unit functioning as three control units of a reception processing unit 11, a load management unit 12, and a data processing unit 13, and a hard disk as a secondary storage device. The reception processing unit 11 includes a request reception unit 111, a request distribution unit 112, a divided data management unit.
113, a partial result integrating unit 114, a failure detecting unit 115, and a request processing managing unit 116, and the load managing unit 12 has functions of a load monitoring unit 121, a distribution ratio calculating unit 122, and a load distributing unit 123. The unit 13 has the function of the request processing unit 131.

【００７７】図10を用いて、ハードディスクへの分割デ
ータの分散および冗長格納方法について、粒度m=2、冗
長度r=1の場合について説明する。また、この例では、
検索対象文書として、6万件の大規模文書群があるもの
とする。検索対象文書にはそれぞれID00001〜ID60000の
文書番号を付与してある。Referring to FIG. 10, a description will be given of a method of distributing and redundantly storing divided data on a hard disk in a case where the granularity is m = 2 and the degree of redundancy is r = 1. Also, in this example,
It is assumed that there are 60,000 large document groups as documents to be searched. Documents to be searched are assigned document numbers of ID00001 to ID60000, respectively.

【００７８】まず、6万件の大規模文書群を、10000件ず
つmラn=2ラ3=6個に分割する。ここでは、分割文書群D11に
はID00001〜ID10000の文書が含まれ、分割文書群D12に
はID10001〜ID20000の文書が含まれている。以下同様に
分割文書群Dij（iは1以上3以下の整数。jは1以上2以下
の整数。）にはID((k-1)ラ10000+1)〜ID(kラ10000)(ただ
しk=(i-1)*2+j)の文書が含まれている。ここでは、文書
数を基準に分割したが、他にも文書容量を基準にして分
割を行うなどの方法がある。First, a large document group of 60,000 documents is divided into m × n = 2 × 3 = 6 by 10,000 documents. Here, the divided document group D11 includes documents of ID00001 to ID10000, and the divided document group D12 includes documents of ID10001 to ID20000. Similarly, the divided document group Dij (i is an integer of 1 or more and 3 or less; j is an integer of 1 or more and 2 or less) includes ID ((k−1) × 10000 + 1) to ID (k × 10000) (where k = (i-1) * 2 + j). Here, the document is divided based on the number of documents, but there are other methods such as dividing the document based on the document capacity.

【００７９】この、分割文書群D11〜D32を分散格納し、
各分割文書群に対して線形探索による検索を行ってもよ
いが、通常は検索速度を向上させるために、あらかじめ
各文書群からインデックスを作成しておくことが一般的
である。ここで、インデックスとは、文書群中に出現す
る全ての単語について、それが出現する文書番号の一覧
を作成したファイルのことである。このインデックスフ
ァイルを用いることによって、検索時にすべての文書群
を読み出すことなく、指定された単語が出現する文書群
の一覧を得ることができる。The divided document groups D11 to D32 are distributed and stored.
A search by a linear search may be performed on each divided document group. However, it is general to create an index from each document group in advance in order to improve the search speed. Here, the index is a file that creates a list of document numbers in which all words appearing in the document group appear. By using this index file, a list of documents in which the specified word appears can be obtained without reading out all the documents in the search.

【００８０】図10では、分割文書群D11から分割インデ
ックスX11が作成され、分割文書群D12から分割インデッ
クスX12が作成されている。以下、同様に分割文書群Dij
（iは1以上3以下の整数。jは1以上2以下の整数。）か
ら、分割インデックスXijが作成される。ここで作成し
た分割インデックスを、原本分割インデックスとする。In FIG. 10, a division index X11 is created from the division document group D11, and a division index X12 is created from the division document group D12. Hereinafter, similarly, the divided document group Dij
(I is an integer of 1 or more and 3 or less; j is an integer of 1 or more and 2 or less.), A division index Xij is created. The division index created here is an original division index.

【００８１】ここで、原本分割インデックスXi1〜Xim
（iは1以上3以下の整数）がハードディスク2_iに格納さ
れる。具体的には、原本分割インデックスX11,X12をハ
ードディスクP2_1に格納し、原本分割インデックスX21,
X22をハードディスクP2_2に格納し、原本分割インデッ
クスX31,X32をハードディスクP2_3に格納する。さら
に、各分割インデックスのコピーをr=1個づつ作成し、
複製分割インデックスDij（iは1以上3以下の整数。jは1
以上2以下の整数。）を循環させながらハードディスクP
2_(i+j)に冗長格納する。具体的には、複製分割インデ
ックスX11はハードディスクP2_2に、複製分割インデッ
クスX12はハードディスクP2_3に、複製分割インデック
スX21はハードディスクP2_3に格納されている。さら
に、複製分割インデックスX22とX31はハードディスクP2
_4ではなく、添え字を3で割った余りとしてハードディ
スクP2_1に循環して格納し、複製分割インデックスX32
は同様にハードディスクP2_2に循環して格納している。Here, the original division indexes Xi1 to Xim
(I is an integer of 1 or more and 3 or less) is stored in the hard disk 2_i. Specifically, the original division indexes X11 and X12 are stored in the hard disk P2_1, and the original division indexes X21 and X21 are stored.
X22 is stored in the hard disk P2_2, and the original document division indexes X31 and X32 are stored in the hard disk P2_3. In addition, r = 1 copies of each split index are created,
Replication division index Dij (i is an integer from 1 to 3; j is 1
An integer between 2 and 2. Hard disk P while circulating)
Store redundantly in 2_ (i + j). Specifically, the replication division index X11 is stored in the hard disk P2_2, the replication division index X12 is stored in the hard disk P2_3, and the replication division index X21 is stored in the hard disk P2_3. Furthermore, the replication division indexes X22 and X31 are stored in the hard disk P2
Instead of _4, it is stored as a remainder of the subscript divided by 3 in the hard disk P2_1 in a circular manner, and the replication division index X32
Is also stored in a circular manner on the hard disk P2_2.

【００８２】この時、パーソナル・コンピュータ1_1〜1
_3の中央演算装置が含む分割データ管理手段は、図11に
示すような分割データ管理表を作成する。At this time, the personal computers 1_1 to 1_1
The divided data management means included in the central processing unit of _3 creates a divided data management table as shown in FIG.

【００８３】また、パーソナル・コンピュータ1_1〜1_3
の中央演算装置が有する分配比率計算手段によって、分
割インデックスへの部分処理の分配比率が図12(a)のよ
うに求まっているものとする。Also, the personal computers 1_1 to 1_3
It is assumed that the distribution ratio of the partial process to the divided index is determined as shown in FIG. 12A by the distribution ratio calculating means of the central processing unit.

【００８４】次に、正常時の動作について説明する。あ
る時点で、Ethernetを通じて利用者から検索処理の要求
が、パーソナル・コンピュータP1_3に送信されたとす
る。パーソナル・コンピュータP1_3の中央演算装置が有
する要求受付手段は、この検索要求を受け取って分割イ
ンデックスX11〜X32を検索対象とする6つの部分処理を
作成する。Next, a normal operation will be described. At a certain point, it is assumed that a search processing request is transmitted from the user to the personal computer P1_3 via Ethernet. The request accepting means of the central processing unit of the personal computer P1_3 receives this search request and creates six partial processes for which the divided indexes X11 to X32 are to be searched.

【００８５】次に、パーソナル・コンピュータP1_3の中
央演算装置が有する負荷分散手段が、6つの部分処理の
依頼先を決定する。負荷分散手段が部分処理の依頼先を
決定する方法の一例を図12(b)を用いて示す。図12(b)を
参照すると、負荷分散手段は、あらかじめ分割インデッ
クスX11〜X32に対し、それぞれ長さ100の配列を用意
し、分配比率計算手段が求めた分配比率に従って塗り分
けを行っておく。例えば、分割インデックスX11は、パ
ーソナル・コンピュータP1_1とP1_2にそれぞれ40％, 60
%の比率で部分処理を依頼するため、長さ100の配列の初
めの40要素にはパーソナル・コンピュータP1_1を示す番
号を代入し、残りの60要素にはパーソナル・コンピュー
タP1_2を示す番号を代入しておく。部分処理の依頼先を
決定する場合、まず1から100までの乱数xを発生し、配
列のx番目の要素を参照すれば、分配比率に従って依頼
先のパーソナル・コンピュータを決定できる。Next, the load distribution means of the central processing unit of the personal computer P1_3 determines the request destinations of the six partial processes. An example of a method in which the load distribution means determines the request destination of the partial processing will be described with reference to FIG. Referring to FIG. 12 (b), the load distribution unit prepares an array having a length of 100 for each of the division indices X11 to X32 in advance, and paints them in accordance with the distribution ratio calculated by the distribution ratio calculation unit. For example, the split index X11 is assigned to the personal computers P1_1 and P1_2 by 40% and 60%, respectively.
In order to request partial processing at a ratio of%, the number indicating the personal computer P1_1 is substituted into the first 40 elements of the array of length 100, and the number indicating the personal computer P1_2 is substituted into the remaining 60 elements. Keep it. When determining the request destination of the partial processing, first, a random number x from 1 to 100 is generated, and by referring to the x-th element of the array, the request destination personal computer can be determined according to the distribution ratio.

【００８６】なお、負荷分散手段が分配比率に従って部
分処理の依頼先を決定する方法としては、あらかじめ分
配比率を満たすように各パーソナル・コンピュータへの
依頼順序を定めておき、それに従う方法などがあり、こ
こに挙げた方法に限定されない。As a method for the load distribution means to determine the request destination of the partial processing according to the distribution ratio, there is a method in which the order of requests to the personal computers is determined in advance so as to satisfy the distribution ratio, and the order is followed. However, the method is not limited to the method described here.

【００８７】仮に、パーソナル・コンピュータ1_3の中
央演算装置が有する負荷分散手段が、検索条件Q1につい
て分割インデックスX22を検索対象とする部分処理(Q1,X
22)を、パーソナル・コンピュータ1_2に依頼することに
決定したとする。依頼先が決定された部分処理(Q1, X2
2)は、パーソナル・コンピュータP1_3の中央演算装置が
有する要求分配手段によって、Ethernetを通じて指定さ
れたパーソナル・コンピュータP1_2の中央演算装置が有
する要求処理手段に依頼される。同時に、パーソナル・
コンピュータP1_3の中央演算装置が有する依頼処理管理
手段が、部分処理(Q1,X22)をどのパーソナル・コンピュ
ータに依頼したかを依頼処理管理表に記録する。ある時
点での、依頼処理管理表の一例を図13に示す。Suppose that the load distribution means of the central processing unit of the personal computer 1_3 performs partial processing (Q1, X
22) to personal computer 1_2. Partial processing (Q1, X2
2) is requested by the request processing means of the central processing unit of the personal computer P1_2 designated via Ethernet by the request distribution means of the central processing unit of the personal computer P1_3. At the same time, personal
The request processing management means of the central processing unit of the computer P1_3 records in the request processing management table which personal computer requested the partial processing (Q1, X22). FIG. 13 shows an example of the request processing management table at a certain time.

【００８８】部分処理(Q1,X22)を依頼されたパーソナル
・コンピュータP1_2は、中央演算装置が有する要求処理
手段によって検索対象の分割インデックスX22を検索し
て、検索条件Q1を満たす文書群の一覧を得る。これを部
分結果とし、Ethernetを介して依頼元であるパーソナル
・コンピュータP1_3の中央演算装置が有する部分結果統
合手段に送信する。また、部分処理(Q1,X22)を実行する
のに要した処理時間を、依頼元であるパーソナル・コン
ピュータP1_3の中央演算装置が有する負荷監視手段に送
信する。同時にパーソナル・コンピュータP1_3の中央演
算装置が有する依頼処理管理手段は、図13の依頼処理管
理表から、部分処理(Q1, X22)を削除する。The personal computer P1_2 requested for the partial processing (Q1, X22) searches the search target divided index X22 by the request processing means of the central processing unit, and obtains a list of the document group satisfying the search condition Q1. obtain. This is used as a partial result and transmitted to the partial result integrating means of the central processing unit of the personal computer P1_3, which is the request source, via Ethernet. Further, the processing time required to execute the partial processing (Q1, X22) is transmitted to the load monitoring means of the central processing unit of the personal computer P1_3, which is the request source. At the same time, the request processing management means of the central processing unit of the personal computer P1_3 deletes the partial processing (Q1, X22) from the request processing management table of FIG.

【００８９】一方、パーソナル・コンピュータP1_3の中
央演算装置が有する部分結果統合手段は、検索条件Q1に
ついて分割インデックスX11〜X32を対象としたすべての
部分処理の結果がそろった時点で、結果を統合し、キー
ワードの出現頻度などを基準にして文書の順序を並べ替
え、利用者に応答する。On the other hand, the partial result integrating means of the central processing unit of the personal computer P1_3 integrates the results when all the partial processing results for the search indexes Q11 to X32 have been completed for the search condition Q1. , The order of the documents is rearranged based on the frequency of appearance of the keywords, and the response is made to the user.

【００９０】ここで、パーソナル・コンピュータP1_3の
中央演算装置が有する分配比率計算手段が、負荷を均一
化させるために部分処理の分配比率を計算する方法につ
いて、具体例を示して説明する。Here, a method of calculating the distribution ratio of the partial processing in order to equalize the load by the distribution ratio calculating means of the central processing unit of the personal computer P1_3 will be described with a specific example.

【００９１】ある一定期間において、パーソナル・コン
ピュータP1_3の中央演算装置が有する負荷監視手段が測
定した、各分割インデックスに対する平均処理時間が図
14のようになったとする。また、この時点での分配比率
が図12(a)に示す値だった場合、パーソナル・コンピュ
ータP1_1〜P1_3の平均処理時間は図15のようにして求め
ることができる。例えば、パーソナル・コンピュータP1
_1の平均処理時間E(P1_1)は、 E(P1_1)=T(X11)ラd(X11,P1_1)+T(X12)ラd(X12,P1_1)+T(X2
2)ラd(X22,P1_1)+T(X31)ラd(X31,P1_1) = 100×0.4＋200×0.8＋100×0.5＋400×0.5 = 450 として計算している。During a certain period, the average processing time for each divided index measured by the load monitoring means of the central processing unit of the personal computer P1_3 is shown in FIG.
Let's say it looks like 14. If the distribution ratio at this time has the value shown in FIG. 12A, the average processing time of the personal computers P1_1 to P1_3 can be obtained as shown in FIG. For example, the personal computer P1
_1, the average processing time E (P1_1) is E (P1_1) = T (X11) × d (X11, P1_1) + T (X12) × d (X12, P1_1) + T (X2
2) La (d22, P1_1) + T (X31) Lad (X31, P1_1) = 100 × 0.4 + 200 × 0.8 + 100 × 0.5 + 400 × 0.5 = 450

【００９２】図15を見ると、パーソナル・コンピュータ
1_3の平均処理時間が最も長いことがわかる。また、図1
1に示す分割データ管理表を参照すると、パーソナル・
コンピュータP1_3と同一の分割インデックスを所有し、
最も平均処理時間が短いのはパーソナル・コンピュータ
P1_2であることがわかる。Referring to FIG. 15, a personal computer
It can be seen that the average processing time of 1_3 is the longest. Figure 1
Referring to the split data management table shown in Fig. 1,
Owns the same split index as computer P1_3,
Personal computers have the shortest average processing time
It turns out that it is P1_2.

【００９３】そこで、パーソナル・コンピュータP1_3と
P1_2が共通して所有している分割インデックスX21につ
いて、パーソナル・コンピュータP1_3に分配する比率を
下げ、パーソナル・コンピュータP1_2に分配する比率を
上げる。ここでは、新たにP1_2:P1_3 = 50:50にしたと
する。Therefore, the personal computers P1_3 and
For the divided index X21 commonly owned by P1_2, the ratio of distribution to the personal computer P1_3 is reduced, and the ratio of distribution to the personal computer P1_2 is increased. Here, it is assumed that P1_2: P1_3 = 50: 50 is newly set.

【００９４】新しい分配比率を用いて、各パーソナル・
コンピュータP1_1〜P1_3の平均処理時間を再計算する
と、それぞれ450, 350, 400となる。今度はパーソナル
・コンピュータP1_1の平均処理時間が最大であり、平均
処理時間が最小のパーソナル・コンピュータP1_2と、分
割インデックスX14を共通に所有している。[0094] Using the new distribution ratio, each personal
When the average processing time of the computers P1_1 to P1_3 is recalculated, they are 450, 350, and 400, respectively. This time, the average processing time of the personal computer P1_1 is the longest, and the personal computer P1_2 having the minimum average processing time and the division index X14 are commonly owned.

【００９５】そこで、新たに分割インデックスX14に対
し、パーソナル・コンピュータP1_1への分配比率を下
げ、パーソナル・コンピュータP1_2への分配比率を上げ
る。例えば、新たにP1_2:P1_1 = 100:0とすると、各パ
ーソナル・コンピュータ1_1〜1_3の平均処理時間はすべ
て400になり、以後の平均処理時間を均一化できる。Therefore, the distribution ratio of the divided index X14 to the personal computer P1_1 is lowered and the distribution ratio to the personal computer P1_2 is newly raised. For example, when P1_2: P1_1 = 100: 0 is newly set, the average processing time of each of the personal computers 1_1 to 1_3 becomes 400, and the average processing time thereafter can be made uniform.

【００９６】なお、ここではヒューリスティックに分配
比率を変化させる方法を説明したが、分配比率を変化さ
せる単位を決めておき、単位ごとに徐々に分配比率を変
化させ、再計算を繰り返しながら自動的に収束させる方
法や、負荷が均一化された時の平均処理時間を目標値と
して、最大処理時間が目標値に一致するように分配比率
を変化させることを繰り返して自動的に分配比率を計算
するなどの方法もある。Here, the method of heuristically changing the distribution ratio has been described. However, a unit for changing the distribution ratio is determined, and the distribution ratio is gradually changed for each unit. Automatically calculate the distribution ratio by repeatedly changing the distribution ratio so that the maximum processing time matches the target value with the average processing time when the load is equalized as the target value, etc. There is also a method.

【００９７】次に、一部のパーソナル・コンピュータが
停止した場合の動作について、具体例を用いて説明す
る。Next, the operation when some personal computers are stopped will be described with reference to a specific example.

【００９８】通常、パーソナル・コンピュータP1_1〜P1
_3の中央演算装置が含んでいる障害検出手段は、お互い
に定期的に微少データをやりとりし、相手が正常に動作
しているか確認している。ある時点で、パーソナル・コ
ンピュータP1_1に予期しない障害が発生し停止してしま
うと、パーソナル・コンピュータP1_1からの返答がなく
なるため、パーソナル・コンピュータP1_2、P1_3の中央
演算装置が有する障害検出手段は、障害を検出できる。
この時、パーソナル・コンピュータP1_2、P1_3の両方と
も、障害時の割り込み処理を行うが、ここでは、パーソ
ナル・コンピュータP1_3に注目して説明する。Normally, personal computers P1_1 to P1
The fault detection means included in the central processing unit of # 3 periodically exchanges minute data with each other to check whether the other party is operating normally. At some point, if the personal computer P1_1 is stopped due to an unexpected failure, the response from the personal computer P1_1 is lost, so the failure detection means of the central processing unit of the personal computers P1_2, P1_3 is Can be detected.
At this time, both the personal computers P1_2 and P1_3 perform interrupt processing at the time of failure, but the following description focuses on the personal computer P1_3.

【００９９】パーソナル・コンピュータP1_3の中央演算
装置が有する障害検出手段は、パーソナル・コンピュー
タP1_1が停止したと判断すると、パーソナル・コンピュ
ータP1_3の中央演算装置が有する分配比率計算手段に知
らせる。分配比率計算手段は、パーソナル・コンピュー
タP1_1への部分処理の分配比率を0にする。すると、分
配比率は図16のようになる。When the failure detecting means of the central processing unit of the personal computer P1_3 determines that the personal computer P1_1 has stopped, it notifies the distribution ratio calculating means of the central processing unit of the personal computer P1_3. The distribution ratio calculation means sets the distribution ratio of the partial processing to the personal computer P1_1 to zero. Then, the distribution ratio becomes as shown in FIG.

【０１００】このままでは、パーソナル・コンピュータ
P1_2とパーソナル・コンピュータP1_3の処理時間に偏り
ができてしまうため、分配比率計算手段は分配比率の再
計算を行う。再計算の方法は、パーソナル・コンピュー
タ1_1への部分処理の分配比率を0に固定して行うことの
他は、前述の正常時の計算方法と同じである。As it is, a personal computer
Since the processing time between P1_2 and the personal computer P1_3 is biased, the distribution ratio calculating means recalculates the distribution ratio. The recalculation method is the same as the normal calculation method described above, except that the distribution ratio of the partial processing to the personal computer 1_1 is fixed to 0.

【０１０１】各分割インデックスの平均処理時間が図14
のようになっている場合、図16の分配比率では、パーソ
ナル・コンピュータP1_2, P1_3の平均処理時間E(P1_2),
E(P1_3)は、それぞれ以下のように計算できる。 E(P1_2)= T(X21)ラd(X21,P1_2)+T(X22)ラd(X22,P1_2)+T(X
11)ラd(X11,P1_2)+T(X32)ラd(X32,P1_2) = 300×0.5＋100×1＋100×1＋100×0.1 = 360 E(P1_3)= T(X31)ラd(X31,P1_2)+T(X32)ラd(X32,P1_2)+T(X
12)ラd(X12,P1_2)+T(X21)ラd(X21,P1_2) = 400×1＋100×0.9＋200×1＋300×0.5 = 840The average processing time of each divided index is shown in FIG.
In the distribution ratio of FIG. 16, the average processing time E (P1_2) of the personal computers P1_2 and P1_3 is
E (P1_3) can be calculated as follows. E (P1_2) = T (X21) La d (X21, P1_2) + T (X22) La d (X22, P1_2) + T (X
11) LA d (X11, P1_2) + T (X32) LA d (X32, P1_2) = 300 × 0.5 + 100 × 1 + 100 × 1 + 100 × 0.1 = 360 E (P1_3) = T (X31) LA d (X31, P1_2) + T (X32) d (X32, P1_2) + T (X
12) LA d (X12, P1_2) + T (X21) LA d (X21, P1_2) = 400 × 1 + 100 × 0.9 + 200 × 1 + 300 × 0.5 = 840

【０１０２】そこで、パーソナル・コンピュータP1_2、
P1_3が共通に所有している分割インデックスX21につい
て、パーソナル・コンピュータP1_2に分配する比率を上
げてパーソナル・コンピュータP1_3に分配する比率を下
げる。新たにP1_2:P1_3 = 100:0にすると、E(P1_2) = 5
10、E(P1_3) = 690となる。これでもまだ偏りは残って
いるが、さらにパーソナル・コンピュータP1_2、P1_3が
共通に所有している分割インデックスX32について、分
配比率を新たにP1_3:P1_2 = 0:100に変更すると、E(P1_
2) = E(P1_3) = 600となり、障害発生時の肩代わり負荷
も、均一化できる。Therefore, the personal computer P1_2,
For the divided index X21 commonly owned by P1_3, the ratio of distribution to the personal computer P1_2 is increased and the ratio of distribution to the personal computer P1_3 is decreased. When P1_2: P1_3 = 100: 0 newly, E (P1_2) = 5
10, E (P1_3) = 690. Although the bias still remains, even if the distribution ratio is newly changed to P1_3: P1_2 = 0: 100 for the split index X32 commonly owned by the personal computers P1_2 and P1_3, E (P1_
2) = E (P1_3) = 600, and the load on the shoulder when a failure occurs can be equalized.

【０１０３】次に、パーソナル・コンピュータP1_3の中
央演算装置は、パーソナル・コンピュータP1_1に依頼
後、結果が未返答の部分処理を、依頼処理管理手段が作
成した依頼処理管理表を参照して負荷分散手段に渡し、
新しい分配比率に従って再分配を行う。Next, after requesting the personal computer P1_1, the central processing unit of the personal computer P1_3 distributes the partial process for which the result has not been returned with reference to the request processing management table created by the request processing management means. Hand over the means,
Redistribute according to the new distribution ratio.

【０１０４】次に第2の実施例を、図面を参照して説明
する。かかる実施例は、第2の実施の形態に対応するも
のである。Next, a second embodiment will be described with reference to the drawings. This example corresponds to the second embodiment.

【０１０５】本実施例は、第1の実施例と構成を同じと
するが、パーソナル・コンピュータP1_1〜P1_3が、分配
比率計算手段の起動のタイミングを決定する起動時間制
御手段としても機能する点で第1の実施例と異なる。The present embodiment has the same configuration as that of the first embodiment, except that the personal computers P1_1 to P1_3 also function as start-up time control means for determining the start timing of the distribution ratio calculation means. This is different from the first embodiment.

【０１０６】分割データの格納方法、正常時の動作、分
配比率の計算方法および一部のパーソナル・コンピュー
タが停止した状態の動作は、第1の実施例と同一である
ため、ここでは省略する。The method of storing the divided data, the operation at the time of normal operation, the method of calculating the distribution ratio, and the operation in a state where some personal computers are stopped are the same as those in the first embodiment, and therefore will not be described here.

【０１０７】パーソナル・コンピュータP1_1の中央演算
装置が含む起動時間制御手段が、分配比率計算手段を起
動させるタイミングを決定する方法について説明する。
ここでは、未返答の部分処理の偏りVを、最大値Umaxと
最小値Uminの比Umax/Uminと定義し、分配比率計算手段
を起動させる条件として、偏りVの閾値を3とする。A method of determining the timing at which the start-up time control means included in the central processing unit of the personal computer P1_1 starts the distribution ratio calculation means will be described.
Here, the bias V of the unprocessed partial process is defined as the ratio Umax / Umin of the maximum value Umax and the minimum value Umin, and the threshold value of the bias V is set to 3 as a condition for activating the distribution ratio calculation means.

【０１０８】ある時点において、パーソナル・コンピュ
ータP1_1の中央演算装置が含む計算時間制御手段が、各
計算機へ依頼後未返答の部分処理の数を計数したとこ
ろ、図19(a)のようになったとする。この時、未返答の
部分処理の数の最大値Umaxが7、最小値Uminが4であるの
で、偏りV=Umax/Umin=1.75である。これは、閾値3より
も小さいため、負荷は均一化されているものと判断し、
分配比率計算手段は起動せずに続行する。At a certain point in time, the calculation time control means included in the central processing unit of the personal computer P1_1 counts the number of unprocessed partial processes after requesting each computer, and as shown in FIG. 19 (a). I do. At this time, since the maximum value Umax of the number of unanswered partial processes is 7 and the minimum value Umin is 4, the bias V = Umax / Umin = 1.75. Since this is smaller than the threshold value 3, it is determined that the load is equalized,
The distribution ratio calculation means continues without being activated.

【０１０９】次に、別の時点で、パーソナル・コンピュ
ータP1_1の中央演算装置が含む計算時間制御手段が、各
計算機へ依頼後未返答の部分処理の数を計数したとこ
ろ、図19(b)のようになったとする。この時、未返答の
部分処理の数の最大値Umaxが10、最小値Uminが2である
ので、偏りV=Umax/Umin=5である。これは、閾値3よりも
大きいため、各分割データに対する平均処理時間の変化
などから、負荷の均衡が崩れてしまったと判断し、パー
ソナル・コンピュータP1_1の中央演算装置が含む分配比
率計算手段を起動し、分配比率を再計算して補正する。Next, at another time, the calculation time control means included in the central processing unit of the personal computer P1_1 counts the number of unprocessed partial processes after requesting each computer. Let's say that. At this time, since the maximum value Umax of the number of unanswered partial processes is 10 and the minimum value Umin is 2, the bias V = Umax / Umin = 5. Since this is larger than the threshold value 3, it is determined that the load balance has been lost due to a change in the average processing time for each of the divided data, and the distribution ratio calculation means included in the central processing unit of the personal computer P1_1 is started. , The distribution ratio is recalculated and corrected.

【０１１０】[0110]

【発明の効果】第1の効果は、二次記憶装置からの入出
力処理を反映した負荷の均一化が可能なことである。そ
の理由は、二次記憶装置からの入出力処理を含む部分処
理の平均処理時間を用いて、処理時間を均一化させる分
配比率を求めているからである。The first effect is that it is possible to equalize the load reflecting the input / output processing from the secondary storage device. The reason is that the distribution ratio for making the processing time uniform is obtained by using the average processing time of the partial processing including the input / output processing from the secondary storage device.

【０１１１】第2の効果は、各計算機で読み出すデータ
量や処理量が異なる処理であっても、各計算機の処理時
間を均一化し、システム全体のスループットを最大化で
きることである。その理由は、異なる計算機間で複数の
同一な分割データを分散および冗長格納しておき、平均
処理時間の長い計算機への部分処理の依頼回数を減ら
し、平均処理時間の短い計算機への部分処理の依頼回数
を増やすことによって処理を肩代わりさせ、各計算機の
全体の処理時間を均一化させるためである。The second effect is that, even if the amount of data read and the amount of processing read by each computer are different, the processing time of each computer can be made uniform and the throughput of the entire system can be maximized. The reason is that a plurality of identical divided data are distributed and redundantly stored between different computers, the number of requests for partial processing to a computer with a long average processing time is reduced, and the partial processing to a computer with a short average processing time is reduced. By increasing the number of requests, the processing can be replaced, and the overall processing time of each computer is made uniform.

【０１１２】第3の効果は、各データに対するアクセス
頻度の分布が変化しても、柔軟に各計算機の全体の処理
時間を均一化できることである。その理由は、定期的あ
るいは各計算機の負荷に偏りが生じた時点で、負荷を均
一化する分配比率を再計算するためである。A third effect is that the entire processing time of each computer can be flexibly equalized even if the distribution of the access frequency to each data changes. The reason is that the distribution ratio for equalizing the load is recalculated periodically or when the load of each computer becomes uneven.

【０１１３】第4の効果は、障害や保守などの理由で一
部の計算機が停止した状態でもサービスの継続が可能で
あり、しかも大幅な性能低下を防ぐことである。その理
由は、障害発生時において停止した計算機を除く構成で
分配比率の再計算を行い、停止した計算機から未返答の
部分処理を他の計算機に再分配するためである。The fourth effect is that the service can be continued even when some computers are stopped due to a failure or maintenance, and a significant performance drop is prevented. The reason is that the distribution ratio is recalculated in a configuration other than the computer stopped when a failure occurs, and the unprocessed partial process is redistributed from the stopped computer to another computer.

[Brief description of the drawings]

【図１】本発明の第一の実施の形態の構成概要図FIG. 1 is a schematic configuration diagram of a first embodiment of the present invention.

【図２】分割データの作成方法と分散および冗長格納方
法を示す概要図FIG. 2 is a schematic diagram showing a method of creating divided data and a method of distributing and storing redundant data.

【図３】分割データ管理手段が作成する分割データ管理
表の例FIG. 3 is an example of a divided data management table created by a divided data management unit.

【図４】依頼処理管理手段が作成する依頼処理管理表の
例FIG. 4 is an example of a request processing management table created by a request processing management unit.

【図５】分配比率計算手段が計算した分配比率の例FIG. 5 is an example of a distribution ratio calculated by a distribution ratio calculation unit.

【図６】本発明の第一の実施の形態の動作を示すフロー
チャートFIG. 6 is a flowchart showing the operation of the first embodiment of the present invention.

【図７】分配比率計算手段が分配比率を計算する手順を
示すフローチャートFIG. 7 is a flowchart showing a procedure in which a distribution ratio calculation unit calculates a distribution ratio.

【図８】本発明の第一の実施の形態において、障害が発
生時の動作を示すフローチャートFIG. 8 is a flowchart showing an operation when a failure occurs in the first embodiment of the present invention;

【図９】本発明の第一の実施例の構成概要図FIG. 9 is a schematic configuration diagram of the first embodiment of the present invention.

【図１０】本発明の第一の実施例における分割データの
格納方法を示す概要図FIG. 10 is a schematic diagram showing a method for storing divided data according to the first embodiment of the present invention;

【図１１】本発明の第一の実施例における分割データ管
理表FIG. 11 is a divided data management table according to the first embodiment of the present invention.

【図１２】本発明の第一の実施例における分配比率と、
分配方法の例FIG. 12 shows a distribution ratio according to the first embodiment of the present invention;
Examples of distribution methods

【図１３】本発明の第一の実施例における依頼処理管理
表FIG. 13 is a request processing management table according to the first embodiment of the present invention.

【図１４】本発明の第一の実施例において、負荷監視手
段が測定した各分割インデックスの処理時間FIG. 14 shows the processing time of each divided index measured by the load monitoring means in the first embodiment of the present invention.

【図１５】本発明の第一の実施例における各計算機の処
理時間の計算方法の説明図FIG. 15 is an explanatory diagram of a calculation method of a processing time of each computer in the first embodiment of the present invention.

【図１６】本発明の第一の実施例における障害発生時の
分配比率再計算を説明する図FIG. 16 is a diagram for explaining recalculation of a distribution ratio when a failure occurs in the first embodiment of the present invention.

【図１７】本発明の第二の実施の形態の構成概要図FIG. 17 is a schematic configuration diagram of a second embodiment of the present invention.

【図１８】本発明の第二の実施の形態の動作を示すフロ
ーチャートFIG. 18 is a flowchart showing the operation of the second embodiment of the present invention.

【図１９】本発明の第二の実施例における依頼処理管理
表FIG. 19 is a request processing management table according to the second embodiment of the present invention.

【図２０】本発明の第三の実施の形態の構成概要図FIG. 20 is a schematic configuration diagram of a third embodiment of the present invention.

[Explanation of symbols]

1_1〜1_n 計算機 2_1〜2_n 二次記憶装置 3 外部通信網 4 記録媒体 11 受付処理部 12 負荷管理部 13 データ処理部 111 要求受付手段 112 求分配手段 113 分割データ管理手段 114 部分結果統合手段 115 障害検出手段 116 依頼処理管理手段 121 負荷監視手段 122 分配比率計算手段 123 負荷分散手段 124 計算時間制御手段 131 要求処理手段 D11〜Dnm 分割データ X11〜X32 分割インデックス 1_1 ~ 1_n Computer 2_1 ~ 2_n Secondary storage device 3 External communication network 4 Recording medium 11 Reception processing unit 12 Load management unit 13 Data processing unit 111 Request reception unit 112 Request distribution unit 113 Divided data management unit 114 Partial result integration unit 115 Failure Detection means 116 Request processing management means 121 Load monitoring means 122 Distribution ratio calculation means 123 Load distribution means 124 Calculation time control means 131 Request processing means D11-Dnm Divided data X11-X32 Divided index

Claims

[Claims]

A divided data obtained by dividing data to be searched is distributed and stored in a secondary storage device provided in a plurality of computers, and a duplicate of the divided data is created so as not to be duplicated. A distributed parallel data processing apparatus, wherein a computer that performs partial processing on each divided data is determined in consideration of one or both of a load state of a computer and a failure state of the computer. .

2. A distributed parallel data processing apparatus for performing a search process on a secondary storage device in which divided data obtained by dividing data to be searched is stored in a distributed manner, wherein a processing time for the divided data is measured. A distribution ratio capable of equalizing the load of the computer that manages the secondary storage device is determined based on the distribution ratio, and a computer that performs partial processing on each divided data is determined based on the distribution ratio thus determined. A distributed / parallel data processing apparatus for transmitting the partial processing request to a corresponding computer.

3. A distributed parallel data processing device for performing a search process on a secondary storage device in which divided data obtained by dividing data to be searched is stored in a distributed manner, wherein a processing request from a user is received. Request accepting means for creating a plurality of partial processes specifying designated divided data, request distribution means for transmitting the partial processes to designated computers, divided data managing means for holding storage locations of the respective divided data, Collects, integrates,
A partial result integrating means for editing and responding to the user; a load monitoring means for measuring a processing time for each divided data; a load on each computer in advance based on the processing time for each divided data measured by the load monitoring means. Distribution ratio calculation means for obtaining a distribution ratio capable of equalizing, load distribution means for determining a computer for distributing a partial process for each divided data based on the calculated distribution ratio and designating the request distribution means, Request processing means for performing a partial process on the divided data stored in the secondary storage device.

4. A distributed parallel data processing device for performing a search process on a secondary storage device in which divided data obtained by dividing data to be searched is stored in a distributed manner. Request accepting means for creating a plurality of partial processes specifying designated divided data, request distribution means for transmitting the partial processes to designated computers, divided data managing means for holding storage locations of the respective divided data, Collects, integrates,
The partial result integration means that edits and responds to the user, the failure detection means that automatically detects a stopped computer, and the partial processing that has been requested to other computers are retained. Request processing management means for redistributing the partial processing of the reply, load monitoring means for measuring the processing time for each divided data, and stopping when a failure occurs, based on the processing time for each divided data measured by the load monitoring means. A distribution ratio calculating means for performing recalculation with the distribution ratio to the computer set to 0, and a computer for distributing the partial processing for each divided data based on the distribution ratio calculated in advance, and designating the request distribution means A distributed parallel data processing device comprising: a load distribution unit; and a request processing unit that performs a partial process on the divided data stored in the secondary storage device. Place.

5. A distributed parallel data processing device for performing a search process on a secondary storage device in which divided data obtained by dividing data to be searched is stored in a distributed manner, wherein a processing request from a user is received. Request accepting means for creating a plurality of partial processes specifying designated divided data, request distribution means for transmitting the partial processes to designated computers, divided data managing means for holding storage locations of the respective divided data, Collects, integrates,
A partial result integrating means for editing and responding to the user; a load monitoring means for measuring a processing time for each divided data; a load on each computer in advance based on the processing time for each divided data measured by the load monitoring means. Distribution ratio calculation means for obtaining a distribution ratio capable of equalizing, load distribution means for determining a computer for distributing a partial process for each divided data based on a distribution ratio calculated in advance, and designating the computer as a request distribution means, When the deviation in the number of unprocessed partial processes after requesting each computer increases, a calculation time control unit that activates the distribution ratio calculation unit, and a request to perform a partial process on the divided data stored in the secondary storage device And a processing means.

6. A distributed parallel data processing device for performing a search process on a secondary storage device in which divided data obtained by dividing data to be searched is stored in a distributed manner, wherein a processing request from a user is received. Request accepting means for creating a plurality of partial processes specifying designated divided data, request distribution means for transmitting the partial processes to designated computers, divided data managing means for holding storage locations of the respective divided data, Collects, integrates,
The partial result integration means that edits and responds to the user, the failure detection means that automatically detects a stopped computer, and the partial processing that has been requested to other computers are retained. Request processing management means for redistributing the partial processing of the reply, load monitoring means for measuring the processing time for each divided data, and stopping when a failure occurs, based on the processing time for each divided data measured by the load monitoring means. A distribution ratio calculation unit that performs recalculation with the distribution ratio to the computer set to 0, and a computer that determines the computer that distributes the partial processing for each divided data based on the distribution ratio calculated in advance and specifies the load to the request distribution unit Distribution means, calculation time control means for activating the distribution ratio calculation means when the deviation of the number of partial processes not responded to each computer after the request becomes large, and the secondary storage device Distributed Parallel data processing apparatus characterized by comprising: a request processing means for performing partial processing on the pay has been divided data.

7. A recording medium which records a distributed parallel data processing program for performing a search process using a computer on a secondary storage device in which divided data obtained by dividing data to be searched is distributed and stored. A request receiving function for receiving a processing request from a user to a computer and creating a plurality of partial processes specifying a target divided data; a request distribution function for transmitting a partial process to a designated computer; A divided data management function for holding a data storage destination; collecting and integrating results of partial processing on the divided data;
A partial result integration function for editing and responding to the user, a load monitoring function for measuring a processing time for each divided data, and a processing time for each divided data measured by the load monitoring function for each computer in advance. A distribution ratio calculation function for obtaining a distribution ratio capable of equalizing the load, and a load distribution function for determining a computer for distributing a partial process for each divided data based on the calculated distribution ratio and specifying the computer for the request distribution function. And a request processing function of performing a partial process on the divided data stored in the secondary storage device, wherein the recording medium stores a distributed parallel data processing program.

8. A recording medium recording a distributed parallel data processing program for performing a search process using a computer on a secondary storage device in which divided data obtained by dividing data to be searched is distributed and stored. A request receiving function for receiving a processing request from a user to a computer and creating a plurality of partial processes specifying a target divided data; a request distribution function for transmitting a partial process to a designated computer; A divided data management function for holding a data storage destination; collecting and integrating results of partial processing on the divided data;
The partial result integration function that edits and responds to the user, the failure detection function that automatically detects a stopped computer, and the partial processing that has been requested to other computers are retained and the A request processing management function for redistributing the partial processing of the response, a load monitoring function for measuring the processing time for each divided data, and a stop when a failure occurs based on the processing time for each divided data measured by the load monitoring function. A distribution ratio calculation function that performs recalculation by setting the distribution ratio to the calculated computer to 0, and a computer that distributes partial processing for each divided data based on the distribution ratio calculated in advance, and designates it to the request distribution function And a load processing function for performing a partial processing on the divided data stored in the secondary storage device. A recording medium on which a type data processing program is recorded.

9. A recording medium storing a distributed parallel data processing program for performing a search process using a computer on a secondary storage device in which divided data obtained by dividing data to be searched is distributed and stored. A request receiving function for receiving a processing request from a user to a computer and creating a plurality of partial processes specifying a target divided data; a request distribution function for transmitting a partial process to a designated computer; A divided data management function for holding a data storage destination; collecting and integrating results of partial processing on the divided data;
A partial result integration function for editing and responding to the user, a load monitoring function for measuring a processing time for each divided data, and a processing time for each divided data measured by the load monitoring function for each computer in advance. A distribution ratio calculation function that calculates the distribution ratio that can equalize the load, and a load distribution function that determines the computer that distributes the partial processing for each divided data based on the distribution ratio calculated in advance and specifies it as the request distribution function When the deviation of the number of unprocessed partial processes after requesting each computer increases, a calculation time control function for activating a distribution ratio calculation function and a partial process for the divided data stored in the secondary storage device are performed. And a request processing function, wherein the recording medium records a distributed parallel data processing program.

10. A recording medium recording a distributed parallel data processing program for performing a search process using a computer on a secondary storage device in which divided data obtained by dividing data to be searched is distributed and stored. A request receiving function for receiving a processing request from a user to a computer and creating a plurality of partial processes specifying a target divided data; a request distribution function for transmitting a partial process to a designated computer; A divided data management function for holding a data storage destination; collecting and integrating results of partial processing on the divided data;
The partial result integration function that edits and responds to the user, the failure detection function that automatically detects a stopped computer, and the partial processing that has been requested to other computers are retained and the A request processing management function for redistributing the partial processing of the response, a load monitoring function for measuring the processing time for each divided data, and a stop when a failure occurs based on the processing time for each divided data measured by the load monitoring function. A distribution ratio calculation function that performs recalculation with the distribution ratio to the calculated computer set to 0, and a computer that distributes partial processing for each divided data based on the distribution ratio calculated in advance and specifies it to the request distribution means A load distribution function, a calculation time control function for activating the distribution ratio calculation means when the deviation of the number of partial processes that have not been replied to each computer after being requested increases, Distributed Parallel data processing program recorded recording medium to a request processing function, characterized in that to realize the performing partial processing on has been divided data stored in the device.

11. A computer comprising a plurality of computers connected by an external communication network, each of which has a local secondary storage device, divides large-scale data to be searched and stores the divided data in the secondary storage device in a distributed manner. A distributed parallel data system for performing processing on the divided data stored in the secondary storage device in parallel on a non-shared type cluster system in which a copy of the divided data is redundantly stored so as not to be duplicated in the same secondary storage device A request receiving means for receiving a processing request from a user and creating a plurality of partial processes specifying a target divided data; a request distribution means for transmitting the partial process to a designated computer; A divided data management unit that holds a storage location of each divided data, and collects, integrates, and collects results of partial processing on the divided data.
A partial result integrating means for editing and responding to the user; a load monitoring means for measuring a processing time for each divided data; and a processing time for each computer in advance based on the processing time for each divided data measured by the load monitoring means. Distribution ratio calculation means for obtaining a distribution ratio capable of equalizing the load; load distribution means for determining a computer for distributing a partial process for each divided data based on a distribution ratio calculated in advance, and designating the computer as a request distribution means; Request processing means for performing a partial process on the divided data stored in the secondary storage device.

12. A computer comprising a plurality of computers connected by an external communication network, each computer having a local secondary storage device, dividing large-scale data of interest and storing the divided data in the secondary storage device in a distributed manner. A distributed parallel data processing system that performs processing on divided data stored in a secondary storage device in parallel on a non-shared type cluster system in which copies of the divided data are redundantly stored so as not to be duplicated in the same secondary storage device Request receiving means for receiving a processing request from a user and creating a plurality of partial processes specifying target divided data; request distributing means for transmitting the partial process to a designated computer; Division data management means for holding a storage location of data; collecting and integrating results of partial processing on the division data;
The partial result integration means that edits and responds to the user, the failure detection means that automatically detects a stopped computer, and the partial processing that has been requested to other computers are retained. Request processing management means for redistributing the partial processing of the reply, load monitoring means for measuring the processing time for each divided data, and stopping when a failure occurs based on the processing time for each divided data measured by the load monitoring means. Distribution ratio calculating means for performing recalculation with the distribution ratio to the calculated computer set to 0, and a computer for distributing the partial processing for each divided data based on the distribution ratio calculated in advance, and designating the request distribution means A distributed parallel data processing system comprising: a load distribution unit; and a request processing unit for performing a partial process on the divided data stored in the secondary storage device. Stem.

13. A computer comprising a plurality of computers connected by an external communication network, each of which has a local secondary storage device, divides large-scale data of interest and stores the divided data in the secondary storage device in a distributed manner. A distributed parallel data processing system that performs processing on divided data stored in a secondary storage device in parallel on a non-shared type cluster system in which copies of the divided data are redundantly stored so as not to be duplicated in the same secondary storage device Request receiving means for receiving a processing request from a user and creating a plurality of partial processes specifying target divided data; request distributing means for transmitting the partial process to a designated computer; Division data management means for holding a storage location of data; collecting and integrating results of partial processing on the division data;
A partial result integrating means for editing and responding to the user; a load monitoring means for measuring a processing time for each divided data; and a processing time for each computer based on the processing time for each divided data measured by the load monitoring means. Distribution ratio calculation means for obtaining a distribution ratio capable of equalizing the load; load distribution means for determining a computer for distributing a partial process for each divided data based on a distribution ratio calculated in advance, and designating the computer as a request distribution means; When the deviation in the number of unprocessed partial processes after requesting each computer increases, a calculation time control unit that activates the distribution ratio calculation unit and a request to perform a partial process on the divided data stored in the secondary storage device A distributed parallel data processing system comprising: processing means.

14. A computer comprising a plurality of computers connected by an external communication network, each of which has a local secondary storage device, divides large-scale data of interest, and stores the divided data in the secondary storage device in a distributed manner. A distributed parallel data processing system that performs processing on divided data stored in a secondary storage device in parallel on a non-shared type cluster system in which copies of the divided data are redundantly stored so as not to be duplicated in the same secondary storage device Request receiving means for receiving a processing request from a user and creating a plurality of partial processes specifying target divided data; request distributing means for transmitting the partial process to a designated computer; Division data management means for holding a storage location of data; collecting and integrating results of partial processing on the division data;
The partial result integration means that edits and responds to the user, the failure detection means that automatically detects a stopped computer, and the partial processing that has been requested to other computers are retained. Request processing management means for redistributing partial processing of a reply, load monitoring means for measuring the processing time for each divided data, and stopping when a failure occurs based on the processing time for each divided data measured by the load monitoring means Distribution ratio calculating means for performing recalculation with the distribution ratio to the calculated computer set to 0, and a computer for distributing the partial processing for each divided data based on the distribution ratio calculated in advance, and designating the request distribution means Load distribution means, calculation time control means for activating the distribution ratio calculation means when the number of unprocessed partial processes after requesting each computer becomes large, and storage in the secondary storage device. Distributed Parallel data processing system characterized in that it and a request processing means for performing partial processing on the divided data.