JP4490068B2

JP4490068B2 - Data storage system using network

Info

Publication number: JP4490068B2
Application number: JP2003329942A
Authority: JP
Inventors: 義博矢野
Original assignee: Dai Nippon Printing Co Ltd
Current assignee: Dai Nippon Printing Co Ltd
Priority date: 2003-09-22
Filing date: 2003-09-22
Publication date: 2010-06-23
Anticipated expiration: 2023-09-22
Also published as: JP2005099911A

Description

本発明は、ネットワークを利用したデータ保管システムに関する。特に、本発明は、バックアップの目的で、貴重なデジタルデータの複製を、ネットワーク上の複数の箇所に分散して保管することができるデータ保管システムに関する。 The present invention relates to a data storage system using a network. In particular, the present invention relates to a data storage system capable of distributing and storing valuable digital data copies in a plurality of locations on a network for the purpose of backup.

データを作成したコンピュータ等から、ネットワークで接続されている他のファイルサーバ等に、作成したデータのバックアップデータを転送し、貴重なデータの保管を行うことは広く行われている。このネットワークを、たとえば、インターネットのような世界的規模の広域ネットワークにまで拡張すれば、インターネットにアクセスすることができる環境さえあれば、世界中のどこからでもデータの保管を行うことができ、保管されていたデータを世界中のどこからでも取り出すことができる。しかしながら、利用するネットワークの規模を拡大すればするほど、利用者の利便性は向上するが、逆にセキュリティは低下することになる。保管対象となるデータは、通常、個人個人のプライベートな情報を含んでおり、データを預けた本人あるいは本人の委託を受けた代理人だけが取り出せるよう、十分なセキュリティを確保しておく必要がある。このように、ネットワークを利用した従来のデータアーカイブシステムには、どこからでもデータの預け入れや取り出しができるように利便性を向上させると、セキュリティが低下するという問題が生じていた。 It is widely performed to transfer the backup data of the created data from the computer or the like that created the data to another file server or the like connected via a network and store valuable data. If this network is extended to a global wide area network such as the Internet, data can be stored and stored from anywhere in the world as long as there is an environment that can access the Internet. You can retrieve your data from anywhere in the world. However, as the scale of the network to be used is increased, the convenience for the user is improved, but on the contrary, the security is lowered. Data to be stored usually contains private information of individuals, and it is necessary to ensure sufficient security so that only the person who deposited the data or the agent entrusted by the person can retrieve it. . As described above, in the conventional data archiving system using a network, there is a problem that security is lowered when convenience is improved so that data can be deposited and retrieved from anywhere.

そこで、下記の特許文献１には、このような問題を解決するために、データを預けた本人あるいはデータへのアクセスを許可された特定の者だけが、任意の場所から預け入れたデータに安全にアクセスでき、しかもデータを保管するサーバ側に特別な装置やソフトウエアの用意を必要としない分散型データアーカイブ装置が開示されている。この文献に開示された装置では、保管対象データが複数の分割ファイルに分割され、ネットワーク上の複数の格納場所に分散して保存されることになる。このような保存方法を採れば、各格納場所に保存された個々の分割ファイル自身についてのセキュリティは低くても、元の保管対象ファイルの分割方法および個々の分割ファイルの格納場所が認識できない限り、元の保管対象ファイルを復元することは困難であり、十分なセキュリティを確保することが可能になる。
ＰＣＴ国際公開第ＷＯ０１／４６８０８号公報 Therefore, in Patent Document 1 below, in order to solve such a problem, only the person who has deposited the data or a specific person authorized to access the data can safely access the data deposited from any place. There has been disclosed a distributed data archive device that can be accessed and does not require any special device or software on the server side for storing data. In the apparatus disclosed in this document, storage target data is divided into a plurality of divided files and distributed and stored in a plurality of storage locations on the network. If such a storage method is adopted, even if the security of each individual divided file stored in each storage location is low, unless the original division method of the file to be stored and the storage location of each individual divided file can be recognized, It is difficult to restore the original file to be stored, and sufficient security can be ensured.
PCT International Publication No. WO01 / 46808

デジタルデータの重要度は、そこに含まれている情報の経済的あるいは社会的意義によって千差万別であり、データの種類によっては、これまで以上に、バックアップの点における信頼性およびセキュリティの点における信頼性が求められている。そこで、本発明の第１の目的は、より高い信頼性を確保することが可能なネットワークを利用したデータ保管システムを提供することにある。 The importance of digital data varies greatly depending on the economic or social significance of the information contained in it, and depending on the type of data, it is more reliable and secure in terms of backup. There is a need for reliability. Accordingly, a first object of the present invention is to provide a data storage system using a network that can ensure higher reliability.

また、ネットワークを利用してデータをバックアップするシステムでは、バックアップに必要とされる時間に、ネットワーク上のデータ伝送時間が含まれることになる。このため、従来のシステムでは、大量のデータを短時間にバックアップすることが困難になるおそれがある。そこで、本発明の第２の目的は、より短時間でデータ保管作業を完了することが可能なネットワークを利用したデータ保管システムを提供することにある。 Further, in a system that backs up data using a network, the time required for backup includes the data transmission time on the network. For this reason, in a conventional system, it may be difficult to back up a large amount of data in a short time. Accordingly, a second object of the present invention is to provide a data storage system using a network that can complete a data storage operation in a shorter time.

(1) 本発明の第１の態様は、ネットワークを介してアクセス可能な複数の格納場所に、保管対象データを分散して保管する機能と、保管されていたデータを必要に応じて取り出す機能と、を有するネットワークを利用したデータ保管システムにおいて、
データの保管時に、保管対象データを複数の分割ファイルに分割してネットワークを介して複数の格納場所に分散して格納する預入処理を行う預入処理手段と、この預入処理の方法を示す管理データを作成して、これを直接的にもしくはオンライン送信で間接的に所定の記録媒体に対して出力して格納させる管理データ出力手段と、データの取出時に、管理データを入力する管理データ入力手段と、入力された管理データに基づいて、分散して格納されていた複数の分割ファイルを取り出して元の保管対象データに復元する引出処理を行う引出処理手段と、を有する分散型データアーカイブ装置を複数設け、
第ｎ番目（ｎは１以上の整数）の分散型データアーカイブ装置の預入処理により特定の格納場所に格納される特定の分割ファイルを、第（ｎ＋１）番目の分散型データアーカイブ装置についての保管対象データとして取り扱い、特定の分割ファイルを更に複数の分割ファイルに分割した上で、特定の格納場所とは異なる複数の格納場所に分散して格納する処理を行うようにしたものである。 (1) The first aspect of the present invention is a function for distributing and storing data to be stored in a plurality of storage locations accessible via a network, and a function for retrieving stored data as needed. In a data storage system using a network having
When storing data, deposit processing means for performing deposit processing to divide the storage target data into a plurality of divided files and distribute and store the data in a plurality of storage locations via a network, and management data indicating the method of the deposit processing Management data output means for creating and outputting this data directly or indirectly to a predetermined recording medium by online transmission, and management data input means for inputting management data at the time of data retrieval; A plurality of distributed data archiving apparatuses are provided, each having a withdrawal processing unit for performing a withdrawal process for taking out a plurality of divided files stored in a distributed manner based on the input management data and restoring them to the original storage target data ,
A specific divided file stored in a specific storage location by deposit processing of the nth (n is an integer equal to or greater than 1) distributed data archive device is stored as the (n + 1) th distributed data archive device. The data is handled as data, and a specific divided file is further divided into a plurality of divided files, and then distributed and stored in a plurality of storage locations different from the specific storage location.

(2) 本発明の第２の態様は、上述の第１の態様に係るネットワークを利用したデータ保管システムにおいて、
第１段目の分散型データアーカイブ装置〜第Ｎ段目（Ｎは２以上の整数）の分散型データアーカイブ装置までの全Ｎ段の分散型データアーカイブ装置を用意し、
第ｉ段目（ｉ＝１〜Ｎ−１）の分散型データアーカイブ装置の預入処理により特定の格納場所に格納される特定の分割ファイルを、第（ｉ＋１）段目の分散型データアーカイブ装置についての保管対象データとして取り扱い、第（ｉ＋１）段目の分散型データアーカイブ装置において、前記特定の分割ファイルを更に複数の分割ファイルに分割した上で、前記特定の格納場所とは異なる複数の格納場所に分散して格納する処理が行われるように構成し、
第１段目の分散型データアーカイブ装置に対してオリジナル保管対象データを保管したときに、第１段目の分散型データアーカイブ装置で作成された第１段目の管理データ〜第Ｎ段目の分散型データアーカイブ装置で作成された第Ｎ段目の管理データまでの全Ｎ組の管理データが、第１段目の分散型データアーカイブ装置によって、直接的にもしくはオンライン送信で間接的に所定の記録媒体に対して出力されるようにしたものである。 (2) According to a second aspect of the present invention, in the data storage system using the network according to the first aspect described above,
Prepare all N- stage distributed data archive devices from the first- stage distributed data archive device to the N -th (N is an integer of 2 or more) distributed data archive device ,
A specific divided file stored in a specific storage location by the depositing process of the i-th stage (i = 1 to N−1) distributed data archive apparatus is assigned to the (i + 1) -th stage distributed data archive apparatus. In the distributed data archive device in the (i + 1) -th stage, the specific divided file is further divided into a plurality of divided files, and a plurality of storage locations different from the specific storage location Configured to be distributed and stored in
When the original storage target data is stored in the first- stage distributed data archive device, the first- stage management data to the Nth- stage data created by the first- stage distributed data archive device are stored. All N sets of management data up to the Nth stage management data created by the distributed data archiving apparatus are directly or indirectly transmitted online by the first stage distributed data archive apparatus. This is output to a recording medium .

(3) 本発明の第３の態様は、上述の第１または第２の態様に係るネットワークを利用したデータ保管システムにおいて、
第ｎ番目（ｎは１以上の整数）の分散型データアーカイブ装置の預入処理により分割ファイルの格納先となる格納場所自身を、第（ｎ＋１）番目の分散型データアーカイブ装置によって構成するようにしたものである。 (3) According to a third aspect of the present invention, in the data storage system using the network according to the first or second aspect described above,
The storage location itself as the storage location of the divided file by the deposit processing of the nth (n is an integer of 1 or more) distributed data archive device is configured by the (n + 1) th distributed data archive device. Is.

(4) 本発明の第４の態様は、上述の第１〜第３の態様に係るネットワークを利用したデータ保管システムにおいて、
各分散型データアーカイブ装置の預入処理手段が、保管対象データを所定の分割方法に基づいて複数の分割ファイルに分割する分割処理部と、ネットワークを介して個々の分割ファイルを異なる格納場所に格納する格納処理部と、を有し、
各分散型データアーカイブ装置の引出処理手段が、入力された管理データから個々の分割ファイルの格納場所を認識し、この認識した各格納場所からネットワークを介して個々の分割ファイルを取り出す取出処理部と、入力された管理データから分割方法を認識し、この認識した分割方法に基づいて、取り出した個々の分割ファイルを統合して元の保管対象データを生成する統合処理部と、を有し、
各分散型データアーカイブ装置の管理データ出力手段が、分割方法を示す情報および個々の分割ファイルの格納場所を示す情報を含む管理データを作成する機能を有するようにしたものである。 (4) According to a fourth aspect of the present invention, in the data storage system using the network according to the first to third aspects described above,
The depositing processing means of each distributed data archive device stores a division processing unit that divides storage target data into a plurality of divided files based on a predetermined division method, and stores each divided file in different storage locations via a network. A storage processing unit,
A retrieval processing unit of each distributed data archive device recognizes the storage location of each divided file from the input management data, and takes out each divided file from the recognized storage location via the network; An integrated processing unit that recognizes the division method from the input management data, and based on the recognized division method, integrates the extracted individual divided files to generate the original storage target data,
The management data output means of each distributed data archive device has a function of creating management data including information indicating the division method and information indicating the storage location of each divided file.

(5) 本発明の第５の態様は、上述の第４の態様に係るネットワークを利用したデータ保管システムにおいて、
預入処理手段内に、保管対象データを分割する前もしくは後に、保管対象データもしくは分割ファイルに対する暗号化処理を実行する暗号化処理部が設けられ、管理データ作成手段が、この暗号化処理の方法を示す情報を管理データに付加する機能を有しており、引出処理手段内に、入力された管理データから暗号化処理の方法を認識し、この認識した暗号化処理の方法に基づいて復号化処理を実行する復号化処理部が設けられ、復号化処理を経て元の保管対象データの復元が行われる分散型データアーカイブ装置を用いるようにしたものである。 (5) According to a fifth aspect of the present invention, in the data storage system using the network according to the fourth aspect described above,
In the deposit processing means, before or after dividing the storage target data, an encryption processing unit for performing encryption processing on the storage target data or the divided file is provided, and the management data creation means determines the encryption processing method. A function of adding information to the management data, and the encryption processing method is recognized from the input management data in the extraction processing means, and the decryption processing is performed based on the recognized encryption processing method. Is used, and a distributed data archiving apparatus that restores the original storage target data through the decryption process is used.

(6) 本発明の第６の態様は、上述の第４または第５の態様に係るネットワークを利用したデータ保管システムにおいて、
預入処理手段内に、保管対象データもしくは分割ファイルに対して、ダミーデータを付加するダミーデータ付加処理を実行するダミーデータ付加部が設けられ、管理データ作成手段が、このダミーデータ付加処理の方法を示す情報を管理データに付加する機能を有しており、引出処理手段内に、入力された管理データからダミーデータ付加処理の方法を認識し、この認識したダミーデータ付加処理の方法に基づいてダミーデータを除去するダミーデータ除去処理を実行するダミーデータ除去処理部が設けられ、ダミーデータ除去処理を経て元の保管対象データの復元が行われる分散型データアーカイブ装置を用いるようにしたものである。 (6) According to a sixth aspect of the present invention, in the data storage system using the network according to the fourth or fifth aspect described above,
In the deposit processing means, a dummy data adding unit for executing dummy data adding processing for adding dummy data to the storage target data or the divided file is provided, and the management data creating means determines the method of the dummy data adding processing. A function for adding the information shown to the management data. The drawer processing means recognizes the dummy data addition processing method from the inputted management data, and performs dummy processing based on the recognized dummy data addition processing method. A distributed data archiving apparatus is provided that includes a dummy data removal processing unit that executes dummy data removal processing for removing data and restores original storage target data through dummy data removal processing.

(7) 本発明の第７の態様は、ネットワークを介してアクセス可能な複数の格納場所に、保管対象データを分散して保管する機能と、保管されていたデータを必要に応じて取り出す機能と、を有するネットワークを利用したデータ保管システムにおいて、
データの保管時に、保管対象データを複数の分割ファイルに分割してネットワークを介して複数の格納場所に分散して格納する預入処理を行う預入処理手段と、この預入処理の方法を示す管理データを作成して、これを直接的にもしくはオンライン送信で間接的に所定の記録媒体に対して出力して格納させる管理データ出力手段と、データの取出時に、管理データを入力する管理データ入力手段と、入力された管理データに基づいて、分散して格納されていた複数の分割ファイルを取り出して元の保管対象データに復元する引出処理を行う引出処理手段と、を有する複数の分散型データアーカイブ装置と、
オリジナル保管対象データを、複数の部分保管対象データに分割し、個々の部分保管対象データを、複数の分散型データアーカイブ装置に対して、それぞれの保管対象データとして与える処理と、複数の分散型データアーカイブ装置によって引き出された各部分保管対象データを統合して、元のオリジナル保管対象データを生成する処理と、を行う機能を有するデータ分割／統合装置と、
を設けるようにしたものである。 (7) The seventh aspect of the present invention is a function for distributing and storing data to be stored in a plurality of storage locations accessible via a network, and a function for retrieving stored data as needed. In a data storage system using a network having
When storing data, deposit processing means for performing deposit processing to divide the storage target data into a plurality of divided files and distribute and store the data in a plurality of storage locations via a network, and management data indicating the method of the deposit processing Management data output means for creating and outputting this data directly or indirectly to a predetermined recording medium by online transmission, and management data input means for inputting management data at the time of data retrieval; A plurality of distributed data archiving devices, each having a withdrawal processing unit for performing a withdrawal process for taking out a plurality of divided files stored in a distributed manner based on the input management data and restoring them to the original storage target data; ,
The original storage target data is divided into a plurality of partial storage target data, and each partial storage target data is provided as a storage target data to a plurality of distributed data archive devices, and a plurality of distributed data A data dividing / integrating device having a function of integrating the partial storage target data extracted by the archive device and generating the original original storage target data;
Is provided.

(8) 本発明の第８の態様は、上述の第７の態様に係るネットワークを利用したデータ保管システムにおいて、
データ分割／統合装置が、データの保管時には、個々の分散型データアーカイブ装置から出力された管理データの集合体を直接的にもしくはオンライン送信で間接的に所定の記録媒体に対して出力して格納させる機能を有し、データの取出時には、管理データの集合体を入力して、これを個々の分散型データアーカイブ装置に与える機能を有するようにしたものである。 (8) According to an eighth aspect of the present invention, in the data storage system using the network according to the seventh aspect described above,
When storing data, the data division / integration device outputs and stores a collection of management data output from each distributed data archive device directly or indirectly to a predetermined recording medium by online transmission. And a function of inputting a collection of management data and giving it to each distributed data archive device at the time of data extraction.

本発明に係るネットワークを利用したデータ保管システムによれば、複数の分散型データアーカイブ装置をカスケード接続することにより、より高い信頼性を確保することが可能になる。また、複数の分散型データアーカイブ装置を並列接続することにより、短時間でデータ保管作業を完了することが可能になる。 According to the data storage system using the network according to the present invention, it is possible to ensure higher reliability by cascading a plurality of distributed data archive devices. Further, by connecting a plurality of distributed data archive devices in parallel, it is possible to complete the data storage operation in a short time.

以下、本発明を図示する実施形態に基づいて説明する。 Hereinafter, the present invention will be described based on the illustrated embodiments.

＜＜＜ §１．分散型データアーカイブ装置の構成および動作＞＞＞
本発明の特徴は、前掲の特許文献１に開示されている分散型データアーカイブ装置を複数設けることにより、高い信頼性を確保したデータ保管を可能にし、また、短時間でデータ保管作業を完了することを可能にする点にある。そこで、まず、この分散型データアーカイブ装置の構成および動作を簡単に説明しておく。 <<< §1. Configuration and operation of distributed data archive device >>
The feature of the present invention is that by providing a plurality of distributed data archive devices disclosed in the above-mentioned Patent Document 1, it is possible to store data with high reliability and complete the data storage operation in a short time. The point is to make it possible. First, the configuration and operation of this distributed data archive device will be briefly described.

図１は、本発明で利用される分散型データアーカイブ装置１００の基本構成とその動作を説明するためのブロック図である。この分散型データアーカイブ装置１００は、ネットワーク２００（この例では、インターネット）を介してアクセス可能な複数の格納場所３１０〜３５０に、保管対象データＤを分散して保管する機能と、保管されていたデータを必要に応じて取り出す機能と、を有している。 FIG. 1 is a block diagram for explaining the basic configuration and operation of a distributed data archive device 100 used in the present invention. The distributed data archiving apparatus 100 is stored with a function for distributing and storing the storage target data D in a plurality of storage locations 310 to 350 accessible via the network 200 (in this example, the Internet). And a function of retrieving data as needed.

保管対象データＤを、この分散型データアーカイブ装置１００に預け入れると、当該データＤは、複数の分割ファイルに分割された上で、個々の分割ファイルが、ネットワーク２００を経由して、格納場所３１０〜３５０へそれぞれ分散して格納され、いわゆる預かり証として、管理データＭが発行されることになる。一方、預け入れたデータが必要になった場合には、管理データＭを分散型データアーカイブ装置１００に与えると、分散して格納されていた各分割ファイルが集められ、元の保管対象データＤに統合された上で、これを引き出すことができる。 When the storage target data D is deposited in the distributed data archive device 100, the data D is divided into a plurality of divided files, and the individual divided files are stored in the storage locations 310 to 310 via the network 200. Management data M is issued as a so-called deposit receipt. On the other hand, when the deposited data is required, when the management data M is given to the distributed data archive device 100, the divided files stored in a distributed manner are collected and integrated into the original storage target data D. This can then be pulled out.

図２は、このような預入処理の概念を示すブロック図である。図示の例では、保管対象データＤが３つの分割ファイルＤ１，Ｄ２，Ｄ３に分割され、それぞれ格納場所３２０，３４０，３１０に分散して格納されている。引出処理を行う際には、各格納場所３２０，３４０，３１０から分割ファイルＤ１，Ｄ２，Ｄ３を取り出した上で、これらを統合して、元の保管対象データＤが復元される。 FIG. 2 is a block diagram showing the concept of such deposit processing. In the example shown in the figure, the storage target data D is divided into three divided files D1, D2, and D3, and is distributed and stored in the storage locations 320, 340, and 310, respectively. When the extraction process is performed, the divided files D1, D2, and D3 are taken out from the storage locations 320, 340, and 310, and are integrated to restore the original storage target data D.

各格納場所３１０〜３５０は、ネットワーク２００を介してアクセス可能なコンピュータによって構成される。図示の例では、説明の便宜上、５つの格納場所３１０〜３５０のみが設けられている例が示されているが、実際には、より多数の格納場所が設けられることになる。個々の格納場所は、デジタルデータを書き込むことができ、必要に応じてこれを読み出すことが可能な環境を提供することができれば、具体的にはどのようなハードウエアによって構成してもかまわない。ただ、実用上は、データサーバ内のハードディスク装置によって構成される記録領域の一部によって、個々の格納場所を構成するのが現実的である。 Each of the storage locations 310 to 350 is configured by a computer accessible via the network 200. In the example shown in the figure, an example in which only five storage locations 310 to 350 are provided is shown for convenience of explanation, but in reality, a larger number of storage locations are provided. Each storage location may be configured by any hardware as long as it can provide an environment in which digital data can be written and read out as needed. However, practically, it is realistic to configure each storage location by a part of the recording area constituted by the hard disk device in the data server.

ここに示す実施形態では、ネットワーク２００としてインターネットを用いているので、各格納場所３１０〜３５０は、世界中に分散配置された任意のデータサーバによって構成することができる。したがって、実際には、たとえば、格納場所３１０は、東京に設置された特定のデータサーバ内の１つの記録領域によって構成され、格納場所３２０は、ロンドンに設置された特定のデータサーバ内の１つの記録領域によって構成され、…というような形態を採ることが可能である。もちろん、各データサーバは、同一の事業体が管理するデータサーバである必要はなく、別な事業者が提供するレンタルサーバであってもかまわない。 In the embodiment shown here, since the Internet is used as the network 200, each of the storage locations 310 to 350 can be configured by any data server distributed in the world. Therefore, in practice, for example, the storage location 310 is constituted by one recording area in a specific data server installed in Tokyo, and the storage location 320 is one in a specific data server installed in London. It is constituted by a recording area and can take a form such as. Of course, each data server does not need to be a data server managed by the same business entity, and may be a rental server provided by another business operator.

分散型データアーカイブ装置１００は、図１に示すとおり、管理データ出力手段１１０、預入処理手段１２０、引出処理手段１３０、管理データ入力手段１４０という各構成要素からなる。もっとも、これら各構成要素は、分散型データアーカイブ装置１００の機能に着目して、それぞれを機能上の構成要素として捉えたものであり、実際には、これら各構成要素の機能は、分散型データアーカイブ装置１００を構成するコンピュータに組み込まれたソフトウエアプログラムによって実現されることになる。 As shown in FIG. 1, the distributed data archiving apparatus 100 is composed of management data output means 110, deposit processing means 120, withdrawal processing means 130, and management data input means 140. However, each of these components focuses on the function of the distributed data archive device 100 and is regarded as a functional component. In reality, the function of each of these components is distributed data. This is realized by a software program incorporated in a computer constituting the archive device 100.

預入処理手段１２０は、データの保管時に、保管対象データＤを複数の分割ファイルに分割し、ネットワーク２００を介して複数の格納場所（図示の例の場合は、各格納場所３１０〜３５０のうちのいくつか）に分散して格納する預入処理を行う構成要素である。図２に示す例では、保管対象データＤが３つの分割ファイルＤ１，Ｄ２，Ｄ３に分割され、それぞれ格納場所３２０，３４０，３１０に分散して格納されている。ここに示す実施形態の場合、預入処理手段１２０は、図１に示すとおり、分割処理部１２１、暗号化処理部１２２、ダミーデータ付加部１２３、格納処理部１２４という４つの部分から構成されている。これら各部で行われる処理内容については後述する。 The deposit processing means 120 divides the storage target data D into a plurality of divided files when storing the data, and stores a plurality of storage locations (in the illustrated example, among the storage locations 310 to 350 via the network 200). It is a component that performs deposit processing to be distributed and stored in some). In the example shown in FIG. 2, the storage target data D is divided into three divided files D1, D2, and D3, and is distributed and stored in storage locations 320, 340, and 310, respectively. In the embodiment shown here, as shown in FIG. 1, the deposit processing means 120 is composed of four parts: a split processing unit 121, an encryption processing unit 122, a dummy data adding unit 123, and a storage processing unit 124. . Details of processing performed in each of these units will be described later.

一方、管理データ出力手段１１０は、預入処理手段１２０で行われた預入処理の方法を示す管理データＭを作成して、これを保管対象データＤについての預かり証として出力する機能を有する。具体的には、預入処理の方法を示す情報には、保管対象データＤの分割方法を示す情報と、個々の分割ファイルＤ１，Ｄ２，Ｄ３の格納場所を示す情報とが含まれている。 On the other hand, the management data output means 110 has a function of creating management data M indicating the method of deposit processing performed by the deposit processing means 120 and outputting this as a deposit receipt for the storage target data D. Specifically, the information indicating the deposit processing method includes information indicating the division method of the storage target data D and information indicating the storage locations of the individual divided files D1, D2, and D3.

以上述べた管理データ出力手段１１０と預入処理手段１２０は、保管対象データＤを、この分散型データアーカイブ装置１００に預け入れる際に利用される。ユーザは、保管対象データＤを預け入れるのと引き換えに、管理データＭをいわゆる預かり証として受領することになる。もちろん、この管理データＭは、デジタルデータであるから、たとえば、ＩＣカードなどの記録媒体に記録された状態で分散型データアーカイブ装置１００側から提供されたり、あるいは、オンラインで分散型データアーカイブ装置１００側から送信されたりして発行されることになる。 The management data output means 110 and the deposit processing means 120 described above are used when depositing the storage target data D into the distributed data archive device 100. The user receives the management data M as a so-called deposit card in exchange for depositing the storage target data D. Of course, since the management data M is digital data, for example, the management data M is provided from the distributed data archive device 100 in a state of being recorded on a recording medium such as an IC card, or online, the distributed data archive device 100. It is issued from the side.

これに対して、引出処理手段１３０と管理データ入力手段１４０は、過去に預け入れた保管対象データＤを引き出す際に利用される。まず、管理データ入力手段１４０は、預かり証として機能する管理データＭを、分散型データアーカイブ装置１００内に入力する機能をもった構成要素である。ユーザが、預け入れ時に発行された管理データＭを入力する作業を行うと、この管理データＭは、管理データ入力手段１４０によって、分散型データアーカイブ装置１００内に取り込まれることになる。管理データＭがＩＣカードなどの記録媒体に記録されていた場合には、管理データ入力手段１４０は、このような媒体から管理データＭを読み出す機能を果たし、管理データＭがオンラインで送られてきた場合には、管理データ入力手段１４０は、これを受信する機能を果たす。 On the other hand, the withdrawal processing unit 130 and the management data input unit 140 are used when retrieving the storage target data D deposited in the past. First, the management data input means 140 is a component having a function of inputting management data M that functions as a deposit receipt into the distributed data archive device 100. When the user performs an operation of inputting the management data M issued at the time of deposit, the management data M is taken into the distributed data archive device 100 by the management data input means 140. When the management data M is recorded on a recording medium such as an IC card, the management data input means 140 functions to read the management data M from such a medium, and the management data M is sent online. In this case, the management data input means 140 has a function of receiving it.

引出処理手段１３０は、入力された管理データＭに基づいて、分散して格納されていた複数の分割ファイルを取り出して元の保管対象データＤに復元する引出処理を行う構成要素である。たとえば、図２に示すような方法で分割および格納が行われた場合には、管理データＭには、元の保管対象データＤが３つの分割ファイルＤ１，Ｄ２，Ｄ３に分割されたこと、および各分割ファイルＤ１，Ｄ２，Ｄ３がそれぞれ格納場所３２０，３４０，３１０に格納されたこと、が情報として記録されているので、この情報に基づいて、各格納場所３２０，３４０，３１０から分割ファイルＤ１，Ｄ２，Ｄ３を取り出した上で、これらを統合して、元の保管対象データＤを復元する処理が実行される。ここに示す実施形態の場合、引出処理手段１３０は、図１に示すとおり、統合処理部１３１、復号化処理部１３２、ダミーデータ除去部１３３、取出処理部１３４という４つの部分から構成されている。これら各部で行われる処理内容については後述する。 The extraction processing unit 130 is a component that performs extraction processing for extracting a plurality of divided files stored in a distributed manner and restoring them to the original storage target data D based on the input management data M. For example, when division and storage are performed by the method shown in FIG. 2, the management data M includes the original storage target data D divided into three divided files D1, D2, and D3, and Since the fact that each of the divided files D1, D2, D3 has been stored in the storage locations 320, 340, 310 is recorded as information, based on this information, the divided files D1 from the storage locations 320, 340, 310 are recorded. , D2 and D3 are extracted and then integrated to restore the original storage target data D. In the case of the embodiment shown here, the extraction processing means 130 is composed of four parts: an integration processing unit 131, a decoding processing unit 132, a dummy data removal unit 133, and an extraction processing unit 134, as shown in FIG. . Details of processing performed in each of these units will be described later.

続いて、預入処理手段１２０および引出処理手段１３０を構成する個々の構成要素を説明する。ここで、預入処理手段１２０を構成する分割処理部１２１、暗号化処理部１２２、ダミーデータ付加部１２３、格納処理部１２４と、引出処理手段１３０を構成する統合処理部１３１、復号化処理部１３２、ダミーデータ除去部１３３、取出処理部１３４とは、それぞれ対応する構成要素であり、これら各対応する構成要素は、互いに逆の作業を実施する構成要素になる。 Next, individual components constituting the deposit processing unit 120 and the withdrawal processing unit 130 will be described. Here, the division processing unit 121, the encryption processing unit 122, the dummy data addition unit 123, the storage processing unit 124, and the integration processing unit 131 and the decryption processing unit 132 that constitute the withdrawal processing unit 130. The dummy data removing unit 133 and the extraction processing unit 134 are respectively corresponding components, and these corresponding components are components that perform operations opposite to each other.

まず、分割処理部１２１は、保管対象データＤを所定の分割方法に基づいて複数の分割ファイルに分割する分割処理を行う機能を果たす。たとえば、図２に示す例では、保管対象データＤを３つに分割し、３つの分割ファイルＤ１，Ｄ２，Ｄ３を作成する例が示されている。もっとも、分割処理部１２１で実行される分割方法は、図２に示すような方法に限定されるものではなく、どのような分割方法を実行してもかまわない。たとえば、４つの分割ファイルＤ１〜Ｄ４に分割することも可能であるし、個々の分割ファイルのデータ長も任意に設定することが可能である。図２では、元の保管対象データＤを３等分している例が示されているが、同じ３つの分割ファイルＤ１，Ｄ２，Ｄ３に分割する方法であっても、たとえば、保管対象データＤの先頭から１０Ｍバイトまでを第１の分割ファイルＤ１とし、次の２５Ｍバイトまでを第２の分割ファイルＤ２とし、残りを第３の分割ファイルＤ３とする、というような分割方法を決めることもできる。あるいは、保管対象データＤの先頭から８Ｍバイトまでを第１の分割ファイルＤ１とし、残りのデータのうちの奇数バイトを抜き出したものを第２の分割ファイルＤ２とし、偶数バイトを抜き出したものを第３の分割ファイルＤ３とする、というような分割方法を決めることもできる。 First, the division processing unit 121 performs a function of performing a division process of dividing the storage target data D into a plurality of divided files based on a predetermined division method. For example, the example shown in FIG. 2 shows an example in which the storage target data D is divided into three and three divided files D1, D2, and D3 are created. However, the division method executed by the division processing unit 121 is not limited to the method shown in FIG. 2, and any division method may be executed. For example, it can be divided into four divided files D1 to D4, and the data length of each divided file can be arbitrarily set. FIG. 2 shows an example in which the original storage target data D is divided into three equal parts, but even if the method is divided into the same three divided files D1, D2, D3, for example, the storage target data D It is also possible to determine a dividing method in which the first divided file D1 is set up to 10 Mbytes from the beginning of the file, the second divided file D2 is set up to the next 25 Mbytes, and the remaining divided file is set as the third divided file D3. . Alternatively, from the beginning of the storage target data D, the first divided file D1 is defined as the first divided file D1, the odd-numbered bytes extracted from the remaining data are defined as the second divided file D2, and the even-numbered bytes are extracted as the first divided file D2. It is also possible to determine a division method such as three division files D3.

このように、本発明にいう「分割」とは、必ずしも羊羹を切り分けるような分割のみを意味するわけではなく、元の保管対象データＤから所定の一部を離散的に抜き出して集めた集合体によって１つの分割ファイルを構成するような方法も含むものである。実用上は、分割処理部１１０には、複数通りの分割方法を行う機能をもたせておき（乱数を用いてバイト数を指定するような方法を採れば、実質的に無限通りの分割方法を定義できる）、所定のアルゴリズムに基づいて、あるいは全くランダムに、特定の分割方法を選択して実行できるようにしておくのが好ましい。 As described above, the “division” in the present invention does not necessarily mean only the division that cuts out the sheep, but is an aggregate obtained by collecting a predetermined part of the original storage target data D in a discrete manner. This includes a method of forming one divided file by In practice, the division processing unit 110 has a function of performing a plurality of division methods (if a method of specifying the number of bytes using a random number is employed, a substantially infinite number of division methods are defined. It is preferable to select and execute a specific division method based on a predetermined algorithm or entirely at random.

分割処理部１２１において、保管対象データＤに対して施した分割方法を示す情報は、管理データ出力手段１１０へと伝達され、管理データＭに盛り込まれる。すなわち、管理データＭには、保管対象データＤに対して、どのようなアルゴリズムに基づく分割処理が行われ、合計いくつの分割ファイルに分割され、個々の分割ファイルの長さがそれぞれ何バイトになったか、という情報が書き込まれることになる。 Information indicating the division method applied to the storage target data D in the division processing unit 121 is transmitted to the management data output unit 110 and included in the management data M. That is, the management data M is subjected to division processing based on what kind of algorithm is performed on the storage target data D, and is divided into a total number of divided files, and the length of each divided file becomes each byte. Is written.

一方、統合処理部１３１は、管理データＭに盛り込まれている上述したような情報に基づいて、過去に実施された分割方法を認識し、この認識した分割方法に基づいて、各格納場所から取り出された個々の分割ファイルを統合し、元の保管対象データＤを生成する統合処理を行う機能を果たす。 On the other hand, the integration processing unit 131 recognizes the division method implemented in the past based on the above-described information included in the management data M, and extracts from each storage location based on the recognized division method. A function of performing an integration process for integrating the individual divided files and generating the original storage target data D is achieved.

暗号化処理部１２２および復号化処理部１３２は、セキュリティをより強固にするための暗号化および復号化を行う構成要素である。これらの構成要素は、本発明を実施する上で必須の構成要素ではないが、実用上は、これらの構成要素を設けるようにするのが好ましい。 The encryption processing unit 122 and the decryption processing unit 132 are components that perform encryption and decryption for further enhancing security. These components are not indispensable components for carrying out the present invention, but it is preferable to provide these components for practical use.

暗号化処理部１２２は、保管対象データＤを分割する前もしくは後に、保管対象データＤもしくは分割ファイルに対する暗号化処理を実行する構成要素である。暗号化処理部１２２で施された暗号化処理の方法を示す情報は、管理データ出力手段１１０へと伝達され、管理データＭに盛り込まれる。一方、復号化処理部１３２は、管理データＭに盛り込まれている暗号化処理の方法を認識し、この認識した暗号化処理の方法に基づいて復号化処理を実行する機能を有する構成要素であり、この復号化処理を経て元の保管対象データＤの復元が行われることになる。 The encryption processing unit 122 is a component that executes encryption processing for the storage target data D or the divided file before or after the storage target data D is divided. Information indicating the encryption processing method performed by the encryption processing unit 122 is transmitted to the management data output unit 110 and included in the management data M. On the other hand, the decryption processing unit 132 is a component having a function of recognizing the encryption processing method included in the management data M and executing the decryption processing based on the recognized encryption processing method. The original storage target data D is restored through this decryption process.

上述したように、暗号化処理は、保管対象データＤを分割する前に施してもよいし、分割した後に施してもよい。図３(a) は、分割前に暗号化を施す概念図である。この例では、元の保管対象データＤに対して、まず暗号化処理を施すことにより暗号化データＤｃを生成し、この暗号化データＤｃに対して分割処理を施すことにより、分割暗号化ファイルＤｃ１，Ｄｃ２，Ｄｃ３を生成している。したがって、各格納場所には、これら分割暗号化ファイルＤｃ１，Ｄｃ２，Ｄｃ３が格納されることになる。 As described above, the encryption process may be performed before the storage target data D is divided or may be performed after the division. FIG. 3A is a conceptual diagram in which encryption is performed before division. In this example, the encrypted data Dc is first generated by performing an encryption process on the original storage target data D, and the divided encrypted file Dc1 is generated by performing a division process on the encrypted data Dc. , Dc2, Dc3 are generated. Therefore, these divided encrypted files Dc1, Dc2, and Dc3 are stored in the respective storage locations.

一方、図３(b) は、分割後に暗号化を施す概念図である。この例では、元の保管対象データＤをまず分割して、分割ファイルＤ１，Ｄ２，Ｄ３を生成した上で、これら各分割ファイルＤ１，Ｄ２，Ｄ３のそれぞれに対して暗号化処理を施し、暗号化分割ファイルＤ１ｃ，Ｄ２ｃ，Ｄ３ｃを生成している。したがって、各格納場所には、これら暗号化分割ファイルＤ１ｃ，Ｄ２ｃ，Ｄ３ｃが格納されることになる。 On the other hand, FIG. 3B is a conceptual diagram in which encryption is performed after division. In this example, the original storage target data D is first divided to generate divided files D1, D2, and D3, and then encryption processing is performed on each of the divided files D1, D2, and D3, and the encryption is performed. Divided files D1c, D2c, and D3c are generated. Accordingly, the encrypted divided files D1c, D2c, and D3c are stored in each storage location.

いずれにしても、各格納場所に格納されているファイルそれ自身は、意味のあるデータにはなっていないので、正規の権原をもたない者によってダウンロードされたとしても、不正に利用される可能性が極めて低く、セキュリティの向上を図ることができる。 In any case, the file itself stored in each storage location is not meaningful data, so it can be used illegally even if it is downloaded by a person who does not have the proper title. Therefore, security can be improved.

暗号化とは別な手法として、本来のデータに、ダミーデータを付加する手法も有効である。図４は、このダミーデータを付加する手法の概念を説明するための図である。たとえば、図４(a) に示すように、本来のデータに、これとは無関係なダミーデータ（ハッチング部分）を付加する処理を行うと、本来は不要な余分なデータが混入することになり、正しいデータの利用を行うことができなくなる。特に、図４(b) に示すように、本来のデータを複数カ所で分断する形式で、ダミーデータを挿入するような方法を採れば、本来のデータとして利用することはほとんど不可能になる。もちろん、このダミーデータの付加処理は、保管対象データＤを分割する前に施してもよいし、分割した後の個々の分割ファイルに対して施してもよい。 As a method different from encryption, a method of adding dummy data to the original data is also effective. FIG. 4 is a diagram for explaining the concept of the method of adding the dummy data. For example, as shown in FIG. 4 (a), when dummy data (hatched portion) that is not related to the original data is added, unnecessary data that is originally unnecessary is mixed. The correct data cannot be used. In particular, as shown in FIG. 4 (b), if a method of inserting dummy data in a format in which the original data is divided at a plurality of locations, it is almost impossible to use the original data. Of course, this dummy data addition processing may be performed before the storage target data D is divided, or may be performed on each divided file after the division.

ダミーデータ付加部１２３およびダミーデータ除去部１３３は、このような原理に基づいて、保管対象データＤを預け入れる際にダミーデータを付加し、引き出す際にこれを除去する処理を行う構成要素である。これらの構成要素は、本発明を実施する上で必須の構成要素ではないが、実用上は、これらの構成要素を設けるようにすれば、セキュリティを更に向上させることができる。 Based on such a principle, the dummy data adding unit 123 and the dummy data removing unit 133 are components that perform a process of adding dummy data when depositing the storage target data D and removing it when extracting the data to be stored. These components are not indispensable components for carrying out the present invention. However, in practice, if these components are provided, security can be further improved.

より具体的に説明すれば、ダミーデータ付加部１２３は、保管対象データＤもしくは分割ファイルに対して、ダミーデータを付加するダミーデータ付加処理を実行する構成要素である。ダミーデータ付加部１２３で施されたダミーデータ付加処理の方法を示す情報は、管理データ出力手段１１０へと伝達され、管理データＭに盛り込まれる。一方、ダミーデータ除去部１３３は、管理データＭに盛り込まれているダミーデータ付加処理の方法を認識し、この認識したダミーデータ付加処理の方法に基づいて、ダミーデータ除去処理を実行する機能を有する構成要素であり、このダミーデータ除去処理を経て元の保管対象データＤの復元が行われることになる。 More specifically, the dummy data adding unit 123 is a component that executes a dummy data adding process for adding dummy data to the storage target data D or the divided file. Information indicating the method of the dummy data adding process performed by the dummy data adding unit 123 is transmitted to the management data output means 110 and included in the management data M. On the other hand, the dummy data removal unit 133 has a function of recognizing the dummy data addition process method included in the management data M and executing the dummy data removal process based on the recognized dummy data addition process method. It is a component, and the original storage target data D is restored through this dummy data removal process.

格納処理部１２４は、ネットワーク２００を介して個々の分割ファイルを異なる格納場所に格納する機能をもった構成要素である。どの分割ファイルをどの格納場所に格納するかは、格納処理部１２４に組み込まれた所定のアルゴリズムにより決定される。このアルゴリズムは任意のものでかまわない。たとえば、格納処理部１２４内に、利用可能な格納場所のリストを予め用意しておき、その中から乱数を利用して全くランダムな格納場所を、個々の分割ファイルごとに選択するような方式を採ってもよい。 The storage processing unit 124 is a component having a function of storing individual divided files in different storage locations via the network 200. Which division file is stored in which storage location is determined by a predetermined algorithm incorporated in the storage processing unit 124. This algorithm may be arbitrary. For example, a list of available storage locations is prepared in advance in the storage processing unit 124, and a completely random storage location is selected for each divided file from among them using a random number. May be taken.

個々の分割ファイルの格納場所を示す情報は、管理データ出力手段１１０へと伝達され、管理データＭに盛り込まれる。ネットワーク２００としてインターネットを利用する場合、格納場所を示す情報として、ＵＲＬを用いると便利である。一方、取出処理部１３４は、管理データＭに盛り込まれている個々の分割ファイルの格納場所を示す情報を認識し、この認識した各格納場所からネットワーク２００を介して個々の分割ファイルを取り出す機能をもった構成要素である。取り出された各分割ファイルは、統合処理部１３１において統合される（このとき、必要に応じて、復号化処理部１３２における復号化処理やダミーデータ除去部１３３におけるダミーデータ除去処理が実行される）。 Information indicating the storage location of each divided file is transmitted to the management data output means 110 and included in the management data M. When the Internet is used as the network 200, it is convenient to use a URL as information indicating the storage location. On the other hand, the extraction processing unit 134 has a function of recognizing information indicating the storage location of each divided file included in the management data M and extracting each divided file from the recognized storage location via the network 200. It is a component with. The extracted divided files are integrated in the integration processing unit 131 (at this time, decoding processing in the decoding processing unit 132 and dummy data removal processing in the dummy data removal unit 133 are executed as necessary). .

この分散型データアーカイブ装置１００の利点は、管理データＭがない限り、元の保管対象データＤの引き出しを行うことができないため、十分なセキュリティが確保できるという点である。すなわち、管理データＭをもたない不正な利用者が、不正な手段によって、保管対象データＤを引き出そうとしても、まず、第１に、復元に必要な個々の分割ファイルが、どこの格納場所に格納されているかを認識することができないため、必要な分割ファイルをすべて揃えることは不可能である。そもそも、特定の保管対象データＤを復元するために必要な分割ファイルが、全部でいくつ存在するのかも不明であり、もちろん、これら分割ファイルが格納されている個々の格納場所を知るすべはない。そして、第２に、万一、必要な分割ファイルの入手に成功したとしても、管理データがない限り、各分割ファイルがどのような分割方法によって生成されたのかを知ることができないのであるから、これらを統合して元の保管対象データＤに復元することは困難である。特に、暗号化やダミーデータの付加が行われていれば、復元はより困難になる。 The advantage of this distributed data archive device 100 is that unless the management data M is present, the original storage target data D cannot be pulled out, so that sufficient security can be ensured. That is, even if an unauthorized user who does not have the management data M tries to extract the storage target data D by unauthorized means, first, where each individual divided file necessary for restoration is stored Therefore, it is impossible to recognize all the necessary divided files. In the first place, it is unclear how many divided files are necessary to restore the specific storage target data D. Of course, there is no way to know the individual storage locations where these divided files are stored. Secondly, even if the necessary divided files are successfully obtained, it is impossible to know what division method each divided file was generated unless there is management data. It is difficult to integrate these and restore to the original storage target data D. In particular, restoration becomes more difficult if encryption or dummy data is added.

結局、各格納場所に格納されている個々の分割ファイルが、誰でも何時でもダウンロード可能な状態におかれているとしても、これらの分割ファイルをダウンロードして、意味のあるデータに復元して利用することが可能なのは、管理データを有する正規のユーザのみということになる。 After all, even if the individual divided files stored in each storage location are in a state where anyone can download them at any time, they can be downloaded and restored to meaningful data for use. Only authorized users with management data can do so.

＜＜＜ §２．分散型データアーカイブ装置をカスケード接続する形態＞＞＞
さて、§１では、本発明で利用される分散型データアーカイブ装置１００の構成および動作を説明した。図５は、この分散型データアーカイブ装置１００によるデータの保管原理を示すブロック図である。図示のとおり、ユーザが、バックアップしておきたいと考える保管対象データＤを分散型データアーカイブ装置１００に預ける操作を行うと、この保管対象データＤは、たとえば、３つの分割ファイルＤ１，Ｄ２，Ｄ３に分割され（必要に応じて暗号化やダミーデータの付加が行われ）、ネットワーク２００を介して、所定の格納場所（この例では、格納場所３０１，３０２，３０３）に格納され、管理データＭが発行される。 <<< §2. Form of cascade connection of distributed data archive devices >>
In §1, the configuration and operation of the distributed data archive device 100 used in the present invention have been described. FIG. 5 is a block diagram showing the principle of data storage by the distributed data archive device 100. As illustrated, when the user performs an operation of depositing the storage target data D that the user wants to back up in the distributed data archive device 100, the storage target data D is, for example, three divided files D1, D2, and D3. (Encrypted or added with dummy data as necessary) and stored in a predetermined storage location (in this example, storage locations 301, 302, 303) via the network 200, and management data M Is issued.

本発明の第１の特徴は、このような分散型データアーカイブ装置１００を複数用意し、この複数の分散型データアーカイブ装置をカスケード接続することにより、より高い信頼性を確保する点にある。図６は、この第１の特徴を備えた実施形態を示すブロック図である。この例では、２台の分散型データアーカイブ装置１００Ａ，１００Ｂが用意されており、両者はカスケード式に接続されている。 The first feature of the present invention is that a plurality of distributed data archive devices 100 are prepared, and a plurality of distributed data archive devices are cascaded to ensure higher reliability. FIG. 6 is a block diagram showing an embodiment having the first feature. In this example, two distributed data archive apparatuses 100A and 100B are prepared, and both are connected in cascade.

ここでカスケード式の接続とは、第１の分散型データアーカイブ装置１００Ａによって作成された分割ファイルが、第２の分散型データアーカイブ装置１００Ｂにとっての保管対象データとなるように、直列的な接続がなされていることを意味する。別言すれば、データの預入処理を行う際には、第１の分散型データアーカイブ装置１００Ａの出力データである分割ファイルＤ３が、第２の分散型データアーカイブ装置１００Ｂの入力データとなり、逆に、データの引出処理を行う際には、第２の分散型データアーカイブ装置１００Ｂの出力データである分割ファイルＤ３が、第１の分散型データアーカイブ装置１００Ａの入力データとなる関係にある。 Here, the cascade connection is a serial connection so that the divided file created by the first distributed data archive device 100A becomes storage target data for the second distributed data archive device 100B. It means being done. In other words, when performing the data depositing process, the divided file D3 which is the output data of the first distributed data archive device 100A becomes the input data of the second distributed data archive device 100B. When the data extraction process is performed, the divided file D3, which is the output data of the second distributed data archive device 100B, has a relationship to become the input data of the first distributed data archive device 100A.

この図６に示すデータ保管システムを用いて、保管対象データＤを預け入れる作業は、次のようになる。まず、ユーザは、保管対象データＤを、第１の分散型データアーカイブ装置１００Ａに預け入れる作業を行う。これは、図５に示す従来の分散型データアーカイブ装置１００にデータを預け入れる作業と全く同じである。第１の分散型データアーカイブ装置１００Ａでは、この保管対象データＤが、たとえば、３つの分割ファイルＤ１，Ｄ２，Ｄ３に分割され（必要に応じて暗号化やダミーデータの付加が行われ）、ネットワーク２００を介して、所定の格納場所（この例では、格納場所３０１，３０２，３０３）に格納される。このとき、ユーザには、預かり証として機能する管理データＭが発行される。 The operation of depositing the storage target data D using the data storage system shown in FIG. 6 is as follows. First, the user performs an operation of depositing the storage target data D in the first distributed data archive device 100A. This is exactly the same as the work of depositing data in the conventional distributed data archive device 100 shown in FIG. In the first distributed data archive device 100A, the storage target data D is divided into, for example, three divided files D1, D2, and D3 (encryption and dummy data are added as necessary), and the network 200, the data is stored in a predetermined storage location (in this example, storage locations 301, 302, and 303). At this time, the management data M that functions as a receipt is issued to the user.

このシステムの特徴は、こうして特定の格納場所に格納された分割ファイルについて、更に、ネットワーク２００を介した分散保管処理が行われる点にある。すなわち、図６に示す例の場合、格納場所３０３に格納された分割ファイルＤ３が、第２の分散型データアーカイブ装置１００Ｂによって、更に別な格納場所３０４，３０５，３０６に分散して格納されている。この第２の分散型データアーカイブ装置１００Ｂによる分割ファイルＤ３の預け入れ処理は、ユーザが意識せずに自動的に行われる処理であるため、ユーザから見た作業は、図５に示すシステムについての作業と全く同様である。 This system is characterized in that distributed storage processing is further performed via the network 200 for the divided files stored in the specific storage location. That is, in the example shown in FIG. 6, the divided file D3 stored in the storage location 303 is distributed and stored in further storage locations 304, 305, and 306 by the second distributed data archive device 100B. Yes. Since the depositing process of the divided file D3 by the second distributed data archive device 100B is a process automatically performed without the user being aware of it, the work seen from the user is the work on the system shown in FIG. Is exactly the same.

ここで、第１の分散型データアーカイブ装置１００Ａに着目すると、格納場所３０３に格納された分割ファイルＤ３は、分散格納された保存先のデータということになるが、第２の分散型データアーカイブ装置１００Ｂに着目すると、格納場所３０３に格納された分割ファイルＤ３は、保管対象データという位置づけになる。したがって、第２の分散型データアーカイブ装置１００Ｂは、この分割ファイルＤ３を保管対象データとして取り扱い、これを更に複数の分割ファイルＤ３１，Ｄ３２，Ｄ３３に分割した上で、ネットワーク２００を介して、異なる格納場所３０４，３０５，３０６に格納する処理を実行し、この分割ファイルＤ３についての預かり証として機能する管理データＭ３を発行する。この管理データＭ３は、図示の例では、格納場所３０３内に、分割ファイルＤ３とともに格納されている。 Here, focusing on the first distributed data archive device 100A, the divided file D3 stored in the storage location 303 is the storage destination data stored in a distributed manner, but the second distributed data archive device. Focusing on 100B, the divided file D3 stored in the storage location 303 is positioned as storage target data. Therefore, the second distributed data archiving apparatus 100B handles the divided file D3 as the storage target data, further divides it into a plurality of divided files D31, D32, and D33, and stores them separately via the network 200. The processing stored in the locations 304, 305, and 306 is executed, and management data M3 that functions as a receipt for the divided file D3 is issued. In the illustrated example, the management data M3 is stored in the storage location 303 together with the divided file D3.

このようなデータ保管システムに預け入れた保管対象データＤを引き出す作業は、格納場所３０３内に、分割ファイルＤ３が支障なく格納されている限り、図５に示す従来のシステムにおける引出作業と全く同様である。すなわち、ユーザが、預かり証として機能する管理データＭを第１の分散型データアーカイブ装置１００Ａに与えて、データの引出要求を行うと、第１の分散型データアーカイブ装置１００Ａの機能により、格納場所３０１，３０２，３０３から各分割ファイルＤ１，Ｄ２，Ｄ３が取り出され、統合処理により元の保管対象データＤが復元され、ユーザに提供される。 The operation of extracting the storage target data D deposited in the data storage system is exactly the same as the extraction operation in the conventional system shown in FIG. 5 as long as the divided file D3 is stored in the storage location 303 without any trouble. is there. That is, when the user gives management data M that functions as a deposit receipt to the first distributed data archive device 100A and makes a data withdrawal request, the storage location is stored by the function of the first distributed data archive device 100A. The divided files D1, D2, D3 are extracted from 301, 302, 303, and the original storage target data D is restored by integration processing and provided to the user.

この実施形態のメリットは、万一、格納場所３０３に格納されていた分割ファイルＤ３が破損したような場合にも、第２の分散型データアーカイブ装置１００Ｂの機能により、これを復元することができる点にある。すなわち、第１の分散型データアーカイブ装置１００Ａから、格納場所３０３内の分割ファイルＤ３を読み出す指示が与えられているのに、分割ファイルＤ３の読出しが正常に行われない場合（読出しが全く不可能な場合だけでなく、エラーチェックコードの検証によりエラーが発生するような場合も含む）、第２の分散型データアーカイブ装置１００Ｂに対して、管理データＭ３を与えて、分割ファイルＤ３の引き出しを要求する処理が自動的に行われるようにしておけば、第２の分散型データアーカイブ装置１００Ｂが、格納場所３０４，３０５，３０６に格納されていた分割ファイルＤ３１，Ｄ３２，Ｄ３３から、分割ファイルＤ３を復元することができる。もちろん、このような復元処理は、ユーザが意識せずに自動的に行われる処理であるので、ユーザから見た引き出し処理の作業は、図５に示すシステムについての作業と全く同様である。 The advantage of this embodiment is that even if the divided file D3 stored in the storage location 303 is damaged, it can be restored by the function of the second distributed data archive device 100B. In the point. That is, when the first distributed data archiving apparatus 100A gives an instruction to read the divided file D3 in the storage location 303, but the reading of the divided file D3 is not performed normally (reading is not possible at all). In addition to a case where an error occurs due to verification of the error check code, the management data M3 is given to the second distributed data archive device 100B to request the withdrawal of the divided file D3. If the processing to be performed is automatically performed, the second distributed data archive device 100B obtains the divided file D3 from the divided files D31, D32, D33 stored in the storage locations 304, 305, 306. Can be restored. Of course, such a restoration process is a process that is automatically performed without the user's awareness, and therefore the extraction process seen from the user is exactly the same as the work for the system shown in FIG.

このように、分割ファイルＤ３の情報は、格納場所３０３に格納されるとともに、格納場所３０４，３０５，３０６に分散して格納されることになるので、冗長性は増すことになるが、データのバックアップという見地での信頼性は向上することになる。 As described above, the information of the divided file D3 is stored in the storage location 303 and distributedly stored in the storage locations 304, 305, and 306. Therefore, the redundancy is increased, but the data Reliability in terms of backup will be improved.

なお、図では、格納場所３０３に格納された分割データＤ３についてのみ、第２の分散型データアーカイブ装置１００Ｂによってバックアップを行う例を示したが、もちろん、実用上は、格納場所３０１，３０２に格納された分割データＤ１，Ｄ２についても、同様に、分散型データアーカイブ装置を用いてバックアップを行うようにすれば、より信頼性を高めることが可能である。この場合、第２の分散型データアーカイブ装置１００Ｂを用いて、分割データＤ１，Ｄ２についてのバックアップも行うようにしてもよいし、別個の分散型データアーカイブ装置を必要に応じて設けるようにしてもよい。 In the figure, an example is shown in which only the divided data D3 stored in the storage location 303 is backed up by the second distributed data archive device 100B, but of course, it is practically stored in the storage locations 301 and 302. Similarly, if the divided data D1 and D2 are backed up using a distributed data archive device, the reliability can be further improved. In this case, the second distributed data archive device 100B may be used to back up the divided data D1 and D2, or a separate distributed data archive device may be provided as necessary. Good.

また、図では、分散型データアーカイブ装置を２段階カスケード接続する例を示したが、同様に、３段階以上のカスケード接続を行うことも可能である。たとえば、図に示す格納場所３０４に格納されている分割ファイルＤ３１についてのバックアップを行うために、第３の分散型データアーカイブ装置１００Ｃを設け、分割ファイルＤ３１を、更に、３つの分割ファイルＤ３１１，Ｄ３１２，Ｄ３１３に分割し、これらをネットワーク２００を介して、それぞれ別の格納場所に分散して格納することも可能である。もちろん、４段階以上のカスケード接続を行うこともできる。 In the figure, an example in which the distributed data archiving apparatus is cascade-connected in two stages is shown. Similarly, cascade connection in three or more stages is also possible. For example, in order to back up the divided file D31 stored in the storage location 304 shown in the figure, a third distributed data archive device 100C is provided, and the divided file D31 is further divided into three divided files D311, D312. , D 313, and these can be distributed and stored in separate storage locations via the network 200. Of course, cascade connection of four or more stages can be performed.

要するに、この§２で述べる実施形態の要点は、第ｎ番目（ｎは１以上の整数）の分散型データアーカイブ装置の預入処理により特定の格納場所に格納される特定の分割ファイルを、第（ｎ＋１）番目の分散型データアーカイブ装置についての保管対象データとして取り扱い、この特定の分割ファイルを更に複数の分割ファイルに分割した上で、上記特定の格納場所とは異なる複数の格納場所に分散して格納する処理を行う、という点にあり、カスケード接続する段数は何段でもかまわない。 In short, the main point of the embodiment described in this §2 is that the specific divided file stored in the specific storage location by the deposit processing of the nth (n is an integer of 1 or more) distributed data archive device is the (( Treated as storage target data for the (n + 1) th distributed data archiving apparatus, further dividing this specific divided file into a plurality of divided files and then distributing them to a plurality of storage locations different from the specific storage location. The number of stages to be cascade-connected is not limited.

なお、各分散型データアーカイブ装置によって発行された管理データは、１つの格納場所のみに保管しておくのではなく、別な格納場所にもバックアップして保管しておくようにするのが好ましい。たとえば、図６に示す例の場合、第２の分散型データアーカイブ装置１００Ｂによって発行された管理データＭ３（分割ファイルＤ３についての管理データ）は、格納場所３０３に保管されているが、万一、保管場所３０３を提供するデータサーバが完全に動作不能となってしまった場合、分割ファイルＤ３とともに、管理データＭ３も失われてしまうことになるので、第２の分散型データアーカイブ装置１００Ｂが機能していても、分割ファイルＤ３の復元を行うことができなくなってしまう。このような場合、管理データＭ３が別な格納場所にも保管されていれば、この別な格納場所に保管されてきた管理データＭ３を用いて、分割ファイルＤ３の復元が可能になる。 The management data issued by each distributed data archiving apparatus is preferably stored not only in one storage location but also in another storage location. For example, in the case of the example shown in FIG. 6, the management data M3 issued by the second distributed data archive device 100B (management data for the divided file D3) is stored in the storage location 303. When the data server that provides the storage location 303 becomes completely inoperable, the management data M3 is also lost along with the divided file D3, so that the second distributed data archive device 100B functions. However, the divided file D3 cannot be restored. In such a case, if the management data M3 is stored in another storage location, the divided file D3 can be restored using the management data M3 stored in the other storage location.

また、この管理データＭ３のコピーを、格納場所３０３から第１の分散型データアーカイブ装置１００Ａへも送信するようにし、管理データＭとともにユーザに預かり証として提供しておくと好都合である。すなわち、図６に示す例の場合、ユーザには、管理データＭと管理データＭ３とが、オリジナル保管対象データＤについての預かり証として提供されることになる。そうすれば、万一、保管場所３０３を提供するデータサーバが完全に動作不能となってしまった場合でも、ユーザから提供された管理データＭ３に基づいて、分割ファイルＤ３の復元が可能になる。もちろん、この分割ファイルＤ３の復元は、第１の分散型データアーカイブ装置１００Ａにおいて行うことも可能である。 Further, it is convenient to send a copy of the management data M3 from the storage location 303 to the first distributed data archive device 100A, and provide it to the user together with the management data M as a receipt. That is, in the case of the example shown in FIG. 6, the management data M and the management data M3 are provided to the user as a custody for the original storage target data D. Then, even if the data server that provides the storage location 303 becomes completely inoperable, the divided file D3 can be restored based on the management data M3 provided by the user. Of course, the restoration of the divided file D3 can also be performed in the first distributed data archive device 100A.

３段階以上の分散型データアーカイブ装置をカスケード接続した場合も同様に、各段階の分散型データアーカイブ装置で発行された管理データをすべてユーザに提供するようにしておけばよい。すなわち、一般論として、第１番目の分散型データアーカイブ装置〜第Ｎ番目（Ｎは２以上の整数）の分散型データアーカイブ装置までの全Ｎ台の分散型データアーカイブ装置をカスケード式に用いることにより構成されたデータ保管システムの場合、ユーザがオリジナル保管対象データＤを保管したときに、第１番目の分散型データアーカイブ装置で作成された第１番目の管理データ〜第Ｎ番目の分散型データアーカイブ装置で作成された第Ｎ番目の管理データまでの全Ｎ組の管理データを、オリジナル保管対象データＤについての預かり証として出力するようにしておけばよい。 Similarly, in a case where three or more stages of distributed data archive devices are cascade-connected, all management data issued by each stage of distributed data archive devices may be provided to the user. That is, in general, all N distributed data archive devices from the first distributed data archive device to the Nth (N is an integer equal to or greater than 2) distributed data archive device are used in a cascade manner. In the case of the data storage system configured by the above, when the user stores the original storage target data D, the first management data to the Nth distributed data created by the first distributed data archive device All N sets of management data up to the Nth management data created by the archive device may be output as a deposit certificate for the original storage target data D.

なお、図６に示す例では、格納場所３０３と第２の分散型データアーカイブ装置１００Ｂとを別個の構成要素として示したが、両者は必ずしも別個独立した構成にする必要はなく、格納場所３０３自身を第２の分散型データアーカイブ装置１００Ｂによって構成するようにしてもかまわない。すなわち、一般論として、第ｎ番目（ｎは１以上の整数）の分散型データアーカイブ装置の預入処理により分割ファイルの格納先となる格納場所自身を、第（ｎ＋１）番目の分散型データアーカイブ装置によって構成することも可能である。 In the example shown in FIG. 6, the storage location 303 and the second distributed data archive device 100B are shown as separate components. However, the storage location 303 itself is not necessarily required to have a separate and independent configuration. May be configured by the second distributed data archive device 100B. That is, as a general rule, the storage location itself that becomes the storage location of the divided file by the deposit processing of the nth (n is an integer of 1 or more) distributed data archive device is the (n + 1) th distributed data archive device. It is also possible to configure by.

＜＜＜ §３．分散型データアーカイブ装置を並列接続する形態＞＞＞
本発明の第２の特徴は、複数の分散型データアーカイブ装置を並列接続することにより、短時間でデータ保管作業を完了できるようにする点にある。図７は、この第２の特徴を備えた実施形態を示すブロック図である。この例では、３台の分散型データアーカイブ装置１００Ａ，１００Ｂ，１００Ｃが用意されており、互いに並列接続されている。また、これら３台の分散型データアーカイブ装置の前段には、データ分割／統合装置４００が設けられている。このデータ分割／統合装置４００は、オリジナル保管対象データＤを、複数の部分保管対象データに分割し、個々の部分保管対象データを、複数の分散型データアーカイブ装置１００Ａ，１００Ｂ，１００Ｃに対して、それぞれの保管対象データとして与える処理と、これら複数の分散型データアーカイブ装置１００Ａ，１００Ｂ，１００Ｃによって引き出された各部分保管対象データを統合して、元のオリジナル保管対象データＤを生成する処理と、を行う機能を有している。 <<< §3. Form to connect distributed data archive devices in parallel >>>
A second feature of the present invention is that a data storage operation can be completed in a short time by connecting a plurality of distributed data archive devices in parallel. FIG. 7 is a block diagram showing an embodiment having the second feature. In this example, three distributed data archive devices 100A, 100B, and 100C are prepared and connected in parallel to each other. In addition, a data division / integration device 400 is provided in the preceding stage of these three distributed data archive devices. The data division / integration device 400 divides the original storage target data D into a plurality of partial storage target data, and the individual partial storage target data is divided into a plurality of distributed data archive devices 100A, 100B, and 100C. A process to be provided as each storage target data, a process of integrating the partial storage target data extracted by the plurality of distributed data archive devices 100A, 100B, and 100C to generate the original original storage target data D; It has a function to perform.

なお、データ分割／統合装置４００が、データの保管を行う時には、個々の分散型データアーカイブ装置１００Ａ，１００Ｂ，１００Ｃから出力された管理データＭ１，Ｍ２，Ｍ３の集合体を、オリジナル保管対象データＤについての預かり証として出力する処理が行われる。また、データ分割／統合装置４００が、データの取り出しを行う時には、この管理データＭ１，Ｍ２，Ｍ３の集合体を入力して、これを個々の分散型データアーカイブ装置１００Ａ，１００Ｂ，１００Ｃに与える処理が行われる。 When the data division / integration device 400 stores data, the aggregate of the management data M1, M2, M3 output from the individual distributed data archive devices 100A, 100B, 100C is used as the original storage target data D. A process of outputting as a receipt for is performed. Further, when the data division / integration device 400 takes out the data, a process of inputting the aggregate of the management data M1, M2, and M3 and giving it to the individual distributed data archive devices 100A, 100B, and 100C Is done.

この図７に示すデータ保管システムを用いて、オリジナル保管対象データＤを預け入れる作業は、次のようになる。まず、ユーザは、オリジナル保管対象データＤを、データ分割／統合装置４００に対して預け入れる作業を行う。すると、データ分割／統合装置４００において、このオリジナル保管対象データＤを、たとえば３つに分割して、３つの部分保管対象データＤ１，Ｄ２，Ｄ３を生成する処理が行われる。そして、各部分保管対象データＤ１，Ｄ２，Ｄ３は、それぞれ分散型データアーカイブ装置１００Ａ，１００Ｂ，１００Ｃに、保管対象データとして与えられる。 The operation of depositing the original storage target data D using the data storage system shown in FIG. 7 is as follows. First, the user performs an operation of depositing the original storage target data D into the data division / integration device 400. Then, the data division / integration device 400 performs processing for dividing the original storage target data D into, for example, three parts and generating three partial storage target data D1, D2, and D3. The partial storage target data D1, D2, and D3 are given as storage target data to the distributed data archive devices 100A, 100B, and 100C, respectively.

この後、各分散型データアーカイブ装置１００Ａ，１００Ｂ，１００Ｃにおいて行われる処理は、§１で述べたとおりである。たとえば、分散型データアーカイブ装置１００Ａでは、部分保管対象データＤ１が３つの分割データＤ１１，Ｄ１２，Ｄ１３に分割され、ネットワーク２００を介して格納場所３０１，３０２，３０３に格納される。同様に、分散型データアーカイブ装置１００Ｂでは、部分保管対象データＤ２が３つの分割データＤ２１，Ｄ２２，Ｄ２３に分割され、ネットワーク２００を介して格納場所３０４，３０５，３０６に格納され、分散型データアーカイブ装置１００Ｃでは、部分保管対象データＤ３が３つの分割データＤ３１，Ｄ３２，Ｄ３３に分割され、ネットワーク２００を介して格納場所３０７，３０８，３０９に格納される。 Thereafter, the processing performed in each of the distributed data archive devices 100A, 100B, and 100C is as described in §1. For example, in the distributed data archive device 100A, the partial storage target data D1 is divided into three divided data D11, D12, and D13, and stored in the storage locations 301, 302, and 303 via the network 200. Similarly, in the distributed data archive device 100B, the partial storage target data D2 is divided into three divided data D21, D22, and D23 and stored in the storage locations 304, 305, and 306 via the network 200. In the device 100C, the partial storage target data D3 is divided into three divided data D31, D32, and D33 and stored in the storage locations 307, 308, and 309 via the network 200.

このような作業の結果、各分散型データアーカイブ装置１００Ａ，１００Ｂ，１００Ｃからは、それぞれ管理データＭ１，Ｍ２，Ｍ３が出力されることになる。データ分割／統合装置４００は、これら管理データＭ１，Ｍ２，Ｍ３の集合体を、ユーザに対して、オリジナル保管対象データＤについての預かり証として出力する。 As a result of such work, management data M1, M2, and M3 are output from the respective distributed data archive devices 100A, 100B, and 100C. The data division / integration apparatus 400 outputs the collection of the management data M1, M2, and M3 as a deposit certificate for the original storage target data D to the user.

一方、この図７に示すデータ保管システムを用いて、オリジナル保管対象データＤを引き出す作業は、次のようになる。まず、ユーザは、データ分割／統合装置４００に対して、管理データＭ１，Ｍ２，Ｍ３の集合体を与えて、オリジナル保管対象データＤの引出要求を行う。すると、データ分割／統合装置４００は、各管理データＭ１，Ｍ２，Ｍ３を、それぞれ分散型データアーカイブ装置１００Ａ，１００Ｂ，１００Ｃに与えて、各部分保管対象データＤ１，Ｄ２，Ｄ３の引出要求を行う。こうして、各分散型データアーカイブ装置１００Ａ，１００Ｂ，１００Ｃから、各部分保管対象データＤ１，Ｄ２，Ｄ３が出力されてきたら、データ分割／統合装置４００において、これを統合する処理を行い、最終的に復元されたオリジナル保管対象データＤがユーザに提供される。 On the other hand, the operation of extracting the original storage target data D using the data storage system shown in FIG. 7 is as follows. First, the user gives a collection of management data M1, M2, and M3 to the data division / integration device 400 and makes a request for drawing out the original storage target data D. Then, the data division / integration device 400 gives the management data M1, M2, and M3 to the distributed data archive devices 100A, 100B, and 100C, respectively, and requests the partial storage target data D1, D2, and D3 to be withdrawn. . When the partial storage target data D1, D2, and D3 are output from the respective distributed data archive devices 100A, 100B, and 100C in this way, the data dividing / integrating device 400 performs a process of integrating them, and finally. The restored original storage target data D is provided to the user.

この図７に示すデータ保管システムのメリットは、オリジナル保管対象データＤの預入処理および引出処理に要する時間を短縮できる点にある。各分散型データアーカイブ装置１００Ａ，１００Ｂ，１００Ｃは、いずれもネットワーク２００を介して、分割ファイルを所定の格納場所に格納する処理を行うので、ネットワーク２００上のデータ伝送に時間がかかると、それだけ処理に時間を要することになる。特に、ネットワーク２００としてインターネットを用い、各格納場所として、ルータを介したトポロジカルな距離が離れた場所に設置されたサーバ装置を用いている場合には、データ伝送に時間がかかることが多い。また、§２で述べたように、分散型データアーカイブ装置を複数段カスケード接続したケースでは、最終段の分散型データアーカイブ装置までデータの伝送が完了するまでに、更に長い時間が必要になる。 The advantage of the data storage system shown in FIG. 7 is that the time required for depositing and withdrawing original storage target data D can be shortened. Each of the distributed data archiving apparatuses 100A, 100B, and 100C performs a process of storing the divided file in a predetermined storage location via the network 200. Therefore, if it takes time to transmit data on the network 200, the process is performed accordingly. It will take time. In particular, when the Internet is used as the network 200 and a server device installed at a location that is a topological distance away via a router is used as each storage location, data transmission often takes time. Further, as described in §2, in the case where the distributed data archiving apparatuses are cascaded in a plurality of stages, a longer time is required until the data transmission to the final distributed data archiving apparatus is completed.

一方、ユーザの立場からは、データバックアップの処理時間は、できるだけ短い方が好ましい。特に、ビジネスの現場では、大容量のデータのバックアップ作業を、夜中の特定の時間帯に行うような運用が採られることが多い。たとえば、午前２時〜５時の３時間の間に、必要なデータのバックアップ作業を行う必要があるような場合、図５に示すような従来のデータ保管システムでは、大容量のデータのバックアップを指定時間内に完了することができない事態も生じることになる。 On the other hand, from the user's standpoint, the data backup processing time is preferably as short as possible. In particular, in business sites, operations are often performed in which a large volume of data is backed up at a specific time during the night. For example, when it is necessary to back up necessary data for 3 hours from 2 am to 5 am, a conventional data storage system as shown in FIG. There will also be situations where it cannot be completed within the specified time.

図７に示すように、複数台の分散型データアーカイブ装置を並列接続すれば、これら複数台の分散型データアーカイブ装置に作業を分担させることができ、上述のような問題を解決することができる。すなわち、図７に示す例のように、３台の分散型データアーカイブ装置を並列接続すれば、１台の分散型データアーカイブ装置で処理した場合に要する時間のほぼ１／３の時間で、同じ処理を実行することが可能になる。 As shown in FIG. 7, if a plurality of distributed data archiving apparatuses are connected in parallel, the plurality of distributed data archiving apparatuses can share work, and the above-described problems can be solved. . That is, as in the example shown in FIG. 7, if three distributed data archive devices are connected in parallel, the time required for processing by one distributed data archive device is approximately 1/3 of the time required. It becomes possible to execute processing.

なお、データ分割／統合装置４００において行われる分割処理は、複数台の分散型データアーカイブ装置に作業分担させるための処理であるので、図１に示す分割処理部１２１が実行する分割処理とは目的が異なる。すなわち、分割処理部１２１が行う分割処理は、セキュリティ確保のため、できるだけ第三者に分割方法を予想されることがないような方法を採る必要があったが、データ分割／統合装置４００で行う分割処理には、そのような必要性はない。したがって、データ分割／統合装置４００では、単純な分割処理（たとえば、羊羹を切るように、３等分するなど）を行うようにすれば足りる。 The division process performed in the data division / integration apparatus 400 is a process for sharing work among a plurality of distributed data archive apparatuses. Therefore, the division process executed by the division processing unit 121 shown in FIG. Is different. That is, the division processing performed by the division processing unit 121 needs to adopt a method in which a third party is not expected to have a division method as much as possible to ensure security. There is no need for such a division process. Therefore, the data division / integration apparatus 400 only needs to perform a simple division process (for example, to divide the sheep into three equal parts so as to cut it).

本発明で利用される分散型データアーカイブ装置１００の基本構成とその動作を説明するためのブロック図である。It is a block diagram for demonstrating the basic composition and the operation | movement of the distributed data archive apparatus 100 utilized by this invention. 図１に示す分散型データアーカイブ装置１００による預入処理の概念を示すブロック図である。It is a block diagram which shows the concept of the deposit process by the distributed data archive apparatus 100 shown in FIG. 本発明に係る暗号化処理の手順を示す概念図である。It is a conceptual diagram which shows the procedure of the encryption process which concerns on this invention. 本発明に係るダミーデータ付加処理を示す概念図である。It is a conceptual diagram which shows the dummy data addition process which concerns on this invention. 本発明で利用される分散型データアーカイブ装置１００によるデータの保管原理を示すブロック図である。It is a block diagram which shows the data storage principle by the distributed data archive apparatus 100 utilized by this invention. 本発明の第１の実施形態に係るネットワークを利用したデータ保管システムの構成を示すブロック図である。It is a block diagram which shows the structure of the data storage system using the network which concerns on the 1st Embodiment of this invention. 本発明の第２の実施形態に係るネットワークを利用したデータ保管システムの構成を示すブロック図である。It is a block diagram which shows the structure of the data storage system using the network which concerns on the 2nd Embodiment of this invention.

Explanation of symbols

１００，１００Ａ，１００Ｂ，１００Ｃ…分散型データアーカイブ装置
１１０…管理データ出力手段
１２０…預入処理手段
１２１…分割処理部
１２２…暗号化処理部
１２３…ダミーデータ付加部
１２４…格納処理部
１３０…引出処理手段
１３１…統合処理部
１３２…復号化処理部
１３３…ダミーデータ除去部
１３４…取出処理部
１４０…管理データ入力手段
２００…ネットワーク（インターネット）
３０１〜３５０…格納場所（データサーバ装置）
４００…データ分割／統合装置
Ｄ…オリジナル保管対象データ
Ｄ１，Ｄ２，Ｄ３…分割ファイル（保管対象データ）
Ｄ１ｃ，Ｄ２ｃ，Ｄ３ｃ…暗号化分割ファイル
Ｄｃ…暗号化データ
Ｄｃ１，Ｄｃ２，Ｄｃ３…分割暗号化ファイル
Ｄ１１〜Ｄ３３…分割ファイル
Ｍ，Ｍ１〜Ｍ３…管理データ 100, 100A, 100B, 100C ... distributed data archive device 110 ... management data output means 120 ... deposit processing means 121 ... split processing section 122 ... encryption processing section 123 ... dummy data addition section 124 ... storage processing section 130 ... extraction process Means 131 ... Integration processing unit 132 ... Decryption processing unit 133 ... Dummy data removal unit 134 ... Extraction processing unit 140 ... Management data input means 200 ... Network (Internet)
301-350 ... Storage location (data server device)
400: Data division / integration device D: Original storage target data D1, D2, D3: Split file (storage target data)
D1c, D2c, D3c ... Encrypted divided file Dc ... Encrypted data Dc1, Dc2, Dc3 ... Divided encrypted files D11-D33 ... Divided files M, M1-M3 ... Management data

Claims

A data storage system having a function of distributing and storing data to be stored in a plurality of storage locations accessible via a network, and a function of retrieving stored data as needed,
Depositing processing means for performing deposit processing for storing data to be divided into a plurality of divided files and distributing the data to a plurality of storage locations via the network when storing data, and management data indicating the method of the deposit processing Management data output means for generating and storing the data directly or indirectly via online transmission to a predetermined recording medium, and management data input means for inputting the management data at the time of data retrieval And a withdrawal processing means for performing a withdrawal process for taking out a plurality of divided files stored in a distributed manner and restoring them to the original storage target data based on the inputted management data With multiple
A specific divided file stored in a specific storage location by deposit processing of the nth (n is an integer equal to or greater than 1) distributed data archive device is stored as the (n + 1) th distributed data archive device. It has a function of handling as data and further dividing the specific divided file into a plurality of divided files, and performing a process of storing in a plurality of storage locations different from the specific storage location. A data storage system using a network.

The data storage system according to claim 1,
A total of N stages of distributed data archiving apparatuses from the first stage of the distributed data archiving apparatus to the Nth stage (N is an integer of 2 or more) of the distributed data archiving apparatus ;
A specific divided file stored in a specific storage location by the depositing process of the i-th stage (i = 1 to N−1) distributed data archive apparatus is assigned to the (i + 1) -th stage distributed data archive apparatus. In the distributed data archive device at the (i + 1) -th stage, the specific divided file is further divided into a plurality of divided files, and a plurality of storages different from the specific storage location are stored. It is configured to be distributed and stored in places,
When the original storage target data is stored in the first- stage distributed data archive device, the first- stage management data to the N-th data created by the first- stage distributed data archive device. the stage of the distributed all N sets of management data in the data archive device up to the N-th stage of the management data created, the first stage of the distributed data archive device, indirectly by directly or online transmission A data storage system using a network characterized in that the data is output to a predetermined recording medium .

The data storage system according to claim 1 or 2,
The storage location itself as the storage location of the divided file by the deposit processing of the nth (n is an integer of 1 or more) distributed data archive device is configured by the (n + 1) th distributed data archive device. Data storage system using the network.

In the data storage system according to claims 1 to 3,
The depositing processing means of each distributed data archive device stores a division processing unit that divides storage target data into a plurality of divided files based on a predetermined division method, and stores each divided file in different storage locations via a network. A storage processing unit,
A retrieval processing unit of each distributed data archive device recognizes the storage location of each divided file from the input management data, and takes out each divided file from the recognized storage location via the network; An integrated processing unit that recognizes the division method from the input management data, and based on the recognized division method, integrates the extracted individual divided files to generate the original storage target data,
Data using a network, wherein the management data output means of each distributed data archive device has a function of creating management data including information indicating the division method and information indicating the storage location of each divided file Storage system.

The data storage system according to claim 4,
In the deposit processing means, before or after dividing the storage target data, an encryption processing unit for performing encryption processing on the storage target data or the divided file is provided, and the management data creation means determines the encryption processing method. A function of adding information to the management data, and the encryption processing method is recognized from the input management data in the extraction processing means, and the decryption processing is performed based on the recognized encryption processing method. A data storage system using a network, comprising a distributed data archiving apparatus that includes a decryption processing unit that performs the above-described processing and restores original storage target data through the decryption process.

The data storage system according to claim 4 or 5,
In the deposit processing means, a dummy data adding unit for executing dummy data adding processing for adding dummy data to the storage target data or the divided file is provided, and the management data creating means determines the method of the dummy data adding processing. A function for adding the information shown to the management data. The drawer processing means recognizes the dummy data addition processing method from the inputted management data, and performs dummy processing based on the recognized dummy data addition processing method. A network having a distributed data archive device provided with a dummy data removal processing unit for performing dummy data removal processing for removing data, wherein the original data to be stored is restored through the dummy data removal processing Data storage system using

A data storage system having a function of distributing and storing data to be stored in a plurality of storage locations accessible via a network, and a function of retrieving stored data as needed,
Depositing processing means for performing deposit processing for storing data to be divided into a plurality of divided files and distributing the data to a plurality of storage locations via the network when storing data, and management data indicating the method of the deposit processing Management data output means for generating and storing the data directly or indirectly via online transmission to a predetermined recording medium, and management data input means for inputting the management data at the time of data retrieval And a plurality of distributed data having a drawer processing unit that performs a drawer process of taking out a plurality of divided files stored in a distributed manner and restoring them to the original storage target data based on the inputted management data An archive device;
The original storage target data is divided into a plurality of partial storage target data, and each partial storage target data is given to each of the plurality of distributed data archive devices as the storage target data, and the plurality of distributions A data dividing / integrating device having a function of integrating the partial storage target data extracted by the type data archive device and generating the original original storage target data;
A data storage system using a network characterized by comprising:

The data storage system according to claim 7,
When storing data, the data division / integration device outputs and stores a collection of management data output from each distributed data archive device directly or indirectly to a predetermined recording medium by online transmission. it has a make function, when extraction of the data, enter the collection of the management data, data storage system using a network and having a function of providing it to the individual distributed data archive device.