WO2014083608A1

WO2014083608A1 - Computer, computer system, and data management method

Info

Publication number: WO2014083608A1
Application number: PCT/JP2012/080591
Authority: WO
Inventors: 藤田　雄介; 信尾額賀
Original assignee: 株式会社日立製作所
Priority date: 2012-11-27
Filing date: 2012-11-27
Publication date: 2014-06-05
Also published as: JPWO2014083608A1; JP5891313B2

Abstract

A framework that associates a storage computer with a recognition system is generally complex. The reason is that it is necessary to consider many such matters as a database that stores recognition results, a function that reports that recognition with respect to data has completed, throughput if a large quantity of data is entered at the same time, and coordination among a plurality of recognition systems. The present invention is a computer which manages unstructured data and structural data, wherein the computer is characterized in being provided with: a recognition unit which, with respect to the unstructured data, performs recognition processing of predetermined data types using predetermined dictionaries; and a structural data generation unit which, as a result of the recognition processing performed by the recognition unit, generates structural data which includes identification information of the recognition unit and identification information of the dictionaries used by the recognition unit.

Description

Computer, computer system, and data management method

The present invention relates to a computer, a system, and a method for executing recognition processing on unstructured data stored in a storage device and generating metadata including the result of recognition processing in the storage device.

Automating information extraction from unstructured data is required by many businesses that handle large amounts of data. In order to extract information from unstructured data, techniques such as image recognition, speech recognition, and document structure recognition are required. Furthermore, a mechanism for linking a large-scale storage device and a recognition system is also important.

As an example of a mechanism for linking a storage device and a recognition system, a method is disclosed in which video data and audio data are individually processed, and object data and metadata are associated with each other and stored in a database (for example, Patent Documents). 1).

JP 2001-167099 A

However, the system disclosed in Patent Document 1 is a system dedicated to video data and audio data, and is configured to be able to recognize a document in conjunction with a storage device that stores data of different data types such as a document. It is difficult.

Also, the mechanism for linking the storage device and the recognition system is generally complicated. This is because there are many items such as a database for storing recognition results, a function for notifying that data has been recognized, throughput when a large amount of data is input simultaneously, and linkage between multiple recognition systems. This is because it is necessary to consider.

The present invention has been made in view of these points, and an object thereof is to provide an apparatus, a system, and a method capable of flexibly linking a storage device and an arbitrary recognition system. .

A typical example of the invention disclosed in the present application is as follows. That is, a computer that manages unstructured data that does not have a fixed data structure and structured data that has a fixed data structure, the computer being connected to a processor, a memory connected to the processor, and the processor A storage device; and an I / O interface connected to the processor; and at least one recognition unit that executes recognition processing of a predetermined data type using a predetermined dictionary for the unstructured data; and the recognition And a structural data generating unit that generates the structural data including identification information of the recognizing unit and identification information of a dictionary used by the recognizing unit as a result of recognition processing executed by the recognizing unit.

According to the present invention, as a result of recognition processing for non-structural data, structure data including identification information for recognition processing and identification information for a dictionary used for recognition processing is generated. Various controls using the results of recognition processing on unstructured data, such as simultaneous operation of recognition systems, suppression of unnecessary recognition processing, and integration of recognition results output from a plurality of recognition systems, are possible.

Issues, configurations, and effects other than those described above will be clarified by the following description of the embodiments.

It is explanatory drawing which shows the structural example of the computer system in Example 1 of this invention. It is explanatory drawing which shows an example of the relevant information in Example 1 of this invention. It is a flowchart explaining the data storage process in Example 1 of this invention. It is explanatory drawing which shows an example of the structure data in Example 1 of this invention. It is a flowchart explaining the data crawling process in Example 1 of this invention. It is a flowchart explaining the data recognition process in Example 1 of this invention. It is explanatory drawing which shows an example of the structure data in which the structured recognition result in Example 1 of this invention was reflected. It is explanatory drawing which shows an example of the structured recognition result in Example 1 of this invention. It is explanatory drawing which shows an example of the structured recognition result in Example 1 of this invention. It is a flowchart explaining the structure data correlation process in Example 1 of this invention. It is explanatory drawing which shows an example of the structure data in which the some structured recognition result in Example 1 of this invention was reflected. It is a flowchart explaining the recognition function registration process in Example 1 of this invention. It is a block diagram explaining the structure of the unstructured data storage device in Example 2 of this invention.

Hereinafter, examples will be described with reference to the drawings.

In this embodiment, an example of a storage device that stores unstructured data including images and sounds will be described.

FIG. 1 is an explanatory diagram showing a configuration example of a computer system according to the first embodiment of the present invention.

The computer system according to the first embodiment includes a storage server 31, a management server 32, a video server 33, and an audio server 34. The storage server 31, the management server 32, the video server 33, and the audio server 34 are connected to each other via the relay device 38. Note that the computer system may include a terminal used by a user or the like.

Hereinafter, when the storage server 31, the management server 32, the video server 33, and the audio server 34 are not distinguished, they are also referred to as servers.

The storage server 31 of this embodiment includes a CPU 35, a memory 36, a communication device 37, and a storage device 39. As the storage device 39, for example, an HDD (Hard Disk Drive) and an SSD (Solid State Drive) can be considered. The storage server 31 may be connected to an external storage apparatus having a control unit, an I / O interface, and a plurality of storage devices.

Further, the management server 32, the video server 33, and the audio server 34 of the present embodiment have the same hardware configuration. Specifically, the management server 32, the video server 33, and the audio server 34 include a CPU 35, a memory 36, and a communication device 37.

CPU 35 executes a program stored in memory 36. The functions of the server can be realized by the CPU 35 executing the program. The memory 36 stores a program executed by the CPU 35 and various information necessary for executing the program. The communication device 37 is a device for communicating with other servers. The communication device 37 may be a network interface, for example.

The program executed by the CPU 35 transmits / receives data to / from each other by communicating with other servers using the communication device 37.

Note that the software configurations of the storage server 31, the management server 32, the video server 33, and the audio server 34 will be described later.

The relay device 38 receives data from an arbitrary device and relays data transmission / reception between devices by transmitting the received data to other devices. The relay device 38 includes a CPU (not shown), a memory (not shown), and a communication device (not shown).

The storage server 31 is a computer that stores various data. The memory 36 of the storage server 31 stores programs for realizing the data receiving unit 2, the storage unit 3, the data reference unit 4, and the structural data reference unit 5. Further, the storage device 39 of the storage server 31 stores unstructured data 50, structured data 51, and related information 52.

The data receiving unit 2 receives data stored in the storage server 31 from a user or the like. The storage unit 3 stores the received data in the storage device 39.

The data reference unit 4 returns the unstructured data 50 stored in the storage device 39 as a response in accordance with an instruction from the user or the like. The structure data reference unit 5 returns the structure data 51 stored in the storage device 39 as a response in accordance with an instruction from the user or the like.

The unstructured data 50 is data whose structure is not defined and cannot be easily managed by the database. The structure data 51 is data in which a structure is defined, and is in a format that can be easily managed in a database. The structural data 51 corresponds to the metadata of the unstructured data 50.

The related information 52 is information for managing the correspondence relationship between the non-structure data 50 and the structure data 51.

The management server 32 is a computer that manages data stored in the storage server 31. The memory 36 of the management server 32 includes a crawling processing unit 6, a data distribution unit 7, an audio filter unit 8, an audio recognition unit 9, an audio post-processing unit 10, an image filter unit 11, an image recognition unit 12, and an image post-processing unit 13. A program for realizing the recognition result receiving unit 14, the structural data association processing unit 15, the data distribution management unit 16, and the recognition function registration unit 17 is stored.

The crawling processing unit 6 extracts the unstructured data 50 to be processed from the unstructured data 50 stored in the storage device 39. The data distribution unit 7 transmits the extracted unstructured data 50 to a predetermined recognition function unit or device.

The voice filter unit 8 determines whether or not to execute voice data recognition processing on the unstructured data 50. The voice recognition unit 9 performs voice data recognition processing on the unstructured data 50. As a result, the recognition result of the voice data is output. The speech post-processing unit 10 converts the recognition result of the speech data output from the speech recognition unit 9 into data in a format that can be added to the structure data 51.

The video filter unit 11 determines whether to perform video data recognition processing on the unstructured data 50. The video recognition unit 12 executes video data recognition processing on the unstructured data 50. Thereby, the recognition result of the video data is output. The video post-processing unit 13 converts the recognition result of the video data output from the video recognition unit 12 into data in a format that can be added to the structure data 51.

The recognition result receiving unit 14 receives and temporarily holds the recognition results output from the audio post-processing unit 10 and the video post-processing unit 13.

The structural data association processing unit 15 reflects the recognition result for the non-structural data 50 in the structural data 51 currently stored.

The data distribution management unit 16 manages information for determining a recognition function unit to which the data distribution unit 7 distributes data.

The recognition function registration unit 17 executes processing for newly adding a recognition function unit.

The video server 33 is a computer that executes video data recognition processing. The memory 36 of the video server 33 stores programs for realizing the video dictionary unit 19 and the video recognition processing unit 42.

The video dictionary unit 19 manages a dictionary used for video data recognition processing. The video recognition processing unit 42 executes video data recognition processing. Note that the video data recognition process may be performed using a known technique, and a description thereof will be omitted.

The voice server 34 is a computer that executes voice data recognition processing. The memory 36 of the voice server 34 stores a program for realizing the voice dictionary unit 18 and the voice recognition processing unit 43.

The voice dictionary unit 18 manages a dictionary used for voice data recognition processing. The voice recognition processing unit 43 executes voice data recognition processing. In addition, since the recognition process of audio | voice data should just use a known technique, description is abbreviate | omitted.

FIG. 2 is an explanatory diagram showing an example of the related information 52 in the first embodiment of the present invention.

The related information 52 stores information for managing the unstructured data 50 and the structured data 51 associated with the unstructured data 50 in an integrated manner. Specifically, the related information 52 includes a URL 61, an unstructured data path 62, a structured data path 63, and an update time 64.

The URL 61 stores a URL (Uniform Resource Locator) used when accessing the unstructured data 50 or the structured data 51 stored in the storage server 31.

The unstructured data path 62 stores the path name of the storage area in which the unstructured data 50 is stored. The structure data path 63 stores the path name of the storage area in which the structure data 51 is stored.

In the present invention, the storage server 31 can manage the one URL, the unstructured data 50, and the structured data 51 in association with each other by holding the related information 52.

Next, processing of the computer system in this embodiment will be described. The processing of this system is divided into seven processes: data storage processing, data reference processing, structural data reference processing, data crawling processing, data recognition processing, structural data association processing, and recognition function registration processing.

The following processing is executed as characteristic processing of this embodiment.

In the data recognition process, a predetermined recognition process is performed on the stored unstructured data 50. At this time, the storage server 31 and the management server 32 cooperate with each other to generate structure data using the recognition processing result.

In the structure data association process, the management server 32 reflects the newly generated structure data in the structure data 51 that has a corresponding relationship with the non-structure data 50.

First, the data storage process in this embodiment will be described.

FIG. 3 is a flowchart for explaining data storage processing according to the first embodiment of the present invention. FIG. 4 is an explanatory diagram showing an example of the structure data in the first embodiment of the present invention.

When the storage server 31 receives unstructured data from an external device such as an external PC or server, the storage server 31 starts data storage processing.

The data receiving unit 2 receives unstructured data transmitted from the external device via the relay device 38 (step S101). The data reception unit 2 receives unstructured data transmitted using, for example, HTTP (HyperText Transfer Protocol). Note that the present invention is not limited to the type of unstructured data, and the data receiving unit 2 can receive arbitrary files (unstructured data) such as documents, images, sounds, and moving images.

Next, the data receiving unit 2 generates a URL for accessing the received unstructured data (step S102).

As a URL generation method, a method of using a URL specified by HTTP as it is can be considered. Moreover, the data reception part 2 may produce | generate URL using the name of a transmitted file, an extension, time, etc. as needed. In this case, for example, a URL such as “http: //server/wav/20120401.wav” is generated.

Next, the storage unit 3 stores the received unstructured data in the storage device 39 (step S103), and updates the related information 52 (step S104). Thereafter, the storage server 31 ends the process. Specifically, the following processing is executed.

The storage unit 3 adds a new entry to the related information 52, and stores the URL generated in step S102 in the URL 61 of the entry. In addition, the storage unit 3 stores the path name where the received unstructured data is stored in the unstructured data path 62 of the added entry, and stores the time when the unstructured data is stored as the update time 64. .

At this time, the structure data path 63 remains blank. This is because the structure data is usually not included when the non-structure data is stored.

However, the data accepting unit 2 can accept any structural data as well as unstructured data. For example, it is conceivable that structural data including information such as the owner of unstructured data as shown in FIG. 4 is added to the unstructured data. In this case, in step S103, the storage unit 3 stores the unstructured data and the structured data in the storage device 39, respectively. In step S104, the storage unit 3 stores the path name in which the structure data is stored in the structure data path 63 of the added entry.

As described above, in the data storage process, the storage unit 3 stores the unstructured data 50 in association with the URL. Therefore, the following data reference process and structural data reference process are possible.

In the data reference process, the data reference unit 4 searches the entry corresponding to the specified URL with reference to the URL 61 of the related information 52 based on the URL specified by the user. Further, the data reference unit 4 refers to the unstructured data path 62 of the retrieved entry, acquires the unstructured data 50, and returns the acquired unstructured data 50 to the user.

In the structure data reference operation, the structure data reference unit 5 searches the entry corresponding to the specified URL with reference to the URL 61 of the related information 52 based on the URL specified by the user. Further, the structure data reference unit 5 refers to the structure data path 63 of the retrieved entry, acquires the structure data 51, and returns the acquired structure data 51 to the user.

For example, the system can be configured to return the unstructured data 50 or the structured data 51 acquired based on the requested URL to the user using HTTP. In the data reference process, when the unstructured data 50 is returned to the user using HTTP, the data reference unit 4 includes the HTTP header to which the content type (data type) of the unstructured data 50 is added, and the unstructured data 50. The system can be configured to return Further, when only the HTTP header is requested, the data reference unit 4 may return only the content type without returning the entire unstructured data 50.

Next, data crawling processing in the present embodiment will be described.

FIG. 5 is a flowchart for explaining data crawling processing according to the first embodiment of the present invention.

Management server 32 repeatedly executes data crawling processing. For example, the management server 32 executes the data crawling process periodically or when receiving an instruction from a user or the like.

The crawling processing unit 6 inquires of the storage unit 3 of the storage server 31 and acquires a list of URLs 61 of the related information 52 (step S201). That is, the unstructured data 50 to be processed is extracted.

In the present embodiment, only the URL 61 associated with the newly stored unstructured data 50 is the target to be extracted. Therefore, the crawling processing unit 6 makes an inquiry including the target time. Upon receipt of the inquiry, the storage unit 3 refers to the update time 64 stored in the related information 52, lists only the URL 61 of the latest data, and transmits the list of URLs 61 to the crawling processing unit 6.

In order to make the above-described inquiry, the crawling processing unit 6 temporarily holds the latest update time 64 in the list of URLs 61 and makes an inquiry about the URL 61 that is a time after the update time 64.

Note that when a large amount of unstructured data is stored within a certain period of time, the URL 61 list may be enlarged. In this case, the storage unit 3 may list only a predetermined number of URLs 61 in order from the oldest update time 64. As will be described later, since the data crawling process is repeatedly executed after waiting for a predetermined time, it is not necessary to list all target URLs 61 at a time.

Next, the data distribution unit 7 distributes the list of URLs 61 acquired by the crawling processing unit 6 to a predetermined recognition function unit (step S202).

Here, the recognition function unit is a function unit that executes recognition processing, and includes a filter unit, a recognition unit, a dictionary unit, and a post-processing unit.

The filter unit determines whether or not the unstructured data 50 is a recognition target based on the URL 61.

The recognition unit acquires the unstructured data 50 from the storage server 31 based on the URL 61, and executes recognition processing on the acquired unstructured data 50 using the dictionary data held by the dictionary unit.

The post-processing unit generates structure data using the recognition result. That is, the post-processing unit corresponds to a functional unit (structure data generation unit) that generates structure data. Specifically, the post-processing unit converts the data into a certain structure based on the recognition result indicating the contents of the unstructured data 50, and adds the ID unique to the recognition process and the ID of the dictionary used to the data. Structure data is generated by assigning.

In this embodiment, the recognition result is converted to XML format data, but the present invention is not limited to this. It suffices if it can be converted into a data format having at least a certain structure.

Specifically, the voice recognition function unit that performs voice recognition processing includes a voice filter unit 8, a voice recognition unit 9, a voice recognition processing unit 43, a voice dictionary unit 18, and a voice post-processing unit 10, and also video recognition. The video recognition function unit for processing includes a video filter unit 11, a video recognition unit 12, a video recognition processing unit 42, a video dictionary unit 19, and a video post-processing unit 13.

In this embodiment, a publish / subscribe model is used as a message model for distributing the URL 61. Specifically, an audio filter unit 8 and a video filter unit 11 that distribute messages are registered in advance in the data distribution management unit 16 as subscriber information. The data distribution unit 7 distributes the list of URLs 61 as messages to the audio filter unit 8 and the video filter unit 11 based on the subscriber information registered in the data distribution management unit 16.

Finally, the crawling processing unit 6 waits for a predetermined time (step S203), and then returns to step S201 to execute the same processing.

The URL 61 associated with the unstructured data 50 stored in the storage device 39 is notified to each recognition function unit by the data crawling process described above. In addition, by this processing, the URL 61 can be repeatedly delivered every time new unstructured data is stored in the storage server 31.

Next, data recognition processing in this embodiment will be described.

FIG. 6 is a flowchart for explaining data recognition processing in the first embodiment of the present invention. FIG. 7 is an explanatory diagram showing an example of the structure data reflecting the structured recognition result in the first embodiment of the present invention. 8 and 9 are explanatory diagrams illustrating examples of structured recognition results according to the first embodiment of the present invention.

Each recognition function unit starts processing upon receiving the list of URLs 61. Hereinafter, the voice recognition function unit and the video recognition function unit will be described as examples.

The audio filter unit 8 and the video filter unit 11 receive the list of URLs 61 transmitted from the data distribution unit 7 (step S301).

In the above-described data crawling process, since the list of URLs 61 is distributed using the publish / subscribe model, each filter unit receives the same list of URLs 61. Thereby, for example, a plurality of recognition processes, such as a voice recognition process and a video recognition process, can be executed on moving image data.

The audio filter unit 8 and the video filter unit 11 select one URL 61 included in the list of URLs 61, and execute the following processing on the selected URL 61.

Next, the audio filter unit 8 and the video filter unit 11 determine whether or not the unstructured data 50 is a recognition target based on the type of the unstructured data 50 corresponding to the selected URL 61 (step S302). ).

For example, the audio filter unit 8 and the video filter unit 11 can determine the content type (data type) of the unstructured data 50 based on the extension of the URL 61. At this time, the audio filter unit 8 determines the unstructured data 50 whose URL 61 ends with “.wav” or “.mpg” as a recognition target, and the video filter unit 11 has the URL ends with “.mpg”. Certain unstructured data 50 is determined as a recognition target.

As another method, the audio filter unit 8 and the video filter unit 11 acquire the content type of the unstructured data 50 by executing the data reference process based on the URL 61, and change the content of the acquired unstructured data 50 into the content. Based on this, it is determined whether or not the unstructured data 50 is a recognition target.

As another method, the audio filter unit 8 and the video filter unit 11 execute the data reference process based on the URL 61 to acquire the unstructured data 50, and based on the analysis result of the acquired unstructured data 50. Thus, it is determined whether or not the unstructured data 50 is a recognition target. As a method of analyzing the acquired non-structured data 50, a method of determining the content type of the non-structured data 50 by analyzing the head of the acquired non-structured data 50 or the like can be considered.

In step S302, when it is determined that the unstructured data 50 corresponding to the URL 61 is not a recognition target, the recognition function unit ends the process.

When it is determined in step S302 that the unstructured data 50 corresponding to the URL 61 is a recognition target, the audio filter unit 8 and the video filter unit 11 acquire the unstructured data 50 corresponding to the URL 61 (step S303). This can be realized by the structure data reference process described above.

Next, the audio filter unit 8 and the video filter unit 11 analyze the content of the acquired unstructured data 50 and determine whether or not the unstructured data 50 has been recognized (step S304).

Here, an example of a method for determining whether or not it has been recognized will be described with reference to FIG. FIG. 7 shows the structure data after the structure data association process described later is executed on the structure data shown in FIG. Comparing FIG. 4 and FIG. 7, it can be seen that a tag “metainfo” is given. In the present embodiment, a structured recognition result is added to the metainfo tag portion.

As the simplest method for determining whether or not it has been recognized, the filter unit may be a method of detecting the aforementioned tag. However, since the above-described tag may be given by another recognition process, it is not sufficient for making a correct determination.

Therefore, in this embodiment, an ID unique to the recognition process is given to the processor_url tag in the metainfo tag. Thus, a method may be considered in which the filter unit determines whether or not it has been recognized based on the ID. That is, when the ID unique to the recognition process corresponding to the structure data 51 is included, the filter unit determines that the non-structure data 50 has been recognized.

Also, as another method, a method of giving the time when the recognition process is completed to the processed tag inside the metainfo tag can be considered. Thus, for example, when the recognition process is executed again in association with the update of the recognition function unit, the filter unit is not structured data only when the completion time of the recognition process is before the update time of the recognition function unit. 50 is determined to be a recognition target.

If it is determined in step S304 that the unstructured data 50 has been recognized, the recognition function unit ends the process.

When it is determined in step S304 that the unstructured data 50 has not been recognized, the voice recognition unit 9 and the video recognition unit 12 execute a recognition process on the unstructured data 50 corresponding to the URL 61 (step S305).

Specifically, the voice recognition unit 9 executes voice recognition processing on the unstructured data 50 in cooperation with the voice recognition processing unit 43 and the voice dictionary unit 18. In addition, the video recognition unit 12 executes video recognition processing on the unstructured data 50 in cooperation with the video recognition processing unit 42 and the video dictionary unit 19.

Here, in the speech recognition process, speech data is received, and words included in the speech data, start times and end times of the words, and the like are output as recognition results. In the video recognition process, video data is received, and the name of a person included in the video data, the appearance time and the appearance position of the person, and the like are output as a recognition result.

Here, voice recognition processing and video recognition processing are taken as an example, but the present invention can apply various processing for recognizing unstructured data acquired from a document, image, voice, acceleration sensor, or the like. it can.

In this embodiment, as described above, the video recognition unit 12 of the management server 32 and the video recognition processing unit 42 of the video server 33 cooperate to execute the video recognition processing, and the voice recognition unit 9 of the management server 32 The voice recognition processing unit 43 of the voice server 34 cooperates to execute voice recognition processing.

In general, the video recognition process and the voice recognition process have a longer processing time than a process such as message transfer. Therefore, in order to prevent another server from executing the recognition process, the processing performance of the entire system is not degraded. The system configuration is as follows. The management server 32 itself may have a system configuration that executes recognition processing.

In the system configuration described above, the voice recognition unit 9 of the management server 32 executes data reference processing to acquire the unstructured data 50 corresponding to the URL 61, and transmits the acquired unstructured data 50 to the voice server 34. Next, the voice recognition processing unit 43 on the voice server 34 generates a recognition result using the voice dictionary unit 18, and returns the generated recognition result to the management server 32. The voice recognition unit 9 of the management server 32 receives the recognition result. Similarly, the video recognition unit 12 cooperates with the video server 33, and the video recognition processing unit 42 generates a recognition result using the video dictionary unit 19.

Next, the audio post-processing unit 10 and the video post-processing unit 13 perform post-processing on the recognition result of the confirmation formula processing (step S306).

Specifically, the audio post-processing unit 10 and the video post-processing unit 13 generate structured data including a structured recognition result, an ID unique to the recognition process, and an ID unique to the dictionary used for the recognition process. Further, the audio post-processing unit 10 and the video post-processing unit 13 can include the recognition processing completion time in the structure data.

In this embodiment, the URL of the server that executes the recognition process is used as the ID unique to the recognition process. Here, it is assumed that the URL of the audio server 34 is “http://sound.hitachi.com/”, and the URL of the video server 33 is “http://video.hitachi.com/”. Further, the ID unique to the recognition process may include an ID unique to the dictionary used for the recognition process. When the system is configured so that the dictionary used for the recognition process is also specified by the URL, the ID unique to the recognition process including “tvnews” which is the dictionary ID held by the speech dictionary unit 18 is “http: // sound. hitachi.com/tvnews ".

As described later, by reflecting the generated structure data in the original structure data 51, it is possible to determine whether or not it is the recognized non-structure data 50 in step S304.

The recognition result output from each recognition processing unit may be in any format, but each post-recognition processing unit has a unified XML format in order to simplify the structure of the structure data association processing unit 15 described later. Generate structured data. An example of XML format structure data generated by the speech post-processing unit 10 is shown in FIG. FIG. 9 shows an example of XML format structure data generated by the video post-processing unit 13.

Next, the audio post-processing unit 10 and the video post-processing unit 13 transmit the structure data to the recognition result receiving unit 14 (step S307).

Here, it is assumed that the recognition result receiving unit 14 includes a queue so that the structure data can be received from a plurality of recognition function units. In this case, the audio post-processing unit 10 and the video post-processing unit 13 each transmit a message including structure data to the queue. In addition, a URL 61 corresponding to the unstructured data 50 that is the recognition target in the recognition process is assigned to the header of the message transmitted to the queue.

The structural data including the recognition result of the unstructured data 50 stored in the storage server 31 is accumulated in the queue of the recognition result receiving unit 14 by the data recognition process described above.

In the present embodiment, each of the plurality of recognition function units includes a filter unit, so that only necessary recognition processing is executed.

Next, the structure data association process in this embodiment will be described.

FIG. 10 is a flowchart for explaining the structure data associating process according to the first embodiment of the present invention. FIG. 11 is an explanatory diagram illustrating an example of structure data reflecting a plurality of structured recognition results according to the first embodiment of the present invention.

First, the recognition result receiving unit 14 acquires the structure data accumulated in the queue (step S401). Here, it is assumed that the structure data including the recognition result of the audio data is received earlier than the structure data including the recognition result of the video data. In this case, XML format structure data as shown in FIG. 9 is acquired from the queue.

Next, the structural data association processing unit 15 executes the structural data reference process to specify the URL 61 corresponding to the non-structured data 50 to be recognized, and the structural data 51 corresponding to the specified URL 61 is stored in the storage server 31. (Step S402). Here, as shown in FIG. 5, the structure data 51 not including the recognition result is acquired.

Next, the structure data association processing unit 15 integrates the structure data 51 acquired from the storage server 31 and the acquired structure data (step S403).

Specifically, the structure data association processing unit 15 generates one XML format structure data as shown in FIG. 7 by embedding the received structure data in the structure data 51 acquired from the storage server 31. To do. It is a recognition result of the audio | speech data in which the part shown with the dotted-line frame of FIG. 7 was embedded.

Here, as a method of embedding the received structural data, the structural data association processing unit 15 analyzes the structural data 51 acquired from the storage server 31 to identify the position where the received structural data is embedded. For example, a method of specifying a position where received data is embedded using a predetermined tag as a key is conceivable. The method described above is an example, and the present invention is not limited to this.

Next, the structural data association processing unit 15 transmits the generated structural data to the storage unit 3 of the storage server 31 (step S404), and ends the process.

At this time, the storage unit 3 overwrites the existing structure data 51 with the received structure data as new structure data.

The recognition result of the recognition process for the non-structure data 50 is stored as the structure data 51 associated with the URL 61 by the above-described structure data association process. In addition, each time a recognition result is received from the recognition function unit, the process is repeatedly executed, so that a plurality of recognition results can be included in one structure data 51.

Here, if the recognition result of the video data is received after receiving the recognition result of the audio data, the following processing is executed.

In step S401, the structure data association processing unit 15 acquires XML-format structure data as shown in FIG. 10 from the queue.

In step S402, the structure data association processing unit 15 acquires the structure data 51 including the recognition result of the sound data as shown in FIG.

In step S403, the structure data association processing unit 15 integrates the existing structure data and the acquired structure data to generate XML structure data as shown in FIG. This is a recognition result of video data in which a portion indicated by a dotted frame in FIG. 11 is embedded.

In step S404, the structural data association processing unit 15 transmits the structural data in which the recognition result of the video data is embedded to the storage server 31. At this time, the storage server 31 overwrites the existing structure data 51 with the received structure data.

As described above, a plurality of recognition results are integrated into the structure data 51 by repeatedly executing the structure data association process.

Next, the recognition function registration process in the present embodiment will be described.

FIG. 12 is a flowchart illustrating the recognition function registration process according to the first embodiment of the present invention.

The recognition function registration unit 17 receives the recognition function unit to be added (step S501). Specifically, the recognition function registration unit 17 receives a program for realizing a predetermined recognition unit.

Here, the recognition function unit is realized by the same configuration as the above-described voice recognition function unit and video recognition function unit. That is, the recognition function unit includes a filter unit, a recognition processing unit, a dictionary unit, and a post-processing unit.

Next, the recognition function registration unit 17 adds a recognition processing unit by storing the received program in the memory 36 of the management server 32 (step S502).

Next, the recognition function registration unit 17 notifies the identification information of the received program to the data distribution management unit 16, and the subscriber of the message distributed from the data distribution unit 7 recognizes the recognition function processing unit realized by the program. (Step S503), and the process ends.

Through the above processing, the recognition function registration unit 17 can add an arbitrary recognition function unit to the computer system. At this time, by using the publish / subscribe model for the message processing of the data distribution unit 7, it can be ensured that the processing of the existing recognition processing unit is not affected.

In the data recognition process, the post-processing unit generates structure data, but the present invention is not limited to this. For example, the following modifications can be considered.

In step S306, the post-processing unit generates a structured recognition result from the recognition result received from the recognition unit, and transmits a message including the structured recognition result to the recognition result receiving unit 14. At this time, URL 61, ID unique to the recognition process, ID unique to the dictionary, and recognition process completion time are added to the header of the message. In this case, the recognition result receiving unit 14 or the structure data association processing unit 15 generates structure data from the received message.

In the structure data association processing, each time structure data is stored in the queue, the structure data association processing unit 15 integrates the structure data and the existing structure data 51. However, the present invention is not limited to this. Not. For example, a plurality of target recognition function units may be registered in advance, and the structure data association process may be started when structure data is received from all the recognition function units. In this case, the structure data association processing unit 15 integrates a plurality of structure data and the existing structure data 51 at a time.

According to the first embodiment, the storage server 31 stores the received unstructured data, and further associates the recognition result indicating the contents of the unstructured data with the information unique to the recognition process and the information of the dictionary. Store as structured data accompanying unstructured data. As a result, the recognition result for the non-structure data can be managed as the structure data associated with the same URL when used when referring to the non-structure data.

Therefore, the database function for storing the recognition result and the function for determining the completion of the recognition process can be realized only by the access process to the storage server 31 using the URL.

Furthermore, when linking a plurality of recognition function units, it is not necessary to design a storage location of recognition results and a correspondence relationship between the plurality of recognition function units. In addition, the processing performance of the computer system for simultaneously executing processing by a plurality of recognition function units can be easily controlled according to the performance of each recognition function unit.

Furthermore, it is possible to avoid unnecessary recognition processing when moving or duplicating unstructured data, and avoid unnecessary recognition processing when adding or updating recognition function units. Can do.

Furthermore, when a plurality of recognition function units are linked, recognition results output from a plurality of recognition function units can be integrated into a single XML structure data for a single unstructured data.

In the first embodiment, the storage processing of unstructured data is realized as the entire computer system. However, the second embodiment is different in that the storage processing of unstructured data is realized using one apparatus. Hereinafter, the second embodiment will be described focusing on differences from the first embodiment.

FIG. 13 is a block diagram illustrating the configuration of the unstructured data storage device 1 according to the second embodiment of the present invention.

The hard structure of the unstructured data storage device 1 is the same as that of the storage server 31 or the management server 32, etc., and includes a CPU (not shown), a memory (not shown), a communication device (not shown), and a storage device (not shown). Have

The unstructured data storage device 1 includes a data reception unit 2, a storage unit 3, a data reference unit 4, a structural data reference unit 5, a crawling processing unit 6, a data distribution unit 7, a voice filter unit 8, a voice recognition unit 9, Audio post-processing unit 10, video filter unit 11, video recognition unit 12, video post-processing unit 13, recognition result reception unit 14, structural data association processing unit 15, data distribution management unit 16, recognition function registration unit 17, audio dictionary unit 18 and a video dictionary unit 19.

Here, the video recognition unit 12 has a function realized by the video recognition unit 12 of the management server 32 and the video recognition processing unit 42 of the video server 33. Similarly, the voice recognition unit 9 has a function realized by the voice recognition unit 9 of the management server 32 and the voice recognition processing unit 43 of the voice server 34.

Other configurations are the same as those in the first embodiment, and thus description thereof is omitted.

The unstructured data storage device 1 provides a user interface for operating the data receiving unit 2, the data reference unit 4, the structural data reference unit 5, and the recognition function registration unit 17 to the user.

When the data receiving unit 2 receives unstructured data from the user, the data receiving unit 2 executes data storage processing in cooperation with the storage unit 3. Moreover, the data reference part 4 will perform a data reference process, if the reference request | requirement of unstructured data containing URL is received from a user. When receiving a structural data reference request including a URL from the user, the structural data reference unit 5 executes a structural data reference process.

In addition, the crawling processing unit 6 and the data distribution unit 7 execute data crawling processing when receiving support from the user periodically or. Specifically, the crawling processing unit 6 generates a URL list, and inputs the generated URL list to the data distribution unit 7. Based on the subscriber information stored in the data distribution management unit 16, the data distribution unit 7 inputs a list of URLs to a filter unit that constitutes a predetermined recognition function unit. In the example illustrated in FIG. 13, a URL list is input to at least one of the audio filter unit 8 and the video filter unit 11. Thereby, the data recognition process is started.

The audio filter unit 8 and the video filter unit 11 determine whether or not the unstructured data 50 corresponding to the URL is a recognition target, and whether or not the recognition process for the unstructured data 50 has been executed. judge. The audio filter unit 8 and the video filter unit 11 request the audio recognition unit 9 and the video recognition unit 12 to execute processing based on the determination result.

The voice recognition unit 9 performs a voice data recognition process on the unstructured data 50 in cooperation with the voice dictionary unit 18 and inputs a recognition result to the voice post-processing unit 10. In addition, the video recognition unit 12 performs a video data recognition process on the unstructured data 50 in cooperation with the video dictionary unit 19, and inputs a recognition result to the video post-processing unit 13.

The speech post-processing unit 10 generates structure data including the recognition result, the ID unique to the recognition process of the speech data, and the process completion time, and inputs the structure data to the recognition result receiving unit 14. Further, the video post-processing unit 13 generates structural data including the recognition result, the ID unique to the recognition processing of the video data, and the completion time of the processing, and inputs the structural data to the recognition result receiving unit 14.

When the structural data is input, the recognition result receiving unit 14 executes the structural data association processing in cooperation with the structural data association processing unit 15. At this time, the structure data association processing unit 15 inputs new structure data into which the input structure data is integrated into the storage unit 3. The storage unit 3 updates the input structure data by overwriting the existing structure data 51.

The recognition function registration unit 17 adds a new recognition function unit to the unstructured data storage device 1 by executing the recognition function registration process, and distributes subscriber information for distributing the URL to the recognition function unit. Register in the management unit 16.

In addition, since the specific content of each process is the same as Example 1, description is abbreviate | omitted.

The configuration of the computer, the processing unit, and the processing unit described in the present invention may be partially or entirely realized by dedicated hardware. In addition, the various software exemplified in the present embodiment can be stored in various recording media (for example, non-transitory storage media) such as electromagnetic, electronic, and optical, and through a communication network such as the Internet. It can be downloaded to a computer.

Further, the present invention is not limited to the above-described embodiment, and includes various modifications. In the present embodiment, a computer system that stores unstructured data is assumed. For example, a portable information management system in which portable devices have the functions of a management server 32 and a storage server 31 and a recognition server is placed on the cloud. The present invention can be applied to apparatuses and systems having various configurations.

Claims

A computer that manages unstructured data not having a fixed data structure and structured data having a fixed data structure,
The computer includes a processor, a memory connected to the processor, a storage device connected to the processor, and an I / O interface connected to the processor,
For the unstructured data, at least one recognition unit that executes a recognition process of a predetermined data type using a predetermined dictionary;
A structure data generation unit that generates the structure data including identification information of the recognition unit and identification information of a dictionary used by the recognition unit as a result of a recognition process executed by the recognition unit. calculator.
The computer according to claim 1,
The structural data generation unit generates structural data having a data structure that can be integrated with the structural data managed by the computer.
The computer according to claim 2,
The computer includes a structural data association processing unit that generates new structural data by integrating the structural data related to the non-structural data and the structural data generated by the structural data generation unit. A computer characterized by
The computer according to claim 3, wherein
The computer includes a first structure data generation unit and a second structure data generation unit,
The first structure data generation unit generates first structure data,
The second structure data generation unit generates second structure data,
The structural data association processing unit
When the first structure data is input from the first structure data generation unit, the third structure data related to the non-structure data is acquired,
Generating fourth structure data by integrating the acquired third structure data and the input first structure data;
After the fourth structure data is stored, when the second structure data is input from the second structure data generation unit, the fourth structure data is acquired,
A computer that generates fifth structure data by integrating the acquired fourth structure data and the input second structure data.
The computer according to claim 3, wherein
Depending on the type of data to be recognized, a plurality of recognition units are provided,
The calculator is
A plurality of filter units for referring to the structural data related to the non-structural data and determining whether the non-structural data is a target of recognition processing of a predetermined data type;
The plurality of filter units are:
Is associated with one of the plurality of recognition units,
With reference to the structure data, it is determined whether or not the non-structure data is data having a predetermined data type targeted by the associated recognition unit,
A computer characterized by referring to the structure data to determine whether or not the associated recognition unit has completed recognition processing for the non-structure data.
The computer according to claim 5, wherein
The calculator is
A data input management unit that manages input information related to the at least one recognition unit that inputs the unstructured data to be processed among the plurality of recognition units;
Data input for specifying the at least one recognition unit that inputs the non-structure data to be processed with reference to the input information and inputting the non-structure data to be processed to the specified recognition unit And
A computer comprising:
The computer according to claim 3, wherein
A storage unit that manages the unstructured data and the structure data related to the unstructured data in association with each other;
The storage unit stores the new structure data input by the structure data association processing unit in association with the non-structure data.
A computer system comprising a plurality of computers,
Each of the plurality of computers includes a processor, a memory connected to the processor, a storage device connected to the processor, and an I / O interface connected to the processor,
The plurality of computers generate storage data for managing unstructured data not having a fixed data structure and structured data having a fixed data structure, and structured data including a result of predetermined processing on the unstructured data. Including a management server,
The management server
For the unstructured data, at least one recognition unit that executes a recognition process of a predetermined data type using a predetermined dictionary;
And a structural data generating unit that generates the structural data including identification information of the recognizing unit and identification information of a dictionary used by the recognizing unit as a result of recognition processing executed by the recognizing unit. Computer system.
A computer system according to claim 8, wherein
The computer system characterized in that the structure data generation unit generates structure data having a data structure that can be integrated with the structure data managed by the storage server.
A computer system according to claim 9, wherein
The management server includes a structure data association processing unit that generates new structure data by integrating the structure data related to the non-structure data and the structure data generated by the structure data generation unit. A computer system characterized by that.
A computer system according to claim 10, wherein
The management server has a first structure data generation unit and a second structure data generation unit,
The first structure data generation unit generates first structure data,
The second structure data generation unit generates second structure data,
The structural data association processing unit
When the first structure data is input from the first structure data generation unit, the third structure data related to the non-structure data is acquired,
Generating fourth structure data by integrating the acquired third structure data and the input first structure data;
After the fourth structure data is stored, when the second structure data is input from the second structure data generation unit, the fourth structure data is acquired,
5. A computer system, characterized in that fifth structure data is generated by integrating the acquired fourth structure data and the input second structure data.
A computer system according to claim 10, wherein
Depending on the type of data to be recognized, a plurality of recognition units are provided,
The management server has a plurality of filter units that determine whether or not the unstructured data is a target of recognition processing of a predetermined data type with reference to the structure data related to the unstructured data,
The plurality of filter units are:
Is associated with one of the plurality of recognition units,
With reference to the structure data, it is determined whether or not the non-structure data is data having a predetermined data type targeted by the associated recognition unit,
A computer system that refers to the structural data and determines whether or not the associated recognition unit has completed recognition processing for the unstructured data.
A computer system according to claim 12, wherein
The management server
A data input management unit that manages input information related to at least one recognition unit to which the unstructured data to be processed is input;
A data input unit that refers to the input information, specifies at least one recognition unit that inputs the non-structure data to be processed, and inputs the non-structure data to be processed to the specified recognition unit And a computer system characterized by comprising:
A computer system according to claim 10, wherein
The storage server has a storage unit that manages the unstructured data and the structure data related to the unstructured data in association with each other,
The storage system stores the new structural data in association with the non-structural data when the new structural data is input from the structural data association processing unit.
A data management method in a computer for managing unstructured data not having a fixed data structure and structured data having a fixed data structure,
The computer includes a processor, a memory connected to the processor, a storage device connected to the processor, and an I / O interface connected to the processor,
The method
The processor executing a plurality of recognition processes using a predetermined dictionary for each data type with respect to the unstructured data;
For each of the plurality of recognition processes, the processor obtains structure data having a data structure that can be integrated based on the result of the recognition process, the identification information of the recognition process, and the identification information of the dictionary used in the recognition process. Generating step;
The processor generates new structure data by integrating a plurality of the structure data; and
A data management method comprising the step of storing the unstructured data and the new structured data in association with each other.