JP7163845B2

JP7163845B2 - Information processing device and program

Info

Publication number: JP7163845B2
Application number: JP2019063559A
Authority: JP
Inventors: 直樹梶谷
Original assignee: Oki Electric Industry Co Ltd
Current assignee: Oki Electric Industry Co Ltd
Priority date: 2019-03-28
Filing date: 2019-03-28
Publication date: 2022-11-01
Anticipated expiration: 2039-03-28
Also published as: JP2020166021A

Description

本開示は、情報処理装置、およびプログラムに関する。 The present disclosure relates to an information processing device and a program.

従来より、音声認識の技術を使用し、音声をテキストに変換する技術がある。具体的には、コールセンターなどの音声通話が行われる産業分野において、このような技術を用いて業務効率の向上等に活用されている。 Conventionally, there is a technique for converting speech into text using speech recognition technology. Specifically, in industrial fields such as call centers where voice communication is performed, such technology is used to improve operational efficiency.

例えば、複数の音声認識エンジンで同時に音声認識を行い、夫々の音声認識エンジンの信頼性により認識結果を選択する技術がある（特許文献１参照）。 For example, there is a technique of performing speech recognition simultaneously with a plurality of speech recognition engines and selecting a recognition result based on the reliability of each speech recognition engine (see Patent Document 1).

また、複数の音声認識エンジンで音声認識した結果のいずれかを採用するかを選択させる技術がある（特許文献２参照）。 Also, there is a technique for selecting which one of the speech recognition results obtained by a plurality of speech recognition engines is to be adopted (see Patent Document 2).

特開２０１７－７６１３９号公報JP 2017-76139 A 特開２０１０－８５５３６号公報JP 2010-85536 A

音声認識の認識結果は、音声通話の業務では正解テキストを作成するのに用いられる。正解テキストとは、コールセンターの場合であれば、例えば、ユーザからの問い合わせ内容等をまとめたテキストである。従来は、正解テキストの作成は、音声を聞きながら手作業で実施していたため、多くの作業時間を要するという課題があった。作成作業に、音声認識エンジンを使用して、音声認識結果を参照しながら、正解テキストを作成する場合もある。この場合は、一社のエンジンを使用するため、同じ誤りが何度も発生するなど、手間がかかるという課題もあった。 The recognition result of voice recognition is used to create correct text in voice call business. In the case of a call center, the correct text is, for example, a text summarizing the content of an inquiry from a user. In the past, correct texts were created manually while listening to voices, which required a lot of work time. In some cases, a correct text is created while referring to the results of speech recognition using a speech recognition engine for creation work. In this case, since the same company's engine is used, there is also the problem that the same error occurs many times, which is troublesome.

音声認識エンジンを使用する場合は、複数の認識手法を並列処理させたり、正解に近い認識結果を選択する手法があるが、画面上で、正解テキストを作成する処理を行うことが検討されていないという課題があった。 When using a speech recognition engine, there are methods of parallel processing multiple recognition methods and selecting recognition results that are close to the correct answer. There was a problem.

本開示の課題は、認識された重要箇所の修正および登録を簡易かつ効率的に行うことができる情報処理装置、およびプログラムを提供することである。 An object of the present disclosure is to provide an information processing apparatus and a program capable of easily and efficiently correcting and registering recognized important parts.

本開示の情報処理装置は、音声認識の結果を修正して登録する情報処理装置であって、音声認識の結果を修正して登録する情報処理装置であって、音声認識を行ってテキストを記憶部に記憶する複数の音声認識部の各々と、前記記憶部に記憶された複数の前記テキストに基づいて、所定の規則に合致する前記テキストを検索し、検索されたテキストを重要テキストとして前記記憶部に記憶し、前記記憶部から読み出して前記重要テキストを表示する重要箇所絞込部と、前記重要箇所絞込部で表示された前記重要テキストから選択された前記重要テキストを前記記憶部から読み出して表示し、表示された前記重要テキストに関する情報を元に修正されたテキストである正解テキストを所定のフォームである第１フォームに入力させ、前記第１フォームに入力された前記正解テキストを前記記憶部に記憶する修正作業部と、前記正解テキストを認識して、予め定められた条件に従って、所定のフォームである第２フォームに情報を入力させる情報登録部と、を備える。 An information processing device according to the present disclosure is an information processing device for correcting and registering the result of speech recognition, and is an information processing device for correcting and registering the result of speech recognition, performing speech recognition and storing text. searching for the text that matches a predetermined rule based on each of a plurality of speech recognition units stored in the storage unit and the plurality of the texts stored in the storage unit, and storing the searched text as an important text an important part narrowing-down unit that stores the important text in a unit, reads out from the storage unit, and displays the important text; and reads from the storage unit the important text selected from the important texts displayed by the important part narrowing-down unit. and display the correct text, which is a text corrected based on the displayed important text, in a first form, which is a predetermined form, and store the correct text entered in the first form. and an information registration unit for recognizing the correct text and inputting information in a second form, which is a predetermined form, according to predetermined conditions.

また、本開示のプログラムは、コンピュータを、本開示に記載の情報処理装置の各部として機能させるためのプログラムである。 Also, a program of the present disclosure is a program for causing a computer to function as each part of the information processing apparatus described in the present disclosure.

本開示の情報処理装置、およびプログラムによれば、認識された重要箇所の修正および登録を簡易かつ効率的に行うことができる、という効果を得られる。 According to the information processing device and the program of the present disclosure, it is possible to easily and efficiently correct and register recognized important parts.

本実施形態の情報処理装置の構成の一例を示すブロック図である。1 is a block diagram showing an example of the configuration of an information processing apparatus according to an embodiment; FIG. 本実施形態の情報処理装置として機能するコンピュータの概略ブロック図である。1 is a schematic block diagram of a computer functioning as an information processing apparatus of this embodiment; FIG. 表示部によって出力される画面インターフェイスの一例を示す図である。It is a figure which shows an example of the screen interface output by the display part. 重要テキスト以外のテキストを含むエリアを選択した場合の一例を示す図である。FIG. 10 is a diagram showing an example when an area containing text other than important text is selected; 情報処理装置によって実行される処理ルーチンのフローチャートである。4 is a flowchart of a processing routine executed by an information processing device; 重要箇所絞込領域に修正モードと登録モードを切り替えるボタンを設けた場合の一例を示す図である。FIG. 10 is a diagram showing an example of a case where a button for switching between a correction mode and a registration mode is provided in an important part narrowing down area;

以下、本発明の実施形態について詳細に説明する。なお、本実施形態に係る情報処理装置は、コールセンターのオペレーションにおいて活用され、オペレータが操作する場合を例に説明する。なお、オペレーションでは、修正作業を行う修正モードと、修正を行わない登録モードとがあり、本実施形態では、修正モードを事前に選択している場合を前提とする。 BEST MODE FOR CARRYING OUT THE INVENTION Hereinafter, embodiments of the present invention will be described in detail. Note that the information processing apparatus according to the present embodiment is used in the operation of a call center, and the case where it is operated by an operator will be described as an example. In operation, there are a correction mode in which correction work is performed and a registration mode in which correction is not performed. In this embodiment, it is assumed that the correction mode is selected in advance.

＜本発明の実施形態に係る情報処理装置の構成＞ <Configuration of Information Processing Apparatus According to Embodiment of the Present Invention>

図１は、本実施形態の情報処理装置１０の構成の一例を示すブロック図である。図１に示す構成の情報処理装置１０は、ＣＰＵと、ＲＡＭと、後述する各処理ルーチンを実行するためのプログラムや各種データを記憶したＲＯＭと、を含むコンピュータで構成することが出来る。 FIG. 1 is a block diagram showing an example of the configuration of an information processing apparatus 10 of this embodiment. The information processing apparatus 10 having the configuration shown in FIG. 1 can be configured by a computer including a CPU, a RAM, and a ROM storing programs and various data for executing each processing routine to be described later.

例えば、情報処理装置１０は、図２に示すコンピュータ５０で実現することができる。コンピュータ５０はＣＰＵ５１、一時記憶領域としてのメモリ５２、及び不揮発性の記憶部５３を備える。また、コンピュータ５０は、入出力装置等（図示省略）が接続される入出力interface（Ｉ／Ｆ）５４、及び記録媒体に対するデータの読み込み及び書き込みを制御するread/write（Ｒ／Ｗ）部５５を備える。また、コンピュータ５０は、インターネット等のネットワークに接続されるネットワークＩ／Ｆ５６を備える。ＣＰＵ５１、メモリ５２、記憶部５３、入出力Ｉ／Ｆ５４、Ｒ／Ｗ部５５、及びネットワークＩ／Ｆ５６は、バス５７を介して互いに接続される。 For example, the information processing device 10 can be realized by a computer 50 shown in FIG. The computer 50 includes a CPU 51 , a memory 52 as a temporary storage area, and a non-volatile storage section 53 . The computer 50 also includes an input/output interface (I/F) 54 to which an input/output device (not shown) is connected, and a read/write (R/W) unit 55 for controlling reading and writing of data to and from a recording medium. Prepare. The computer 50 also has a network I/F 56 connected to a network such as the Internet. The CPU 51 , memory 52 , storage unit 53 , input/output I/F 54 , R/W unit 55 and network I/F 56 are connected to each other via a bus 57 .

記憶部５３は、Hard Disk Drive（ＨＤＤ）、solid state drive（ＳＳＤ）、フラッシュメモリ等によって実現できる。記憶媒体としての記憶部５３には、コンピュータ５０を機能させるためのプログラムが記憶されている。ＣＰＵ５１は、プログラムを記憶部５３から読み出してメモリ５２に展開し、プログラムが有するプロセスを順次実行する。また、記憶部５３には認識結果記憶部２２の情報が記憶される。 The storage unit 53 can be implemented by a hard disk drive (HDD), solid state drive (SSD), flash memory, or the like. A program for causing the computer 50 to function is stored in the storage unit 53 as a storage medium. The CPU 51 reads out the program from the storage unit 53, develops it in the memory 52, and sequentially executes the processes of the program. Information of the recognition result storage unit 22 is stored in the storage unit 53 .

以上が図２におけるコンピュータの電気的な構成の一例の説明である。 The above is the description of an example of the electrical configuration of the computer in FIG.

情報処理装置１０は、機能的には、図１に示されるように、複数の音声認識部２０の各々と、認識結果記憶部２２と、重要箇所絞込部２４と、修正作業部２６と、情報登録部２８と、表示部３０とを備えている。なお、認識結果記憶部２２が本発明の記憶部の一例である。 Functionally, as shown in FIG. 1, the information processing apparatus 10 includes each of a plurality of speech recognition units 20, a recognition result storage unit 22, an important part narrowing down unit 24, a correction operation unit 26, An information registration section 28 and a display section 30 are provided. Note that the recognition result storage unit 22 is an example of the storage unit of the present invention.

ここで、表示部３０によって出力される画面インターフェイスの例を説明する。図３は、表示部３０によって出力される画面インターフェイスの一例を示す図である。図３に示すように、画面インターフェイスでは、重要箇所絞込領域２４Ａと、修正作業領域２６Ａと、情報登録領域２８Ａとが表示画面上に一体として形成される。各領域は、それぞれ重要箇所絞込部２４、修正作業部２６、および情報登録部２８に対応する。情報処理装置１０は、画面インターフェイスの各領域に、各処理部の処理結果の各種テキストを反映させる。このような画面インターフェイスによって、画面を、作業別に領域を３分割して表示することで、各種作業を１画面で完了できる。そのため、例えば、コールセンターなどでのオペレータが対話内容の履歴を残す作業などが、短時間で効率的に実施できるようになり、結果的に、より多くのコールに対応できるなどの効果が期待できる。なお、具体的な手法については、各領域に対応する各部の処理において後述する。 Here, an example of a screen interface output by the display unit 30 will be described. FIG. 3 is a diagram showing an example of a screen interface output by the display unit 30. As shown in FIG. As shown in FIG. 3, in the screen interface, an important part narrowing-down area 24A, a correction work area 26A, and an information registration area 28A are integrally formed on the display screen. Each area corresponds to the important part narrowing down section 24, the correction work section 26, and the information registration section 28, respectively. The information processing apparatus 10 reflects various texts of processing results of each processing unit in each area of the screen interface. With such a screen interface, by dividing the screen into three areas for each task and displaying it, various tasks can be completed on one screen. Therefore, for example, an operator at a call center or the like can efficiently record a history of conversations in a short period of time. A specific method will be described later in the process of each unit corresponding to each area.

音声認識部２０の各々は、対話の音声認識を行う音声認識エンジンである。音声認識エンジンは音声認識部２０ごとに異なる手法の音声認識を行う。音声認識部２０の各々は、音声認識を行ってテキストを認識結果記憶部２２に記憶する。音声認識は、音声認識エンジンによって識別可能であれば発言者の発話ごとにまとめられ、テキストに発言者の識別ラベルが付与される。 Each of the speech recognition units 20 is a speech recognition engine that performs dialogue speech recognition. The speech recognition engine performs speech recognition using different techniques for each speech recognition unit 20 . Each of the speech recognition units 20 performs speech recognition and stores the text in the recognition result storage unit 22 . Speech recognition is grouped by the speaker's utterances, if identifiable by the speech recognition engine, and the text is labeled with the speaker's identification.

認識結果記憶部２２には、音声認識部２０の各々で認識されたラベル付きテキストが記憶される。また、重要箇所絞込部２４で検索された重要テキストが記憶される。また、修正作業部２６で修正された正解テキストにより修正された重要テキストが記憶される。また、修正作業部２６で処理された音声認識エンジンの各々の認識率が記憶される。 The recognition result storage unit 22 stores the labeled text recognized by each of the speech recognition units 20 . In addition, the important text searched by the important part narrowing-down unit 24 is stored. Also, the important text corrected by the correct text corrected by the correction working unit 26 is stored. Also, the recognition rate of each speech recognition engine processed by the correction work unit 26 is stored.

重要箇所絞込部２４は、認識結果記憶部２２に記憶された複数のテキストに基づいて、所定の規則に合致するテキストを検索する。重要箇所絞込部２４は、検索されたテキストを重要テキストとして認識結果記憶部２２に記憶し、認識結果記憶部２２から読み出して重要テキストを重要箇所絞込領域２４Ａに表示する。また、重要箇所絞込部２４は、重要テキストの選択を受け付ける。選択は、吹き出し２４Ｂの重要テキストをクリックするか、ドラックアンドドロップ等によりテキストを選択することで行うようにすればよい。 The important part narrowing-down unit 24 searches for texts that match a predetermined rule based on the multiple texts stored in the recognition result storage unit 22 . The important part narrowing-down unit 24 stores the retrieved text as important text in the recognition result storage unit 22, reads the text from the recognition result storage unit 22, and displays the important text in the important part narrowing-down area 24A. The important part narrowing-down unit 24 also accepts selection of important texts. The selection may be made by clicking the important text in the balloon 24B or by selecting the text by drag-and-drop or the like.

重要箇所絞込部２４は、対話の中から、重要な箇所と判断される発言や発言内の箇所を抽出し、図２に示したように、重要箇所絞込領域２４Ａに、音声認識で識別された発言者ごと重要テキストを表示する。図３では、右側のアイコンが問い合わせしたユーザ、左側のアイコンがオペレータの発言であることを表し、吹き出し２４Ｂに重要テキストを表示している場合を例に表示している。なお、重要な箇所と判断する所定の規則については、各種方法の採用や組合せにより、重要な箇所の判断を行うルールまたは識別モデルを学習しておけばよい。所定の規則は、例えば、特定キーワードが含まれる文章を抽出する、自然言語解析技術により文脈理解したうえで該当文章を抽出する、不要な文章を省いていくことにより該当文章を抽出するなどといった方法が挙げられる。 The important part narrowing-down unit 24 extracts utterances and parts within utterances that are judged to be important parts from the dialogue, and as shown in FIG. Display important text for each speaker identified. In FIG. 3, the icon on the right side represents the user who made the inquiry, the icon on the left side represents the operator's statement, and an important text is displayed in the balloon 24B. As for the predetermined rule for judging an important part, it is sufficient to learn a rule or a discriminant model for judging an important part by adopting or combining various methods. Predetermined rules include, for example, extracting sentences containing specific keywords, extracting relevant sentences after understanding the context using natural language analysis technology, extracting relevant sentences by omitting unnecessary sentences, etc. is mentioned.

また、表示された重要テキストの中に重要な箇所がない場合が考えられる。重要箇所絞込領域２４Ａには、「・・・」表示された重要テキスト以外のテキストを含むエリア２４Ｃが設けられている。当該エリア２４Ｃは、対話の時系列上で、上部の重要テキストと下部の重要テキストとの間に記録されたテキストを含んでいる。 Also, there may be cases where there is no important part in the displayed important text. The important part narrowing-down area 24A is provided with an area 24C containing texts other than the important texts displayed with "...". The area 24C includes the text recorded between the upper important text and the lower important text on the dialogue chronological order.

図４は、重要テキスト以外のテキストを含むエリア２４Ｃを選択した場合の一例を示す図である。当該エリアを選択することで、重要テキストとしてピックアップしていない別の発言を表示して取り出すことが可能である。重要だと判断して取り出した発言は、重要箇所絞込領域２４Ａに吹き出し２４Ｂの重要テキストとして追加することが可能である。なお、別の発言の吹き出しをポップアップ等で表示するようにしてもよい。 FIG. 4 is a diagram showing an example when the area 24C including text other than the important text is selected. By selecting the area, it is possible to display and extract other utterances that have not been picked up as important text. Sentences that are determined to be important and extracted can be added to the important part narrowing down area 24A as important texts in balloons 24B. It should be noted that a speech balloon of another statement may be displayed as a pop-up or the like.

修正作業部２６は、重要箇所絞込領域２４Ａに表示された重要テキストから選択された重要テキストを認識結果記憶部２２から読み出して、修正作業領域２６Ａの表示領域２６Ｂに選択された重要テキストに関する情報を表示する。修正作業部２６は、修正作業領域２６Ａの修正作業フォーム２６Ｃから正解テキストの入力を受け付ける。正解テキストは、表示領域２６Ｂに表示された重要テキストに関する情報である、並列表示された音声認識エンジンの音声認識結果を元に修正されたテキストである。修正作業部２６は、正解文登録ボタンの押下を受け付け、正解テキストを修正済みの重要テキストとして認識結果記憶部２２に記憶する。修正作業フォーム２６Ｃへの入力および登録の作業はオペレータが行う。また、修正作業部２６は、認識結果記憶部２２から正解テキストで修正済みの重要テキストを認識結果記憶部２２から読み出して、重要箇所絞込領域２４Ａに修正済みの重要テキストとして表示に反映する。なお、修正作業フォームが第１フォームの一例である。 The correction work section 26 reads important texts selected from the important texts displayed in the important part narrowing-down area 24A from the recognition result storage section 22, and displays information about the important texts selected in the display area 26B of the correction work area 26A. display. The correction work section 26 receives input of correct text from the correction work form 26C in the correction work area 26A. The correct text is information about the important text displayed in the display area 26B, which is text corrected based on the speech recognition results of the speech recognition engine displayed in parallel. The correction work unit 26 accepts pressing of the correct text registration button, and stores the correct text as a corrected important text in the recognition result storage unit 22 . The operator performs the input and registration work on the correction work form 26C. Further, the correction work unit 26 reads out the corrected important text with the correct text from the recognition result storage unit 22 and reflects it in the display as the corrected important text in the important part narrowing down area 24A. Note that the correction work form is an example of the first form.

修正作業領域２６Ａは、重要箇所絞込部２４で重要な箇所と判断された重要テキスト、すなわち選択された重要テキストの文章について、誤認識がある場合に、本領域において修正を可能とする領域である。選択された重要テキストについて、表示領域２６Ｂに複数の音声認識エンジンによる結果を並列表示させることが可能である。なお、複数の音声認識エンジンの結果が合致している部分は、正解とみなすなどすることで、正解文案を修正作業フォーム２６Ｃに自動生成することを可能とする。このように、修正作業領域２６Ａは、正解文案を、オペレータの人手で確認の上、必要に応じ修正を行い、正解文を完成させることが可能な領域である。 The correction work area 26A is an area in which it is possible to correct an important text judged to be an important part by the important part narrowing-down unit 24, that is, if there is an erroneous recognition of a sentence of the selected important text. be. For the selected important text, it is possible to display the results of multiple speech recognition engines side by side in the display area 26B. It is possible to automatically generate a correct sentence in the correction work form 26C by regarding the portion where the results of a plurality of speech recognition engines match as the correct answer. In this manner, the correction work area 26A is an area in which the operator can manually check the correct sentence draft, correct it as necessary, and complete the correct sentence.

また、修正作業部２６は、オペレータが正解テキストを修正作業フォーム２６Ｃで完成させて、正解文登録ボタンを押下したときに、表示領域２６Ｂに並列表示された音声認識エンジンの音声認識結果に対して、音声認識エンジンごとの認識率を算出して認識結果記憶部２２に記憶する。認識結果は、表示領域２６Ｂに表示することも可能である。認識率は、音声認識エンジンの元のテキストと、入力された正解テキストとを比較した場合の一致率等により算出すればよい。認識率は、音声認識エンジンの評価に活用することができる。 In addition, when the operator completes the correct text in the correction work form 26C and presses the correct text registration button, the correction work unit 26 performs a , the recognition rate for each speech recognition engine is calculated and stored in the recognition result storage unit 22 . The recognition result can also be displayed in the display area 26B. The recognition rate may be calculated from the match rate or the like when the original text of the speech recognition engine is compared with the input correct text. The recognition rate can be used to evaluate speech recognition engines.

また、重要箇所絞込領域２４Ａに正解テキストの修正結果を反映する際に、重要テキストが修正済みである旨を、注釈、およびフォントの色の変更等によって提示してもよい。このように修正対象を識別可能な形式で反映することで、複数の修正箇所がある場合にいずれの重要テキストを修正したかをオペレータが把握できる。 Further, when the result of correction of the correct text is reflected in the important part narrowing-down area 24A, the fact that the important text has been corrected may be indicated by an annotation, a change in font color, or the like. By reflecting correction targets in an identifiable format in this manner, the operator can grasp which important text has been corrected when there are multiple corrections.

情報登録部２８は、重要箇所絞込領域２４Ａの修正済みの重要テキストを認識して、予め定められた条件に従って、情報登録領域２８Ａの情報登録フォーム２８Ｂ～Ｅに情報を入力させる。情報登録部２８は、登録ボタンの押下を受け付け、登録情報を認識結果記憶部２２に記憶する。 The information registration unit 28 recognizes the corrected important text in the important part narrowing down area 24A, and allows the information to be entered in the information registration forms 28B to 28E in the information registration area 28A according to predetermined conditions. The information registration unit 28 accepts pressing of the registration button and stores the registration information in the recognition result storage unit 22 .

情報登録フォーム２８Ｂ～Ｅには、それぞれカテゴリが設けられており、例えば、情報登録フォーム２８Ｂ、２８Ｃは「自由記入」、２８Ｄは「用件」、２８Ｅは「回答」のカテゴリと定義できる。情報登録領域２８Ａでは、重要箇所絞込領域２４Ａに表示された重要テキストを使用して、カテゴリごとに自動生成した文章や、重要テキストをコピーした文章を、予め用意した情報登録フォーム２８Ｂ～Ｅに反映して、ユーザの問い合わせ内容にあてはめることで、情報登録を簡易に実施することが可能となる。文章の自動生成は、重要テキストを選択した際に自動で行ってもよいし、情報登録フォームをクリックすることにより行ってもよい。また、コピーは重要テキストをダブルクリックすることで行えるようにしてもよい。 The information registration forms 28B to 28E are each provided with a category. For example, the information registration forms 28B and 28C can be defined as "free entry", 28D as "item", and 28E as "answer". In the information registration area 28A, using the important texts displayed in the important part narrowing down area 24A, sentences automatically generated for each category and sentences copied from the important texts are entered into information registration forms 28B to 28E prepared in advance. By reflecting the information and applying it to the content of the user's inquiry, it is possible to easily perform information registration. The automatic generation of sentences may be performed automatically when the important text is selected, or may be performed by clicking on the information registration form. Copying may also be performed by double-clicking on important text.

また、文章の自動生成には、例えば、重要箇所絞込領域２４Ａの吹き出し２４Ｂが、１０行以上におよぶ長文だった場合に、不要な言葉などを取り除き、文章を簡潔にするなどの作業が自動で行われるように作成した自動文章作成ツールを用いてもよい。これは、吹き出し２４Ｂに表示される重要テキストは、話し言葉であるため、不要な言葉も多く含まれる場合もあるからである。 In the automatic generation of sentences, for example, when the speech balloon 24B in the important part narrowing down area 24A is a long sentence of 10 lines or more, unnecessary words are removed to simplify the sentence. You may use an automated writing tool designed to be done in . This is because the important text displayed in the balloon 24B is spoken language and may include many unnecessary words.

＜本発明の実施形態に係る情報処理装置の作用＞ <Operation of the information processing apparatus according to the embodiment of the present invention>

次に、図５を参照して、情報処理装置１０の作用を説明する。図５は、情報処理装置１０で実行する処理ルーチンのフローチャートである。処理ルーチンは、例えば、コールセンターでユーザとの対話が開始されると同時に実行される。 Next, operation of the information processing apparatus 10 will be described with reference to FIG. FIG. 5 is a flowchart of a processing routine executed by the information processing apparatus 10. As shown in FIG. The processing routine is executed, for example, at the same time as the user starts interacting with the call center.

ステップＳ１００において、音声認識部２０の各々は、音声認識を行ってラベル付きのテキストを認識結果記憶部２２に記憶する。 In step S<b>100 , each speech recognition unit 20 performs speech recognition and stores labeled text in the recognition result storage unit 22 .

ステップＳ１０２において、重要箇所絞込部２４は、認識結果記憶部２２に記憶された複数のテキストに基づいて、所定の規則に合致するテキストを検索する。 In step S<b>102 , the important part narrowing-down unit 24 searches for text that matches a predetermined rule based on the multiple texts stored in the recognition result storage unit 22 .

ステップＳ１０４において、重要箇所絞込部２４は、検索されたテキストを重要テキストとして認識結果記憶部２２に記憶し、認識結果記憶部２２から読み出して重要テキストを重要箇所絞込領域２４Ａに表示する。 In step S104, the important part narrowing-down unit 24 stores the retrieved text as important text in the recognition result storage unit 22, reads it from the recognition result storage unit 22, and displays the important text in the important part narrowing-down area 24A.

ステップＳ１０６において、重要箇所絞込部２４は、重要箇所絞込領域２４Ａに表示された重要テキストから重要テキストの選択を受け付ける。 In step S106, the important part narrowing-down unit 24 receives selection of important texts from the important texts displayed in the important part narrowing-down area 24A.

ステップＳ１０８において、修正作業部２６は、重要箇所絞込領域２４Ａに表示された重要テキストから選択された重要テキストを認識結果記憶部２２から読み出して、修正作業領域２６Ａの表示領域２６Ｂに選択された重要テキストに関する情報を表示する。 In step S108, the correction work unit 26 reads the important text selected from the important texts displayed in the important part narrowing down area 24A from the recognition result storage unit 22, and selects it in the display area 26B of the correction work area 26A. Display information about important text.

ステップＳ１１０において、修正作業部２６は、修正作業領域２６Ａの修正作業フォーム２６Ｃから正解テキストの入力を受け付ける。オペレータは、表示内容を元に、修正作業フォーム２６Ｃに正解テキストを入力する。 In step S110, the correction work section 26 receives input of correct text from the correction work form 26C in the correction work area 26A. Based on the displayed contents, the operator enters the correct text into the correction work form 26C.

ステップＳ１１２において、修正作業部２６は、正解文登録ボタンの押下を受け付け、正解テキストを修正済みの重要テキストとして認識結果記憶部２２に記憶する。 In step S112, the correction working unit 26 accepts pressing of the correct text registration button, and stores the correct text in the recognition result storage unit 22 as corrected important text.

ステップＳ１１４において、修正作業部２６は、認識結果記憶部２２から正解テキストで修正済みの重要テキストを認識結果記憶部２２から読み出して、重要箇所絞込領域２４Ａに修正済みの重要テキストを表示に反映する。 In step S114, the correction working unit 26 reads the important text corrected with the correct text from the recognition result storage unit 22, and reflects the corrected important text in the important part narrowing down area 24A. do.

ステップＳ１１６において、修正作業部２６は、表示領域２６Ｂに並列表示された音声認識エンジンの音声認識結果に対して、音声認識エンジンごとの認識率を算出して認識結果記憶部２２に記憶する。 In step S116, the correction unit 26 calculates the recognition rate for each of the speech recognition engines displayed in parallel in the display area 26B, and stores it in the recognition result storage unit 22. FIG.

ステップＳ１１８において、情報登録部２８は、重要箇所絞込領域２４Ａの修正済みの重要テキストを認識して、予め定められた条件に従って、情報登録領域２８Ａの情報登録フォーム２８Ｂ～Ｅに対する情報の入力を受け付ける。 In step S118, the information registration unit 28 recognizes the corrected important text in the important part narrowing down area 24A, and according to predetermined conditions, inputs information to the information registration forms 28B to 28E in the information registration area 28A. accept.

ステップＳ１２０において、情報登録部２８は、登録ボタンの押下を受け付け、登録情報を認識結果記憶部２２に記憶し、処理を終了する。 In step S120, the information registration unit 28 accepts pressing of the registration button, stores the registration information in the recognition result storage unit 22, and ends the process.

以上説明したように、本実施形態に係る情報処理装置１０は、認識された重要箇所の修正および登録を簡易かつ効率的に行うことができる。 As described above, the information processing apparatus 10 according to the present embodiment can easily and efficiently correct and register recognized important parts.

なお、本発明は、上述した実施の形態に限定されるものではなく、この発明の要旨を逸脱しない範囲内で様々な変形や応用が可能である。 The present invention is not limited to the above-described embodiments, and various modifications and applications are possible without departing from the gist of the present invention.

上述した実施形態では、修正モードを事前に選択している場合を例に説明したが、これに限定されるものではない。例えば、重要箇所絞込領域２４Ａでは、修正対象を選ぶのか、情報登録するのかを切り替える画面インターフェイスとしてもよい。図６は、重要箇所絞込領域２４Ａに修正モードと登録モードを切り替えるボタンを設けた場合の一例を示す図である。図６に示すように、モード選択領域２４Ｄによりモードを選択する。修正モードを選択した場合には、重要箇所絞込領域２４Ａで重要テキストを選択すると、修正作業領域２６Ａの表示領域２６Ｂおよび修正作業フォーム２６Ｃに重要テキストが反映される。登録モードを選択した場合には、重要箇所絞込領域２４Ａで重要テキストを選択すると、情報登録フォーム２８Ｂ～Ｅに所定の条件により自動反映される。また、オペレータは、重要箇所絞込領域２４Ａの重要テキストを、情報登録フォーム２８Ｂ～Ｅにコピーして情報の登録が行える。また、登録モードを選択した場合には、修正作業領域２６Ａを非表示にするようにしてもよい。このようにモードを切り替えられるようにすることで、オペレータが一画面で簡易に登録作業を行えるようになる。 In the above-described embodiment, the case where the correction mode is selected in advance has been described as an example, but the present invention is not limited to this. For example, in the important part narrowing down area 24A, a screen interface for switching between selecting a correction target and registering information may be used. FIG. 6 is a diagram showing an example in which a button for switching between the correction mode and the registration mode is provided in the important part narrowing-down area 24A. As shown in FIG. 6, the mode is selected by the mode selection area 24D. When the correction mode is selected, when important text is selected in the important part narrowing down area 24A, the important text is reflected in the display area 26B of the correction work area 26A and the correction work form 26C. When the registration mode is selected, when an important text is selected in the important part narrowing down area 24A, it is automatically reflected in the information registration forms 28B to 28E according to predetermined conditions. Further, the operator can register information by copying the important text in the important part narrowing down area 24A to the information registration forms 28B to 28E. Further, when the registration mode is selected, the correction work area 26A may be hidden. By allowing the mode to be switched in this way, the operator can easily perform the registration work on one screen.

また、本願明細書中において、プログラムが予めインストールされている実施形態として説明したが、当該プログラムを、コンピュータ読み取り可能な記録媒体に格納して提供することも可能である。 Further, in the specification of the present application, an embodiment in which the program is pre-installed has been described, but it is also possible to store the program in a computer-readable recording medium and provide it.

１０情報処理装置
２０音声認識部
２２認識結果記憶部
２４重要箇所絞込部
２６修正作業部
２８情報登録部
３０表示部 10 Information processing device 20 Voice recognition unit 22 Recognition result storage unit 24 Important part narrowing unit 26 Correction operation unit 28 Information registration unit 30 Display unit

Claims

An information processing device for correcting and registering voice recognition results,
each of a plurality of speech recognition units that perform speech recognition and store text in a storage unit;
Based on the plurality of texts stored in the storage unit, the texts matching a predetermined rule are searched, the searched texts are stored in the storage unit as important texts, and the important texts are read out from the storage unit. an important part narrowing section for displaying text;
The important text selected from the important texts displayed by the important part narrowing-down unit is read from the storage unit and displayed, and a correct text, which is a text corrected based on information related to the displayed important text, is displayed. a correction work unit that prompts the user to enter a first form, which is a predetermined form, and stores the correct text entered in the first form in the storage unit;
an information registration unit that recognizes the correct text and inputs information in a second form, which is a predetermined form, according to predetermined conditions;
Information processing equipment including.

further comprising a display;
3. The display section is formed so as to integrally display a display area corresponding to the important part narrowing-down section, a display area corresponding to the correction work section, and a display area corresponding to the information registration section. 1. The information processing device according to 1.

3. The information processing apparatus according to claim 1, wherein the correction operation unit reads the selected important text and a plurality of texts corresponding to the selected important text from the storage unit and displays them.

4. The information processing according to any one of claims 1 to 3, wherein the recognition rate of each of the speech recognition units is measured based on the correct text registered by the correction work unit and registered in the storage unit. Device.

The important part narrowing-down unit
Based on selection of an area containing text other than the important text, candidate texts other than the important text are displayed, and text selected from the candidate texts is stored in the storage unit as the important text. Item 5. The information processing apparatus according to any one of Item 4.

the predetermined conditions include conditions related to automatic reflection of the text;
A plurality of the second forms are provided for each category,
6. The information registration unit reflects the information candidate in the second form of the category corresponding to the important text and the correct text according to the predetermined condition. The information processing device according to .

A program for causing a computer to function as each unit of the information processing apparatus according to any one of claims 1 to 6.