JPH11175518A

JPH11175518A - Character string inputting device and method therefor and program recording medium for storing character string input program

Info

Publication number: JPH11175518A
Application number: JP9341104A
Authority: JP
Inventors: Seiji Kuwari; 聖二桑理
Original assignee: Omron Corp; Omron Tateisi Electronics Co
Current assignee: Omron Corp
Priority date: 1997-12-11
Filing date: 1997-12-11
Publication date: 1999-07-02

Abstract

PROBLEM TO BE SOLVED: To allow a user to almost exactly confirm input miss, and to easily correct an inputted character string. SOLUTION: A control part 3 allows a user to confirm input miss by displaying a synonym as a candidate word when the synonym of a character string similar to a character string inputted from an inputting part 2 has a cooccurrence relation with a word which is already used in a document which is being prepared. Therefore, even when the inputted character string is a character string which exists as a word, the user is allowed to confirm the input miss when the possibility of the input miss is high. Thus, it is possible to allow the user to almost exactly confirm the input miss, and to prevent the efficiency of document preparation from being deteriorated.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】この発明は、ユーザの入力し
た文字列がユーザの意図している単語の文字列でなく入
力ミスである可能性が高いときに、ユーザに入力ミスで
ないか確認させ、入力ミスであったときに入力された文
字列を修正させる文字列入力装置、文字列入力方法およ
び文字列入力プログラムを記録したプログラム記録媒体
に関する。BACKGROUND OF THE INVENTION The present invention provides a method for allowing a user to confirm whether a character string entered by a user is not a character string of a word intended by the user and is likely to be an input error, The present invention relates to a character string input device, a character string input method, and a program recording medium that records a character string input program for correcting a character string input when an input error occurs.

【０００２】[0002]

【従来の技術】従来より、ユーザによって入力された文
字列が単語として適正な文字列であるか否かを判定し、
この文字列が単語として適正でなければ入力された文字
列と類似する文字列の単語（類似語）を候補単語として
抽出し、この候補単語を表示することによって、ユーザ
に入力ミスを認識させる（確認させる）文字列入力装置
がある。また、この種の文字列入力装置では、ユーザが
表示された候補単語の中から自分の意図する単語を選択
すると、入力された文字列がここで選択された単語の文
字列に置き換えられ、入力ミスを容易に修正することが
できる。2. Description of the Related Art Conventionally, it has been determined whether a character string input by a user is a proper character string as a word,
If the character string is not appropriate as a word, a word (similar word) of a character string similar to the input character string is extracted as a candidate word, and the candidate word is displayed to allow the user to recognize an input error ( There is a character string input device. In this type of character string input device, when the user selects a word intended by the user from the displayed candidate words, the input character string is replaced with the character string of the word selected here, and Mistakes can be easily corrected.

【０００３】例えば、この種の文字列入力装置が適用さ
れた英文ワープロでは、英和辞書（電子データ化された
辞書データである。）を検索してユーザが入力した文字
列の単語が存在するか否かを確認し、該文字列の英単語
が存在しなければ入力ミス（スペルミス）であると判定
し、入力された文字列と類似する文字列の単語を英和辞
書から抽出し、これを候補単語として表示することによ
ってユーザに入力ミスを認識させていた。また、ユーザ
が表示されている候補単語の中から自分の意図する単語
を選択すると、入力された文字列を選択された候補単語
の文字列に置き換え、入力ミスを容易に修正できるよう
にもしていた。なお、上記英和辞書を各単語について日
本語の意味を対応させて記憶させていない辞書に置き換
えたものもある。[0003] For example, in an English word processor to which this type of character string input device is applied, an English-Japanese dictionary (which is dictionary data converted into electronic data) is searched to determine whether a word of a character string input by a user exists. If there is no English word of the character string, it is determined that there is an input error (spelling error), and a word of a character string similar to the input character string is extracted from the English-Japanese dictionary. By displaying the words as words, the user was made to recognize input errors. Further, when the user selects a word intended by the user from the displayed candidate words, the input character string is replaced with the character string of the selected candidate word, so that input errors can be easily corrected. Was. There is also a dictionary in which the English-Japanese dictionary is replaced with a dictionary that does not store the meaning of Japanese in correspondence with each word.

【０００４】さらに、候補単語を表示する場合、候補単
語毎にその意味も同時に表示することによって、ユーザ
が候補単語の中から自分の意図している単語を容易に判
断できるようにしたものもある。Further, when displaying candidate words, there is also a display that displays the meaning of each candidate word at the same time, so that the user can easily determine the intended word from the candidate words. .

【０００５】[0005]

【発明が解決しようとする課題】しかしながら、従来の
文字列入力装置は、入力された文字列と同一の文字列の
単語が存在しているか否かによってユーザの入力ミスで
あるか否かを判定している。例えば、英文ワープロで
は、ユーザの入力した文字列と同一の文字列の単語が英
和辞書に存在していれば、この入力された文字列を正し
い単語であると判定している（入力ミスでないと判定し
ている。）。したがって、ユーザの入力した文字列がユ
ーザの意図する単語（文字列）でなくても（入力ミスで
あったとしても）、入力された文字列の単語が英和辞書
に存在していれば、入力された文字列を正しい単語であ
る（入力ミスでない）と判定してしまうという問題があ
った。例えば、ユーザの意図する単語が『ｌｉｇｈｔ』
であるときに、ユーザが入力を誤って『ｒｉｇｈｔ』と
入力しても、入力された文字列を正しい単語であると判
定してしまう。特に、日本人の場合には発音が似ている
ことから『ｌ』と『ｒ』を間違えて入力することが多
い。このような場合には、ユーザに入力ミスを認識さ
せ、これを修正させることができなかった。However, the conventional character string input device determines whether or not an input error has been made by the user based on whether or not a word having the same character string as the input character string exists. doing. For example, in an English word processor, if a word having the same character string as the character string input by the user exists in the English-Japanese dictionary, the input character string is determined to be a correct word. Has been determined.) Therefore, even if the character string input by the user is not the word (character string) intended by the user (even if the input is incorrect), if the word of the input character string exists in the English-Japanese dictionary, There is a problem that the determined character string is determined to be a correct word (not an input error). For example, if the word intended by the user is "light"
, Even if the user erroneously inputs "right", the input character string is determined to be a correct word. In particular, in the case of Japanese, "l" and "r" are often mistakenly input because the pronunciation is similar. In such a case, the user cannot recognize the input error and cannot correct it.

【０００６】この発明の目的は、入力された文字列と同
一の文字列の単語が存在するか否かだけでなく、作成中
の文書内に既に入力された単語との関係から、入力され
た文字列がユーザの意図する単語であるか否かを判定す
ることによって、ユーザの入力ミスをより確実に検出
し、これをユーザに認識させて、入力した文字列を修正
させることができる文字列入力装置、文字列入力方法、
および、文字列入力プログラムを記録したプログラム記
録媒体を提供することにある。An object of the present invention is not only to determine whether or not a word having the same character string as the input character string exists, but also to determine whether or not a word is already input in a document being created. By determining whether or not a character string is a word intended by the user, a character string that can more reliably detect a user's input error, make the user recognize this, and correct the input character string Input device, character string input method,
Another object of the present invention is to provide a program recording medium on which a character string input program is recorded.

【０００７】また、この発明の目的は、入力された文字
列が予め設定された使用頻度の低い単語である場合に、
ユーザに入力ミスでないかを確認させることにより、ユ
ーザに入力ミスを認識させ、入力した文字列を修正させ
る文字列入力装置、文字列入力方法、および、文字列入
力プログラムを記録したプログラム記録媒体を提供する
ことにある。[0007] Another object of the present invention is to provide a method in which an input character string is a preset infrequently used word.
A character string input device, a character string input method, and a program recording medium on which a character string input program is recorded so that the user recognizes the input error and corrects the input character string by making the user confirm whether or not there is an input error. To provide.

【０００８】[0008]

【課題を解決するための手段】請求項１に記載した発明
は、文字列を入力する入力部と、単語毎に、文字列が類
似するものとして予め設定した類似語、同義語や反意語
等意味上の観念が関連するものとして予め設定した共起
語、および共起状態の有無を直接または間接に記憶する
記憶部と、入力された単語の類似語が前記記憶部にある
とき、該類似語が共起状態にあれば該類似語を入力候補
として表示部に表示させるとともに、入力単語の確定時
には、確定した単語の共起語について共起状態をセット
する制御部と、を備えている。According to a first aspect of the present invention, there is provided an input unit for inputting a character string, and a similar word, a synonym, antonym, etc., which are set in advance for each word as a similar character string. A storage unit for directly or indirectly storing the co-occurrence word set in advance as a related idea, and the presence or absence of a co-occurrence state, and when a similar word of the input word is present in the storage unit, the similar word And a control unit for setting the co-occurrence state of the co-occurred word of the determined word when the input word is determined, on the display unit, when the similar word is in the co-occurrence state.

【０００９】まず、この発明で言う共起とは、同義語や
反意語等意味上の観念が関連し、同一の文書内で同時に
使用される可能性が高い単語間の関係である。なお、単
語間で意味上の観念が関連しているかどうかについて
は、辞書作成者が自分のイメージにより判断するもので
あり、該辞書作成者によって単語毎に共起語が設定され
る。例えば、辞書作成者によって『ｗｅｉｇｈｔ（重
さ）』と『ｌｉｇｈｔ（軽い）』や、『ｌｉｇｈｔ
（光）』と『ｃｏｌｏｒ（色）』を意味上の観念が関連
している（共起関係にある）共起語として予め設定され
る。なお、辞書作成者は『ｌｉｇｈｔ』は『ｗｅｉｇｈ
ｔ』と『ｃｏｌｏｒ』とのそれぞれについて共起関係に
ある共起語と設定しても、『ｗｅｉｇｈｔ』と『ｃｏｌ
ｏｒ』とが共起関係にないと判断し（『ｗｅｉｇｈｔ』
と『ｃｏｌｏｒ』とは意味上の観念が関連していないと
判断し）、これらの単語を共起語として設定しない場合
もある。また、共起状態とは、上記の意味上の観念が関
連する共起語が入力されたことにより、他の単語に比べ
て入力される可能性が高くなった状態を言う。First, the co-occurrence referred to in the present invention is a relation between words that are related to meanings such as synonyms and antonyms and are highly likely to be used simultaneously in the same document. Note that whether or not the meaning is related between words is determined by the dictionary creator based on his / her own image, and the co-occurrence word is set for each word by the dictionary creator. For example, depending on the dictionary creator, "weight (weight)" and "light (light)" or "light"
(Light) ”and“ color (color) ”are set in advance as co-occurrence words that are related (in a co-occurrence relationship) with a semantic idea. Note that the dictionary creator changed “light” to “weigh”.
t "and" color "are set as co-occurring words in a co-occurrence relationship, but" weight "and" color "
or "is not in a co-occurrence relationship (" weight "
And "color" are not related to a semantic idea), and these words may not be set as co-occurring words. In addition, the co-occurrence state refers to a state in which a co-occurrence word related to the above-described semantic concept is more likely to be input compared to other words.

【００１０】なお、類似語も辞書作成者が各単語につい
て自分のイメージで文字列が類似する単語を予め設定し
ている。[0010] For the similar words, the dictionary creator sets in advance, for each word, a word whose character string is similar in its own image.

【００１１】この構成の文字列入力装置では、単語毎に
共起状態の有無を記憶し、入力単語の確定時に、確定し
た単語の共起語（他の単語に比べて入力される可能性が
高くなった単語）について共起状態をセットする。ま
た、入力された単語の類似語があるときに、該類似語が
共起状態にあれば（他の単語に比べて入力される可能性
が高ければ）、該類似語を入力候補として表示する。In the character string input device having this configuration, the presence / absence of a co-occurrence state is stored for each word, and when the input word is determined, the co-occurrence word of the determined word (the possibility of being input as compared to other words is higher). The co-occurrence state is set for the word (elevated). In addition, when there is a similar word of the input word, if the similar word is in a co-occurrence state (if it is more likely to be input than other words), the similar word is displayed as an input candidate. .

【００１２】ここで、ユーザによって入力された文字列
の単語が存在していても、入力された単語の類似語が他
の単語に比べて入力される可能性が高い共起状態にあれ
ば、ユーザの意図している単語がこの類似語である可能
性が高い（ユーザの入力ミスである可能性が高い。）。
一方、入力された単語の類似語が他の単語に比べて入力
される可能性が高い共起状態になければ、ユーザの意図
している単語が類似語である可能性が低い（ユーザの入
力ミスでない可能性が高い。）。ユーザの入力ミスであ
る可能性が高い場合に、入力候補を表示することによっ
て、ユーザに入力ミスでないかを確認させることで、ユ
ーザが入力ミスを略確実に認識できるようになる。Here, even if a word of a character string input by the user exists, if a similar word of the input word is in a co-occurrence state in which there is a high possibility that the word is input compared to other words, There is a high possibility that the word intended by the user is this similar word (it is highly likely that the user has made an input error).
On the other hand, if the similar word of the input word is not in a co-occurrence state in which there is a high possibility that the word is input compared to other words, it is unlikely that the word intended by the user is a similar word (user input It is likely that it is not a mistake.) When there is a high possibility that the user has made an input error, the input candidate is displayed to allow the user to confirm whether or not the input error has occurred, so that the user can almost certainly recognize the input error.

【００１３】なお、単に入力された単語の類似語があれ
ば類似語を表示してユーザに入力ミスでないかを確認さ
せることも考えられるが、この方式では上述したユーザ
の入力ミスでない可能性が高いときにも、入力候補を表
示してユーザに入力ミスでないかを確認させることとな
って、ユーザの文書作成作業を頻繁に中断させてしまう
ため、文書作成効率を低下させるという問題が生じる。
しかし、本願発明では、ユーザの入力ミスである可能性
が高いときのみ、入力候補を表示してユーザに入力ミス
でないかを確認させているので、上記の問題が生じるこ
とがない。[0013] Incidentally, if there is a similar word of the input word, it is conceivable to display the similar word to make the user confirm whether there is an input error. However, in this method, there is a possibility that the above-mentioned user input error is not made. Even when the price is high, input candidates are displayed and the user is asked to confirm whether there is any input error, and the user's document creation work is frequently interrupted, thereby causing a problem of lowering document creation efficiency.
However, in the present invention, the input problem is displayed only when there is a high possibility that the user has made an input error, and the user is made to check whether there is an input error. Therefore, the above problem does not occur.

【００１４】請求項２に記載した発明では、入力された
単語自身が既に共起状態にあれば、前記類似語を表示部
に表示させることなく、該入力された単語を確定させる
ようにした。In the invention described in claim 2, if the input word itself is already in a co-occurrence state, the input word is determined without displaying the similar word on the display unit.

【００１５】この構成では、入力された文字列の単語が
他の単語に比べて入力される可能性の高い単語であると
きには、ユーザの入力ミスである可能性が殆ど無いこと
から、入力された単語が確定される。In this configuration, when the word of the input character string is a word that is more likely to be input than other words, there is almost no possibility of a user input error. The word is determined.

【００１６】請求項３に記載した発明では、確定した単
語の共起語と類似する類似語が共起状態にあれば、該類
似語の共起状態をリセットするようにしたため、類似関
係にある複数の単語が同時に共起状態となることがな
い。ところで、類似関係にある複数の単語が同時に共起
状態となった場合、入力された単語が入力ミスである可
能性が高いかどうかを適正に判定できなくなる。例え
ば、最初に『ｌｉｇｈｔ』に共起する単語『ｗｅｉｇｈ
ｔ』が入力され、その後、『ｒｉｇｈｔ』に共起する単
語『ｗｒｏｎｇ』が入力され、最後に『ｒｉｇｈｔ』が
入力された場合、殆どの場合『ｗｒｏｎｇ』に共起する
『ｒｉｇｈｔ』が入力されたと考えられるが、『ｌｉｇ
ｈｔ』が共起状態であることを理由にユーザに入力ミス
でないかを確認させることで、ユーザの文書作成作業を
中断させてしまうという問題が生じる。In the invention described in claim 3, if a similar word similar to the co-occurred word of the determined word is in the co-occurrence state, the co-occurrence state of the similar word is reset, so that there is a similar relationship. A plurality of words do not co-occur. By the way, when a plurality of words having a similar relationship are in a co-occurrence state at the same time, it cannot be properly determined whether or not the input word is highly likely to be an input error. For example, first, the word “weigh” co-occurring with “light”
t ”is input, and then the word“ word ”co-occurring with“ right ”is input. When“ right ”is input at the end,“ right ”co-occurring with“ word ”is input in most cases. It is considered that
ht "is a co-occurrence state, causing the user to check for an input error, thereby causing a problem of interrupting the user's document creation work.

【００１７】一方、入力された単語が共起状態にあれ
ば、その単語を確定する構成が採用されていると、上記
の問題は解決されるが、最初に『ｌｉｇｈｔ』に共起す
る単語『ｗｅｉｇｈｔ』が入力され、その後、『ｒｉｇ
ｈｔ』に共起する単語『ｗｒｏｎｇ』が入力され、最後
に『ｌｉｇｈｔ』が入力された場合、上記のようにこの
『ｌｉｇｈｔ』は『ｒｉｇｈｔ』の入力ミスであるのに
『ｌｉｇｈｔ』が共起状態であることから入力が確定さ
れ、ユーザに入力ミスを認識させることができなくなる
という問題が生じる。On the other hand, if the input word is in a co-occurrence state, the above problem can be solved by adopting a structure for determining the word, but the word "co-occurrence" in "light" first is solved. weight ”is input, and then“ rig ”
When the word "word" co-occurring with "ht" is input, and "light" is input at the end, "light" is a co-occurrence of "right" even though "right" is an input error of "right" as described above. Since the state is in the state, the input is determined, and there is a problem that the user cannot recognize the input error.

【００１８】本構成では、上記構成により類似語の中で
同時に複数の単語が共起状態となることを禁止している
ため、上述の問題が生じることを防止している。In this configuration, a plurality of words among the similar words are prohibited from co-occurring at the same time by the above configuration, thereby preventing the above-mentioned problem from occurring.

【００１９】請求項５に記載した発明は、文字列を入力
する入力部と、単語毎に、文字列が類似するものとして
予め設定した類似語、および予め設定した使用頻度の低
い単語であるか否かを示す情報記憶する記憶部と、入力
された単語が使用頻度の低い単語であるか否かを前記記
憶部を参照して判断し、使用頻度の低い単語であれば、
該入力された単語の類似語を入力候補として表示部に表
示させる制御部と、を備えた。According to a fifth aspect of the present invention, there is provided an input unit for inputting a character string, a similar word set in advance as a similar character string for each word, and a word set in a predetermined low frequency of use. A storage unit for storing information indicating whether or not the input word is a word with a low use frequency, determining whether or not the input word is a word with a low use frequency;
A control unit for displaying a similar word of the input word on a display unit as an input candidate.

【００２０】この構成では、予め使用頻度の低い単語で
あると設定されている単語（稀にしか使用されない単
語）が入力されたときに、類似語を表示して、ユーザに
入力ミスでないかを確認させる。ここで、一般のユーザ
にとって殆ど使用することがない単語（使用頻度の低い
単語）が入力された場合、ユーザの意図している単語が
入力された文字列と類似する文字列の単語である可能性
が高い。すなわち、ユーザの入力ミスであった可能性が
高いと言える。このような場合、ユーザに入力ミスを確
認させるようにしたので、ユーザの入力ミスを略確実に
検出し、これをユーザに認識させることができる。In this configuration, when a word set in advance as a word with low frequency of use (a word that is rarely used) is input, a similar word is displayed to determine whether the user has made an input error. Ask for confirmation. Here, when a word that is rarely used by a general user (a word that is rarely used) is input, the word intended by the user may be a word having a character string similar to the input character string. High in nature. In other words, it can be said that there is a high possibility that the user made an input error. In such a case, the user is made to confirm the input error, so that the input error of the user can be detected almost certainly, and the user can recognize the input error.

【００２１】なお、請求項４に記載した発明は、請求項
１、２または３に記載した発明の構成に請求５に記載し
た発明の構成を加えた構成である。The invention described in claim 4 is a structure obtained by adding the structure of the invention described in claim 5 to the structure of the invention described in claim 1, 2 or 3.

【００２２】請求項６に記載した発明は、前記使用頻度
の低いか否かの情報を入力する手段を設けた。According to a sixth aspect of the present invention, there is provided a means for inputting information on whether or not the frequency of use is low.

【００２３】通常、単語の使用頻度はユーザによって異
なるものである。例えば、一般のユーザにとっては使用
頻度の低い単語であっても、特定のユーザが頻繁に使用
する単語であると、該特定のユーザに対してこの単語が
入力される度に入力ミスを確認させることは、該ユーザ
の文書作成効率を低下させることになる。Normally, the frequency of use of a word differs depending on the user. For example, even if a word is rarely used by a general user, if the word is frequently used by a specific user, an input error is confirmed each time the word is input to the specific user. This lowers the document creation efficiency of the user.

【００２４】この構成では、単語毎にユーザ自身が使用
頻度の低い単語であるか否かを設定することができるた
め、上記の問題が生じることがなく、ユーザの文書作成
効率を低下させることがない。In this configuration, the user can set for each word whether or not the word is an infrequently used word. Therefore, the above problem does not occur, and the user's document creation efficiency can be reduced. Absent.

【００２５】[0025]

【発明の実施の形態】図１はこの発明の実施形態である
文字列入力装置の構成を示すブロック図である。この実
施形態の文字列入力装置１は、文字列（英単語）を入力
する入力部２と、入力された文字列に対して後述する処
理を実行する制御部３と、英単語（以下、単に単語と言
う。）の文字列とその意味を対応させた英和辞書４ａお
よび文字列の類似する類似語を示す情報等を登録した類
似語テーブル４ｂを記憶した記憶部４と、作成中の文書
や入力された文字列等を表示する表示部５と、を備えて
いる。６は、制御部３において後述する処理を実行させ
るためのプログラムを記録したＣＤ−ＲＯＭやフロッピ
ィディスク等のプログラム記録媒体であり、このプログ
ラム記録媒体６から予め制御部３にプログラムが供給さ
れる。なお、制御部３にこのプログラムを記録したＲＯ
Ｍを設けておき、このＲＯＭから制御部３にプログラム
を供給する構成とすることもできる。FIG. 1 is a block diagram showing a configuration of a character string input device according to an embodiment of the present invention. The character string input device 1 of this embodiment includes an input unit 2 for inputting a character string (English word), a control unit 3 for executing a process to be described later on the input character string, and an English word (hereinafter simply referred to as “English word”). A storage unit 4 storing an English-Japanese dictionary 4a in which character strings and their meanings are associated with each other and a similar word table 4b in which information indicating similar words similar to each other in the character strings are registered. A display unit 5 for displaying an input character string and the like. Reference numeral 6 denotes a program recording medium, such as a CD-ROM or a floppy disk, on which a program for causing a later-described process to be executed by the control unit 3 is recorded. The program is supplied from the program recording medium 6 to the control unit 3 in advance. In addition, RO which recorded this program in control part 3
M may be provided, and a program may be supplied from the ROM to the control unit 3.

【００２６】図２は、記憶部に記憶されている英和辞書
の構成を示す図である。英和辞書４ａには、単語毎に単
語レコード１１が登録されている。単語レコード１１に
は英和辞書４ａにおける該レコードの位置を示す辞書ｉ
ｎｄｅｘ、単語の文字列（スペル）、該単語の意味、意
味上の観念が関連し同一の文書内で同時に使用される可
能性が高い共起関係にある共起語の単語レコード１１の
位置を示す共起語辞書ｉｎｄｅｘ、および、後述する類
似語テーブル４ｂに登録されている該単語の類似語レコ
ードの位置を示す類似語テーブルｉｎｄｅｘが登録され
ている。なお、共起語が無い単語は、共起語辞書ｉｎｄ
ｅｘに−１を登録している。例えば、図２に示す例は、
『ｗｅｉｇｈｔ』と『ｌｉｇｈｔ』とが共起関係にあ
り、『ｌｉｇｈｔ』と『ｃｏｌｏｒ』とが共起関係にあ
ることを示している。なお、この例では『ｌｉｇｈｔ』
は『ｗｅｉｇｈｔ』および『ｃｏｌｏｒ』の両方と共起
関係にあるが、『ｗｅｉｇｈｔ』と『ｃｏｌｏｒ』とは
共起関係にない。このように、単語Ａが単語Ｂおよび単
語Ｃと共起関係にあっても、単語Ｂと単語Ｃとが共起関
係にない場合もある。これは、単語Ａが単語Ｂおよび単
語Ｃと意味上の観念が関連していても、単語Ｂと単語Ｃ
の意味上の観念が関連するとは限らないからである。な
お、意味上の観念が関連しているかどうかについては、
上述のように辞書作成が自分のイメージで判断し、各単
語について共起語を予め設定している。また、『ｌｉｃ
ｅ』は共起語が存在しない例である。また、類似語のな
い単語は、類似語テーブルｉｎｄｅｘに−１を登録して
いる。図２では、類似語テーブル４ｂ（図３参照）に
『ｗｅｉｇｈｔ』の類似語レコードが登録されていない
ことを示している（類似語がないことを示してい
る。）。FIG. 2 is a diagram showing the structure of the English-Japanese dictionary stored in the storage unit. A word record 11 is registered for each word in the English-Japanese dictionary 4a. The word record 11 has a dictionary i indicating the position of the record in the English-Japanese dictionary 4a.
ndex, the character string (spelling) of the word, the meaning of the word, the meaning of the meaning, the position of the word record 11 of the co-occurring word that is likely to be used simultaneously in the same document and is likely to be used simultaneously. A co-occurrence word dictionary index and a synonym table index indicating the position of a synonym record of the word registered in a synonym table 4b to be described later are registered. Note that words without co-occurring words are described in the co-occurring word dictionary ind
ex is registered with -1. For example, the example shown in FIG.
This indicates that “weight” and “light” are in a co-occurrence relationship, and that “light” and “color” are in a co-occurrence relationship. In this example, "light"
Is co-occurring with both "weight" and "color", but "weight" and "color" are not. As described above, even when the word A has a co-occurrence relationship with the words B and C, the word B and the word C may not have a co-occurrence relationship. This is because even if the word A is related to the words B and C in a semantic sense, the words B and C
Is not always relevant. It should be noted that whether semantic ideas are relevant
As described above, the dictionary is determined based on its own image, and co-occurrence words are set in advance for each word. Also, "lic
"e" is an example in which no co-occurrence word exists. For words having no similar words, -1 is registered in the similar word table index. FIG. 2 shows that the similar word record of “weight” is not registered in the similar word table 4b (see FIG. 3) (indicating that there is no similar word).

【００２７】図３は類似語テーブルの構成を示す図であ
る。類似語テーブル４ｂには、類似語が存在する単語の
類似語レコード１５が登録されている（類似語が無い単
語にの類似語レコード１５は登録されていない。）。類
似語レコード１５には、類似語テーブル４ｂにおける該
レコードの位置を示すテーブルｉｎｄｅｘ、単語の文字
列、該単語の英和辞書４ａにおける単語レコード１１の
位置を示す辞書ｉｎｄｅｘ、作成中の文書で該単語が使
用される可能性が高いか否かを示す共起フラグ、使用頻
度が低い単語であるか否かを示す表示フラグ、および、
文字列の類似する単語の類似語レコード１５の位置を示
す類似語ｉｎｄｅｘが登録されている。各類似後レコー
ド１５には、類似する全ての単語が登録されているので
はなく、図３に矢印で示すように類似語は所謂チェーン
構造で関連付けられており、この類似語ｉｎｄｅｘがチ
ェーン構造のリンク情報となっている。すなわち、図示
の矢印で示すように『ｆｉｇｈｔ』、『ｌｉｇｈｔ』、
『ｎｉｇｈｔ』、『ｒｉｇｈｔ』の４つが類似語である
場合（他には類似語が無いものとする。）、『ｆｉｇｈ
ｔ』の類似語レコード１５に『ｌｉｇｈｔ』の類似語レ
コード１５を示すテーブルｉｎｄｅｘを登録し、『ｌｉ
ｇｈｔ』の類似語レコード１５に『ｎｉｇｈｔ』の類似
語レコード１５を示すテーブルｉｎｄｅｘを登録し、
『ｎｉｇｈｔ』の類似語レコード１５に『ｒｉｇｈｔ』
の類似語レコード１５を示すテーブルｉｎｄｅｘを登録
し、『ｒｉｇｈｔ』の類似語レコード１５に『ｆｉｇｈ
ｔ』の類似語レコード１５を示すテーブルｉｎｄｅｘを
登録することによって、こられの類似語をチェーン構造
で関連付けている。FIG. 3 is a diagram showing the structure of the similar word table. In the similar word table 4b, a similar word record 15 of a word having a similar word is registered (a similar word record 15 for a word having no similar word is not registered). The synonym record 15 includes a table index indicating the position of the record in the synonym table 4b, a character string of the word, a dictionary index indicating the position of the word record 11 in the English-Japanese dictionary 4a of the word, and the word in the document being created. A co-occurrence flag indicating whether or not the word is likely to be used, a display flag indicating whether the word is infrequently used, and
A synonym index indicating the position of a synonym record 15 of a word having a similar character string is registered. Not all the similar words are registered in each post-similar record 15, but the similar words are related in a so-called chain structure as shown by an arrow in FIG. Link information. That is, as shown by the arrows in the figure, “fight”, “light”,
If the four words “night” and “right” are similar words (there is no other similar word), “fight”
The table index indicating the synonym record 15 of “light” is registered in the synonym record 15 of “t”, and “li” is registered.
The table index indicating the synonym record 15 of “night” is registered in the synonym record 15 of “ght”,
"Right" is added to the synonym record 15 of "night"
A table index indicating the synonym record 15 of “right” is registered, and “fight” is registered in the synonym record 15 of “right”.
By registering a table index indicating the similar word record 15 of "t", these similar words are associated in a chain structure.

【００２８】なお、この実施形態では、英和辞書４ａと
類似語テーブル４ｂとを２つのファイルに分割し、各単
語についてその意味、類似語、共起語等を間接的に記憶
させたが、これらを１つのファイルにして各単語につい
てその意味、類似語、共起語等を直接的に記憶させても
よい。ここでは、ハードディスクに英和辞書４ａと類似
語テーブル４ｂとを記憶させた場合に、英和辞書４ａと
類似語テーブル４ｂのデータを読み出すときのシーク時
間を短縮し、アクセスが高速に行えるようにするために
分割している。また、類似語をチェーン構造で関連付け
るとしたが、各単語の類似語インデックスに類似語のテ
ーブルインデックスを全て登録するようにしてもよい。In this embodiment, the English-Japanese dictionary 4a and the similar word table 4b are divided into two files, and the meaning, similar words, co-occurrence words, etc. of each word are indirectly stored. Into a single file, the meaning, similar words, co-occurrence words, etc. of each word may be directly stored. Here, when the English-Japanese dictionary 4a and the synonym table 4b are stored in the hard disk, the seek time for reading the data of the English-Japanese dictionary 4a and the synonym table 4b is reduced, and the access can be performed at high speed. Is divided into Although the similar words are associated in a chain structure, all of the similar word table indexes may be registered in the similar word index of each word.

【００２９】以下、この実施形態にかかる文字列入力装
置の動作を詳細に説明する。図４〜図６は、この実施形
態にかかる文字列入力装置の処理を示すフローチャート
である。なお、制御部３は、ユーザが新規文書の作成を
開始するときには、全ての類似語レコード１５における
共起フラグ４ｂをオフする（０にする。）。また、この
実施形態では表示フラグがオンである（１である。）単
語が使用頻度の低い単語である。なお、表示フラグはユ
ーザが任意にセットまたはリセットすることができ、ユ
ーザ自身が使用環境に応じて、単語毎に使用頻度の低い
単語であるか否かを設定できる。ユーザは、入力部２を
操作して文字列（英文字）等を入力し、文書を作成す
る。Hereinafter, the operation of the character string input device according to this embodiment will be described in detail. 4 to 6 are flowcharts showing the processing of the character string input device according to this embodiment. When the user starts creating a new document, the control unit 3 turns off the co-occurrence flag 4b in all the similar word records 15 (sets it to 0). Further, in this embodiment, the word whose display flag is on (is 1) is a word that is used less frequently. The display flag can be arbitrarily set or reset by the user, and the user can set for each word whether or not the word is a less frequently used word according to the use environment. The user operates the input unit 2 to input a character string (English characters) and the like to create a document.

【００３０】制御部３は、入力部２においてキー操作等
による入力があると（ｎ１）、操作されたキーが文字キ
ー、スペースキー、またはその他のキーであるかを判定
する（ｎ２、ｎ３）。入力されたキーが文字キーであれ
ば、制御部３に設けられている入力バッファ（不図示）
に対応する文字を追加する（ｎ４）。なお、表示部５に
は入力バッファ内の文字列が表示されており、この表示
によってユーザが入力した文字列を確認することができ
る。また、英語では単語間にスペースを設けるというき
まりがあるため、操作されたキーがスペースキーである
と、１つの単語が入力されたとして、ここで入力された
単語に対して後述するｎ６以降の処理を実行する。ま
た、文字キーおよびスペースキーでない例えば改行キー
等のその他のキーが操作された場合には、操作されたキ
ーに対応する処理（改行処理等）を行う（ｎ５）。When there is an input by a key operation or the like on the input unit 2 (n1), the control unit 3 determines whether the operated key is a character key, a space key, or another key (n2, n3). . If the input key is a character key, an input buffer (not shown) provided in the control unit 3
Is added (n4). The character string in the input buffer is displayed on the display unit 5, and the character string input by the user can be confirmed by this display. Also, in English, there is a rule that a space is provided between words, so if the operated key is the space key, it is determined that one word is input, and that the input word is n6 and later described later. Execute the process. When another key other than the character key and the space key, such as a line feed key, is operated, a process (line feed process, etc.) corresponding to the operated key is performed (n5).

【００３１】ｎ６に進むと、制御部３は入力された文字
列と同一の文字列の単語（マッチする単語）の類似語レ
コード１５が類似語テーブル４ｂに登録されているか否
かを検索し、この検索で該当する類似語レコード１５が
無ければ、ｎ７からｎ８に進み、英和辞書４ａに入力さ
れた文字列と同一の文字列の単語（マッチする単語）が
あるか否かを検索し、マッチする単語があれば入力され
た文字列（単語）を確定し（ｎ９→ｎ１５）、ｎ１６に
進む。When proceeding to n6, the control section 3 searches whether or not a synonym record 15 of a word (matching word) having the same character string as the input character string is registered in the synonym table 4b. If there is no corresponding synonym record 15 in this search, the process proceeds from n7 to n8 to search for a word (matching word) having the same character string as the character string input to the English-Japanese dictionary 4a. If there is a word to be executed, the input character string (word) is determined (n9 → n15), and the process proceeds to n16.

【００３２】例えば、ユーザが『ｗｅｉｇｈｔ』と言う
文字列を入力した場合、この文字列と同一の文字列から
なる単語の類似語レコード１５が類似語テーブル４ａに
登録されていない（この実施形態では、『ｗｅｉｇｈ
ｔ』に類似する文字列の単語が存在しないものとしてい
る。）のでｎ７からｎ８に進む。また、この文字列と同
一の文字列からなる単語が英和辞書４ａに登録されてい
るのでｎ９からｎ１５に進んで入力された文字列『ｗｅ
ｉｇｈｔ』を確定する。For example, when the user inputs a character string “weight”, a similar word record 15 of a word having the same character string as this character string is not registered in the similar word table 4a (in this embodiment, , "Weigh
It is assumed that a word having a character string similar to “t” does not exist. ), The process proceeds from n7 to n8. Also, since a word consisting of the same character string as this character string is registered in the English-Japanese dictionary 4a, the character string "we
"right".

【００３３】一方、ｎ７でマッチする単語があると判定
すると、ｎ２０にジャンプしてこの単語の共起フラグが
オンしているかどうかを判定する。共起フラグがオンし
ていれば、後述するように他の単語に比べて作成中の文
書で使用される可能性が高い単語であるため、ユーザの
入力ミスである可能性が殆どない。この理由からｎ２０
で共起フラグがオンしていると、ｎ１５に進んで入力さ
れた文字列を確定する。一方、ｎ２０で共起フラグがオ
ンしていなければ、図５に示すｎ２１に進む。なお、ｎ
２１以降の処理は後述する。On the other hand, if it is determined at n7 that there is a matching word, the process jumps to n20 to determine whether the co-occurrence flag of this word is on. If the co-occurrence flag is on, the word is more likely to be used in the document being created as compared to other words, as described later, so that there is almost no possibility of a user input error. For this reason n20
If the co-occurrence flag is turned on, the program proceeds to n15 to determine the input character string. On the other hand, if the co-occurrence flag is not turned on at n20, the process proceeds to n21 shown in FIG. Note that n
The processing after 21 will be described later.

【００３４】ｎ９でもマッチする単語がなければ（入力
された文字列の単語が存在しなければ）ユーザの入力ミ
ス（スペルミス）であるため、英和辞書４ａを検索して
入力された文字列と類似する文字列の単語を候補単語と
して抽出する（ｎ１０）。表示部５では、これらの候補
単語を表示し（ｎ１１）、制御部３がユーザによって表
示された候補単語の中から１つの候補単語が選択される
のを待つ（ｎ１２）。ユーザが表示された候補単語の中
から１つの候補単語を選択すると、入力された文字列を
選択された候補単語の文字列に置き換え（ｎ１３）、こ
の置き換えた文字列（単語）を確定し（ｎ１４）、ｎ１
６に進む。If there is no matching word even in n9 (if there is no word of the input character string), it is an input error (spelling error) of the user, so that the English-Japanese dictionary 4a is searched and similar to the input character string. The word of the character string to be extracted is extracted as a candidate word (n10). The display unit 5 displays these candidate words (n11), and the control unit 3 waits until one candidate word is selected from the candidate words displayed by the user (n12). When the user selects one candidate word from the displayed candidate words, the input character string is replaced with the character string of the selected candidate word (n13), and the replaced character string (word) is determined ( n14), n1
Proceed to 6.

【００３５】例えば、ユーザが『ｗｅｉｇｈｔ』という
単語を誤って『ｗａｉｇｈｔ』と入力した場合（ただ
し、『ｗａｉｇｈｔ』という文字列は英和辞書４ａに登
録されていない単語（存在しない単語）である。）、制
御部３がこの文字列と類似する文字列の『ｗｅｉｇｈ
ｔ』や『ｗｒｉｇｈｔ』等の単語を英和辞書４ａより候
補単語として抽出し、表示部５にこれらの候補単語を表
示する（図７参照）。ここでユーザが自分の意図してい
た単語である『ｗｅｉｇｈｔ』を選択すると、入力され
た文字列『ｗａｉｇｈｔ』をここで選択された候補単語
の文字列『ｗｅｉｇｈｔ』に置き換え、ｎ１５に進んで
この文字列を確定する。なお、単語の選択は選択する候
補単語の上にカーソル（不図示）を移動させて、図示し
ていないマウスのボタンをクリックすることによって行
うようにしてもよいし、また、特定のキー操作によって
行うようにしてもよい。For example, when the user erroneously inputs the word "weight" as "weight" (however, the character string "weight" is a word not registered in the English-Japanese dictionary 4a (a word that does not exist)). , The control unit 3 sets a character string similar to this character string to “weigh”.
Words such as "t" and "right" are extracted from the English-Japanese dictionary 4a as candidate words, and these candidate words are displayed on the display unit 5 (see FIG. 7). When the user selects the word "weight" intended by the user, the input character string "weight" is replaced with the character string "weight" of the candidate word selected here, and the process proceeds to n15. Determine the character string. The selection of a word may be performed by moving a cursor (not shown) on a candidate word to be selected and clicking a mouse button (not shown), or by a specific key operation. It may be performed.

【００３６】制御部３は、ｎ１５で単語を確定すると、
ｎ１６で確定した単語と共起関係にある単語（共起語）
が存在するか否かを判定し、共起関係にある単語が存在
しなければｎ１に戻って上記の処理を繰り返す。なお、
共起関係にある単語が存在するか否かは、英和辞書４ａ
における確定した単語の単語レコード１１の共起語辞書
ｉｎｄｅｘが−１であるか否かによって判定できる。確
定した単語と共起関係にある単語が存在していると、ｎ
１７に進んで確定した単語と共起関係にある共起語の中
で類似語テーブル４ｂに類似語レコード１５が登録され
ている単語を抽出する（ｎ１７）。この処理は、英和辞
書４ａを検索して確定した単語の単語レコード１１の共
起語辞書ｉｎｄｅｘを抽出し、ここで抽出した共起語辞
書ｉｎｄｅｘで示される単語レコード１１の類似語テー
ブルｉｎｄｅｘを抽出する処理である。そして、類似語
テーブル４ａを検索して、ｎ１７で抽出した類似語テー
ブルｉｎｄｅｘで示される全ての類似語レコード１５の
共起フラグをオンする（ｎ１８）。すなわち、ｎ１５で
確定された単語と共起関係にあって、且つ、類似語が存
在する単語の共起フラグがオンされる。確定された単語
と共起関係にある単語は、他の単語に比べて入力される
可能性が高い単語であると言える。また、ｎ１８で共起
フラグをオンした単語毎に、文字列が類似する類似単語
の中に共起フラグがオンしているものがあれば、この単
語の共起フラグをオフし（ｎ１９）、ｎ１に戻って上記
の処理を実行する。When the control unit 3 determines the word at n15,
Words co-occurring with the word determined in n16 (co-occurrence words)
Is determined, and if there is no co-occurring word, the process returns to n1 and the above processing is repeated. In addition,
Whether or not there is a word having a co-occurrence relationship is determined by the English-Japanese dictionary 4a.
Can be determined based on whether or not the co-occurrence word dictionary index of the word record 11 of the determined word in is 1. If a word co-occurring with the determined word exists, n
Then, a word whose synonym record 15 is registered in the synonym table 4b is extracted from the co-occurrence words having a co-occurrence relation with the determined word (n17). In this processing, the co-occurrence word dictionary index of the word record 11 of the word determined by searching the English-Japanese dictionary 4a is extracted, and the similar word table index of the word record 11 indicated by the co-occurrence word dictionary extracted here is extracted. This is the processing to be performed. Then, the similar word table 4a is searched, and the co-occurrence flags of all the similar word records 15 indicated by the similar word table index extracted at n17 are turned on (n18). That is, the co-occurrence flag of a word having a co-occurrence relationship with the word determined in n15 and having a similar word is turned on. It can be said that a word that has a co-occurrence relationship with the determined word is a word that is more likely to be input than other words. Also, for each word for which the co-occurrence flag is turned on in n18, if there is a similar word whose character string is similar and the co-occurrence flag is on, the co-occurrence flag of this word is turned off (n19). Returning to n1, the above processing is executed.

【００３７】例えば、確定した単語が『ｗｅｉｇｈｔ』
であった場合、ｎ１７では『ｗｅｉｇｈｔ』の単語レコ
ード１１に登録されている共起語辞書ｉｎｄｅｘ『１３
９８１』、『２１０８３』等を抽出し、この共起語辞書
ｉｎｄｅｘで示される単語レコード１１を検索し、ここ
で検索した単語レコード１１の類似語テーブルｉｎｄｅ
ｘ『３６８９』等を抽出する。ｎ１８では、ｎ１７で抽
出した類似語テーブルｉｎｄｅｘで示される『ｌｉｇｈ
ｔ』等の類似語レコード１５を検索し、これらの類似語
レコード１５における共起フラグをオンする（１にす
る。）。さらに、ｎ１８で共起フラグをオンした単語
『ｌｉｇｈｔ』等に類似する単語の中で共起フラグがオ
ンしている単語があれば、この単語の共起フラグをオフ
する。例えば『ｎｉｇｈｔ』の共起フラグがオンしてい
れば、この共起フラグをオフする。これにより、ユーザ
が『ｌｉｇｈｔ』を入力する際に、誤って『ｎｉｇｈ
ｔ』と入力した場合には、後述するように入力候補を表
示して、ユーザに入力を確認させて、入力ミスを認識さ
せることができるとともに、正確に『ｌｉｇｈｔ』と入
力した場合には、ユーザに入力を確認させることなくこ
の単語を確定するため、ユーザの文書作成効率を低下さ
せることがない。For example, if the determined word is "weight"
, At n17, the co-occurrence word dictionary index “13” registered in the word record 11 of “weight”
981 "," 21083 ", etc., and searches the word record 11 indicated by the co-occurrence word dictionary index, and finds a similar word table inde of the word record 11 searched here.
x “3689” or the like is extracted. In n18, “light” shown in the similar word table index extracted in n17
t "and the like, and the co-occurrence flag in these similar word records 15 is turned on (set to 1). Further, if there is a word whose co-occurrence flag is on among words similar to the word "light" or the like for which the co-occurrence flag is on in n18, the co-occurrence flag of this word is turned off. For example, if the co-occurrence flag of “night” is on, this co-occurrence flag is turned off. Thereby, when the user inputs “light”, the user may mistakenly input “light”
When "t" is input, an input candidate is displayed as described later to allow the user to confirm the input and recognize an input error, and when "right" is input accurately, Since this word is determined without prompting the user to confirm the input, the document creation efficiency of the user is not reduced.

【００３８】次に、ｎ２０に戻って、類似語テーブル４
ｂに入力された文字列にマッチする単語があると判定し
た場合について説明する（図５参照）。制御部３は、入
力された文字列にマッチする単語（元の単語）の類似語
レコード１５における類似語ｉｎｄｅｘが示す類似語レ
コード１５に注目し（ｎ２１）、共起フラグがオンして
いるか否かを判定する（ｎ２２）。共起フラグがオンし
ていなければ、注目している類似語レコード１５の類似
語ｉｎｄｅｘが指す類似語レコード１５に注目し（ｎ２
３）、注目している類似語レコード１５が元の単語の類
似語レコード１５であるか否かを判定する（ｎ２４）。
ｎ２４で元の単語でなければ、ｎ２２に戻って上記の処
理を繰り返す。ｎ２２〜ｎ２４の処理によって、入力さ
れた文字列と類似する文字列の類似語の中に共起フラグ
がオンしている単語があるか否かを判定している。Next, returning to n20, the synonym table 4
The case where it is determined that there is a word that matches the character string input to b will be described (see FIG. 5). The control unit 3 focuses on the synonym record 15 indicated by the synonym index in the synonym record 15 of the word (original word) that matches the input character string (n21), and determines whether the co-occurrence flag is on. Is determined (n22). If the co-occurrence flag is not turned on, attention is paid to the synonym record 15 indicated by the synonym index of the synonym record 15 of interest (n2
3) It is determined whether or not the similar word record 15 of interest is a similar word record 15 of the original word (n24).
If n24 is not the original word, the process returns to n22 and the above processing is repeated. By the processing of n22 to n24, it is determined whether or not there is a word whose co-occurrence flag is on among the similar words of the character string similar to the input character string.

【００３９】ここで、共起フラグがオンしている類似語
があれば、この共起フラグがオンしている類似語は他の
単語に比べて作成中の文書で使用される（入力される）
可能性が高いため、ユーザの意図している単語がこの共
起フラグがオンしている類似語である可能性が高いと言
える。すなわち、ユーザの入力ミスであった可能性が高
い。ｎ２２で共起フラグがオンであると判定すると、ｎ
２５に進んで候補単語の表示形態をタイプ１に設定し、
ｎ２９以降の処理を実行する。Here, if there is a similar word for which the co-occurrence flag is on, the similar word for which the co-occurrence flag is on is used in the document being created as compared with other words (input). )
Since the possibility is high, it can be said that the word intended by the user is likely to be a similar word for which the co-occurrence flag is on. In other words, there is a high possibility that the user made an input error. When it is determined that the co-occurrence flag is on at n22, n
Proceeding to step 25, the display mode of the candidate word is set to type 1,
The processing after n29 is executed.

【００４０】例えば、図３に示すように『ｌｉｇｈｔ』
の共起フラグがオンである場合、『ｌｉｇｈｔ』に類似
する文字列『ｒｉｇｈｔ』が入力されると、ｎ２４で表
示部５における候補単語の表示形態をタイプ１に設定す
る。なお、ここでは『ｌｉｇｈｔ』と共起関係にある
『ｗｅｉｇｈｔ』が入力されたことによって、『ｌｉｇ
ｈｔ』が他の単語に比べて入力される可能性が高くなっ
ている。したがって、ユーザの意図している単語は『ｌ
ｉｇｈｔ』であるが、入力ミスにより『ｒｉｇｈｔ』と
入力された可能性が高いと言える。For example, as shown in FIG.
When the co-occurrence flag is turned on and a character string "right" similar to "light" is input, the display mode of the candidate word on the display unit 5 is set to type 1 in n24. It should be noted that, here, “weight”, which is co-occurring with “light”, is input to “light”.
ht ”is more likely to be input than other words. Therefore, the word intended by the user is “l
It can be said that there is a high possibility that "right" has been input due to an input error.

【００４１】一方、ｎ２２〜ｎ２４の処理において、共
起フラグがオンしている類似語が無ければｎ２４からｎ
２６に進んで、入力された文字列の単語が使用頻度の低
い単語であるか否か（表示フラグがオンしているか否
か）を判定する。ｎ２６で使用頻度の低い単語でないと
判定すると（表示フラグがオフであると）、ユーザの入
力ミスである可能性は低く、ｎ１５に進んで入力された
文字列の単語を確定し、ｎ１６以降の処理を実行する。On the other hand, in the processing of n22 to n24, if there is no similar word for which the co-occurrence flag is on, n24 to n
Proceeding to 26, it is determined whether or not the word of the input character string is a word of low use frequency (whether or not the display flag is on). If it is determined in n26 that the word is not an infrequently used word (if the display flag is off), the possibility of a user input error is low, and the process proceeds to n15 to determine the word of the input character string. Execute the process.

【００４２】また、ｎ２６で入力された文字列が使用頻
度の低い単語であると判定すると、ｎ２７で候補単語の
表示形態をタイプ２に設定してｎ２８に進む。ここで、
入力された文字列の単語の使用頻度が低く、且つ、この
入力された文字列と類似する文字列の単語が存在する場
合、ユーザの意図している単語は使用頻度の低い単語
（入力された文字列の単語）ではなく、入力した文字列
と類似する文字列の単語である可能性が高い。例えば、
ユーザが『ｌｉｃｅ』と入力した場合、『ｌｉｃｅ』の
使用頻度が低いと、ユーザの意図している単語が『ｌｉ
ｃｅ』に類似した『ｒｉｃｅ』等である可能性が高い。
すなわち、ユーザの入力ミスである可能性が高い。If it is determined in step n26 that the input character string is a word with a low frequency of use, the display mode of the candidate word is set to type 2 in step n27, and the flow advances to step n28. here,
When the frequency of use of the word of the input character string is low and there is a word of a character string similar to the input character string, the word intended by the user is a word of low frequency of use (input word It is highly likely that the word is not a character string word) but a character string similar to the input character string. For example,
If the user inputs “license” and the frequency of use of “license” is low, the word intended by the user is changed to “license”.
It is highly possible that the name is “rice” similar to “ce”.
That is, there is a high possibility that an input error has been made by the user.

【００４３】本実施形態の文字列入力装置では、ユーザ
の入力ミスであった可能性が高いと判定したときに、候
補単語の表示形態をタイプ１またはタイプ２に設定し
て、ｎ２８以降の処理を実行する。ｎ２８では、英和辞
書４ａを検索して入力された文字列の単語の意味と文字
列が類似する類似語および類似語の意味を抽出する。候
補単語の表示形態がタイプ１であるか否かを判定し（ｎ
２９）、タイプ１であれば図８（Ａ）に示すように入力
された文字列の単語とｎ２８で抽出した類似語とを候補
単語として表示し（ｎ３０）、ユーザに入力ミスでない
かを確認させる。なお、入力された文字列の単語を表示
しているのは、ユーザの入力ミスでない場合も想定され
るため、候補単語の中には入力された文字列の単語も含
ませている。なお、図８（Ａ）は、『ｌｉｇｈｔ』の共
起フラグがオンしているときに、文字列が類似する『ｒ
ｉｇｈｔ』が入力されたときの例である。制御部３はユ
ーザが表示されている候補単語の中から１つの候補単語
を選択すると（ｎ３１）、入力された文字列を候補単語
に置き換え（ｎ３２）、ｎ１５に進んでここで置き換え
た文字列の単語を確定し、ｎ１６以降の処理を実行す
る。なお、候補単語の選択は、マウスの操作によってカ
ーソル（不図示）を意図する単語の上に位置させ、マウ
スのボタンをクリックすることによって選択することが
できる。In the character string input device according to the present embodiment, when it is determined that there is a high possibility that an input error has been made by the user, the display form of the candidate word is set to type 1 or type 2 and the processing after n28 is performed. Execute In n28, the English-Japanese dictionary 4a is searched to extract similar words whose meaning is similar to the meaning of the word of the input character string and the meaning of the similar word. It is determined whether or not the display form of the candidate word is type 1 (n
29) If it is type 1, as shown in FIG. 8 (A), the word of the input character string and the similar word extracted in n28 are displayed as candidate words (n30), and it is confirmed to the user whether there is any input error. Let it. Note that it is assumed that a word of the input character string is not a user input error, and thus the candidate word includes a word of the input character string. FIG. 8A shows a case where the character string is similar to “r” when the co-occurrence flag of “light” is on.
This is an example when “light” is input. When the user selects one candidate word from the displayed candidate words (n31), the control unit 3 replaces the input character string with the candidate word (n32), proceeds to n15, and replaces the character string here. Is determined, and the processing after n16 is executed. Note that the candidate word can be selected by operating a mouse to position a cursor (not shown) on the intended word and clicking a mouse button.

【００４４】このように、入力された文字列と類似する
文字列の単語が存在することのみを条件としてユーザに
入力ミスでないかを確認させるのではなく、入力された
単語の類似語が他の単語に比べて作成中の文書で使用さ
れる可能性が高いとき（類似語と共起関係にある単語が
作成中の文書に既に使用されているとき）に、ユーザに
入力ミスでないかを確認させるので、ユーザにこの確認
を頻繁に行わせることを防止でき、文書の作成効率を低
下させることがないとともに、略確実にユーザに入力ミ
スを認識させることができる。なお、図８（Ａ）の表示
に代えて、図８（Ｂ）に示すように共起フラグがオンで
あった類似語のみ候補単語として表示するようにしても
よい（ここでは、『ｆｉｇｈｔ』、『ｎｉｇｈｔ』の共
起フラグがオフであったとする。）。このようにすれ
ば、表示が煩雑にならずユーザに単語の選択を一層容易
に行わせることができるようになる。As described above, instead of having the user confirm whether there is a mistake in inputting only on the condition that a word having a character string similar to the input character string exists, a similar word of the input word is replaced with another word. When the word is more likely to be used in the document being created than the word (when a word that has a co-occurrence with a similar word is already used in the document being created), check if the user has made an input mistake Therefore, it is possible to prevent the user from frequently performing this check, not to lower the document creation efficiency, and to allow the user to almost certainly recognize an input error. Note that instead of the display of FIG. 8A, only similar words for which the co-occurrence flag is on may be displayed as candidate words as shown in FIG. 8B (here, “fight”). , The co-occurrence flag of “night” is off.) In this way, the display is not complicated, and the user can more easily select a word.

【００４５】ｎ２９で表示形態がタイプ１でないと判定
すると、ｎ３４でタイプ２であるか否かを判定する（図
６参照）。タイプ２であれば入力単語とその意味を表示
部５に図９に示すように表示し（ｎ３５）、ユーザに入
力ミスでないかを確認させる。ここでは、上述のように
ユーザの入力した単語が使用頻度の低い単語であったこ
とから、ユーザの入力ミスである可能性が高いと判断
し、ユーザに入力ミスでないかを確認させている。図９
に示すタイプ２の表示形態では、入力された文字列の単
語の意味を表示することによって、ユーザに入力ミスで
ないかを確認させるようにしている。なお、図９におい
て、２１は表示ボタンであり、２２は類似語表示ボタン
である。If it is determined in step n29 that the display mode is not type 1, it is determined in step n34 whether or not the display mode is type 2 (see FIG. 6). In the case of type 2, the input word and its meaning are displayed on the display unit 5 as shown in FIG. 9 (n35), and the user is made to confirm whether there is an input error. Here, since the word input by the user is a word used infrequently as described above, it is determined that there is a high possibility of an input error by the user, and the user is checked whether there is an input error. FIG.
In the type 2 display mode shown in FIG. 5, the meaning of the word of the input character string is displayed to allow the user to confirm whether or not there is an input error. In FIG. 9, reference numeral 21 denotes a display button, and reference numeral 22 denotes a similar word display button.

【００４６】制御部３は、表示部５に図９に示すタイプ
２の表示画面が表示されると、候補選択、表示オフ、類
似語表示のいずれかの入力を待つ（ｎ３６〜ｎ３８）。
なお、表示オフボタン２１の上にカーソル（不図示）を
移動してマウスのボタンをクリックすることによって表
示オフの入力が行え、類似語表示ボタン２２上にカーソ
ルを移動してマウスのボタンをクリックすることによっ
て類似語表示の入力が行える。また、特定のキー操作に
よって、表示オフや類似語表示を入力できる構成として
もよい。ユーザが類似語表示を入力すると、ｎ３９で現
在の表示形態がタイプ２であるか否かを判定し、タイプ
２であれば表示形態をタイプ３に変更し（ｎ４０）、タ
イプ２でなければ（タイプ３であれば）表示形態をタイ
プ２に変更し（ｎ４１）、ｎ３４に戻る。なお、ｎ３４
でタイプ２でないと判定した場合、ｎ４２で入力単語と
その意味だけでなく図１０に示すように類似語の文字列
およびその意味も表示し、ｎ３６〜ｎ３８でユーザの入
力を待つ。なお、この図１０に示す状態で、ユーザが類
似語表示の入力を行うと、表示部５の画面は図９に示す
画面に戻る。When the type 2 display screen shown in FIG. 9 is displayed on the display unit 5, the control unit 3 waits for input of any of candidate selection, display off, and similar word display (n36 to n38).
Note that the display can be turned off by moving the cursor (not shown) on the display off button 21 and clicking the mouse button, and moving the cursor on the similar word display button 22 and clicking the mouse button. By doing so, a similar word display can be input. Further, the display may be turned off or a similar word may be input by a specific key operation. When the user inputs the similar word display, it is determined whether or not the current display mode is type 2 at n39. If the current display mode is type 2, the display mode is changed to type 3 (n40). The display mode is changed to type 2 (if it is type 3) (n41), and the process returns to n34. Note that n34
If it is determined that the input word is not Type 2, not only the input word and its meaning but also the character string of the similar word and its meaning as shown in FIG. 10 are displayed at n42, and the user's input is waited at n36 to n38. When the user inputs a similar word display in the state shown in FIG. 10, the screen of the display unit 5 returns to the screen shown in FIG.

【００４７】ユーザが候補単語を選択すると、入力され
た文字列を選択された文字列に置き換え（ｎ４３）、ｎ
１５に進んでここで置き換えた単語を確定し、ｎ１６以
降の処理を実行する。また、ユーザが表示オフの入力を
行うと、元の単語の類似語テーブル１５における表示フ
ラグをオフするとともに（ｎ４５）、ｎ１５に進んで入
力された文字列の単語を確定し、ｎ１６以降の処理を実
行する。When the user selects a candidate word, the input character string is replaced with the selected character string (n43), and n
Proceeding to step 15, the replaced word is determined, and the processing after n16 is executed. When the user inputs the display off, the display flag in the similar word table 15 of the original word is turned off (n45), the process proceeds to n15, the word of the input character string is determined, and the process after n16 is performed. Execute

【００４８】このように、ユーザが表示オフの入力を行
うと、入力した文字列の単語の表示フラグがオフされ、
この単語を使用頻度が低くない単語に設定される。この
ため、一般的には使用頻度の低い単語であっても、ユー
ザ自身にとっては頻繁に使用する単語であれば、使用頻
度の低くない単語に設定することができ、図９に示す画
面が表示されることがなくなる。したがって、このよう
なユーザに対して、この単語が入力される度に入力ミス
でないかを確認させることがなくなり、該ユーザの文書
の作成効率を低下させることもない。As described above, when the user inputs the display off, the display flag of the word of the input character string is turned off,
This word is set as a word that is not frequently used. For this reason, even if the word is generally used infrequently, if the word is frequently used by the user himself / herself, the word can be set as a word not infrequently used, and the screen shown in FIG. 9 is displayed. Will not be done. Therefore, it is not necessary for such a user to confirm whether or not there is an input error every time this word is input, and the document creation efficiency of the user is not reduced.

【００４９】なお、この実施形態では、本願発明の文字
列入力装置を英文ワープロに適用した例で説明を行った
が、ドイツ語等の他の言語のワープロにも適用すること
ができる。また、単語が入力された時点で、この単語が
入力ミスであるか否かを判定するとしたが、作成された
文書の先頭から順次単語を抽出し、抽出した単語毎に上
記のｎ６以降の処理を行い、入力ミスである可能性が高
いときに、ユーザに入力ミスでないかを確認させる文字
列入力装置とすることもできる。In this embodiment, the character string input device according to the present invention is applied to an English word processor. However, the present invention can be applied to other languages such as German. In addition, when a word is input, it is determined whether or not this word is an input error. However, words are sequentially extracted from the beginning of the created document, and the above-described processing after n6 is performed for each extracted word. And a character string input device that allows the user to confirm whether there is an input error when there is a high possibility of an input error.

【００５０】[0050]

【発明の効果】以上のように、この発明によれば、入力
された文字列と同一の文字列の単語が存在するか否かだ
けでなく、既に入力された単語との関係から、入力され
た文字列がユーザの意図する単語であるか否かを判定
し、ユーザの入力ミスである可能性が高いときに、ユー
ザに入力ミスでないかを確認させるため、ユーザの文書
作成の効率を低下させることなく、略確実にユーザに入
力ミスを認識させることができる。As described above, according to the present invention, not only is it determined whether or not a word having the same character string as the input character string exists, but also based on the relationship with the already input word. Decreases the efficiency of the user's document creation by determining whether the entered character string is a word intended by the user and, when it is highly probable that the user has made an input error, prompting the user to check for the input error. Without making the user input, it is possible to make the user recognize the input error almost certainly.

[Brief description of the drawings]

【図１】この発明の実施形態である文字列入力装置の構
成を示すブロック図である。FIG. 1 is a block diagram showing a configuration of a character string input device according to an embodiment of the present invention.

【図２】この実施形態の文字列入力装置における英和辞
書の構成を示す図である。FIG. 2 is a diagram showing a configuration of an English-Japanese dictionary in the character string input device of this embodiment.

【図３】この実施形態の文字列入力装置における類似語
テーブルの構成を示す図である。FIG. 3 is a diagram showing a configuration of a similar word table in the character string input device of the embodiment.

【図４】同実施形態の文字列入力装置の処理を示すフロ
ーチャートである。FIG. 4 is a flowchart showing processing of the character string input device of the embodiment.

【図５】同実施形態の文字列入力装置の処理を示すフロ
ーチャートである。FIG. 5 is a flowchart showing processing of the character string input device of the embodiment.

【図６】同実施形態の文字列入力装置の処理を示すフロ
ーチャートである。FIG. 6 is a flowchart showing processing of the character string input device of the embodiment.

【図７】表示部における表示画面の例を示す図である。FIG. 7 is a diagram illustrating an example of a display screen on a display unit.

【図８】表示部における表示画面の例を示す図である。FIG. 8 is a diagram illustrating an example of a display screen on a display unit.

【図９】表示部における表示画面の例を示す図である。FIG. 9 is a diagram illustrating an example of a display screen on a display unit.

【図１０】表示部における表示画面の例を示す図であ
る。FIG. 10 is a diagram illustrating an example of a display screen on a display unit.

[Explanation of symbols]

１−文字列入力装置２−入力部３−制御部４−辞書ファイル４ａ−英和辞書４ｂ−類似語テーブル５−表示部６−プログラム記憶媒体１１−単語レコード１５−類似語レコード２１−表示ボタン２２−類似語表示ボタン 1-character string input device 2-input unit 3-control unit 4-dictionary file 4a-English-Japanese dictionary 4b-similar word table 5-display unit 6-program storage medium 11-word record 15-similar word record 21-display button 22 -Similar word display button

Claims

[Claims]

An input unit for inputting a character string, and a co-occurrence set in advance for each word as a related word having a similar idea such as a similar word, a synonym or an antonym that is similar to the character string. A storage unit for directly or indirectly storing words and the presence or absence of a co-occurrence state; and when a similar word of the input word is in the storage unit, the similar word is an input candidate if the similar word is in a co-occurrence state. And a control unit for setting a co-occurrence state of the co-occurred word of the determined word when the input word is determined.

2. The character according to claim 1, wherein, if the input word itself is already in a co-occurrence state, the control unit determines the input word without displaying the similar word on a display unit. Column input device.

3. The character string input according to claim 1, wherein the control unit resets the co-occurrence state of the similar word if a similar word similar to the co-occurrence word of the determined word is in the co-occurrence state. apparatus.

4. The storage unit further stores information indicating whether or not each of the words is a low-frequency word set in advance, and the control unit determines that the input word is a low-frequency word. 4. The storage unit according to claim 1, wherein the storage unit determines whether or not there is a word, and if the word is infrequently used, a similar word of the input word is displayed on the display unit as an input candidate. A character string input device according to Crab.

5. An input unit for inputting a character string, and storing, for each word, a similar word set in advance as a similar character string and information indicating whether the word is a predetermined infrequently used word or not. A storage unit that determines whether or not the input word is an infrequently used word by referring to the storage unit. If the input word is an infrequently used word, a similar word of the input word is input. A character string input device comprising: a control unit configured to display a candidate on a display unit.

6. The character string input device according to claim 4, further comprising means for inputting information on whether the use frequency is low.

7. For each word, a similar word set in advance as a similar character string, a co-occurrence word set in advance as a meaning having a semantic idea such as a synonym or antonym, and presence / absence of a co-occurrence state are determined. It is stored directly or indirectly, and by referring to the stored contents, it is determined whether or not there is a similar word of the input word. If there is a similar word, if the similar word is in a co-occurrence state, the similar word A character string input method that displays words as input candidates and sets the co-occurrence state for the co-occurred words of the determined word when the input word is determined.

8. The character string input method according to claim 1, wherein if the input word itself is already in a co-occurrence state, the input word is determined without displaying the similar word on a display unit.

9. The character string input device according to claim 7, wherein if a similar word similar to the determined word co-occurrence word is in the co-occurrence state, the co-occurrence state of the similar word is reset.

10. In addition to the storage contents, information indicating whether or not each word is a word of low use frequency set in advance is stored, and whether or not the input word is a word of low use frequency is determined. 10. The character string input method according to claim 7, wherein if the word is a word that is not used frequently, a similar word similar to the input word is displayed as an input candidate.

11. For each word, a similar word set in advance as a similar character string and information indicating whether or not each word is a preset infrequently used word are stored. A reference is made to determine whether or not the input word is an infrequently used word. If the input word is an infrequently used word, a character string indicating a similar word similar to the input word as an input candidate input method.

12. The character string input method according to claim 10, wherein information on whether the use frequency is low is input.

13. For each word, a similar word set in advance as a character string is similar, a co-occurrence word set in advance as a semantic idea such as a synonym or antonym is related, and the presence / absence of a co-occurrence state. A first step of determining whether or not there is a similar word of the input word by referring to the memory unit, the first step being applied to an apparatus having a storage unit for directly or indirectly storing the similar word; A second step of determining whether the word is in a co-occurrence state; a third step of displaying the similar word as an input candidate if the word is in a co-occurrence state; And a fourth step of setting a co-occurrence state with respect to the utterance.

14. A storage unit for directly or indirectly storing, for each word, a similar word preset as a similar character string and information indicating whether or not the word is a preset word having a low frequency of use. A first step of determining whether or not an input word is an infrequently used word by referring to the storage unit; and, if the input word is an infrequently used word, Displaying a similar word similar to the input word as an input candidate.