JPH0877294A

JPH0877294A - Image processor for document

Info

Publication number: JPH0877294A
Application number: JP6212951A
Authority: JP
Inventors: Yasuto Ishitani; 康人石谷
Original assignee: Toshiba Corp
Current assignee: Toshiba Corp
Priority date: 1994-09-06
Filing date: 1994-09-06
Publication date: 1996-03-22

Abstract

PURPOSE: To provide the document image processor which can accurately specify the document format of a document, etc., and efficiently extract and read a character string. CONSTITUTION: Figure feature quantities extracted by a feature extraction part 12 from an input image of the document generated by an image input part 11 are grouped by a feature structuring part 13, and the relation of the respective features is extracted and managed. The kind of the format structure of the input document is estimated by using the structured features and information (format structure model) regarding the format structure of a document to be processed which is previously registered in a format structure kind identification part 15. A format structure information collation part 16 extracts detailed correspondence relation between the format structure model corresponding to the estimated kind of the format structure and the structured features of the input document. After noncorrespondence and contradict correspondence finding and correction part 18 obtains the matching of the correspondence relation, a document structure acquisition part 19 copies information regarding the previously registered document structure model to the input document on the basis of the correspondence relation to acquire the structure and relative knowledge of the input document.

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】本発明は、例えば帳票などの文書
上に記載された文字の読みとり、データベースへの自動
入力、帳票画像の自動ファイリングに用いられる文書画
像処理装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a document image processing apparatus used for reading characters written on a document such as a form, automatically inputting into a database, and automatic filing of form images.

【０００２】[0002]

【従来の技術】近年、書類形態で管理・利用されていた
文書を電子化して計算機に入力し、多用な用途で活用し
ている。様々な業務処理で用いられている文書には、表
形式のものが多く、文書中の特定の位置に記載されてい
る所望の文字列を所望の形式で計算機に効率よく入力
し、管理したいという要求が高まっている。これらの期
待に答えるためにこれまでに光学的文字読みとり装置が
開発され、実用化されてきた。2. Description of the Related Art In recent years, documents that have been managed and used in the form of documents are digitized and input to a computer, and are used for various purposes. Many documents used in various business processes are in tabular form, and I want to efficiently input and manage the desired character string written in a specific position in the document in the desired format on the computer. The demand is increasing. In order to meet these expectations, optical character reading devices have been developed and put into practical use.

【０００３】このような帳票上の特定の位置に記載され
ている文字列を読み取り対象とする場合、帳票の書式構
造を事前に知る必要がある。帳票のように定型的な書式
構造を持つ文書に対する文字列の読み取り処理では、書
式構造を事前に覚えさせておき、それに基づいて効率良
く文字列領域を特定し、読み取るようにしている。When reading a character string written in a specific position on such a form, it is necessary to know the format structure of the form in advance. In the process of reading a character string for a document having a standard format structure such as a form, the format structure is memorized in advance, and the character string area is efficiently specified and read based on the stored structure structure.

【０００４】このようなアプローチをとるものとして
は、対象文書の書式構造に関する知識とそれを利用する
処理を分離することで書式構造の多用性への拡張性を高
めている。これらのものは書式構造に関する知識として
文書構造の物理情報、例えば位置、大きさ、幾何学的関
係などの情報を用いている。代表的な研究として、文献
「信学論（Ｄ）、Ｊ71−Ｄ、10、pp.2050-2058(1988-1
0）」と文献「信学論（Ｄ−II）、Ｊ72−Ｄ−II、７、p
p.1029-1039(1989-07）」がある。According to such an approach, the expandability to the versatility of the format structure is enhanced by separating the knowledge about the format structure of the target document and the process using it. These items use physical information of the document structure as knowledge about the format structure, for example, information such as position, size, and geometrical relationship. As a typical research, reference is made to the literature “Shingaku theory (D), J71-D, 10, pp.2050-2058 (1988-1).
0) ”and the literature“ Shingaku Theory (D-II), J72-D-II, 7, p.
p.1029-1039 (1989-07) ”.

【０００５】一方、最近になって、文書構造に関する物
理情報を用いるだけでは対応できない帳票に対しても文
字列領域の特定を可能にすることが考えられている。こ
の場合、文字領域を空間的な隣接・接続関係に基づいた
論理情報で表現し、識別することを可能にしている。On the other hand, recently, it has been considered to make it possible to specify a character string area even for a form that cannot be handled only by using physical information regarding the document structure. In this case, the character area can be expressed and identified by logical information based on the spatial adjacency / connection relationship.

【０００６】このようなアプローチをとるものとして
は、種々の帳票文書の書式構造に関する構成規則を一般
化してメタ知識と表現されたものを用いて対象文書画像
の書式構造を認識している。代表的な研究として文献
「信学論（Ｄ−II）、Ｊ76−Ｄ−II、３、pp.534-545(1
993-03）」がある。In order to take such an approach, the format rule of the target document image is recognized by using the one expressed as meta-knowledge by generalizing the configuration rules regarding the format structure of various form documents. As a typical research, the literature “Biological theory (D-II), J76-D-II, 3, pp.534-545 (1)
993-03) ”.

【０００７】[0007]

【発明が解決しようとする課題】文献「信学論（Ｄ）、
Ｊ71−Ｄ、10、pp.2050-2058(1988-10）」や文献「信学
論（Ｄ−II）、Ｊ72−Ｄ−II、７、pp.1029-1039(1989-
07）」のようなアプローチをとる手法では、物理情報に
依存しているので文書が大幅にずれて入力されたり、拡
大・縮小によるスケール変換を受けている場合には対応
できない。また、文献「信学論（Ｄ−II）、Ｊ76−Ｄ−
II、３、pp.534-545(1993-03）」のようなアプローチを
とる手法では、入力文書は高品質で画像の劣化がないこ
とを前提としているため、処理対象文書の品質が悪く画
像情報が不足している場合には所望の処理結果を得られ
ないという問題がある。[Problems to be Solved by the Invention] The literature "Dieology (D),
J71-D, 10, pp. 2050-2058 (1988-10) "and the literature," Theoretical theory (D-II), J72-D-II, 7, pp.1029-1039 (1989-).
The method such as "07)" cannot be applied when the document is input with a large deviation or undergoes scale conversion due to enlargement / reduction because it relies on physical information. In addition, the literature, “Sociology (D-II), J76-D-
II, 3, pp.534-545 (1993-03) ”, it is assumed that the input document is of high quality and there is no deterioration of the image. If the information is insufficient, there is a problem that the desired processing result cannot be obtained.

【０００８】また、何れの手法においても、以下の
（１）〜（６）の問題がある。Further, any of the methods has the following problems (1) to (6).

【０００９】（１）表が複数混在している場合には対
応できない。(1) It is not possible to deal with the case where a plurality of tables are mixed.

【００１０】（２）表が分裂している場合には対応で
きない。(2) If the table is divided, it cannot be dealt with.

【００１１】（３）罫線がかすれていたり、欠落して
いる場合には対応できない。(3) If the ruled line is faint or missing, it cannot be dealt with.

【００１２】（４）罫線分布が局所的に変動している
場合には対応できない。(4) It is not possible to deal with the case where the ruled line distribution locally changes.

【００１３】（５）入力された文書の処理方法を限定
している（左右に９０度および１８０度回転している文
書に対応できない）。(5) The processing method of the input document is limited (it cannot support documents rotated 90 degrees and 180 degrees to the left and right).

【００１４】（６）種々の書式構造の文書を一括して
読み取らせる場合、適用すべき書式構造モデルの自動同
定ができない。(6) When documents of various format structures are read collectively, the format structure model to be applied cannot be automatically identified.

【００１５】本発明は上記事情に鑑みてなされたもの
で、表形式の帳票などの書式構造を正確に認識でき、効
率の良い文字列の領域の特定を可能にした文書画像処理
装置を提供することを目的とする。The present invention has been made in view of the above circumstances, and provides a document image processing apparatus capable of accurately recognizing a format structure of a tabular form or the like and efficiently specifying a character string area. The purpose is to

【００１６】[0016]

【課題を解決するための手段】本発明は、文書より入力
画像を生成する画像入力手段と、入力文書の書式構造を
認識するために用いられる処理対象文書の書式構造に関
する情報を登録する際に、正立した処理対象文書の書式
構造に関する情報を複数の所定角度で回転させたものを
発生させ、それぞれに正立したものから何度回転してい
るかに関する情報を付与し、それらすべてを処理対象文
書の書式構造に関する情報として登録する書式構造情報
登録手段と、前記画像入力手段により生成された入力画
像から線分と文字成分に関する図形特徴を抽出し、さら
に前記入力画像における文字成分以外の領域から線分に
関する特徴を罫線を構成する図形特徴とみなして抽出す
る特徴抽出手段と、前記特徴抽出手段より抽出された罫
線に関する図形特徴をグループ化することにより表に関
する特徴を抽出し、各表に関する特徴において罫線が交
差・接続する部分に生じる接合部に関する情報を抽出
し、それぞれの特徴間の関係を抽出・管理する特徴構造
化手段と、前記特徴構造化手段で得られた入力画像の構
造化された画像特徴と、予め前記書式構造情報登録手段
によって登録されている処理対象文書の書式構造に関す
る情報を用いて、類似度を計算し、最も類似度の高い書
式構造モデルあるいは類似度の高いものから順に複数個
の書式構造モデルあるいはある一定値以上の類似度を有
する書式構造モデルを選び、前記入力文書の書式構造の
種別を一つあるいは複数個の候補に絞りこむ書式構造種
別同定手段と、前記特徴構造化手段により得られた入力
文書の表に関する特徴と、前記書式構造情報登録手段に
より登録されている当該書式構造モデルを構成する表に
関する特徴との間い照合処理を行ない、表間対応関係を
獲得する表照合手段と、前記表照合手段により得られた
表の対応関係において入力文書の表を構成する罫線と同
罫線に対応付く書式構造モデルの表を構成する罫線との
間の対応関係を獲得する罫線照合手段と、前記照合処理
結果に基づき特徴間の対応付きの程度を表す照合度を計
算し、正しい対応付けが行なわれているか否かの判断を
行なう照合結果判定手段とにより構成されているモデル
照合手段と、前記モデル照合手段で選択された書式構造
文書と入力文書における構造化特徴間の対応付けにおい
て、不完全な対応付けおよび矛盾した対応付けを解消す
ることにより整合のとれた前記書式構造モデルと入力文
書の構造化された特徴間の対応関係を獲得する未対応・
矛盾対応発見修正手段と、前記モデル照合手段で入力画
像に対応付いた書式構造モデルが所定角度で回転させた
ものである場合には、その回転角度を正立する方向に入
力画像を回転し、正立した当該書式モデルと対応付ける
画像回転手段と、前記書式構造モデルと入力文書の構造
化された特徴間の対応関係に基づいて、予め登録されて
いる当該書式構造モデルに関する情報を入力文書にコピ
ーすることにより入力文書の書式構造と関連情報を獲得
する文書構造獲得手段とを具備し、この文書構造獲得手
段により得られる結果に基づいて入力画像の書式構造を
認識するように構成されている。SUMMARY OF THE INVENTION According to the present invention, an image input means for generating an input image from a document and an information regarding the format structure of a processing target document used for recognizing the format structure of the input document are registered. , Generates information about the format structure of an upright document to be processed at multiple predetermined angles, gives information about how many times it is rotated from the upright document, and processes all of them. Form structure information registration means for registering as information about the format structure of the document, and graphic features regarding line segments and character components are extracted from the input image generated by the image input means, and further extracted from areas other than the character components in the input image. Feature extraction means for extracting the features related to the line segment as the graphic features forming the ruled lines, and the graphic feature related to the ruled lines extracted by the feature extraction means. Feature structuring means for extracting features related to a table by grouping, extracting information about a joint portion at a portion where ruled lines intersect / connect in the features related to each table, and extracting / managing a relationship between the features. And calculating the similarity using the structured image features of the input image obtained by the feature structuring means and the information about the format structure of the processing target document registered in advance by the format structure information registration means. Then, a plurality of format structure models or a format structure model having a similarity of a certain value or more are selected in order from the format structure model having the highest similarity or the one having the highest similarity, and the type of the format structure of the input document is set to one. Format structure type identifying means for narrowing down to one or a plurality of candidates, features relating to the table of the input document obtained by the feature structuring means, and the format structure Correspondence between the table obtained by the table collating means and the table collating means for performing the collating process with the features relating to the tables constituting the format structure model registered by the information registering means to obtain the inter-table correspondence relationship. In the relation, ruled line collating means for acquiring the correspondence between the ruled lines forming the table of the input document and the ruled lines forming the table of the format structure model corresponding to the ruled line, and the correspondence between the features based on the result of the collation processing. Of the structured structure document selected by the model matching means, and a model matching means configured by a matching result determining means for calculating a matching degree indicating the degree of matching and determining whether or not correct matching is performed. In the correspondence between the structured features in the input document and the input document, the formal structure model and the input document are matched by eliminating incomplete correspondence and inconsistent correspondence. Uncorrespondence that acquires the correspondence between the structured features of
When the format structure model associated with the input image by the contradiction correspondence finding and correction means and the model matching means is rotated by a predetermined angle, the input image is rotated in a direction in which the rotation angle is erect, Based on the correspondence between the image rotation means associated with the upright format model and the structured features of the format structure model and the input document, information on the previously registered format structure model is copied to the input document. The document structure acquisition means for acquiring the format structure and the related information of the input document is provided, and the format structure of the input image is recognized based on the result obtained by the document structure acquisition means.

【００１７】[0017]

【作用】この結果、本発明は、入力文書画像から罫線に
関する図形特徴量と文字成分に関する図形特徴量を抽出
し、これらのかかわり合いから文字成分以外の領域で正
確に罫線の特徴量を抽出できる。また、この罫線特徴を
グループ化することにより表に関する特徴量が得られ、
さらに各表において罫線に関する特徴とそれらの交差・
接合により生じる接合部を抽出し、全体−部分関係で記
述・管理することにより図形特徴量に対してより豊富な
情報を付加することができる。それらを効率的に検索す
ることができる。これにより文書内に表が複数個混在し
ていても各々の情報を抽出することができ、また罫線に
関する特徴も表ごとに区別することができる。As a result, according to the present invention, the feature amount of the ruled line and the feature amount of the character component related to the ruled line are extracted from the input document image, and the feature amount of the ruled line can be accurately extracted in the area other than the character component based on the relation between them. Also, by grouping these ruled line features, the feature amount related to the table can be obtained,
Furthermore, in each table, the features related to ruled lines and their intersections
It is possible to add more abundant information to the graphic feature amount by extracting the joint portion generated by the joint and describing and managing the joint portion by the whole-part relation. You can search them efficiently. As a result, even if a plurality of tables are mixed in the document, the respective information can be extracted, and the characteristics regarding ruled lines can be distinguished for each table.

【００１８】本発明では、入力画像から得られた構造化
された特徴と予め登録されている書式構造モデルの構造
化特徴との間で照合処理を行なうことにより入力文書を
解釈するが、照合処理の前に書式構造モデルの種別を同
定することで登録されているすべての書式構造モデルと
の照合処理を行なうことを避けることができる。この処
理では、文書中に含まれている接合部の数や罫線の数を
用いているので入力文書と書式構造モデルの間で拡大縮
小に伴うスケール変換がなされていたり、表や罫線の構
成要素の大きさが部分的に変動していても安定した結果
を得ることができる。書式構造モデルの種別を同定する
時に唯一の結果を出力するのではなく複数個の候補を出
力することで同定誤りが生じないようにしている。候補
となっている書式構造モデルの構造化特徴と入力文書の
構造化特徴との間で行なわれる照合処理では、もっとも
良く対応付く書式構造モデルを選ぶことができ、得られ
た対応関係が正しく獲得されているか否かの判断を行な
うことにより処理結果の信頼性を高めている。このと
き、照合処理まず表単位に行なわれ、次いで罫線単位に
行なわれる。この結果、局所的にまったく同じ特徴量を
持つ罫線でもそれが所属する表が異なれば誤った対応付
けが行なわれることはない。According to the present invention, the input document is interpreted by performing the matching process between the structured feature obtained from the input image and the structured feature of the format structure model registered in advance. By identifying the type of the format structure model before, it is possible to avoid performing the matching process with all the registered format structure models. In this process, since the number of joints and the number of ruled lines included in the document are used, scale conversion is performed between the input document and the format structure model due to scaling, or the components of tables and ruled lines are used. It is possible to obtain a stable result even if the size of is partially varied. The identification error is prevented by outputting a plurality of candidates instead of outputting a single result when identifying the type of the format structure model. In the matching process performed between the structured features of the candidate format structure model and the structured features of the input document, the format structure model with the best correspondence can be selected, and the obtained correspondence relationship can be acquired correctly. The reliability of the processing result is improved by determining whether or not the processing result is obtained. At this time, the collating process is first performed for each table and then for each ruled line. As a result, even if a ruled line having exactly the same local feature amount locally belongs to a different ruled line, incorrect association is not performed.

【００１９】また、本発明では、対応関係に対してどち
らかの構造化特徴の欠落による不完全な対応付きや矛盾
した対応付きの有無およびこの箇所を発見し、それを解
消する。この結果、整合のとれた対応関係を獲得するこ
とができ、安定した処理結果を得ることができる。この
対応関係に基づいて予め登録されている書式構造モデル
に関連する情報を入力文書に複写することにより入力文
書の文書構造を獲得し、読みとるべき文字列の記載位置
を正確に特定することができる。Further, in the present invention, the presence or absence of incomplete correspondence or inconsistent correspondence due to the lack of one of the structured features in the correspondence relation and this portion are found and solved. As a result, a matching correspondence can be obtained, and a stable processing result can be obtained. The document structure of the input document can be acquired by copying the information related to the pre-registered format structure model based on this correspondence to the input document, and the position where the character string to be read can be accurately specified. .

【００２０】[0020]

【実施例】以下、図面を参照して本発明の一実施例を説
明する。図１は本実施例に係わる画像処理装置の概略構
成を示すブロック図である。An embodiment of the present invention will be described below with reference to the drawings. FIG. 1 is a block diagram showing the schematic arrangement of an image processing apparatus according to this embodiment.

【００２１】本発明は、画像入力部１１、特徴抽出部１
２、特徴構造化部１３、書式構造情報登録部１４、書式
構造種別同定部１５、書式構造情報照合部１６、未対応
・矛盾対応発見修正部１７、照合結果判定部１８、及び
文書構造獲得部１９の各機能部から構成されている。The present invention includes an image input unit 11 and a feature extraction unit 1.
2, feature structuring unit 13, form structure information registration unit 14, form structure type identifying unit 15, form structure information collating unit 16, uncorrespondence / contradiction correspondence finding and correcting unit 17, collation result judging unit 18, and document structure acquiring unit. It is composed of 19 functional units.

【００２２】また、処理対象とする帳票は、図２に示す
ように、罫線により文字列領域が規定されているものと
する。Further, it is assumed that the form to be processed has a character string area defined by ruled lines as shown in FIG.

【００２３】画像入力部１１は、スキャナ装置やＴＶカ
メラ、ＦＡＸデータの入力部などによって構成されるも
のである。画像入力部１１は、処理対象である帳票等の
文書の画像を検出してシステム内に取り込む。画像入力
部１１によって入力された画像（入力画像）は、特徴抽
出部１２に送られる。The image input unit 11 is composed of a scanner device, a TV camera, a FAX data input unit, and the like. The image input unit 11 detects an image of a document such as a form to be processed and takes it into the system. The image (input image) input by the image input unit 11 is sent to the feature extraction unit 12.

【００２４】特徴抽出部１２は、画像入力部１１から得
られた入力画像から、罫線や文字成分に関する幾何学的
な図形特徴量をそれぞれ基本特徴として抽出する。特徴
抽出部１２により抽出された基本特徴の集合は特徴構造
化部１３に送られる。The feature extraction unit 12 extracts, from the input image obtained from the image input unit 11, geometrical figure feature amounts regarding ruled lines and character components as basic features. The set of basic features extracted by the feature extraction unit 12 is sent to the feature structuring unit 13.

【００２５】特徴構造化部１３は、特徴抽出部１２によ
り抽出された基本特徴の集合について、例えば罫線に関
する特徴をグループ化することにより得られる表や、罫
線が交差・接続する部分に生じる接合部などに関する情
報を構造化特徴として抽出し、さらにそれぞれの特徴間
の関係を記述、管理する。The feature structuring unit 13 is a table obtained by, for example, grouping features related to ruled lines in the set of basic features extracted by the feature extraction unit 12, and a joint portion generated at a portion where ruled lines intersect / connect. Information related to etc. is extracted as a structured feature, and the relationship between each feature is described and managed.

【００２６】書式構造情報登録部１４は、システムとオ
ペレータとの対話的な入力作業により、処理対象文書に
関する知識の登録を行なう。本実施例では、処理対象文
書の種類ごとに用意された種々の知識の総体をモデルと
呼び、モデルを定義したときに用いた文書をモデル文書
と呼ぶことにする。モデルは例えば、そのモデル文書の
構造化特徴に関する情報などを有している。The format structure information registration unit 14 registers knowledge about the document to be processed by interactive input work between the system and the operator. In the present embodiment, the total of various knowledge prepared for each type of document to be processed is called a model, and the document used when the model is defined is called a model document. The model has, for example, information about the structured features of the model document.

【００２７】このとき、書式構造情報登録部１４は、正
立している構造化特徴から、右に９０度回転したもの、
左に９０度回転したもの、１８０度回転したものの３種
類の構造化特徴を発生させ、それぞれが正立したものか
ら何度回転されているものであるかという情報を付与し
て知識ベースとして格納してもよい。At this time, the format structure information registration unit 14 rotates the structure feature standing upright by 90 degrees to the right,
Generates three types of structured features, one rotated 90 degrees to the left and one rotated 180 degrees, and added as information about how many times each of them has been rotated from an upright one and stored as a knowledge base. You may.

【００２８】この他に、例えば罫線により規定されてい
る領域内に含まれる文字列を文字認識装置で認識・コー
ド化させるような場合には、各種情報を構造化特徴に関
連付けて知識ベースとして登録・管理するようにしても
よい。In addition to this, for example, when a character string included in an area defined by a ruled line is to be recognized and coded by a character recognition device, various kinds of information are registered as a knowledge base in association with structured features.・ It may be managed.

【００２９】各種情報には、例えば、文字認識対象領域
の指定に関する情報、文字認識対象領域の文字列方向の
情報、文字読み取り処理時に拘束条件として働く文字種
情報及び筆記形態情報、文字認識後処理のための項目属
性情報、文字認識結果の対応関係などの情報がある。The various information includes, for example, information relating to designation of the character recognition target area, information on the character string direction of the character recognition target area, character type information and writing form information which acts as a constraint condition at the time of character reading processing, and post-character recognition processing. There is information such as item attribute information and a correspondence relationship between character recognition results.

【００３０】書式構造種別同定部１５は、特徴構造化部
１３によって抽出された入力画像に対する構造化特徴
と、予め書式構造情報登録部１４によって登録されてい
るモデル文書のそれぞれの構造化特徴との間で、類似度
計算を行なうことにより入力文書のフォーマットの種別
を同定する。すなわち類似度値が高いモデルのフォーマ
ットが入力文書のフォーマットである可能性が高いもの
と判別する。書式構造種別同定部１５は、該当する構造
化特徴を次段の書式構造情報照合部１６に出力する。The format structure type identifying unit 15 includes a structured feature for the input image extracted by the feature structuring unit 13 and each structured feature of the model document registered in advance by the format structure information registration unit 14. In between, the type of format of the input document is identified by performing similarity calculation. That is, it is determined that the model format having a high similarity value is highly likely to be the input document format. The format structure type identifying unit 15 outputs the corresponding structured feature to the format structure information matching unit 16 in the next stage.

【００３１】なお、書式構造種別同定部１５は、最も類
似度値の高いモデルのみに注目するのではなく、例えば
ある事前に定められたしきい値以上の類似度値を示す全
てのモデル文書の構造化特徴を候補として、書式構造情
報照合部１６に入力構造化特徴と共に送り込むようにし
てもよい。It should be noted that the format structure type identifying unit 15 does not focus only on the model having the highest similarity value, for example, for all model documents showing a similarity value equal to or higher than a certain predetermined threshold value. The structured feature may be sent to the format structure information matching unit 16 together with the input structured feature as a candidate.

【００３２】書式構造情報照合部１６は、書式構造種別
同定部１５によって候補とされた全てのモデル文書と入
力文書の間で構造化特徴間の対応関係を獲得する。ま
た、書式構造情報照合部１６は、入力文書の構造化特徴
と各モデル文書の構造化特徴との対応付け結果に対して
対応付きの度合いを表す尺度（以後、照合度と呼ぶ）を
計算する。書式構造情報照合部１６は、構造化特徴間の
対応関係を示す情報、及び照合度を示す情報を照合結果
判定部１７に出力する。The format structure information collating unit 16 acquires the correspondence relation between the structured features between all the model documents and the input documents which are candidates by the format structure type identifying unit 15. Further, the format structure information matching unit 16 calculates a scale (hereinafter, referred to as a matching degree) indicating the degree of correspondence with the correspondence result between the structured feature of the input document and the structured feature of each model document. . The format structure information matching unit 16 outputs the information indicating the correspondence between the structured features and the information indicating the matching degree to the matching result determination unit 17.

【００３３】なお、書式構造種別同定部１５から複数の
モデル文書の構造化特徴が送られてきた場合には、書式
構造情報照合部１６は、さらに最大照合度を示すモデル
文書を選択・出力するようになっていてもよい。When the structured features of a plurality of model documents are sent from the format structure type identifying unit 15, the format structure information matching unit 16 further selects and outputs the model document showing the maximum matching degree. It may be like this.

【００３４】照合結果判定部１７は、書式構造情報照合
部１６によって得られた情報に基づいて、最大照合度を
示す文書モデルと入力文書との間で対応関係が取れてい
るか否かを判定する。ここで対応関係が取れていると判
定された場合には、後段の未対応・矛盾対応発見修正部
１８に着目入力文書とモデル文書間の対応関係に関する
情報が送られるようにする。対応関係が取れていないと
みなされる場合には、入力文書を棄却して、次の文書を
入力するようにオペレータに促す。Based on the information obtained by the format structure information matching unit 16, the matching result judging unit 17 judges whether or not there is a correspondence between the document model showing the maximum matching degree and the input document. . If it is determined that the correspondence is established, the information about the correspondence between the input document of interest and the model document is sent to the uncorresponding / contradiction correspondence finding / correcting unit 18 in the subsequent stage. When it is considered that the correspondence is not established, the input document is rejected and the operator is urged to input the next document.

【００３５】未対応・矛盾対応発見修正部１８は、書式
構造情報照合部１６で獲得された入力文書の構造化特徴
とモデル文書の構造化特徴の対応関係において、誤った
特徴の抽出や必要な特徴の欠落のために不完全な対応や
矛盾した対応が生じているか否かを発見する。未対応・
矛盾対応発見修正部１８は、不完全な対応や矛盾した対
応を発見した場合には、それらを解消することにより整
合のとれた入力・モデル間の対応関係を獲得した上で、
文書構造獲得部１９に出力する。The non-correspondence / contradiction correspondence finding / correction unit 18 extracts an erroneous feature in the correspondence relation between the structured feature of the input document and the structured feature of the model document acquired by the format structure information collation unit 16 and extracts a necessary feature. Discover whether incomplete or inconsistent correspondences occur due to missing features. Not compatible·
When the inconsistent correspondence finding / correcting unit 18 finds an incomplete correspondence or an inconsistent correspondence, the inconsistent correspondence finding / correcting unit 18 eliminates them to obtain a matching correspondence between the input and the model, and
It is output to the document structure acquisition unit 19.

【００３６】文書構造獲得部１９は、未対応・矛盾対応
発見修正部１８で得られた整合のとれた入力文書とモデ
ル文書の構造化特徴間の対応関係に基づいて、予め書式
構造情報登録部１４で登録されているモデル文書に関す
る知識を入力文書にコピーすることにより入力文書の文
書構造及び関連知識を獲得する。The document structure acquisition unit 19 preliminarily uses the format structure information registration unit based on the correspondence between the structured features of the matched input document and the model document obtained by the uncorrespondence / contradiction correspondence finding / correction unit 18. The document structure of the input document and the related knowledge are acquired by copying the knowledge about the model document registered in 14 to the input document.

【００３７】次に、上述した必須構成要素を含む具体的
な文書画像処理システムについて説明する。ここでは、
図３に示すように、文字認識装置と組み合わせた文書画
像処理システムについて説明する。このシステムは、入
力文書から得られた入力画像において、特定の位置に記
載されている文字列領域を自動的に抽出し、さらにその
文字列画像を認識・コード化して、所望の出力様式で出
力するという動作をするものである。以後、図２に示す
文書（以下、特に断らない限り図２の文書を入力文書と
呼ぶ）を用いて説明する。Next, a specific document image processing system including the above-mentioned essential components will be described. here,
As shown in FIG. 3, a document image processing system combined with a character recognition device will be described. This system automatically extracts the character string area described at a specific position in the input image obtained from the input document, recognizes and codes the character string image, and outputs it in the desired output format. It is the action of doing. Hereinafter, description will be made using the document shown in FIG. 2 (hereinafter, the document of FIG. 2 is referred to as an input document unless otherwise specified).

【００３８】図３に示すように、文書画像処理システム
は、画像入力部２１、２値化処理部２３、前処理部２
５、特徴抽出部２７、特徴構造化部２９、モデル登録部
３１、フォント種別同定部３３、モデル照合部３５、照
合結果判定部３７、未対応矛盾対応発見修正部３９、文
書構造獲得部４１、文字列領域抽出部４３、文字認識部
４５、及び文字認識結果出力部４７によって構成されて
いる。As shown in FIG. 3, the document image processing system includes an image input section 21, a binarization processing section 23, and a preprocessing section 2.
5, feature extraction unit 27, feature structuring unit 29, model registration unit 31, font type identification unit 33, model collation unit 35, collation result determination unit 37, uncorresponding contradiction correspondence finding and correction unit 39, document structure acquisition unit 41, The character string area extraction unit 43, the character recognition unit 45, and the character recognition result output unit 47 are included.

【００３９】画像入力部２１は、図１において説明した
画像入力部１１と同じものとして説明を省略する。The image input section 21 is the same as the image input section 11 described with reference to FIG.

【００４０】２値化処理部２３は、画像入力部２３から
取り込まれた文書画像を、公知である２値化処理により
白と黒の２値の画像データに変換し、前処理部２５に出
力する。The binarization processing unit 23 converts the document image fetched from the image input unit 23 into binary image data of white and black by a known binarization process, and outputs it to the preprocessing unit 25. To do.

【００４１】前処理部２５は、２値化処理部２３から出
力された２値画像について、例えば文献「信学技報、Ｐ
ＲＵ９２−３２、１９９２」に記載されている傾き検出
・補正処理により、傾きのない２値画像に変換する。さ
らに、前処理部２５は、傾きが補正された２値画像に対
して、公知である黒連結成分抽出処理により、連結する
黒画素のまとまりを囲む外接矩形枠を生成し、その大き
さ（縦幅と横幅の長さ）や位置座標値を抽出、管理し、
その結果を特徴抽出部２７に出力する。The pre-processing unit 25 processes the binary image output from the binarization processing unit 23, for example, in the literature “Science Technical Report, P.
RU92-32, 1992 ”, the image is converted into a binary image having no inclination by the inclination detection / correction processing. Further, the preprocessing unit 25 generates a circumscribing rectangular frame surrounding a group of black pixels to be connected to the binary image whose inclination has been corrected by a known black connected component extraction process, and determines its size (vertical length). (Width and width) and position coordinate values are extracted and managed,
The result is output to the feature extraction unit 27.

【００４２】なお、入力文書の位置座標は左上端を原点
とし、ｘ座標値は右方向に次第に大きくなり、ｙ座標値
は下方向に次第に大きくなるように定義されているもの
とする。本実施例で触れる文書画像は全てこの座標系で
定義されている。この内、縦幅と横幅の長さが、それぞ
れ動的に検出されたしきい値th1 、th2 よりも小さく、
かつ最近傍の他の黒連結成分矩形から距離th3 以上離れ
ているものを、ノイズあるいは網点であるとして除去す
るようにしてもよい。The position coordinates of the input document are defined such that the upper left corner is the origin, the x coordinate value gradually increases in the right direction, and the y coordinate value gradually increases in the downward direction. All the document images touched in this embodiment are defined in this coordinate system. Of these, the width and height are smaller than the dynamically detected thresholds th1 and th2, respectively.
In addition, it is also possible to remove those that are separated from the other nearest black connected component rectangle by a distance th3 or more as noise or halftone dots.

【００４３】以後、画像入力部２１から入力された文書
画像に対して、２値化処理部２３における２値化処理、
前処理部２５における前処理を施して得た画像を入力文
書画像または単に入力画像と呼ぶ。入力画像は、特徴抽
出部２７に送られる特徴抽出部２７は、前処理部２５か
ら得られた入力画像から幾何学的な図形特徴を抽出する
ものであり、図４に示すように、線分抽出部２７ａ、文
字候補矩形抽出部２７ｂ、及び罫線特徴抽出部２７ｃに
よって構成されている。特徴抽出部２７は、以下に示す
手順で幾何学的な図形特徴を抽出する。After that, the document image input from the image input unit 21 is binarized by the binarization processing unit 23.
An image obtained by performing the preprocessing in the preprocessing unit 25 is called an input document image or simply an input image. The input image is sent to the feature extraction unit 27. The feature extraction unit 27 extracts geometrical graphic features from the input image obtained from the preprocessing unit 25. As shown in FIG. The extraction unit 27a, the character candidate rectangle extraction unit 27b, and the ruled line feature extraction unit 27c are configured. The feature extraction unit 27 extracts geometrical graphic features by the following procedure.

【００４４】まず、線分抽出部２７ａは、例えば、以下
に述べる手順に基づいて入力画像から線分を抽出する。
このとき線分抽出処理は、垂直方向と水平方向の２つの
方向に限定して実施されるようにしてもよい。具体的な
例として、垂直方向の線分を抽出する場合について説明
する。なお、水平方向の線分についても同様の手順によ
り抽出することができる。First, the line segment extraction unit 27a extracts a line segment from the input image based on the procedure described below, for example.
At this time, the line segment extraction process may be performed only in two directions, that is, the vertical direction and the horizontal direction. As a specific example, a case of extracting a line segment in the vertical direction will be described. Note that line segments in the horizontal direction can also be extracted by the same procedure.

【００４５】Ｓｔｅｐ１：文書画像を垂直方向に順次走
査し、各走査線において、予め定めた例えば３ドット以
上の長さを持つ連続する黒画素の連なりの集合（ＢＬ＝
｛ｂｌi ｜ｉ＝１，２，…，ｎ｝）を抽出する。Step 1: A document image is sequentially scanned in the vertical direction, and a series of consecutive black pixels (BL ==) having a predetermined length of, for example, 3 dots or more is scanned in each scanning line.
{Bli | i = 1, 2, ..., N}) is extracted.

【００４６】Ｓｔｅｐ２：ＢＬの要素のうち、水平方向
に隣接しているものを統合してまとめることにより、さ
らに、その集合（ＢＬＧ＝｛ｂｌｇj ｜ｊ＝１，２，…
Ｍ｝）を抽出する。Step 2: Of the elements of BL, those that are adjacent in the horizontal direction are integrated and put together, and the set (BLG = {blgj | j = 1, 2, ...
M}) is extracted.

【００４７】Ｓｔｅｐ３：ＢＬＧの各要素に含まれるＢ
Ｌにおいて、垂直方向に最も長い黒画素の連なりｂｌma
x を抽出し、その長さｂｌmax のα（例えばα＝０．
３）倍未満の長さを持つｂｌi を削除する。Step3: B included in each element of BLG
In L, a series of black pixels that are longest in the vertical direction blma
x is extracted, and its length blmax is α (for example, α = 0.
3) Delete bli with length less than double.

【００４８】Ｓｔｅｐ４：残ったｂｌi を内接する矩形
を抽出し、その集合（ＲＬ＝｛ｒｌk ｜ｋ＝１，２，
…，Ｐ｝）を抽出する（このとき、ｒｌk とそれを構成
するｂｌi とを相互に関連づける）。ｒｌk の左上端お
よび右下端のｙ座標値を検出し、それぞれｒｌk ｙ1 、
ｒｌk ｙ2 とする。Step 4: A rectangle inscribed in the remaining bli is extracted and its set (RL = {rlk | k = 1, 2,
, P}) (at this time, rlk and bli that compose it are correlated with each other). The y coordinate values of the upper left corner and the lower right corner of rlk are detected, and rlk y1, respectively.
Let rlk y2.

【００４９】Ｓｔｅｐ５：ＲＬの各要素を水平方向（左
から右への方向）に順次走査し、各走査線において最初
に出現するｂｌi のｘ座標（ｘis）と最後に出現するｂ
ｌiのｘ座標（ｘie）を保持する。Step 5: Each element of RL is sequentially scanned in the horizontal direction (from left to right), and the x coordinate (xis) of bli that first appears in each scanning line and the b that appears last in bli.
Hold the x coordinate (xie) of li.

【００５０】Ｓｔｅｐ６：各ｒｌk の各水平走査線にお
いて得られたｘisの集まりと、ｅieの集まりの平均値ｒ
ｌk ｘ1 ，ｒｌk ｘ2 をそれぞれ計算する。Step 6: The average value r of the collection of xis and the collection of eie obtained in each horizontal scanning line of each rlk.
lk x1 and rlk x2 are calculated respectively.

【００５１】Ｓｔｅｐ７：各ｒｌk の左上端および右下
端の座標をそれぞれ（ｒｌk ｘ1 ，ｒｌk ｙ1 ）、（ｒ
ｌk ｘ2 ，ｒｌk ｙ2 ）に設定し、それを線分として抽
出する。Step 7: The coordinates of the upper left corner and the lower right corner of each rlk are (rlk x1, rlk y1) and (r
lk x2, rlk y2) and extract it as a line segment.

【００５２】線分抽出部２７ａにより、入力画像から抽
出された線分の例を図５に示している。図５に示すよう
に、この段階では文字の部分において、文字を構成する
短い線分を検出している。これらを取り除くように以下
の処理を行なってもよい。すなわち、線分抽出部２７ａ
による線分抽出処理に前後して、あるいは同時に黒連結
成分群に対して、文字候補矩形抽出部２７ｂは、以下に
述べる手順により、文字とみなすことのできる黒連結成
分（以後、文字候補矩形と呼ぶ）を選出する。FIG. 5 shows an example of the line segment extracted from the input image by the line segment extraction unit 27a. As shown in FIG. 5, at this stage, short line segments forming a character are detected in the character portion. The following processing may be performed to remove these. That is, the line segment extraction unit 27a
Before or after the line segment extraction processing by, the character candidate rectangle extraction unit 27b uses the procedure described below to determine a black connected component that can be regarded as a character (hereinafter referred to as a character candidate rectangle). Call).

【００５３】Ｓｔｅｐ１：各黒連結成分を内接する矩形
を抽出し、その縦幅ｃｈと横幅ｃｗを求める。Step 1: A rectangle inscribed in each black connected component is extracted, and its vertical width ch and horizontal width cw are obtained.

【００５４】Ｓｔｅｐ２：ｃｈとｃｗの各値に対して出
現頻度を求め、最頻値を抽出する。最頻値を示すｃｈを
入力文書中の文字の平均的な文字高さ（ＣＨ）、最頻値
を示すｃｗを入力文書中の文字の平均的な文字幅（Ｃ
Ｗ）とみなす。Step 2: The appearance frequency is calculated for each value of ch and cw, and the mode value is extracted. The ch indicating the mode value is the average character height (CH) of the characters in the input document, and the cw indicating the mode value is the average character width (C) of the character in the input document.
W).

【００５５】Ｓｔｅｐ３：（ＣＷ−ｔｈ）≦ｃｗ≦（Ｃ
Ｗ＋ｔｈ）かつ（ＣＨ−ｔｈ）≦ｃｈ≦（ＣＨ＋ｔｈ）
を満たす黒連結成分を文字候補矩形として抽出する。Step 3: (CW-th) ≤cw≤ (C
W + th) and (CH-th) ≦ ch ≦ (CH + th)
Black connected components that satisfy the above are extracted as character candidate rectangles.

【００５６】文字候補矩形抽出部２７ｂにより抽出され
た文字候補矩形の抽出結果例を図６（文字部を外枠矩形
で囲んだ例）に示している。An example of the extraction result of the character candidate rectangles extracted by the character candidate rectangle extraction unit 27b is shown in FIG. 6 (an example in which the character portion is surrounded by an outer frame rectangle).

【００５７】なお、文字候補矩形抽出処理を線分抽出処
理より先に行ない、得られた文字候補矩形以外の入力画
像上の部分に対して垂直方向と水平方向の両方向に、以
下に述べるフィルタリング処理を適用して、罫線がかす
れていたり、とぎれているような場合でも線分抽出処理
結果を安定させるようにしてもよい。The character candidate rectangle extraction process is performed before the line segment extraction process, and the filtering process described below is performed in both the vertical direction and the horizontal direction with respect to the portion other than the obtained character candidate rectangle on the input image. May be applied to stabilize the line segment extraction processing result even when the ruled line is faint or broken.

【００５８】この場合、特徴抽出部２７は、図７に示す
ように構成される。すなわち、文字候補矩形抽出部２７
ｂの処理結果がフィルタリング処理部２７ｄに出力され
フィルタリング処理が施される。そして、フィルタリン
グ処理の後に、線分抽出部２７ａによる線分抽出処理実
行される。In this case, the feature extraction unit 27 is constructed as shown in FIG. That is, the character candidate rectangle extraction unit 27
The processing result of b is output to the filtering processing unit 27d and subjected to filtering processing. Then, after the filtering process, the line segment extracting process is executed by the line segment extracting unit 27a.

【００５９】フィルタリング処理部２７ｄによるフィル
タリング処理は、例えば、２値画像において水平（垂
直）方向の走査線上のある決まった長さ以内で連続する
白画素を全て黒画素に置き換えるという処理で実現され
る。これにより、かすれやとぎれのある罫線部分が補正
されるので、線分抽出処理結果が安定される。The filtering process by the filtering process unit 27d is realized by, for example, a process of replacing all continuous white pixels within a predetermined length on a horizontal (vertical) scanning line in a binary image with black pixels. . As a result, the ruled line portion having faintness or discontinuity is corrected, so that the line segment extraction processing result is stabilized.

【００６０】罫線特徴抽出部２７ｃは、線分抽出部２７
ａによって抽出された線分群において、図８に示すよう
な文字候補矩形に交差あるいは包含されている線分を除
去し、残った線分を罫線特徴として抽出する。The ruled line feature extraction unit 27c is a line segment extraction unit 27.
In the line segment group extracted by a, line segments that intersect or are included in the character candidate rectangle as shown in FIG. 8 are removed, and the remaining line segments are extracted as ruled line features.

【００６１】この時点では、各罫線特徴は、入力画像に
おける当該画像を囲む外接矩形の左上端と右下端の位置
座標、外接矩形の縦幅及び横幅などの種々の情報で表現
されるようにしてもよい。また、罫線特徴に「対応付い
た罫線特徴の識別番号」を用意しておいて、後段のモデ
ル照合部３５で実施される照合処理において、対応付い
た相手の識別番号を格納するようにしてもよい。At this point, each ruled line feature is represented by various information such as the position coordinates of the upper left and lower right edges of the circumscribing rectangle surrounding the image in the input image, and the vertical and horizontal widths of the circumscribing rectangle. Good. In addition, a ruled line feature “corresponding ruled line feature identification number” may be prepared, and the corresponding partner identification number may be stored in the matching process performed by the model matching unit 35 in the subsequent stage. Good.

【００６２】得られた罫線特徴の集合は、さらに水平罫
線と垂直罫線の２種類に分類され、水平罫線特徴の集合
とその要素数、垂直罫線特徴の集合とその要素数が抽出
される。The obtained set of ruled line features is further classified into two types of horizontal ruled lines and vertical ruled lines, and the set of horizontal ruled line features and the number of elements thereof, and the set of vertical ruled line features and the number of elements thereof are extracted.

【００６３】入力画像に対する罫線特徴抽出結果の例を
図９に示す。ここで、入力画像から抽出された水平罫線
の集合（水平罫線特徴集合）と垂直罫線の集合（垂直罫
線特徴集合）の各要素に、ＩＨＬ＝（ｉｈｌ₁，ｉｈｌ₂，ｉｈｌ₃，ｉｈｌ₄，
ｉｈｌ₅，ｉｈｌ₆，ｉｈｌ₇），ＩＶＬ＝（ｉｖｌ₁，ｉｖｌ₂，ｉｖｌ₃，ｉｖｌ₄，
ｉｖｌ₅，ｉｖｌ₆，ｉｖｌ₇，ｉｖｌ₈，ｉｖｌ₉）となるように識別番号を付与する。FIG. 9 shows an example of ruled line feature extraction results for the input image. Here, IHL = (ihl ₁ , ihl ₂ , ihl ₃ , ihl ₄ , ihl ₄ , ihl ₄ , ihl ₄ , ihl ₄ , ihl ₄ , ihl ₄ , ihl ₄ , ihl ₄ , ihl ₄ , ihl ₄ , ihl ₄ , ihl ₄ , ihl ₄ , ihl ₄ , ihl ₄ ,
ihl ₅ , ihl ₆ , ihl ₇ ), IVL = (ivl ₁ , ivl ₂ , ivl ₃ , ivl ₄ ,
(ivl ₅ , ivl ₆ , ivl ₇ , ivl ₈ , ivl ₉ ) are assigned identification numbers.

【００６４】また、水平罫線特徴集合の要素数をｉｈ-n
um、垂直罫線特徴集合の要素数をｉｖ-numとし、水平罫
線特徴集合と垂直罫線特徴集合をまとめて入力罫線特徴
集合と呼ぶことにする。入力罫線特徴集合は、後段の特
徴構造化部２９に送られる。以後の説明において上記記
号を用いることにする。The number of elements of the horizontal ruled line feature set is ih-n.
um and the number of elements of the vertical ruled line feature set are iv-num, and the horizontal ruled line feature set and the vertical ruled line feature set are collectively referred to as an input ruled line feature set. The input ruled line feature set is sent to the subsequent feature structuring unit 29. In the following description, the above symbols will be used.

【００６５】特徴構造化部２９は、特徴抽出部２７から
得られた入力罫線特徴集合から以下に述べる手順で罫線
に関する構造化特徴を抽出するもので、図１０に示すよ
うに、罫線グループ化処理部２９ａ、表特徴抽出部２９
ｂ、罫線接合部検出部２９ｃ、及び特徴間関係記述部２
９ｄによって構成されている。The feature structuring unit 29 extracts structured features relating to ruled lines from the input ruled line feature set obtained from the feature extracting unit 27 by the procedure described below. As shown in FIG. 10, ruled line grouping processing is performed. Unit 29a, table feature extraction unit 29
b, the ruled line joint detection unit 29c, and the inter-feature relationship description unit 2
It is composed of 9d.

【００６６】まず、罫線グループ化処理部２９ａは、交
差・接続する罫線特徴を一まとめにグループ化する。罫
線グループ化処理アルゴリズムは例えば以下のようにな
る。このアルゴリズムは、交差・接続する罫線には同じ
ラベルを与えることにより罫線をグループ化するもので
ある。First, the ruled line grouping processing unit 29a groups the ruled line features intersecting / connecting into one group. The ruled line grouping processing algorithm is as follows, for example. This algorithm groups ruled lines by giving the same label to the ruled lines that intersect and connect.

【００６７】Ｓｔｅｐ１：ラベル番号を初期化する。Step 1: Initialize the label number.

【００６８】Ｓｔｅｐ２：１本の垂直罫線を選択する。Step 2: Select one vertical ruled line.

【００６９】Ｓｔｅｐ３：ステップ２で選択された当該
垂直罫線に直交する全ての水平罫線を抽出する。Step 3: All horizontal ruled lines orthogonal to the vertical ruled line selected in step 2 are extracted.

【００７０】Ｓｔｅｐ４：当該垂直罫線か当該水平罫線
のいずれかに既にラベルが付与されている場合には、そ
の中で最小値のラベルを当該罫線の全てに付与する。Step 4: If a label is already attached to either the vertical ruled line or the horizontal ruled line, the minimum value label is given to all of the ruled lines.

【００７１】Ｓｔｅｐ５：当該垂直罫線と当該水平罫線
のいずれにもラベルが付与されていない場合には、それ
ら全てに新しいラベル番号を付与し、ラベル番号を更新
する。Step 5: If no label is given to either the vertical ruled line or the horizontal ruled line, new label numbers are given to all of them and the label numbers are updated.

【００７２】Ｓｔｅｐ６：Ｓｔｅｐ２からＳｔｅｐ５を
全ての垂直罫線に適用する。（Ｓｔｅｐ２からＳｔｅｐ
６までを手続きＡとする）Ｓｔｅｐ７：１本の水平罫線を選択する。Step 6: Steps 2 to 5 are applied to all vertical ruled lines. (Step 2 to Step
6 is the procedure A) Step 7: Select one horizontal ruled line.

【００７３】Ｓｔｅｐ８：当該水平罫線に直交する全て
の垂直罫線を抽出する。Step 8: Extract all vertical ruled lines orthogonal to the horizontal ruled line.

【００７４】Ｓｔｅｐ９：当該水平罫線か当該垂直罫線
のいずれかに既にラベルが付与されている場合にはその
中で最小値のラベルを当該罫線の全てに付与する。Step 9: When a label is already attached to either the horizontal ruled line or the vertical ruled line, the minimum value label is given to all the ruled lines.

【００７５】Ｓｔｅｐ10：当該垂直罫線と当該水平罫線
のいずれにもラベルが付与されていない場合には、それ
ら全てに新しいラベル番号を付与し、ラベル番号を更新
する。Step 10: If no label is given to either the vertical ruled line or the horizontal ruled line, new label numbers are given to all of them and the label numbers are updated.

【００７６】Ｓｔｅｐ11：Ｓｔｅｐ7 からＳｔｅｐ10を
全ての垂直罫線に対して適用する（Ｓｔｅｐ7 からＳｔ
ｅｐ11までを手続きＢとする）。Step11: Apply Step7 to Step10 to all vertical ruled lines (Step7 to St)
(Procedure B is up to ep11).

【００７７】Ｓｔｅｐ12：手続きＡと手続きＢをラベル
番号の更新がなくなるまで繰り返す。Step 12: Procedures A and B are repeated until the label number is no longer updated.

【００７８】Ｓｔｅｐ13：同じラベルが付与されている
罫線特徴をグループ化する。Step 13: The ruled line features having the same label are grouped.

【００７９】例えば、図９に示す入力罫線特徴集合に対
して、前述のようなグループ化処理を適用すると以下に
示す２つのグループが得られる。For example, when the above-described grouping process is applied to the input ruled line feature set shown in FIG. 9, the following two groups are obtained.

【００８０】Ｇroup１：（ｉｈｌ₁，ｉｈｌ₂，ｉｈｌ
₃，ｉｈｌ₄，ｉｈｌ₅，ｉｖｌ₁，ｉｖｌ₂，ｉｖｌ
₃，ｉｖｌ₄，ｉｖｌ₅，ｉｖｌ₆），Ｇroup２：（ｉｈｌ6 ，ｉｈｌ7 ，ｉｖｌ7 ，ｉｖｌ8
，ｉｖｌ9 ）表特徴抽出部２９ｂは、罫線グループ化処理部２９ａに
よって得られたグループごとに、グループに含まれる罫
線を内接する矩形（以後、表枠と呼ぶ）を抽出する。さ
らに、表特徴抽出部２９ｂは、表特徴として、例えば左
上端の位置座標、右下端の位置座標、表の縦幅、表の横
幅、重心の位置座標、水平罫線数、当該表に含まれる水
平罫線の集合、垂直罫線数、当該表に含まれる垂直罫線
の集合、当該表に含まれる接合部数、他の表に内接され
ているか否かの情報などを抽出する。この時、水平（垂
直）罫線特徴は、その左上端のｙ（ｘ）座標値の昇順に
ソートされていてもよい。Group1: (ihl ₁ , ihl ₂ , ihl
₃ , ihl ₄ , ihl ₅ , ivl ₁ , ivl ₂ , ivl
₃ , ivl ₄ , ivl ₅ , ivl ₆ ), Group2: (ihl6, ihl7, ivl7, ivl8)
, Ivl9) The table feature extraction unit 29b extracts, for each group obtained by the ruled line grouping processing unit 29a, a rectangle (hereinafter referred to as a table frame) inscribed with the ruled lines included in the group. Further, the table feature extraction unit 29b, as the table feature, for example, the position coordinates of the upper left corner, the position coordinates of the lower right corner, the vertical width of the table, the horizontal width of the table, the positional coordinates of the center of gravity, the number of horizontal ruled lines, the horizontal included in the table. A set of ruled lines, the number of vertical ruled lines, a set of vertical ruled lines included in the table, the number of joints included in the table, and information about whether or not the table is inscribed in another table are extracted. At this time, the horizontal (vertical) ruled line features may be sorted in ascending order of the y (x) coordinate values at the upper left end thereof.

【００８１】ここで接合部数については、後段の罫線接
合部検出部２９ｃで抽出され、表特徴として格納され
る。例えば、Ｇroup１からは、図１１に示すように以下
の表特徴ｉｔ₁が得られる。また、Ｇroup２から得られ
る表特徴をｉｔ2 とする。Here, the number of joints is extracted by the ruled line joint detector 29c in the subsequent stage and stored as a table feature. For example, the following table feature it ₁ is obtained from Group1 as shown in FIG. Also, the table feature obtained from Group2 is set to it2.

【００８２】表特徴：ｉｔ₁ 左上端の位置座標（ｉｔ₁ｘ1 ，ｉｔ₁ｙ1 ）右下端の位置座標（ｉｔ₁ｘ2 ，ｉｔ₁ｙ2 ）表の縦幅ｉｔ₁height 表の横幅ｉｔ₁width 重心の位置座標（ｉｔ₁ｃｘ，ｉｔ₁ｃｙ）水平罫線数ｉｈｌ₁num 当該表に含まれる水平罫線の集合ＩＨＬ₁ 垂直罫線数ｉｖｌ₁num 当該表に含まれる垂直罫線の集合ＩＶＬ₁ 当該表に含まれる接合部数ｉ₁junc-num 他の表に内接されているか否かの情報 nest flag ｉｔ₁height＝ｉｔ₁ｙ2 −ｉｔ₁ｙ1 ＋１，ｉｔ₁width ＝ｉｔ₁ｘ2 −ｉｔ₁ｘ1 ＋１，ｉｔ₁ｃｘ＝（ｉｔ₁ｘ1 ＋ｉｔ₁ｘ2 ）／２，ｉｔ₁ｃｙ＝（ｉｔ₁ｙ1 ＋ｉｔ₁ｙ2 ）／２，ｉｈｌ₁num ＝５，ＩＨＬ₁＝（ｉｈｌ₁，ｉｈｌ₂，ｉｈｌ₃，ｉｈ
ｌ₄，ｉｈｌ₅），ｉｖｌ₁num ＝６，ＩＶＬ₁＝（ｉｖｌ₁，ｉｖｌ₂，ｉｖｌ₃，ｉｖ
ｌ₄，ｉｖｌ₅，ｉｖｌ₆）， nest flag ＝０（すなわち他の表に含まれていない）１枚の入力帳票に複数の表が含まれていることを考慮し
て、さらにページ特徴を抽出、管理する。ページ特徴
は、例えば、表数、表特徴の集合、水平罫線数、垂直罫
線数、及び接合部数によって定義される。Table features: it ₁ top left position coordinates (it ₁ x1, it ₁ y1) bottom right position coordinates (it ₁ x2, it ₁ y2) table vertical width it ₁ height table width it ₁ width centroid Position coordinates (it ₁ cx, it ₁ cy) Number of horizontal ruled lines ihl ₁ num Set of horizontal ruled lines included in the table IHL ₁ Number of vertical ruled lines ivl ₁ num Set of vertical ruled lines included in the table IVL ₁ Included in the table Number of joints i ₁ junc-num Information on whether or not it is inscribed in another table nest flag it ₁ height = it ₁ y2 −it ₁ y1 +1, it ₁ width = it ₁ x2 −it ₁ x1 +1, it ₁ cx = (it ₁ x1 + it ₁ x2) / 2, it ₁ cy = (it ₁ y1 + it ₁ y2) / 2, ihl ₁ num = 5, IHL ₁ = (ihl ₁ , ihl ₂ , ihl ₃ , ih
l ₄ , ihl ₅ ), ivl ₁ num = 6, IVL ₁ = (ivl ₁ , ivl ₂ , ivl ₃ , iv
l ₄ , ivl ₅ , ivl ₆ ), nest flag = 0 (that is, not included in other tables) Considering that one input form contains multiple tables, page features are further extracted. ,to manage. The page feature is defined by, for example, the number of tables, the set of table features, the number of horizontal ruled lines, the number of vertical ruled lines, and the number of joints.

【００８３】例えば、図９の入力罫線集合からは、ペー
ジ特徴：ＩＰとして、表数＝２、表特徴の集合ＩＴ＝（ｉｔ₁，ｉｔ₂）、水平罫線数ｉｈｌ-num＝７、垂直罫線数ｉｖｌ-num＝
９、接合部数ｉjunc-num＝２８、が得られる。For example, from the set of input ruled lines in FIG. 9, as the page feature: IP, the number of tables = 2, the set of table features IT = (it ₁ , it ₂ ), the number of horizontal ruled lines ihl-num = 7, the vertical ruled lines Number ivl-num =
9, the number of junctions ijunc-num = 28 is obtained.

【００８４】次に、罫線接合部検出部２９ｃは、各表に
おいて、そこに含まれる水平罫線と垂直罫線の交差部分
・接続部分（以後、接合部と呼ぶ）を求め、さらに各表
特徴で接合部の個数を管理する。罫線接合部検出部２９
ｃの動作手順は例えば次のようになる。Next, the ruled line joining portion detecting unit 29c finds the intersections / connections (hereinafter referred to as joining portions) of the horizontal ruled lines and vertical ruled lines contained therein in each table, and joins them according to the characteristics of each table. Manage the number of copies. Ruled line joint detection unit 29
The operation procedure of c is as follows, for example.

【００８５】Ｓｔｅｐ１：一本の水平罫線を選択する。Step 1: Select one horizontal ruled line.

【００８６】Ｓｔｅｐ２：着目した水平罫線に交差・接
続する全ての垂直罫線を抽出する。Step 2: Extract all vertical ruled lines that intersect / connect to the focused horizontal ruled line.

【００８７】Ｓｔｅｐ３：当該水平罫線と当該垂直罫線
の交点を求め、水平罫線ごとにその座標値を管理する。
各水平罫線では接合部特徴はｘ座標値の昇順にソートし
ておく。Step 3: The intersection between the horizontal ruled line and the vertical ruled line is obtained, and the coordinate value is managed for each horizontal ruled line.
In each horizontal ruled line, the joint features are sorted in ascending order of the x coordinate value.

【００８８】Ｓｔｅｐ４：Ｓｔｅｐ1 からＳｔｅｐ3 ま
でを全ての水平罫線に対して実施する。Step 4: Steps 1 to 3 are executed for all horizontal ruled lines.

【００８９】Ｓｔｅｐ５：一本の垂直罫線を選択する。Step 5: Select one vertical ruled line.

【００９０】Ｓｔｅｐ６：着目した垂直罫線に交差・接
続する全ての水平罫線を抽出する。Step 6: Extract all horizontal ruled lines that intersect and connect with the vertical ruled line of interest.

【００９１】Ｓｔｅｐ７：当該垂直罫線と当該水平罫線
の交点を求め、垂直罫線ごとにその座標値を管理する。
各垂直罫線では接合部特徴はｙ座標値の昇順にソートし
ておく。Step 7: The intersection of the vertical ruled line and the horizontal ruled line is obtained, and the coordinate value is managed for each vertical ruled line.
In each vertical ruled line, the joint features are sorted in ascending order of the y coordinate value.

【００９２】Ｓｔｅｐ８：Ｓｔｅｐ５からＳｔｅｐ７ま
でを全ての垂直罫線に対して実施する。Step 8: Steps 5 to 7 are executed for all vertical ruled lines.

【００９３】例えば、表特徴ｉｔ₁では、水平罫線集合
ＩＨＬ₁と、垂直罫線集合ＩＶＬ₂から、図１２に示す
接合部が得られ、その接合部数「２２」を表特徴に加え
る（ｉ₁junc-num＝２２とする）。For example, in the table feature it ₁ , the joint shown in FIG. 12 is obtained from the horizontal ruled line set IHL ₁ and the vertical ruled line set IVL ₂ , and the number of joints “22” is added to the table feature (i ₁ junc -num = 22).

【００９４】特徴間関係記述部２９ｄは、以上の処理結
果より得られた情報を、例えば図１３のように関係づけ
て管理する。この結果、ページ特徴から表特徴、罫線特
徴、接合特徴と階層的に関連づけられて管理され、特徴
に関する情報を効率的に検索できるようになる。The inter-feature relation description part 29d manages the information obtained from the above processing results in relation to each other as shown in FIG. As a result, the page features are managed by being hierarchically associated with the table features, the ruled line features, and the joining features, and the information about the features can be efficiently searched.

【００９５】以後、これらの特徴を総称して構造化特徴
と呼ぶ。また、入力文書から抽出された構造化特徴を入
力構造化特徴、モデル文書から抽出された構造化特徴を
モデル構造化特徴と呼ぶ。Hereinafter, these features will be collectively referred to as structured features. In addition, the structured features extracted from the input document are called input structured features, and the structured features extracted from the model document are called model structured features.

【００９６】一方、処理対象文書に対する処理とは別
に、モデル登録部３１は、オペレータによって提示され
た処理対象文書の種類ごとの一例であるサンプル文書を
もとに、オペレータとの間の対話的入力作業によりモデ
ルの登録を行なう。ここでは、モデル登録作業時のモデ
ル登録部３１の動作の一例について説明する。On the other hand, in addition to the processing on the processing target document, the model registration unit 31 interactively inputs the operator based on the sample document presented by the operator, which is an example for each type of the processing target document. The model is registered by work. Here, an example of the operation of the model registration unit 31 during the model registration work will be described.

【００９７】まず、画像入力部２１を介して登録対象の
サンプル文書を入力し、２値化処理部２３、前処理部２
５を経て文書画像（以後、モデル画像と呼ぶ）を取得す
る。次いで、特徴抽出部２７においてモデル画像から罫
線特徴を抽出する。First, the sample document to be registered is input via the image input unit 21, and the binarization processing unit 23 and the preprocessing unit 2 are input.
A document image (hereinafter referred to as a model image) is acquired via step 5. Next, the feature extraction unit 27 extracts ruled line features from the model image.

【００９８】モデル登録部３１は、特徴抽出部２７にお
ける抽出結果に対して特徴抽出処理の誤りを推定し、そ
れを修正する旨のメッセージを、表示部３１ａのディス
プレイの画面上に表示して、修正作業をオペレータに指
示する。The model registration unit 31 estimates the error of the feature extraction processing with respect to the extraction result of the feature extraction unit 27 and displays a message to the effect that the error is corrected on the screen of the display of the display unit 31a. Instruct the operator to make corrections.

【００９９】特徴抽出処理の誤りの推定方式は、例え
ば、端点が接合部となっていない罫線を見つけて、その
罫線が正しく抽出されていないとみなすことにより実現
できる。モデル登録部３１は、表示部３１ａに表示され
た指示に従いオペレータによって画面上で修正された罫
線特徴を入力部３１ｂを介して入力する。The error estimation method of the feature extraction processing can be realized by, for example, finding a ruled line whose end point is not a joint portion and assuming that the ruled line is not correctly extracted. The model registration unit 31 inputs, through the input unit 31b, the ruled line feature corrected on the screen by the operator according to the instruction displayed on the display unit 31a.

【０１００】修正作業が完了すると、修正された罫線特
徴は特徴構造化部２９に送られ、構造化特徴が抽出され
る。この構造化特徴は、未知入力文書を処理する際に、
フォーマット種別同定部３３、モデル照合部３５、未対
応・矛盾対応発見修正部３９、照合結果判定部３７など
で用いられる。When the modification work is completed, the modified ruled line feature is sent to the feature structuring unit 29, and the structured feature is extracted. This structured feature is useful when processing unknown input documents.
It is used by the format type identifying unit 33, the model matching unit 35, the unsupported / contradiction correspondence finding / correction unit 39, the matching result determination unit 37, and the like.

【０１０１】抽出・修正された罫線特徴は、画面上にモ
デル画像と重ねて表示される。オペレータは、書式構造
情報登録部３１が画面上に出力するメッセージに従いな
がら、モデル文書に対応付くべき入力文書画像を処理す
るために必要な知識を構築していく。The extracted / corrected ruled line features are displayed on the screen so as to be superimposed on the model image. The operator builds the knowledge necessary for processing the input document image to be associated with the model document while following the message output by the format structure information registration unit 31 on the screen.

【０１０２】この結果、モデルとして構造化特徴の情
報、文字認識対象領域の指定に関する情報、文字認識対
象領域の文字列方向の情報、文字読み取り処理時に拘束
条件として働く文字種情報、筆記形態情報、文字認識後
処理のための項目属性情報、文字認識結果の対応関係な
どの情報が知識ベース３１ｃに格納される。As a result, the structured feature information as the model, the information regarding the designation of the character recognition target area, the information about the character string direction of the character recognition target area, the character type information serving as a constraint condition during the character reading process, the writing form information, and the character Information such as item attribute information for post-recognition processing and correspondence between character recognition results is stored in the knowledge base 31c.

【０１０３】このうち構造化特徴は、正立した画像に対
して作られている。そこで、モデル登録部３１は、知識
ベース３１ｃへの登録時に、左に９０度回転したものと
右に９０度回転したものと１８０度回転したものをそれ
ぞれ発生させ、さらに何度回転されているかを示す情報
を付加するようにしてもよい。Of these, structured features are created for upright images. Therefore, the model registration unit 31 generates one rotated 90 degrees to the left, one rotated 90 degrees to the right, and one rotated 180 degrees at the time of registration in the knowledge base 31c. You may make it add the information shown.

【０１０４】これら４方向の構造化特徴と入力構造化特
徴は、後段のモデル照合部３５において、それぞれの方
向の特徴と入力画像とを照合させることにより、入力文
書の文書方向が未知であっても、モデルマッチングで最
良マッチングした構造化特徴に付与されている回転角度
を知ることができる。従って、その角度を０度にする方
向に入力画像を回転させれば、必ず正立した入力画像を
得ることができる。In the model collating unit 35 in the subsequent stage, the structured feature in the four directions and the input structured feature are collated with the feature in each direction and the input image, and the document direction of the input document is unknown. Also, it is possible to know the rotation angle given to the structured feature best matched by the model matching. Therefore, if the input image is rotated in the direction in which the angle is set to 0 degree, an upright input image can be obtained without fail.

【０１０５】モデル登録部３１は、以上の処理を全ての
モデル文書に対して行なう。ここでは、一例として図１
４と図１５に示す２種類の構造化特徴がモデルとして登
録されたものとする（以後、図１４をモデル１、図１５
をモデル２と呼ぶ）。The model registration unit 31 performs the above processing on all model documents. Here, as an example, FIG.
It is assumed that two types of structured features shown in FIG. 4 and FIG. 15 are registered as models (hereinafter, FIG. 14 is model 1, FIG.
Is called model 2).

【０１０６】フォーマット種別同定部３３は、モデル登
録部３１によって予め登録されているモデルから、入力
文書に対応するもの（あるいは対応する可能性のあるも
の）を選出する。The format type identifying unit 33 selects a model corresponding to (or possibly corresponding to) the input document from the models registered in advance by the model registering unit 31.

【０１０７】ここでは、入力文書の構造化特徴と登録さ
れている全てのモデルの構造化特徴との間で類似度計算
を行ない、最も類似度の高いモデルあるいは、ある一定
の値以上の類似度を有するモデルの構造化特徴を後段の
モデル照合部３５に送り込む。Here, the similarity is calculated between the structured features of the input document and the structured features of all the registered models, and the model with the highest similarity or the similarity of a certain value or more is calculated. The structured features of the model having the above are sent to the model matching unit 35 in the subsequent stage.

【０１０８】類似度計算に用いる特徴としては、例えば
文書中に含まれる、特徴構造化２９によって抽出された
接合部数を用いる。接合部数という特徴は、未知入力文
書が寸法の拡大・縮小、さらには各表が独立してスケー
ル変換されている場合にも、影響を受けない特徴である
ので書式構造種別同定のための類似度計算に適してい
る。As the feature used for the similarity calculation, for example, the number of joints extracted by the feature structuring 29 included in the document is used. The number of joints is a feature that is not affected even when the unknown input document is scaled up or down, and when each table is scale-converted independently. Suitable for calculation.

【０１０９】この他に、文書中に含まれる水平罫線特徴
数や垂直罫線特徴数、表特徴数なども類似度計算のため
の特徴として有効である。ここでモデル文書は適宜に順
序づけがなされているとする。Besides this, the number of horizontal ruled line features, the number of vertical ruled line features, the number of table features, etc. included in the document are also effective as features for similarity calculation. Here, it is assumed that the model documents are appropriately ordered.

【０１１０】また、類似度は以下のようにして求まる。
ここで、モデルの総数：model-num、入力文書中の接合
部数：ｉｊｎ、ｋ番目のモデル文書中の接合部数：ｍｊ
ｎ_k（ただし１≦ｋ≦model-num ）、入力文書とｋ番目
のモデル文書との類似度：ｓｉｍ_k（ただし− 100≦ｓ
ｉｍ_k≦ 100）とすると、類似度は例えば、ｉｊｎ＞０またはｍｊｎ_k＞０のときｓｉｍ_k＝ 100− 200×｜ｉｊｎ−ｍｊｎ_k｜／（ｉｊ
ｎ＋ｍｊｎ_k）ｉｊｎ＝０かつｍｊｎ_k＝０のときｓｉｍ_k＝−100 により求まる。The degree of similarity can be obtained as follows.
Here, the total number of models: model-num, the number of joints in the input document: ijn, the number of joints in the kth model document: mj
n _k (where 1 ≦ k ≦ model-num), the similarity between the input document and the kth model document: sim _k (where −100 ≦ s
Im _k ≦ 100), the similarity is, for example, when ijn> 0 or mjn _k > 0 sim _k = 100−200 × | ijn−mjn _k | / (ij
n + mjn _k ) ijn = 0 and mjn _k = 0, it is obtained by sim _k = −100.

【０１１１】得られた類似度のうち類似度が最大である
もの、あるいは予め設定されたしきい値th5 以上のもの
を候補のモデルとして入力文書の構造化特徴とともにモ
デル照合部３５に送る。このとき、全てのモデルにおい
て類似度がしきい値th5 未満である場合、以後の処理を
中断して、入力文書を棄却してもよいし、また、全ての
モデルの構造化特徴をモデル照合部に送り込むようにし
てもよい。Among the obtained similarities, the one having the maximum similarity or one having a preset threshold value th5 or more is sent to the model matching unit 35 as a candidate model together with the structured feature of the input document. At this time, if the similarity is less than the threshold value th5 in all models, the subsequent processing may be interrupted and the input document may be rejected. You may send it to.

【０１１２】例えば、登録されているモデル文書が図１
４と図１５の２種類である場合、入力文書とそれら２つ
のモデル文書との類似度計算は以下のようになる。ここ
で入力文書の接合部数：ｉｊｎ＝28、モデル１の接合部
数：ｍｊｎ₁＝31、モデル２の接合部数：ｍｊｎ₂＝26
とし、入力文書とモデル１との類似度をｓｉｍ₁、入力
文書とモデル２との類似度をｓｉｍ₂、しきい値：th5
＝80とする。For example, the registered model document is shown in FIG.
4 and FIG. 15, the similarity calculation between the input document and these two model documents is as follows. Here, the number of joints in the input document: ijn = 28, the number of joints in the model 1: mjn ₁ = 31, the number of joints in the model 2: mjn ₂ = 26
, The similarity between the input document and the model 1 is sim ₁ , the similarity between the input document and the model 2 is sim ₂ , and the threshold value is th5.
= 80.

【０１１３】ｓｉｍ₁＝ 100− 200×｜28−31｜／（28＋31）＝89.83 ｓｉｍ₂＝ 100− 200×｜28−26｜／（28＋26）＝92.59 以上の計算結果からｓｉｍ1 とｓｉｍ2 が共にしきい値
th5 を越えているので、双方とも入力文書に対応付く可
能性のあるモデルとしてそれらの構造化特徴を後段のモ
デル照合部３５に送る。Sim ₁ = 100−200 × | 28−31 | / (28 + 31) = 89.83 sim ₂ = 100−200 × | 28−26 | / (28 + 26) = 92.59 From the above calculation results, sim1 and sim2 are combined. Threshold
Since th5 is exceeded, both of them send their structured features to the model matching unit 35 in the subsequent stage as models that may correspond to the input document.

【０１１４】モデル照合部３５は、入力文書から抽出さ
れた構造化特徴（以後、入力構造化特徴と呼ぶ）と、書
式構造種別同定部で選出された一つあるいは複数個のモ
デル文書の構造化特徴（以後、モデル構造化特徴と呼
ぶ）を受け取り、それらの間で以下に述べる照合処理を
行ない、照合度を計算する。The model matching unit 35 structures the structured features extracted from the input document (hereinafter referred to as input structured features) and one or more model documents selected by the format structure type identifying unit. The features (hereinafter referred to as model structured features) are received, and the matching process described below is performed between them to calculate the matching degree.

【０１１５】照合処理は、当該入力構造化特徴と当該モ
デル構造化特徴との間の対応付け処理のことであり、照
合度はその対応付きの程度を表す尺度である。ここで
は、最も高い照合度を示すモデル文書が入力文書に対応
するものであるとし、入力構造化特徴とそのモデル構造
化特徴との間の対応関係に関する情報を、後段の照合結
果判定部に送り込む。最大照合度を示すモデル文書と入
力文書との間の対応関係が獲得されているか否かの最終
的な判断は照合結果判定部で行なわれる。The matching process is a matching process between the input structured feature and the model structured feature, and the matching degree is a scale indicating the degree of correspondence. Here, it is assumed that the model document having the highest matching degree corresponds to the input document, and information regarding the correspondence between the input structured feature and the model structured feature is sent to the matching result determination unit in the subsequent stage. . The collation result judging unit makes a final judgment as to whether or not the correspondence between the model document showing the maximum matching degree and the input document is acquired.

【０１１６】特徴構造化部２９で得られた構造化特徴
は、図１３に示すように、ページ特徴から表特徴へ、さ
らに表特徴から罫線特徴へと全体−部分という階層的な
関係が抽出されて構造化されている。この特質を用いる
場合、照合処理は階層的に実施される。これに対応し
て、モデル照合部３５は、さらに図１６に示すように、
選択部３５ａ、表照合部３５ｂ、罫線照合部３５ｃ、照
合度計算部３５ｄ、及び照合結果出力部３５ｅによって
構成される。As shown in FIG. 13, the structured features obtained by the feature structuring unit 29 have a hierarchical relationship of whole-part extracted from page features to table features, and from table features to ruled line features. Is structured. When using this feature, the matching process is performed hierarchically. In response to this, the model matching unit 35 further, as shown in FIG.
The selection unit 35a, the table matching unit 35b, the ruled line matching unit 35c, the matching degree calculation unit 35d, and the matching result output unit 35e.

【０１１７】まず、選択部３５ａは、複数個のモデルの
構造化特徴から一つのモデルの構造化特徴を選択し、入
力文書の構造化特徴とともに表照合部３５ｂに送る。First, the selecting unit 35a selects the structured feature of one model from the structured features of a plurality of models and sends it to the table collating unit 35b together with the structured feature of the input document.

【０１１８】次いで、表照合部３５ｂは、入力文書の表
特徴の集合（以後、入力表特徴集合と呼ぶ）とモデル文
書の表特徴の集合（以後、モデル表特徴集合と呼ぶ）と
の間で対応付けを行なう。Next, the table collating unit 35b performs processing between the set of table features of the input document (hereinafter referred to as the input table feature set) and the set of table features of the model document (hereinafter referred to as the model table feature set). Correspond.

【０１１９】表間対応がとれた場合には、さらに罫線照
合部３５ｃは、入力文書の当該表の内部に含まれる罫線
特徴（以後、入力罫線特徴と呼ぶ）とモデル文書の当該
表の内部に含まれる罫線特徴（以後、モデル罫線特徴と
呼ぶ）との間で対応付けを行なう。When the correspondence between the tables is established, the ruled line collating unit 35c further determines whether the ruled line features included in the table of the input document (hereinafter referred to as the input ruled line feature) and the table of the model document are included. Correspondence is performed between the included ruled line features (hereinafter referred to as model ruled line features).

【０１２０】罫線間の対応がとれた場合には、照合度計
算部３５ｄは、（入力文書と当該モデル文書との間の）
照合度を計算する。ここで、表間対応と罫線間対応のど
ちらも得られなかった場合には、照合度に最低値（−10
0 ）を代入する。When the ruled lines are matched with each other, the matching degree calculation unit 35d determines (between the input document and the model document).
Calculate the degree of matching. If neither table correspondence nor ruled line correspondence is obtained, the minimum matching degree (-10
0) is substituted.

【０１２１】照合結果出力部３５ｅは、それまでに計算
されている最も高い照合度を示すモデル文書と入力文書
の組を選び、その構造化特徴間の対応関係に関する情報
を後段の照合結果判定部３７に出力する。The collation result output unit 35e selects a set of the model document and the input document showing the highest collation degree calculated up to that time, and provides information on the correspondence between the structured features in the collation result determination unit in the subsequent stage. Output to 37.

【０１２２】以下では、フォーマット種別同定部３３か
ら図１４と図１５の２種類のモデルと入力文書の構造化
特徴がそれぞれ送られてきた場合のモデル照合部３５の
処理動作例について説明する。In the following, an example of the processing operation of the model collating unit 35 when the two types of models of FIGS. 14 and 15 and the structured feature of the input document are respectively sent from the format type identifying unit 33 will be described.

【０１２３】２種類のモデルのうち、選択部３５ａによ
ってモデル１が選ばれ、入力文書とともにそれぞれの構
造化特徴が表照合部３５ｂに送り込まれたとする。ここ
で、入力文書の表特徴数をｉｔ-num＝２とし、表特徴集
合をＩＴとすると、ＩＴ＝（ｉｔ₁，ｉｔ₂）（図９参
照）モデル１の表特徴数をｍｔ1-num ＝２とし、表特徴
集合をＭＴ１とすると、ＭＴ１＝（ｍｔ１₁，ｍｔ
１₂）（図１４参照）とする。It is assumed that, out of the two types of models, Model 1 is selected by the selection unit 35a and the structured features thereof are sent to the table matching unit 35b together with the input document. Here, if the number of table features of the input document is it-num = 2 and the table feature set is IT, IT = (it ₁ , it ₂ ) (see FIG. 9) mt 1-num = 2 and the table feature set is MT1, MT1 = (mt1 ₁ , mt
1 ₂ ) (see FIG. 14).

【０１２４】表照合部３５ｂは、さらに図１７に示すよ
うに、対応可能ペア検出部３５ｂ−１、異種対応可能ペ
ア間両立関係判定部３５ｂ−２、及び最良マッチング抽
出部３５ｂ−３によって構成されており、次のように動
作する。As shown in FIG. 17, the table collating unit 35b further includes a compatible pair detecting unit 35b-1, a heterogeneous compatible inter-pair compatibility determining unit 35b-2, and a best matching extracting unit 35b-3. And operates as follows.

【０１２５】まず、対応可能ペア検出部３５ｂ−１は、
選択部３５ａで選択されたモデルの表特徴集合の各要素
に対して、それと対応付く可能性のある入力の表特徴を
すべて検出し、対応可能ペアとして管理する。すなわ
ち、Ｓｔｅｐ１：任意のＭＴから任意の一つの表特徴ｍｔ_k
（着目モデルの表特徴集合のうちｋ番目の表特徴）を選
ぶ。First, the available pair detection unit 35b-1
For each element of the table feature set of the model selected by the selection unit 35a, all the input table features that may be associated with it are detected and managed as a pair that can be associated. That is, Step 1: any MT to any one table feature mt _k
(Kth table feature of the table feature set of the model of interest) is selected.

【０１２６】Ｓｔｅｐ２：表特徴ｍｔ_kの接合部数とＩ
Ｔの各表特徴の接合部数との間で類似度を計算をする。Step 2: The number of joints of the table feature mt _k and I
Similarity is calculated between the number of joints of each table feature of T.

【０１２７】Ｓｔｅｐ３：例えばｍｔ_kとｉｔ_j（入力
表特徴集合のｊ番目の表特徴）の類似度ｓｉｍ_kjが、あ
らかじめ設定したしきい値th6 以上である場合には、対
応可能であるとして、その組（ｍｔ_k，ｉｔ_j）を対応
可能ペアとして保持する。Step 3: For example, if the similarity sim _kj between mt _k and it _j (the j-th table feature of the input table feature set) is equal to or greater than a preset threshold th6, it is possible to handle it. The set (mt _k , it _j ) is held as a compatible pair.

【０１２８】Ｓｔｅｐ４：Ｓｔｅｐ３を表特徴集合ＩＴ
のすべての要素に対して適用する。Step4: Step3 is a table feature set IT
Applies to all elements of.

【０１２９】Ｓｔｅｐ５：Ｓｔｅｐ１〜Ｓｔｅｐ４まで
を着目ＭＴのすべての要素に対して適用する。Step 5: Steps 1 to 4 are applied to all the elements of the MT of interest.

【０１３０】ここで、ｉｔj の接合部数をｉｊｎ_j、ｍ
ｔ_kの接合部数をｍｊｎ_kとすると、類似度は、例えば
次の式で求まるとする。Here, the number of junctions of itj is ijn _j , m
Assuming that the number of joints of t _k is mjn _k , it is assumed that the degree of similarity is obtained by the following equation, for example.

【０１３１】ｉｊｎ_j＞０またはｍｊｎ_j＞０のとき、ｓｉｍ_kj＝ 100− 200×｜ｉｊｎ_j−ｍｊｎ_k｜／（ｉ
ｊｎ_j＋ｍｊｎ_k）ｉｊｎ_j＝０かつｍｊｎ_k＝０のとき、ｓｉｍ_kj＝−100 上記動作をＩＴとＭＴ１を例として具体的に説明すると
以下のようになる。th6 ＝75、ｉｔ₁の接合部数＝22、
ｉｔ₂の接合部数＝６、ｍｔ１₁の接合部数＝25、ｍｔ
１₂の接合部数＝６とすると式から、ｍｔ１₁とｉｔ₁の間の類似度：ｓｉｍ１₁₁＝86.67 と
なり、ｓｉｍ１₁₁＞th6 であるから、対応可能であると
して、その組（ｍｔ１₁，ｉｔ₁）を対応可能ペアとし
て保持する。When ijn _j > 0 or mjn _j > 0, sim _kj = 100-200 × | ijn _j −mjn _k | / (i
jn _j + mjn _k ) ijn _j = 0 and mjn _k = 0, sim _kj = −100 The above operation will be described in detail using IT and MT1 as examples. th6 = 75, number of junctions of it ₁ = 22,
Number of joints of it ₂ = 6, number of joints of mt1 ₁ = 25, mt
If the number of joints of 1 ₂ = 6, from the formula, the similarity between mt1 ₁ and it ₁ is sim1 ₁₁ = 86.67, and sim1 ₁₁ > th6. Therefore, it is considered that the pair (mt1 ₁ , it) ₁ ) is held as a compatible pair.

【０１３２】ｍｔ１₁とｉｔ₂の間の類似度：ｓｉｍ１
₁₂＝−22.58 となり、ｓｉｍ１₁₂＜th6 であるから、対
応不可能であるとする。Similarity between mt1 ₁ and it ₂ : sim1
_{Since 12} = −22.58, and sim1 ₁₂ <th6, it cannot be handled.

【０１３３】ｍｔ１₂とｉｔ₁の間の類似度：ｓｉｍ１
₂₁＝−14.28 となり、ｓｉｍ１₂₁＜th6 であるから、対
応可能であるとする。Similarity between mt1 ₂ and it ₁ : sim1
_{Since 21} = -14.28, and sim1 ₂₁ <th6, it is possible to deal with it.

【０１３４】ｍｔ１₂とｉｔ₂の間の類似度：ｓｉｍ１
₂₂＝ 100となり、ｓｉｍ１₂₂＞th6であるから、その組
（ｍｔ１₂，ｉｔ₂）を対応可能ペアとして保持する。Similarity between mt1 ₂ and it ₂ : sim1
_{Since 22} = 100 and sim1 ₂₂ > th6, the set (mt1 ₂ , it ₂ ) is held as a compatible pair.

【０１３５】以上の結果、対応可能ペアとして、ＩＴと
ＭＴ１との間では、p1＝（ｍｔ１₁，ｉｔ₁）とp2＝
（ｍｔ１₂，ｉｔ₂）が検出された。As a result, as a compatible pair, p1 = (mt1 ₁ , it ₁ ) and p2 = between IT and MT1.
(Mt1 ₂ , it ₂ ) was detected.

【０１３６】次に、異種対応可能ペア間両立関係判定部
３５ｂ−２は、当該モデル文書と入力文書の間で、対応
可能ペア検出部３５ｂ−１で検出された２つの異なる対
応可能ペアが両立するものであるか否かを判定する。Next, the heterogeneous compatible pair compatibility relationship determining unit 35b-2 is compatible with the two different compatible pairs detected by the compatible pair detecting unit 35b-1 between the model document and the input document. It is determined whether or not to do.

【０１３７】ここでいう「両立する」ということは、２
つの対応可能ペアが同時に存在することに矛盾が無いこ
とを意味する。ここでの判定条件としては以下のものが
上げられる。判定対象となる対応可能ペアをそれぞれ
（ｍｔ_k，ｉｔ_j）、（ｍｔ′_k，ｉｔ′_j）とする。
ここで、ｍｔ_kの左上端の位置座標を（ｍｔ_kｘ1 ，ｍ
ｔ_kｙ1 ）、右下端の位置座標を（ｍｔ_kｘ2 ，ｍｔ_k
ｙ2 ）、ｍｔ′_kの左上端の位置座標を（ｍｔ′_kｘ1
，ｍｔ′_kｙ1 ）、右下端の位置座標を（ｍｔ′_kｘ2
，ｍｔ′_kｙ2 ）、ｉｔ_jの左上端の位置座標を（ｉ
ｔ_jｘ1 ，ｉｔ_jｙ1 ）、右下端の位置座標を（ｉｔ_j
ｘ2 ，ｉｔ_jｙ2 ）、ｉｔ′_jの左上端の位置座標を
（ｉｔ′_jｘ1 ，ｉｔ′_jｙ1 ）、右下端の位置座標を
（ｉｔ′_jｘ2 ，ｉｔ′_jｙ2 ）とする。"Compatible" here means 2
It means that there is no contradiction that two corresponding pairs exist at the same time. The determination conditions here include the following. Corresponding pairs to be judged are (mt _k , it _j ) and (mt ′ _k , it ′ _j ), respectively.
Here, the position coordinates of the upper left point of the mt _{_k} (mt _k x1, m
t _k y1), and the position coordinates of the lower right corner are (mt _k x2, mt _k
y2), 'the position coordinates of the upper left point of the _{_k} (mt' mt _k x1
, Mt ′ _k y1) and the position coordinates of the lower right corner are (mt ′ _k x2
, Mt ′ _k y2), and the position coordinate of the upper left end of it _j is (i
t _j x1, it _j y1), and the position coordinates of the lower right corner are (it _j
_{x2, it j y2), it} 'j of the coordinates of the upper left end (it' _j x1, it is' _j y1), the position coordinates of the lower right (it 'and _{_{j x2, it' j y2)}} .

【０１３８】また、文書画像の座標系（ｘ，ｙ）：０＜
ｘ≦ＷＩＤＴＨ，０＜ｙ≦ＨＥＩＧＨＴにおいて、図１
８に示すような座標系を定義する。すなわち、図１８
（ａ）中に示す矩形領域Ｔ：（左上端の位置座標を（ｔ
ｘ1 ，ｔｙ1 ）、右下端の位置座標を（ｔｘ2 ，ｔｙ2
）とする）に対して、図１８（ｂ）に示す０＜ｘ＜ｔ
ｘ1 かつ０＜ｙ≦ＨＥＩＧＨＴを満たす領域を領域１、
図１８（ｃ）に示すｔｘ2＜ｘ≦ＷＩＤＴＨかつ０＜ｙ
≦ＨＥＩＧＨＴを満たす領域を領域２、図１８（ｄ）に
示す０＜ｘ≦ＷＩＤＴＨかつ０＜ｙ＜ｔｙ1 を満たす領
域を領域３、図１８（ｅ）に示す０＜ｘ≦ＷＩＤＴＨか
つｔｙ2 ＜ｙ≦ＨＥＩＧＨＴを満たす領域を領域４と定
義する。The coordinate system (x, y) of the document image: 0 <
In the case of x ≦ WIDTH and 0 <y ≦ HEIGHT, FIG.
A coordinate system as shown in 8 is defined. That is, in FIG.
Rectangular area T shown in (a): (The position coordinate of the upper left corner is (t
x1, ty1) and the position coordinates of the lower right corner are (tx2, ty2)
) And 0 <x <t shown in FIG.
The region where x1 and 0 <y≤HEIGHT is satisfied is region 1,
18 (c), tx2 <x ≦ WIDTH and 0 <y
The region satisfying ≤HEIGHT is region 2, the region satisfying 0 <x≤WIDTH and 0 <y <ty1 shown in FIG. 18D is region 3, and the region satisfying 0 <x≤WIDTH and ty2 <y shown in FIG. 18E. A region that satisfies ≦ HEIGHT is defined as a region 4.

【０１３９】また、判定条件として、以下の条件１から
条件３までのすべての条件を満たさない場合のみ、２つ
の対応可能ペアが両立可能であると見なす。Also, it is considered that two compatible pairs are compatible only when all the following conditions 1 to 3 are not satisfied as the determination conditions.

【０１４０】条件１：ｍｔ_k＝ｍｔ′_k、条件２：ｉｔ_j＝ｉｔ′_j、条件３：配置関係に逆転があること。条件３は、以下の
条件３−１〜３−４の条件を満たす場合である。すなわ
ち、条件３−１：ｉｔ_j（ｉｔ′_j）に対して、ｉｔ′
_j（ｉｔ_j）が領域１にあり、ｍｔ_k（ｍｔ′_k）に対
して、ｍｔ′_k（ｍｔ_k）が領域２にある。Condition 1: mt _k = mt ' _k , Condition 2: it _j = it' _j , Condition 3: There is an inversion in the arrangement relationship. Condition 3 is a case where the following conditions 3-1 to 3-4 are satisfied. That is, condition 3-1: it ′ for it _j (it ′ _j )
_j (it _j ) is in region 1 and mt ' _k (mt _k ) is in region 2 for mt _k (mt' _k ).

【０１４１】条件３−２：ｉｔ_j（ｉｔ′_j）に対し
て、ｉｔ′_j（ｉｔ_j）が領域２にあり、ｍｔ_k（ｍ
ｔ′_k）に対して、ｍｔ′_k（ｍｔ_k）が領域１にあ
る。Condition 3-2: For it _j (it ' _j ), it' _j (it _j ) is in the region 2 and mt _k (m
For t ′ _k ), mt ′ _k (mt _k ) is in region 1.

【０１４２】条件３−３：ｉｔ_j（ｉｔ′_j）に対し
て、ｉｔ′_j（ｉｔ_j）が領域３にあり、ｍｔ_k（ｍ
ｔ′_k）に対して、ｍｔ′_k（ｍｔ_k）が領域４にあ
る。Condition 3-3: For it _j (it ' _j ), it' _j (it _j ) is in the region 3, and mt _k (m
'to _{_{k), mt' t k (}} mt k) is in the region 4.

【０１４３】条件３−４：ｉｔ_j（ｉｔ′_j）に対し
て、ｉｔ′_j（ｉｔ_j）が領域４にあり、ｍｔ_k（ｍ
ｔ′_k）に対して、ｍｔ′_k（ｍｔ_k）が領域３にあ
る。Condition 3-4: For it _j (it ' _j ), it' _j (it _j ) is in the region 4, and mt _k (m
'to _{_{k), mt' t k (}} mt k) is in the region 3.

【０１４４】異種対応可能ペア間両立関係判定部３５ｂ
−２の動作を、前述した対応可能ペアp1とp2を用いて具
体的に説明する。ＩＴとＭＴ１の間における２つの異な
る対応可能ペアの組として（p1，p2）がある。このペア
は条件１から条件３までのすべてを満たさないので両立
可能であると判断される。異種対応可能ペア間両立関係
判定部３５ｂ−２は、両立可能と判断された対応可能ペ
アを、最良マッチング抽出部３５ｂ−３に出力する。Different type compatible compatibility determination unit 35b
The operation -2 will be specifically described by using the compatible pairs p1 and p2 described above. There is (p1, p2) as a set of two different compatible pairs between IT and MT1. Since this pair does not satisfy all of the conditions 1 to 3, it is judged that they are compatible with each other. The heterogeneous compatible pair compatibility relationship determination unit 35b-2 outputs the compatible pair determined to be compatible to the best matching extraction unit 35b-3.

【０１４５】最良マッチング抽出部３５ｂ−３は、異種
対応可能ペア間両立関係判定部３５ｂ−２において両立
すると判定された対応可能ペアのうち、すべてが互いに
両立可能な対応可能ペアの最大の集合を求めることによ
り、入力表特徴集合とモデル表特徴集合間の最良マッチ
ングを抽出する。The best matching extraction unit 35b-3 selects the maximum set of compatible pairs that are compatible with each other among the compatible pairs determined to be compatible by the heterogeneous compatible pair compatibility relationship determination unit 35b-2. By finding, the best matching between the input table feature set and the model table feature set is extracted.

【０１４６】最良マッチング抽出部３５ｂ−３は、さら
に図１９に示すように、連合グラフ作成部３５ｂ−３ａ
と最大クリーク抽出部３５ｂ−３ｂにより構成されてい
る。連合グラフ作成部３５ｂ−３ａは、異種対応可能ペ
ア間両立関係判定部３５ｂ−２の判定結果を受け取り、
その情報をもとに連合グラフを作成する。この連合グラ
フの節点は対応可能ペアを示し、節点間を結ぶ弧は２つ
の対応可能ペアが両立可能であることを示すものであ
り、異種対応可能ペア間両立関係判定部３５ｂ−２の判
定結果を表す補助的なデータ構造である。The best matching extraction unit 35b-3, as shown in FIG. 19, further includes an association graph creation unit 35b-3a.
And the maximum clique extraction unit 35b-3b. The association graph creation unit 35b-3a receives the determination result of the inter-pair compatible compatibility determination unit 35b-2,
An association graph is created based on the information. The nodes of this union graph indicate compatible pairs, and the arc connecting the nodes indicates that two compatible pairs are compatible, and the determination result of the compatibility compatible pair determining unit 35b-2 for different types is possible. Is an auxiliary data structure that represents.

【０１４７】具体的には、対応可能ペアp1とp2が連合グ
ラフ作成部３５ｂ−３ａに送られ、この結果、図２０に
示す連合グラフが得られる。上述した「すべてが互いに
両立可能な対応可能ペアの最大の集合」はこの連合グラ
フにおいては全体的に連結した（完全に互いに両立可能
な）最大の節点集合であると捉えることができる。それ
はクリークであり、本実施例ではより大きなクリークは
より良いマッチングを表している。Specifically, the available pairs p1 and p2 are sent to the association graph creating section 35b-3a, and as a result, the association graph shown in FIG. 20 is obtained. The above-mentioned “maximum set of compatible pairs that are all compatible with each other” can be regarded as the maximum connected node set (completely compatible with each other) in this association graph. It is a clique, with larger cliques representing better matching in this example.

【０１４８】最大クリーク検出部３５ｂ−３ｂは、連合
グラフ作成部３５ｂ−３ａによって作成された連合グラ
フから、節点数が最大である完全連結部分グラフ（最大
クリーク）を抽出する。なお、最大クリーク検出部３５
ｂ−３ｂの動作は、例えば文献「信学論（Ｄ），J68-
Ｄ，3,pp221-228,1985」に記載されている方式を適用す
ることができる。The maximum clique detection unit 35b-3b extracts a fully connected subgraph (maximum clique) having the maximum number of nodes from the association graph created by the association graph creation unit 35b-3a. The maximum clique detection unit 35
The operation of b-3b is described in, for example, the literature “Science theory (D), J68-
D, 3, pp 221-228, 1985 ”can be applied.

【０１４９】最大クリーク検出部３５ｂ−３ｂによって
抽出されたクリーク、すなわち、「すべてが互いに両立
可能な対応可能ペアの最大の集合」を構成する要素の対
応可能ペアは、入力文書の表特徴集合ＩＴと当該モデル
文書の表特徴集合ＭＴ１との間で対応の取れた表の組を
表す。具体的には、最大クリーク：mclique ＝p1，p2が
得られ、ｍｔ１₁とｉｔ₁、ｍｔ１₂とｉｔ₂の対応が
それぞれ得らたことになる。この対応結果は、後段の罫
線照合部３５ｃに送られる。The cliques extracted by the maximum clique detectors 35b-3b, that is, the corresponding pairs of elements that make up the "maximum set of compatible pairs that are all compatible with each other" are the table feature set IT of the input document. And a table feature set MT1 of the model document are associated with each other. Specifically, the maximum cliques: mclique = p1, p2 are obtained, and the correspondences of mt1 ₁ and it ₁ and mt1 ₂ and it ₂ are obtained. This correspondence result is sent to the ruled line collating unit 35c in the subsequent stage.

【０１５０】このとき、入力文書とモデル文書の表特徴
集合の各表では、その内部に含まれる罫線特徴の座標値
は、表の左上端の座標値で正規化されている。また、入
力文書の各表の罫線特徴の座標値は、対応付くモデル文
書の表に重ね合わせられるようにスケール変換されてい
る。At this time, in each table of the table feature set of the input document and the model document, the coordinate value of the ruled line feature contained therein is normalized by the coordinate value of the upper left end of the table. Further, the coordinate values of the ruled line feature of each table of the input document are scale-converted so as to be superimposed on the corresponding model document table.

【０１５１】すなわち、入力表の縦幅がｉｔ-height 、
横幅がｉｔ-width、対応付いたモデル表の縦幅がｍｔ-h
eight 、横幅がｍｔ-widthであるとき、当該入力表の正
規化された罫線特徴の座標値は、さらに、ｘ座標値では
（ｍｔ-width／ｉｔ-width）倍、ｙ座標値では（ｍｔ-h
eight ／ｉｔ-height ）倍されることによりスケール変
換される。この結果、後段の罫線照合部３５ｃでは同じ
スケールかつ同じ座標系のもとで入力表とモデル表に含
まれる罫線集合間の対応付け処理が可能になる。That is, the vertical width of the input table is it-height,
The width is it-width, and the corresponding model table height is mt-h.
When the width is eight and the width is mt-width, the coordinate value of the normalized ruled line feature in the input table is further multiplied by (mt-width / it-width) for the x coordinate value and (mt-width for the y coordinate value. h
Eight / it-height) Scale conversion by multiplication. As a result, the ruled line collating unit 35c in the subsequent stage can perform the matching process between the ruled line sets included in the input table and the model table under the same scale and the same coordinate system.

【０１５２】前述した対応可能ペア検出部３５ｂ−１の
動作を示すＳｔｅｐ３において、ｓｉｍ_kj＜th6 である
場合には、ｍｔ_kとｉｔ_jの構造が異なっている他に、
例えば図２１（ａ）に示すように、入力文書の印刷品質
が悪いために表が分離している場合（図中（ａ−１））
や、隣接する表の間隔が狭いために画像入力時に接触し
てしまったりする場合（図中（ｂ−１））も考えられ
る。In Step 3 showing the operation of the above-mentioned compatible pair detection unit 35b-1, if sim _kj <th6, the structures of mt _k and it _j are different, and
For example, as shown in FIG. 21A, when the input document has poor print quality and the table is separated ((a-1) in the figure)
Alternatively, there may be a case where the adjacent tables come into contact with each other at the time of image input because of a narrow interval ((b-1) in the figure).

【０１５３】このような問題点に対応するために対応可
能ペア検出部３５ｂ−１は、さらに以下の処理を実施す
るようにしてもよい。In order to deal with such a problem, the available pair detection unit 35b-1 may further execute the following processing.

【０１５４】Ｓｔｅｐ３−１：ｍｊｎ_k＞ｉｊｎ_j（ｍ
ｊｎ_k＜ｉｊｎ_j）である場合には、ＩＴ（ＭＴ１）の
中から以下の条件を満たすｉｔ_j（ｍｔ_k）に隣接する
表特徴ｉｔ_l（ｍｔ_l）を１つ検出し、それらを統合し
て仮想的な表特徴ｉｔ′_jを新たに生成し、その際に生
じる接合部数ｉｊｎ′_j（ｍｊｎ′_k）とｍｊｎ_k（ｉ
ｊｎ_j）との類似度ｓｉｍｌ′_kjを計算する。Step 3-1: mjn _k > ijn _j (m
If it is jn _k <ijn _j) is a table wherein it _{_l} (mt _l) detecting one adjacent to the following conditions are met it _j (mt _k) from the IT (MT1), integrate them Then, a new virtual table feature it ′ _j is newly generated, and the number of joints ijn ′ _j (mjn ′ _k ) and mjn _k (i
jn _j ) and the similarity degree siml ′ _kj .

【０１５５】類似度：ｓｉｍｌ′_kjがしきい値th6 以上
である場合には、対応可能であるとして、（ｍｔ_k，ｉ
ｔ′_j）（（ｍｔ′_k，ｉｔ_j））を対応可能ペアとし
て保持する。Similarity: If siml ' _kj is greater than or equal to the threshold value th6, it is considered that it is possible to cope with (mt _k , i
t ′ _j ) ((mt ′ _k , it _j )) is held as a compatible pair.

【０１５６】条件：ｉｔ_j（ｍｔ_k）とｉｔ_l（ｍ
ｔ_l）の間に他の表が存在しないこと。Condition: it _j (mt _k ) and it _l (m
No other table exists during t _l ).

【０１５７】Ｓｔｅｐ３−２：Ｓｔｅｐ３−１で求めた
ｓｉｍｌ′_kjがしきい値th6 に満たない場合、類似度：
ｓｉｍｌ′_kjがしきい値th6 以上となるまで、あるいは
条件を満たす統合すべき表が見つからなくなるまでＳｔ
ｅｐ３−１を繰り返す。Step 3-2: If the siml ' _kj obtained in Step 3-1 is less than the threshold value th6, the similarity:
St is continued until siml ' _kj becomes equal to or greater than the threshold value th6, or until a table that satisfies the conditions cannot be found.
Repeat ep3-1.

【０１５８】この場合、表照合部３５ｂの最良マッチン
グ部３５ｂ−３では、最大クリークを構成する「対応付
き」のすべての組み合わせを出力するようにして、その
後に次の条件を適用することにより候補を絞り込む。そ
れでもなお複数の組み合わせが生じている場合には、そ
れらすべてを後段の罫線照合部３５ｃに送り込み、その
結果に基づいて（それらの中で照合度が最も高くなる組
み合わせを選ぶことにより）最終的に表間対応関係を一
意に決めるようにしてもよい。In this case, the best matching unit 35b-3 of the table matching unit 35b outputs all the combinations of "corresponding" that make up the maximum clique, and then applies the following conditions to the candidates. Narrow down. If a plurality of combinations still occur, all of them are sent to the ruled line matching unit 35c in the subsequent stage, and finally based on the result (by selecting the combination with the highest matching degree). The correspondence between tables may be uniquely determined.

【０１５９】条件：モデル表特徴集合のすべての要素が
入力表特徴集合のいずれかの要素に対応していること。Condition: All the elements of the model table feature set correspond to any of the elements of the input table feature set.

【０１６０】次に、罫線照合部３５ｃでは、表照合部３
５ｂで対応付いた入力表とモデル表にそれぞれ含まれる
罫線集合間で対応付け処理を行なう。このとき水平罫線
集合間の対応付けと垂直罫線集合間の対応付けを独立に
行なうようにしてもよい。そして両者の整合を、各々の
対応付け処理が済んだあとで得るようにしてもよい。こ
の場合の罫線照合部３５ｃは、さらに図２２に示すよう
に構成される。すなわち、表対応選択部３５ｃ−１、垂
直罫線照合部３５ｃ−２、水平罫線照合部３５ｃ−３、
及び方向間整合獲得部３５ｃ−４によって構成されてい
る。Next, in the ruled line collating unit 35c, the table collating unit 3
Correspondence processing is performed between the ruled line sets included in the input table and the model table associated with each other in 5b. At this time, the association between the horizontal ruled line sets and the association between the vertical ruled line sets may be performed independently. Then, the matching between the two may be obtained after the corresponding processing is completed. The ruled line matching unit 35c in this case is further configured as shown in FIG. That is, the table correspondence selection unit 35c-1, the vertical ruled line matching unit 35c-2, the horizontal ruled line matching unit 35c-3,
And an inter-direction matching acquisition unit 35c-4.

【０１６１】まず、表対応選択部３５ｃ−１は、表照合
部３５ｂで抽出された表間対応付きの中から任意の対応
を選択する。垂直罫線照合部３５ｃ−２は、表対応選択
部３５ｃ−１によって選択された入力表とモデル表の垂
直罫線集合間で対応付けを行なう。この対応付けに成功
すれば、さらに水平罫線照合部３５ｃ−３は、水平罫線
集合間で対応付けを行なう。方向間整合獲得部３５ｃ−
３は、垂直罫線照合部３５ｃ−２による対応付けと水平
罫線照合部３５ｃ−３による対応付けとの間の整合を獲
得する。First, the table correspondence selecting unit 35c-1 selects an arbitrary correspondence from the correspondences between tables extracted by the table collating unit 35b. The vertical ruled line collating unit 35c-2 associates the vertical ruled line set of the input table selected by the table correspondence selecting unit 35c-1 with the vertical ruled line set of the model table. If this matching is successful, the horizontal ruled line matching unit 35c-3 further matches the horizontal ruled line sets. Direction matching acquisition unit 35c-
3 obtains the matching between the matching by the vertical ruled line matching unit 35c-2 and the matching by the horizontal ruled line matching unit 35c-3.

【０１６２】なお、垂直罫線照合部３５ｃ−２と水平罫
線照合部３５ｃ−２は、さらに図２３に示すように構成
されている。これらの処理動作は、罫線方向の違いを考
慮する以外は、基本的に同じである。The vertical ruled line matching unit 35c-2 and the horizontal ruled line matching unit 35c-2 are further configured as shown in FIG. These processing operations are basically the same except that the difference in ruled line direction is taken into consideration.

【０１６３】以下に、具体的な説明として、垂直罫線照
合部３５ｃ−２における、ｍｔ１₁の垂直罫線集合Ｍ₁
ＶＬ₁とｉｔ₁の垂直罫線集合ＩＶＬ₁の間の対応付け
処理の動作について説明する。ただし、図１４より、Ｍ₁ＶＬ₁＝（ｍ₁ｖｌ₁，ｍ₁ｖｌ₂，ｍ₁ｖｌ₃，
ｍ₁ｖｌ₄，ｍ₁ｖｌ₅，ｍ₁ｖｌ₆，ｍ₁ｖｌ₇，ｍ
₁ｖｌ₈）とする。以後、垂直罫線を単に罫線と略して
説明する。As a concrete description, the vertical ruled line set M _{1 of} mt1 _{1 in} the vertical ruled line matching unit 35c-2 will be described below.
The operation of the associating process between the vertical ruled line set IVL ₁ of VL ₁ and it ₁ will be described. However, from FIG. 14, M ₁ VL ₁ = (m ₁ vl ₁ , m ₁ vl ₂ , m ₁ vl ₃ ,
m ₁ vl ₄ , m ₁ vl ₅ , m ₁ vl ₆ , m ₁ vl ₇ , m
₁ vl ₈ ). Hereinafter, the vertical ruled line will be simply described as a ruled line.

【０１６４】図２３中に示す対応可能罫線特徴ペア検出
部３５ｃ−２ａは、モデルの罫線集合の各要素に対し
て、要素と対応付く可能性のある入力の罫線特徴をすべ
て検出し、対応可能な罫線特徴のペアとして管理する。
すなわち、以下に説明する手順を実行する。The applicable ruled line feature pair detection unit 35c-2a shown in FIG. 23 detects, for each element of the ruled line set of the model, all of the input ruled line features that may be associated with the element, and can handle them. Manage as a pair of different ruled line features.
That is, the procedure described below is executed.

【０１６５】Ｓｔｅｐ１：任意のＭ₁ＶＬ₁から任意の
一つの罫線特徴ｍ₁ｖｌ_k（ｋ番目の罫線特徴を意味す
る）を選ぶ。Step 1: Select any one ruled line feature m ₁ vl _k (meaning the kth ruled line feature) from any M ₁ VL ₁ .

【０１６６】Ｓｔｅｐ２：ｍ₁ｖｌ_kに対応付く入力罫
線特徴を検出するための探索範囲（ａｒｅａ_mvl）を設
定する。Step 2: A search range (area _mvl ) for detecting the input ruled line feature corresponding to m ₁ vl _k is set.

【０１６７】ここで、ｍ₁ｖｌ_kの罫線特徴の左上端と
右下端の位置座標を（ｍｌｘ1 ，ｍｌｙ1 ）、（ｍｌｘ
2 ，ｍｌｙ2 ）とすると、探索範囲は例えば（ｍｌｘ1
−th9 ，ｍｌｙ1 ）と（ｍｌｘ2 ＋th9 ，ｍｌｙ2 ）の
座標値で構成される矩形の内部としてもよい。ここで、
しきい値th9 が予め与えられているものとする。Here, the position coordinates of the upper left end and the lower right end of the ruled line feature of m ₁ vl _k are (mlx1, mly1), (mlx
2, mly2), the search range is, for example, (mlx1
-Th9, mly1) and (mlx2 + th9, mly2) may be used as the inside of a rectangle constituted by coordinate values. here,
It is assumed that the threshold th9 is given in advance.

【０１６８】Ｓｔｅｐ３：Ｓｔｅｐ２で求められた探索
範囲に内包・交差する入力罫線特徴を抽出する。Step 3: The input ruled line feature that is included / intersects in the search range obtained in Step 2 is extracted.

【０１６９】この処理は例えば以下のようにして行なわ
れる。すなわち、探索対象となる入力罫線の左上端と右
下端の位置座標値を（ｉｘ1 ，ｉｙ1 ）、（ｉｘ2 ，ｉ
ｙ2）とした場合、以下の条件を満たす入力罫線を抽出
する。This processing is performed, for example, as follows. That is, the position coordinate values of the upper left corner and the lower right corner of the input ruled line to be searched are (ix1, iy1), (ix2, i
If y2), the input ruled line satisfying the following conditions is extracted.

【０１７０】条件：ｍｉｎ（ｍｌｘ2 ＋th9 ，ｉｘ2 ）
−ｍａｘ（ｍｌｘ1 ＋th9 ，ｉｘ1）＋１＞０でかつｍ
ｉｎ（ｍｌｙ2 ，ｉｙ2 ）−ｍａｘ（ｍｌｙ1 ，ｉｙ1
）＋１＞０である。Condition: min (mlx2 + th9, ix2)
-Max (mlx1 + th9, ix1) +1> 0 and m
in (mly2, iy2) -max (mly1, iy1
) +1> 0.

【０１７１】Ｓｔｅｐ４：抽出された入力罫線特徴の一
つを選ぶ（例えば、ｉｖｌ_j（入力罫線特徴集合のｊ番
目の罫線特徴）を選んだものとする）。Step 4: Select one of the extracted input ruled line features (for example, ivl _j ( _jth ruled line feature of the input ruled line feature set) is selected).

【０１７２】Ｓｔｅｐ５：ｍ₁ｖｌ_kの縦幅：ｍｌ-hei
ght とｉｖｌ_jの縦幅：ｉｌ-height との類似度：ｓｉ
ｍｌ_kjを計算する。Vertical width of Step 5: m ₁ vl _k : ml-hei
Vertical width of ght and ivl _j : il-height similarity: si
Calculate ml _kj .

【０１７３】ここで、類似度：ｓｉｍｌ_kjは例えば次の
式で求まるとする。Here, it is assumed that the degree of similarity: siml _kj is obtained by the following equation, for example.

【０１７４】ｍｌ-height ＞０またはｉｌ-height ＞０のときｓｉｍｌ_kj＝ 100− 200×｜ｍｌ-height −ｉｌ-heigh
t ｜／（ｍｌ-height＋ｉｌ-height ）ｍｌ-height ＝０かつｉｌ-height ＝０のときｓｉｍｌ_kj＝−100 Ｓｔｅｐ６：類似度：ｓｉｍｌ_kjがあらかじめ設定した
しきい値th10以上である場合には、対応可能であるとし
て、その組（ｍ₁ｖｌ_k，ｉｖｌ_j）を対応可能ペアと
して保持する。When ml-height> 0 or il-height> 0 siml _kj = 100−200 × | ml-height−il-heigh
t | / (ml-height + il-height) ml-height = 0 and il-height = 0 siml _kj = -100 Step 6: Similarity: If siml _kj is equal to or greater than a preset threshold th10, Assuming that the correspondence is possible, the pair (m ₁ vl _k , ivl _j ) is held as a correspondence pair.

【０１７５】Ｓｔｅｐ７：類似度：ｓｉｍｌ_kjがしきい
値th10未満である場合には、さらに以下の処理を行な
う。Step 7: Similarity: When siml _kj is less than the threshold value th10, the following processing is further performed.

【０１７６】Ｓｔｅｐ７−１：図２４に示すように、１
本であるべき線がかすれたり、途切れたりしているため
に分離している場合に対応するために以下の処理を行な
う。Step 7-1: As shown in FIG.
The following processing is performed in order to deal with the case where a line that should be a book is faint or is broken and is separated.

【０１７７】Ｓｔｅｐ７−１−１：Ｓｔｅｐ３で抽出さ
れた入力罫線特徴のうち、以下の条件を満たす罫線特徴
のうち最も近接するものを１つ抽出し、それらを図２５
に示すように統合した際に生じる縦幅：ｉl-height′と
ｍｌ-height との類似度：ｓｉｍｌ′_kjを計算する。た
だし、ｉｖｌ_jの左上端と右下端の位置座標値を、（ｉ
ｖｌ_jｘ1 ，ｉｖｌ_jｙ1 ）、（ｉｖｌ_jｘ2 ，ｉｖｌ
_jｙ2 ）、抽出対象となる入力罫線の左上端と右下端の
位置座標値を（ｉｘ１，ｉｙ１）、（ｉｘ２，ｉｙ２）
とする。Step 7-1-1: Among the input ruled line features extracted in Step 3, one of the ruled line features that satisfies the following conditions is extracted, and the closest one is extracted.
As shown in (1), the similarity between the vertical width: il-height 'and ml-height: siml' _kj is calculated. However, the position coordinate values of the upper left corner and the lower right corner of ivl _j are (i
vl _j x1, ivl _j y1), (ivl _j x2, ivl
_j y2), and the position coordinate values of the upper left corner and the lower right corner of the input ruled line to be extracted are (ix1, iy1), (ix2, iy2)
And

【０１７８】条件：ｍｉｎ（ｉｖｌ_jｘ2 ，ｉｘ2 ）−
ｍａｘ（ｉｖｌ_jｘ1 ，ｉｘ1 ）＋１＞０、類似度：ｓ
ｉｍｌ′_kjがしきい値th10以上である場合には、対応可
能であるとして、その組（ｍ₁ｖｌ_k，ｉｖｌ′_j）を
対応可能ペアとして保持する。Condition: min (ivl _j x2, ix2)-
max (ivl _j x1, ix1) +1> 0, similarity: s
If iml ' _kj is greater than or equal to the threshold value th10, it is determined that they can be supported, and the set (m ₁ vl _k , ivl' _j ) is held as a compatible pair.

【０１７９】Ｓｔｅｐ７−１−２：Ｓｔｅｐ７−１−１
で求めたｓｉｍｌ′_kj＜th10である場合、類似度：ｓｉ
ｍｌ′_kjがしきい値th10以上となるまで、あるいは条件
を満たす統合すべき罫線が見つからなくなるまでＳｔｅ
ｐ７−１−１を繰り返す。Step 7-1-2: Step 7-1-1
When siml ′ _kj <th10 obtained in step S1, the similarity: si
Step until ml ′ _kj becomes equal to or greater than the threshold value th10, or until no ruled line that meets the conditions is found.
Repeat p7-1-1.

【０１８０】Ｓｔｅｐ７−２：図２６に示すようにモデ
ルの複数本の罫線特徴と入力の複数本の罫線特徴とが対
応付くような場合に対応するために以下の処理を行な
う。Step 7-2: As shown in FIG. 26, the following processing is performed in order to cope with a case where a plurality of ruled line features of the model correspond to a plurality of input ruled line features.

【０１８１】Ｓｔｅｐ７−２−１：ｍｌ-height ＞ｉｌ
-height （ｍｌ-height ＜ｉｌ-height ）である場合に
は、ＩＶＬ₁（Ｍ₁ＶＬ₁）の中から以下の条件を満た
すｉｖｌ_j（ｍ₁ｖｌ_k）に最も近接する罫線特徴ｉｖ
ｌ′_j（ｍ₁ｖｌ′_k）を１つ検出し、それらを統合し
た際に生じる縦幅ｉｌ-height ′（ｍｌ-height ′)と
ｍｌ-height(ｉｌ-height)との類似度：ｓｉｍｌ′_kjを
計算する。Step 7-2-1: ml-height> il
-height (ml-height <il-height), the ruled line feature iv that is closest to ivl _j (m ₁ vl _k ) satisfying the following condition out of IVL ₁ (M ₁ VL ₁ ).
Similarity between the vertical widths il-height ′ (ml-height ′) and ml-height (il-height) generated by detecting _one l ′ _j (m ₁ vl ′ _k ) and integrating them: siml Calculate ′ _kj .

【０１８２】ただし、ｉｖｌ_jの左上端と右下端の位置
座標値を、（ｉｖｌ_jｘ１′，ｉｖｌ_jｙ１′）、（ｉ
ｖｌ_jｘ２′，ｉｖｌ_jｙ２′）、予め設定されたしき
い値をth11とする。[0182] However, the position coordinates of the left upper and right lower ends of _{_{ivl j, (ivl j x1 '}} , ivl j y1'), (i
vl _j x2 ', ivl _j y2'), and a preset threshold value is th11.

【０１８３】条件：ｍｉｎ（ｉｖｌ_jｘ２＋th11，ｉｖ
ｌ_jｘ２′）−ｍａｘ（ｉｖｌ_jｘ１−th11，ｉｖｌ_j
ｘ１′）＋１＞０類似度：ｓｉｍｌ′_kjがしきい値th10以上である場合に
は、対応可能であるとして、（ｍ₁ｖｌ_k，ｉｖｌ_j）
と（ｍ₁ｖｌ_k，ｉｖｌ′_j）（（ｍ₁ｖｌ_k，ｉｖｌ
_j）と（ｍ₁ｖｌ′_k，ｉｖｌ_j))を対応可能ペアとし
て保持する。Condition: min (ivl _j x2 + th11, iv
l _j x2 ′)-max (ivl _j x1-th11, ivl _j
x1 ′) + 1> 0 Similarity: When siml ′ _kj is greater than or equal to the threshold value th10, it is possible to support (m ₁ vl _k , ivl _j ).
And _{_{(m 1 vl k, ivl '}} j) ((m 1 vl k, ivl
_j ) and (m ₁ vl ′ _k , ivl _j )) are held as a pair capable of correspondence.

【０１８４】Ｓｔｅｐ７−２−２：Ｓｔｅｐ７−２−１
で求めたｓｉｍｌ′_kj＜th10である場合、類似度：ｓｉ
ｍｌ′_kjがしきい値th10以上となるまで、あるいは条件
を満たす統合すべき罫線が見つからなくなるまでＳｔｅ
ｐ７−２−１を繰り返す。Step 7-2-2: Step 7-2-1
When siml ′ _kj <th10 obtained in step S1, the similarity: si
Step until ml ′ _kj becomes equal to or greater than the threshold value th10, or until no ruled line that meets the conditions is found.
Repeat p7-2-1.

【０１８５】Ｓｔｅｐ８：Ｓｔｅｐ４〜Ｓｔｅｐ７まで
をＳｔｅｐ３で抽出されたすべての入力罫線特徴に対し
て適用する。Step 8: Steps 4 to 7 are applied to all the input ruled line features extracted in Step 3.

【０１８６】Ｓｔｅｐ９：Ｓｔｅｐ１〜Ｓｔｅｐ８まで
を着目Ｍ₁ＶＬ₁のすべての要素に対して適用する。Step 9: Steps ₁ to 8 are applied to all the elements of the attention M ₁ VL ₁ .

【０１８７】ここで、ｍｉｎ（）は（）内の２変数の
内、小さい方を出力する関数であり、ｍａｘ（）は（）
内の２変数の内、大きい方を出力する関数である。Here, min () is a function that outputs the smaller one of the two variables in (), and max () is ().
This is a function that outputs the larger of the two variables.

【０１８８】対応可能罫線特徴ペア検出部３５ｃ−２ａ
の前述した処理動作を、Ｍ₁ＶＬ₁とＩＶＬ₁を用いて
具体的に説明する。対応可能罫線特徴ペア検出部３５ｃ
−２ａにおいて、Ｍ₁ＶＬ₁の各要素で探索範囲を設け
て、それぞれ対応付く可能性を持つＩＶＬ₁の要素を抽
出した結果、以下のような罫線特徴ペアが得られたもの
とする。Correspondence Ruled Line Feature Pair Detection Unit 35c-2a
The above-mentioned processing operation will be specifically described by using M ₁ VL ₁ and IVL ₁ . Available ruled line feature pair detection unit 35c
-2a, it is assumed that the following ruled line feature pairs are obtained as a result of extracting the elements of IVL ₁ that have a possibility of being associated with each other by providing a search range for each element of M ₁ VL ₁ .

【０１８９】ｐｌ₁＝（ｍ₁ｖｌ₁，ｉｖｌ₁）、ｐｌ₂＝（ｍ₁ｖｌ₂，ｉｖｌ₂）、ｐｌ₃＝（ｍ₁ｖｌ₃，ｉｖｌ₃）、ｐｌ₄＝（ｍ₁ｖｌ₄，ｉｖｌ₂）、ｐｌ₅＝（ｍ₁ｖｌ₄，ｉｖｌ₄）、ｐｌ₆＝（ｍ₁ｖｌ₅，ｉｖｌ₄）、ｐｌ₇＝（ｍ₁ｖｌ₆，ｉｖｌ₄）、ｐｌ₈＝（ｍ₁ｖｌ₇，ｉｖｌ₅）、ｐｌ₉＝（ｍ₁ｖｌ₈，ｉｖｌ₆）。Pl ₁ = (m ₁ vl ₁ , ivl ₁ ), pl ₂ = (m ₁ vl ₂ , ivl ₂ ), pl ₃ = (m ₁ vl ₃ , ivl ₃ ), pl ₄ = (m ₁ vl _4). , Ivl ₂ ), pl ₅ = (m ₁ vl ₄ , ivl ₄ ), pl ₆ = (m ₁ vl ₅ , ivl ₄ ), pl ₇ = (m ₁ vl ₆ , ivl ₄ ), pl ₈ = (m ₁ vl ₇ , ivl ₅ ), pl ₉ = (m ₁ vl ₈ , ivl ₆ ).

【０１９０】このうち、ｐｌ₁、ｐｌ₂、ｐｌ₃、ｐｌ
₄、ｐｌ₈、ｐｌ₉は、Ｓｔｅｐ２からＳｔｅｐ６まで
の処理（以後、Ｓｔｅｐ２からＳｔｅｐ６までの処理で
得られた対応可能特徴ペアのみを１対１対応可能ペアと
呼ぶ）で得られる。Of these, pl ₁ , pl ₂ , pl ₃ , pl
₄ , pl ₈ and pl ₉ are obtained by the processing from Step 2 to Step 6 (hereinafter, only the corresponding feature pairs obtained by the processing from Step 2 to Step 6 are referred to as one-to-one correspondence pair).

【０１９１】ｐｌ5 は、Ｓｔｅｐ２からＳｔｅｐ６まで
の処理で類似度：ｓｉｍｌ₄₄がしきい値th10未満であっ
たために、Ｓｔｅｐ７−２−１によりｉｖｌ₄と「ｍ₁
ｖｌ₄とｍ1 ｖｌ5 を統合したもの」のペアを検出し
た。そして、そのペアをｐｌ₅とｐｌ₆の対応可能なペ
アに分けて管理している。ｐｌ₇についても同様にＳｔ
ｅｐ７−２−１でｉｖｌ₄と「ｍ₁ｖｌ₄とｍ₁ｖｌ₆
を統合したもの」のペアを検出していることにより対応
可能なペアとして抽出されている。[0191] pl5 is similarity in the process from Step2 to Step6: To Siml ₄₄ is less than the threshold value TH10, and IVL ₄ by Step7-2-1 "m ₁
v1 ₄ and m1 vl5 integrated "pair was detected. Then, the pair is managed by dividing it into a compatible pair of pl ₅ and pl ₆ . Similarly for pl ₇ , St
In ep7-2-1, ivl ₄ and “m ₁ vl ₄ and m ₁ vl ₆
Is detected as a pair that is integrated, and is extracted as a compatible pair.

【０１９２】対応可能罫線特徴ペア検出部３５ｃ−２ａ
における処理動作のＳｔｅｐ２における探索範囲の設定
は、例えば以下に述べる手順で行なわれてもよい。例え
ば、図２７に示すモデル垂直罫線ＶＬ₁の探索範囲は、
モデル垂直罫線ＶＬ₁に左側で隣接するモデル垂直罫線
ＶＬ₂と、右側で隣接するモデル垂直罫線ＶＬ₃との距
離に応じて設定するようにしてもよい。Correspondence Ruled Line Feature Pair Detection Unit 35c-2a
The setting of the search range in Step 2 of the processing operation may be performed by the procedure described below, for example. For example, the search range of the model vertical ruled line VL ₁ shown in FIG.
It may be set according to the distance between the model vertical ruled line VL ₂ adjacent to the model vertical ruled line VL ₁ on the left side and the model vertical ruled line VL ₃ adjacent to the right side.

【０１９３】すなわち、ＶＬ₁の左上端の位置座標値を
（ＶＬ₁ｘ1 ，ＶＬ₁ｙ1 ）、右下端の位置座標値を
（ＶＬ₁ｘ2 ，ＶＬ₁ｙ2 ）、ＶＬ₂の左上端の位置座
標値を（ＶＬ₂ｘ1 ，ＶＬ₂ｙ1 ）、右下端の位置座標
値を（ＶＬ₂ｘ2 ，ＶＬ₂ｙ2）、ＶＬ₃の左上端の位
置座標値を（ＶＬ₃ｘ1 ，ＶＬ₃ｙ1 ）、右下端の位置
座標値を（ＶＬ₃ｘ2 ，ＶＬ₃ｙ2 ）とすると、ＶＬ₁
とＶＬ₂の間の距離：ｄｉｓｔ12と、ＶＬ₁とＶＬ₃の
間の距離：ｄｉｓｔ13はそれぞれ、ｄｉｓｔ12＝ＶＬ₁ｘ1 −ＶＬ₂ｘ2 ＋１、ｄｉｓｔ13＝ＶＬ₃ｘ1 −ＶＬ₁ｘ2 ＋１、より求まる
こととする。[0193] That is, the position coordinates of the upper left point of the _{_{VL 1 (VL 1 x1, VL}} 1 y1), the position coordinates of the lower right end _{_{(VL 1 x2, VL 1 y2}} ), the position coordinates of the upper left point of the VL ₂ value _{_{(VL 2 x1, VL 2 y1}} ), the position coordinates of the lower right end _{_{(VL 2 x2, VL 2 y2}} ), the position coordinates of the upper left point of the _{_{VL 3 (VL 3 x1, VL}} 3 y1), right If the position coordinate value of the lower end is (VL ₃ x2, VL ₃ y2), VL ₁
The distance between the VL _2: a Dist12, the distance between the VL ₁ and VL _3: dist13 _{_{respectively, dist12 = VL 1 x1 -VL 2}} x2 +1, dist13 = VL 3 x1 -VL 1 x2 +1, more determined that And

【０１９４】そして探索範囲を、((ＶＬ₁ｘ1 −ｄｉｓ
ｔ12／２），（ＶＬ₁ｙ1 ＋th9)）と、((ＶＬ₁ｘ2 ＋
ｄｉｓｔ13／２），（ＶＬ₁ｙ2 ＋th9)）、の位置座標
値で構成される矩形領域としてもよい。Then, the search range is set to ((VL ₁ x1 -dis
t12 / 2), (VL ₁ y1 + th9)) and ((VL ₁ x2 +
_{dist13 / 2), (VL 1} y2 + th9)), it may be a rectangular region composed of position coordinates of the.

【０１９５】ここで、ＶＬ₁とその左側で隣接するモデ
ル垂直罫線ＶＬ₂は、min(ＶＬ₁ｙ2 ，ＶＬ₂ｙ2 ）−
max(ＶＬ₁ｙ1 ，ＶＬ₂ｙ1 ）＋１＞th13を満たす、距
離ｄｉｓｔ12が最小であるモデル垂直罫線を検出するこ
とにより求めることができ、右側で隣接するモデル垂直
罫線ＶＬ3 は、min(ＶＬ₁ｙ2 ，ＶＬ₃ｙ2 ）−max(Ｖ
Ｌ₁ｙ1 ，ＶＬ₃ｙ1 ）＋１＞th13を満たす、距離ｄｉ
ｓｔ13が最小であるモデル垂直罫線を検出することによ
り求めることができる。ここでth13をしきい値とする。
各モデル水平罫線の探索範囲も同様に求めることができ
る。Here, the model vertical ruled line VL ₂ adjacent to VL ₁ on its left side is min (VL ₁ y2, VL ₂ y2) −
The model vertical ruled line VL3 adjacent to the right side can be obtained by detecting the model vertical ruled line having the minimum distance dist12, which satisfies max (VL ₁ y1, VL ₂ y1) +1> th13, and min (VL ₁ y2 , VL ₃ y2) -max (V
L ₁ y1, VL ₃ y1) +1> th13, distance di
This can be obtained by detecting the model vertical ruled line for which st13 is the minimum. Here, th13 is the threshold.
The search range of each model horizontal ruled line can be similarly obtained.

【０１９６】この他にも、Ｓｔｅｐ２における探索範囲
を次のようにして設定してもよい。例えばｋ番目のモデ
ル罫線に着目したとき、探索対象となっている入力罫線
のうちｋ±α以内の番号を有するものを着目モデル罫線
の探索範囲とするようにしてもよい。また、ある大きさ
のパラメータでスケール変換がなされた状態で、着目モ
デル罫線と同じ長さを持つ全ての入力罫線を探索範囲と
しても良い。In addition to this, the search range in Step 2 may be set as follows. For example, when the k-th model ruled line is focused, the input ruled line that is the search target and has a number within k ± α may be set as the search range of the focused model ruled line. Further, all the input ruled lines having the same length as the target model ruled line may be set as the search range in the state where the scale conversion is performed with the parameter of a certain size.

【０１９７】次に、対応可能罫線特徴ペア間両立性判定
部３５ｃ−２ｂは、対応可能罫線特徴ペア検出部３５ｃ
−２ａで検出されたすべての２つの異なる対応可能ペア
の組み合わせにおいて、それらが両立するものであるか
否かを判定する。Next, the compatible ruled line feature pair compatibility determination unit 35c-2b determines the applicable ruled line feature pair detection unit 35c.
-2a, it is determined whether or not they are compatible with each other in the combination of all the two different compatible pairs detected.

【０１９８】ここでの判定条件としては以下のものが上
げられる。判定対象となる対応可能ペアをそれぞれｐ＝
（ｍｌ_k，ｉｌ_j）、ｐ′＝（ｍｌ′_k，ｉｌ′_j）と
する。The following are given as the judgment conditions here. Each available pair to be judged is p =
Let (ml _k , il _j ) and p ′ = (ml ′ _k , il ′ _j ).

【０１９９】ここで、ｍｌ_kの左上端の位置座標を（ｍ
ｌ_kｘ1 ，ｍｌ_kｙ1 ）、右下端の位置座標を（ｍｌ_k
ｘ2 ，ｍｌ_kｙ2 ）、ｍｌ′_kの左上端の位置座標を
（ｍｌ′_kｘ1 ，ｍｌ′_kｙ1 ）、右下端の位置座標を
（ｍｌ′_kｘ2 ，ｍｌ′_kｙ2）、ｉｌ_jの左上端の位
置座標を（ｉｌ_jｘ1 ，ｉｌ_jｙ1 ）、右下端の位置座
標を（ｉｌ_jｘ2 ，ｉｌ_jｙ2 ）、ｉｌ′_jの左上端の
位置座標を（ｉｌ′_jｘ1 ，ｉｌ′_jｙ1 ）、右下端の
位置座標を（ｉｌ′_jｘ2 ，ｉｌ′_jｙ2 ）とする。Here, the position coordinate of the upper left corner of ml _k is (m
l _k x1, ml _k y1), and the position coordinates of the lower right corner are (ml _k
x2, ml _k y2), 'the position coordinates of the upper left point of the _{_{k (ml' ml k x1,}} ml 'k y1), the position coordinates (ml lower right _{_{end' k x2, ml 'k y2}} ), the il _j the position coordinates of the upper left corner _{_{(il j x1, il j y1}} ), the position coordinates of the lower right end _{_{(il j x2, il j y2}} ), il ' the position coordinates of the upper left point of the _{_{j (il' j x1, il}} ' _j y1) and the position coordinates of the lower right corner are (il ′ _j x2, il ′ _j y2).

【０２００】判定条件は、条件１から条件４までのすべ
ての条件を満たさない場合のみ２つの対応可能ペアが両
立可能であるとみなす。It is considered that the two compatible pairs are compatible only when all the conditions 1 to 4 are not satisfied as the determination conditions.

【０２０１】条件１：ｐとｐ′のどちらかが１対１対応
可能ペアでり、かつｍｌ_k＝ｍｌ′_kである。Condition 1: Either p or p'can be a one-to-one correspondence pair, and ml _k = ml ' _k .

【０２０２】条件２：ｐとｐ′のどちらかが１対１対応
可能ペアでり、かつｉｌ_j＝ｉｌ′_jである。Condition 2: Either p or p'is a one-to-one correspondence pair, and il _j = il ' _j .

【０２０３】条件３：ｐとｐ′のどちらも１対１対応可
能ペアでり、かつ以下の条件３−１，３−２のどちらか
を満たす。Condition 3: Both p and p'are one-to-one correspondence pairs, and satisfy either of the following conditions 3-1 and 3-2.

【０２０４】条件３−１：（min(ｍｌ_kｘ2 ，ｍｌ′_k
ｘ2 ）−max(ｍｌ_kｘ1 ，ｍｌ′_kｘ1 ）＋１）＞０か
つ（min(ｍｌ_kｙ2 ，ｍｌ′_kｙ2 ）−max(ｍｌ_kｙ1
，ｍｌ′_kｙ1 ）＋１）＞０。Condition 3-1: (min (ml _k x2, ml ' _k
x2) -max (ml _k x1, ml ′ _k x1) +1)> 0 and (min (ml _k y2, ml ′ _k y2) −max (ml _k y1
, Ml ′ _k y1) +1)> 0.

【０２０５】条件３−２：（min(ｉｌ_kｘ2 ，ｉｌ′_k
ｘ2 ）−max(ｉｌ_kｘ1 ，ｉｌ′_kｘ1 ）＋１）＞０か
つ（min(ｉｌ_kｙ2 ，ｉｌ′_kｙ2 ）−max(ｉｌ_kｙ1
，ｉｌ′_kｙ1 ）＋１）＞０。Condition 3-2: (min (il _k x2, il ' _k
_{x2) -max (il k x1,} il 'k x1) +1)> 0 and _{(min (il k y2, il} ' k y2) -max (il k y1
, Il ′ _k y1) +1)> 0.

【０２０６】条件４：配置関係に逆転がある。すなわ
ち、以下の４−１〜４−４の状態にある。Condition 4: The arrangement relationship is reversed. That is, it is in the following states 4-1 to 4-4.

【０２０７】４−１：ｉｌ_j（ｉｌ′_j）に対して、ｉ
ｌ′_j（ｉｌ_j）が領域１にあり、ｍｌ_k（ｍｌ′_k）
に対して、ｍｌ′_k（ｍｌ_k）が領域２にある。4-1: For il _j (il ' _j ), i
l ′ _j (il _j ) is in the region 1 and ml _k (ml ′ _k )
, There is ml ′ _k (ml _k ) in region 2.

【０２０８】４−２：ｉｌ_j（ｉｌ′_j）に対して、ｉ
ｌ′_j（ｉｌ_j）が領域２にあり、ｍｌ_k（ｍｌ′_k）
に対して、ｍｌ′_k（ｍｌ_k）が領域１にある。4-2: For il _j (il ′ _j ), i
l ′ _j (il _j ) is in the region 2, and ml _k (ml ′ _k )
, There is ml ' _k (ml _k ) in region 1.

【０２０９】４−３：ｉｌ_j（ｉｌ′_j）に対して、ｉ
ｌ′_j（ｉｌ_j）が領域３にあり、ｍｌ_k（ｍｌ′_k）
に対して、ｍｌ′_k（ｍｌ_k）が領域４にある。4-3: For il _j (il ′ _j ), i
l ′ _j (il _j ) is in the region 3, and ml _k (ml ′ _k )
, There is ml ′ _k (ml _k ) in region 4.

【０２１０】４−４：ｉｌ_j（ｉｌ′_j）に対して、ｉ
ｌ′_j（ｉｌ_j）が領域４にあり、ｍｌ_k（ｍｌ′_k）
に対して、ｍｌ′_k（ｍｌ_k）が領域３にある。4-4: For il _j (il ' _j ), i
l ′ _j (il _j ) is in the region 4, and ml _k (ml ′ _k )
, There is ml ′ _k (ml _k ) in region 3.

【０２１１】対応可能罫線特徴ペア間両立性判定部３５
ｃ−２ｂの動作を、前述した対応可能罫線特徴ペアｐｌ
₁〜ｐｌ₉を用いて具体的に説明する。ｐｌ₁からｐｌ
₉までの９個の対応可能罫線特徴ペアのすべての組み合
わせは３６通りある。Compatible ruled line feature pair compatibility determination unit 35
The operation of c-2b can be performed by the corresponding ruled line feature pair pl described above.
_A specific description will be given using _{1 to} pl ₉ . pl ₁ to pl
_There are 36 combinations of all 9 possible ruled line feature pairs up to 9.

【０２１２】そのうち上記条件１〜４をすべて満たさな
い組み合わせは、（ｐｌ₁，ｐｌ₂）、（ｐｌ₁，ｐｌ
₃）、（ｐｌ₁，ｐｌ₄）、（ｐｌ₁，ｐｌ₅）、（ｐ
ｌ₁，ｐｌ₆）、（ｐｌ₁，ｐｌ₇）、（ｐｌ₁，ｐｌ
₈）、（ｐｌ₁，ｐｌ₉）、（ｐｌ₂，ｐｌ₃）、（ｐ
ｌ₂，ｐｌ₅）、（ｐｌ₂，ｐｌ₆）、（ｐｌ₂，ｐｌ
₇）、（ｐｌ₂，ｐｌ₈）、（ｐｌ₂，ｐｌ₉）、（ｐ
ｌ₃，ｐｌ₅）、（ｐｌ₃，ｐｌ₆）、（ｐｌ₃，ｐｌ
₇）、（ｐｌ₃，ｐｌ₈）、（ｐｌ₃，ｐｌ₉）、（ｐ
ｌ₄，ｐｌ₆）、（ｐｌ₄，ｐｌ₇）、（ｐｌ₄，ｐｌ
₈）、（ｐｌ₄，ｐｌ₉）、（ｐｌ₅，ｐｌ₆）、（ｐ
ｌ₅，ｐｌ₇）、（ｐｌ₅，ｐｌ₈）、（ｐｌ₅，ｐｌ
₉）、（ｐｌ₆，ｐｌ₇）、（ｐｌ₆，ｐｌ₈）、（ｐ
ｌ₆，ｐｌ₉）、（ｐｌ₇，ｐｌ₈）、（ｐｌ₇，ｐｌ
₉）、（ｐｌ₈，ｐｌ₉）の３２通りである。これらの
組み合わせの各々において、それを構成する対応可能罫
線特徴ペアは、両立可能であると判断される。Among the combinations which do not satisfy all of the above conditions 1 to 4, (pl ₁ , pl ₂ ), (pl ₁ , pl)
₃ ), (pl ₁ , pl ₄ ), (pl ₁ , pl ₅ ), (p
l ₁ , pl ₆ ), (pl ₁ , pl ₇ ), (pl ₁ , pl
₈ ), (pl ₁ , pl ₉ ), (pl ₂ , pl ₃ ), (p
l ₂ , pl ₅ ), (pl ₂ , pl ₆ ), (pl ₂ , pl
₇ ), (pl ₂ , pl ₈ ), (pl ₂ , pl ₉ ), (p
l ₃ , pl ₅ ), (pl ₃ , pl ₆ ), (pl ₃ , pl
₇ ), (pl ₃ , pl ₈ ), (pl ₃ , pl ₉ ), (p
l ₄ , pl ₆ ), (pl ₄ , pl ₇ ), (pl ₄ , pl
₈ ), (pl ₄ , pl ₉ ), (pl ₅ , pl ₆ ), (p
l ₅ , pl ₇ ), (pl ₅ , pl ₈ ), (pl ₅ , pl
₉ ), (pl ₆ , pl ₇ ), (pl ₆ , pl ₈ ), (p
l ₆ , pl ₉ ), (pl ₇ , pl ₈ ), (pl ₇ , pl
₉ ) and (pl ₈ , pl ₉ ) in 32 ways. In each of these combinations, the corresponding ruled line feature pairs that make up the combination are determined to be compatible.

【０２１３】次に、最良マッチング抽出部３５ｃ−２ｃ
は，対応可能罫線特徴ペア間両立性判定部３５ｃ−２ｂ
において両立すると判定されたもののうち、すべてが互
いに両立可能な対応可能ペアの最大の集合を求めること
により、入力表特徴集合とモデル表特徴集合間の最良マ
ッチングを抽出する。Next, the best matching extraction unit 35c-2c
Is a compatible ruled line feature pair compatibility determination unit 35c-2b.
The best matching between the input table feature set and the model table feature set is extracted by finding the maximum set of compatible pairs that are all compatible with each other.

【０２１４】なお、最良マッチング抽出部３５ｃ−２ｃ
は、図１７に示す最良マッチング抽出部３５ｂ−３と同
一の機能を有している（詳細は図１９に示している）。
最良マッチング抽出部３５ｃ−２ｃを構成している連合
グラフ作成部では、対応可能罫線特徴ペア間両立性判定
部３５ｃ−２ｂで得られた結果から連合グラフを作る
か、ｐｌ₁からｐｌ₉までの９個の対応可能罫線特徴ペ
アに対しては、図２８に示すものが作られる。The best matching extraction units 35c-2c
Has the same function as the best matching extraction unit 35b-3 shown in FIG. 17 (details are shown in FIG. 19).
In the association graph creation unit that constitutes the best matching extraction unit 35c-2c, an association graph is created from the results obtained by the compatible ruled line feature pair compatibility determination unit 35c-2b, or from the pl ₁ to pl ₉ For 9 possible ruled line feature pairs, the one shown in FIG. 28 is created.

【０２１５】Ｍ₁ＶＬ₁とＩＶＬ₁の間の対応付け処理
に関しては、最良マッチング抽出部３５ｃ−２ｃにおい
て、（ｍ₁ｖｌ₁，ｉｖｌ₁）、（ｍ₁ｖｌ₂，ｉｖｌ
₂）、（ｍ₁ｖｌ₃，ｉｖｌ₃）、（ｍ₁ｖｌ₅，ｉｖ
ｌ₄）、（ｍ₁ｖｌ₆，ｉｖｌ₄）、（ｍ₁ｖｌ₇，ｉ
ｖｌ₅）、（ｍ₁ｖｌ₈，ｉｖｌ₆）、の対応が抽出さ
れたものとする。Regarding the matching process between M ₁ VL ₁ and IVL ₁ , in the best matching extraction units 35c-2c, (m ₁ vl ₁ , ivl ₁ ), (m ₁ vl ₂ , ivl)
₂ ), (m ₁ vl ₃ , ivl ₃ ), (m ₁ vl ₅ , iv
l ₄ ), (m ₁ vl ₆ , ivl ₄ ), (m ₁ vl ₇ , i
It is assumed that the correspondence of vl ₅ ) and (m ₁ vl ₈ , ivl ₆ ) is extracted.

【０２１６】ｍｔ１₁の水平罫線集合：Ｍ₁ＨＬ₁＝
（ｍ₁ｈｌ₁，ｍ₁ｈｌ₂，ｍ₁ｈｌ₃，ｍ₁ｈｌ₄，
ｍ₁ｈｌ₅）とｉｔ₁の水平罫線集合：ＩＨＬ₁＝（ｉ
ｈｌ₁，ｉｈｌ₂，ｉｈｌ₃，ｉｈｌ₄，ｉｈｌ₅）の
間の対応付け処理も、水平罫線照合部３５ｃ−３におい
て、同様に、（ｍ1 ｈｌ₁，ｉｈｌ₁）、（ｍ₁ｈ
ｌ₂，ｉｈｌ₂）、（ｍ₁ｈｌ₃，ｉｈｌ₃）、（ｍ₁
ｈｌ₄，ｉｈｌ₄）、（ｍ₁ｈｌ₅，ｉｈｌ₅）、の対
応が抽出されたものとする。Set of horizontal ruled lines of mt1 ₁ : M ₁ HL ₁ =
(M ₁ hl ₁ , m ₁ hl ₂ , m ₁ hl ₃ , m ₁ hl ₄ ,
horizontal ruled line set of m ₁ hl ₅ ) and it ₁ : IHL ₁ = (i
Similarly, in the matching process among the hl ₁ , ihl ₂ , ihl ₃ , ihl ₄ , ihl ₅ ), the horizontal ruled line matching unit 35c-3 similarly (m1 hl ₁ , ihl ₁ ), (m ₁ h
l ₂ , ihl ₂ ), (m ₁ hl ₃ , ihl ₃ ), (m ₁
It is assumed that the correspondence of hl ₄ , ihl ₄ ) and (m ₁ hl ₅ , ihl ₅ ) is extracted.

【０２１７】表照合部３５ｂで抽出されたｍｔ１₂とｉ
ｔ₂の対応における、罫線集合間の対応付け処理も罫線
照合部３５ｃで同様に行なわれ、ｍｔ１₂の垂直罫線集
合：Ｍ₁ＶＬ₂＝（ｍ₁ｖｌ₉，ｍ₁ｖｌ₁₀，ｍ₁ｖｌ
₁₁）とｉｔ₂の垂直罫線集合：ＩＶＬ₂＝（ｉｖｌ₇，
ｉｖｌ₈，ｉｖｌ₉）の間では、（ｍ₁ｖｌ₉，ｉｖｌ
₇）、（ｍ₁ｖｌ₁₀，ｉｖｌ₈）、（ｍ₁ｖｌ₁₁，ｉｖ
ｌ₉）、ｍｔ１₂の水平罫線集合：Ｍ₁ＨＬ₁＝（ｍ₁
ｈｌ₆，ｍ₁ｈｌ₇）とｉｔ₂の水平罫線集合：ＩＨＬ
₂＝（ｉｈｌ₇，ｉｈｌ₈）の間では、（ｍ₁ｖｌ₆，
ｉｖｌ₆）、（ｍ₁ｖｌ₇，ｉｖｌ₇）の罫線特徴間の
対応が得られたものとする。Mt1 ₂ and i extracted by the table collating unit 35b
Correspondence processing between ruled line sets in correspondence of t ₂ is similarly performed by the ruled line collating unit 35c, and a vertical ruled line set of mt1 ₂ is: M ₁ VL ₂ = (m ₁ vl ₉ , m ₁ vl ₁₀ , m ₁ vl
₁₁ ) and it ₂ vertical ruled line set: IVL ₂ = (ivl ₇ ,
Between ivl ₈ and ivl ₉ ), (m ₁ vl ₉ and ivl
₇ ), (m ₁ vl ₁₀ , ivl ₈ ), (m ₁ vl ₁₁ , iv)
l ₉ ), mt1 ₂ horizontal ruled line set: M ₁ HL ₁ = (m ₁
hl ₆ , m ₁ hl ₇ ) and it ₂ horizontal ruled line set: IHL
_{Between 2} = (ihl ₇ , ihl ₈ ), (m ₁ vl ₆ ,
It is assumed that the correspondence between the ruled line features of ivl ₆ ) and (m ₁ vl ₇ , ivl ₇ ) is obtained.

【０２１８】照合度計算部３５ｄは、罫線照合部３５ｃ
によって対応関係が抽出されたモデル文書と入力文書の
間で、当該構造化特徴間の対応付きを数量化することに
よりその度合い（照合度）を計算する。The collation degree calculation unit 35d includes a ruled line collation unit 35c.
The degree (matching degree) is calculated by quantifying the correspondence between the structured features between the model document and the input document from which the correspondence relationship is extracted by.

【０２１９】照合度は、モデル照合部３５に送られてき
たすべてのモデル文書と入力文書との間で計算され、照
合結果出力部３５ｅに出力される。照合度：matching-m
etric は、モデル水平罫線数をsmhl-num、モデル垂直罫
線数をsmvl-num、入力水平罫線のうちモデル水平罫線と
対応付いたものの総数をsmch-num、入力垂直罫線のうち
モデル垂直罫線と対応付いたものの総数をsmcv-numとし
たときに、例えば以下の式で定義される。The matching degree is calculated between all the model documents sent to the model matching unit 35 and the input document, and is output to the matching result output unit 35e. Matching degree: matching-m
etric is the number of model horizontal ruled lines smhl-num, the number of model vertical ruled lines smvl-num, the total number of input horizontal ruled lines that are associated with the model horizontal ruled line is smch-num, and the corresponding input vertical ruled line is the model vertical ruled line When the total number of attached items is smcv-num, it is defined by the following formula, for example.

【０２２０】matching-metric ＝ 100− 200×（｜smhl
-num−smch-num｜＋｜smvl-num−smcv-num｜）／（（sm
hl-num−smch-num）＋（smvl-num−smcv-num））例えば、図９の入力文書と図１４のモデル文書との間の
照合度： matching-metric1 は、smhl-num＝７，smvl-n
um＝11，smch-num＝７，smcv-num＝10より、 matching-metric₁＝ 100− 200×（｜７−７｜＋｜11
−10｜／（７＋７）＋（11＋11））＝94.44 、となる。Matching-metric = 100-200 × (| smhl
-num-smch-num | + | smvl-num-smcv-num |) / ((sm
hl-num-smch-num) + (smvl-num-smcv-num)) For example, the matching degree between the input document of FIG. 9 and the model document of FIG. 14: matching-metric1 is smhl-num = 7, smvl-n
From um = 11, smch-num = 7, smcv-num = 10, matching-metric ₁ = 100-200 x (| 7-7 | + | 11
−10 | / (7 + 7) + (11 + 11)) = 94.44.

【０２２１】また、図９の入力文書と図１５のモデル文
書との間の照合度： matching-metric₂は、当該構造化
特徴間の対応関係が抽出可能であったので（−100 ）を
設定する。The matching degree: matching-metric ₂ between the input document in FIG. 9 and the model document in FIG. 15 is set to (-100) because the corresponding relationship between the structured features can be extracted. To do.

【０２２２】照合結果出力部３５ｅは、モデル照合部３
５に送られてきたすべてのモデル構造化特徴と入力構造
化特徴との間の照合度のうち、最大値を示すモデル文書
と入力文書の組み合わせを選び、それらの構造化特徴間
の対応関係と共に照合結果判定部３７に出力する。The matching result output unit 35e is the model matching unit 3
Among the matching degrees between all the model structured features and the input structured features sent to S5, the combination of the model document and the input document showing the maximum value is selected, and the correspondence between these structured features is selected. The result is output to the collation result determination unit 37.

【０２２３】照合結果判定部３７は、モデル照合部３５
で最大照合度を示したモデル文書と入力文書の構造化特
徴間の対応関係（以後、構造化特徴間対応関係と呼ぶ）
が獲得できたか否かを判定する。The collation result judging unit 37 is the model collating unit 35.
Correspondence between the structured features of the model document and the input document showing the maximum matching degree in (hereinafter referred to as the correspondence between structured features)
It is determined whether or not was acquired.

【０２２４】すなわち、モデル照合部３５で計算された
最大照合度が予め与えられているしきい値：th7 以上で
ある場合には、構造化特徴間対応関係を獲得できたと判
定する。また、最大照合度がth7 未満である場合には、
構造化特徴間対応関係を獲得できなかったとして、入力
文書を棄却して、次の文書の入力を実施する。That is, when the maximum matching degree calculated by the model matching unit 35 is equal to or greater than the threshold value: th7 which is given in advance, it is determined that the structured feature correspondence can be acquired. When the maximum matching degree is less than th7,
Assuming that the structured feature correspondence cannot be acquired, the input document is rejected and the next document is input.

【０２２５】また、この場合、フォーマット種別同定部
３３で選出されなかったモデルのすべてに対して、モデ
ル照合部３５で照合処理を行ない、その結果をこの照合
結果判定部３７で判定するようにしてもよい。こうする
と、フォーマット種別同定部３３における処理誤りを救
済でき、処理結果の精度が高まる。In this case, the model collation unit 35 performs collation processing on all the models not selected by the format type identification unit 33, and the result is discriminated by the collation result determination unit 37. Good. By doing so, the processing error in the format type identifying unit 33 can be relieved, and the accuracy of the processing result can be improved.

【０２２６】具体的に説明すると、例えば、モデル照合
部３５で図９の入力構造化特徴と図１４のモデル構造化
特徴との間で最大照合度が計算されたとすると、その
値：max-sim は matching-metric₁より、max-sim ＝9
4.44 であり、 th7＝60とするとth7＜max-sim より、当
該構造化特徴間の対応関係は獲得されたと見なされる。More specifically, for example, when the model matching unit 35 calculates the maximum matching degree between the input structured feature of FIG. 9 and the model structured feature of FIG. 14, its value: max-sim. Is from matching-metric ₁ , max-sim = 9
It is 4.44, and if th7 = 60, then th7 <max-sim, and it is considered that the corresponding relationship between the structured features has been acquired.

【０２２７】構造化特徴間の対応関係は、例えば次のよ
うな形式により保持される。すなわち、入力罫線特徴モ
デル罫線特徴の各対応付きにおいて、入力罫線特徴のim
ageに、それに対応するモデル罫線特徴の識別子を格納
し、モデル罫線特徴のimageに、それに対応付くモデル
罫線の識別子を格納する。予め、それぞれの罫線集合の
各罫線特徴のimage に−１をセットしておけば、対応付
かない罫線特徴のimage には常に−１が設定されている
ことになる。The correspondence between the structured features is held in the following format, for example. That is, in each input ruled line feature model ruled line feature correspondence, the input ruled line feature im
The age of the model ruled line feature corresponding thereto is stored in age, and the identifier of the model ruled line corresponding thereto is stored in image of the model ruled line feature. If -1 is set in advance to the image of each ruled line feature of each ruled line set, -1 is always set to the image of the ruled line feature that is not associated.

【０２２８】モデル照合部３５と照合結果判定部３７を
経て獲得された構造化特徴対応関係のうちで最も重要な
ものは、入力文書の罫線集合とモデル文書の罫線集合の
間の対応関係（以後、罫線特徴間対応関係と呼ぶ）であ
る。後段の文字列領域抽出部４３は、この対応関係に基
づいて入力画像から文字列領域を切り出す。The most important structured feature correspondence obtained through the model matching unit 35 and the matching result judging unit 37 is the correspondence between the ruled line set of the input document and the ruled line set of the model document (hereinafter , Which is referred to as a ruled line feature correspondence). The character string region extraction unit 43 in the subsequent stage cuts out the character string region from the input image based on this correspondence.

【０２２９】このとき罫線間対応関係が、どちらかの構
造化特徴の欠落のため不完全であったり、矛盾を含んで
いる場合には、処理不能となってしまう。このような問
題点を解決するために、未対応・矛盾対応発見修正部４
１は、文字列領域抽出部４３による処理の前に、罫線間
対応関係に対する未対応・矛盾対応を修正する。At this time, if the correspondence between ruled lines is incomplete due to the lack of one of the structured features or contains a contradiction, it becomes impossible to process. In order to solve such a problem, the unfixed / contradictory correspondence discovery / correction unit 4
1 corrects the non-correspondence / contradiction correspondence with respect to the correspondence between ruled lines before the processing by the character string area extraction unit 43.

【０２３０】未対応・矛盾対応発見修正部４１は、以下
の処理（１）（２）を実施する。The non-correspondence / contradiction correspondence finding / correcting unit 41 carries out the following processes (1) and (2).

【０２３１】（１）モデルの罫線集合を構成する罫線特
徴のうち入力罫線に対応付いてないものを検出し、すで
に対応付いている他の対応関係を利用し、入力罫線集合
において対応付くものを発見するか、対応付くべきもの
を仮想入力罫線として自動的に発生させて、新たに入力
罫線集合に加える。(1) Among the ruled line features that make up the model ruled line set, those that are not associated with the input ruled line are detected, and the other features that are already associated are used to identify those that are associated with the input ruled line set. A virtual input ruled line to be discovered or associated is automatically generated and newly added to the input ruled line set.

【０２３２】（２）上記（１）の処理を行なうと図２９
に示すような矛盾が生じてしまう場合には、それを解消
し、無矛盾な対応関係を生みだすようにする。(2) When the processing of (1) above is performed, FIG.
If a contradiction such as the one shown in Figure 6 occurs, it is resolved and a consistent relationship is created.

【０２３３】このような未対応・矛盾対応発見修正部３
９の動作は、例えば図３０に示すフローチャートに従
う。図３０に示すフローチャートの各ステップの処理動
作は以下のようになる。ただし、この時点でも入力罫線
集合の各要素は対応付いているモデル罫線集合の座標系
に変換されたままであるものとする。Such unsupported / contradictory correspondence discovery / correction unit 3
The operation of 9 follows the flowchart shown in FIG. 30, for example. The processing operation of each step of the flowchart shown in FIG. 30 is as follows. However, it is assumed that each element of the input ruled line set is still converted to the coordinate system of the associated model ruled line set at this point.

【０２３４】ｆ１：当該モデル罫線集合の要素のうち、
特徴のimage に−１が付与されているものを検出する。F1: Of the elements of the model ruled line set,
The feature image with -1 is detected.

【０２３５】ｆ２：着目モデル罫線特徴の左上端の座標
値（ｍｘ1 ，ｍｙ1 ）と右下端の座標値（ｍｘ2 ，ｍｙ
2 ）に対して、それぞれ以下のように探索範囲を設け
る。ここで、しきい値th8 が予め設定されているものと
する。F2: Coordinate values (mx1, my1) at the upper left end and coordinate values (mx2, my) at the lower right end of the target model ruled line feature
For 2), the search ranges are set as follows. Here, it is assumed that the threshold value th8 is preset.

【０２３６】ｌｉｍ-x1 ＝ｍｘ1 −th8 ｌｉｍ-y1 ＝ｍｙ1 −th8 ｌｉｍ-x2 ＝ｍｘ2 ＋th8 ｌｉｍ-y2 ＝ｍｙ2 ＋th8 この探索範囲に含まれる入力罫線特徴のうち、着目モデ
ル罫線に最も近いものを検出する。この時、近さの尺度
を表す距離値は、例えば罫線特徴の重心間のユークリッ
ド距離で定義されてもよい。Lim-x1 = mx1−th8 lim-y1 = my1−th8 lim-x2 = mx2 + th8 lim-y2 = my2 + th8 Among the input ruled line features included in this search range, the one closest to the target model ruled line is detected. To do. At this time, the distance value representing the measure of the closeness may be defined by, for example, the Euclidean distance between the centers of gravity of the ruled line features.

【０２３７】ｆ３：仮想入力罫線として、以下の特徴
（罫線特徴）を有するものを生成する。罫線特徴は、左
上端の位置座標（ｋｘ1 ，ｋｙ1 ）、右下端の位置座標
（ｋｘ2 ，ｋｙ2 ）、縦幅（ｋ−height）、横幅（ｋ−
width ）、重心の位置座標（ｋｃｘ，ｋｃｙ）を含む。F3: A virtual input ruled line having the following features (ruled line feature) is generated. The ruled line features are position coordinates (kx1, ky1) at the upper left end, position coordinates (kx2, ky2) at the lower right end, vertical width (k-height), and horizontal width (k-
width) and the position coordinates (kcx, kcy) of the center of gravity.

【０２３８】ｆ４：着目モデル罫線が、１．水平罫線の場合、着目モデル罫線の両端に接続する
垂直モデル罫線をそれぞれ検出する。これらの垂直モデ
ル罫線が、(a) 存在し、かつそれに対応付く入力罫線が
存在する場合には、その垂直入力罫線特徴の左上端と右
下端の位置座標のうちｘ座標値のみを、それぞれｋｘ1
とｋｘ2 に格納する。F4: The target model ruled line is 1. In the case of horizontal ruled lines, vertical model ruled lines connected to both ends of the model ruled line of interest are detected. If these vertical model ruled lines exist (a) and the input ruled lines corresponding to them exist, only the x coordinate value of the position coordinates of the upper left end and the lower right end of the vertical input ruled line feature is respectively kx1.
And kx2.

【０２３９】(b) 存在しない場合、もしくは垂直モデル
罫線が存在してもそれに対応付く入力罫線が存在しない
場合には、着目水平モデル罫線の位置座標値であるｍｘ
1 とｍｘ2 をそれぞれｋｘ1 とｋｘ2 に格納する。(B) If it does not exist, or if there is a vertical model ruled line but no corresponding input ruled line, the position coordinate value mx of the horizontal model ruled line of interest.
Store 1 and mx2 in kx1 and kx2, respectively.

【０２４０】２．垂直罫線の場合、着目モデル罫線の位
置座標値であるｍｘ1 とｍｘ2 をそれぞれｋｘ1 とｋｘ
2 に格納する。2. In the case of a vertical ruled line, the position coordinate values mx1 and mx2 of the model ruled line of interest are set to kx1 and kx, respectively.
Store in 2.

【０２４１】ｆ５：着目モデル罫線が、１．垂直罫線の場合、着目モデル罫線の両端に接続する
水平モデル罫線をそれぞれ検出する。これらの水平モデ
ル罫線が、(a) 存在し、かつそれに対応付く入力罫線が
存在する場合には、その垂直入力罫線特徴の左上端と右
下端の位置座標のうちｙ座標値のみを、それぞれｋｙ1
とｋｙ2 に格納する。F5: The target model ruled line is 1. In the case of vertical ruled lines, the horizontal model ruled lines connected to both ends of the model ruled line of interest are detected. If these horizontal model ruled lines (a) exist and the corresponding input ruled lines also exist, only the y coordinate value of the position coordinates of the upper left end and the lower right end of the vertical input ruled line feature is determined by ky1
And ky2.

【０２４２】(b) 存在しない場合、もしくは水平モデル
罫線が存在してもそれに対応付く入力罫線が存在しない
場合には、着目垂直モデル罫線の位置座標値であるｍｙ
1 とｍｙ2 をそれぞれｋｙ1 とｋｙ2 に格納する。(B) If it does not exist, or if there is a horizontal model ruled line but no corresponding input ruled line, the position coordinate value my of the vertical model ruled line of interest is my.
Store 1 and my2 in ky1 and ky2, respectively.

【０２４３】２．水平罫線の場合、着目モデル罫線の位
置座標値であるｍｙ1 とｍｙ2 をそれぞれｋｙ1 とｋｙ
2 に格納する。2. In the case of a horizontal ruled line, the position coordinate values my1 and my2 of the target model ruled line are set to ky1 and ky, respectively.
Store in 2.

【０２４４】ｆ６：ｆ２で検出モデル罫線特徴のimage
を調べる。image に−１がセットされている場合には、
図３０のフローチャートでＮＯの方向に、−１以外の値
がセットされている場合にはＹＥＳの方向に処理を進め
る。F6: An image of the feature of the detected model ruled line at f2
Find out. If image is set to -1,
If a value other than -1 is set in the direction of NO in the flowchart of FIG. 30, the process proceeds in the direction of YES.

【０２４５】ｆ７：着目モデル罫線が水平罫線の場合、
着目モデル罫線の両端に接続する垂直モデル罫線をそれ
ぞれ検出する。これらの垂直モデル罫線が、１．存在し、かつそれら対応付く入力罫線が存在する場
合には、その垂直入力罫線特徴の左上端と右下端の位置
座標のうちｘ座標値のみをそれぞれ着目水平入力罫線の
左上端と右下端のｘ座標値に格納する。F7: When the target model ruled line is a horizontal ruled line,
The vertical model ruled lines connected to both ends of the model ruled line of interest are detected. These vertical model lines are: If they exist and there is an input ruled line corresponding to them, only the x-coordinate value of the position coordinates of the upper left corner and the lower right corner of the vertical input ruled line feature is respectively considered as the x at the upper left corner and the lower right corner of the horizontal input ruled line. Store in coordinate values.

【０２４６】２．存在しない場合、もしくは垂直モデル
罫線が存在してもそれに対応付く入力罫線が存在しない
場合には、着目水平モデル罫線の位置座標値であるｍｘ
1とｍｘ2 をそれぞれ着目水平入力罫線の左上端と右下
端のｘ座標値に格納する。2. If it does not exist, or if there is a vertical model ruled line but no corresponding input ruled line, the position coordinate value mx of the horizontal model ruled line of interest.
1 and mx2 are stored in the x coordinate values of the upper left corner and the lower right corner of the horizontal input ruled line of interest, respectively.

【０２４７】ｆ８：着目モデル罫線が垂直罫線の場合、
着目モデル罫線の両端に接続する水平モデル罫線をそれ
ぞれ検出する。これらの水平モデル罫線が、１．存在し、かつそれに対応付く入力罫線が存在する場
合には、その水平入力罫線特徴の左上端と右下端の位置
座標のうちｙ座標値のみをそれぞれ着目水平入力罫線の
左上端と右下端のｙ座標値に格納する。F8: When the target model ruled line is a vertical ruled line,
The horizontal model ruled lines connected to both ends of the target model ruled line are respectively detected. These horizontal model lines are: If it exists and there is an input ruled line corresponding to it, only the y coordinate value of the position coordinates of the upper left corner and the lower right corner of the horizontal input ruled line feature is only the y coordinates of the upper left corner and the lower right corner of the horizontal input ruled line of interest, respectively. Store in coordinate values.

【０２４８】２．存在しない場合、もしくは水平モデル
罫線が存在してもそれに対応付く入力罫線が存在しない
場合には、着目垂直モデル罫線の位置座標値であるｍｙ
1とｍｙ2 をそれぞれ着目垂直入力罫線の左上端と右下
端のｙ座標値に格納する。2. If it does not exist, or if there is a horizontal model ruled line but no corresponding input ruled line, the position coordinate value my of the vertical model ruled line of interest is my.
1 and my2 are stored in the y coordinate values of the upper left corner and the lower right corner of the vertical input ruled line of interest, respectively.

【０２４９】図３０に示すフローチャートの具体的な動
作を図９に示す入力構造化特徴と図１４に示すモデル構
造化特徴を用いて説明する。当該モデル罫線特徴集合の
うちｍ₁ｖｌ₄のみ、対応付く入力罫線が存在しない。
従って、図３０のフローチャートのステップｆ１でｍ₁
ｖｌ₄が検出される。次いで、ステップｆ２において、
ｍ₁ｖｌ₄の探索範囲に含まれるｉｖｌ₄を検出し、さ
らにステップｆ６でｉｖｌ₄に対応付くモデル罫線とし
てｍ₁ｖｌ₅とｍ₁ｖｌ₆を検出する。このうち、ｍ₁
ｖｌ₆とｍ₁ｖｌ₄は共存できる（同時に存在できる）
が、ｍ₁ｖｌ₄とｍ₁ｖｌ₅の両立性は矛盾する。そこ
で、ステップｆ９でｍ₁ｖｌ₄とｍ₁ｖｌ₅のどちらが
当該入力罫線に近いかが判断され、その結果、ｍ₁ｖｌ
₄の方が近いことが分かる。これによりステップｆ１０
でｉｖｌ₄とｍ₁ｖｌ₅の対応関係が無効とされ、ステ
ップｆ１１，ｆ１２において、ｍ₁ｖｌ₄とｉｖｌ₄の
対応が新たに生成され、これにより生じる座標値の変更
がステップｆ７，ｆ８で行なわれる。The specific operation of the flow chart shown in FIG. 30 will be described with reference to the input structured feature shown in FIG. 9 and the model structured feature shown in FIG. Of the model ruled line feature set, only m ₁ vl ₄ has no associated input ruled line.
Thus, m ₁ in step f1 of the flow chart of FIG. 30
vl ₄ is detected. Then, in step f2,
ivl ₄ included in the search range of m ₁ vl ₄ is detected, and m ₁ vl ₅ and m ₁ vl ₆ are detected as model ruled lines corresponding to ivl ₄ in step f6. Of these, m ₁
vl ₆ and m ₁ vl ₄ can coexist (can exist at the same time)
However, the compatibility of m ₁ vl ₄ and m ₁ vl ₅ is contradictory. Therefore, in step f9, it is determined which of m ₁ vl ₄ and m ₁ vl ₅ is closer to the input ruled line, and as a result, m ₁ vl 4 is determined.
You can see that ₄ is closer. As a result, step f10
Then, the correspondence between ivl ₄ and m ₁ vl ₅ is invalidated, a correspondence between m ₁ vl ₄ and ivl ₄ is newly generated in steps f11 and f12, and the resulting change in the coordinate values is performed in steps f7 and f8. Done.

【０２５０】ステップｆ１０で対応関係を無効とされた
ことにより、ｍ₁ｖｌ₅に対応付く入力罫線が存在しな
くなった。これをステップｆ１４で検知したあと、ｍ₁
ｖｌ₅に対応付く入力罫線の発見・設定処理が行なわれ
る。まず、ステップｆ２でｍ₁ｖｌ₅の探索範囲でｉｖ
ｌ₄を発見し、ステップｆ６でｉｖｌ₄に対応付くモデ
ル罫線としてｍ₁ｖｌ₄とｍ₁ｖｌ₆を検出する。この
うち、ｍ₁ｖｌ₆とｍ₁ｖｌ₅は共存できる（同時に存
在できる）が、ｍ₁ｖｌ₄とｍ₁ｖｌ₅の両立性は矛盾
することがわかっており、さらにｍ₁ｖｌ₄の方がｍ₁
ｖｌ₅より近いことが分かっているので、ｍ₁ｖｌ₅に
対応する入力罫線は当該入力罫線集合では見つからなか
ったとして、ステップｆ３で仮想入力罫線を生成させ、
その位置座標をステップｆ４とｆ５で設定し、ステップ
ｆ１３で当該罫線集合に加える。この結果、当該モデル
罫線集合に未対応モデル罫線がなくなり、未対応・矛盾
対応発見修正手段３９における処理は終了となる。Since the correspondence is invalidated in step f10, there is no input ruled line associated with m ₁ vl ₅ . After detecting this in step f14, m ₁
The input ruled line corresponding to vl ₅ is found and set. First, in step f2, iv is set in the search range of m ₁ vl _5.
l ₄ is found, and m ₁ vl ₄ and m ₁ vl ₆ are detected as model ruled lines corresponding to ivl ₄ in step f6. Of these, m ₁ vl ₆ and m ₁ vl ₅ can coexist (can exist at the same time), but it is known that the compatibility of m ₁ vl ₄ and m ₁ vl ₅ is inconsistent, and m ₁ vl ₄ is more compatible. Is m ₁
Since it is known that the input ruled line corresponding to m ₁ vl ₅ is not found in the input ruled line set since it is closer than vl ₅ , a virtual input ruled line is generated in step f3,
The position coordinates are set in steps f4 and f5, and added to the ruled line set in step f13. As a result, there is no uncorresponding model ruled line in the model ruled line set, and the process in the uncorresponding / contradictory correspondence finding / correcting means 39 ends.

【０２５１】以上の処理結果により得られた罫線マッチ
ングに対して、以下の手順によって示されるマッチング
後処理を適用することによって、さらにマッチング結果
の精度を上げることができる。あるモデル罫線に対応付
くべき入力罫線の付近に当該モデル罫線との間の類似度
が高い線分（例えば取り消し線）が存在する場合、誤っ
てそれに対応づけてしまうことがある。以下の処理は、
このような誤りを解消するものである。By applying the post-matching processing shown in the following procedure to the ruled line matching obtained by the above processing result, the accuracy of the matching result can be further improved. When there is a line segment (for example, a strike-through line) having a high degree of similarity with the model ruled line near the input ruled line that should be associated with a certain model ruled line, it may be erroneously associated with it. The following process
Such an error is eliminated.

【０２５２】Ｓｔｅｐ１：モデル罫線（ｍｌ）を１つ抽
出する。Step 1: One model ruled line (ml) is extracted.

【０２５３】Ｓｔｅｐ２：その探索範囲（上述したａｒ
ｅａ_ml）内に存在する入力罫線（ｉｌ）を抽出し、モデ
ルとの類似度（ｌｓｉｍ_ml）を計算する。ここで、ｍｌ
ｈ：ｍｌの縦幅、ｍｌｗ：ｍｌの横幅、ｉｌｈ：ｉｌの
縦幅、ｉｌｗ：ｉｌの横幅とすると、ｍｌが水平罫線の
場合には、ｌｓｉｍ_ml＝ 100− 200×｜ｍｌｗ−ｉｌｗ｜／（ｍｌ
ｗ＋ｉｌｗ）とし、垂直罫線の場合には、ｌｓｉｍ_ml＝ 100− 200×｜ｍｌｈ−ｉｌｈ｜／（ｍｌ
ｈ＋ｉｌｈ）とする。Step 2: its search range (ar described above
The input ruled line (il) existing in (ea _ml ) is extracted, and the similarity (lsim _ml ) with the model is calculated. Where ml
If h: ml vertical width, mlw: ml horizontal width, ilh: il vertical width, and ilw: il horizontal width, then if ml is a horizontal ruled line, lsim _ml = 100−200 × | mlw−ilw | / (Ml
w + ilw), and in the case of a vertical ruled line, lsim _ml = 100−200 × | mlh−ilh | / (ml
h + ilh).

【０２５４】Ｓｔｅｐ３：ｌｓｉｍ_ml≧th10である入力
罫線（ｉｌ′）をすべて抽出する。Step 3: All input ruled lines (il ′) for which lsim _ml ≧ th10 are extracted.

【０２５５】Ｓｔｅｐ４：当該モデル罫線と対応づいて
いた入力罫線とｉｌ′の中で最も距離ｄｄ_mlが近い罫線
を選び、あらためてそれを当該モデルに対応づける。こ
こで、ｍｌの左上端と右下端の座標をそれぞれ（ｍｌｘ
1 ，ｍｌｙ1 ）、（ｍｌｘ2，ｍｌｙ2 ）、ｉｌ′の左
上端と右下端をそれぞれ（ｉｌ′ｘ1 ，ｉｌ′ｙ1 ）、
（ｉｌ′ｘ2 ，ｉｌ′ｙ2 ）とすると、ｍｌが水平罫線
の場合には、ｄｄ_ml＝ min（ｍｌｙ1 ，ｉｌ′ｙ1 ）− max（ｍｌｙ
2 ，ｉｌ′ｙ2 ）＋１とし、垂直罫線の場合には、ｄｄ_ml＝ min（ｍｌｘ1 ，ｉｌ′ｘ1 ）− max（ｍｌｘ
2 ，ｉｌ′ｘ2 ）＋１とする。Step 4: Select the ruled line having the shortest distance dd _ml among the input ruled lines that have been associated with the model ruled line and il ′, and associate it with the model again. Here, the coordinates of the upper left corner and the lower right corner of ml are (mlx
1, mly1), (mlx2, mly2), and the upper left and lower right ends of il 'are (il'x1, il'y1),
(Il'x2, il'y2), if ml is a horizontal ruled line, dd _ml = min (mly1, il'y1) -max (mly
2, il'y2) +1, and in the case of a vertical ruled line, dd _ml = min (mlx1, il'x1) -max (mlx
2, il'x2) +1.

【０２５６】以上の結果、入力構造化とモデル構造化特
徴間の対応付け処理は終了したことになり、入力文書の
すべての罫線特徴の座標系を元に戻すようにする。As a result of the above, the association processing between the input structuring and the model structuring features is completed, and the coordinate systems of all ruled line features of the input document are restored.

【０２５７】未対応・矛盾対応発見修正手段３９により
整合の得られた入力構造化特徴とモデル構造化特徴の対
応関係を用いて、入力文書の罫線特徴集合に対して、以
下に述べる罫線成形処理を施すことによって以後の処理
が安定に行なわれるようにしてもよい。The ruled line forming process described below is performed on the ruled line feature set of the input document by using the correspondence relationship between the input structured feature and the model structured feature that are matched by the unsupported / contradictory correspondence finding / correcting means 39. The subsequent processing may be performed stably by applying the above.

【０２５８】例えば、図３１に示すように、罫線特徴
（図中Ｌ１）の端点（図中Ｅ）がそれに直交する他の罫
線特徴（図中Ｌ２）に接していない場合には、Ｌ１の端
点の位置座標を変更して、Ｌ２に接するようにすること
で罫線特徴を成形する。この成形処理は、入力画像の水
平罫線集合と垂直罫線集合の両方に適用される。For example, as shown in FIG. 31, when the end point (E in the figure) of the ruled line feature (L1 in the figure) is not in contact with another ruled line feature (L2 in the figure) orthogonal thereto, the end point of L1 The ruled line feature is formed by changing the position coordinates of the so as to be in contact with L2. This shaping process is applied to both the horizontal ruled line set and the vertical ruled line set of the input image.

【０２５９】水平罫線集合に対する罫線成形処理部の動
作は、例えば次のようになる。ここでＬ１の左上端の座
標値を（Ｌ₁ｘ1 ，Ｌ₁ｙ1 ）、右下端の座標値を（Ｌ
₁ｘ2 ，Ｌ₁ｙ2 ）、Ｌ２の左上端の座標値を（Ｌ₂ｘ
1 ，Ｌ₂ｙ1 ）、右下端の座標値を（Ｌ₂ｘ2 ，Ｌ₂ｙ
2 ）とする。The operation of the ruled line forming processing section for a set of horizontal ruled lines is as follows, for example. Here, the coordinate value of the upper left corner of L1 is (L ₁ x1, L ₁ y1) and the coordinate value of the lower right corner is (L ₁
₁ x2, L ₁ y2), the coordinate value of the upper left corner of L2 is (L ₂ x
1, L ₂ y 1) and the coordinate value of the lower right corner is (L ₂ x 2, L ₂ y
2)

【０２６０】Ｓｔｅｐ１：入力画像に対応するモデル文
書の水平罫線特徴集合の任意の罫線特徴を選択する。Step 1: Select an arbitrary ruled line feature of the horizontal ruled line feature set of the model document corresponding to the input image.

【０２６１】Ｓｔｅｐ２：着目水平罫線の両端に接する
モデル垂直罫線を検出する。Step 2: Detect model vertical ruled lines in contact with both ends of the target horizontal ruled line.

【０２６２】例えば、図３１のＬ１の左端（点Ｅ）に接
するべき垂直罫線Ｌ２は、次のようにして検出される。
まず、探索領域を設定する。予め設定してあるしきい値
をth12とすると、探索領域は例えば、（Ｌ₁ｘ1 −th1
2，（Ｌ₁ｙ1 ＋Ｌ₁ｙ2 ）／２−th12）、（Ｌ₁ｘ1
＋th12，（Ｌ₁ｙ1 ＋Ｌ₁ｙ2 ）／２＋th12）で示され
る矩形（図３１中の破線で示されている矩形）の内部と
する。For example, the vertical ruled line L2 that should contact the left end (point E) of L1 in FIG. 31 is detected as follows.
First, the search area is set. Assuming that the threshold value set in advance is th12, the search area is, for example, (L ₁ x1 −th1
2, (L ₁ y1 + L ₁ y2) / 2-th12), (L ₁ x1
+ Th12, (L ₁ y1 + L ₁ y2) / 2 + th12) is the inside of the rectangle (the rectangle shown by the broken line in FIG. 31).

【０２６３】次に、探索領域に交差する垂直罫線のうち
以下に定義する距離：ｄｉｓｔが最小のものを抽出す
る。Next, of the vertical ruled lines intersecting the search area, the one having the smallest distance: dist defined below is extracted.

【０２６４】ｄｉｓｔ＝｜Ｌ₁ｘ1 −（Ｌ₂ｘ1 ＋Ｌ₂ｘ2 ）／２｜Ｓｔｅｐ３：着目モデル水平罫線に対応付いている入力
垂直罫線を抽出する。ここで、着目入力水平罫線の左上
端の位置座標値を（ｉｘ1 ，ｉｙ1 ）、右下端の位置座
標値を（ｉｘ2 ，ｉｙ2 ）とする。Dist = | L ₁ x1 − (L ₂ x1 + L ₂ x2) / 2 | Step 3: Extract the input vertical ruled line associated with the target model horizontal ruled line. Here, the position coordinate value of the upper left end of the focused input horizontal ruled line is (ix1, iy1), and the position coordinate value of the lower right end is (ix2, iy2).

【０２６５】Ｓｔｅｐ４：着目モデル水平罫線の左端に
接するモデル垂直罫線に対応付いている入力垂直罫線を
抽出する。ここで、その左上端の位置座標値を（ｌｘ1
，ｌｙ1 ）、右下端の位置座標値を（ｌｘ2 ，ｌｙ2
）とする。Step 4: The input vertical ruled line corresponding to the model vertical ruled line in contact with the left end of the model horizontal ruled line of interest is extracted. Here, the position coordinate value of the upper left corner is (lx1
, Ly1), and the position coordinate value of the lower right corner is (lx2, ly2
).

【０２６６】Ｓｔｅｐ５：着目モデル水平罫線の右端に
接するモデル垂直罫線に対応付いている入力垂直罫線を
抽出する。ここで、その左上端の位置座標値を（ｒｘ1
，ｒｙ1 ）、右下端の位置座標値を（ｒｘ2 ，ｒｙ2
）とする。Step 5: The input vertical ruled line associated with the model vertical ruled line in contact with the right end of the target model horizontal ruled line is extracted. Here, the position coordinate value of the upper left end is (rx1
, Ry1) and the position coordinate value of the lower right corner is (rx2, ry2
).

【０２６７】Ｓｔｅｐ６：着目入力水平罫線の左上端の
位置座標を（（ｌｘ1 ＋ｌｘ2 ）／２，ｉｙ1 ）に変更
する。Step 6: The position coordinate of the upper left end of the input horizontal ruled line of interest is changed to ((lx1 + lx2) / 2, iy1).

【０２６８】Ｓｔｅｐ７：着目入力水平罫線の右下端の
位置座標を（（ｒｘ1 ＋ｒｘ2 ）／２，ｉｙ2 ）に変更
する。Step 7: Change the position coordinates of the lower right corner of the input horizontal ruled line of interest to ((rx1 + rx2) / 2, iy2).

【０２６９】Ｓｔｅｐ８：Ｓｔｅｐ２からＳｔｅｐ７ま
での処理を、すべてのモデル水平罫線特徴に対して行な
う。Step 8: The processing from Step 2 to Step 7 is performed on all the model horizontal ruled line features.

【０２７０】文書構造獲得部４１は、入力文書とモデル
文書との間で矛盾のない構造化特徴間対応関係が得られ
た場合には、以下に示す処理を行なうことにより入力文
書の構造を獲得する。すなわち、モデル登録部３１で予
め登録されている当該モデル文書の知識を、構造化特徴
間対応関係に基づいて入力文書にコピーすることによ
り、入力文書の構造を獲得したものと見なす。The document structure acquisition unit 41 acquires the structure of the input document by performing the following processing when a consistent structured feature correspondence relationship is obtained between the input document and the model document. To do. That is, it is considered that the structure of the input document is acquired by copying the knowledge of the model document registered in advance in the model registration unit 31 into the input document based on the correspondence between the structured features.

【０２７１】具体的に説明すると、まず、モデル文書の
各罫線特徴に付与されている識別番号（以後、ｉｄと呼
ぶ）を罫線対応関係に基づいて、それに対応付いている
入力文書の罫線特徴に付与する。More specifically, first, the identification number (hereinafter referred to as id) given to each ruled line feature of the model document is determined based on the ruled line correspondence to the ruled line feature of the input document associated therewith. Give.

【０２７２】次いで、モデル文書に対して定義されてい
る種々の知識を入力文書に対して用意されている、知識
を格納するためのメモリ４３ｂの所定の領域にコピーす
る。後段の文字列領域抽出部４３、文字認識部４５、文
字認識結果出力部４７は、メモリ４３ｂの所定の領域に
コピーされた知識に基づいて動作する。Next, the various kinds of knowledge defined for the model document are copied to predetermined areas of the memory 43b for storing the knowledge prepared for the input document. The character string area extraction unit 43, the character recognition unit 45, and the character recognition result output unit 47 in the subsequent stage operate based on the knowledge copied to a predetermined area of the memory 43b.

【０２７３】文字列領域抽出部４３は、文書構造獲得部
４１で得られた当該文書に関する知識を用いて、認識対
象文字列領域として定義されている領域のみを入力画像
から切り出す。認識対象文字列領域は、例えば領域を囲
んでいる上下左右の罫線のそれぞれの識別番号で定義さ
れていてもよい。この場合、文字列領域抽出部４３で
は、入力文書画像中に位置するそれらの罫線の内側の部
分を入力画像から切り出すことにより文字列領域を抽出
する。The character string area extracting unit 43 cuts out only the area defined as the character string area to be recognized from the input image by using the knowledge about the document obtained by the document structure acquiring unit 41. The recognition target character string area may be defined by, for example, respective identification numbers of upper, lower, left, and right ruled lines surrounding the area. In this case, the character string area extracting unit 43 extracts the character string area by cutting out the portions inside the ruled lines located in the input document image from the input image.

【０２７４】文字認識部４５は、文字列領域抽出部４３
において抽出された文字列画像を、その文字列領域につ
いて定義されている知識を制約条件として用いて、例え
ば文献「信学技報、ＰＲＵ９３−４７、１９９３」に記
載されている方式に基づいた処理により、文字切り出し
／認識処理を行ない、コードデータに変換する。The character recognizing unit 45 has a character string area extracting unit 43.
Using the knowledge defined for the character string area as a constraint condition, the character string image extracted in 1) is processed based on the method described in, for example, the literature “Science Technical Report, PRU93-47, 1993”. By this, character cutting / recognition processing is performed and converted into code data.

【０２７５】このとき、各文字認識結果は類似度の降順
にソートされており、上位Ｎ位まで保持されているよう
にしてもよい。また、認識結果はオペレータとシステム
との対話的な修正作業により修正されるようになってい
てもよい。At this time, the character recognition results are sorted in descending order of similarity, and the upper N ranks may be held. Further, the recognition result may be corrected by interactive correction work between the operator and the system.

【０２７６】文字認識結果出力部４７は、文字認識部４
５による文字認識結果に対して、文字単位で修正が済ん
だ文字コードデータに応じて、入力文書に対応付いたモ
デルに関して予め指定されている出力形態に基づき、デ
ィスプレイあるいはファイルに出力する。The character recognition result output unit 47 is the character recognition unit 4
The character recognition result of 5 is output to a display or a file based on an output form designated in advance for a model associated with an input document according to the character code data corrected in character units.

【０２７７】このようにして、本発明では、入力画像か
ら抽出した罫線特徴を、特徴構造化部２９において構造
化して、さらに表特徴、接合部に関する特徴などを抽出
し、それらの関係を抽出・管理する。これらの情報を用
いて、モデル登録部３１によって予め登録されているモ
デル文書のフォーマットに関する構造化特徴間で対応付
け処理を行なう。このとき、モデル照合部３５によっ
て、入力文書とモデル文書の間で表特徴集合間の照合処
理を行ない、さらにその中に含まれる罫線特徴集合間で
照合処理を行なうことにより、以下のような効果が得ら
れる。As described above, according to the present invention, the ruled line feature extracted from the input image is structured by the feature structuring unit 29, and the table feature and the feature relating to the joint are extracted, and the relation between them is extracted. to manage. Using these pieces of information, the associating process is performed between the structured features related to the format of the model document registered in advance by the model registration unit 31. At this time, the model matching unit 35 performs the matching process between the table feature sets between the input document and the model document, and further performs the matching process between the ruled line feature sets included therein, whereby the following effects are obtained. Is obtained.

【０２７８】１．階層的な照合処理を行なうので計算量
を少なくすることができる。1. Since a hierarchical matching process is performed, the amount of calculation can be reduced.

【０２７９】２．表が複数混在している場合も取り扱う
ことができる。2. It is possible to handle the case where multiple tables are mixed.

【０２８０】３．表単位の照合処理を行なうことにより
全体的な配置関係を考慮することができ、局所的に見て
同じ特徴量を有する場合でも対応付け誤りが生じない。3. By performing the table-by-table matching process, the overall layout relationship can be taken into consideration, and even if the same feature amount is seen locally, a matching error does not occur.

【０２８１】４．表間対応ごとに大きさの倍率に関する
パラメータを求めることができ、モデル表に対応する入
力表の大きさに関するパラメータがそれぞれ独立した値
を持つような文書を取り扱うことができる。4. It is possible to obtain a parameter related to the size magnification for each correspondence between tables, and it is possible to handle a document in which the parameters related to the size of the input table corresponding to the model table have independent values.

【０２８２】５．表が印刷品質の劣化などにより分裂し
ている場合も取り扱うことができる。5. Even if the table is divided due to deterioration of print quality, it can be handled.

【０２８３】６．入力文書とモデル文書の間で罫線間の
対応関係を求めるので、罫線がかすれていたり、途切れ
ていたり、欠落している場合や余分な特徴抽出結果があ
る場合にも対応できる。6. Since the correspondence between the ruled lines is obtained between the input document and the model document, it is possible to deal with the case where the ruled lines are faint, broken or missing, or when there is an extra feature extraction result.

【０２８４】７．罫線単位で対応付け処理を行なうとき
に、複数対複数の対応を許しているので、罫線分布が局
所的に変動している場合にも対応できる。7. When the matching process is performed in ruled line units, plural-to-plural correspondence is permitted, so that it is possible to cope with a case where the ruled line distribution locally changes.

【０２８５】照合処理結果に対しては、照合結果判定部
３７によって、照合度を用いてその妥当性を評価するこ
とにより、正しい対応付け結果のみを採用することがで
きる。With respect to the collation processing result, the collation result determination section 37 evaluates the validity of the collation degree using the degree of collation, so that only the correct association result can be adopted.

【０２８６】さらに、照合処理結果に対して、未対応・
矛盾対応発見修正部３９によって、不完全な対応結果を
発見し、修正するので以下のような効果が得られる。Furthermore, the collation processing result is not supported.
Since the inconsistent correspondence finding / correcting unit 39 finds and corrects the incomplete correspondence result, the following effects can be obtained.

【０２８７】１．印刷の品質の悪い文書にも対応でき
る。1. It can handle documents with poor print quality.

【０２８８】２．特徴抽出結果が不完全である場合にも
対応できる。2. It is possible to deal with incomplete feature extraction results.

【０２８９】３．対応関係に基づいている後段の処理で
処理不能となることがない。3. There is no case where processing cannot be performed in the subsequent processing based on the correspondence relationship.

【０２９０】４．未対応箇所に対して、特徴抽出時のパ
ラメータを調整して未検出な画像特徴の抽出が可能とな
る。4. It is possible to extract undetected image features by adjusting the parameters at the time of feature extraction for uncorresponding parts.

【０２９１】また、本発明ではモデル照合処理の前に、
フォーマット種別同定部３３によって、入力文書の書式
構造種別の同定処理を行なうことにより、照合処理で適
用すべきモデルの種類を絞りこむので、無駄な照合処理
を行なわないため、以下のような効果がある。In the present invention, before the model matching process,
Since the format type identification unit 33 performs the identification process of the format structure type of the input document to narrow down the types of models to be applied in the collation process, wasteful collation process is not performed, and the following effects are obtained. is there.

【０２９２】１．計算量が少ない。1. The calculation amount is small.

【０２９３】２．構造のかけ離れたものにむりやり対応
付けることがないため高精度な処理結果が得られる。2. Highly accurate processing results can be obtained because there is no need to unreasonably associate objects with dissimilar structures.

【０２９４】３．オペレータが対象文書ごとにモデルを
手動で与える必要がないので、システムの自動運転が可
能となる。3. Since the operator does not have to manually give a model for each target document, the system can be automatically operated.

【０２９５】４．モデル登録時に正立したモデルフォー
マットから、左および右に９０度回転させたもの、１８
０度回転させたものの４種類を登録し、モデル照合時に
これらと対応付け処理を行なうことにより、文書の入力
方向を限定しなくてもよい。4. From the model format that was upright when the model was registered, rotated 90 degrees to the left and right, 18
It is not necessary to limit the input direction of the document by registering the four types that have been rotated by 0 ° and performing the matching process with these at the time of model matching.

【０２９６】[0296]

【発明の効果】以上詳述したように本発明によれば、表
形式の帳票などの書式構造を正確に認識でき、効率の良
い文字列の領域の特定が可能となるものである。As described in detail above, according to the present invention, the format structure of a tabular form or the like can be accurately recognized, and the area of a character string can be efficiently specified.

[Brief description of drawings]

【図１】本発明の一実施例に係わる画像処理装置の概略
構成を示すブロック図。FIG. 1 is a block diagram showing a schematic configuration of an image processing apparatus according to an embodiment of the present invention.

【図２】本実施例における処理対象文書の一例を示す
図。FIG. 2 is a diagram showing an example of a document to be processed in this embodiment.

【図３】本発明の一実施例である文字認識装置と組み合
わせた文書画像処理システムの概略構成を示すブロック
図。FIG. 3 is a block diagram showing a schematic configuration of a document image processing system combined with a character recognition device which is an embodiment of the present invention.

【図４】本実施例における特徴抽出部２７の構成を示す
ブロック図。FIG. 4 is a block diagram showing the configuration of a feature extraction unit 27 in this embodiment.

【図５】入力画像から抽出された線分素の例を示す図。FIG. 5 is a diagram showing an example of line segment elements extracted from an input image.

【図６】入力画像から抽出された文字候補矩形の例を示
す図。FIG. 6 is a diagram showing an example of character candidate rectangles extracted from an input image.

【図７】本実施例における特徴抽出部２７の他の構成を
示すブロック図。FIG. 7 is a block diagram showing another configuration of a feature extraction unit 27 in this embodiment.

【図８】文字候補矩形に交差・内包する線分素の例を示
す図。FIG. 8 is a diagram showing an example of line segment elements that intersect / include a character candidate rectangle.

【図９】入力画像から抽出された罫線特徴の例を示す
図。FIG. 9 is a diagram showing an example of ruled line features extracted from an input image.

【図１０】本実施例における特徴構造化部２９の構成を
示すブロック図。FIG. 10 is a block diagram showing the configuration of a characteristic structuring unit 29 in this embodiment.

【図１１】入力画像から抽出された表特徴の例を示す
図。FIG. 11 is a diagram showing an example of table features extracted from an input image.

【図１２】入力画像から抽出された接合部特徴の例を示
す図。FIG. 12 is a diagram showing an example of a joint feature extracted from an input image.

【図１３】階層的に関連づけられて管理される特徴に関
する情報の一例を示す図。FIG. 13 is a diagram showing an example of information about features that are hierarchically associated and managed.

【図１４】モデル文書の一例を示す図。FIG. 14 is a diagram showing an example of a model document.

【図１５】モデル文書の一例を示す図。FIG. 15 is a diagram showing an example of a model document.

【図１６】本実施例におけるモデル照合部３５の構成を
示すブロック図。FIG. 16 is a block diagram showing the configuration of a model matching unit 35 in this embodiment.

【図１７】本実施例における表照合部３５ｂの構成を示
すブロック図。FIG. 17 is a block diagram showing the configuration of a table matching unit 35b in this embodiment.

【図１８】任意の矩形領域の周辺に関する領域を定義す
るための説明に用いる図。FIG. 18 is a diagram used for description to define a region related to the periphery of an arbitrary rectangular region.

【図１９】本実施例における最良マッチング抽出部３５
ｂ−３の構成を示すブロック図。FIG. 19 is the best matching extraction unit 35 in this embodiment.
The block diagram which shows the structure of b-3.

【図２０】連合グラフの一例を示す図。FIG. 20 is a diagram showing an example of a association graph.

【図２１】表特徴の分裂および接触の例を示す図。FIG. 21 shows an example of splitting and contacting table features.

【図２２】本実施例における罫線照合部３５ｃの構成を
示すブロック図。FIG. 22 is a block diagram showing the configuration of a ruled line matching unit 35c in this embodiment.

【図２３】本実施例における垂直罫線照合部３５ｃ−２
および水平罫線照合部３５ｃ−３の構成を示すブロック
図。FIG. 23 is a vertical ruled line matching unit 35c-2 according to the present embodiment.
3 is a block diagram showing the configuration of a horizontal ruled line matching unit 35c-3. FIG.

【図２４】１本の罫線が分離している場合の例を示す
図。FIG. 24 is a diagram showing an example in which one ruled line is separated.

【図２５】分離している罫線を統合した例を示す図。FIG. 25 is a view showing an example in which separated ruled lines are integrated.

【図２６】複数のモデル罫線と複数の入力罫線が対応付
く例を示す図。FIG. 26 is a diagram showing an example in which a plurality of model ruled lines are associated with a plurality of input ruled lines.

【図２７】任意の１本のモデル罫線特徴に対応付く可能
性のある入力罫線特徴を抽出するための探索範囲の例を
示す図。FIG. 27 is a diagram showing an example of a search range for extracting an input ruled line feature that may be associated with any one model ruled line feature.

【図２８】連合グラフの一例を示す図。FIG. 28 is a diagram showing an example of the association graph.

【図２９】未対応・矛盾対応発見抽出処理により矛盾し
た対応が生じてしまう例を示す図。FIG. 29 is a diagram showing an example in which inconsistent correspondence occurs due to an unsupported / contradictory correspondence discovery / extraction process.

【図３０】未対応・矛盾対応発見抽出処理の流れ示すフ
ローチャートの例を表す図。FIG. 30 is a diagram showing an example of a flowchart showing a flow of uncorrespondence / contradiction correspondence discovery / extraction processing.

【図３１】罫線特徴の端点がそれに直交する他の罫線特
徴に接していない例を示す図。FIG. 31 is a diagram showing an example in which an end point of a ruled line feature is not in contact with another ruled line feature orthogonal to it.

[Explanation of symbols]

１１，２１…画像入力部、１２…特徴抽出部、１３…特
徴構造化部、１４…書式構造情報登録部、１５…書式構
造種別同定部、１６…書式構造情報照合部、１７…未対
応・矛盾対応発見修正部、１８…照合結果判定部、１９
…文書構造獲得部、２３…２値化処理部、２５…前処理
部、２７…特徴抽出部、２７ａ…線分抽出部、２７ｂ…
文字候補矩形抽出部、２７ｃ…罫線特徴抽出部、２７ｄ
…フィルタリング処理部、２９…特徴構造化部、２９ａ
…罫線グループ化処理部、２９ｂ…表特徴抽出部、２９
ｃ…罫線接合部検出部、２９ｄ…特徴間関係記述部、３
１…モデル登録部、３３…フォント種別同定部、３５…
モデル照合部、３５ａ…選択部、３５ｂ…表照合部、３
５ｂ−１…対応可能ペア検出部、３５ｂ−２…異種対応
可能ペア間両立関係判定部、３５ｂ−３…最良マッチン
グ抽出部、３５ｂ−３ａ…連合グラフ作成部，３５ｂ−
３ｂ…最大クリーク抽出部、３５ｃ…罫線照合部、３５
ｃ−１…表対応選択部、３５ｃ−２…垂直罫線照合部、
３５ｃ−２ａ…対応可能罫線特徴ペア検出部、３５ｃ−
２ｂ…対応可能罫線特徴ペア間両立性判定部、３５ｃ−
２ｃ…最良マッチング抽出部、３５ｃ−３…水平罫線照
合部、３５ｃ−４…方向間整合獲得部、３５ｄ…照合度
計算部、３５ｅ…照合結果出力部、３７…照合結果判定
部、３９…未対応矛盾対応発見修正部、４１…文書構造
獲得部、４３…文字列領域抽出部、４５…文字認識部、
４７…文字認識結果出力部。11, 21 ... Image input section, 12 ... Feature extraction section, 13 ... Feature structuring section, 14 ... Format structure information registration section, 15 ... Format structure type identifying section, 16 ... Format structure information collating section, 17 ... Not supported Conflict correspondence finding / correction unit, 18 ... Collation result determination unit, 19
... Document structure acquisition unit, 23 ... Binarization processing unit, 25 ... Pre-processing unit, 27 ... Feature extraction unit, 27a ... Line segment extraction unit, 27b ...
Character candidate rectangle extraction unit, 27c ... Ruled line feature extraction unit, 27d
... Filtering processing unit, 29 ... Feature structuring unit, 29a
... Ruled line grouping processing unit, 29b ... Table feature extraction unit, 29
c ... Ruled line joint detecting section, 29d ... Inter-feature relation describing section, 3
1 ... Model registration unit, 33 ... Font type identification unit, 35 ...
Model collating unit, 35a ... Selecting unit, 35b ... Table collating unit, 3
5b-1 ... Corresponding pair detection unit, 35b-2 ... Heterogeneous correspondence inter-pair compatibility relationship determination unit, 35b-3 ... Best matching extraction unit, 35b-3a ... Union graph creation unit, 35b-
3b ... Maximum clique extraction unit, 35c ... Ruled line collation unit, 35
c-1 ... Table correspondence selection unit, 35c-2 ... Vertical ruled line collation unit,
35c-2a ... Compatible ruled line feature pair detection unit, 35c-
2b ... Compatible ruled line feature pair compatibility determination unit, 35c-
2c ... best matching extraction unit, 35c-3 ... horizontal ruled line collation unit, 35c-4 ... direction matching acquisition unit, 35d ... collation degree calculation unit, 35e ... collation result output unit, 37 ... collation result determination unit, 39 ... not yet Correspondence contradiction correspondence finding / correction unit, 41 ... Document structure acquisition unit, 43 ... Character string region extraction unit, 45 ... Character recognition unit,
47 ... Character recognition result output unit.

Claims

[Claims]

1. An image input means for generating an input image from a document, and a format structure information registration for previously registering information (format structure model) on the format structure of a processing target document used for recognizing the format structure of the input image. Means, a feature extraction means for extracting a geometric figure feature quantity from the input image generated by the image input means, and a graphic feature quantity extracted by the feature extraction means are grouped to generate an image feature, Feature structuring means for extracting and managing the relationship between the respective image features, and image features of the input image obtained by the feature structuring means,
A format structure type identifying unit that narrows down candidates for the format structure type of the input document by using information about the format structure of the processing target document registered in advance by the format structure information registering unit, and a candidate by the format structure identifying unit. All the format structure models that have become and the features of the input document structured by the feature structuring means are associated with each other, and the set of the format structure model and the input document that best correspond to each other is selected. In the format structure information matching unit that obtains the correspondence relationship and in the correspondence between the format structure document selected by the format structure information matching unit and the structured feature in the input document, incomplete correspondence and inconsistent correspondence are eliminated. An uncorrespondence / contradiction correspondence finding / correcting means for acquiring the correspondence between the format structure model and the structured features of the input document which are matched by Copying, in the input document, information about the pre-registered format structure model based on the correspondence between the structure structure model and the structured features of the input document obtained by the response / contradiction correspondence finding / correcting means. A document image processing apparatus, comprising: a document structure acquisition means for acquiring the format structure of an input document and related information.

2. An image input unit for generating an input image from a document, a line segment and a graphic feature relating to a character component are extracted from the input image generated by the image input unit, and the region other than the character component in the input image is further extracted. From the feature extraction means for extracting the features related to the line segment as the graphic features forming the ruled lines, and the feature related to the table by grouping the graphic features related to the ruled lines extracted by the feature extraction means,
Document image processing characterized by comprising: feature structuring means for extracting information about a joint portion generated at a portion where ruled lines intersect / connect in the feature relating to each table, and extracting / managing a relation between the features. apparatus.

3. An image input unit for generating an input image from a document, a line segment and a graphic feature relating to a character component are extracted from the input image generated by the image input unit, and the region other than the character component in the input image is further extracted. The ruled line feature extracting means for extracting the feature relating to the line segment as a graphic feature forming the ruled line and the set of the graphic feature relating to the ruled line extracted by the ruled line feature extracting means are the same group of ruled lines intersecting / connecting A document image processing apparatus comprising: a table feature extracting unit that extracts a feature related to a table by combining the above.

4. An image input means for generating an input image from a document, and format structure information registration for previously registering information (format structure model) relating to the format structure of a processing target document used for recognizing the format structure of the input image. Means and the graphic feature relating to the line segment and the character component from the input image generated by the image input means, and further extracting the feature relating to the line segment from the area other than the character component in the input image by regarding it as the graphic feature relating to the ruled line. A feature extraction unit and a feature related to a table are extracted by grouping graphic features related to ruled lines extracted by the feature extraction unit,
Feature structuring means for extracting information on a joint portion generated at a portion where ruled lines intersect / connect in the feature relating to each table, and extracting / managing a relation between the respective features, and the input obtained by the feature structuring means. Table collating means for performing collation processing between the characteristic relating to the table of the document and the characteristic relating to the table forming the format structure model registered in advance by the format structure information registering means, A ruled line collating unit that obtains a correspondence relationship between the ruled lines that form the table of the input document in the table correspondence obtained by the table matching unit and the ruled lines that form the table of the format structure model that corresponds to the ruled line, Collation result determining means for calculating the degree of collation indicating the degree of correspondence between the features based on the result of the collation processing, and determining whether or not the correct correspondence is performed. A document image processing device characterized by the above.

5. An image input means for generating an input image from a document, and format structure information registration for previously registering information (format structure model) on the format structure of a processing target document used for recognizing the format structure of the input image. Means, a feature extraction means for extracting a geometric figure feature amount from the input image generated by the image input means, and a group of graphic feature amounts extracted by the feature extraction means to generate image features, Feature structuring means for extracting and managing the relationship between the respective features, structured image features of the input image obtained by the feature structuring means, and processing registered in advance by the format structure information registering means The similarity is calculated using the information about the format structure of the target document, and the format structure model with the highest similarity or a plurality of format structure models in order from the highest similarity is calculated. Or a format structure model having a similarity of a certain value or more, and a format structure type identifying means for narrowing down the type of the format structure of the input document to one or a plurality of candidates. Document image processing device.

6. An image input means for generating an input image from a document, and format structure information registration for previously registering information (format structure model) relating to the format structure of a processing target document used for recognizing the format structure of the input image. Means and the graphic feature relating to the line segment and the character component from the input image obtained by the image input means, and further extracting the feature relating to the line segment from the region other than the character component in the input image as the graphic feature relating to the ruled line. A feature extraction unit and a feature related to a table are extracted by grouping graphic features related to ruled lines extracted by the feature extraction unit,
Feature structuring means for extracting information about a joint portion generated at a portion where ruled lines intersect / connect in the feature relating to each table, and extracting / managing a relation between the features, and an input image obtained by the feature structuring means. Of the structured feature of the document and the information about the format structure of the document to be processed registered in advance by the format structure information registration means, the similarity is calculated, and the format structure model or the similarity of the highest similarity is calculated. A format structure type identifying means for selecting a plurality of format structure models in order from the highest one or a format structure model having a similarity of a certain value or more and narrowing down the format structure type of the input document to one or more candidates. , For each of the format structure models selected by the format structure type identifying means, with respect to the table of the input document obtained by the feature structuring means. And a table collating unit that performs a collation process between the features related to the table that constitutes the format structure model registered by the format structure information registering unit and obtains an inter-table correspondence relationship; The ruled line collating means for acquiring the correspondence between the ruled lines forming the table of the input document and the ruled lines forming the table of the format structure model corresponding to the ruled lines in the correspondence relation of the created table, and the ruled line collating means. The matching degree calculation means for calculating the matching degree representing the degree of correspondence between the features with respect to the obtained correspondence relationship, and the maximum matching among the matching degrees of the respective format structure models calculated by the matching degree calculation means. Of the input document and the format structure model using the matching result output means for extracting the format structure model indicating the degree, and the maximum matching degree of the format structure model extracted by the matching result output means. Document image processing apparatus characterized by comprising a comparison result determining means for determining whether or not the correct correspondence between Zoka feature is being performed, the.

7. The format structure information registration means registers information about the format structure of an upright processing target document when registering information about the format structure of a processing target document used for recognizing the format structure of an input document. Characters that are generated by rotating at a plurality of predetermined angles, give information about how many times it is rotated from the upright one, and register all of them as information about the format structure of the processing target document The document image processing apparatus according to claim 1, claim 4, claim 5, or claim 6.

8. An image input means for generating an input image from a document and an upright processing target document when registering information on the format structure of the processing target document used for recognizing the format structure of the input document. Generates information about the format structure rotated at multiple predetermined angles, gives information about how many times it has rotated from the upright one, and registers all of them as information about the format structure of the processing target document. Format structure information registering means, feature extracting means for extracting geometrical graphic feature quantities from the input image generated by the image input means, and grouping graphic feature quantities extracted by the feature extracting means. The feature structuring means for generating image features by using the feature structuring means for extracting and managing the relationship between the features, and the structuring of the input image obtained by the feature structuring means. Format structure type identification for narrowing down the type structure of the input document to one or a plurality of candidates using the image feature and the information on the format structure of the processing target document registered in advance by the format structure information registration means. Means, and all the format structure models that are candidates in the format structure type identifying means and the features structured by the feature structuring means of the input document are associated with each other, and the format structure with the best correspondence is obtained. Model matching means for selecting a pair of a model and an input document and acquiring the corresponding relationship, and incomplete matching in the correspondence between the structured structure document and the structured feature in the input document selected by the model matching means. Between the structured structure model of the input document and the structured features of the input document, which are matched by discovering whether or not there is an inconsistent correspondence and eliminating them. When the unstructured and inconsistent correspondence finding / correcting means for acquiring the correspondence and the format structure model associated with the input image by the model matching means are rotated by a predetermined angle, the rotation angle is set upright. The image rotation means for rotating the input image in the direction to correspond to the erect format model, and the format registered in advance based on the correspondence between the format structure model and the structured features of the input document. A document image processing apparatus, comprising: a document structure acquisition unit that acquires the format structure of an input document and related information by copying information about a structure model into the input document.

9. An image input means for generating an input image from a document, and format structure information registration for previously registering information (format structure model) on the format structure of a processing target document used for recognizing the format structure of the input image. Means for extracting graphic features relating to line segments and character components from the input image generated by the image input means, and determining features relating to line segments from regions other than character components in the input image as graphic features relating to ruled lines (ruled line features). A feature extraction unit that considers and extracts a feature (table feature) related to a table by grouping the ruled line features extracted by the feature extraction unit, and joins that occur in the portions where the ruled lines intersect / connect in each table feature. Feature structuring means for extracting information about parts and extracting / managing relations between the respective features; and Table collating means for collating processing between the table features of the input document and table features forming the format structure model registered in advance by the format structure information registering means, and acquiring table correspondence relation; A ruled line collating unit that obtains a correspondence between the ruled line features that form the table of the input document and the corresponding ruled lines that form the table of the format structure model corresponding to the table correspondence obtained by the matching unit; Collation result judging means for calculating a collation degree indicating a degree of correspondence between the features with respect to the correspondence relation acquired by the means, and judging whether or not correct correspondence is made; In the correspondence relationship between the input document and the ruled line feature of the format structure model, which is determined to be correct by the above, the ruled line feature of the input document (input ruled line feature) is not associated. A means for extracting ruled line features (unsupported model ruled line features) of the format structure model, and a correspondence relationship when an input ruled line feature to be associated with the unsupported model ruled line feature is associated with another model ruled line feature Means for associating an unsupported model ruled line feature with its input ruled line feature, and if there is a missing unsupported input ruled line feature to be associated with the unsupported model ruled line feature, A means for associating input ruled line features, and when an input ruled line feature to be associated with the unsupported model ruled line feature is not found, a new input ruled line feature to be associated is newly generated and an unsupported model ruled line feature is newly generated. A document image processing apparatus, comprising: means for correcting non-correspondence and inconsistency correspondence by means for associating the input ruled line features.

10. An image input means for generating an input image from a document, and format structure information registration for previously registering information (format structure model) on the format structure of a processing target document used for recognizing the format structure of the input image. Means for extracting graphic features relating to line segments and character components from the input image generated by the image inputting means, and determining features relating to line segments from regions other than character components in the input image as graphic features relating to ruled lines (ruled line features). A feature extraction unit that considers and extracts a feature (table feature) related to a table by grouping the ruled line features extracted by the feature extraction unit, and joins that occur in the portions where the ruled lines intersect / connect in each table feature. Feature structuring means for extracting information about parts and extracting / managing relations between the respective features; and Table collating means for performing collation processing between the table features of the input document and the table features constituting the format structure model registered in advance by the format structure information registering means, and table collating means for obtaining the correspondence relation between tables; Ruled line matching means for acquiring a correspondence relationship between the ruled line feature forming the table of the input document and the ruled line feature forming the table of the format structure model corresponding thereto in the correspondence relationship of the table obtained by the means; In the correspondence relationship between the ruled line features acquired by the means, the ruled line matching that corrects the connection relationship between the ruled lines in the ruled line feature set of the corresponding input document based on the connection relationship between the ruled line features in the ruled line feature set of the format structure model. A document image processing apparatus comprising: a post-processing unit.