JP6876300B2

JP6876300B2 - Predictor

Info

Publication number: JP6876300B2
Application number: JP2017167634A
Authority: JP
Inventors: 越智　大介; 大介越智; 隆史山内; 淑希恒元; 檜山　聡; 聡檜山; 正朗長崎; 準一菅原; 修田邉; 理宇山下
Original assignee: Tohoku University NUC; NTT Docomo Inc
Current assignee: Tohoku University NUC; NTT Docomo Inc
Priority date: 2017-08-31
Filing date: 2017-08-31
Publication date: 2021-05-26
Anticipated expiration: 2037-08-31
Also published as: JP2019045264A

Description

本発明は、一般に医療診断技術に関し、より詳細には、非侵襲的に早産の発症予測又は異常検出を行うために情報を収集し早産発症予測を行う予測装置及び方法に関する。 The present invention relates generally to medical diagnostic techniques, and more specifically to predictors and methods for collecting information and predicting the onset of preterm birth in order to non-invasively predict the onset of preterm birth or detect abnormalities.

妊娠関連疾患として、早産は、原因不明が約3分の1を占める周産期医療上の代表的な疾患であり、複雑な相互作用により発症する多因子疾患である。妊婦の歯周炎と早産・低体重児出産との関連が報告されているが、その詳細なメカニズムは不明な点が多い。 As a pregnancy-related disease, preterm birth is a typical perinatal medical disease in which the cause is unknown in about one-third, and it is a multifactorial disease that develops through complex interactions. The relationship between periodontitis in pregnant women and the birth of premature and low birth weight infants has been reported, but the detailed mechanism remains unclear.

非特許文献１によると、妊婦124名に対し、妊娠期間中あるいは産後3日以内にClinical attachment level (CAL)、Probing depth (PD)を測定し、PLBW(Preterm Low Birth Weight)だったケース群とそうでないコントロール群で評価したところ、ケース群（31名）のCAL平均値がコントロール群よりも有意に大きかった。また、CALが3mm以上の歯が60%以上を占めた妊婦においては、PLBW発症危険率が全体で5.9倍、初産で6.7倍だった。 According to Non-Patent Document 1, 124 pregnant women were measured for clinical attachment level (CAL) and Probing depth (PD) during pregnancy or within 3 days after delivery, and were PLBW (Preterm Low Birth Weight). When evaluated in the control group that did not, the CAL mean value of the case group (31 persons) was significantly larger than that of the control group. In pregnant women with CALs of 3 mm or more accounting for 60% or more, the risk of developing PLBW was 5.9 times overall and 6.7 times at the first delivery.

また、非特許文献２によると、切迫早産から早産となった18名とそうでない正期産40名の第2トライメスターの歯垢細菌叢を比較したところ、歯周病関連細菌の一つであるTannerella forsythenesisの割合が有意に高かった。このTannerella forsythenesisは、歯周病に関連のある細菌と言われる３つのred complexのうちの一つである。 In addition, according to Non-Patent Document 2, a comparison of the plaque flora of the second trimester of 18 patients who had a preterm birth due to imminent preterm birth and 40 patients who did not have a term birth was one of the periodontal disease-related bacteria. The proportion of certain Tannerella forsythenesis was significantly higher. This Tannerella forsythenesis is one of three red complexes called bacteria related to periodontal disease.

Offenbacher S, et al, Periodontal Infection as a Possible Risk Factor for Preterm Low Birth Weight, J Peridontol, 67, 1103-1113, 1996Offenbacher S, et al, Periodontal Infection as a Possible Risk Factor for Preterm Low Birth Weight, J Peridontol, 67, 1103-1113, 1996 Hasegawa K, et al, Associations between systemic status, oeruidibtak statysm seryn cytokine levels and delivery outcomes in pregnant women with a diagnosis of threatened premature labor (TPL), J Peridontol, 74, 1764-1770, 2003Hasegawa K, et al, Associations between systemic status, oeruidibtak statysm seryn cytokine levels and delivery outcomes in pregnant women with a diagnosis of threatened premature labor (TPL), J Peridontol, 74, 1764-1770, 2003

Offenbacherらの報告を発端として、世界各地で同様に歯周病と早産・低体重との関連が報告されているが、これらの関係は人種差があり、日本人の妊婦を対象にした研究が少ない。長谷川らの報告によると、時系列で1点のみの測定であるため、経時的な変化による早産への影響が不明である。また、単に関連性のみを報告しているが、それが早産の予測にどれだけ役立つのかの評価がない。 Starting with the report by Offenbacher et al., The association between periodontal disease and preterm birth / underweight has been reported all over the world, but there are racial differences in these relationships, and studies targeting pregnant Japanese women have been conducted. Few. According to the report by Hasegawa et al., Since only one point was measured in chronological order, the effect of changes over time on preterm birth is unknown. It also reports only relevance, but there is no assessment of how useful it is in predicting preterm birth.

上述した問題点を鑑み、本発明の課題は、口腔内細菌叢から早産発症予測を実現するための予測装置及び方法を提供することである。 In view of the above-mentioned problems, an object of the present invention is to provide a prediction device and a method for realizing prediction of the onset of preterm birth from the oral bacterial flora.

上記課題を解決するため、本発明の一態様は、正期産群の妊婦と早産群の妊婦との口腔内の細菌叢に関する細菌叢データを取得するデータ取得機能と、前記細菌叢データに基づき早産予測モデルを生成する早産予測モデル生成機能と、前記早産予測モデルに基づき、前記取得した細菌叢データから予測対象の妊婦の早産発症予測を実行する早産予測機能と、前記早産発症予測の結果を通知する結果通知機能と、を有し、前記データ取得機能は、前記予測対象の妊婦の妊娠前後の何れかの時期から出産前後の何れかの時期までのデータ取得期間において、前記細菌叢データを複数の時点にて取得し、前記早産予測機能は、前記複数の時点において取得された前記細菌叢データに基づき前記早産発症予測を実行する予測装置に関する。
To solve the above problems, one aspect of the present invention includes a data acquisition function of acquiring flora data relating flora in the oral cavity of the pregnant women pregnant with preterm group full-term production group, based on the flora data The preterm birth prediction model generation function that generates a preterm birth prediction model, the preterm birth prediction function that executes the preterm birth prediction prediction of the pregnant woman to be predicted from the acquired bacterial flora data based on the preterm birth prediction model, and the result of the preterm birth onset prediction. It has a result notification function for notifying, and the data acquisition function obtains the bacterial flora data during a data acquisition period from any time before or after pregnancy to any time before or after childbirth of the pregnant woman to be predicted. The preterm birth prediction function, which is acquired at a plurality of time points, relates to a prediction device that executes the preterm birth onset prediction based on the bacterial flora data acquired at the plurality of time points.

本発明によると、口腔内細菌叢から早産発症予測を実現するための予測装置及び方法を提供することができる。 According to the present invention, it is possible to provide a prediction device and a method for realizing prediction of the onset of preterm birth from the oral bacterial flora.

図１は、本発明の一実施例による予測システムの構成を示す概略図である。FIG. 1 is a schematic view showing a configuration of a prediction system according to an embodiment of the present invention. 図２は、本発明の一実施例による予測装置の機能構成を示すブロック図である。FIG. 2 is a block diagram showing a functional configuration of a prediction device according to an embodiment of the present invention. 図３は、本発明の一実施例による予測処理を示すフローチャートである。FIG. 3 is a flowchart showing a prediction process according to an embodiment of the present invention. 図４は、本発明の一実施例による特徴量抽出・モデル作成処理を示すフローチャートである。FIG. 4 is a flowchart showing a feature amount extraction / model creation process according to an embodiment of the present invention. 図５は、本発明の他の実施例による特徴量抽出・モデル作成処理を示すフローチャートである。FIG. 5 is a flowchart showing a feature amount extraction / model creation process according to another embodiment of the present invention. 図６は、本発明の他の実施例による特徴量抽出・モデル作成処理を示すフローチャートである。FIG. 6 is a flowchart showing a feature amount extraction / model creation process according to another embodiment of the present invention.

以下、図面に基づいて本発明の実施の形態を説明する。 Hereinafter, embodiments of the present invention will be described with reference to the drawings.

後述される実施例では、早産発症予測のための予測装置が開示される。後述される実施例によると、複数の妊娠フェーズにおいて採取された妊婦の口腔内の細菌叢に関する細菌叢データを取得すると、予測装置は、早産発症と相関を有する細菌種を抽出し、抽出した細菌種を用いて生成した早産予測モデルに基づき、取得した細菌叢データから妊婦の早産発症予測を実行する。これにより、人種間の差があると共に、経時的に変化すると考えられている細菌叢データからの早産予測に対して、特定の人種に適した早産予測モデルが生成可能になる。 In the examples described later, a prediction device for predicting the onset of preterm birth is disclosed. According to the examples described below, when the bacterial flora data on the oral flora of pregnant women collected in multiple pregnancy phases is acquired, the predictor extracts the bacterial species that correlates with the onset of preterm birth and the extracted bacteria. Based on the preterm birth prediction model generated using seeds, the onset of preterm birth in pregnant women is predicted from the acquired bacterial flora data. This makes it possible to generate a preterm birth prediction model suitable for a specific race, as opposed to the preterm birth prediction from bacterial flora data, which is considered to change over time as well as being different between races.

まず、図１を参照して、本発明の一実施例による予測システムを説明する。図１は、本発明の一実施例による予測システムの構成を示す概略図である。 First, a prediction system according to an embodiment of the present invention will be described with reference to FIG. FIG. 1 is a schematic view showing a configuration of a prediction system according to an embodiment of the present invention.

図１に示されるように、予測システム１０は、細菌叢計測装置５０及び予測装置１００を有する。 As shown in FIG. 1, the prediction system 10 includes a bacterial flora measuring device 50 and a predicting device 100.

細菌叢計測装置５０は、採取された歯垢検体から口腔内の細菌叢を測定し、生成された細菌叢データを予測装置１００に提供する。具体的には、検査技師が妊婦の歯垢検体を採取し、細菌叢計測装置５０を用いて採取した歯垢検体に存在する細菌種を示す細菌叢データを取得し、取得した細菌叢データを予測装置１００に提供する。 The bacterial flora measuring device 50 measures the bacterial flora in the oral cavity from the collected plaque sample, and provides the generated bacterial flora data to the prediction device 100. Specifically, an inspection engineer collects a plaque sample of a pregnant woman, acquires bacterial flora data indicating the bacterial species present in the plaque sample collected using the bacterial flora measuring device 50, and obtains the acquired bacterial flora data. Provided to the prediction device 100.

人間の口腔内に存在する細菌種として、例えば、Porphyromonas gingivalis, Treponema denticola, Tabberella forsythensis, Bergeyella, Kingella denitrificans, Scardovia wiggsiae, Streptococus mutans, Selenomonas, Capnocytophaga, Treponema vincentiiなどを含む約200種類以上の細菌種が知られており、特にPorphyromonas gingivalis, Treponema denticola, Tabberella forsythensisは、歯周病に関わるred complexとして分類され、妊娠関連疾患との相関が指摘されている。細菌叢計測装置５０は、口腔内の細菌種を計測するのに用いられる何れか既知のものであってもよい。 More than 200 bacterial species present in the human oral cavity, including, for example, Porphyromonas gingivalis, Treponema denticola, Tabberella forsythensis, Bergeyella, Kingella denitrificans, Scardovia wiggsiae, Streptococus mutans, Selenomonas, Capnocytophaga, Treponema vincentii, etc. Known, especially Porphyromonas gingivalis, Treponema denticola, and Tabberella forsythensis are classified as a red complex related to periodontal disease, and their correlation with pregnancy-related diseases has been pointed out. The bacterial flora measuring device 50 may be any known one used for measuring the bacterial species in the oral cavity.

予測装置１００は、提供された細菌叢データから早産発症を予測する。具体的には、予測装置１００は、以下で詳細に説明されるように、早産群の細菌叢データ及び非早産群又は正期産群の細菌叢データを含む既知のデータから早産予測モデルを作成し、作成した早産予測モデルに予測対象となる妊婦の細菌叢データを入力することによって、当該妊婦の早産の発症の有無を予測する。なお、本発明による早産発症予測は、早産の発症の有無に限定されず、後述されるように、出産日の予測、早産リスクの算出などを含むものであってもよい。 The prediction device 100 predicts the onset of preterm birth from the provided bacterial flora data. Specifically, the prediction device 100 creates a preterm birth prediction model from known data including preterm birth group bacterial flora data and non-preterm birth group or term birth group bacterial flora data, as described in detail below. Then, by inputting the bacterial flora data of the pregnant woman to be predicted into the created preterm birth prediction model, the presence or absence of the onset of preterm birth of the pregnant woman is predicted. The prediction of the onset of preterm birth according to the present invention is not limited to the presence or absence of the onset of preterm birth, and may include prediction of the date of birth, calculation of the risk of preterm birth, and the like, as will be described later.

ここで、予測装置１００は、典型的には、コンピュータ、サーバ、スマートフォン、タブレットなどの通信機能を備えた情報処理装置により実現されてもよい。例えば、情報処理装置に搭載されたプロセッサが、メモリ装置に格納されたデータやプログラムを処理及び実行することによって、後述される各種機能及び処理を実行する。しかしながら、予測装置１００は、何れか特定のハードウェア構成に限定されるものでなく、適切なハードウェア構成により実現されてもよい。 Here, the prediction device 100 may be typically realized by an information processing device having a communication function such as a computer, a server, a smartphone, or a tablet. For example, a processor mounted on an information processing device executes various functions and processes described later by processing and executing data and programs stored in the memory device. However, the prediction device 100 is not limited to any specific hardware configuration, and may be realized by an appropriate hardware configuration.

次に、図２を参照して、本発明の一実施例による予測装置を説明する。図２は、本発明の一実施例による予測装置の機能構成を示すブロック図である。 Next, a prediction device according to an embodiment of the present invention will be described with reference to FIG. FIG. 2 is a block diagram showing a functional configuration of a prediction device according to an embodiment of the present invention.

図２に示されるように、予測装置１００は、データ取得機能１１０、データ保存機能１２０、早産予測モデル生成機能１３０、早産予測機能１４０及び結果通知機能１５０を有する。 As shown in FIG. 2, the prediction device 100 has a data acquisition function 110, a data storage function 120, a premature birth prediction model generation function 130, a premature birth prediction function 140, and a result notification function 150.

データ取得機能１１０は、細菌叢計測装置５０から細菌叢データを取得する。具体的には、データ取得機能１１０は、妊婦の妊娠前後の何れかの時期から出産前後の何れかの時期までのデータ取得期間において、妊婦に関する細菌叢データを複数の時点にて取得する。例えば、データ取得機能１１０は、細菌叢計測装置５０から、異なる妊娠フェーズ（例えば、１２〜２３週と２４〜３４週）に採取された妊婦の口腔内の細菌叢データを取得してもよい。 The data acquisition function 110 acquires bacterial flora data from the bacterial flora measuring device 50. Specifically, the data acquisition function 110 acquires bacterial flora data related to a pregnant woman at a plurality of time points during a data acquisition period from any time before and after pregnancy of a pregnant woman to any time before and after childbirth. For example, the data acquisition function 110 may acquire the bacterial flora data in the oral cavity of a pregnant woman collected in different pregnancy phases (for example, 12 to 23 weeks and 24 to 34 weeks) from the bacterial flora measuring device 50.

データ保存機能１２０は、取得した妊婦の口腔内の細菌叢データを保存する。具体的には、データ保存機能１２０は、取得した細菌叢データを各妊娠フェーズで採取された細菌種（例えば、２００菌種など）のそれぞれを独立した変数として保存する。例えば、２つの妊娠フェーズでそれぞれ２００菌種の細菌種が採取された場合、経時的な複数時点の歯垢細菌組成の変化を加味したモデルを作成できるように、４００個（２００菌種×２妊娠フェーズ）の変数が設定されてもよい。 The data storage function 120 stores the acquired bacterial flora data in the oral cavity of a pregnant woman. Specifically, the data storage function 120 stores the acquired bacterial flora data as independent variables for each of the bacterial species (for example, 200 bacterial species) collected in each pregnancy phase. For example, when 200 bacterial species are collected in each of the two pregnancy phases, 400 (200 bacterial species x 2) can be created so that a model that takes into account changes in the composition of dental plaque bacteria at multiple time points over time can be created. The variable of pregnancy phase) may be set.

早産予測モデル生成機能１３０は、取得された妊婦の口腔内の細菌叢データに基づき、早産の発症の有無を予測する早産予測モデルを生成する。具体的には、早産予測モデル生成機能１３０は、正期産により出産した妊婦（正期産群）と早産により出産した妊婦（早産群）とを含む複数の妊婦から取得した口腔内の細菌叢データに基づき、早産発症予測モデルを生成及び更新してもよい。例えば、以下で詳細に説明されるように、早産予測モデル生成機能１３０は、取得した細菌叢データから早産群と正期産群とをよく分ける細菌種を特定し、変数を削減するため、二群をよく分ける細菌種として１０〜２０個の細菌種×妊娠フェーズを抽出してもよい。早産群と正期産群とをよく分ける細菌種として、red complexとして分類されるPorphyromonas gingivalis, Treponema denticola, Tabberella forsythensisや、Bergeyella, Kingella denitrificans, Scardovia wiggsiae, Streptococus mutans, Selenomonas, Capnocytophaga, Treponema vincentiiなどが抽出されてもよい。 The preterm birth prediction model generation function 130 generates a preterm birth prediction model that predicts the presence or absence of the onset of preterm birth based on the acquired bacterial flora data in the oral cavity of the pregnant woman. Specifically, the preterm birth prediction model generation function 130 is a bacterial flora in the oral cavity acquired from a plurality of pregnant women including a pregnant woman who gave birth by term birth (term birth group) and a pregnant woman who gave birth by preterm birth (preterm birth group). Preterm birth prediction models may be generated and updated based on the data. For example, as described in detail below, the preterm birth prediction model generation function 130 identifies the bacterial species that distinguishes the preterm birth group and the term birth group from the acquired bacterial flora data, and reduces the variables. 10 to 20 bacterial species x pregnancy phases may be extracted as bacterial species that divide the group well. Porphyromonas gingivalis, Treponema denticola, Tabberella forsythensis, Bergeyella, Kingella denitrificans, Scardovia wiggsiae, Streptococus mutans, Selenomonas, Capnocytophaga, Treponema vinsis, etc. It may be extracted.

続いて、早産予測モデル生成機能１３０は、抽出された細菌種を特徴量として用いて早産予測モデルを作成する。具体的な予測モデルの作成手法として、例えば、早産予測モデル生成機能１３０は、Linear SVMを用いて、特徴量として用いられる細菌種をforward stepwise selectionによって選択してもよい。例えば、最初に２０個の特徴量（妊娠フェーズごとの細菌種）について、早産予測モデル生成機能１３０は、１つの特徴量だけを用いて早産予測モデルM(1)〜M(20)を作成し、最も性能の良いM(A)（特徴量Aを使用）を特定する。次に、早産予測モデル生成機能１３０は、当該特徴量を除いた残りの１９個の特徴量から１つの特徴量を選択してAと併せた２変数の予測モデルM(A,1)〜M(A,19)を作成し、M(A)を含めて最適なモデルを選択する。早産予測モデル生成機能１３０は、上述した一連の処理を再帰的に実行し、最終的に予測モデルM(A,B,C,...)を特定する。 Subsequently, the preterm birth prediction model generation function 130 creates a preterm birth prediction model using the extracted bacterial species as a feature amount. As a specific method for creating a prediction model, for example, the preterm birth prediction model generation function 130 may use Linear SVM to select a bacterial species to be used as a feature by forward stepwise selection. For example, for the first 20 features (bacterial species for each pregnancy phase), the preterm birth prediction model generation function 130 creates preterm birth prediction models M (1) to M (20) using only one feature amount. , Identify the best performing M (A) (using feature A). Next, the preterm birth prediction model generation function 130 selects one feature amount from the remaining 19 feature amounts excluding the feature amount and combines it with A, which is a two-variable prediction model M (A, 1) to M. Create (A, 19) and select the optimal model including M (A). The premature birth prediction model generation function 130 recursively executes the above-mentioned series of processes, and finally identifies the prediction model M (A, B, C, ...).

なお、上述した実施例では、早産の発症有無を示す早産発症予測モデルについて説明したが、本発明による早産予測モデルは、これに限定されず、出産日を予測する出産日予測及び早産発症リスクモデル又は発症確率を予測する早産発症リスクモデルであってもよい。 In the above-mentioned examples, the preterm birth prediction model showing the onset of preterm birth has been described, but the preterm birth prediction model according to the present invention is not limited to this, and the birth date prediction and the preterm birth risk model for predicting the birth date are not limited to this. Alternatively, it may be a preterm birth risk model that predicts the onset probability.

早産予測機能１４０は、早産予測モデルに基づき、取得した妊婦の細菌叢データから当該妊婦の早産発症予測を実行する。具体的には、早産予測機能１４０は、複数の時点において取得した妊婦の細菌叢データを早産予測モデルに投入し、複数の時点において取得した細菌叢データに基づく早産予測モデルの出力結果に基づき当該妊婦の早産有無予測、出産日予測又は早産発症リスク算出を実行してもよい。すなわち、早産予測機能１４０は、当該妊婦に早産の可能性があるか予測するか（早産発症有無予測）、当該妊婦の出産日を予測するか（出産日予測）、あるいは、当該妊婦の早産発症リスクを算出してもよい（早産発症リスク算出）。 The preterm birth prediction function 140 executes the prediction of the onset of preterm birth of the pregnant woman from the acquired bacterial flora data of the pregnant woman based on the preterm birth prediction model. Specifically, the preterm birth prediction function 140 inputs the pregnant woman's bacterial flora data acquired at a plurality of time points into the preterm birth prediction model, and based on the output result of the preterm birth prediction model based on the bacterial flora data acquired at the plurality of time points. Preterm birth prediction, birth date prediction, or preterm birth risk calculation for pregnant women may be performed. That is, the preterm birth prediction function 140 predicts whether the pregnant woman has a possibility of preterm birth (preterm birth onset prediction), predicts the birth date of the pregnant woman (birth date prediction), or predicts the preterm birth onset of the pregnant woman. The risk may be calculated (preterm birth risk calculation).

結果通知機能１５０は、早産予測機能１４０によって実行された早産予測の結果を妊婦、医療関係者などに通知する。 The result notification function 150 notifies pregnant women, medical personnel, and the like of the result of the premature birth prediction executed by the premature birth prediction function 140.

次に、図３〜６を参照して、本発明の一実施例による予測処理を説明する。当該予測処理は、上述した予測装置１００によって実行される。図３は、本発明の一実施例による予測処理を示すフローチャートである。 Next, the prediction process according to the embodiment of the present invention will be described with reference to FIGS. 3 to 6. The prediction process is executed by the prediction device 100 described above. FIG. 3 is a flowchart showing a prediction process according to an embodiment of the present invention.

図３に示されるように、ステップＳ１００において、予測装置１００は、細菌叢計測装置５０から細菌叢データを取得し、取得した細菌叢データを保存する。具体的には、細菌叢計測装置５０は、妊婦から採取された歯垢検体を計測し、当該歯垢検体に含まれる細菌種量を示す細菌叢データを予測装置１００に提供する。予測装置１００は、細菌叢計測装置５０から取得した細菌叢データを妊婦毎に保存する。本実施例によると、細菌叢データは、例えば、妊娠フェーズ毎など、妊娠前後の何れかの時期から出産前後の何れかの時期までのデータ取得期間において各妊婦に対して複数回採取される。 As shown in FIG. 3, in step S100, the prediction device 100 acquires the bacterial flora data from the bacterial flora measuring device 50 and stores the acquired bacterial flora data. Specifically, the bacterial flora measuring device 50 measures a plaque sample collected from a pregnant woman, and provides the predictive device 100 with bacterial flora data indicating the amount of bacterial species contained in the plaque sample. The prediction device 100 stores the bacterial flora data acquired from the bacterial flora measuring device 50 for each pregnant woman. According to this example, bacterial flora data is collected multiple times for each pregnant woman during the data acquisition period from any time before or after pregnancy to any time before and after childbirth, for example, for each pregnancy phase.

ステップｓ１０１において、予測装置１００は、取得した細菌叢データに対して異常値を検出及び除去し、欠測値を補完する。例えば、基準値から閾値範囲外の細菌種量が測定された場合、予測装置１００は、当該測定値を実際の値ではなく計測エラーとみなして、当該測定値を除去すると共に、他の細菌種量の値が類似する別の歯垢検体の当該測定値にランダムなノイズを加えて流用するか、他の細菌種量の値が類似する別の歯垢検体を複数抽出してそれらの当該測定値の分布からその分布に従うように１つの推定値を算出するなど何れかの方法により当該測定値を補完してもよい。例えば、検体Sの細菌種spA-2が閾値範囲外または欠測値である場合、まず検体S以外の検体群においてspA-2の細菌種量の分散Vを計算する。つぎに、細菌種群B_ALL={spA-1, spB-1,…, spA-2, spB-2}からspA-2を抜いた細菌種群B_SUBの各細菌種量が検体Sと近い検体を1検体抽出し、そのspA-2の細菌種量に正規分布N(0,V)に従うランダムな１点を足し合わせた値を検体SのspA-2の推定量とする方法がある。このとき、検体Sと近い検体の抽出には、各細菌種量を各細菌種量の標準偏差で割って標準化した値を用いてユーグリッド距離を計算しても良いし、MSE（平均平方誤差）又はR²（決定係数）などを用いても良い。また別の補完方法として、細菌種群B_SUBの各細菌種量が検体Sと近い検体を例えば10検体だけ抽出し、その平均値E2および分散V2を求め、正規分布N(E2,V2)に従うランダムな１点を検体SのspA-2の推定量とする方法がある。また、分布は必ずしも正規分布とせず、任意の分布で推定してもよい。 In step s101, the prediction device 100 detects and removes abnormal values from the acquired bacterial flora data and supplements the missing values. For example, when the amount of bacterial species outside the threshold range is measured from the reference value, the prediction device 100 considers the measured value as a measurement error instead of an actual value, removes the measured value, and removes the measured value, and other bacterial species. Random noise is added to the measured value of another toothpaste sample with similar amount values and used, or multiple other toothpaste samples with similar amount values are extracted and measured. The measured value may be supplemented by any method such as calculating one estimated value from the distribution of values so as to follow the distribution. For example, when the bacterial species spA-2 of sample S is out of the threshold range or is a missing value, the variance V of the bacterial species amount of spA-2 is first calculated in the sample group other than sample S. Next, a sample in which the amount of each bacterial species in _{the bacterial species group B SUB} , which is obtained by removing spA-2 from the bacterial species group B _ALL = {spA-1, spB-1,…, spA-2, spB-2}, is close to that of the sample S. There is a method in which one sample is extracted and the value obtained by adding one random point according to the normal distribution N (0, V) to the amount of bacterial species of spA-2 is used as the estimated amount of spA-2 of sample S. At this time, in order to extract a sample close to the sample S, the Eugrid distance may be calculated by dividing each bacterial species amount by the standard deviation of each bacterial species amount and using a standardized value, or MSE (mean square error). ) Or R ² (coefficient of determination) or the like may be used. As another complementary method, for example, only 10 samples in which the amount of each bacterial species in the bacterial species group B _SUB is close to that of the sample S are extracted, the average value E2 and the dispersion V2 are obtained, and the random distribution follows the normal distribution N (E2, V2). There is a method of using one point as the estimated amount of spA-2 of the sample S. Moreover, the distribution is not necessarily a normal distribution, and may be estimated by any distribution.

ステップｓ１０２において、予測装置１００は、既に出産して出産状況が確定している妊婦のデータを用いて、早産群と正期産群とをよく分ける変数を特徴量として抽出する。これには、ステューデントのt検定、ウェルチのt検定、多変量ロジスティック回帰、カイ二乗検定などを用いて変数を限定する方法、Lasso回帰、Ridge回帰、Elastic Netなどにより有用な変数を限定する方法、主成分分析の寄与率などから変数を限定する方法などがあるが、それらに限定されるものでない。 In step s102, the prediction device 100 uses data of pregnant women who have already given birth and whose delivery status has been determined, and extracts variables that often separate the preterm birth group and the term birth group as feature quantities. This includes limiting variables using Student's t-test, Welch's t-test, multivariate logistic regression, chi-square test, Lasso regression, Ridge regression, Elastic Net, etc. There are methods for limiting variables based on the contribution rate of principal component analysis, but they are not limited to these.

ステップｓ１０３、ｓ１０５及びｓ１０７において、予測装置１００は、抽出した特徴量を用いて早産予測モデルを生成する。具体的には、予測装置１００は、ステップｓ１０３において、早産になるかどうかを予測する分類モデルとして早産発症予測モデルを生成する。また、予測装置１００は、ステップｓ１０５において、各妊婦の出産日を予測する回帰モデルとして出産日予測モデルを生成する。また、予測装置１００は、ｓ１０７において、早産発症のリスクを算出する早産発症リスク算出モデルを生成する。回帰モデルには、線形回帰モデル、ロジスティック回帰モデル、SVMによる回帰モデル、多層ニューラルネットワークによる回帰モデルなどがあるが、それらに限定されるものでない。また、分類モデルにおいても、線形分類、ロジスティック回帰、ベイジアンネットワーク、SVM、k近傍法、ランダムフォレスト、多層ニューラルネットワークによる分類モデルなどがあるが、それらに限定されるものでない。早産発症リスク算出モデルについては、オッズ比、寄与危険度、相対危険度などがあるが、それらに限定されるものでない。また、ステップｓ１０３、ｓ１０５、ｓ１０７は、ｓ１０２と切り離さずに一体で処理されてもよい。 In steps s103, s105 and s107, the prediction device 100 generates a preterm birth prediction model using the extracted features. Specifically, in step s103, the prediction device 100 generates a premature birth onset prediction model as a classification model for predicting whether or not premature birth will occur. Further, in step s105, the prediction device 100 generates a birth date prediction model as a regression model for predicting the birth date of each pregnant woman. In addition, the prediction device 100 generates a preterm birth risk calculation model for calculating the risk of preterm birth in s107. Regression models include, but are not limited to, linear regression models, logistic regression models, regression models using SVMs, and regression models using multi-layer neural networks. In addition, classification models include, but are not limited to, linear classification, logistic regression, Bayesian network, SVM, k-nearest neighbor method, random forest, and multi-layer neural network classification model. The model for calculating the risk of preterm birth includes, but is not limited to, odds ratio, attribution risk, and relative risk. Further, steps s103, s105, and s107 may be processed integrally without being separated from s102.

ステップｓ１０４、ｓ１０６及びｓ１０８において、予測装置１００は、生成された早産発症予測モデル、出産日予測モデル及び早産発症リスク算出モデルに予測対象の妊婦の細菌叢データをそれぞれ投入し、早産予測を実行する。具体的には、予測装置１００は、ステップｓ１０４において、早産発症予測モデルを利用して予測対象の妊婦の細菌叢データから将来の早産発症の有無を予測する。また、予測装置１００は、ステップｓ１０６において、出産日予測モデルを利用して予測対象の妊婦の細菌叢データから出産日を予測する。また、予測装置１００は、ステップｓ１０８において、早産発症リスク算出モデルを利用して予測対象の妊婦の細菌叢データから早産発症リスクを算出する。なお、早産有無予測は、最終的な早産発症有無の予測、この先n1週間（n1は任意の数値）で早産を発症しない予測、この先n2週間（n2は任意の数値）で早産を発症する予測、などを含むが、それらに限定されるものでない。 In steps s104, s106 and s108, the prediction device 100 inputs the bacterial flora data of the pregnant woman to be predicted into the generated preterm birth onset prediction model, birth date prediction model and preterm birth onset risk calculation model, respectively, and executes the preterm birth prediction. .. Specifically, in step s104, the prediction device 100 predicts the presence or absence of future preterm birth from the bacterial flora data of the pregnant woman to be predicted by using the preterm birth onset prediction model. Further, in step s106, the prediction device 100 predicts the birth date from the bacterial flora data of the pregnant woman to be predicted by using the birth date prediction model. Further, in step s108, the prediction device 100 calculates the risk of preterm birth from the bacterial flora data of the pregnant woman to be predicted by using the preterm birth risk calculation model. The prediction of the presence or absence of preterm birth is the final prediction of the presence or absence of preterm birth, the prediction of not developing preterm birth in the next n1 weeks (n1 is an arbitrary value), the prediction of developing preterm birth in the next n2 weeks (n2 is an arbitrary value), Etc., but not limited to them.

ステップｓ１０９において、予測装置１００は、早産発症予測結果、出産日予測結果及び／又は早産発症リスク計算結果を妊婦又は医療機関に通知し、当該予測処理を終了する。 In step s109, the prediction device 100 notifies the pregnant woman or the medical institution of the preterm birth onset prediction result, the birth date prediction result and / or the preterm birth onset risk calculation result, and ends the prediction process.

図４は、本発明の一実施例による特徴量抽出・モデル作成処理を示すフローチャートである。当該特徴量抽出・モデル作成処理は、上述された図３のステップｓ１０２，ｓ１０３において実行可能であり、後述されるステップｓ１０２−１〜５がステップｓ１０２の特徴量の抽出処理に対応し、ステップｓ１０３−１〜３がステップｓ１０３の早産発症有無予測モデルの生成処理に対応する。当該特徴量抽出・モデル作成処理では、早産予測モデル生成機能１３０は、取得したデータを早産群と正期産群とに分け、各変数のうち二群をよく分ける（すなわち、区別する、判別する、分離するなど）変数、例えば、二群をよく分ける上位の所定数の変数を特徴量として抽出し、抽出された特徴量を用いて早産予測モデルを作成し、最も性能の良い特徴量による早産予測モデルを特定する。次に、早産予測モデル生成機能１３０は、当該特徴量を除いた残りの特徴量と特定された特徴量とを併せた２変数の予測モデルを作成し、これらのモデルを含めて最適なモデルを選択する。早産予測モデル生成機能１３０は、上述した一連の処理を再帰的に実行し、最終的に早産予測モデルを特定する。なお、上記は早産発症有無予測モデルに利用する特徴量の抽出方法や早産発症有無予測モデルの選定方法の一例であって、早産発症有無予測モデルはこれに制限されない。例えば、多層ニューラルネットワークまたはLasso回帰などによって特徴量を抽出し、早産発症有無予測モデルを生成する方法もある。 FIG. 4 is a flowchart showing a feature amount extraction / model creation process according to an embodiment of the present invention. The feature amount extraction / model creation process can be executed in steps s102 and s103 of FIG. 3 described above, and steps s102-1 to 5 described later correspond to the feature amount extraction process of step s102, and step s103. -1 to 3 correspond to the generation process of the preterm birth onset prediction model in step s103. In the feature amount extraction / model creation process, the preterm birth prediction model generation function 130 divides the acquired data into a preterm birth group and a term birth group, and divides (that is, distinguishes, discriminates) two groups of each variable well. , Separation, etc.) Variables, for example, a predetermined number of variables that divide the two groups well are extracted as features, a preterm birth prediction model is created using the extracted features, and preterm birth is based on the best-performing features. Identify the predictive model. Next, the preterm birth prediction model generation function 130 creates a two-variable prediction model that combines the remaining feature amount excluding the feature amount and the specified feature amount, and includes these models to obtain the optimum model. select. The premature birth prediction model generation function 130 recursively executes the above-mentioned series of processes, and finally identifies the premature birth prediction model. The above is an example of a method for extracting features used in a model for predicting the presence or absence of preterm birth and a method for selecting a model for predicting the presence or absence of preterm birth, and the model for predicting the presence or absence of preterm birth is not limited thereto. For example, there is also a method of extracting features by a multi-layer neural network or Lasso regression to generate a model for predicting the presence or absence of preterm birth.

図４に示されるように、ステップｓ１０２−１において、予測装置１００は、取得した細菌叢データを加工する。具体的には、予測装置１００は、例えば、細菌種の計測量を相対量に標準化してもよいし、また、相対量が少ない細菌種を削除してもよい。 As shown in FIG. 4, in step s102-1, the predictor 100 processes the acquired bacterial flora data. Specifically, the prediction device 100 may standardize the measured amount of the bacterial species to a relative amount, or may delete the bacterial species having a small relative amount, for example.

ステップｓ１０２−２において、予測装置１００は、モデル生成のために、既に出産済みの妊婦の細菌叢データを抽出する。 In step s102-2, the predictor 100 extracts bacterial flora data of a pregnant woman who has already given birth for model generation.

ステップｓ１０２−３において、予測装置１００は、妊娠フェーズ毎に採取された歯垢検体の細菌叢データについて、各妊娠フェーズにおいて採取された各細菌種のそれぞれを独立した変数として保存する。例えば、２つの妊娠フェーズでそれぞれ２００菌種の細菌種が採取された場合、４００個（２００菌種×２妊娠フェーズ）の変数が設定される。例えば、４００個のデータは、当該妊婦のＩＤ（userid）と関連付けて、"spA-1, spB-1, ...,spA-2, spB-2, ..."の変数列により表されてもよい。 In step s102-3, the prediction device 100 stores each bacterial species collected in each pregnancy phase as an independent variable for the bacterial flora data of the plaque sample collected in each pregnancy phase. For example, when 200 bacterial species are collected in each of the two pregnancy phases, 400 variables (200 bacterial species x 2 pregnancy phases) are set. For example, 400 data are represented by a variable sequence of "spA-1, spB-1, ..., spA-2, spB-2, ..." in association with the ID (userid) of the pregnant woman. You may.

ステップＳ１０２−４において、予測装置１００は、早産群と正期産群とに細菌叢データを分け、検定にてp値がa未満（例えば、a=0.01）となる二群間をよく分ける細菌種を抽出する。 In step S102-4, the predictor 100 divides the bacterial flora data into the preterm birth group and the term birth group, and divides the bacteria well between the two groups whose p-value is less than a (for example, a = 0.01) in the test. Extract the seeds.

ステップＳ１０２−５において、予測装置１００は、抽出した細菌種をp値の昇順にソートし、これらを特徴量とする。 In step S102-5, the prediction device 100 sorts the extracted bacterial species in ascending order of p-value, and uses these as feature quantities.

ステップＳ１０３−１において、予測装置１００は、細菌叢データを早産群D1と正期産群D2とに分け、それぞれの比率e（例えば、e=0.9）で早産群D1を学習用データD1a及び評価用データD1bに分け、正期産群D2を学習用データD2a及び評価用データD2bに分ける。 In step S103-1, the prediction device 100 divides the bacterial flora data into the preterm production group D1 and the term production group D2, and evaluates the preterm production group D1 as learning data D1a and evaluation at each ratio e (for example, e = 0.9). The term data D1b is divided, and the term production group D2 is divided into learning data D2a and evaluation data D2b.

ステップＳ１０３−２において、予測装置１００は、予測性能を向上させるため、学習用データD1a, D2aにおいてD1aをオーバーサンプリングしてもよく、及び／又はD2aをアンダーサンプリングしてもよい。 In step S103-2, the prediction device 100 may oversample D1a in the training data D1a, D2a and / or undersample D2a in order to improve the prediction performance.

ステップＳ１０３−３において、予測装置１００は、学習用データD1a, D2aを用いてforward stepwise selectionによって特徴量を選択し、早産予測モデルMを作成する。例えば、最初に２０個の特徴量（妊娠フェーズごとの細菌種）について、予測装置１００は、１つの特徴量だけを用いて早産予測モデルM(1)〜M(20)を作成する。そして、予測装置１００は、評価データ用D1b, D2bを用いて、最も性能の良いM(A)（特徴量Aを使用）を特定する。性能スコアとして、例えば、AUC(Area under the curve)又はf-scoreが利用されてもよい。次に、予測装置１００は、学習用データD1a, D2aを用いて、当該特徴量を除いた残りの１９個の特徴量から１つの特徴量を選択してAと併せた２変数の早産予測モデルM(A,1)〜M(A,19)を作成し、評価用データD1b, D2bを用いて、M(A)を含めて最適なモデルを選択する。予測装置１００は、上述した一連の処理を再帰的に実行し、最終的に早産予測モデルM(A,B,C,...)を特定する。 In step S103-3, the prediction device 100 selects the feature amount by forward stepwise selection using the training data D1a and D2a, and creates the preterm birth prediction model M. For example, for the first 20 features (bacterial species for each pregnancy phase), the prediction device 100 creates preterm birth prediction models M (1) to M (20) using only one feature. Then, the prediction device 100 identifies M (A) (using the feature amount A) having the best performance by using the evaluation data D1b and D2b. As the performance score, for example, AUC (Area under the curve) or f-score may be used. Next, the prediction device 100 uses the training data D1a and D2a to select one feature amount from the remaining 19 feature amounts excluding the feature amount and combine it with A as a two-variable premature delivery prediction model. Create M (A, 1) to M (A, 19), and use the evaluation data D1b and D2b to select the optimum model including M (A). The prediction device 100 recursively executes the series of processes described above, and finally identifies the premature birth prediction model M (A, B, C, ...).

なお、早産予測モデルMは、SVM、線形ロジスティック回帰、ニューラルネットワーク、ランダムフォレストなどに基づき生成されてもよい。また、性能スコアは、AUC又はf-scoreに限定されず、sensitivityとspecificityとの調和平均などが用いられてもよい。また、最適な特徴量を選択する方法としてはforward step selectionに限定されず、backward stepwise selection又はbidirectional stepwise selectionが利用されてもよい。また、全細菌種を利用し、PCAやMDS、NMDSなどで次元削減した後、PC1, PC2, ..., PCNのN個の特徴量に対してforward/backward/bidirectional stepwise selectionを適用してもよい。 The preterm birth prediction model M may be generated based on SVM, linear logistic regression, neural network, random forest, or the like. Further, the performance score is not limited to AUC or f-score, and a harmonic mean of sensitivity and specificity may be used. Further, the method of selecting the optimum feature amount is not limited to forward step selection, and backward stepwise selection or bidirectional stepwise selection may be used. In addition, after using all bacterial species and reducing the dimensions with PCA, MDS, NMDS, etc., apply forward / backward / bidirectional stepwise selection to N features of PC1, PC2, ..., PCN. May be good.

図５は、本発明の他の実施例による特徴量抽出・モデル作成処理を示すフローチャートである。当該特徴量抽出・モデル作成処理は、上述された図３のステップｓ１０２，ｓ１０５において実行可能であり、後述されるステップｓ１０２−１〜５がステップｓ１０２の特徴量の抽出処理に対応し、ステップｓ１０５−１〜３がステップｓ１０５の出産日予測モデルの生成処理に対応する。なお、ステップＳ１０２−１〜５は図４のステップＳ１０２−１〜５と同じであるため、その説明は省く。 FIG. 5 is a flowchart showing a feature amount extraction / model creation process according to another embodiment of the present invention. The feature amount extraction / model creation process can be executed in steps s102 and s105 of FIG. 3 described above, and steps s102-1 to 5 described later correspond to the feature amount extraction process of step s102, and step s105. -1 to 3 correspond to the generation process of the birth date prediction model in step s105. Since steps S102-1 to 5 are the same as steps S102-1 to 5 in FIG. 4, the description thereof will be omitted.

ステップＳ１０５−１において、予測装置１００は、細菌叢データを早産群D1と正期産群D2とに分け、それぞれの比率e（例えば、e=0.9）で早産群D1を学習用データD1a及び評価用データD1bに分け、正期産群D2を学習用データD2a及び評価用データD2bに分ける。 In step S105-1, the prediction device 100 divides the bacterial flora data into the preterm production group D1 and the term production group D2, and evaluates the preterm production group D1 as learning data D1a and evaluation at each ratio e (for example, e = 0.9). The term data D1b is divided, and the term production group D2 is divided into learning data D2a and evaluation data D2b.

ステップＳ１０５−２において、予測装置１００は、予測性能を向上させるため、学習用データD1a, D2aにおいてD1aをオーバーサンプリングしてもよく、及び／又はD2aをアンダーサンプリングしてもよい。 In step S105-2, the prediction device 100 may oversample D1a in the training data D1a, D2a and / or undersample D2a in order to improve the prediction performance.

ステップＳ１０５−３において、予測装置１００は、学習用データD1a, D2aを用いてforward stepwise selectionによって特徴量を選択し、出産日予測モデルMを作成する。例えば、最初に２０個の特徴量（妊娠フェーズごとの細菌種）について、予測装置１００は、１つの特徴量だけを用いて出産日予測モデルM(1)〜M(20)を作成する。そして、予測装置１００は、評価データ用D1b, D2bを用いて、最も性能の良いM(A)（特徴量Aを使用）を特定する。性能スコアとして、例えば、MSE（平均平方誤差）又はR²（決定係数）が利用されてもよい。次に、予測装置１００は、学習用データD1a, D2aを用いて、当該特徴量を除いた残りの１９個の特徴量から１つの特徴量を選択してAと併せた２変数の出産日予測モデルM(A,1)〜M(A,19)を作成し、評価用データD1b, D2bを用いて、M(A)を含めて最適なモデルを選択する。予測装置１００は、上述した一連の処理を再帰的に実行し、最終的に出産日予測モデルM(A,B,C,...)を特定する。 In step S105-3, the prediction device 100 selects the feature amount by forward stepwise selection using the learning data D1a and D2a, and creates the birth date prediction model M. For example, first, for 20 feature amounts (bacterial species for each pregnancy phase), the prediction device 100 creates birth date prediction models M (1) to M (20) using only one feature amount. Then, the prediction device 100 identifies M (A) (using the feature amount A) having the best performance by using the evaluation data D1b and D2b. For example, MSE (mean square error) or R ² (coefficient of determination) may be used as the performance score. Next, the prediction device 100 uses the training data D1a and D2a to select one feature amount from the remaining 19 feature amounts excluding the feature amount and combine it with A to predict the birth date of two variables. Models M (A, 1) to M (A, 19) are created, and the optimum model including M (A) is selected using the evaluation data D1b and D2b. The prediction device 100 recursively executes the series of processes described above, and finally identifies the birth date prediction model M (A, B, C, ...).

なお、出産日予測モデルMは、SVM、線形ロジスティック回帰、ニューラルネットワークなどに基づき生成されてもよい。また、性能スコアは、MSE（平均平方誤差）又はR²（決定係数）に限定されなくてもよい。また、最適な特徴量を選択する方法としてはforward step selectionに限定されず、backward stepwise selection又はbidirectional stepwise selectionが利用されてもよい。また、全細菌種を利用し、PCAやMDS、NMDSなどで次元削減した後、PC1, PC2, ..., PCNのN個の特徴量に対してforward/backward/bidirectional stepwise selectionを適用してもよい。 The birth date prediction model M may be generated based on SVM, linear logistic regression, neural network, or the like. Also, the performance score does not have to be limited to MSE (mean square error) or R ^{2 (coefficient of determination).} Further, the method of selecting the optimum feature amount is not limited to forward step selection, and backward stepwise selection or bidirectional stepwise selection may be used. In addition, after using all bacterial species and reducing the dimensions with PCA, MDS, NMDS, etc., apply forward / backward / bidirectional stepwise selection to N features of PC1, PC2, ..., PCN. May be good.

図６は、本発明の他の実施例による特徴量抽出・モデル作成処理を示すフローチャートである。当該特徴量抽出・モデル作成処理は、上述された図３のステップｓ１０２，ｓ１０７において実行可能であり、後述されるステップｓ１０２−１〜５がステップｓ１０２の特徴量の抽出処理に対応し、ステップｓ１０７−１〜２がステップｓ１０７の早産リスクモデルの生成処理に対応する。なお、ステップＳ１０２−１〜５は図４のステップＳ１０２−１〜５と同じであるため、その説明は省く。 FIG. 6 is a flowchart showing a feature amount extraction / model creation process according to another embodiment of the present invention. The feature amount extraction / model creation process can be executed in steps s102 and s107 of FIG. 3 described above, and steps s102-1 to 5 described later correspond to the feature amount extraction process of step s102, and step s107. -1 to 2 correspond to the process of generating the preterm birth risk model in step s107. Since steps S102-1 to 5 are the same as steps S102-1 to 5 in FIG. 4, the description thereof will be omitted.

ステップＳ１０７−１において、予測装置１００は、細菌種（spA-1, spB-1, ..., spA-2, spB-2, ...）毎に量の閾値範囲S_spA-1=[A_spA-1, B_spa-1]を設定し、早産に対するリスク比(RR_spA-1)を計算する。ここで、リスク比はRR=a(c+d)/c(a+b)であり、aは早産群であって（細菌種の量が）閾値範囲内である歯垢検体数であり、bは正期産群であって閾値範囲内である歯垢検体数であり、cは早産群であって閾値範囲外である歯垢検体数であり、dは正期産群であって閾値範囲外である歯垢検体数である。 _{In step S107-1, the predictor 100 determines the amount threshold range S spA-1} = [for each bacterial species (spA-1, spB-1, ..., spA-2, spB-2, ...). Set A _spA-1 , B _spa-1 ] and calculate the risk ratio (RR _{spA-1) to preterm birth.} Here, the risk ratio is RR = a (c + d) / c (a + b), where a is the number of plaque samples in the preterm birth group (the amount of bacterial species) within the threshold range. b is the number of plaque samples in the term birth group and within the threshold range, c is the number of plaque samples in the preterm birth group and outside the threshold range, and d is the term birth group and threshold. The number of plaque samples that are out of range.

ステップＳ１０７−２において、予測装置１００は、予測対象の妊婦の細菌種の量を（x_spA-1, x_spB-1,...）とした場合、
RR=Π_spRR_sp ^f(x _sp ⁾
f(x_sp) = 1 if x_sp is in S_sp
f(x_sp) = -1 if x_sp is not in S_sp
によって、早産リスク予測モデルを生成してもよい。 In step S107-2, when the amount of bacterial species of the pregnant woman to be predicted is (x _spA-1 , x _spB-1 , ...), the prediction device 100
RR = Π _sp RR _sp ^{f (x} _sp ⁾
f (x _sp ) = 1 if x _sp is in S _sp
f (x _sp ) = -1 if x _sp is not in S _sp
May generate a preterm birth risk prediction model.

上記の一例を実施した結果では、早産群と正期産群をよく分ける細菌種として、Porphyromonas gingivalis, Treponema denticola, Tabberella forsythensisや、Bergeyella, Kingella denitrificans, Scardovia wiggsiae, Streptococus mutans, Selenomonas, Capnocytophaga, Treponema vincentiiなどが抽出され、これらを特徴量として用いた早産発症有無予測モデルの性能は、感度0.714、特異度0.751となった。 According to the results of the above example, Porphyromonas gingivalis, Treponema denticola, Tabberella forsythensis, Bergeyella, Kingella denitrificans, Scardovia wiggsiae, Streptococus mutans, Selenomonas, Capnocytophaga, Treponema The performance of the preterm birth onset prediction model using these as feature quantities was 0.714 for sensitivity and 0.751 for specificity.

以上、本発明の実施例について詳述したが、本発明は上述した特定の実施形態に限定されるものではなく、特許請求の範囲に記載された本発明の要旨の範囲内において、種々の変形・変更が可能である。 Although the examples of the present invention have been described in detail above, the present invention is not limited to the above-mentioned specific embodiments, and various modifications are made within the scope of the gist of the present invention described in the claims.・ Can be changed.

１０予測システム
５０細菌叢計測装置
１００予測装置
１１０データ取得機能
１２０データ保存機能
１３０早産予測モデル生成機能
１４０早産予測機能
１５０結果通知機能 10 Prediction system 50 Bacterial flora measurement device 100 Prediction device 110 Data acquisition function 120 Data storage function 130 Preterm birth prediction model generation function 140 Preterm birth prediction function 150 Result notification function

Claims

A data acquisition function of acquiring flora data relating flora in the oral cavity of the pregnant full-term production group and preterm group pregnant women,
A preterm birth prediction model generation function that generates a preterm birth prediction model based on the bacterial flora data,
Based on the preterm birth prediction model, the preterm birth prediction function that executes the prediction of the onset of preterm birth of the pregnant woman to be predicted from the acquired bacterial flora data, and
A result notification function that notifies the result of the prediction of the onset of preterm birth,
Have,
The data acquisition function acquires the bacterial flora data at a plurality of time points during the data acquisition period from any time before and after pregnancy to any time before and after childbirth of the pregnant woman to be predicted.
The preterm birth prediction function is a prediction device that executes the preterm birth onset prediction based on the bacterial flora data acquired at the plurality of time points.

The preterm birth prediction model generation function sets each bacterial species of the bacterial flora data acquired at the plurality of time points as independent variables, and extracts a predetermined number of high-ranking variables that often separate the preterm birth group and the term birth group. And
The prediction device according to claim 1, wherein the premature birth prediction model generation function generates the premature birth prediction model from the extracted variables according to a forward stepwise selection method.

The Preterm Prediction model is, Porphyromonas gingivalis, Treponema denticola, and Tabberella forsythensis, Bergeyella, Kingella denitrificans, Scardovia wiggsiae, Streptococus mutans, Selenomonas, Capnocytophaga, is generated so as to extract one or more of Treponema vincentii, claim 1 or 2. The prediction device according to 2.

The Preterm Prediction model generation function generates a premature onset whether prediction model及beauty preterm risk model,
The Preterm Prediction, based respectively on the generated preterm onset whether prediction model及beauty preterm risk model, executes the onset whether pre Haka及beauty premature risk calculating preterm claims 1 to 3 set forth in any one Predictor.