CN113808693B

CN113808693B - Drug recommendation method based on graph neural network and attention mechanism

Info

Publication number: CN113808693B
Application number: CN202111061579.9A
Authority: CN
Inventors: 万健; 岳魏琦; 张蕾; 洪高枫; 郑慧琳; 史斌彬
Original assignee: Zhejiang Lover Health Science and Technology Development Co Ltd
Current assignee: Zhejiang Lover Health Science and Technology Development Co Ltd
Priority date: 2021-09-10
Filing date: 2021-09-10
Publication date: 2024-10-22
Anticipated expiration: 2041-09-10
Also published as: CN113808693A

Abstract

The invention discloses a medicine recommendation method based on a graph neural network and an attention mechanism. The invention takes the structural characteristics of the medical treatment situation or the medication information of each patient as a node, adopts the graph neural network to capture the relation among the structural characteristics, and learns the high-order characteristics comprising the knowledge of a medical system. Meanwhile, the attention mechanism is used for modeling the history medical records of the user better, and the medicine interaction knowledge is introduced, so that the accuracy and the safety of medicine recommendation are improved effectively.

Description

Drug recommendation method based on graph neural network and attention mechanism

Technical Field

The invention belongs to the technical field of computer application, and relates to a medicine recommendation method based on a graph neural network and an attention mechanism.

Background

The development of modern medical technology has led to the widespread use of electronic medical records, accumulating a large amount of clinical data such as vital signs, clinical summaries, disease diagnoses, prescription drugs, etc. Meanwhile, the deep learning technology provides a new technical means for mining and utilizing medical data, and is a current research hotspot. The combined medicine recommendation algorithm based on the electronic medical records can assist doctors to make safe and effective prescriptions according to the change characteristics of patient conditions, the medicine attributes and the action relations among a large number of medicines, and has important research values.

Early drug recommendation techniques were mostly rule-based. Relevant specialists extract medication rules based on medical information such as diagnosis, disease classification, symptoms, detection results and the like of patients, and the disadvantage is that maintenance is complex and difficult to update and expand. The medicine recommendation technology under deep learning embeds the information of physical sign, diagnosis, past medicine and the like of a patient into a low-dimensional space, and uses the embedded representation to recommend, so that the recommendation accuracy is improved. However, they also have many problems including data sparsity, inability to effectively utilize the patient's history information, neglecting medical ontology information underlying medical codes, and the like.

A graph neural network is a neural network that acts directly on the graph structure. It has the following characteristics: neglecting the input sequence of the nodes; in the calculation process, the representation of the node is influenced by the neighboring nodes around the node, and the connection of the graph is unchanged; the representation of the graph structure allows reasoning based on the graph. Therefore, the graphic neural network becomes a great hotspot for research, and is widely applied to the fields of social networks, recommendation systems, financial wind control, physical systems, molecular chemistry, life sciences, knowledge maps, traffic prediction and the like.

Drawing attention mechanisms have been introduced into graphic neural networks and have found widespread use in many fields. The method has the advantages that attention mechanisms are benefited, the graph neural network better realizes the weighted aggregation of neighbors by learning the weights of the neighbors, noise neighbors are further filtered, model performance is improved, and the result can be interpreted to a certain extent.

Disclosure of Invention

Aiming at the defects of the prior art, the invention aims to provide the medicine recommendation method which can effectively relieve the sparsity of medical data, effectively utilize the history case information of patients and consider the medicine safety.

In order to achieve the above object, the technology provided by the present invention is a drug recommendation method based on a graph neural network and an attention mechanism, comprising the steps of:

step 1, acquiring historical electronic medical record data and carrying out structural treatment:

acquiring historical diagnosis conditions of a patient and medication information corresponding to the diagnosis conditions to construct an electronic medical record, wherein the diagnosis conditions comprise diagnosis data and operation condition data; the electronic medical record of the patient is expressed as p= [ x ₁,x₂,...,x_t-1 ], t is the current visit times of the patient, wherein the ith visit of the patient is expressed as x _i＝[d_i,p_i,m_i],i＝1,2,...,t-1,d_i which represents the diagnosis data of the ith visit of the patient, p _i which represents the operation condition data of the ith visit in the medical record of the patient, and m _i which represents the medication data of the ith visit.

Step 2, constructing three graphic neural networks for learning the patient treatment condition and the structural characteristics of the medication information; the three graphic neural networks are respectively used for inputting diagnosis data, operation condition data and medication data of a patient, and the corresponding outputs are respectively d ^e、p^e、m^e;

The three graph neural networks adopt the same structure and specifically comprise nodes and edges; the nodes comprise leaf nodes and non-leaf nodes, wherein the leaf nodes are one of input data, namely diagnosis data, operation condition data and medication data of a patient, and the non-leaf nodes are medical attribution classifications of the leaf nodes; edges are medical classification relations of two nodes;

Each non-leaf node is represented as the sum of its own vector representation and its vector representation of all child nodes, calculated by way of GAT graph annotation mechanism:

Where g _n denotes the nth non-leaf node, K denotes the total number of attentions, reLU and LeakyReLU denote nonlinear functions, ch (n) denotes the vector representation of the nth non-leaf node itself and all its children, The weight calculation coefficient representing the current non-leaf node under the kth attention and all sub-nodes thereof, W ^k representing the learning parameter of the non-leaf node under the kth attention, e _* representing the vector representation of the node, a representing a learnable matrix, and a ^T being the transpose thereof.

Each leaf node is represented as the sum of its own vector representation and all its ancestor nodes, again calculated by way of the GAT graph annotation mechanism:

Where c' _n represents the nth leaf node, an (n) represents the vector representation of the nth leaf node itself and all ancestor nodes thereof, The weight calculation coefficients representing the current leaf node itself and all its ancestor nodes at the kth attention, W' ^k represents the learning parameters of the leaf node at the kth attention.

Step 3, constructing two GRU network models with attention mechanisms, wherein the input is the output result d ^e、p^e of the step 2 respectively, and the corresponding output is k ^d、k^p with history information respectively;

the two GRU network models with the attention mechanisms adopt the same structure and comprise two parallel GRU networks and an attention mechanism module connected with the outputs of the two parallel GRU networks;

Two parallel GRU models are hidden layer output information of historical treatment conditions (i.e. diagnosis or operation condition information) obtained by adopting different activation functions, and the hidden layer output information is specifically as follows:

H＝GRU₁(r) (5)

W^h＝softmax(F_h(H)) (6)

H＝GRU₂(r) (7)

W′^htanh(F_h′(H′)) (8)

Wherein, H, H' respectively represent hidden layer information output by a first GRU network and a second GRU network model, W ^h,W′^h respectively represent attention mechanism weights obtained by the first GRU network and the second GRU network through softmax and tanh activation functions, F _h,F′_h respectively represent linear transformation matrix functions which can be learned by the first GRU network and the second GRU network, and r represents d ^e or p ^e;

the attention mechanism module calculates k ^d、k^p with history information according to formula (9), i.e For diagnostic information with historical information for different time scales,Is the operation condition information with history information of different time scales.

Wherein t represents the total number of patient visits, W ^h(i),W′^h (i) represents the attention mechanism weight obtained through the softmax and tanh activation functions corresponding to the ith visit,Representing element-by-element multiplication; k represents k ^d or k ^p;

Step 4, constructing two memory neural networks MANN with the same structure; wherein the key-value pair stored in the first memory neural network is the "ith visit diagnostic data fusion information Medication information of "-" graph neural network"; The key-value pair stored in the second memory neural network is' fusion information of the ith treatment operation conditionMedication information of "-" graph neural network”；

The ith treatment condition and the historical treatment condition are subjected to para-multiplication, and the weight of the ith treatment condition is calculated

Wherein the method comprises the steps ofOr (b)The treatment condition information which indicates that the treatment is carried out for the ith treatment and has history information;

By weight Obtaining historical medication vectorThe ith dose is as follows:

further obtaining the key of the memory neural network

Wherein the method comprises the steps ofRepresenting learning weights, keysCorresponding value is

According toCan obtainAnd

Step 5, constructing a drug interaction knowledge base

The drug interaction knowledge is introduced, and the coexistence relationship of the drugs in the electronic medical record is represented by using an adjacency matrix A _C, and the drug interaction relationship is represented by an adjacency matrix A _D. And (3) learning a drug co-occurrence relationship and a drug interaction relationship by adopting a graph convolution neural network, and combining the drug interaction and co-occurrence relationship with the key value pair in the step (4) to generate a recommended drug list.

5-1 In step 4AndCombining to obtain query vectors containing historical medical record information, ith diagnosis information and ith operation information

Wherein W _s represents the comparative weight of the diagnostic and surgical information.

5-2 Constructing a drug coexistence relationship matrix and a drug interaction relationship matrix in the electronic medical record

A_*＝D^-1(A_*+I)D^-1 (14)

Wherein D represents a diagonal matrix transformed by A _*, D ^-1 is the inverse thereof, I is an identity matrix, and A _* represents a drug coexistence relationship matrix A _C or a drug interaction relationship matrix A _D in the electronic medical record.

5-3 Learning relationships between drugs using a graph convolution neural network, combining interactions and coexistence relationships of drugs into an embedded representation, resulting in a representation matrix Z _C of a drug co-occurrence graph, and a representation matrix Z _D of a drug interaction graph:

Z_C＝A_Ctanh(A_Cm^e)W_C (15)

ZD＝A_Dtanh(A_Dm^e)W_D (16)

Wherein W _C,W_D is the parameter matrix of the drug contribution graph and the drug interaction graph, and m ^e is the set of key value pairs obtained in the step 4.

Based on matrix Z _C、Z_D and query vectorCalculate attention λ _i:

Wherein W _CD represents a comparative weight of drug coexistence relationship and interaction.

Finally, a recommended drug list y _i is obtained:

Wherein W _y represents the weight coefficient when the recommended medicine list is calculated.

When the recommendation probability y _i of a drug in the drug list is greater than the recommendation probability threshold ρ, the drug is recommended.

It is another object of the present invention to provide a drug recommendation device based on a graph neural network and an attention mechanism, comprising

The data preprocessing module is used for carrying out structural processing on the historical treatment condition of the patient and the medication information corresponding to the treatment condition to construct corresponding electronic medical record data;

The graphic neural network module is used for learning the patient treatment condition and the structural characteristics of the medication information; the three graphic neural networks are respectively used for inputting diagnosis data, operation condition data and medication data of a patient, and the corresponding outputs are respectively d ^e、p^e、m^e;

And the GRU network model module with an attention mechanism is used for extracting characteristics of the diagnosis data and the operation condition data output by the graphic neural network module through the GRU network and combining the characteristics with the current diagnosis condition to obtain the diagnosis data and the operation condition data with history information.

The memory neural network MANN module is used for constructing a key value pair of the i-th diagnosis data fusion information output by the GRU network model module with the attention mechanism and the medication information of the graphic neural network, and a key value pair of the i-th diagnosis operation condition fusion information output by the GRU network model module with the attention mechanism and the medication information of the graphic neural network;

and the medicine interaction knowledge base module is used for learning medicine co-occurrence relations and medicine interaction relations by adopting a graph convolution neural network, combining the medicine interaction and co-occurrence relations to the medicine embedding representation of the memory neural network MANN module and generating a recommended medicine list.

Yet another object of the present invention is a computer readable storage medium having stored thereon a computer program which, when executed in a computer, causes the computer to perform the above-mentioned method.

Yet another object of the present invention is a computing device comprising a memory having executable code stored therein and a processor which, when executing the executable code, implements the method described above.

The invention has the following advantages: the invention takes the structural characteristics of the medical treatment situation or the medication information of each patient as a node, adopts the graph neural network to capture the relation among the structural characteristics, and learns the high-order characteristics comprising the knowledge of a medical system. Meanwhile, the attention mechanism is used for modeling the history medical records of the user better, and the medicine interaction knowledge is introduced, so that the accuracy and the safety of medicine recommendation are improved effectively.

Drawings

Fig. 1 is a drug recommendation process based on a graph neural network and an attention mechanism.

FIG. 2 is a tree diagram of a medical code encoding architecture.

FIG. 3is a graph showing the comparison of F1 values for different visits by different methods.

Detailed Description

The invention will be further analyzed with reference to specific examples.

The technology provided by the invention is a medicine recommendation method based on a graph neural network and an attention mechanism, and comprises the following steps:

Step 3, constructing two GRU network models with attention mechanisms, wherein the input is the output results de and p ^e of the step 2 respectively, and the corresponding output is k ^d、k^p with history information respectively;

the two GRU network models with the attention mechanisms adopt the same structure, and each GRU network model comprises a GRU network and an attention mechanism module connected with the output of the GRU network;

The GRU model uses the attention mechanism to incorporate hidden layer output information of historical medical conditions (i.e., diagnostic or surgical condition information) into the current information representation in the following manner:

H＝GRU₁(r) (5)

W^h＝softmax(F_h(H)) (6)

H′＝GRU₂(r) (7)

W′^h＝tanh(F_h′(H′)) (8)

Wherein, H, H' respectively represent hidden layer information output by the first and second GRU network models, W ^h,W′^h respectively represent attention mechanism weights obtained by softmax and tanh activation functions, F _h,F′_h represents a linear transformation matrix function which can be learned by the first and second GRU network models, and r represents d ^e or p ^e;

Wherein t represents the total number of patient visits, W ^h(i),W^h (i) represents the attention mechanism weight obtained through the softmax and tanh activation functions corresponding to a specific visit,Representing element-by-element multiplication; k ^* represents k ^d or k ^p;

step 4, constructing two memory neural networks MANN with similar structures; wherein the key-value pair stored in the first memory neural network is the "ith visit diagnostic data fusion information Medication information of "-" graph neural network"; The key-value pair stored in the second memory neural network is' fusion information of the ith treatment operation conditionMedication information of "-" graph neural network”；

Wherein the method comprises the steps ofOr (b)Surgical condition information representing the i-th visit with history information;

By weight Obtaining historical medication vectorThe ith dose is as follows:

further obtaining the key of the memory neural network

Step 5, constructing a drug interaction knowledge base

The drug interaction knowledge is introduced, and the coexistence relationship of the drugs in the electronic medical record is represented by using an adjacency matrix A _C, and the drug interaction relationship is represented by an adjacency matrix A _D. And (3) learning a drug co-occurrence relationship and a drug interaction relationship by adopting a graph convolution neural network, and combining the drug interaction and the co-occurrence relationship with the drug embedded representation obtained in the step (4) to generate a recommended drug list.

5-1 In step fourAndCombining to obtain query vectors containing historical medical record information, ith diagnosis information and ith operation information

Wherein W _s represents a comparative weight of the diagnostic and surgical information.

A_*＝D^-1(A_*+I)D^-1 (14)

z_C＝A_Ctanh(A_Cm^e)W_C (15)

Z_D＝A_Dtanh(A_Dm^e)W_D (16)

Wherein W _C,W_D is the parameter matrix of the drug contribution graph and the drug interaction graph, and me is the structural characteristic of the drug output in the step 2.

Based on matrix Z _C、Z_D and query vectorCalculate attention λ _i:

Finally, a recommended drug list y _i is obtained:

The obtained medicine list is a group of one-dimensional matrixes with absolute values smaller than 1, the horizontal and vertical coordinates respectively represent the kind of the medicine and the recommendation probability, and when the recommendation probability of the medicine in the medicine list is larger than a preset threshold value of 0.5, the medicine is recommended.

The experimental process comprises the following steps:

The experiment used electronic medical record data from the MIMIC-III (Medical Information Mark for INTENSIVE CARE) database, a free public intensive care data set issued by the institute of technology computing physiology laboratory. The present invention uses the diagnostic, surgical and prescription data in the database to screen patients for medications that are received within 24 hours after entering the ICU.

In order to measure the recommendation accuracy, the invention uses a Jaccard similarity coefficient (Jaccard), namely the size of the intersection of the real drug and the recommended drug divided by the union size, the average F1 value (F1), namely the harmonic average of the precision rate and the recall rate, and an accuracy calling curve (PRAUC) as a measurement index of the precision rate.

In order to measure the safety of recommended drugs, the drug interaction rate DDI, i.e. the ratio of the recommended combination drugs containing DDI drugs, is used.

Compared with the six methods which are more effective at present, the Nearst method recommends according to the similarity between the current visit and the past diagnosis, and recommends the combined medicament which is the same as the past diagnosis; the LR method is L2 regularized logistic regression, using multiple thermal vectors to represent the input data, with the dichotomous used to process the multi-tag output. The Leap method uses a recurrent neural network to model tag dependencies, and uses a content-based attention mechanism to capture mappings between tag instances. The RETAIN method is based on a sequential data drug combination of a two-layer attention network model that selects clinical variables important in past visits. The GAMENet method is a method of integrating historical medication and medication interaction DDI usage graph rolling networks through a storage module. The PREMIER method learns patient history using the attention mechanism and drug interactions in conjunction with the graph attention mechanism.

Table 1 model comparison experiments

Table 1 is the performance of the drug recommendation task on the data set for various methods. Experimental results show that the model of the method can achieve the best effect in all methods. In particular, the method proposed by the present invention is 0.97%, 0.89% and 0.93% higher than the latest method (PREMIER) in Jaccard, PRAUC and F1 scores, respectively. Meanwhile, the method of the invention gives consideration to drug interaction, and the minimum DDI is 0.0705 under the condition of taking the first 40 drug interaction rates in all similar deep learning methods. In addition, the average medicine quantity recommended by the invention is 14.98, and the average medicine quantity closest to the actual medical record in comparison with each deep learning method is 14.68.

GRAD-mkg represents the experimental results of the invention, with the knowledge base of drug interactions removed. Under the condition that a medicine interaction knowledge base is not available, the accuracy of the recommended medicines is not changed greatly, but DDI in the recommended medicines of the model is increased to 0.767, which shows that knowledge of interaction relations among medicines and query vectors with historical treatment information are combined in the method, so that the interaction rate in the recommended medicines is reduced, and the medication safety is improved. Grad-tree represents a model generated using only the neural network to learn structural features of patient's medical condition and medication information. After the medical code body structure is not embedded, the accuracy of the medicine recommendation is obviously reduced, which indicates that the graph neural network used in the text has the coding capability on the high-order structural characteristics, can enrich the embedded representation of the medical code body, makes up the problem of sparse training data to a certain extent, and improves the accuracy of the medicine recommendation.

Since the number of visits is different for each patient, the influence of the number of previous visits should be considered. For different timing lengths, the invention is superior to all other methods. As shown in FIG. 3, the present invention has the highest F1 value among all classifications with number of visits. In particular, for data with more times of treatment, higher accuracy can be maintained compared with other methods, which shows that the method has better modeling capability on long time sequence dependence in patient medical records.

TABLE 2 comparative experiments with different degrees of DDI

Further experiments were carried out with respect to the effect on drug interactions. The first 40, 60, 80, 100 DDI types were used separately to investigate the impact of the present invention and those methods compared to, considering the use of different numbers of DDIs. Results as shown in table 2, GRAD is the only algorithm that can achieve DDI reduction when the number of DDI types considered is changed from 40 to 100, although the Δddi rate rises from-18.48 to-0.26%, and is always greater than zero, regardless of the DDI type. This shows that the invention can reduce the interaction rate of recommended drugs after introducing the knowledge of drug interactions, and has more safety.

It can thus be seen that the present invention has the following advantages: the proposed medicine recommendation algorithm based on the graph neural network and the attention mechanism takes the structural characteristics of the medical condition or the medication information of each patient as a node, adopts the graph neural network to capture the relation among the nodes, and learns the higher-order characteristics comprising the medical classification relation. Meanwhile, the attention mechanism is used for modeling the history medical records of the user better, and the medicine interaction knowledge is introduced, so that the accuracy and the safety of medicine recommendation are improved effectively.

Claims

1. The medicine recommending method based on the graph neural network and the attention mechanism is characterized by comprising the following steps of:

Acquiring historical diagnosis conditions of a patient and medication information corresponding to the diagnosis conditions to construct an electronic medical record, wherein the diagnosis conditions comprise diagnosis data and operation condition data; the electronic medical record of the patient is expressed as p= [ x ₁,x₂,...,x_t-1 ], t is the current visit times of the patient, wherein the ith visit of the patient is expressed as x _i＝[d_i,p_i,m_i],i＝1,2,...,t-1,d_i which represents the diagnosis data of the ith visit of the patient, p _i which represents the operation condition data of the ith visit in the medical record of the patient, and m _i which represents the medication data of the ith visit;

Step 5, constructing a drug interaction knowledge base

Wherein W _s represents a contrasting weight of the diagnostic and surgical information;

A_*＝D^-1(A_*+I)D^-1 (1)

Wherein D represents a diagonal matrix transformed by A _*, D ^-1 is the inverse thereof, I is an identity matrix, and A _* represents a drug coexistence relationship matrix A _C or a drug interaction relationship matrix A _D in the electronic medical record;

Z_c＝A_ctanh(A_cm^e)W_c (15)

Z_D＝A_Dtanh(A_Dm^e)W_D (16)

Wherein W _C,W_D is the parameter matrix of the drug contribution graph and the drug interaction graph, and m ^e is the set of key value pairs obtained in the step 4;

based on matrix Z _C、Z_D and query vector Calculate attention λ _i:

wherein W _CD represents a comparative weight of drug coexistence relationship and interaction;

finally, a recommended medicine list yi is obtained:

wherein W _y represents a weight coefficient when calculating a recommended medicine list;

When the recommendation probability yi of the drug in the drug list is larger than the recommendation probability threshold ρ, the corresponding drug is recommended.

2. The medicine recommendation method based on the graph neural network and the attention mechanism as claimed in claim 1, wherein in the step2, three graph neural networks adopt the same structure and all comprise nodes and edges; the nodes comprise leaf nodes and non-leaf nodes, wherein the leaf nodes are input data, namely one of diagnosis data, operation condition data and medication data of a patient, and the non-leaf nodes are medical attribution classification of the leaf nodes; edges are medical classification relations of two nodes;

Where g _n denotes the nth non-leaf node, K denotes the total number of attentions, reLU and LeakyReLU denote nonlinear functions, ch (n) denotes the vector representation of the nth non-leaf node itself and all its children, The weight calculation coefficient of the current non-leaf node and all child nodes thereof under the kth attention is represented, W ^k represents the learning parameter of the non-leaf node under the kth attention, e _* represents the vector representation of the node, a represents a leavable matrix, and a ^T is the transpose thereof;

3. The method for recommending drugs based on a graph neural network and an attention mechanism as claimed in claim 1 or 2, wherein in the step 3, two GRU network models with the attention mechanism are of the same structure, and each GRU network model comprises two parallel GRU networks and an attention mechanism module connected with the outputs of the two parallel GRU networks;

the two parallel GRU models are specifically a first GRU network and a second GRU network, wherein the first GRU network and the second GRU network respectively adopt different activation functions to acquire hidden information of historical visit conditions, and the hidden information is specifically as follows:

H＝GRU₁(r) (5)

w^h＝softmax(F_h(H)) (6)

H′＝GRU₂(r) (7)

w′^h＝tanh(F_h′(H′)) (8)

Wherein, H, H' respectively represent hidden layer information output by a first GRU network and a second GRU network model, w _h,w′^h respectively represent attention mechanism weights obtained by the first GRU network and the second GRU network through softmax and tanh activation functions, F _h,F′_h respectively represent linear transformation matrix functions which can be learned by the first GRU network and the second GRU network, and r represents d ^e or p ^e;

the attention mechanism module calculates k ^d、k^p with history information according to formula (9), i.e For diagnostic information with historical information for different time scales,Surgical condition information with history information for different time scales;

wherein t represents the total number of patient visits, W ^h(i),W′^h (i) represents the attention mechanism weight obtained through softmax and tanh activation functions corresponding to the ith visit, Representing element-by-element multiplication; k ^* represents k ^d or k ^p.

4. The method for recommending drugs based on graphic neural network and attention mechanism as set forth in claim 1 or 2, wherein the memory neural network in step 4 performs para-multiplication on the i-th diagnosis situation and its historical diagnosis situation, calculates the weight of the i-th diagnosis situation

By weight Obtaining historical medication vectorThe ith dose is as follows:

further obtaining the key of the memory neural network

According toCan obtainAnd

5. A graph neural network and attention mechanism based drug recommendation device implementing the method of any one of claims 1-4, comprising:

The GRU network model module is used for extracting characteristics of the diagnosis data and the operation condition data output by the graphic neural network module through the GRU network, and then combining the characteristics with the current diagnosis condition to obtain the diagnosis data and the operation condition data with history information;

6. A computer readable storage medium, characterized in that it has stored thereon a computer program which, when executed in a computer, causes the computer to perform the method of any of claims 1-4.

7. A computing device comprising a memory having executable code stored therein and a processor that, when executing the executable code, performs the method of any of claims 1-4.