Nothing Special   »   [go: up one dir, main page]

CN112365989B - Equivalent signal mining method for SRS combined adverse reaction signals - Google Patents

Equivalent signal mining method for SRS combined adverse reaction signals Download PDF

Info

Publication number
CN112365989B
CN112365989B CN202011304774.5A CN202011304774A CN112365989B CN 112365989 B CN112365989 B CN 112365989B CN 202011304774 A CN202011304774 A CN 202011304774A CN 112365989 B CN112365989 B CN 112365989B
Authority
CN
China
Prior art keywords
signal
adverse reaction
equivalent
equivalence
srs
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011304774.5A
Other languages
Chinese (zh)
Other versions
CN112365989A (en
Inventor
袁洪
刘心瑶
吴俏玉
陆瑶
蔡菁菁
李莹
黄志军
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Changsha Hongyuan Cardiovascular Health Research Institute
Original Assignee
Changsha Hongyuan Cardiovascular Health Research Institute
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Changsha Hongyuan Cardiovascular Health Research Institute filed Critical Changsha Hongyuan Cardiovascular Health Research Institute
Priority to CN202011304774.5A priority Critical patent/CN112365989B/en
Publication of CN112365989A publication Critical patent/CN112365989A/en
Application granted granted Critical
Publication of CN112365989B publication Critical patent/CN112365989B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H50/00ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics
    • G16H50/70ICT specially adapted for medical diagnosis, medical simulation or medical data mining; ICT specially adapted for detecting, monitoring or modelling epidemics or pandemics for mining of medical data, e.g. analysing previous cases of other patients
    • GPHYSICS
    • G16INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR SPECIFIC APPLICATION FIELDS
    • G16HHEALTHCARE INFORMATICS, i.e. INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR THE HANDLING OR PROCESSING OF MEDICAL OR HEALTHCARE DATA
    • G16H70/00ICT specially adapted for the handling or processing of medical references
    • G16H70/40ICT specially adapted for the handling or processing of medical references relating to drugs, e.g. their side effects or intended usage

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Public Health (AREA)
  • Epidemiology (AREA)
  • Data Mining & Analysis (AREA)
  • Primary Health Care (AREA)
  • General Health & Medical Sciences (AREA)
  • Databases & Information Systems (AREA)
  • Pathology (AREA)
  • Biomedical Technology (AREA)
  • Chemical & Material Sciences (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Medicinal Chemistry (AREA)
  • Pharmacology & Pharmacy (AREA)
  • Toxicology (AREA)
  • Medical Treatment And Welfare Office Work (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses an equivalent signal mining method facing an SRS combined adverse reaction signal, which comprises the steps of acquiring data through an SRS spontaneous presentation system of adverse drug events to obtain a combined adverse reaction signal set X ═ D1,D2,…,Dx},DxA drug combination which is judged to be an adverse reaction signal; preferentially processing the signal with the minimum size of the drug combination set and solving the maximum equivalence group D of the DGCombining with DGThere is a maximum equivalence set to a maximum equivalence set D of the intersectionG(ii) a Will no longer belong to DGAnd a signal equivalent to D is incorporated into DGD isGAdding XGOutputting the maximum equivalence set XG。XGEach maximum equivalence group in the group is a signal set with pairwise equivalence relation between the combined adverse reaction signals, and one signal is selected from the maximum equivalence groups to represent the signal set (namely the maximum equivalence group) to participate in other data analysis, so that the workload of the overall analysis is reduced.

Description

Equivalent signal mining method for SRS combined adverse reaction signals
Technical Field
The invention relates to the technical field of signal mining, in particular to an equivalent signal mining method for adverse reaction signals of SRS combined medication, wherein SRS represents a spontaneous report system for adverse drug events.
Background
The study on the interaction of all the combined drugs cannot be completed in the pre-marketing stage due to the factors such as the experimental range, the study object, the time and the like, and many potential drug interactions can be discovered only in the process of large-scale and long-time use after the drugs are marketed. The Spontaneous drug adverse event Reporting System (SRS) provides an important data source for drug interaction signal mining after marketing. For example, the FDA Adverse Event Reporting System (FAERS) in the united states reflects the complexity of medication safety in real life, and it is investigated that 60% of Adverse Event reports have more than one drug, 70% have more than one drug, and 84% have at least 3 total drugs or Adverse events. These features present opportunities and challenges for combination drug adverse reaction signal mining.
In order to break through the dilemma of 'mass data and lack of information', mainstream researches utilize a computer to discover adverse reaction signals of combined medication in batches in spontaneous presentation system data. The signal contains two elements: drug combination D and target adverse reaction AE. A drug combination is a collection of drugs, which may also be referred to as a combination or co-drug. If combination D is a signal for an adverse reaction AE, it means that the targeted adverse reaction AE may occur when the patient is taking all of the drugs in D at the same time. When AE is determined, the signal can be expressed as a drug combination. It should be noted that the signal is only a clue and does not prove the causal relationship between the drug combination and the adverse reaction, which needs more complete medical experiment and mechanism analysis to confirm that the signal that can be confirmed is called a positive signal and the signal that cannot be confirmed is called a false positive signal. The higher the positive signal percentage in the signal, the higher the quality of signal mining. The signal strength metric is a key to determining the quality of signal mining. The imbalance measurement is the basic idea of signal metrics, i.e., "imbalance" or "dissimilarity" of an event of interest as compared to other events. The method comprises two major categories, namely a frequency method and a Bayesian method, wherein the frequency method comprises a relative hazard ratio (RR), a ratio report ratio (PRR), a report ratio (ROR) and the like, and the Bayesian method comprises a Bayesian confidence degree progressive neural network (BCPNN), an empirical Bayesian gamma Poisson distribution reduction Method (MGPS), a BCPNN high-dimensional expansion version, an omega contraction measurement method and the like. The methods have advantages, but on the whole, the common problems of large signal clue quantity and low accuracy (4%) exist, the advantages of large data cannot be truly exerted, and the defect is more prominent when high-order combined drug adverse reaction signals are mined. This is because the real-life drug combination mode is very complicated, and the known imbalance measurement method estimates the risk of adverse signal reaction by assuming independent and irrelevant drug combination, only evaluates the drug combination without considering the influence of other drug combinations, so that other drugs (combination) daily combined with positive signals can be estimated as false positive signals. One of the very typical problems is that several apparently distinct combination adverse reaction signals are expressed from the same set of patient reports in the SRS database. The presence of such signals can cause two problems for subsequent signal analysis by medical researchers: (1) performing data analysis independently on signals from the same patient subgroup can result in duplication of labor; (2) from the point of view of data analysis, these signals are confounding each other, in other words, the number of positive signals therein is much smaller than the number of signals. At present, how to describe and mine such signals in a normative way, no relevant research work can be used for reference.
Disclosure of Invention
Technical problem to be solved
Aiming at the problems, the invention provides an equivalent signal mining method facing an adverse reaction signal of SRS combined medication. If the coincidence degree of the patient groups corresponding to the two signals is high, case difference is insufficient to support signal intensity judgment, the two signals are equivalent to the equivalent description of adverse drug reactions of the same patient group, and the two signals are called to have an equivalent relationship, so that the invention provides a judgment method. On the basis, a transmission hypothesis and a combination hypothesis are adopted, and a signal set with pairwise equivalence relation between the combined adverse reaction signals is excavated, and the signal set is called as a maximum equivalence group. And selecting a signal from the maximum equivalence group to represent the whole set to participate in other data analysis, so that the workload of the whole analysis is reduced, and the equivalent signal of the signal can be regarded as a confounding factor, which is beneficial to improving the quality of the signal analysis.
(II) technical scheme
Based on the technical problems, the invention provides an equivalent signal mining method for adverse reaction signals of SRS combined medication, which comprises the following steps:
s1, making the unprocessed set Z as X, and making the maximum equivalence set X of the combined drug adverse reaction signal set XGIn the empty set, the combined adverse reaction signal set X acquires data and preprocesses the data through an SRS (sounding reference signal) of a drug adverse event spontaneous presentation system, and then the combined adverse reaction signal set X ═ { D ═ is obtained by mining1,D2,…,Dx},DxA drug combination for determining an adverse reaction signal;
s2, judging whether Z is an empty set, if so, entering a step S9, and otherwise, entering a step S3;
s3, priority processing the signal with the smallest medicine combination size, wherein the medicine combination size represents the number of medicines contained in the medicine combination, i.e. processing the signal
Figure BDA0002788021500000041
If | represents the size of the set, the unprocessed set Z ═ Z- { D }, and D is initializedG={D};
S4, judgment XGIf the current set is an empty set, if so, the step S6 is executed, otherwise, the step S5 is executed;
s5, traverse XGCombining with DGThere is a maximum equivalence set to a maximum equivalence set D of the intersectionG
S6, judging whether Z is an empty set, if so, entering a step S8, and if not, entering a step S7;
s7, traversing Z, adding the signals equivalent to D into the maximum equivalent group D of DG
S8, order XG=XG∪{DGReverting to step S2;
s9, outputting the maximum equivalence group set XGThat is, the adverse side effect of combined medicationThe method comprises the steps of collecting maximum equivalence groups of response signals, wherein each maximum equivalence group is a combined adverse reaction signal collection with pairwise equivalence relation, selecting a signal, namely a drug combination, from each maximum equivalence group, and representing the maximum equivalence group to participate in further analysis of combined adverse reaction data, wherein the data are obtained from demographic information, medication information and adverse reaction information of a spontaneous drug adverse event report system SRS.
Further, the method for determining the drug combination with adverse reaction signals comprises the following steps: if the frequency of the target adverse reaction signals AE reported by the patients taking the medicine combination D is greater than the support threshold; the length of the drug combination D is not more than the drug combination signal length threshold MAX _ D; d obtained from SRS data of PS patient set is used as lower bound Q (PS, D, AE) CI of signal intensity confidence interval of target adverse reaction signal AE->A signal strength threshold δ; d is determined to be the drug combination of the targeted adverse reaction signal AE.
Further, Q (PS, D, AE) is a signal metric function of the non-equilibrium metric class, and the metric methods adopted include RR, PRR, RoR, BCPNN, and MGPS.
Further, the preprocessing comprises data deduplication and drug name normalization.
Further, the step S5 includes the following steps:
s5.1, traversing XGSelect not traversed member Dk GJudgment of Dk GAnd DGIf not, go to step S5.2, if yes, DG=DG∪Dk G,XG=XG-{Dk GStep S5.2;
s5.2, judging and traversing XGIf not, the process goes to step S6, otherwise, the process goes to step S5.1.
Further, the step S7 includes the following steps:
s7.1, selecting non-traversed member D of Zi
S7.2, judging whether D is equivalent to DiIf yes, let DG=DG∪{DiStep 7.3, if not, step 7.3 is carried out;
s7.3, judging whether the traversal Z is finished, if not, entering the step S7.1, and if so, indicating DGAfter the solution is completed, the process proceeds to step S8.
Further, it is determined in step S7.2 whether D is equivalent to DiThe method comprises the following steps:
s7.2.1, and Q (PS, D, AE) CI-≤Q(PS,Di,AE).CI+And Q (PS, D)i,AE).CI-≤Q(PS,D,AE).CI+If yes, go to step S7.2.2; otherwise, go to step S7.3; wherein Q (e.,. e.) represents a signal metric function, Q (PS, D, AE)+,Q(PS,D,AE).CI-Representing the upper and lower bounds of a signal intensity confidence interval of a target adverse reaction signal AE, which is D obtained according to SRS data of a patient set PS;
s7.2.2, determination of absolute values | C (G (D), AE) -C (G (D)i) AE) | is greater than θ, if yes, go to step S7.3, otherwise go to step S7.2.3; wherein G (·) represents a medication case function, C (,) represents an adverse reaction report statistical function, C (G (D), AE) represents the number of reports of adverse reactions AE from a case set G (D) taking a drug combination D, and θ represents an adverse reaction report number threshold;
s7.2.3, determining whether D is DiIf so, go to step S7.2.4, otherwise go to step S7.2.5;
s7.2.4, let C (G (D) -G (D)i),AE)=C(G(D),AE)-C(G(Di),AE),C(G(Di) -g (d), AE) ═ 0, go to step S7.2.6;
s7.2.5, determining whether | D | is equal to the medication combination signal length threshold MAX _ D, if yes, entering step S7.2.6; otherwise, go to step S7.3;
s7.2.6, traversing the SRS data, and counting to obtain C (G), (D) -G (D)i) AE) and C (G (D)i) -values of g (d), AE);
s7.2.7, judging C (G), (D) -G (D)i),AE)<θ and C (G (D)i)-G(D),AE)<Whether or not θ becomesIf both are true, go to step S7.2.8; otherwise, go to step S7.3;
s7.2.8, D is equivalent to DiLet DG=DG∪{Di}。
The invention discloses a server, comprising:
at least one processor; and at least one memory communicatively coupled to the processor, wherein:
the memory stores program instructions executable by the processor, and the processor invokes the program instructions to perform the equivalent signal mining method for SRS-oriented combined adverse drug reaction signals.
The invention discloses a non-transitory computer-readable storage medium storing computer instructions for causing a computer to execute the equivalent signal mining method for an adverse reaction signal of an SRS combination.
(III) advantageous effects
The technical scheme of the invention has the following advantages:
(1) the invention first takes the value D from X and solves the maximum equivalence group D of DGAdding XGThen, the value D is re-taken and X is traversedGMerging the maximum equivalent sets with intersection, and solving the new maximum equivalent set D of DGIncorporating XGRepeating the steps in the cycle until each drug combination in the X has been valued, and obtaining the final maximum equivalent group set XG;XGEach maximum equivalence group in the drug combination adverse reaction signal set is a drug combination adverse reaction signal set with pairwise equivalence relation, and one signal is selected from the set according to an equivalence substitution assumption so as to represent the whole set to participate in other data analysis, thereby being beneficial to reducing the workload of signal analysis; when finer-grained data analysis is carried out on a specific signal, an equivalent signal of the signal can be used as a confounding factor, and the signal analysis quality is improved;
(2) step S7 of the present invention is to solve the maximum equivalence group D of D according to the equivalence relation determination methodGIn the process, traverseZ, adding the signal equivalent to D to the maximum equivalence group D of DGThe accuracy is high, and before the equivalence relation judging method is utilized, special conditions that equivalence relation judgment is not established without inquiring SRS data are eliminated, SRS data inquiry is avoided, and the calculation efficiency is improved;
(3) in step S7, the equivalence relation is determined by using the method of | D | and the size of the combined medication signal length threshold MAX _ D in combination with the equivalence transfer hypothesis and the equivalence combination hypothesis, so that the execution of equivalence relation determination is avoided, SRS data query is avoided, and the calculation efficiency is further improved;
(4) step S5 of the present invention at XGAfter non-empty set, traverse XGMerging and D according to the equal-cost transfer assumptionGThere is a maximum equivalence set to a maximum equivalence set D of the intersectionGSo that and DGSignals contained in the maximum equivalent group with intersection do not need to be subjected to equivalent judgment one by one, so that the number of relation judgment times in the equivalent group is reduced, the calculation amount is reduced, and the analysis efficiency is improved.
Drawings
The features and advantages of the present invention will be more clearly understood by reference to the accompanying drawings, which are illustrative and not to be construed as limiting the invention in any way, and in which:
FIG. 1 is a schematic overall flow chart of a combined medication adverse reaction equivalence relation signal mining method based on SRS data according to an embodiment of the invention;
FIG. 2 is an example of equivalence relations and maximum equivalence sets for an embodiment of the present invention;
FIG. 3 is a flowchart of a signal mining method for adverse reaction equivalence relation based on SRS combined medication according to an embodiment of the invention;
FIG. 4 is a flowchart of an embodiment of the present invention for determining whether D is equivalent to DiA flow chart of the method of (1).
Detailed Description
The following detailed description of embodiments of the present invention is provided in connection with the accompanying drawings and examples. The following examples are intended to illustrate the invention but are not intended to limit the scope of the invention.
The invention relates to a mining method for analyzing adverse reaction equivalence relation signals of combined drug bleeding by using an adverse event report file issued by FAERS, which comprises the following steps of:
in the stage I, data are acquired and preprocessed through an SRS (sounding reference signal) of a drug adverse event spontaneous reporting system: data are acquired through an SRS (sounding reference signal) of a drug adverse event spontaneous presentation system, data are deduplicated, drug names are normalized, and target adverse reactions AE are selected. This is the basis and precondition for all signal mining efforts. After data preprocessing, the signal is analyzed in the last two stages. For the purposes of the following expressions, the following symbols are defined: let the total patient set in the SRS be PS ═ p1,p2,…,pmAll drugs are set as DS ═ d }1,d2,…,dn}. Let C (,) denote the adverse reaction reporting statistical function, given a subset of patients P, C (P, AE) denotes the number of reports of adverse reaction AEs from subset of patients P. Let G (.) denote the medication case function, given signals D, G (D) as a set of cases taking medication combination D.
Obtaining adverse event report files published by FAERS 2018 three-quarter data to obtain 420,915 case reports, 1,651,966 medication records (including 51083 medicine names) and 1,329,530 adverse event records (including 11,944 adverse event records), integrating the demographic information, medication information and adverse drug reactions related by case IDs, and carrying out standardized processing on the adverse drug reaction names and the medicine names. Bleeding events were selected as target adverse events AE, involving 194 adverse events (all generalizable to bleeding events but stated differently) for a total of 19,067 bleeding event records.
And II, excavating adverse reaction signals of the combined medicine: for selected SRS data and target adverse reaction AE, excavating combined drug adverse reaction signals to obtain a combined drug adverse reaction signal set X ═ D1,D2,…,DxIn which the signal D isiIs a combination of drugs (drug set) taken simultaneously by patientsiFor each drug in (1), the patient is said to take Di. . Wherein the signal evaluation is jointly determined by the following three parameters: (1) threshold of supportSupport: the frequency of target adverse reaction AE reported by the medication patients from the drug combination D is more than Support; (2) length threshold MAX _ D: the length of drug combination D (i.e., the number of drugs contained in D) is not greater than MAX _ D, i.e., | D<MAX _ D; (3) signal metric function Q (,) and signal strength threshold δ: the function Q is input into a patient set PS, a drug combination D and a target adverse reaction AE, and Q (P, D, AE) is output as a Confidence Interval (CI) of signal intensity of an AE signal of D obtained from SRS data of the patient set PS, wherein CI is [ CI ]-,CI+]If the confidence interval is lower bound CI->And delta, D is judged as a signal of the target adverse reaction AE. The signal measurement function Q can adopt a non-equilibrium measurement method, which is a category of traditional research work, and the known measurement methods include a relative hazard ratio (RR), a ratio report ratio (PRR), a report ratio (ROR), a bayesian confidence degree progressive neural network (BCPNN), an empirical bayesian gamma poisson distribution subtraction Method (MGPS), and the like, and the corresponding relationship between Q and a typical method is shown in table 1. When solving Q (PS, D, AE) by the non-equilibrium metric method, C (g, (D), AE) needs to be calculated based on SRS data. In order to increase the efficiency of the subsequent calculation, each signal D in the set X has additional information Q (PS, D, AE) and C (g (D), AE).
TABLE 1 exemplary Signal metric method and confidence interval selection therefor
Figure BDA0002788021500000101
Drug combination bleeding event signals were mined using the FAERS 2018 three-quarter data. Let the Support threshold Support be 20, the length threshold MAX _ D be 3, the signal metric function Q (·) be RoR, and the signal strength threshold δ be 2, to obtain 1,530 drug combination bleeding event signals.
And III, excavating an adverse reaction equivalent relation signal of the combined medication from the adverse reaction signal set of the combined medication: the input of the stage is a signal set with pairwise equivalence relation between the combined drug adverse reaction signals to be mined, wherein the pairwise equivalence relation means that the patient groups corresponding to the two signals have high coincidence degree, case difference is insufficient to support signal intensity judgment, and the two signals are equivalent to equivalent description of the same patient group. The signal set obtained in the second stage is analyzed, the equivalence relation among the signals is identified, and the maximum equivalent signal set is obtained. This stage generally employs the same signal metric function Q as the second stage.
In order to disclose the characteristics of the equivalence relation, the ideal equivalence relation is analyzed firstly, the inference of the ideal equivalence relation is extended to the hypothesis under the general condition, and then the equivalence relation judgment standard and the mining algorithm under the general condition are obtained:
the ideal equivalent relationship is as follows: given signal Da,DbIf G (D)a)=G(Db) G (-) denotes the case function, drug combination DaAnd DbThe same set of cases is called DaAnd DbHave an equivalence relation between them, is called DaIs equivalent to Db. From the definition of the ideal equivalence relation, the following reasoning holds:
inference 1 equivalent substitutions: if signal DaIs equivalent to DbFor a given signal metric function Q, patient set P and signal DcThen the following equation holds: (i) q (P, D)a,AE)=Q(P,DbAE), namely under the same conditions, DaAnd DbThe signal intensities are consistent; (ii) q (P-G (D)a),Dc,AE)=Q(P-G(Db),DcAE), namely under the same conditions, Da、DbThe effect on the other signal evaluations was consistent.
Equivalence substitution inference indicates that for an equivalent signal Da,DbOnly one of the representatives needs to be selected to participate in the signal correlation analysis, which reduces the workload of the overall analysis.
Inference 2 equivalent transfer: if signal DaEquivalent to signal DbAnd D isbEquivalent to signal DcThen D isaIs equivalent to Dc
Inference 3 merge equivalencies: if signal DaEquivalent to signal DbOrder signal
Figure BDA0002788021500000111
And is
Figure BDA0002788021500000112
Then D iscIs equivalent to Da
However, the ideal equivalence relation is a special case, and the possibility of the ideal equivalence relation among signals is very little in view of the complexity of the combined medication in reality. For the assessment of adverse drug reaction signals, a small number of adverse reaction report differences were not statistically significant. Therefore, the equivalence between the determination signals focuses on two points: (1) the signal strengths have no significant difference, namely, the confidence intervals of the signal strengths have intersection; (2) for the target adverse reaction AE, a proper amount of difference exists between the adverse reaction reports of the allowable signals, and the difference is restrained by adopting a parameter 'adverse reaction report number threshold value theta'. The equivalence relation decision is defined as follows.
And (3) judging equivalence relation: for a given signal Da,DbIf the following 2 conditions are satisfied, DaAnd DbHave an equivalence relation between them, called DaIs equivalent to Db. : (i) there was no significant difference between the two signal strength estimates, i.e., Q (PS, D)a,AE).CI-≤Q(PS,Db,AE).CI+,Q(PS,Db,AE).CI-≤Q(PS,Da,AE).CI+. (ii) Adverse case record differences between the two signals were less than θ, i.e., C (G (D)a)-G(Db),AE)<=θ,C(G(Db)-G(Da),AE)<θ. Wherein the set subtraction operation relationship is defined as G (D)a)=G(Da)-G(Da)∩G(Db)。
Analyzing the above judgment process, in the condition (i), Q (PS, D)a,AE)、Q(PS,DbAE) is the signal Da、DbThe attached information of (2) is directly compared with the numerical value, so that the calculation cost is not large; in the condition (ii), in general, C (G (D)a)-G(Db),AE)、C(G(Db)-G(Da) AE) needs to query SRS data, which is high in calculation overhead. However, there are two special conditions bySignal Da、DbAuxiliary information C (G (D))a),AE)、C(G(Db) AE), SRS data may not need to be queried: (ii-1) if
Figure BDA0002788021500000121
Then C (G (D)a)-G(Db),AE)=C(G(Da),AE)-C(G(Db),AE),C(G(Db)-G(Da) AE) 0; (ii-2) apparently, | C (G (D)a),AE)-C(G(Db),AE)|<=Max(C(G(Da)-G(Db),AE),C(G(Db)-G(Da) AE)), therefore, if | C (G (D))a),AE)-C(G(Db),AE)|>θ, it is understood that the condition (ii) does not hold.
Considering that the equivalence relation judgment process relates to the SRS data query, in order to further improve the calculation efficiency, the equivalence transmission inference and the merging equivalence inference of the ideal equivalence relation are used for reference, the two inferences are used as the equivalence transmission assumption and the equivalence merging assumption, and the SRS data query is avoided as much as possible. When the signal D is known from the equivalent transfer assumptionaEquivalent to signal DbAnd D isbEquivalent to signal DcIn this case, D can be determined without executing the equivalence relation determination conditionaAnd DcAnd equivalently, SRS data query is avoided. Given signal D, as can be seen from the combined equivalence assumptionsaAnd DbWherein | Da|<MAX _ D, if DaIs equivalent to DbThen there must be a signal D in the set of signals XcSatisfy the following requirements
Figure BDA0002788021500000131
And DcIs equivalent to DaAnd Db. In this case, pair D is discarded based on the equivalent transfer assumptionaAnd DbD is finally obtained by judging the equivalence relationaAnd DbAnd (4) an equivalent conclusion. At the same time, this is also advantageous for reducing the computational overhead when the conditions are met
Figure BDA0002788021500000132
Then, D is judgedbAnd DcAn equivalence relation ofSRS data queries need not be performed.
For an input signal set X, the patent intends to group the signals therein, and for selected signals, if there is pairwise equivalence between the signals, put the signals into a signal set, which is called a maximum equivalence set. The definition is as follows:
maximum equivalence set definition: given set of signals
Figure BDA0002788021500000133
If for DGOf any two signals Da,DbSatisfy DaEquivalent DbSet of signals DGReferred to as an equivalence group. If there is no equivalence group DG’So that
Figure BDA0002788021500000134
Then D isGReferred to as the maximum equivalence group.
Based on the equivalence transfer assumption, it can be demonstrated that any one signal belongs to at most one maximum equivalence group. As shown in fig. 2, the inference of the ideal equivalence relation is extended to the hypothesis under the general situation, and then the equivalence relation decision criterion and the mining algorithm under the general situation are obtained: shown, { D, { when no equivalent transfer assumptions are assumed1,D2,D3And { D }1,D2,D4Are the largest equivalence sets. Wherein D is1,D2Belong to the two largest equivalence groups, thus creating a conceptual divergence, according to the equivalence transfer assumption, D1,D2Belong to a maximum equivalence group { D1,D2,D3,D4}。
In summary, the signal set X is divided into several maximal equivalence groups without intersection, and the set of these maximal equivalence groups is called the maximal equivalence group set XG
When SRS data and a signal set X (including Q (PS, D, AE) and C (G (D) related to each signal) are inputteda) AE) data) as follows, as shown in fig. 3:
s1, making Z as X in the untreated set, and making X as X in the combined adverse reaction signal setMaximum equivalence set XGIs an empty set phi;
s2, judging whether Z is an empty set, if so, entering a step S9, and if not, entering a step S3;
if Z is an empty set, the combined drug adverse reaction signal set X is completely processed;
s3, priority processing the signal with the smallest size of the medicine combination (the number of medicines contained in the medicine combination), i.e. order to process the signal
Figure BDA0002788021500000141
Initialization DGWhere | represents the size of the set (i.e., the number of elements in the set), then the unprocessed set Z ═ Z- { D };
s4, judgment XGIf the current set is an empty set, if so, the step S6 is executed, otherwise, the step S5 is executed;
if XGIf it is an empty set, the process proceeds to step S6;
s5, traverse XGCombining with DGThere is a maximum equivalence set to a maximum equivalence set D of the intersectionG
S5.1, traversing XGSelect not traversed member Dk GJudgment of Dk GAnd DGIf not, go to step S5.2, if yes, DG=DG∪Dk G,XG=XG-{Dk GStep S5.2;
according to the equivalence transfer assumption, if two maximum equivalence groups DGAnd Dk GIf there is an intersection, the signals of the two maximum equivalence sets are equivalent, so that the signals are combined into a maximum equivalence set DGNo longer need to match DGD with intersectionk GThe medicine combination is subjected to equivalence judgment one by one, the operation amount is greatly reduced through equivalence transmission hypothesis, and the analysis efficiency is improved.
S5.2, judging and traversing XGIf yes, the step S6 is executed, otherwise, the step S5.1 is executed;
s6, judging whether Z is an empty set, if so, entering a step S8, and if not, entering a step S7;
if Z is an empty set, the combined drug adverse reaction signal set X is completely processed;
s7, traversing Z, adding the signals equivalent to D into the maximum equivalent group D of DG(ii) a S7.1, selecting non-traversed member D of Zi
S7.2, judging whether D is equivalent to DiIf yes, let DG=DG∪{DiStep 7.3, if not, step 7.3 is carried out;
s7.3, judging whether the traversal Z is finished, if not, entering the step S7.1, and if so, indicating DGAfter the solution is completed, the process proceeds to step S8.
Judging whether D is equivalent to D in step S7.2iAs shown in fig. 4:
s7.2.1, determining whether the following two inequalities, Q (PS, D, AE) CI, are true-≤Q(PS,Di,AE).CI+,Q(PS,Di,AE).CI-≤Q(PS,D,AE).CI+If both are true, go to step S7.2.2; otherwise, go to step S7.3;
s7.2.2, determination of absolute values | C (G (D), AE) -C (G (D)i) AE) | is greater than θ, if yes, go to step S7.3, otherwise go to step S7.2.3;
s7.2.3, determining whether D is DiIf so, go to step S7.2.4, otherwise go to step S7.2.5;
s7.2.4, let C (G (D) -G (D)i),AE)=C(G(D),AE)-C(G(Di),AE),C(G(Di) -g (d), AE) ═ 0, go to step S7.2.7;
s7.2.5, determining whether | D | is equal to the medication combination signal length threshold MAX _ D, if yes, entering step S7.2.6; otherwise, go to step S7.3;
s7.2.6, traversing the SRS data, and counting to obtain C (G), (D) -G (D)i) AE) and C (G (D)i) -values of g (d), AE);
s7.2.7, judging whether the following two inequalities hold: c (G (D) -G (D)i),AE)<=θ,C(G(Di)-G(D),AE)<θ; if both are true, go to step S7.2.8; otherwise, go to step S7.3;
s7.2.8, D is equivalent to DiLet DG=DG∪{Di};
Step S7.2.1 is to make a decision first based on the equivalence relation decision (i), step S7.2.2 eliminates the special case condition that the equivalence relation decision (ii) that does not require querying for SRS data does not hold, and SRS data querying is avoided, and steps S7.2.3 to S7.2.6 show that when D is DiIs equal to the combination signal length threshold MAX _ D, and if one is true, the method proceeds to step S7.2.6, wherein steps S7.2.3-S7.2.4 determine (ii-1) based on the equivalence relation, steps S7.2.5-S7.2.6 avoid the SRS data query based on the equivalence propagation assumption and the equivalence combination assumption, and step S7.2.7 further determines (ii-2) based on the equivalence relationiEquivalent to D, the signal equivalent to D is added to the maximum equivalence set D of D in step S7.2.8G
S8, order XG=XG∪{DGReverting to step S2;
s9, outputting the maximum equivalence group set XG
The invention first takes the value D from X and solves the maximum equivalence group D of DGAdding XGThen, the value D is re-taken and X is traversedGMerging the maximal equivalent sets with intersection to solve the new maximal equivalent set D of DGIncorporating XGRepeating the steps in the cycle until each drug combination in the X has been valued, and obtaining the final maximum equivalent group set XG
For 1,530 signals obtained in the previous stage, the adverse reaction report number threshold θ is set to 4, and a total of 783 maximum equivalence sets are obtained through mining. Wherein, the maximum equivalence group containing 2 and more than 2 signals has 47, and the 47 maximum equivalence groups contain 794 signals in total, when each maximum equivalence group selects a representative signal for analysis, 794 signal analysis processes can be compressed into 47. From the perspective of a single maximum equivalence group, the maximum equivalence group containing the most signals contains 399 signals,one of the signals is { hydrocinnamatoxin, risperidone }, the RoR thereof02510.51, significantly above the signal decision threshold (δ 2). The signal has an equivalent relationship with 398 signals { levo-promazine, hydrocinnamatoxin }, { glutelin, levo-promazine }, etc., and the patient data of the 399 signals are nearly consistent, which also means that only a small number of positive signals exist. The pharmaceutical researchers need to take full consideration in experimental confirmation and mechanistic analysis of these signals, i.e. if one of the signals is selected for analysis, the other 398 signals are suggested as confounding factors.
The method can be used for analyzing the combined adverse reaction signals independently, and can also be used in cooperation with a strong signal screening method facing the SRS combined adverse reaction signals and/or an in-doubt signal mining method facing the SRS combined adverse reaction signals. The three methods are all used for analyzing the relation between signals, but the targets are respectively emphasized, strong signal screening is used for finding out a positive signal clue, doubtful signal mining is used for finding out a false positive signal clue, equivalence relation signal mining is used for finding out a signal clue with consistent data height, and the clues are respectively beneficial to improving the signal analysis quality. And (4) carrying out equivalent relation signal mining firstly, then carrying out strong signal screening and doubt signal mining execution. The equivalence relation signal mining method obtains a maximum equivalence set, only one signal in each maximum equivalence set needs to be selected to participate in subsequent analysis, and the calculation cost of the whole analysis process can be reduced.
Finally, it should be noted that the above-described method can be converted into software program instructions, and can be implemented by using a control system including a processor and a memory, or by using computer instructions stored in a non-transitory computer-readable storage medium. The integrated unit implemented in the form of a software functional unit may be stored in a computer readable storage medium. The software functional unit is stored in a storage medium and includes several instructions to enable a computer device (which may be a personal computer, a server, or a network device) or a processor (processor) to execute some steps of the methods according to the embodiments of the present invention. And the aforementioned storage medium includes: various media capable of storing program codes, such as a usb disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk, or an optical disk.
In conclusion, the equivalent signal mining method for the adverse reaction signal of the SRS combination drug has the following advantages:
(1) the invention first takes the value D from X and solves the maximum equivalence group D of DGAdding XGThen, the value D is re-taken and X is traversedGMerging the maximum equivalent sets with intersection, and solving the new maximum equivalent set D of DGIncorporating XGRepeating the steps in the cycle until each drug combination in the X has been valued, and obtaining the final maximum equivalent group set XG;XGEach maximum equivalence group in the drug combination adverse reaction signal set is a drug combination adverse reaction signal set with pairwise equivalence relation, and one signal is selected from the set according to an equivalence substitution assumption so as to represent the whole set to participate in other data analysis, thereby being beneficial to reducing the workload of signal analysis; when finer-grained data analysis is carried out on a specific signal, an equivalent signal of the signal can be used as a confounding factor, and the signal analysis quality is improved;
(2) step S7 of the present invention is to solve the maximum equivalence group D of D according to the equivalence relation determination methodGIn the process, Z is traversed, and the signals in which D is equivalent are added to the maximum equivalence group D of DGThe accuracy is high, and before the equivalence relation judging method is utilized, special conditions that equivalence relation judgment is not established without inquiring SRS data are eliminated, SRS data inquiry is avoided, and the calculation efficiency is improved;
(3) in step S7, the equivalence relation is determined by using the method of | D | and the size of the combined medication signal length threshold MAX _ D in combination with the equivalence transfer hypothesis and the equivalence combination hypothesis, so that the execution of equivalence relation determination is avoided, SRS data query is avoided, and the calculation efficiency is further improved;
(4) step S5 of the present invention at XGAfter non-empty set, traverse XGMerging and D according to the equal-cost transfer assumptionGMaximum equivalence with intersectionGroup to maximum equivalence group DGSo that and DGSignals contained in the maximum equivalent group with intersection do not need to be subjected to equivalent judgment one by one, so that the number of relation judgment times in the equivalent group is reduced, the calculation amount is reduced, and the analysis efficiency is improved.
Finally, it should be noted that: the above examples are only intended to illustrate the technical solution of the present invention, but not to limit it; although the embodiments of the present invention have been described in conjunction with the accompanying drawings, those skilled in the art may make various modifications and variations without departing from the spirit and scope of the invention, and such modifications and variations fall within the scope defined by the appended claims.

Claims (10)

1. An equivalent signal mining method for SRS combined adverse reaction signals is characterized by comprising the following steps:
s1, making the unprocessed set Z as X, and making the maximum equivalence set X of the combined drug adverse reaction signal set XGIn the empty set, the combined adverse reaction signal set X acquires data and preprocesses the data through an SRS (sounding reference signal) of a drug adverse event spontaneous presentation system, and then the combined adverse reaction signal set X ═ { D ═ is obtained by mining1,D2,…,Dx},DxA drug combination for determining an adverse reaction signal;
s2, judging whether Z is an empty set, if so, entering a step S9, and if not, entering a step S3;
s3, priority processing the signal with the smallest size of the drug combination set, i.e. order processing the signal
Figure FDA0002788021490000011
If | represents the size of the set, the unprocessed set Z ═ Z- { D }, and D is initializedG={D};
S4, judgment XGIf the current set is an empty set, if so, the step S6 is executed, otherwise, the step S5 is executed;
s5, traverse XGCombining with DGThere is a maximum equivalence set to a maximum equivalence set D of the intersectionG
S6, judging whether Z is an empty set, if so, entering a step S8, and if not, entering a step S7;
s7, traversing Z, adding the signals equivalent to D into the maximum equivalent group D of DG
S8, order XG=XG∪{DGReverting to step S2;
s9, outputting the maximum equivalence group set XGThe maximum equivalence group is a set of the maximum equivalence groups of the combined adverse reaction signals, each maximum equivalence group is a set of the combined adverse reaction signals with pairwise equivalence relation, and one signal, namely one drug combination, is selected from each maximum equivalence group, so that the maximum equivalence group can be represented to participate in the data analysis of the combined adverse reaction.
2. The SRS combination adverse drug reaction signal-oriented equivalent signal mining method according to claim 1, wherein the data is obtained from demographic information, medication information, adverse drug reaction information of a spontaneous report system SRS for adverse drug events.
3. The method for mining an equivalent signal for an adverse reaction signal to SRS combination according to claim 1, wherein the method for determining the drug combination for an adverse reaction signal comprises: if the frequency of the target adverse reaction signals AE reported by the patients taking the medicine combination D is greater than the support threshold; the length of the drug combination D is not more than the drug combination signal length threshold MAX _ D; d obtained from SRS data of PS patient set is used as lower bound Q (PS, D, AE) CI of signal intensity confidence interval of target adverse reaction signal AE->A signal strength threshold δ; d is determined to be the drug combination of the targeted adverse reaction signal AE.
4. The method as claimed in claim 3, wherein Q (PS, D, AE) is a signal metric function of the non-equilibrium metric class, and the metric methods adopted include RR, PRR, RoR, BCPNN, MGPS.
5. The SRS combination adverse reaction signal-oriented equivalence signal mining method according to claim 1, wherein the preprocessing comprises de-duplicating data and normalizing drug names.
6. The method for mining an equivalent signal to an SRS combined adverse drug reaction signal according to claim 1, wherein the step S5 includes the steps of:
s5.1, traversing XGSelect not traversed member Dk GJudgment of Dk GAnd DGIf not, go to step S5.2, if yes, DG=DG∪Dk G,XG=XG-{Dk GStep S5.2;
s5.2, judging and traversing XGIf not, the process goes to step S6, otherwise, the process goes to step S5.1.
7. The method for mining an equivalent signal to an SRS combined adverse drug reaction signal according to claim 1, wherein the step S7 includes the steps of:
s7.1, selecting non-traversed member D of Zi
S7.2, judging whether D is equivalent to DiIf yes, let DG=DG∪{DiStep 7.3, if not, step 7.3 is carried out;
s7.3, judging whether the traversal Z is finished, if not, entering the step S7.1, and if so, indicating DGAfter the solution is completed, the process proceeds to step S8.
8. The SRS combination adverse reaction signal-oriented equivalent signal mining method according to claim 7, wherein in step S7.2, whether D is equivalent to D is judgediThe method comprises the following steps:
s7.2.1, and Q (PS, D, AE) CI-≤Q(PS,Di,AE).CI+And Q (PS, D)i,AE).CI-≤Q(PS,D,AE).CI+If yes, go to step S7.2.2; otherwise, go to step S7.3; wherein Q (e.,. e.) represents a signal metric function, Q (PS, D, AE)+,Q(PS,D,AE).CI-Representing the upper and lower bounds of a signal intensity confidence interval of a target adverse reaction signal AE, which is D obtained according to SRS data of a patient set PS;
s7.2.2, determination of absolute values | C (G (D), AE) -C (G (D)i) AE) | is greater than θ, if yes, go to step S7.3, otherwise go to step S7.2.3; wherein G (·) represents a medication case function, C (,) represents an adverse reaction report statistical function, C (G (D), AE) represents the number of reports of adverse reactions AE from a case set G (D) taking a drug combination D, and θ represents an adverse reaction report number threshold;
s7.2.3, determining whether D is DiIf so, go to step S7.2.4, otherwise go to step S7.2.5;
s7.2.4, let C (G (D) -G (D)i),AE)=C(G(D),AE)-C(G(Di),AE),C(G(Di) -g (d), AE) ═ 0, go to step S7.2.6;
s7.2.5, determining whether | D | is equal to the medication combination signal length threshold MAX _ D, if yes, entering step S7.2.6; otherwise, go to step S7.3;
s7.2.6, traversing the SRS data, and counting to obtain C (G), (D) -G (D)i) AE) and C (G (D)i) -values of g (d), AE);
s7.2.7, judging C (G), (D) -G (D)i),AE)<θ and C (G (D)i)-G(D),AE)<If θ is true, the routine proceeds to step S7.2.8; otherwise, go to step S7.3;
s7.2.8, D is equivalent to DiLet DG=DG∪{Di}。
9. A server, comprising:
at least one processor; and at least one memory communicatively coupled to the processor, wherein:
the memory stores program instructions executable by the processor, the processor invoking the program instructions to perform the equivalent signal mining method for SRS combination adverse reaction oriented signals as claimed in any one of claims 1 to 8.
10. A non-transitory computer-readable storage medium storing computer instructions for causing a computer to perform the method of equivalent signal mining for SRS combination adverse reaction signals according to any one of claims 1 to 8.
CN202011304774.5A 2020-11-19 2020-11-19 Equivalent signal mining method for SRS combined adverse reaction signals Active CN112365989B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011304774.5A CN112365989B (en) 2020-11-19 2020-11-19 Equivalent signal mining method for SRS combined adverse reaction signals

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011304774.5A CN112365989B (en) 2020-11-19 2020-11-19 Equivalent signal mining method for SRS combined adverse reaction signals

Publications (2)

Publication Number Publication Date
CN112365989A CN112365989A (en) 2021-02-12
CN112365989B true CN112365989B (en) 2022-05-03

Family

ID=74533346

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011304774.5A Active CN112365989B (en) 2020-11-19 2020-11-19 Equivalent signal mining method for SRS combined adverse reaction signals

Country Status (1)

Country Link
CN (1) CN112365989B (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2722395A1 (en) * 2001-10-15 2014-04-23 Bioarray Solutions Ltd Multiplexed analysis of polymorphic loci by concurrent interrogation and enzyme-mediated detection
CN110767319A (en) * 2019-09-30 2020-02-07 南京邮电大学 Method for detecting adverse reaction signals of combined medication

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP2722395A1 (en) * 2001-10-15 2014-04-23 Bioarray Solutions Ltd Multiplexed analysis of polymorphic loci by concurrent interrogation and enzyme-mediated detection
CN110767319A (en) * 2019-09-30 2020-02-07 南京邮电大学 Method for detecting adverse reaction signals of combined medication

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
基于国家药品不良反应监测自发呈报系统的心元胶囊不良反应风险信号分析;王连心等;《中国中药杂志》;20200531(第10期);第96-101页 *
联合用药不良反应信号检测研究进展;唐利等;《华西医学》;20151213(第03期);第28-33页 *
西格列汀上市后致皮肤不良反应的信号挖掘与评价;彭媛等;《中国药房》;20140115(第02期);第66-68页 *

Also Published As

Publication number Publication date
CN112365989A (en) 2021-02-12

Similar Documents

Publication Publication Date Title
US10347365B2 (en) Systems and methods for visualizing a pattern in a dataset
US11954614B2 (en) Systems and methods for visualizing a pattern in a dataset
JP2017224283A (en) Big data analytical method and mass analytical system using the same
CN105917336A (en) Database key identification
US20210090686A1 (en) Single cell rna-seq data processing
CN116364182A (en) Integrated analysis method for single cell transcriptome and TCR and BCR sequencing data
Hosseinpoor et al. Gene regulatory elements extraction in breast cancer by Hi-C data using a meta-heuristic method
CN112365989B (en) Equivalent signal mining method for SRS combined adverse reaction signals
CN107273718A (en) A kind of Analysis of Survival Time method of fusion dna methylation characteristic
CN112365990B (en) Strong signal screening method for adverse reaction signals of SRS combined medication
KR20040093743A (en) Method of visualizing non-targeted metabolomic data generated from fourier transform ion cyclotron resonance mass spectrometers
JP2016014944A (en) Correlation rule analysis device and correlation rule analysis method
CN112365991B (en) Method for mining doubt signal facing SRS combined adverse reaction signal
Egho et al. Healthcare trajectory mining by combining multidimensional component and itemsets
López-Oriona et al. Analyzing categorical time series with the R package ctsfeatures
KR20190126606A (en) IDENTIFYING METHOD FOR TUMOR PATIENT BASED ON miRNA IN EXOSOME AND APPARATUS FOR THE SAME
US12079992B1 (en) Utilizing machine learning and digital embedding processes to generate digital maps of biology and user interfaces for evaluating map efficacy
Kemmo Tsafack et al. Meta-Analytic Gene-Clustering Algorithm for Integrating Multi-Omics and Multi-Study Data
US20230212560A1 (en) Systems, methods, and media for determining relative quality of oligonucleotide preparations
WO2023238439A1 (en) Analysis device, analysis method, and analysis program
EP4404106A1 (en) Analysis device, analysis method, and analysis program
Ettetuani et al. Meta-analysis for a therapeutic target involved in the activation of the genes associated with c3 glomerulopathy
Liu et al. Construction of a prognostic model of lung adenocarcinoma based on machine learning
Esteves et al. Recommendation engine for optimal Spark delta tables partitioning
Triananda et al. Use of Deep Learning to Identify COVID-19 through DNA/RNA Sequencing

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant