CN111493935B

CN111493935B - Artificial intelligence-based automatic prediction and identification method and system for echocardiogram

Info

Publication number: CN111493935B
Application number: CN202010353559.8A
Authority: CN
Inventors: 何昆仑; 杨菲菲; 刘博罕; 王秋霜; 李宗任; 陈煦; 郭华源; 张璐; 邓玉娇
Original assignee: Chinese PLA General Hospital
Current assignee: Chinese PLA General Hospital
Priority date: 2020-04-29
Filing date: 2020-04-29
Publication date: 2021-01-15
Anticipated expiration: 2040-04-29
Also published as: CN111493935A

Abstract

The application discloses an ultrasonic cardiogram automatic prediction and identification method and a system based on artificial intelligence, wherein the method comprises the following steps: acquiring a color Doppler video of at least one section of an echocardiogram of a detected object; extracting each video frame in the color Doppler video, and inputting each video frame into a trained convolutional neural network to obtain an N-dimensional feature vector corresponding to each video frame; generating the weight corresponding to each video frame by the N-dimensional feature vector of each video frame through an attention module; calculating a weighted sum of the N-dimensional feature vectors of each video frame by using the weights to obtain an overall feature representation of the color Doppler video; and calculating to obtain a predicted value containing the pre-identified image features based on the overall feature representation. By the method, whether the image features to be identified exist in the echocardiogram can be accurately predicted.

Description

Artificial intelligence-based automatic prediction and identification method and system for echocardiogram

Technical Field

The application relates to the technical field of intelligent identification of medical video images, in particular to an ultrasonic cardiogram automatic prediction identification method and system based on artificial intelligence.

Background

Echocardiography is currently one of the most important methods for assessing cardiac structure and function. For identification of the heart valve regurgitation features in echocardiography, the physician typically identifies the presence or absence of abnormal color flow by visual inspection, which is susceptible to velocity range and color gain, thus overestimating or underestimating the severity of the features and not suitable for accurate assessment of the heart valve regurgitation features. In addition, methods such as a contraction flow method and a continuous Doppler method can quantitatively analyze the degree of regurgitation of the heart valve, but the echocardiogram has obvious individual differences in the aspects of image acquisition, measurement, analysis, judgment and the like, and is greatly influenced by the experience and the capability of medical workers, so that the inspection accuracy and consistency are difficult to ensure, and great difficulty is often brought to clinical identification.

The artificial intelligence technology has great advantages in automatic measurement, analysis and interpretation of the heart ultrasonic images compared with manual operation of professional doctors, and can realize standardization of data analysis and identification, thereby eliminating interference of artificial subjective factors, reducing inter-individual differences and intra-individual differences of heart ultrasonic judgment, and improving accuracy and consistency of ultrasonic image judgment. The artificial intelligence technology also enables the cardiac ultrasound to be more efficient and better meets the clinical practical requirements, thereby greatly improving the efficiency of the medical industry, and reducing the medical cost and the burden of families and socioeconomic people.

In recent years, although it is a research hotspot to process medical images by using an artificial intelligence technology, more traditional machine learning algorithms such as decision trees, clustering, bayesian classification, support vector machines, EM and the like are adopted, the algorithms do not use dynamic videos of echocardiograms, only carry out classification by randomly extracted ultrasonic images, the motion information of the heart is lost, and color doppler videos contain flow direction information of blood flow in the heart, so that the identification of valve regurgitation is facilitated, and if the information is neglected, the classification is not accurate enough, and the practical application effect is general.

Disclosure of Invention

In view of the above-mentioned defects or shortcomings in the prior art, the present application provides an echocardiogram automatic prediction and recognition method and system based on artificial intelligence, the method adopts a deep learning model specially designed for the echocardiogram, automatically predicts the pre-recognition feature in the echocardiogram, and outputs the video frame most relevant to the pre-recognition feature, thereby greatly improving the accuracy of the artificial intelligence technology for processing the echocardiogram, meeting the clinical requirements of the medical system, and reducing the workload of the medical staff.

The first aspect of the invention provides an echocardiogram automatic prediction and identification method and a system based on artificial intelligence, comprising the following steps:

acquiring a color Doppler video of at least one section in an echocardiogram of a detected object;

extracting each video frame in the color Doppler video, and inputting each video frame into a trained convolutional neural network to obtain an N-dimensional feature vector corresponding to each video frame;

generating the weight corresponding to each video frame by the N-dimensional feature vector of each video frame through an attention module;

calculating a weighted sum of the N-dimensional feature vectors of each video frame by using the weights to obtain an overall feature representation of the color Doppler video;

and calculating a predicted value of the color Doppler video containing the pre-identified image characteristics based on the overall characteristic representation.

Further, the method also comprises the following steps: and outputting the frame with the maximum weight as a key frame.

Further, the at least one slice comprises an apical four-chamber cardiac slice.

Further, the pre-identified image is characterized by valve regurgitation.

Further, the method also comprises the following steps:

measuring the relative area of the left atrial region in the keyframe;

measuring the relative area of the regurgitation stream within the left atrial region in said keyframe;

calculating a ratio of the relative area of the back flow stream to the relative area of the left atrium.

Further, before inputting each video frame to the pre-trained convolutional neural network, the method further comprises: and inputting the color Doppler video of at least one section of the echocardiogram containing the pre-identified image characteristics into a pre-trained convolutional neural network as a training sample, and training the pre-trained convolutional neural network.

Further, the method also comprises the following steps:

function of current loss

No longer decreases, or loss functions

Stopping the training of the convolutional neural network when the value of (a) is lower than a predetermined value;

said loss function

Expressed as:

；

wherein, the

Representing a classification loss calculated at the video level, said

Represents the sparse loss used to adjust the weight of each video frame,

is composed of

A normalization constant of (d);

the above-mentioned

Calculated according to the following formula:

wherein,

indicating whether the nth color Doppler video contains the pre-identified image characteristicsN is (1, 2, … … N), N is the total number of color Doppler videos, if

Indicating that the nth color Doppler video does not contain the pre-identified image features; if it is

Indicating that the nth color Doppler video contains the pre-identified image characteristics;

the predicted value of the characteristic of the pre-identified image contained in the nth color Doppler video is shown;

for the global characterization of the nth color doppler video,

then is

And

cross entropy of (d);

the above-mentioned

Calculated according to the following formula:

wherein,

,

representing the t-th in the n-th color Doppler videoThe weight of the video frame, T is (1, 2, … …, T), T is the frame number of the video; the value range of N is (1, 2, … … N), and N is the total number of color Doppler videos.

The second aspect of the present invention also provides an echocardiogram automatic prediction and recognition system based on artificial intelligence, which comprises:

the video acquisition module is used for acquiring a color Doppler video of at least one section in an echocardiogram of the detected object;

the input extraction module is used for extracting each video frame in the color Doppler video and inputting each video frame to a trained convolutional neural network so as to obtain an N-dimensional feature vector corresponding to each video frame;

the weight generating module is used for generating the weight corresponding to each video frame by the N-dimensional characteristic vector of each video frame through the attention module;

the overall characteristic calculation module is used for calculating the weighted sum of the N-dimensional characteristic vectors of each video frame by using the weights so as to obtain the overall characteristic representation of the color Doppler video;

and the prediction output module is used for calculating a prediction value of the color Doppler video containing the pre-identified image characteristics based on the overall characteristic representation.

Further, the method also comprises the following steps: and the key frame output module is used for outputting the frame with the maximum weight as a key frame.

Further, the at least one section comprises an apical four-chamber section; the pre-identified image feature is valve regurgitation; the system further comprises: a pre-identified image feature measurement module for measuring the relative area of the left atrial region in the keyframe; measuring the relative area of the regurgitation stream within the left atrial region in said keyframe; calculating a ratio of the relative area of the back flow stream to the relative area of the left atrium.

In summary, the method and system for identifying an echocardiogram automatic prediction based on artificial intelligence of the invention identify important image features of cardiac ultrasound from a video by identifying a group of video frames consisting of the important image features, learn how to measure the importance of each frame in the video based on a brand-new deep neural network, and automatically select a sparse subset representing frames to predict the classification of video levels. By using the method and the system provided by the invention, the method and the system can be used for accurately identifying the mitral valve regurgitation image characteristics in the echocardiogram, and can also be used for accurately identifying the heart ultrasonic image characteristics in the echocardiogram, such as tricuspid valve regurgitation, aortic valve regurgitation, pulmonary valve regurgitation, atrial septal defect, ventricular septal defect, patent ductus arteriosus, and the like.

Drawings

Other features, objects and advantages of the present application will become more apparent upon reading of the following detailed description of non-limiting embodiments thereof, made with reference to the accompanying drawings in which:

FIG. 1 is a flow chart of an artificial intelligence-based echocardiography automatic prediction identification method according to an embodiment of the present invention;

FIG. 2 is a flowchart of an echocardiogram automatic prediction identification method with a key frame output function according to another embodiment of the present invention;

fig. 3 is a flowchart of an echocardiogram automatic prediction identification method with a saliency estimation function according to another embodiment of the present invention;

FIG. 4 is a flow chart of a method for automated predictive identification of an echocardiogram with a pre-training function according to another embodiment of the present invention;

FIG. 5 is a functional block diagram of an echocardiographic automatic predictive identification system according to another embodiment of the present invention;

fig. 6 is a block diagram of an electronic device according to another embodiment of the present invention;

fig. 7 is a component assembly diagram of an electronic device according to another embodiment of the invention.

Detailed Description

The present application will be described in further detail with reference to the following drawings and examples. It is to be understood that the specific embodiments described herein are merely illustrative of the relevant invention and not restrictive of the invention. It should be noted that, for convenience of description, only the portions related to the present invention are shown in the drawings.

It should be noted that the embodiments and features of the embodiments in the present application may be combined with each other without conflict. The present application will be described in detail below with reference to the embodiments with reference to the attached drawings.

The echocardiograms described herein are ultrasound images that examine the anatomical and functional state of the heart and great vessels using the special physical characteristics of ultrasound.

Referring to fig. 1, an artificial intelligence based echocardiography automatic prediction identification method is shown according to an embodiment of the present invention. In the present embodiment, the mitral regurgitation feature in the echocardiogram is predicted and identified as an example, but the method proposed in the present embodiment is also applicable to the prediction and identification of ultrasound images of other heart blood flow features such as tricuspid regurgitation, aortic regurgitation, pulmonary regurgitation, atrial septal defect, ventricular septal defect, patent ductus arteriosus, and the like.

Step S101, acquiring a color Doppler video of at least one section in an echocardiogram of a detected object.

The most common doppler ultrasound techniques include pulsed doppler, continuous doppler, and color doppler flow imaging. Color doppler is a visualization technique that is an area display with many beams transmitted and received back in the same area. The direction of blood flow in color doppler flow visualization is defined as the direction of blood flow towards the probe being red and the direction of blood flow away from the probe being blue. The orifices of the heart and the great vessels are the single antegrade blood flow, and once the retrograde blood flow occurs, whether the blood flow is abnormal or not should be considered.

Specifically, in the ultrasonic examination, an original echocardiogram image containing a plurality of sequences can be obtained for different postures of the examination object and different sections of the heart, and each sequence corresponds to one section in the ultrasonic examination.

In this embodiment, a color doppler video (for example, a video of one slice or a combination of videos of multiple slices) related to at least one slice in an echocardiogram is first acquired, the color doppler video has multiple video frames, and the color doppler video is used as an input of the depth learning algorithm module. It should be noted that the video is used instead of a single image as input because the video provides more information in the time dimension, which includes frame-to-frame variation information, and a single image often loses heart motion information, which tends to cause inaccurate classification. In addition, the color Doppler video can display the blood flow direction in the valve, the characteristics are more obvious, and the classification accuracy of the algorithm module is further improved.

Step S102, extracting each video frame in the color Doppler video, and inputting each video frame into a trained convolutional neural network to obtain an N-dimensional feature vector corresponding to each video frame.

Specifically, the color doppler video is composed of a group of video frames, each video frame can be understood as a frame of ultrasound image, all T video frames in a single video are extracted first, and then input into the trained convolutional neural network frame by frame. The present embodiment is not limited to the type of the convolutional neural network, and for example, an I3D network that is a combination of a dual-stream network and a 3D convolution may be used, or a ResNet residual network may be used.

Obtaining N-dimensional characteristic vector after each video frame passes through Resnet neural network

Wherein

Is a reference number for a video frame,

is (1, 2 … … T), T is the frame number of the video; the dimension N represents information contained in a corresponding frame, N can be neither too small nor too large, too small setting of N can cause the video frame to contain too little characteristic information, too large setting of N can cause the video frame to contain too much useless information,the calculation resources are wasted, N is preferably 1024 in the embodiment, and N can obtain an appropriate setting value through multiple manual attempts during actual application.

Step S103, generating the weight corresponding to each video frame by the attention module according to the N-dimensional feature vector of each video frame.

In clinical examination, only a few key frames in color Doppler video often contain characteristic information of whether a sample flows back or not, whether the video frames are provided with the characteristic information or not and the amount of the provided characteristic information determine the weight of the video frames, and the weight is used

To represent the weight of the T-th frame, where T is (1, 2, … …, T), and T is the frame number of the video.

The weight is an N-dimensional feature vector output by the Resnet neural network

Input into attention module (attention module). The attention module is an attention model in which a deep learning algorithm is reused for simulating the human brain, for example, when a picture is watched, although the whole picture can be seen, when the picture is deeply and carefully observed, only a small block is focused on the eyes, and the human brain mainly focuses on the small block pattern, that is, the attention of the human brain to the whole picture is not balanced at this time, and the attention is distinguished with certain weight. N-dimensional feature vector

The weight corresponding to each video frame can be obtained after the calculation of the attention module

. Since the specific implementation method of the attention module has been widely applied to the deep learning model, the detailed description is omitted in this embodiment.

And step S104, calculating the weighted sum of the N-dimensional feature vectors of each video frame by using the weights so as to obtain the overall feature representation of the color Doppler video.

Specifically, each video frame corresponds to a 1024-dimensional feature vector

Can indicate the video representation of the frame, and the overall characteristic representation of the video can be represented by the weight of each frame

With feature vectors of each frame

Is calculated as the weighted sum of:

wherein,

for the purpose of the overall characterization of the video,

for the video feature representation of the t-th frame,

is the weight of the T-th frame, and T is the frame number of the video.

And step S105, calculating and obtaining a predicted value containing the pre-identified image features through an FC full connection layer and a sigmoid activation function based on the overall feature representation.

Obtaining an overall feature representation of a video

Then, after two-layer operation of an FC full-connection layer and a sigmod activation function, a final predicted value is obtained

，

The value of (a) is between 0 and 1, which represents the probability of mitral regurgitation feature existing in the sample video, namely:

referring to fig. 2, it is preferable to further include on the basis of the embodiment shown in fig. 1:

step S106, weighting

The largest frame is output as the key frame.

Weight of

The largest video frame means the video frame which shows the mitral regurgitation characteristic most obviously or contributes most, the key frame is automatically output and generated on the ultrasonic report, the operation step of selecting the screenshot by a doctor is saved, and the generation efficiency of the ultrasonic report is improved.

Further, at least one of the slices in the above embodiments preferably comprises an apical four-chamber heart (A4C) slice. The section of the apical four-chamber heart (A4C) is a standard section in clinical echocardiography, and is also an important section which is most widely applied, and image characteristics such as mitral regurgitation and the like can be more accurately identified through the section of the apical four-chamber heart (A4C).

Referring to fig. 3, preferably, on the basis that whether the pre-identified image features exist in the color doppler video of the detected object is identified according to the embodiment shown in fig. 1, the method further includes:

in step S107, the degree of saliency of the pre-identified image features is identified.

Taking the identification of mitral regurgitation features as an example, the specific method is to measure the ratio of regurgitation area to left atrium area on the key frame where mitral regurgitation is most evident, and output the ratio to an ultrasound report.

First, the left atrial area is measured. Usually, there are hundreds of groups of delineated echocardiograms (so-called delineations are artificial borders drawn on key parts of the heart, such as the left atrium) which have been carefully delineated by the annotator in the early stage. And putting the data into the neural network for learning to finish the automatic identification of the left atrium by the model. The relative area of the left atrium can be obtained by calculating the relative area of the left atrium in the map, i.e. the number of pixels in the delineated region.

The area of the back-flow stream is then measured. Because color doppler ultrasound is used, the regurgitant beam is usually labeled blue (unretreamed blood is usually red), so the relative area of the regurgitant beam can be calculated by simply calculating the number of blue pixels in the left atrium.

Finally, the ratio of the relative area of the regurgitation beam to the relative area of the left atrium is calculated, and the significance degree of the regurgitation image characteristics in the ultrasonic video can be obtained through the ratio.

Referring to fig. 4, before the method shown in fig. 1 is performed, the method preferably further comprises

Step S100, inputting a color Doppler video of at least one section in an echocardiogram containing pre-recognition image characteristics into a pre-trained convolutional neural network as a training sample, and training the pre-trained convolutional neural network.

The process of training the model is actually a process of continuously adjusting parameters to enable a prediction result to be better and better, in the process, a loss function is usually used as a reference, the loss function is usually used for evaluating the difference between a predicted value and a true value of the model, and the neural network model has two main tasks, namely accurately judging whether a sample echocardiogram contains pre-recognition image characteristics or not, and outputting a video key frame which is most helpful for judging the result, so that the corresponding loss function

Also designed as two-part, respectively classified losses

And loss of sparseness

。

Represents the classification loss calculated at the video level:

wherein,

whether the nth color Doppler video contains the pre-identified image features or not is shown, the value range of N is (1, 2, … … N), N is the total number of the color Doppler videos, and if the value range of N is not (1, 2, … … N), the total number of the color Doppler videos is not included, the color Doppler videos are not included, and if the value range of N

for the global characterization of the nth color doppler video,

then is

And

cross entropy of (d).

So-called sparse losses

I.e. for adjusting the weight of a particular video frame in a video

Because in clinical tests, often only a few critical frames represent the backflow of the sample, only a few frames of all frames are important, i.e. only a few frames correspond to approximately 1 and the rest of the frames correspond to approximately 0, in other words the importance of these frames

The method is sparse, in this embodiment, an L1 norm is used to measure sparsity of the whole data, according to a mathematical definition, the L1 norm is a sum of absolute values of each element in a vector, and the smaller L1 is, the more sparse the whole element is, and a specific form is as follows:

the above-mentioned

Calculated according to the following formula:

wherein,

,

the weight of the T video frame in the nth color Doppler video is represented, the value range of T is (1, 2, … …, T), and T is the frame number of the video; the value range of N is (1, 2, … … N), and N is the total number of color Doppler videos.

Loss function of the embodiment in the training process

The mathematical expression of (a) is:

；

in a specific training process, samples for training the neural network can be recycled, and when the set cycle number and the set loss function are reached

Is below a predetermined value, or a loss function

When the value of (c) is no longer decreasing, it indicates that training of the convolutional neural network can be stopped.

It should be noted that while the operations of the method of the present invention are depicted in FIG. 1 in a particular order, this does not require or imply that the operations must be performed in this particular order, or that all of the illustrated operations must be performed, to achieve desirable results. Rather, the steps depicted in the flowcharts may change order of execution if desired.

Referring to fig. 5, an artificial intelligence based echocardiography automatic predictive identification system 200 is shown according to another embodiment of the invention, including:

a video obtaining module 201, configured to obtain a color doppler video of at least one slice in an echocardiogram of a detected object;

an input extraction module 202, configured to extract each video frame in the color doppler video, and input each video frame to a trained convolutional neural network to obtain an N-dimensional feature vector corresponding to each video frame;

the weight generating module 203 is used for generating the weight corresponding to each video frame by the attention module according to the N-dimensional feature vector of each video frame;

the overall feature calculation module 204 calculates a weighted sum of the N-dimensional feature vectors of each video frame by using the weights to obtain an overall feature representation of the color doppler video;

and the prediction output module 205 is used for calculating a prediction value containing the pre-identified image characteristics through an FC full connection layer and a sigmoid activation function based on the overall characteristic representation.

Further, the echocardiographic automatic predictive identification system 200 includes: a pre-identified image feature measurement module 206 for measuring the area of the left atrial region in the keyframe; measuring the area of the regurgitation stream within the left atrial region in said keyframe; calculating a ratio of an area of the back flow stream to an area of the left atrium.

It should be understood that the modules described in the echocardiographic automatic predictive identification system 200 in this embodiment correspond to the steps in the method described in fig. 1. Therefore, the operations and features described above for the method are also applicable to each module of the present embodiment, and are not described herein again. The system of this embodiment may be implemented in the electronic device in advance, or may be loaded into the electronic device by downloading or the like. The corresponding modules in the system of this embodiment may cooperate with units in the electronic device to implement the solution of this embodiment. In addition, the modules described in the present embodiment may be implemented by software or hardware. The names of these units or modules do not in some cases constitute a limitation on the units or modules themselves, e.g., the video acquisition module 201 may also be described as "module 201 for acquiring color doppler video of at least one slice in an echocardiogram of a test subject".

Referring to fig. 6, there is shown an electronic device 300 according to another embodiment of the invention, comprising:

at least one processor 301; and the number of the first and second groups,

a memory 302 communicatively coupled to the at least one processor 301; wherein,

the memory 302 stores instructions executable by the at least one processor 301 to enable the at least one processor 301 to perform the steps of the above-described method embodiments.

Referring to fig. 7, the electronic device in the embodiment shown in fig. 6 may be, for example, a B-mode ultrasound machine. The B-mode ultrasound machine may also comprise a computer system 700 including a Central Processing Unit (CPU) 701 which may perform various appropriate actions and processes in accordance with a program stored in a Read Only Memory (ROM) 702 or a program loaded from a storage section 708 into a Random Access Memory (RAM) 703. In the RAM 703, various programs and data necessary for the operation of the system 700 are also stored. The CPU 701, the ROM 702, and the RAM 703 are connected to each other via a bus 704. An input/output (I/O) interface 705 is also connected to bus 704. The following components are connected to the I/O interface 705: an input portion 706 including a keyboard, a mouse, and the like; an output section 707 including a display such as a Cathode Ray Tube (CRT), a Liquid Crystal Display (LCD), and the like, and a speaker; a storage section 708 including a hard disk and the like; and a communication section 709 including a network interface card such as a LAN card, a modem, or the like. The communication section 709 performs communication processing via a network such as the internet. A drive 710 is also connected to the I/O interface 705 as needed. A removable medium 711 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, or the like is mounted on the drive 710 as necessary, so that a computer program read out therefrom is mounted into the storage section 708 as necessary.

As another aspect, the present application also provides a computer-readable storage medium, which may be a computer-readable storage medium included in the system or the electronic device described in the above embodiments; or it may be a separate computer readable storage medium not incorporated into the device. The computer readable storage medium stores one or more programs for use by one or more processors in performing the methods for automated predictive identification of echocardiograms described herein.

The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.

The above description is only a preferred embodiment of the application and is illustrative of the principles of the technology employed. It will be appreciated by a person skilled in the art that the scope of the invention as referred to in the present application is not limited to the embodiments with a specific combination of the above-mentioned features, but also covers other embodiments with any combination of the above-mentioned features or their equivalents without departing from the inventive concept. For example, the above features may be replaced with (but not limited to) features having similar functions disclosed in the present application.

Claims

1. An ultrasonic cardiogram automatic prediction and identification method based on artificial intelligence is characterized by comprising the following steps:

2. The method according to claim 1, further comprising the steps of:

and outputting the frame with the maximum weight as a key frame.

3. The method according to claim 2, wherein the method comprises: the at least one section comprises an apical four-chamber section.

4. The method according to claim 3, wherein the pre-identified image is characterized by valve regurgitation.

5. The method according to claim 4, further comprising the steps of:

measuring the relative area of the left atrial region in the keyframe;

6. The artificial intelligence based echocardiogram auto-predictive recognition method of claim 1, further comprising, prior to inputting each video frame to a pre-trained convolutional neural network:

and inputting the color Doppler video of at least one section of the echocardiogram containing the pre-identified image characteristics into a pre-trained convolutional neural network as a training sample, and training the pre-trained convolutional neural network.

7. The method according to claim 6, further comprising:

function of current loss

No longer decreases, or loss functions

said loss function

Expressed as:

；

wherein, the

Representing a classification loss calculated at the video level, said

Represents the sparse loss used to adjust the weight of each video frame,

is composed of

A normalization constant of (d);

the above-mentioned

Calculated according to the following formula:

wherein,

whether the nth color Doppler video contains the pre-recognition image features or not is shown, the value range of N is (1, 2, … … N), N is the total number of the color Doppler videos used for training the convolutional neural network model, and if the value range of N is (1, 2, … … N), the number of the color Doppler videos is not less than the total number of the color Doppler videos used for training the convolutional neural network model

for the global characterization of the nth color doppler video,

then is

And

cross entropy of (d);

the above-mentioned

Calculated according to the following formula:

wherein,

,

representing the nth color Doppler video

The weight of each of the video frames is,

the value range is (1, 2, … …, T), and T is the frame number of the video; the value range of N is (1, 2, … … N), and N is the total number of color Doppler videos.

8. An echocardiography automatic prediction and recognition system based on artificial intelligence, which is characterized by comprising:

9. The system according to claim 8, further comprising:

and the key frame output module is used for outputting the frame with the maximum weight as a key frame.

10. The system according to claim 9, wherein said system comprises:

the at least one section comprises an apical four-chamber section;

the pre-identified image feature is valve regurgitation;

the system further comprises:

a pre-identified image feature measurement module for measuring the relative area of the left atrial region in the keyframe; measuring the relative area of the regurgitation stream within the left atrial region in said keyframe; calculating a ratio of the relative area of the back flow stream to the relative area of the left atrium.