Disclosure of Invention
In order to overcome the defects, the invention aims to provide a brand-new physiological parameter measuring method based on a multi-scale fusion network. The method of the invention fully exerts the complementary advantages of the information of different scales by combining the information of different scales, and realizes accurate physiological parameter measurement. Meanwhile, the adopted end-to-end neural network avoids complicated feature point detection and feature engineering. The method extracts multi-scale fusion characteristics from physiological signals through a multi-scale fusion network, obtains estimated values of physiological parameters to be measured by regression of the characteristics, and outputs a static mean value or dynamic continuous measured value of the physiological parameters to be measured by corresponding operation according to a measurement mode identifier.
The invention provides a physiological parameter measuring method based on a multi-scale fusion network, which comprises the following specific steps:
(1) collecting physiological signals of biological individuals under static or dynamic conditions; preprocessing the acquired physiological signals, namely performing data segment cutting, resampling and normalization operation, and then removing the interference of baseline drift, power frequency, respiration, motion artifacts, electromyographic noise and the like by adopting a filtering or other signal processing method to obtain a one-dimensional signal segment;
(2) for the one-dimensional signal segment obtained in the step (1), the dimensionality is expanded by adopting mathematical transformation to construct more complete and rich physiological signal representation, and a three-dimensional matrix is obtainedB, C, N]Wherein:Bas the total number of data segments after the segmentation,Cfor the dimensions after the mathematical transformation to be used,Nthe length of the data segment, namely the number of sample points;
(3) for the product obtained in step (2)Three-dimensional matrix [ 2 ]B, C, N]Divided into training sets according to a certain ratio (e.g., 9:1, 8:2 or 7: 3)B 1 , C, N]And the test setB 2 , C, N],B 1 AndB 2 the number of fragments in the training set and test set, respectively, andB 1 andB 2 the sum being equal toB;
(4) Constructing a physiological parameter measurement model, wherein the model comprises three parts: a multi-scale fusion network, a hybrid attention mechanism and a convolution network layer; according to a predetermined scale numberIDesigning a multi-scale fusion convolutional layer, and then fusing the number of convolutional layers according to the preset multi-scaleMSetting a multi-scale fusion network, and enabling the three-dimensional matrix obtained in the step (2)B, C, N]Inputting the multi-scale fusion network to obtain a first characteristic data three-dimensional matrixS(ii) a Digging out a second characteristic data three-dimensional matrix related to the physiological parameter to be measured by using a mixed attention machineS 1 (ii) a According to the set number of layers of the convolutional networkMUsing a convolutional network layer to perform three-dimensional matrix on the second characteristic dataS 1 Performing regression calculation to generate an estimated value of the physiological parameter to obtain a two-dimensional data matrixB, X]Wherein:Xis an estimated value of the physiological parameter to be measured;
(5) obtaining a measurement mode identifier, the measurement mode identifier comprising a static mode and a dynamic mode:
when the measurement mode identifier is a static mode, the two-dimensional data matrix obtained in step (4)B, X]Carrying out mean value operation to generate a mean value data matrix [1,X M]average value ofX MAs a physiological parameter measurement in static mode;
when the measurement mode identifier is in a dynamic mode, continuously extracting all estimated values of the physiological parameters to be measured obtained in the step (4) to form a one-dimensional continuous data sequence as a continuous physiological parameter measured value in the dynamic mode;
(6) the training set [ 2 ] obtained in the step (3)B 1 , C, N]Inputting the physiological parameters into the model in the step (4) for training and optimizing to obtain a physiological parameter estimation model; will test set 2B 2 , C, N]Inputting a physiological parameter estimation model for testing, and checking the accuracy of the model.
In the present invention, the physiological parameter includes, but is not limited to, one or more of heart rate, blood pressure, respiration rate, cardiac function index or arteriosclerosis index.
In the invention, the signal collected in step (1) is a physiological signal containing the physiological and pathological information of a cardiovascular system, and mainly comprises the following types: one or more of electrocardio signals, pulse wave signals, heart impact signals or heart sound signals.
In the present invention, the mathematical transformation method for dimension expansion of physiological signals in step (2) comprises the following types: any one of difference, integral, fourier transform, wavelet transform, empirical mode decomposition, or variational mode decomposition.
In the invention, the multi-scale fusion convolutional layer in the step (4) is formed by the following method:
(4.1) setting the convolution layer with the convolution kernel size of 1 to adjust the channel number of the input data according to the output channel number of the set multi-scale fusion convolution layer;
(4.2) number of scales according to settingISetting upIMulti-scale fusion convolutional layer with different convolutional kernel sizesF 1 , F 2 ,…F I This isIThe convolution kernel sizes of the multiple multi-scale fusion convolutional layers are spaced at 2 intervals.IThe input matrix is simultaneously convolved by a plurality of multi-scale fusion convolution layers to obtainIAn output matrixY 1 , Y 2 …Y I Using global pooling pairsIReducing the dimension of each output matrix to obtain a one-dimensional embedded vectorz 1 , z 2 , …z I Then using two fully-connected layers andsoftmaxweight matrix obtained by compressing and recovering information of embedded vectorW 1 , W 2 , …W I The weight matrixW 1 , W 2 , …W I And output matrixY 1 , Y 2 …Y I And correspondingly multiplying and summing to obtain the multi-scale fusion feature.
In the present invention, convolution operation is performed on an input matrix by using convolution layers, and the convolution operation includes the following types: a classical convolution or a dilated convolution.
In the invention, global pooling is used for reducing the dimension of the output matrix, and the method comprises the following types: global average pooling or global maximum pooling.
In the invention, a mixed attention mechanism is utilized in the step (4) to further mine the characteristics related to the physiological parameters to be estimated, and the method comprises the following types: a Bottleneck Attention Module (BAM) or convolution module attention mechanism (CBAM).
In the invention, the average value of all estimated values is used as the physiological parameter measured value in the static mode in the step (5); and taking the one-dimensional continuous data sequence formed by all the estimation values as the continuous measurement value of the physiological parameter in the dynamic mode.
In the invention, the training set in the step (6) is used for training the weights in the model, and the test set is used for verifying the performance of the physiological parameter measurement model on an unknown data set. According to different division modes of the data set, the physiological parameter measurement model can be divided into a calibration model and a non-calibration model, the data of the same subject appearing in the training set and the test set is the calibration model, and otherwise, the data is the non-calibration model.
The invention has the following beneficial effects:
1. the method can be used for realizing noninvasive and portable physiological parameter measurement, and is favorable for daily monitoring;
2. the invention realizes the dimension expansion by using mathematical transformation on the physiological signals, and can more comprehensively mine the potential information contained in the physiological signals. The measurement of the physiological parameter can be achieved more accurately than using only the original physiological signal;
3. the invention fully extracts the information of different scales in the original signal by utilizing the multi-scale fusion network, realizes the complementation between different scales and can more accurately realize the measurement of physiological parameters;
4. the application range of the method disclosed by the invention covers the measurement of all physiological parameters, and the method has a certain application value in the fields of cardiovascular disease research and signal processing research.
Detailed Description
The method and the application of the invention are further explained below with reference to the figures and the examples. These embodiments do not limit the invention; variations in structure, method, or function that may be apparent to those of ordinary skill in the art upon reading the foregoing description are intended to be within the scope of the present invention.
Example 1: the physiological parameter measurement model based on the multi-scale fusion network is applied to a dynamic systolic pressure and diastolic pressure measurement task, and the task is realized by adopting an MIMIC database. The MIMIC database contains ECG (electrocardiogram signal), PPG (pulse wave signal) and ABP (arterial blood pressure signal), all at a sampling rate of 125 Hz. The ECG and PPG signals are used to measure the blood pressure value, and the ABP signal is compared as the true value with the blood pressure value measured by the model. The method for measuring the blood pressure value by adopting the multi-scale fusion network model comprises the following specific steps:
(1) the pulse wave signals in the MIMIC database are observed as shown in fig. 1 (a). The pulse wave signals have serious baseline drift and contain a certain degree of power frequency interference. The pulse wave signal is first Discrete Wavelet Transform (DWT) decomposed with a db8 wavelet basis function. Then setting the wavelet coefficient corresponding to the noise frequency range to zero, and finally reconstructing according to the wavelet coefficient. Obtaining a clean pulse wave signal through the above pre-processing, as shown in fig. 1 (b);
dividing the pulse wave signals subjected to noise reduction according to the duration of 8 seconds (namely the number of sampling points is 1000) to obtain all data segments, selecting the slope as an index to evaluate the signal quality of all the data segments, discarding the current segment if the index is less than 0, and keeping the current segment if the index is more than 0;
(2) expanding dimensionality of the pulse wave data segments by adopting first-order difference and second-order difference, and splicing to obtain a three-dimensional input matrix [3, 1000], wherein 3 is input dimensionality, and 1000 is sampling point number;
(3) dividing the data of 1825 subjects in the database into a training set and a test set according to the ratio of 8:2 to form a data set of the embodiment;
(4) the overall Multi-scale Fusion Network model framework is shown in FIG. 2, where Multi-scale Fusion CNN backbone is a Multi-scale Fusion Network, where Attention is the Attention layer, DBP Network is the diastolic convolution Network, SBP Network is the systolic convolution Network, and Loss isauxIs fromLoss function, Loss, of multi-scale converged networkssbpAnd LossdbpThe loss functions from the systolic and diastolic convolution networks, respectively. The construction of the multi-scale fusion network consists of 5 steps, wherein the 5 steps respectively comprise 1, 2, 3, 3 and 3 multi-scale fusion convolutional layers, and the first multi-scale fusion convolutional layer of each step is used for changing the number of channels and reducing sampling. The first step is to increase the number of channels to 32, and the rest of the stages are increased by 2 times on this basis. Meanwhile, the signal length is decreased by 2 times at each step;
the specific structure of the multi-scale fusion convolutional layer in this embodiment is shown in fig. 3, where ConvX (X is 1, 7, 9, 11, respectively) represents a convolutional layer with a convolution kernel size of X. The number of scales of the multi-scale fusion convolutional layers is set to be 3, the convolutional layers adopt expansion convolution, and the sizes of the three convolutional layers are respectively selected to be 7, 9 and 11. After the multi-scale fusion network model, the original input matrix [ 2 ]B, 3, 1000]First, the matrix is converted into a feature matrixB, 256, 32],BIndicating the number of samples. The mixed attention mechanism adopts a bottleneck attention module, and a systolic pressure and diastolic pressure convolution network consists of two layers of ordinary convolution layers with the channel number of 512, a global average pooling layer and convolution layers with the convolution kernel size of 1. Characteristic matrix [ alpha ]B, 256, 32]The physiological parameter matrix is finally obtained after the mixed attention module and the convolution networkB, 2];
(5) Acquiring a measurement mode identifier, wherein the measurement mode identifier is a dynamic mode, and extracting all physiological parameter estimation values from the physiological parameter matrix finally obtained in the step (4) to form a one-dimensional continuous data sequence: systolic pressure sequence [ alpha ], [ alpha ] and [ alpha ], [ alpha ] aB S , 1]And diastolic pressure sequence [ alpha ], [ beta ] -andB D , 1],B S the number of measurements of the systolic blood pressure is indicated,B D representing the number of measurements of diastolic pressure;
(6) the training set is used to train the weights in the model, and the test set is used to verify the performance of the physiological parameter measurement model on the unknown data set. According to different division modes of the data set, the physiological parameter measurement model can be divided into a calibration model and a non-calibration model, the data of the same subject appearing in the training set and the test set is the calibration model, and otherwise, the data is the non-calibration model. The results of the performance test of this example are shown in fig. 4. FIGS. 4 (a), (c) reflect the correlation between the measured values and the true values of the non-calibrated model (Cal-free) on the test set; fig. 4 (b), (d) reflect the correlation between the measured values and the actual values of the calibration model (Cal-based) on the test set. The result shows that the physiological parameter measurement model constructed by the embodiment can accurately measure the systolic pressure and the diastolic pressure.