CN112806977B

CN112806977B - Physiological parameter measuring method based on multi-scale fusion network

Info

Publication number: CN112806977B
Application number: CN202110160054.4A
Authority: CN
Inventors: 杨翠微; 胡启晗
Original assignee: Fudan University
Current assignee: Fudan University
Priority date: 2021-02-05
Filing date: 2021-02-05
Publication date: 2022-03-18
Anticipated expiration: 2041-02-05
Also published as: CN112806977A

Abstract

The invention provides a physiological parameter measurement method based on a multi-scale fusion network. The method includes: sampling physiological signals and generating a one-dimensional physiological signal data sequence; performing noise filtering on data segments that meet signal quality requirements through data segment division and signal quality evaluation; and then performing mathematical transformation on the data segments to generate multi-dimensional The input data tensor is used to extract the latent features of the input data by using a multi-scale fusion network to obtain the estimated values of physiological parameters. By identifying the measurement mode identifier, the mean of all estimated values is taken as the measured value of the physiological parameter in the static mode; the one-dimensional continuous data sequence formed by all the estimated values is taken as the continuous measured value of the physiological parameter in the dynamic mode. The method can fully extract the complementary information of different scales in the signal, and realize the accurate measurement of physiological parameters.

Description

Physiological parameter measuring method based on multi-scale fusion network

Technical Field

The invention relates to a physiological parameter measuring method based on a multi-scale fusion network.

Background

The heart is the power center of human blood circulation, and supplies blood to the whole body through regular pulsation to meet the metabolism of the human body, thereby maintaining the normal life activities of the human body. It is of utmost importance to effectively measure physiological parameters related to the cardiovascular system to monitor the health status of a human body to prevent cardiovascular diseases. With the aging population and the increase of working pressure, the prevalence of cardiovascular diseases in China is increasing year by year. Cardiovascular diseases have become a leading cause of human death worldwide as reported by the world health organization in 2020. Therefore, the dynamic detection of the physiological parameters related to the cardiovascular system and the accurate evaluation of the health condition have great practical significance for the prevention and treatment of cardiovascular diseases.

Currently, a large number of medical sensors are required to be worn on commonly used physiological parameter detection equipment, such as a dynamic blood pressure monitor, an arteriosclerosis detector, a microcirculation detector and the like. Although these devices work accurately during the measurement process, the long wear can be uncomfortable for the patient, and the devices are too bulky and expensive to use in everyday life. Therefore, portability and comfort are the key of the physiological parameter measuring system facing daily life.

In recent years, the development of wearable technology and high-performance processing chips lays a hardware foundation for physiological signal processing, and the rise of deep learning makes accurate measurement of physiological parameters possible, so more and more researches are focused on automatically extracting characteristic parameters from physiological signals by using deep learning algorithms to estimate physiological parameters. The physiological signals are mostly one-dimensional time series with periodicity, and the change of the physiological signals reflects physiological information of various systems of a human body. At present, most of researches adopt a convolutional neural network with a single scale to realize automatic feature extraction, but the method ignores information which may be useful on other scales, so that the measurement accuracy cannot be further improved. Meanwhile, for the same physiological signal, the variability among individuals and the variability of waveforms of the physiological signal also cause that the convolutional neural network with a single scale cannot effectively extract features.

Disclosure of Invention

In order to overcome the defects, the invention aims to provide a brand-new physiological parameter measuring method based on a multi-scale fusion network. The method of the invention fully exerts the complementary advantages of the information of different scales by combining the information of different scales, and realizes accurate physiological parameter measurement. Meanwhile, the adopted end-to-end neural network avoids complicated feature point detection and feature engineering. The method extracts multi-scale fusion characteristics from physiological signals through a multi-scale fusion network, obtains estimated values of physiological parameters to be measured by regression of the characteristics, and outputs a static mean value or dynamic continuous measured value of the physiological parameters to be measured by corresponding operation according to a measurement mode identifier.

The invention provides a physiological parameter measuring method based on a multi-scale fusion network, which comprises the following specific steps:

(1) collecting physiological signals of biological individuals under static or dynamic conditions; preprocessing the acquired physiological signals, namely performing data segment cutting, resampling and normalization operation, and then removing the interference of baseline drift, power frequency, respiration, motion artifacts, electromyographic noise and the like by adopting a filtering or other signal processing method to obtain a one-dimensional signal segment;

(2) for the one-dimensional signal segment obtained in the step (1), the dimensionality is expanded by adopting mathematical transformation to construct more complete and rich physiological signal representation, and a three-dimensional matrix is obtainedB, C, N]Wherein:Bas the total number of data segments after the segmentation,Cfor the dimensions after the mathematical transformation to be used,Nthe length of the data segment, namely the number of sample points;

(3) for the product obtained in step (2)Three-dimensional matrix [ 2 ]B, C, N]Divided into training sets according to a certain ratio (e.g., 9:1, 8:2 or 7: 3)B ₁, C, N]And the test setB ₂, C, N]，B ₁AndB ₂the number of fragments in the training set and test set, respectively, andB ₁andB ₂the sum being equal toB；

(4) Constructing a physiological parameter measurement model, wherein the model comprises three parts: a multi-scale fusion network, a hybrid attention mechanism and a convolution network layer; according to a predetermined scale numberIDesigning a multi-scale fusion convolutional layer, and then fusing the number of convolutional layers according to the preset multi-scaleMSetting a multi-scale fusion network, and enabling the three-dimensional matrix obtained in the step (2)B, C, N]Inputting the multi-scale fusion network to obtain a first characteristic data three-dimensional matrixS(ii) a Digging out a second characteristic data three-dimensional matrix related to the physiological parameter to be measured by using a mixed attention machineS ₁(ii) a According to the set number of layers of the convolutional networkMUsing a convolutional network layer to perform three-dimensional matrix on the second characteristic dataS ₁Performing regression calculation to generate an estimated value of the physiological parameter to obtain a two-dimensional data matrixB, X]Wherein:Xis an estimated value of the physiological parameter to be measured;

(5) obtaining a measurement mode identifier, the measurement mode identifier comprising a static mode and a dynamic mode:

when the measurement mode identifier is a static mode, the two-dimensional data matrix obtained in step (4)B, X]Carrying out mean value operation to generate a mean value data matrix [1,X _M]average value ofX _MAs a physiological parameter measurement in static mode;

when the measurement mode identifier is in a dynamic mode, continuously extracting all estimated values of the physiological parameters to be measured obtained in the step (4) to form a one-dimensional continuous data sequence as a continuous physiological parameter measured value in the dynamic mode;

(6) the training set [ 2 ] obtained in the step (3)B ₁, C, N]Inputting the physiological parameters into the model in the step (4) for training and optimizing to obtain a physiological parameter estimation model; will test set 2B ₂, C, N]Inputting a physiological parameter estimation model for testing, and checking the accuracy of the model.

In the present invention, the physiological parameter includes, but is not limited to, one or more of heart rate, blood pressure, respiration rate, cardiac function index or arteriosclerosis index.

In the invention, the signal collected in step (1) is a physiological signal containing the physiological and pathological information of a cardiovascular system, and mainly comprises the following types: one or more of electrocardio signals, pulse wave signals, heart impact signals or heart sound signals.

In the present invention, the mathematical transformation method for dimension expansion of physiological signals in step (2) comprises the following types: any one of difference, integral, fourier transform, wavelet transform, empirical mode decomposition, or variational mode decomposition.

In the invention, the multi-scale fusion convolutional layer in the step (4) is formed by the following method:

(4.1) setting the convolution layer with the convolution kernel size of 1 to adjust the channel number of the input data according to the output channel number of the set multi-scale fusion convolution layer;

(4.2) number of scales according to settingISetting upIMulti-scale fusion convolutional layer with different convolutional kernel sizesF ₁, F ₂,…F _IThis isIThe convolution kernel sizes of the multiple multi-scale fusion convolutional layers are spaced at 2 intervals.IThe input matrix is simultaneously convolved by a plurality of multi-scale fusion convolution layers to obtainIAn output matrixY ₁, Y ₂…Y _IUsing global pooling pairsIReducing the dimension of each output matrix to obtain a one-dimensional embedded vectorz ₁, z ₂, …z _IThen using two fully-connected layers andsoftmaxweight matrix obtained by compressing and recovering information of embedded vectorW ₁, W ₂, …W _IThe weight matrixW ₁, W ₂, …W _IAnd output matrixY ₁, Y ₂…Y _IAnd correspondingly multiplying and summing to obtain the multi-scale fusion feature.

In the present invention, convolution operation is performed on an input matrix by using convolution layers, and the convolution operation includes the following types: a classical convolution or a dilated convolution.

In the invention, global pooling is used for reducing the dimension of the output matrix, and the method comprises the following types: global average pooling or global maximum pooling.

In the invention, a mixed attention mechanism is utilized in the step (4) to further mine the characteristics related to the physiological parameters to be estimated, and the method comprises the following types: a Bottleneck Attention Module (BAM) or convolution module attention mechanism (CBAM).

In the invention, the average value of all estimated values is used as the physiological parameter measured value in the static mode in the step (5); and taking the one-dimensional continuous data sequence formed by all the estimation values as the continuous measurement value of the physiological parameter in the dynamic mode.

In the invention, the training set in the step (6) is used for training the weights in the model, and the test set is used for verifying the performance of the physiological parameter measurement model on an unknown data set. According to different division modes of the data set, the physiological parameter measurement model can be divided into a calibration model and a non-calibration model, the data of the same subject appearing in the training set and the test set is the calibration model, and otherwise, the data is the non-calibration model.

The invention has the following beneficial effects:

1. the method can be used for realizing noninvasive and portable physiological parameter measurement, and is favorable for daily monitoring;

2. the invention realizes the dimension expansion by using mathematical transformation on the physiological signals, and can more comprehensively mine the potential information contained in the physiological signals. The measurement of the physiological parameter can be achieved more accurately than using only the original physiological signal;

3. the invention fully extracts the information of different scales in the original signal by utilizing the multi-scale fusion network, realizes the complementation between different scales and can more accurately realize the measurement of physiological parameters;

4. the application range of the method disclosed by the invention covers the measurement of all physiological parameters, and the method has a certain application value in the fields of cardiovascular disease research and signal processing research.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings used in the embodiments will be briefly described below. It is noted that the following drawings illustrate only certain embodiments of the invention and are therefore not to be considered limiting of its scope.

Fig. 1 is a schematic diagram of noise reduction of a pulse wave signal. The abscissa is time and the ordinate is signal amplitude. In fig. 1, (a) is a pulse wave signal segment before noise reduction, which includes a certain power frequency interference and baseline drift; (b) is a clean signal after noise reduction.

Fig. 2 shows the architecture of a multi-scale fusion network model for measuring systolic and diastolic pressures.

FIG. 3 shows the specific architecture and operation flow of the multi-scale fusion convolutional layer with the number of scales set to 3.

Fig. 4 shows a correlation analysis between the output values and the actual values of the blood pressure measurement model. (a) Correlation analysis of diastolic pressure for a non-calibrated model, (b) correlation analysis of diastolic pressure for a calibrated model; (c) the correlation analysis of the systolic pressure of a non-calibration model, and (d) the correlation analysis of the systolic pressure of a calibration model. In the coordinates of fig. 4, the horizontal axis represents the true value, and the vertical axis represents the output value of the model.

Detailed Description

The method and the application of the invention are further explained below with reference to the figures and the examples. These embodiments do not limit the invention; variations in structure, method, or function that may be apparent to those of ordinary skill in the art upon reading the foregoing description are intended to be within the scope of the present invention.

Example 1: the physiological parameter measurement model based on the multi-scale fusion network is applied to a dynamic systolic pressure and diastolic pressure measurement task, and the task is realized by adopting an MIMIC database. The MIMIC database contains ECG (electrocardiogram signal), PPG (pulse wave signal) and ABP (arterial blood pressure signal), all at a sampling rate of 125 Hz. The ECG and PPG signals are used to measure the blood pressure value, and the ABP signal is compared as the true value with the blood pressure value measured by the model. The method for measuring the blood pressure value by adopting the multi-scale fusion network model comprises the following specific steps:

(1) the pulse wave signals in the MIMIC database are observed as shown in fig. 1 (a). The pulse wave signals have serious baseline drift and contain a certain degree of power frequency interference. The pulse wave signal is first Discrete Wavelet Transform (DWT) decomposed with a db8 wavelet basis function. Then setting the wavelet coefficient corresponding to the noise frequency range to zero, and finally reconstructing according to the wavelet coefficient. Obtaining a clean pulse wave signal through the above pre-processing, as shown in fig. 1 (b);

dividing the pulse wave signals subjected to noise reduction according to the duration of 8 seconds (namely the number of sampling points is 1000) to obtain all data segments, selecting the slope as an index to evaluate the signal quality of all the data segments, discarding the current segment if the index is less than 0, and keeping the current segment if the index is more than 0;

(2) expanding dimensionality of the pulse wave data segments by adopting first-order difference and second-order difference, and splicing to obtain a three-dimensional input matrix [3, 1000], wherein 3 is input dimensionality, and 1000 is sampling point number;

(3) dividing the data of 1825 subjects in the database into a training set and a test set according to the ratio of 8:2 to form a data set of the embodiment;

(4) the overall Multi-scale Fusion Network model framework is shown in FIG. 2, where Multi-scale Fusion CNN backbone is a Multi-scale Fusion Network, where Attention is the Attention layer, DBP Network is the diastolic convolution Network, SBP Network is the systolic convolution Network, and Loss is_auxIs fromLoss function, Loss, of multi-scale converged networks_sbpAnd Loss_dbpThe loss functions from the systolic and diastolic convolution networks, respectively. The construction of the multi-scale fusion network consists of 5 steps, wherein the 5 steps respectively comprise 1, 2, 3, 3 and 3 multi-scale fusion convolutional layers, and the first multi-scale fusion convolutional layer of each step is used for changing the number of channels and reducing sampling. The first step is to increase the number of channels to 32, and the rest of the stages are increased by 2 times on this basis. Meanwhile, the signal length is decreased by 2 times at each step;

the specific structure of the multi-scale fusion convolutional layer in this embodiment is shown in fig. 3, where ConvX (X is 1, 7, 9, 11, respectively) represents a convolutional layer with a convolution kernel size of X. The number of scales of the multi-scale fusion convolutional layers is set to be 3, the convolutional layers adopt expansion convolution, and the sizes of the three convolutional layers are respectively selected to be 7, 9 and 11. After the multi-scale fusion network model, the original input matrix [ 2 ]B, 3, 1000]First, the matrix is converted into a feature matrixB, 256, 32]，BIndicating the number of samples. The mixed attention mechanism adopts a bottleneck attention module, and a systolic pressure and diastolic pressure convolution network consists of two layers of ordinary convolution layers with the channel number of 512, a global average pooling layer and convolution layers with the convolution kernel size of 1. Characteristic matrix [ alpha ]B, 256, 32]The physiological parameter matrix is finally obtained after the mixed attention module and the convolution networkB, 2]；

(5) Acquiring a measurement mode identifier, wherein the measurement mode identifier is a dynamic mode, and extracting all physiological parameter estimation values from the physiological parameter matrix finally obtained in the step (4) to form a one-dimensional continuous data sequence: systolic pressure sequence [ alpha ], [ alpha ] and [ alpha ], [ alpha ] aB _S, 1]And diastolic pressure sequence [ alpha ], [ beta ] -andB _D, 1]，B _Sthe number of measurements of the systolic blood pressure is indicated,B _Drepresenting the number of measurements of diastolic pressure;

(6) the training set is used to train the weights in the model, and the test set is used to verify the performance of the physiological parameter measurement model on the unknown data set. According to different division modes of the data set, the physiological parameter measurement model can be divided into a calibration model and a non-calibration model, the data of the same subject appearing in the training set and the test set is the calibration model, and otherwise, the data is the non-calibration model. The results of the performance test of this example are shown in fig. 4. FIGS. 4 (a), (c) reflect the correlation between the measured values and the true values of the non-calibrated model (Cal-free) on the test set; fig. 4 (b), (d) reflect the correlation between the measured values and the actual values of the calibration model (Cal-based) on the test set. The result shows that the physiological parameter measurement model constructed by the embodiment can accurately measure the systolic pressure and the diastolic pressure.

Claims

1. A physiological parameter measuring method based on a multi-scale fusion network is characterized by comprising the following specific steps:

(1) collecting physiological signals of a biological individual in a static mode or a dynamic mode; preprocessing the acquired physiological signals, namely performing data segment cutting, resampling and normalization operation, and then removing the interference of baseline drift, power frequency, respiration, motion artifacts and electromyographic noise by adopting a filtering or other signal processing method to obtain one-dimensional signal segments;

(2) expanding dimensionality of the one-dimensional signal segment obtained in the step (1) by adopting mathematical transformation to construct more complete and rich physiological signal representation to obtain a three-dimensional matrix [ alpha ], [ alpha ] formB, C, N]Wherein:Bas the total number of data segments after the segmentation,Cfor the dimensions after the mathematical transformation to be used,Nthe length of the data segment, namely the number of sample points;

(3) subjecting the three-dimensional matrix obtained in step (2)B, C, N]Is divided into training sets according to a certain proportionB ₁, C, N]And the test setB ₂, C, N]，B ₁AndB ₂the number of fragments in the training set and test set, respectively, andB ₁andB ₂the sum being equal toB；

(4) Constructing a physiological parameter measurement model, wherein the model comprises three parts: multi-scale fusion network, hybrid attention mechanism and convolution networkA layer; according to a predetermined scale numberIDesigning a multi-scale fusion convolutional layer, and then fusing the number of convolutional layers according to the preset multi-scaleMSetting a multi-scale fusion network, and enabling the three-dimensional matrix obtained in the step (2)B, C, N]Inputting the data into the multi-scale fusion network to obtain a first characteristic data three-dimensional matrixS(ii) a Digging out a second characteristic data three-dimensional matrix related to the physiological parameter to be measured by using a mixed attention machineS ₁(ii) a According to the set number of layers of the convolutional networkMUsing a convolutional network layer to perform three-dimensional matrix on the second characteristic dataS ₁Performing regression calculation to generate an estimated value of the physiological parameter to obtain a two-dimensional data matrixB, X]Wherein:Xis an estimated value of the physiological parameter to be measured;

2. The method of claim 1, wherein the physiological parameter comprises one or more of heart rate, blood pressure, respiration rate, cardiac function index, and arteriosclerosis index.

3. The method according to claim 1, wherein the acquired signal in step (1) is a physiological signal containing physiological and pathological information of cardiovascular system, specifically: one or more of electrocardio signals, pulse wave signals, heart impact signals or heart sound signals.

4. The method according to claim 1, wherein the mathematical transformation method for dimension expansion of the physiological signal in step (2) is specifically: any one of difference, integral, fourier transform, wavelet transform, empirical mode decomposition, or variational mode decomposition.

5. The method of claim 1, wherein the multi-scale fusion convolutional layer in step (4) comprises the following steps:

(4.1) according to the number of output channels of the set multi-scale fusion convolutional layer, setting the multi-scale fusion convolutional layer with the convolutional kernel size of 1 to adjust the number of channels of input data;

(4.2) number of scales according to settingISetting upIMulti-scale fusion convolutional layer with different convolutional kernel sizesF ₁, F ₂,…F _IThis isIThe sizes of convolution kernels of the multi-scale fusion convolution layers are spaced by 2;Ithe input matrix is simultaneously convolved by a plurality of multi-scale fusion convolution layers to obtainIAn output matrixY ₁, Y ₂…Y _IUsing global pooling pairsIReducing the dimension of each output matrix to obtain a one-dimensional embedded vectorz ₁, z ₂, …z _IThen using two fully-connected layers andsoftmaxweight matrix obtained by compressing and recovering information of embedded vectorW ₁, W ₂, …W _IThe weight matrixW ₁, W ₂, …W _IAnd the transmissionGo out matrixY ₁, Y ₂…Y _IAnd correspondingly multiplying and summing to obtain the multi-scale fusion feature.

6. The method of claim 5, wherein the convolution operation is performed on the input matrix using convolutional layers, and wherein the convolution operation is either a classical convolution or a dilated convolution.

7. The method of claim 5, wherein the output matrix is reduced in dimension using global pooling, and wherein the method is global average pooling or global maximum pooling.

8. The method according to claim 1, wherein the step (4) further mines a second three-dimensional matrix of feature data related to the physiological parameter to be estimated by using a mixed attention mechanism, specifically: a bottleneck attention module or a convolution module attention mechanism.