Nothing Special   »   [go: up one dir, main page]

Next Article in Journal
Entanglement Swapping and Swapped Entanglement
Next Article in Special Issue
Decision Fault Tree Learning and Differential Lyapunov Optimal Control for Path Tracking
Previous Article in Journal
Deep Reinforcement Learning-Assisted Optimization for Resource Allocation in Downlink OFDMA Cooperative Systems
Previous Article in Special Issue
Trend Feature Consistency Guided Deep Learning Method for Minor Fault Diagnosis
You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Application of Fault Diagnosis Method Combining Finite Element Method and Transfer Learning for Insufficient Turbine Rotor Fault Samples

China Institute of FTZ Supply Chain, Shanghai Maritime University, Shanghai 201306, China
*
Author to whom correspondence should be addressed.
Entropy 2023, 25(3), 414; https://doi.org/10.3390/e25030414
Submission received: 13 January 2023 / Revised: 20 February 2023 / Accepted: 21 February 2023 / Published: 24 February 2023
Figure 1
<p>Diagram of residual blocks.</p> ">
Figure 2
<p>ResNet18 network structure diagram.</p> ">
Figure 3
<p>Schematic diagram of transfer learning.</p> ">
Figure 4
<p>Flow chart of turbine rotor fault diagnosis method based on FEMATL method.</p> ">
Figure 5
<p>Rotor-bearing system experimental platform: (<b>a</b>) Experimental platform. (<b>b</b>) Dimensions of the rotor model, where L1 = 140 mm, L2 = L3 = 435 mm, L4 = 180 mm, H = 50 mm, D1 = 50 mm, D2 = 60 mm, D3 = 350 mm, and the two displacement sensors are located 60 mm to the left of the turntable.</p> ">
Figure 6
<p>Finite element model of rotor bearing system.</p> ">
Figure 7
<p>Comparison of experimental and simulated signals in the X- and Y-directions.</p> ">
Figure 8
<p>50 N rubbing force in X- and Y-directions.</p> ">
Figure 9
<p>Nine faulty rotor simulation signals: (<b>a</b>–<b>c</b>) three misalignment faults, (<b>d</b>–<b>f</b>) three rubbing faults, (<b>g</b>–<b>i</b>) three unbalance faults.</p> ">
Figure 10
<p>Original signal and noisy signal with SNR = 10.</p> ">
Figure 11
<p>Time-frequency diagrams transformed by four different time-frequency analysis methods. (<b>a</b>) STFT time-frequency diagram, (<b>b</b>) CWT time-frequency diagram, (<b>c</b>) GST time-frequency diagram, (<b>d</b>) WVD time-frequency diagram.</p> ">
Figure 12
<p>Diagram of transfer learning from source domain to target domain.</p> ">
Figure 13
<p>Accuracy curves for different time-frequency diagram datasets: (<b>a</b>) STFT time-frequency diagram dataset, (<b>b</b>) CWT time-frequency diagram dataset, (<b>c</b>) GST time-frequency diagram dataset, (<b>d</b>) WVD time-frequency diagram dataset.</p> ">
Figure 14
<p>Accuracy of the validation set during the iterations of different time-frequency diagrams.</p> ">
Figure 15
<p>Multi-classification confusion matrix for transfer learning based on ResNet18 network. (<b>a</b>) STFT confusion matrix, (<b>b</b>) CWT confusion matrix, (<b>c</b>) GST confusion matrix, (<b>d</b>) WVD confusion matrix.</p> ">
Figure 16
<p>t-SNE visualization of the fully connected layer of the ResNet18 network. (<b>a</b>) STFT dataset, (<b>b</b>) CWT dataset, (<b>c</b>) GST dataset, (<b>d</b>) WVD dataset.</p> ">
Figure 17
<p>Loss and accuracy curves of ResNet18 network without migration learning for CWT time-frequency diagrams. (<b>a</b>) Loss curve. (<b>b</b>) Accuracy curve.</p> ">
Versions Notes

Abstract

:
Deep learning has led to significant progress in the fault diagnosis of mechanical systems. These intelligent models often require large amounts of training data to ensure their generalization capabilities. However, the difficulty of obtaining turbine rotor fault data poses a new challenge for intelligent fault diagnosis. In this study, a turbine rotor fault diagnosis method based on the finite element method and transfer learning (FEMATL) is proposed, ensuring that the intelligent model can maintain high diagnostic accuracy in the case of insufficient samples. This method fully exploits the finite element method (FEM) and transfer learning (TL) for small-sample problems. First, FEM is used to generate data samples with fault information, and then the one-dimensional vibration displacement signal is transformed into a two-dimensional time-frequency diagram (TFD) by taking advantage of the deep learning model to recognize the image. Finally, a pre-trained ResNet18 network was used as the input to carry out transfer learning. The feature extraction layer of the network was trained on the ImageNet dataset and a fully connected layer was used to match the specific classification problems. The experimental results show that the method requires only a small amount of training data to achieve high diagnostic accuracy and significantly reduces the training time.

1. Introduction

Rotating machines are widely used in mechanical systems and their working environment is complex and changeable. Their operating conditions directly affect the stability of the entire mechanical system; therefore, the fault diagnosis of rotating machines is receiving increasing attention. As a typical rotating machine, a turbine rotor operates under a wide range of variable conditions for a long period of time, which can have serious consequences if it fails.
To gain insight into rotor fault effects, Wang et al. [1] investigated the effects of rotor unbalance and misalignment faults on a two-rotor system. Ebrahimi et al. [2] studied the dynamic responses of unbalanced, flexible rotors. Ren et al. [3] investigated a coupled rotor-bearing system with touching and cracking faults to reveal its dynamic behavior. Zhang et al. [4] studied the dynamic response of the turbine blades. Hong et al. [5] studied the touch response of a single-disk rotor system under additional constraints.
Most rotating machinery fault diagnoses are based on vibration signals, and the processing of faulty mechanical signals using signal-processing techniques is the main method of fault detection. Specifically, signal features are extracted through different signal processing techniques to determine the fault, including improved local mean decomposition (LMD) [6], empirical mode decomposition (EMD) [7], ensemble empirical mode decomposition (EEMD) [8], and variational modal decomposition (VMD) [9]. However, all these feature extraction methods require a priori knowledge, and the feature extraction work directly affects the results of the fault diagnosis. In recent years, the emergence of deep learning theory has reformed intelligent fault diagnosis (IFD) [10]. Since it can automatically extract signal features without prior knowledge, thus freeing human feature extraction, deep learning provides useful tools for processing and analyzing big data [11] and is favored by more and more experts and scholars [12]. Wang et al. [13] proposed an extended deep belief network (EDBN) for fault classification. Zhang [14] and Wu [15] constructed one-dimensional convolutional neural networks to achieve timely and accurate fault detection for bearings and turbine rotors, respectively. Chen et al. [16] combined convolutional neural networks (CNN) with long short-term memory (LSTM) for the fault diagnosis of bearings, and the diagnostic results outperformed other intelligent algorithms based on a priori knowledge.
Although these AI models are powerful tools for troubleshooting mechanical systems, the most important factor for achieving good diagnostic results is that there must be sufficient training samples for the models to learn. The lack of or difficulty in obtaining turbine rotor fault samples poses a new challenge for intelligent fault diagnosis of turbine rotors. Some mechanical equipment failure samples are insufficient and some scholars have conducted research on small or unbalanced data through generative adversarial networks (GANs) and their variants. Liu et al. [17] proposed a diagnostic framework for a GAN and multi-sensor fusion, where generators are used to synthesize data and thus compensate for small samples of category data. The validation was performed on bearing and gearbox datasets and the results showed the good performance of the proposed method. Ding et al. [18] used a GAN to generate synthetic samples related to real samples under small sample conditions and were able to identify different fault types stably, even with fewer training samples, validating the model on bearing and gearbox datasets. Li et al. [19] proposed an ACW-GAN-GP model to address the diagnostic effects of unbalanced datasets. Luo et al. [20] proposed a C-DCGAN model for aiding the generation of synthetic samples for the fault diagnosis of highly unbalanced datasets and validated the model on bearing and gearbox datasets, proving that it exhibits better fault diagnosis capabilities in the case of small samples. Other scholars have expanded datasets using different data enhancement methods [21]. Numerical simulation is another method used to solve the problem of insufficient data samples. Xiang [22] and Liu et al. [23] established finite element models for rotors and bearings, respectively, and expanded the datasets using simulation data.
Transfer learning is another way to solve the small-sample problem. The AI model is trained on the source dataset and only requires a small number of target data samples when migrating to the target dataset, which can cause the model to converge quickly. Chen et al. [24] studied the application of transfer learning on wind turbine fault datasets with unbalanced data and different distributions and achieved good performance. Huang et al. [25] realized the fault diagnosis of rolling bearings using wavelet transform and transfer learning. Li et al. [26] implemented a fault diagnosis of rolling bearings using deep convolutional domain adversarial transfer learning. Zhu et al. [27] proposed a transfer learning method based on multisource domain adaptation and verified it on four bearing datasets, proving the effectiveness of the proposed method.
Because the diagnosis effect of the AI model with insufficient sample data is poor, different scholars have adopted various methods to solve this problem. For large rotating mechanical equipment such as turbine rotors, sample data are lacking and difficult to obtain, which can lead to poor results in the established intelligent diagnostic models.
In this study, the advantages of finite element simulation and transfer learning in small-sample problems are fully utilized to achieve high diagnostic accuracy for turbine rotors with insufficient fault data, and the proposed FEMATL method can effectively solve the problem of insufficient samples.
The specific arrangement of this paper is as follows. Section 2 introduces the basic theory and fundamental ideas of the FEMATL method and Section 3 introduces the experimental bench equipment for the rotor. The results of the FEMATL method for a specific case are presented in Section 4. The conclusions of this study are presented in Section 5.

2. Introduction to the FEMATL Method

Aiming at the problem of the difficulty in training AI models due to insufficient turbine rotor samples, the FEMATL method proposed in this study can effectively solve the problem of insufficient turbine rotor fault samples, accelerate the model training speed, and improve the diagnosis accuracy. This section describes the techniques used in this method and specifies the details of the FEMATL method.

2.1. Rotor Failure Mechanism

The rotor-bearing system is formed by a combination of a rotor shaft, disc, and plain bearing. The support at both ends of the spindle is provided by plain bearings and the disc is located at the center of the shaft and connected to the shaft using an interference fit.
By integrating the rotating shaft, disc, and bearings, the dynamic equation of the system can be expressed as:
M X ¨ + ( C + ω G ) X ˙ + K X = f
where M is the mass matrix, C is the damping matrix, G is the gyroscopic matrix resulting from rotor rotation, K is the stiffness matrix of the rotor, and f denotes the failure force, including the unbalanced force of the rotor and the force resulting from the misalignment failure. X is the displacement matrix and ω is the rotational speed.
Common failures of rotors include rotor misalignment, rubbing, and rotor unbalance. According to three common fault mechanisms [28,29], when the rotor misalignment fault is considered, the misalignment force acts on the rotating shaft through coupling and the generated excitation forces fc(x) and fc(y) are simultaneously applied to the rightmost node of the rotor. fc(x) and fc(y) can be obtained using the following equations:
f c ( x ) = m c c ω 2 cos ( 2 ω t ) f c ( y ) = m c c ω 2 sin ( 2 ω t )
where mc is the coupling mass, c represents the parallel offset distance between the two halves of the rotor, and ω represents the rotational speed of the rotor.
A rotor rubbing fault generally occurs in a rotor-bearing system with radial clearance, specifically manifested as the contact and separation between the rotor and stator of rotation. In the rotation process, when a rub-impact fault occurs between the rotor and stator, an instantaneous impact is generated. According to [29], the impact force can be equivalent to a triangle and acts on the rotating shaft.
When an unbalance failure occurs in the rotor, the centrifugal forces fm(x) and fm(y) generated by the unbalance of the rotor are applied in the X and Y directions of the disk mass unit Mass21, respectively. The unbalanced force is
f m ( x ) = m δ δ ω 2 cos ( ω t + φ u ) f m ( y ) = m δ δ ω 2 sin ( ω t + φ u )
where δ denotes the eccentric distance, ω denotes the rotational speed, φu denotes the initial phase angle, and mδ denotes the eccentric mass.

2.2. Time-Frequency Analysis Method

To transform the one-dimensional vibration signal of the turbine rotor-bearing system, which is not in an image format, into data in an image format, four time-frequency analysis methods were considered in this study: the short-time Fourier transform (STFT) [30], continuous wavelet transform (CWT) [31], generalized S-transform (GST) [32], and Wigner–Ville distribution (WVD) [33].
STFT:
When the signal passes through the STFT, a spectrum diagram is generated and the STFT can map the signal to a two-dimensional function on the time-frequency plane:
H ( t , f ) = x ( τ ) g ( τ t ) e j 2 π f τ d τ
where x(τ) is the time signal being analyzed, t is the moment of analysis, τ is the time constant, f is the frequency, g(t) is the window function, and the mode square of the coefficient H of the STFT is called the spectrogram.
CWT:
When the signal passes through the CWT, a wavelet time-frequency diagram is generated. The CWT uses a wavelet basis function to map the signal to a 2D function in the plane where the scale factor and translation factor are located.
Z ( a , b ) = 1 b x ( t ) h ( t a b ) d t
where a is the translation factor, b is the scale factor, h(t) is the wavelet basis function, and the mode square of the coefficients Z of the CWT is called the wavelet time-frequency diagram.
GST:
Based on the S-transform, the GST improves the time-frequency resolution of the S-transform by introducing the parameters λ and p to adjust the time continuation degree and attenuation trend of the window function. The formula is as follows:
S ( τ , f ) = x ( t ) λ | f | p 2 π e ( λ 2 f 2 p ( τ t ) 2 2 ) e ( i 2 π f ) d t
where λ > 0 and p > 0. When λ = 1 and p = 1, this is the standard S-transform. x(t) is the time signal being analyzed, f is the frequency, and τ is the translation.
WVD:
The WVD is a quadratic time-frequency distribution and the WVD of x(t) of the signal is defined as
W ( t , f ) = x ( t + τ 2 ) x * ( t τ 2 ) e j 2 π f τ d τ
where x(t) is the time signal and * denotes the complex conjugate. Because x(t) appears twice, it is called a bilinear transformation. The equation does not contain any window functions, thus avoiding the contradiction that its time and frequency resolutions cannot be reconciled.

2.3. Deep Residual Network (ResNet)

To address the difficulty of training neural networks as the depth of a model increases, He et al. [34] proposed the Residual Network (ResNet). The ResNet network is composed of multiple residual blocks stacked on top of each other. The residual blocks are shown in Figure 1.
Through the residual connection, the input x of the neural network is directly connected to the output of the nonlinear layer through an identity map. Thus, the residual module only needs to learn the parts that are different between the input and output, simplifying the network learning task and improving the discrimination ability of the network. The learning process is as follows:
y = F ( x ) + x
Owing to the residual connectivity, the ResNet network significantly alleviates the problems associated with deeper network layers, allowing deeper networks to be built and improving the stability of model training.
In this paper, the ResNet18 [35] network is used for model training. The basic architecture of ResNet18 is a ResNet and the depth of the network is 18 layers so it is called ResNet18. The network structure is shown in Figure 2. It consists of 17 convolutional layers and one fully connected layer. The “3 × 3 CONV, 64” indicates that the convolutional kernel size of the convolutional layer is 3 × 3 and the number of convolutional kernels is 64.

2.4. Transfer Learning and ImageNet Dataset

2.4.1. Transfer Learning

Transfer learning is the knowledge learned from data in the source domain to improve or optimize the learning effect of the target prediction function in the target domain [36]. The correlation between data from two different domains is exploited to transfer knowledge from the source domain to the target domain, thereby reducing the training time or improving the accuracy.
Given a source domain Ds = {Xs, Ps(X)} and task Ts = {Ys, fs(·)} and a target domain Dt = {Xt, Pt(X)} and task Tt = {Yt, ft(·)}, the purpose of transfer learning is to help or improve the prediction function f(·) in the target domain with the knowledge learned in the source domain. Generally, it is assumed that the source and target domains are not identical. A schematic of the transfer learning between the source and target domains is shown in Figure 3.

2.4.2. ImageNet Dataset

Instead of training network models from scratch, many people working in the field of computer vision often use open source models pre-trained on the ImageNet dataset [37] and migrate them to local models using transfer learning to shorten the model training time and achieve better results. There are 1000 categories in the ImageNet dataset, including animals, plants, buildings, vehicles, and more. There are more than 1000 images in each category of which about 1.2 million images are used as the training set, more than 50,000 images are used as the validation set, and more than 100,000 images are used as the test set. The ImageNet dataset is rich in categories and has a huge number of images. A network trained on the ImageNet dataset can extract general features from images, which can improve the recognition ability of the network. In this paper, the ResNet18 network is pre-trained on the ImageNet dataset and then migrated to the turbine rotor fault dataset for fine-tuning, which significantly shortened the training time and improved the network’s diagnostic ability.

2.5. The Proposed Method

Figure 4 shows a flow diagram of the FEMATL method. This method can effectively solve the problem of insufficient turbine rotor fault samples, accelerate the model training speed, and improve diagnostic accuracy. The method is roughly divided into three steps: finite element simulation to obtain fault samples, transformation of 1D vibration signals into 2D time-frequency maps, and transfer learning fault diagnosis based on ResNet18. The technical details are described in detail below.
Step 1: Finite element simulation to obtain fault samples.
(1) Construct a finite element model of the turbine rotor-bearing system. The finite element software ANSYS was used to build the model and generate the simulated vibration signals. The beam element was used to simulate the rotating shaft, the mass element to simulate the disc, and the bearing element to simulate the sliding bearings at both ends of the rotating shaft.
(2) Modify the rotor-bearing finite element model. Since a perfectly balanced rotor cannot exist, the eccentric load caused by the rotor eccentricity acts on the rotor and causes it to vibrate. In order to ensure that the finite element model matched the actual rotor model, it was necessary to set a comprehensive eccentricity for the rotor and continuously correct this eccentricity so that the finite element simulation signal and the actual rotor measurement signal matched. In order to measure the matching degree between the analog signal and the measured signal, cosine similarity was used to measure the similarity between them. Cosine similarity maps two curves to a vector space and calculates the cosine value of the angle between two vectors. The closer the cosine value is to 1, the more similar the two curves are. The cosine similarity calculation formula of two vectors x and y is:
cos x , y = i = 1 n x i y i i = 1 n x i 2 i = 1 n y i 2
where xi and yi are the i-th elements of vectors x and y, respectively. After this step, the finite element model of the rotor-bearing system was built to reflect the actual rotor vibration.
(3) Simulation to generate the fault samples. According to the rotor fault mechanism, the corresponding excitation force was applied to the rotor-bearing finite element model to simulate and generate different types of fault signals. To simulate the vibrations of actual machinery and equipment, Gaussian noise with a certain signal-to-noise ratio was artificially added to the vibration signal.
Step 2: Transformation of a one-dimensional vibration signal into a two-dimensional time-frequency diagram.
To take advantage of deep learning in image processing, the 1D vibration signal generated by the simulation was transformed into a time-frequency diagram in RGB format. In order to study the influence of different time-frequency analysis techniques on rotor fault diagnosis, the one-dimensional vibration signals were generated as short-time Fourier transform time-frequency diagrams (STFT TFD), continuous wavelet transform time-frequency diagrams (CWT TFD), generalized S-transform time-frequency diagrams (GST TFD), and Wigner–Ville distribution time-frequency diagrams (WVD TFD) according to the different time-frequency analysis methods.
Step 3: Transfer learning fault diagnosis based on ResNet18.
The ResNet18 network was pre-trained on the ImageNet dataset and the feature extraction part of the network had shallow features for extracting the general images. In order to transfer the knowledge learned by the network on ImageNet, in this experiment, the weight parameters of all layers of the network, except for the fully connected layer, were frozen and the fully connected layer was modified to match the brand-new fully connected layer in this experiment. The input of the ResNet18 network, which was the time-frequency map data obtained in step 2, was used and the whole model was retrained.

3. Introduction of Rotor Test Bench

In this section, the rotor experiment platform is introduced to provide a physical basis for the subsequent construction of the finite element model of the rotor-bearing system and the key parameters of the finite element model are modified through the measured signals of the rotor experiment platform.
Due to the complex structure and large structure size of actual turbine rotors, limited by experimental conditions and considerations of test safety, the incomplete geometric similarity of turbine rotors is often used to reduce its geometric size, thus simplifying the structure of prototype rotors and making the model easier to verify [38]. The similarity theory was used to realize the relationship between the prototype system and the simplified model. In recent years, there has been increasing interest among experts and scholars in rotor dynamics research based on similarity theory [39,40,41]. By using similarity theory to design and manufacture a model similar to the prototype, the physical characteristics of the prototype can be reproduced through model experiments and the physical nature and law of the prototype can be revealed [42]. The rotor used in this experiment was a simplified model of the turbine rotor. In the simplification process, we focused mainly on matching the first- and second-order natural frequencies of the experimental rotor to those of the prototype turbine rotor. Through CAE analysis, the first two natural frequencies of the model rotor were the same as those of the turbine rotor. The established model rotor is shown in Figure 5a. The rotor-bearing system is composed of a rotary table, a rotating shaft, and sliding bearings at both ends. The servo motor provides the rotating power of the rotor and lubricating oil is injected into the sliding bearing through the hydraulic system to provide oil-film support during rotor rotation. Two eddy current displacement sensors were placed in the X- and Y-directions of the rotating axis, respectively, at a distance of 60 mm from the disc in a 90° symmetry to collect the vibration signals during the rotation. Figure 5b shows the dimensional data of the rotor. The sampling frequency of the sensor was set to 2000 Hz and the motor rotation speed was 1000 rpm.
The vibration signal measured by the rotor experimental platform was used as a reference for the simulation signal of the finite element model. By constantly modifying the finite element model, the simulation signal gradually approximates the measured signal. Finally, a finite element model of the rotor bearing with high precision was obtained.

4. Results and Discussion

This section verifies the effectiveness of the FEMATL method for a specific case. First, a finite element model of the rotor-bearing system is constructed according to the rotor experimental platform and data samples with fault information are generated based on this model. Then, the vibration signal is transformed into four different time-frequency diagrams. Finally, the ResNet18 network pre-trained on ImageNet was used for transfer learning fault diagnosis. The specific experimental contents are detailed below.

4.1. Finite Element Simulation to Obtain Fault Samples

4.1.1. Finite Element Model of Rotor-Bearing System

In this study, the commercial finite element software ANSYS was used to establish the finite element model of the rotor-bearing system, and the material parameters of the rotor are listed in Table 1. The rotary shaft was simulated using the Beam189 element, the rotary table was simulated using the Mass21 element, and the Combi214 element was simulated using a sliding bearing. The unit types are listed in Table 2.
The finite element model of the rotor-bearing system established according to the above parameters is shown in Figure 6.

4.1.2. Modifying the Finite Element Model of the Rotor-Bearing System

Based on the vibration signals generated by the established finite element model of the rotor-bearing system, the vibration signals were compared with the measured signals of the experimental platform of the rotor-bearing system and the key parameters of the model were modified to match the measured signals.
As a rotor experiences wear owing to machining errors and long-term operating conditions, the rotor has a certain amount of eccentricity and the eccentric load acts on the whole rotor, which causes the rotor to vibrate. To accurately obtain the eccentric load, the eccentric distance e of the rotor must be known. To obtain the rotor eccentricity under normal conditions, the eccentric load (i.e., the unbalanced force) can be applied to the finite element model of the rotor bearing, the finite element simulation signal is compared with the measured signal of the actual rotor, and the eccentricity is constantly updated.
The eccentric force is decomposed in the X- and Y-directions according to Equation (3) and acts on the disc (Mass21) simultaneously. where mδ = 59.71 kg is the total mass of the rotor, δ is the integrated eccentricity, ω = 104.72 rad/s is the rotational speed of the rotor, and the initial phase φu = 0.
The boundary conditions of the finite element model were set to be consistent with those of the experimental rotor and the vibration signals of the simulation model were set to be consistent with those of the experimental rotor by continuously modifying the comprehensive eccentricity of the rotor. Figure 7 shows the experimental vibration signal and simulated vibration signal when the integrated eccentricity was 0.4 mm.
At this point, the cosine similarities of the simulated and measured signals in the X- and Y-directions were 0.999 and 0.998, respectively. The finite element model of the rotor-bearing system can be revised using the above modifications so that the model can correctly reflect the actual running status of the rotor; therefore, the revised integrated eccentricity was 0.4 mm.

4.1.3. Obtaining Fault Samples

The finite element model of the rotor-bearing system obtained using the above method can effectively reflect the vibration of a real rotor. Based on the rotor failure mechanism, the corresponding excitation force is applied to the modified rotor bearing finite element model, and different types of failure samples can be obtained. In this study, three fault states of the turbine rotor system were considered, including the rotor misalignment, rub impact, and rotor unbalance, with each state having three different severities, resulting in a total of nine fault types. This study also considered the health type without failure. The specific method is described below.
(1) Misalignment
In this experiment, only the parallel misalignment of the rotor was considered. According to Equation (2), the excitation forces fc(x) and fc(y) generated by the misalignment were applied to the rightmost node of the rotor simultaneously. The mass of the coupling was set at mc = 1.8 kg; the parallel offset distances c were set at 1, 1.5, and 2 mm; and the speed ω = 104.72 rad/s.
(2) Rubbing
According to the rotor rub-impact fault theory and the method in [29], the impact force generated by the rub impact between the rotor and stator is equivalent to a triangular pulse wave. In this case, impulse forces of 50 N, 70 N, and 90 N were applied simultaneously in the X- and Y-directions to simulate three types of rubbing-fault severities. Figure 8 shows the rubbing forces in the X- and Y-directions for the first degree of the rubbing fault.
(3) Unbalance
According to Equation (3), the centrifugal forces fm(x) and fm(y) generated by rotor unbalance are applied in the X- and Y-directions of mass element Mass21, respectively. In the formula, the eccentricity δ = 0.1 m, the rotational speed ω = 104.72 rad/s, the initial phase φu = 0, and the eccentricity masses mδ= 0.1, 0.2, and 0.3 kg.
For the modified finite element model of the rotor-bearing system, the excitation forces caused by the above nine faults were applied to the model and the displacement data in the X-direction were selected as the fault samples. Table 3 lists the nine fault types and one health type along with the corresponding labels. Figure 9 shows the simulation signals of the nine types of faulty rotors.
In practical applications, mechanical equipment inevitably produces a certain amount of noise. This noise may interfere with the environment and machinery itself. To reflect the effect of noise more accurately, Gaussian white noise with a signal-to-noise ratio of 10 was artificially added to the simulated signal. The signal-to-noise ratio SNR is defined as follows:
S N R = 10 log 10 ( P s P n )
where Ps denotes the signal power and Pn denotes the power of the Gaussian noise. The signal with added noise and the original signal are shown in Figure 10 (with the rotor unbalance fault signal as the example).
Fault samples were created for each fault signal with a sample length of 2048 data points. Each fault type consisted of 500 samples and a total of 5000 samples were used. The training, validation, and test sets were created using a ratio of 3:1:1. The training set consisted of 3000 samples, and the verification set and test set each consisted of 1000 samples.

4.2. Transformation of One-Dimensional Vibration Signal into Two-Dimensional Time-Frequency Diagram

The vibration signals generated by the finite element model of the rotor-bearing system were one-dimensional data that could not be directly used as the input of the ResNet18 network. To take advantage of the deep learning model for image processing, four time-frequency analysis techniques (STFT, CWT, GST, and WVD) were used to transform the one-dimensional vibration displacement signal into a time-frequency diagram. Figure 11 shows four different time-frequency diagrams of the same signal.
The fault signals of each type were converted into different time-frequency graphs using the above methods and four different datasets were generated, including the STFT TFD dataset, CWT TFD dataset, GST TFD dataset, and WVD TFD dataset. Each dataset contained sample pictures of 10 fault types and the dimensions of each picture were 875 × 656 × 3. There were 3000 pictures in the training set, 1000 pictures in the verification set, and 1000 pictures in the test set. Table 4 describes the different rotor fault datasets in detail.

4.3. Turbine Rotor Fault Diagnosis Based on ResNet18

4.3.1. Intelligent Diagnosis of Fault Data Based on Transfer Learning with ResNet18

The ResNet18 network was pre-trained on the ImageNet dataset and the feature extraction part of the network extracted the shallow features of each general image. Transferring this capability to the time-frequency graph dataset accelerated the training speed of the network in the target domain and improved training accuracy. To successfully transfer the network from the source domain to the target domain, the fully connected layer of the network was processed by replacing the fully connected layer of the network on the source domain with a new fully connected layer that matched the target domain classification task, as shown in Figure 12.
The specific operation was to freeze the weight parameters of the network trained on the ImageNet data and replace the original fully connected layer of 1000 neurons with a fully connected layer of 10 neurons. After the pre-training weights were loaded, the time-frequency diagram dataset was input into the network and the model was trained from beginning to end.
In this training, the batch size of each training was set at 128. The Adam [43] optimization algorithm was adopted and the learning rate was set at 0.0001. The cross-entropy loss function was used and the epochs were set to 10. Figure 13 shows the accuracy curves of the training process for each of the four time-frequency diagram datasets. “Acc” in the following images refers to the accuracy.
Table 5 shows the accuracy of the validation set for each iteration. Figure 14 shows the accuracy curves of the validation set during the iterations of the different time-frequency diagram datasets. By combining Table 5 and Figure 14, it can be seen that when the number of iterations was 10, the accuracy of the validation set of the STFT, CWT, and GST TFDs all reached 99.8%, whereas the WVD TFD dataset performed slightly worse, reaching 98.5%. For the four datasets, the ResNet18 network considerably shortened the training time, accelerated the convergence of the model, and achieved high accuracy in a very small number of iterations, indicating that the network successfully transferred knowledge from the ImageNet dataset to the experimental dataset.
In this study, a multi-classification confusion matrix was also introduced to compare and analyze the transfer learning fault diagnosis based on ResNet18, as shown in Figure 15. The values on the main diagonal represent the number of correctly identified samples and the other values represent the number of incorrectly identified samples.
As can be seen in Figure 15, for the four different TFD datasets, only individual samples of the ResNet18 network were identified incorrectly, whereas the rest of the samples were identified correctly, which further indicates that the transfer learning fault diagnosis can effectively identify the fault types.
Precision, recall, specificity, and F1 value are common evaluation indexes for deep learning fault diagnosis by which the effectiveness of the diagnosis can be further evaluated. The formulas for the different evaluation indexes are:
P r e c i s i o n = T P T P + F P R e c a l l = T P T P + T N S p e c i f i c i t y = T N T N + F P F 1 = 2 × P r e c i s i o n × R e c a l l P r e c i s i o n + R e c a l l
In the formulas, TP represents a true positive, TN represents a true negative, FP represents a false positive, and FN represents a false negative. The evaluation indexes of the CWT TFD dataset are shown in Table 6. It can be seen from the table that the evaluation index of each fault classification is above 0.99, which fully indicates that the network has excellent diagnostic performance for the rotor fault dataset, and proves the effectiveness of the proposed method.
To illustrate the powerful recognition ability of the migration learning-based ResNet18 network for different classes of time-frequency graphs, the fully connected layer of ResNet18 was visualized by t-SNE dimensionality reduction, as shown in Figure 16. The t-SNE is a dimensionality reduction technology that can reduce high-dimensional data to low-dimensional data. It is generally used for data visualization. t-SNE diagrams can more intuitively verify the effectiveness of the algorithm.
As shown in Figure 16, after the sample was trained on the ResNet18 network, the feature data points of the same fault type were clustered together, which differs from other fault types. This proves that the transfer learning fault diagnosis network based on ResNet18 can effectively identify different types of faults.

4.3.2. Intelligent Diagnosis of Fault Data Using Resnet18 without Transfer Learning

To illustrate the advantages of transfer learning in this experiment, a fault diagnosis of the ResNet18 network without using a pretrained model was also performed in this study. After randomly initializing all the weight parameters of the ResNet18 network, one of the time-frequency diagram datasets was used as the target data input to the network for training, and the loss versus accuracy curve of its training is shown in Figure 17. The selected time-frequency diagram data were CWT time-frequency diagrams.
When the iteration was 100 times, the accuracy of the verification set reached a maximum of 0.882, which was not as high as the 10 iterations of the ResNet18 network based on transfer learning. Table 7 summarizes the impact of the four TFD datasets with and without transfer learning. It can be seen in the table that convergence of the network was accomplished with a very small number of iterations using transfer learning and the accuracy was higher than the diagnostic results without transfer learning. The training time of the model based on transfer learning was much less than that without transfer learning. Since the fault diagnosis based on transfer learning has higher diagnostic accuracy and less training time, it can meet the requirements of practical engineering.

5. Conclusions

To address the problem of insufficient samples in the fault detection of turbine rotor-bearing systems that affect the training effectiveness of AI models, this study proposes a fault detection method for turbine rotors based on FEMATL. First, a finite element model of the rotor-bearing system was constructed using the finite element method and the model was modified based on the actual rotor measurement signal. For the modified model, different types of fault signals were generated through simulations according to the relevant fault mechanism. Then, to give full play to the image recognition advantages of the deep learning model, the generated one-dimensional vibration displacement signals were converted into corresponding time-frequency diagrams using four different time-frequency analysis techniques. Finally, the time-frequency graph data were used as the input of the deep residual network ResNet18, and intelligent fault diagnosis of the turbine rotor was carried out based on transfer learning. The main conclusions of this study are as follows:
(1) The proposed FEMATL method can realize intelligent diagnosis with high accuracy for turbine rotor fault detection with insufficient fault samples, eliminate the need for a large amount of training data typically required by deep learning models, and solve the problem of small samples in the intelligent diagnosis of turbine rotors.
(2) In this study, four time-frequency analysis techniques, the short-time Fourier transform, continuous wavelet transform, generalized S-transform, and Wigner–Ville distribution, were used to transform the one-dimensional vibration signal of the rotor-bearing system into the corresponding time-frequency diagrams, and the diagnostic effects of the four different time-frequency diagrams were observed.
(3) A well-built finite element model enables the calculation and simulation of a large number of fault samples, even for signals that are difficult or impossible to obtain through physical measurements. Finite element simulations can be very flexible for producing fault samples and expanding fault types. In contrast, transfer learning requires only a small number of fault samples, which can help the model shorten the training time and improve diagnostic accuracy.
In summary, the FEMATL method proposed in this paper can help mechanical devices with insufficient fault samples, such as turbine rotors, to achieve small-sample fault diagnosis with a short training time and high diagnostic accuracy.

Author Contributions

Resources, Q.Z.; writing—original draft preparation, Q.H.; writing—review and editing, J.Q. and J.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China, grant number 51875332.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Wang, N.; Jiang, D. Vibration Response Characteristics of a Dual-Rotor with Unbalance-Misalignment Coupling Faults: Theoretical Analysis and Experimental Study. Mech. Mach. Theory 2018, 125, 207–219. [Google Scholar] [CrossRef]
  2. Ebrahimi, R. Experimental and Theoretical Investigations of Unbalance Flexible Rotor in Active Magnetic Bearings Considering Backup Bearings Contacts. Noise Vib. Worldw. 2022, 53, 161–171. [Google Scholar] [CrossRef]
  3. Ren, Z.; Zhou, S.; Li, C.; Wen, B. Dynamic Characteristics of Multi-Degrees of Freedom System Rotor-Bearing System with Coupling Faults of Rub-Impact and Crack. Chin. J. Mech. Eng. 2014, 27, 785–792. [Google Scholar] [CrossRef]
  4. Zhang, Q.; Wang, H.; Qin, J.; Duan, J. Study on the Collision Dynamics of Integral Shroud Blade for High-Pressure Turbine in Different Integral Shroud Clearance Distance. Noise Vib. Worldw. 2021, 52, 200–211. [Google Scholar] [CrossRef]
  5. Hong, J.; Yu, P.; Zhang, D.; Ma, Y. Nonlinear Dynamic Analysis Using the Complex Nonlinear Modes for a Rotor System with an Additional Constraint Due to Rub-Impact. Mech. Syst. Signal Process. 2019, 116, 443–461. [Google Scholar] [CrossRef]
  6. Wang, Y.; He, Z.; Zi, Y. A Demodulation Method Based on Improved Local Mean Decomposition and Its Application in Rub-Impact Fault Diagnosis. Meas. Sci. Technol. 2009, 20, 025704. [Google Scholar] [CrossRef]
  7. Lei, Y.; Lin, J.; He, Z.; Zuo, M.J. A Review on Empirical Mode Decomposition in Fault Diagnosis of Rotating Machinery. Mech. Syst. Signal Process. 2013, 35, 108–126. [Google Scholar] [CrossRef]
  8. Lei, Y.; He, Z.; Zi, Y. Application of the EEMD Method to Rotor Fault Diagnosis of Rotating Machinery. Mech. Syst. Signal Process. 2009, 23, 1327–1338. [Google Scholar] [CrossRef]
  9. Jiang, X.; Li, S.; Cheng, C. A Novel Method for Adaptive Multiresonance Bands Detection Based on VMD and Using MTEO to Enhance Rolling Element Bearing Fault Diagnosis. Shock Vib. 2016, 2016, 8361289. [Google Scholar] [CrossRef] [Green Version]
  10. Lei, Y.; Yang, B.; Jiang, X.; Jia, F.; Li, N.; Nandi, A.K. Applications of Machine Learning to Machine Fault Diagnosis: A Review and Roadmap. Mech. Syst. Signal Process. 2020, 138, 106587. [Google Scholar] [CrossRef]
  11. Zhao, R.; Yan, R.; Chen, Z.; Mao, K.; Wang, P.; Gao, R.X. Deep Learning and Its Applications to Machine Health Monitoring. Mech. Syst. Signal Process. 2019, 115, 213–237. [Google Scholar] [CrossRef]
  12. Liu, R.; Yang, B.; Zio, E.; Chen, X. Artificial Intelligence for Fault Diagnosis of Rotating Machinery: A Review. Mech. Syst. Signal Process. 2018, 108, 33–47. [Google Scholar] [CrossRef]
  13. Wang, Y.; Pan, Z.; Yuan, X.; Yang, C.; Gui, W. A Novel Deep Learning Based Fault Diagnosis Approach for Chemical Process with Extended Deep Belief Network. ISA Trans. 2020, 96, 457–467. [Google Scholar] [CrossRef]
  14. Zhang, W.; Peng, G.; Li, C.; Chen, Y.; Zhang, Z. A New Deep Learning Model for Fault Diagnosis with Good Anti-Noise and Domain Adaptation Ability on Raw Vibration Signals. Sensors 2017, 17, 425. [Google Scholar] [CrossRef] [Green Version]
  15. Wu, X.; Peng, Z.; Ren, J.; Cheng, C.; Zhang, W.; Wang, D. Rub-Impact Fault Diagnosis of Rotating Machinery Based on 1-D Convolutional Neural Networks. IEEE Sens. J. 2020, 20, 8349–8363. [Google Scholar] [CrossRef]
  16. Chen, X.; Zhang, B.; Gao, D. Bearing Fault Diagnosis Base on Multi-Scale CNN and LSTM Model. J. Intell. Manuf. 2021, 32, 971–987. [Google Scholar] [CrossRef]
  17. Liu, Q.; Ma, G.; Cheng, C. Data Fusion Generative Adversarial Network for Multi-Class Imbalanced Fault Diagnosis of Rotating Machinery. IEEE Access 2020, 8, 70111–70124. [Google Scholar] [CrossRef]
  18. Ding, Y.; Ma, L.; Ma, J.; Wang, C.; Lu, C. A Generative Adversarial Network-Based Intelligent Fault Diagnosis Method for Rotating Machinery Under Small Sample Size Conditions. IEEE Access 2019, 7, 149736–149749. [Google Scholar] [CrossRef]
  19. Li, Z.; Zheng, T.; Wang, Y.; Cao, Z.; Guo, Z.; Fu, H. A Novel Method for Imbalanced Fault Diagnosis of Rotating Machinery Based on Generative Adversarial Networks. IEEE Trans. Instrum. Meas. 2021, 70, 1–17. [Google Scholar] [CrossRef]
  20. Luo, J.; Huang, J.; Li, H. A Case Study of Conditional Deep Convolutional Generative Adversarial Networks in Machine Fault Diagnosis. J. Intell. Manuf. 2021, 32, 407–425. [Google Scholar] [CrossRef]
  21. Li, X.; Zhang, W.; Ding, Q.; Sun, J.-Q. Intelligent Rotating Machinery Fault Diagnosis Based on Deep Learning Using Data Augmentation. J. Intell. Manuf. 2020, 31, 433–452. [Google Scholar] [CrossRef]
  22. Xiang, J.; Zhong, Y. A Novel Personalized Diagnosis Methodology Using Numerical Simulation and an Intelligent Method to Detect Faults in a Shaft. Appl. Sci. 2016, 6, 414. [Google Scholar] [CrossRef] [Green Version]
  23. Liu, X.; Huang, H.; Xiang, J. A Personalized Diagnosis Method to Detect Faults in a Bearing Based on Acceleration Sensors and an FEM Simulation Driving Support Vector Machine. Sensors 2020, 20, 420. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  24. Chen, W.; Qiu, Y.; Feng, Y.; Li, Y.; Kusiak, A. Diagnosis of Wind Turbine Faults with Transfer Learning Algorithms. Renew. Energy 2021, 163, 2053–2067. [Google Scholar] [CrossRef]
  25. Huang, D.; Zeng, Y.; Zhang, Y. Transfer Learning Fault Diagnosis Method of Rolling Bearing Based on Laplace Wavelet and Deep Residual Neural Network. SSRN Electron. J. 2022. [CrossRef]
  26. Li, F.; Tang, T.; Tang, B.; He, Q. Deep Convolution Domain-Adversarial Transfer Learning for Fault Diagnosis of Rolling Bearings. Measurement 2021, 169, 108339. [Google Scholar] [CrossRef]
  27. Zhu, J.; Chen, N.; Shen, C. A New Multiple Source Domain Adaptation Fault Diagnosis Method Between Different Rotating Machines. IEEE Trans. Ind. Inform. 2021, 17, 4788–4797. [Google Scholar] [CrossRef]
  28. Jalan, A.K.; Mohanty, A.R. Model Based Fault Diagnosis of a Rotor–Bearing System for Misalignment and Unbalance under Steady-State Condition. J. Sound Vib. 2009, 327, 604–622. [Google Scholar] [CrossRef]
  29. Cao, H.; Niu, L.; He, Z. Method for Vibration Response Simulation and Sensor Placement Optimization of a Machine Tool Spindle System with a Bearing Defect. Sensors 2012, 12, 8732–8754. [Google Scholar] [CrossRef] [Green Version]
  30. Chandra, N.H.; Sekhar, A.S. Fault Detection in Rotor Bearing Systems Using Time Frequency Techniques. Mech. Syst. Signal Process. 2016, 72–73, 105–133. [Google Scholar] [CrossRef]
  31. Cheng, Y.; Lin, M.; Wu, J.; Zhu, H.; Shao, X. Intelligent Fault Diagnosis of Rotating Machinery Based on Continuous Wavelet Transform-Local Binary Convolutional Neural Network. Knowl.-Based Syst. 2021, 216, 106796. [Google Scholar] [CrossRef]
  32. Li, C. Rolling Element Bearing Defect Detection Using the Generalized Synchrosqueezing Transform Guided by Time–Frequency Ridge Enhancement. ISA Trans. 2016, 60, 274–284. [Google Scholar] [CrossRef]
  33. Wu, J.-D.; Huang, C.-K. An Engine Fault Diagnosis System Using Intake Manifold Pressure Signal and Wigner–Ville Distribution Technique. Expert Syst. Appl. 2011, 38, 536–544. [Google Scholar] [CrossRef]
  34. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep Residual Learning for Image Recognition. In Proceedings of the 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
  35. Fan, M.; Xia, J.; Meng, X.; Zhang, K. Single-Phase Grounding Fault Types Identification Based on Multi-Feature Transformation and Fusion. Sensors 2022, 22, 3521. [Google Scholar] [CrossRef]
  36. Pan, S.J.; Yang, Q. A Survey on Transfer Learning. IEEE Trans. Knowl. Data Eng. 2010, 22, 1345–1359. [Google Scholar] [CrossRef]
  37. Russakovsky, O.; Deng, J.; Su, H.; Krause, J.; Satheesh, S.; Ma, S.; Huang, Z.; Karpathy, A.; Khosla, A.; Bernstein, M.; et al. ImageNet Large Scale Visual Recognition Challenge. Int. J. Comput. Vis. 2015, 115, 211–252. [Google Scholar] [CrossRef] [Green Version]
  38. Baxi, C.B.; Telengator, A.; Razvi, J. Rotor Scale Model Tests for Power Conversion Unit of GT-MHR. Nucl. Eng. Des. 2012, 251, 344–348. [Google Scholar] [CrossRef]
  39. Wu, J.-J. Prediction of Lateral Vibration Characteristics of a Full-Size Rotor-Bearing System by Using Those of Its Scale Models. Finite Elem. Anal. Des. 2007, 43, 803–816. [Google Scholar] [CrossRef]
  40. Young, Y.L. Dynamic Hydroelastic Scaling of Self-Adaptive Composite Marine Rotors. Compos. Struct. 2010, 92, 97–106. [Google Scholar] [CrossRef]
  41. Ramu, M.; Prabhu Raja, V.; Thyla, P.R. Establishment of Structural Similitude for Elastic Models and Validation of Scaling Laws. KSCE J. Civ. Eng. 2013, 17, 139–144. [Google Scholar] [CrossRef]
  42. Coutinho, C.P.; Baptista, A.J.; Dias Rodrigues, J. Reduced Scale Models Based on Similitude Theory: A Review up to 2015. Eng. Struct. 2016, 119, 81–94. [Google Scholar] [CrossRef]
  43. Chang, Z.; Zhang, Y.; Chen, W. Electricity Price Prediction Based on Hybrid Model of Adam Optimized LSTM Neural Network and Wavelet Transform. Energy 2019, 187, 115804. [Google Scholar] [CrossRef]
Figure 1. Diagram of residual blocks.
Figure 1. Diagram of residual blocks.
Entropy 25 00414 g001
Figure 2. ResNet18 network structure diagram.
Figure 2. ResNet18 network structure diagram.
Entropy 25 00414 g002
Figure 3. Schematic diagram of transfer learning.
Figure 3. Schematic diagram of transfer learning.
Entropy 25 00414 g003
Figure 4. Flow chart of turbine rotor fault diagnosis method based on FEMATL method.
Figure 4. Flow chart of turbine rotor fault diagnosis method based on FEMATL method.
Entropy 25 00414 g004
Figure 5. Rotor-bearing system experimental platform: (a) Experimental platform. (b) Dimensions of the rotor model, where L1 = 140 mm, L2 = L3 = 435 mm, L4 = 180 mm, H = 50 mm, D1 = 50 mm, D2 = 60 mm, D3 = 350 mm, and the two displacement sensors are located 60 mm to the left of the turntable.
Figure 5. Rotor-bearing system experimental platform: (a) Experimental platform. (b) Dimensions of the rotor model, where L1 = 140 mm, L2 = L3 = 435 mm, L4 = 180 mm, H = 50 mm, D1 = 50 mm, D2 = 60 mm, D3 = 350 mm, and the two displacement sensors are located 60 mm to the left of the turntable.
Entropy 25 00414 g005
Figure 6. Finite element model of rotor bearing system.
Figure 6. Finite element model of rotor bearing system.
Entropy 25 00414 g006
Figure 7. Comparison of experimental and simulated signals in the X- and Y-directions.
Figure 7. Comparison of experimental and simulated signals in the X- and Y-directions.
Entropy 25 00414 g007
Figure 8. 50 N rubbing force in X- and Y-directions.
Figure 8. 50 N rubbing force in X- and Y-directions.
Entropy 25 00414 g008
Figure 9. Nine faulty rotor simulation signals: (ac) three misalignment faults, (df) three rubbing faults, (gi) three unbalance faults.
Figure 9. Nine faulty rotor simulation signals: (ac) three misalignment faults, (df) three rubbing faults, (gi) three unbalance faults.
Entropy 25 00414 g009
Figure 10. Original signal and noisy signal with SNR = 10.
Figure 10. Original signal and noisy signal with SNR = 10.
Entropy 25 00414 g010
Figure 11. Time-frequency diagrams transformed by four different time-frequency analysis methods. (a) STFT time-frequency diagram, (b) CWT time-frequency diagram, (c) GST time-frequency diagram, (d) WVD time-frequency diagram.
Figure 11. Time-frequency diagrams transformed by four different time-frequency analysis methods. (a) STFT time-frequency diagram, (b) CWT time-frequency diagram, (c) GST time-frequency diagram, (d) WVD time-frequency diagram.
Entropy 25 00414 g011
Figure 12. Diagram of transfer learning from source domain to target domain.
Figure 12. Diagram of transfer learning from source domain to target domain.
Entropy 25 00414 g012
Figure 13. Accuracy curves for different time-frequency diagram datasets: (a) STFT time-frequency diagram dataset, (b) CWT time-frequency diagram dataset, (c) GST time-frequency diagram dataset, (d) WVD time-frequency diagram dataset.
Figure 13. Accuracy curves for different time-frequency diagram datasets: (a) STFT time-frequency diagram dataset, (b) CWT time-frequency diagram dataset, (c) GST time-frequency diagram dataset, (d) WVD time-frequency diagram dataset.
Entropy 25 00414 g013
Figure 14. Accuracy of the validation set during the iterations of different time-frequency diagrams.
Figure 14. Accuracy of the validation set during the iterations of different time-frequency diagrams.
Entropy 25 00414 g014
Figure 15. Multi-classification confusion matrix for transfer learning based on ResNet18 network. (a) STFT confusion matrix, (b) CWT confusion matrix, (c) GST confusion matrix, (d) WVD confusion matrix.
Figure 15. Multi-classification confusion matrix for transfer learning based on ResNet18 network. (a) STFT confusion matrix, (b) CWT confusion matrix, (c) GST confusion matrix, (d) WVD confusion matrix.
Entropy 25 00414 g015
Figure 16. t-SNE visualization of the fully connected layer of the ResNet18 network. (a) STFT dataset, (b) CWT dataset, (c) GST dataset, (d) WVD dataset.
Figure 16. t-SNE visualization of the fully connected layer of the ResNet18 network. (a) STFT dataset, (b) CWT dataset, (c) GST dataset, (d) WVD dataset.
Entropy 25 00414 g016
Figure 17. Loss and accuracy curves of ResNet18 network without migration learning for CWT time-frequency diagrams. (a) Loss curve. (b) Accuracy curve.
Figure 17. Loss and accuracy curves of ResNet18 network without migration learning for CWT time-frequency diagrams. (a) Loss curve. (b) Accuracy curve.
Entropy 25 00414 g017
Table 1. Material parameters of the rotor.
Table 1. Material parameters of the rotor.
Material ParametersShaftDisc
Density of material7850 kg/m37850 kg/m3
Modulus of elasticity2.05 × 1011 Pa2.05 × 1011 Pa
Poisson’s ratio0.30.3
Mass25.93 kg33.78 kg
Table 2. Rotor-bearing system components and corresponding element types.
Table 2. Rotor-bearing system components and corresponding element types.
Rotor System ComponentsType
ShaftBeam189
DiscMass21
Sliding bearingCombi214
Table 3. Samples of ten different faults and their labels.
Table 3. Samples of ten different faults and their labels.
Fault TypeLabelFault TypeLabel
1.0 mm_parallel misalignment090 N_rubbing5
1.5 mm_parallel misalignment10.1 kg_unbalance6
2.0 mm_parallel misalignment20.2 kg_unbalance7
50 N_rubbing30.3 kg_unbalance8
70 N_rubbing4health9
Table 4. Rotor-bearing system failure dataset descriptions.
Table 4. Rotor-bearing system failure dataset descriptions.
DatasetSTFT DatasetCWT DatasetGST DatasetWVD Dataset
fault category10101010
TFD dimension875 × 656 × 3875 × 656 × 3875 × 656 × 3875 × 656 × 3
number of training set samples3000300030003000
number of samples in the validation set1000100010001000
number of samples in the test set1000100010001000
Table 5. Validation set accuracy in each epoch.
Table 5. Validation set accuracy in each epoch.
Epochs12345678910
STFT0.2280.7720.8540.8840.9730.9540.9950.9980.9980.997
CWT0.4320.7980.9050.9740.9680.9770.9730.9970.9950.998
GST0.4240.8320.9150.9480.9960.9820.9870.9850.9970.998
WVD0.2290.7220.7220.9150.9640.9630.9570.9290.9850.977
Table 6. Evaluation indexes of different fault categories in CWT TFD dataset.
Table 6. Evaluation indexes of different fault categories in CWT TFD dataset.
Fault TypePrecisionRecallSpecificityF1
01.00.991.00.995
11.01.01.01.0
21.01.01.01.0
31.01.01.01.0
41.01.01.01.0
51.01.01.01.0
60.991.00.9990.995
71.01.01.01.0
81.01.01.01.0
90.990.990.9990.99
Table 7. The impact of transfer learning on rotor fault diagnosis.
Table 7. The impact of transfer learning on rotor fault diagnosis.
DatasetEpochsValidation Set AccuracyTraining TimeTest Time
Fault diagnosis using transfer learningSTFT dataset1099.8%399.73/s10.68/s
CWT dataset1099.8%397.12/s10.54/s
GST dataset1099.8%401.26/s10.77/s
WVD dataset1098.5%412.35/s11.03/s
Fault diagnosis without using transfer learningSTFT dataset10087.5%63.76/min11.23/s
CWT dataset10088.2%64.47/min10.66/s
GST dataset10087.1%64.52/min10.89/s
WVD dataset10082.3%65.01/min11.78/s
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Zhang, Q.; He, Q.; Qin, J.; Duan, J. Application of Fault Diagnosis Method Combining Finite Element Method and Transfer Learning for Insufficient Turbine Rotor Fault Samples. Entropy 2023, 25, 414. https://doi.org/10.3390/e25030414

AMA Style

Zhang Q, He Q, Qin J, Duan J. Application of Fault Diagnosis Method Combining Finite Element Method and Transfer Learning for Insufficient Turbine Rotor Fault Samples. Entropy. 2023; 25(3):414. https://doi.org/10.3390/e25030414

Chicago/Turabian Style

Zhang, Qinglei, Qunshan He, Jiyun Qin, and Jianguo Duan. 2023. "Application of Fault Diagnosis Method Combining Finite Element Method and Transfer Learning for Insufficient Turbine Rotor Fault Samples" Entropy 25, no. 3: 414. https://doi.org/10.3390/e25030414

APA Style

Zhang, Q., He, Q., Qin, J., & Duan, J. (2023). Application of Fault Diagnosis Method Combining Finite Element Method and Transfer Learning for Insufficient Turbine Rotor Fault Samples. Entropy, 25(3), 414. https://doi.org/10.3390/e25030414

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop