1. Introduction
As global climate change and environmental problems become more serious, more attention is being paid to renewable energy sources that do not emit large amounts of greenhouse gases and pollutants. Wind energy is one of the most important and cleanest renewable energy sources, which has attracted much attention globally. Its advantages are characterized by high efficiency and non-pollution, wide distribution, abundant reserves, etc. It has good competitiveness in society and good commercialization prospects [
1]. The year-on-year increase in installed wind power capacity will lead to significant challenges in the operation and maintenance of wind turbines. Wind turbines can be categorized into horizontal-axis wind turbines and vertical-axis wind turbines. Wind turbine generators (WTGs) are generally operated in sparsely populated remote areas and inevitably suffer from wind and sunshine, high and low temperatures, sand, salt erosion, and some other unique local climatic influences, and they also withstand the impact of random perturbations arising from the uncertainty of wind energy. At the same time, in order to maximize the capture of wind energy, horizontal-axis wind turbines are becoming larger and larger, the diameter of the wind power blades is increasing, and most of the unit components are lifted up to a hundred meters in the air, so the unit’s operation and maintenance costs are much higher than those of traditional power generation equipment [
2]. When downtime failure occurs, wind farms face huge losses and compensation, so it is necessary to carry out preventive maintenance on wind turbines.
Preventive maintenance refers to maintenance activities carried out prior to equipment failure and is categorized into planned maintenance and maintenance on a case-by-case basis. Preventive maintenance is done through the monitoring of equipment operation with monitoring indicators and finding fault signals before the failure of the equipment to take the necessary maintenance activities, not only to avoid the occurrence of downtime failure, but also to make the equipment work in a good condition for a long time. Planned maintenance is based on a fixed period of time for equipment maintenance activities and is done by arranging maintenance personnel and equipment in advance to save time and temporary scheduling costs. However, the formulation of the maintenance cycle does not take into account the actual state of the equipment for its subsequent operation, and often, the occurring maintenance is not enough [
3,
4,
5]. Zhou Jian [
6] proposed a failure rate model that takes into account the situation of “repair not new” and influences the formulation of preventive maintenance cycles through the influence factor, so that the maintenance plan is more in line with the failure pattern of the components and avoids the over-maintenance and under-maintenance that may be encountered in traditional preventive maintenance. He Xuehong [
7] took into account the aging phenomenon of the equipment with the increase of the operation time and the amount of maintenance, so he introduced the service age regression factor and then formulated the indefinite cycle dimension strategy according to the change of the reliability of the equipment in different maintenance cycles. Zhang Guojun [
8] proposed an improved service age regression factor, which greatly reduces the maintenance cost per unit of time during the maintenance cycle through the correction of the failure rate. Dependent maintenance evaluates the condition of the equipment by collecting parameters and data from the equipment and repairs the equipment based on the results of the condition evaluation [
9,
10,
11,
12]. Since visual maintenance can overcome many of the shortcomings of planned maintenance, visual maintenance is attracting more and more attention.
According to the different ways to detect the state, maintenance can be divided into continuous-monitoring-state maintenance and non-continuous-monitoring-state maintenance. In continuous-monitoring-state maintenance, the monitoring equipment and pre-investment costs are higher, as are the signal processing equipment requirements, so generally it is installed in nuclear power plants and other systems high security requirements. In non-continuous monitoring state maintenance, faults are prone to occurring between two consecutive state detections, so it is extremely important to predict the state between two consecutive state detections. Zhu Wen [
13] used the MSET method to estimate the state of the gearbox bearing temperature of wind turbines, and the real-time trend of the residual difference between the estimated and actual values of the bearing temperature was used to judge whether the gearbox was operating normally or not. Zhang Wenjing [
14] decomposed the denoising-processed vibration signal into wavelet packets, took its energy in each frequency band as a feature vector, and used a back-propagation (BP) neural network for modelling and prediction, so as to achieve an intelligent diagnosis of wind turbine faults. Rosmaini A [
15] focused on issues such as identification and collection of required condition monitoring data, data analysis, modelling, and decision making, and provided an analytical comparison from the perspective of a practical application. Zhang Xiaohong [
16] defined the internal recession of the system as a Markov process, and based on the Markov process, a minimization cost model was developed and solved for the optimal maintenance intervals. Ding Peng [
17] proposed a parameter estimation method for the Weibull distribution based on a small sample of fault data and the case of no fault data to provide ideas for the development of initial preventive maintenance strategies. Luo Sensen [
18] proposed an improved failure prediction and health management (PHM) model, which in turn led to the development of a state-introduced multi-component opportunistic maintenance model for wind turbines. Zhang Lupeng [
19] proposed an opportunity maintenance strategy to repair components that exceed the state maintenance threshold and those that enter the state opportunity maintenance interval at the same time, which greatly reduces the total maintenance cost. In summary, accurate prediction of equipment condition plays a crucial role in finding the most maintenance cycles and developing optimal maintenance strategies. However, when constructing state-change models, most studies do not take into account the effects due to external random perturbations, which in turn do not allow for an accurate estimation of the equipment state.
To address the above problems, this paper took the oil temperature of wind turbine gearboxes as the research object, established a stochastic differential equation model for the change of oil temperature of gearboxes, and used iterative updating and Markov-based updating for the fitted and predicted sequences of the model, respectively. The updated model could more accurately predict the oil temperature of the gearbox, providing a more accurate basis for the development of contingent maintenance strategies.
The rest of the paper is organized as follows:
Section 2 constructs a model of gearbox oil temperature variation.
Section 3 solves the drift and diffusion coefficients.
Section 4 updates the model-fitted sequence and the predicted sequence.
Section 5 verifies the fitting and generalization ability of the model.
2. Stochastic Differential Equation (SDE) Modeling of Gearbox Oil Temperature Variations
Definition 1. represents the oil temperature of the wind turbine gearbox at moment t.
Definition 2. The variation of the gearbox oil temperature is only related to the oil temperature and operating time of the gearbox.
Definition 3. The oil temperature of the gearbox is spatially averaged.
The process of change of
can be expressed as follows:
where
is the relative rate of change.
The variation of the oil temperature of the gearbox is related to the operating time on the one hand, and to random environmental influences on the other hand, the magnitude of which is related to the state in which the equipment is located. Random environmental influences include the weather, wind speed, and routine daily inspections, patrols, and maintenance. Therefore, the relative rate of change can be expressed by the sum of two items, where the first item is the inevitable increase of the gearbox oil temperature with the increase of the running time under the condition of not considering the effect of a random perturbation, i.e., the recession of the gearbox oil temperature itself, and the second item is the effect of a random perturbation on the change of the gearbox oil temperature. The random perturbation is simulated by the Brownian motion here because the perturbation is random. Thus,
can be expressed as:
where
is the recession of the gearbox oil temperature itself,
is the random perturbation received by the gearbox oil temperature, and
is a noise term. Then, Equation (1) is:
Let the expectation of the random perturbation suffered by the gearbox be 0, and the random perturbation be independent and smooth. Then, the external random perturbation can be described by Brownian motion
. Equation (3) can be expressed as follows:
where
is the drift coefficient,
is the diffusion coefficient, and
is Brownian motion.
Suppose
is any constant, for all
and
, if satisfied:
Then it is shown that Equation (4) has a continuous unique solution.
If both Equations (5) and (6) are satisfied, Equation (4) can be expressed as:
where
is the initial time.
3. Drift and Diffusion Coefficient Solutions
The drift coefficient is:
From Equation (8), is the first-order average rate of change at time t.
Assuming that
is a continuous function on
, the drift coefficient
can be approximated by Weierstrass’ first approximation theorem:
Further calculations are available:
When
and
, there is:
It is obtained from the Taylor expansion:
where
is the baseline failure rate function and
is the monitoring value change rate.
Taking expectations for both the left and right sides of Equation (4) yields:
The desired value of the gearbox temperature is:
Since the derivative of
is the product of
and
, the constructed likelihood function is:
where
N is the total number of data and
r is the fault data.
The iterative formula is constructed as:
The diffusion coefficient is:
From Equation (9), is the average second-order moment growth rate at time t.
It is assumed that the disturbance suffered by the gearbox is only related to the current temperature of the gearbox and not the running time. Therefore, the diffusion coefficient can be expressed as:
In the
cycle, take the step
, the step condition is satisfied:
The change in the temperature of the gearbox during the time period
is denoted as
, then there is:
Therefore, the parameter
k can be expressed as:
4. Gearbox Oil Temperature Iteration Update
Definition:
is the ratio of the actual value to the predicted value, then
can be expressed as:
where
is the actual oil temperature of the gearbox at moment
t and
is the predicted oil temperature of the stochastic differential equation model at moment
t.
Based on the ratio of actual to predicted values, the state space is divided into
n, respectively:
where, if
then
If , then it means that at moment , the predicted value lies in state space .
The state transfer matrix is calculated based on the determined state space and the state changes of the predicted sequence:
where
is the n-step state transfer matrix. Due to the specificity of the gearbox oil temperature, only one step state transfer matrix
P needs to be calculated.
Correction of the fitted value data: the predicted sequence is corrected according to the state space to which the predicted sequence of stochastic differential equations belongs, and the corrected fitted sequence is:
Correction of prediction data: the probability that the predicted value is located in each state at the prediction time is known from the one-step state transfer matrix, and the corrected prediction sequence is:
After the above calculations and a new machine temperature prediction sequence
, recalculate the ratio of the actual value to the predicted value
and the state space boundaries
and
, and determine whether the predicted value belongs to the state space. According to the re-determination of the state space and the prediction of the sequence of changes in the re-calculation of the state transfer matrix, the state transfer matrix is calculated as follows:
where
is the probability that the predicted value changes from state
k to state
l before correction, and
is the probability that the predicted value changes from state
k to state
l after correction.
The fitted and test sequences were calculated as above.
A new machine temperature prediction sequence can be obtained after the calculation, and the new machine temperature prediction sequence can be obtained again by re-computing , , , the state space, the new state transfer matrix, and so on for the iterative calculation.
The fitting ability and generalization ability of the stochastic differential equation model prediction sequence
and the prediction sequence after iteration
are verified by calculating the fitting error
and the testing error
. The fitting error
and the testing error
are calculated as:
where
is the predicted sequence after the lth iteration,
n is the number of model fitting samples, and
m is the number of model testing samples.
5. Example Analysis
In this paper, based on the wind turbine gearbox oil temperature monitoring data of a wind farm in Jiuquan area, the stochastic differential equation model and iterative computational method mentioned in previously were utilized to fit and predict the gearbox oil temperature. The gearbox oil temperature monitoring data are shown in
Table 1.
The independent variable is the equipment running time, and the dependent variable is gearbox oil temperature. Model training and iterative calculations were performed using the first 13 monitoring data to predict the 14th monitoring data, and the fitting error and testing error were calculated and analyzed for model accuracy.
Based on the modelling of the gearbox oil temperature variation in
Section 2 and the solution of the drift coefficient and diffusion coefficient in
Section 3, the authors carried out the modelling and simulation in MATLAB R2023a software by combining the built-in functions and custom scripts and functions. The solution of the unknown parameters in the drift parameters needed to be carried out by estimating the parameters according to the Newton–Raphson iterative method described in
Section 3 after choosing the initial values of
,
, and
by the authors themselves. The diffusion coefficient was solved for
. Ultimately, the simulation results of Equation (7) in MATLAB are plotted in
Figure 1.
Due to the addition of Brownian motion to the modelling of gearbox oil temperature changes in
Section 2 and Brownian motion producing a different sequence of random numbers each time it is run, there were differences in the results produced by each MATLAB simulation run, but this just improved the realism and reliability of the simulation results.
The relative error versus time curve is shown in
Figure 2.
With an increase in running time, gearbox oil temperature change showed a trend of slow growth followed by accelerated growth. It can be assumed that there was no significant change in the gearbox oil temperature in the first 733 h. This finding indicates that the collection interval and maintenance interval of the gearbox oil temperature should not be too small in the first 733 h. Setting a collection interval that is too small will lead to an increase in the operating cost of the equipment on the one hand and to the generation of a large amount of redundant data on the other hand, which will increase the difficulty of storing and processing. The maintenance interval being too small could easily lead to excessive maintenance, which would not only increase the operating cost of the equipment, but also reduce its operating reliability.
The difference between the predicted value of the stochastic differential equation model and the actual value of the oil temperature increased significantly after the running time reached 733 h, a phenomenon that indicates that the effect of stochastic perturbations on the gearbox oil temperature becomes more significant as oil temperature increases.
The fitting and testing errors of the stochastic differential equation model were:
The ordinary differential equation was modeled as:
where the drift parameters were solved similarly to
Section 3.
The simulation results of the prediction curve of the gearbox oil temperature ordinary differential equation model are shown in
Figure 3.
The fitting and testing errors of the ordinary differential equation model were:
As can be seen from
Figure 3, the fitting curve based on the ordinary differential equation model produced an overall performance of a smooth curve. Although the general trend changed in line with the trend of the gearbox oil temperature, it could not reflect the impact of the random perturbation, while the stochastic differential equation model with added Brownian motion had the nature of independent increments, which could be used to make the predicted value approximated to any value of the time. It can also be concluded by comparing the fitting error and testing error that the stochastic differential equation model had better fitting and generalization ability than the ordinary differential equation model.
According to the MATLAB simulation results, the first 13 predictions of the stochastic differential equation model could be obtained, and the ratio was calculated according to Equation (25). The first 13 predictions and ratio of the stochastic differential equation model are shown in
Table 2.
By comparing the national standards with other literature, this paper divides the state space into five states, which are:
Overpredicted status
:
where
is calculated as shown in Equations (27)–(29), with
before iterative calculations were performed.
Slightly larger state in the forecast
:
Predicted value accuracy status
:
Slightly smaller state in the forecast
:
Prediction understatement
:
where
is calculated as shown in Equations (27)–(29), with
before iterative calculations were performed.
Based on the above division of the state space, the states to which the predicted values belong are shown in
Table 3.
From
Table 3, the one-step state transfer matrix
P is:
Markov-corrected predicted values are:
The fitted and test data were corrected, and the fitting and test errors were calculated according to the iterative calculation method described in
Section 4. The variation of the fitting and test errors are plotted in
Figure 4 and
Figure 5.
As can be seen in
Figure 4 and
Figure 5, as the number of iterations increased, both the fitting error and the test error stabilized and were smaller than before the iterations. The fitting error eventually stabilized at around 2%, and the testing error eventually stabilized at around 8%, which optimized both the fitting and generalization effects after the iteration, compared to 35.36% and 11.46% before the iteration. It can also be seen that the fitting error tended to stabilize around eight iterations and had a tendency to disperse as the number of iterations increased, and the test error gradually became smaller as the number of iterations increased. However, after the fifteenth iteration, the degree of reduction in the test error with the increase in the number of iterations was no longer significant. The results of the above analysis show that having a higher number of iterations was not necessarily better, and the number of iterations may be too high, which may result in the dispersion of the data and waste of resources.
6. Conclusions
For the wind turbine gearbox oil temperature change problem, this paper establishes a stochastic differential equation model of gearbox oil temperature change and uses an iterative calculation method to optimize the established model. The optimized model could accurately fit the trend of gearbox oil temperature change and predict the gearbox oil temperature. Weierstrass’ first approximation theorem was used in the model to solve the drift coefficients, Brownian motion was used to simulate the external random perturbations, and iterative calculations and Markov chain-based corrections were performed on the predicted values generated by the model, which greatly improved the model’s fitting and generalization capabilities. Meanwhile, when comparing with those of the ordinary differential equation model, the fitting error and test error changed from the previous 39.33% and 22.05% to 35.36% and 11.46%, respectively, which verified the necessity of considering the external stochastic perturbation. In the example analysis, by comparing the fitting error and test error before and after the iteration, it was found that the fitting error and test error changed from the previous 35.36% and 11.46% to about 2.5% and 8%, respectively, which further verified the validity of the iterative update. Obviously, this method of accurately predicting the transmission oil temperature is very important for the development of preventive maintenance.