CN111027775A - Step hydropower station generating capacity prediction method based on long-term and short-term memory network - Google Patents
Step hydropower station generating capacity prediction method based on long-term and short-term memory network Download PDFInfo
- Publication number
- CN111027775A CN111027775A CN201911276332.1A CN201911276332A CN111027775A CN 111027775 A CN111027775 A CN 111027775A CN 201911276332 A CN201911276332 A CN 201911276332A CN 111027775 A CN111027775 A CN 111027775A
- Authority
- CN
- China
- Prior art keywords
- data
- model
- generated energy
- hydropower station
- long
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 36
- 230000007787 long-term memory Effects 0.000 title claims abstract description 16
- 230000006403 short-term memory Effects 0.000 title claims abstract description 16
- 238000010248 power generation Methods 0.000 claims abstract description 54
- 238000012360 testing method Methods 0.000 claims abstract description 25
- 238000012549 training Methods 0.000 claims abstract description 20
- 238000000354 decomposition reaction Methods 0.000 claims abstract description 18
- 238000005457 optimization Methods 0.000 claims abstract description 18
- 230000015654 memory Effects 0.000 claims abstract description 11
- 238000004364 calculation method Methods 0.000 claims description 29
- 230000008569 process Effects 0.000 claims description 6
- 238000001744 unit root test Methods 0.000 claims description 5
- 239000000654 additive Substances 0.000 claims description 4
- 230000000996 additive effect Effects 0.000 claims description 4
- 230000006870 function Effects 0.000 claims description 4
- 238000007689 inspection Methods 0.000 claims description 4
- 230000004913 activation Effects 0.000 claims description 3
- 238000012790 confirmation Methods 0.000 claims description 3
- 238000013135 deep learning Methods 0.000 claims description 3
- 230000010354 integration Effects 0.000 claims description 3
- 210000002569 neuron Anatomy 0.000 claims description 3
- 238000007781 pre-processing Methods 0.000 claims description 3
- 230000011218 segmentation Effects 0.000 claims description 3
- 238000004519 manufacturing process Methods 0.000 claims 4
- 238000013528 artificial neural network Methods 0.000 abstract description 10
- 238000013112 stability test Methods 0.000 abstract 1
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 16
- 238000010586 diagram Methods 0.000 description 13
- 239000011435 rock Substances 0.000 description 12
- 230000000694 effects Effects 0.000 description 8
- 238000004458 analytical method Methods 0.000 description 7
- 238000010219 correlation analysis Methods 0.000 description 7
- 230000007774 longterm Effects 0.000 description 4
- 241000030939 Bubalus bubalis Species 0.000 description 3
- 238000010606 normalization Methods 0.000 description 3
- 238000011160 research Methods 0.000 description 3
- 101001095088 Homo sapiens Melanoma antigen preferentially expressed in tumors Proteins 0.000 description 2
- 102100037020 Melanoma antigen preferentially expressed in tumors Human genes 0.000 description 2
- 235000021170 buffet Nutrition 0.000 description 2
- 230000008859 change Effects 0.000 description 2
- 238000011161 development Methods 0.000 description 2
- 230000018109 developmental process Effects 0.000 description 2
- 230000005611 electricity Effects 0.000 description 2
- 238000002955 isolation Methods 0.000 description 2
- 230000001105 regulatory effect Effects 0.000 description 2
- 238000012300 Sequence Analysis Methods 0.000 description 1
- 230000002159 abnormal effect Effects 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 238000009412 basement excavation Methods 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000007796 conventional method Methods 0.000 description 1
- 230000003247 decreasing effect Effects 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000004146 energy storage Methods 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000011156 evaluation Methods 0.000 description 1
- 238000000605 extraction Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 238000009472 formulation Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000005259 measurement Methods 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 230000035772 mutation Effects 0.000 description 1
- 238000003062 neural network model Methods 0.000 description 1
- 230000010355 oscillation Effects 0.000 description 1
- 238000000513 principal component analysis Methods 0.000 description 1
- 230000001131 transforming effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/044—Recurrent networks, e.g. Hopfield networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/06—Energy or water supply
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y04—INFORMATION OR COMMUNICATION TECHNOLOGIES HAVING AN IMPACT ON OTHER TECHNOLOGY AREAS
- Y04S—SYSTEMS INTEGRATING TECHNOLOGIES RELATED TO POWER NETWORK OPERATION, COMMUNICATION OR INFORMATION TECHNOLOGIES FOR IMPROVING THE ELECTRICAL POWER GENERATION, TRANSMISSION, DISTRIBUTION, MANAGEMENT OR USAGE, i.e. SMART GRIDS
- Y04S10/00—Systems supporting electrical power generation, transmission or distribution
- Y04S10/50—Systems or methods supporting the power network operation or management, involving a certain degree of interaction with the load-side end user applications
Landscapes
- Engineering & Computer Science (AREA)
- Business, Economics & Management (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Economics (AREA)
- General Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Strategic Management (AREA)
- Human Resources & Organizations (AREA)
- Software Systems (AREA)
- Marketing (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Computation (AREA)
- Mathematical Physics (AREA)
- Data Mining & Analysis (AREA)
- Computational Linguistics (AREA)
- Biophysics (AREA)
- Biomedical Technology (AREA)
- General Business, Economics & Management (AREA)
- Tourism & Hospitality (AREA)
- Artificial Intelligence (AREA)
- Molecular Biology (AREA)
- Life Sciences & Earth Sciences (AREA)
- Primary Health Care (AREA)
- Water Supply & Treatment (AREA)
- Public Health (AREA)
- Development Economics (AREA)
- Game Theory and Decision Science (AREA)
- Entrepreneurship & Innovation (AREA)
- Operations Research (AREA)
- Quality & Reliability (AREA)
- Supply And Distribution Of Alternating Current (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
The invention discloses a step hydropower station generating capacity prediction method based on a long-short term memory network, which comprises the steps of carrying out stability test on a hydropower station generating capacity time sequence; carrying out correlation test on the generated energy time series; converting the generated energy time series data into supervised learning data; establishing a generating capacity prediction model based on a long-term and short-term memory network; performing integrated empirical mode decomposition on the generated energy data to obtain a training set and a test set; training the generated energy prediction model by using a training set, and performing model hyper-parameter optimization by using an improved discrete differential evolution algorithm to obtain optimal model parameters; and adopting a power generation amount prediction model to predict the power generation amount of the cascade hydropower station. The method can be suitable for predicting the generated energy of large and medium-sized step power stations; the generated energy prediction model based on the LSTM neural network has more advantages on the generation energy prediction of the multi-year regulation power station, and the fitting precision of the model is improved by the super-parameter optimization of the model.
Description
Technical Field
The invention belongs to the field of hydropower station energy optimization, and particularly relates to a step hydropower station generating capacity prediction method based on a long-term and short-term memory network.
Background
With the high-speed development of power grid construction and the continuous expansion of the scale of the power grid in China, the requirements of safe, stable and economic operation of the power grid are higher and higher, and the increasing contradiction between the power consumption requirement and the peak regulation capacity brings great challenges to the operation of water and electricity. However, hydropower is used as a main peak shaving power supply in a power grid in China, and the peak shaving requirements of each power receiving power grid need to be fully considered in the output process, so that higher requirements are provided for power generation prediction of an excavation hydropower station based on a multi-constraint situation. The hydropower station generated energy prediction uses data of a past period of hydropower station, such as generated energy, warehousing flow, ex-warehouse flow, dam upper water level, dam lower water level, water head, energy storage value and the like, which have a correlation with hydropower station generated electricity to predict the generated energy of the hydropower station in a future period of time, so that the scheduling plan is convenient to make.
The existing research of generating capacity prediction is mainly oriented to small hydropower stations without regulating capacity, the generating capacity prediction mainly adopts the following methods, the most common method is a gray prediction model, and the method is used for transforming less input information, establishing a differential gray system and explaining the development of things; dynamic neural network models are also commonly used for predicting the generated energy of the radial hydropower station, and comprise a BP neural network, a wavelet neural network, an Artificial Neural Network (ANN) and the like. But the power generation capacity of the radial hydropower station is basically dependent on the incoming water, and the manual operable space is small. The existing research methods only mention the prediction of the power generation capacity of large and medium-sized cascade power stations, which brings difficulty to the formulation of a scheduling plan, and the current research does not consider the long-term dependence of the power generation capacity time sequence.
Disclosure of Invention
The technical problem of the invention is that the existing power generation amount prediction method only refers to the prediction of the power generation amount of large and medium-sized cascade power stations, and does not consider the long-term dependence of the power generation amount time sequence.
The invention aims to solve the problems and provides a step hydropower station electric energy generation amount prediction method based on a Long-Short Term Memory network, wherein an electric energy generation amount prediction model is constructed by using the Long-Short Term Memory network (LSTM), an integrated Empirical Mode Decomposition (EEMD) is carried out on hydropower station electric energy generation amount data to obtain a training set and a test set of the electric energy generation amount prediction model, and meanwhile, an improved discrete differential Evolution (MDDE) algorithm is adopted to carry out hyperparametric optimization on the electric energy generation amount prediction model aiming at the characteristics of multiple types, non-continuity and large manual parameter adjustment difficulty of the LSTM neural network, so that the electric energy generation amount prediction model can be predicted more accurately.
The technical scheme of the invention is a step hydropower station generating capacity prediction method based on a long-short term memory network, which comprises the following steps of 1: preprocessing data of the characteristic engineering, and carrying out stability inspection on a hydropower station generating capacity time sequence;
step 2: performing correlation test on the generated energy time sequence, and selecting a generated energy influence factor with high correlation;
and step 3: converting the generated energy time series data into supervised learning data;
and 4, step 4: establishing a generating capacity prediction model based on a long-term and short-term memory network;
and 5: performing integrated empirical mode decomposition on the generated energy data in the step 3 to obtain a training set and a test set;
step 6: training the generated energy prediction model by using a training set, and performing model hyper-parameter optimization by using an improved discrete differential evolution algorithm to obtain optimal model parameters;
and 7: testing the generated energy prediction model based on the long-term and short-term memory network by using a test set;
and 8: and adopting a power generation amount prediction model based on a long-term and short-term memory network to predict the power generation amount of the cascade hydropower station.
Preferably, in step 1, the stationarity test is performed on the hydropower station generated energy time sequence, a unit root test method is adopted, when the unit root does not exist in the sequence, the sequence is considered to have stationarity, otherwise, the sequence is considered to be a non-stationary time sequence.
Preferably, in step 2, the correlation test is performed on the power generation amount time series, and comprises calculation of a Pearson correlation coefficient and a Spearman correlation coefficient.
Further, in step 3, the power generation amount time series data are converted into supervised learning data, the dimensionality of the time series data is N _ features, the length of the time series is l, and X _ time is the length of a time window and corresponds to sample data; y _ timestamps is the length of a forecast period and corresponds to data label data; and sequentially performing sliding segmentation on the generated energy time sequence, and taking out the sequence with the length of X _ times + Y _ times each time to obtain a plurality of subsequences, wherein the number of the subsequences is N _ samples which is l-X _ times-Y _ times + 1.
Further, in step 5, the performing integrated empirical mode decomposition on the power generation amount data in step 3 specifically includes:
1) importing a data set X, wherein the capacity of the data set is N, and the standard deviation is std;
2) initializing the ratio Nstd of the additive noise to the standard deviation and the integration quantity NE;
3) rounding the logarithm of N, and then subtracting 1 to obtain the number M of IMF components;
4) adding noise to the sequence;
5) performing empirical mode decomposition to obtain an IMF component and a residual error, and entering a step 4 when the decomposition times are less than M);
6) and outputting the IMF component and the residual error.
Further, in step 6, the improved discrete differential evolution algorithm is adopted to perform model hyper-parameter optimization, and the hyper-parameter learning rate, the iteration times, the optimizer, the activation function, the time window and the number of neurons in the deep learning are subjected to unique heat coding and then subjected to differential evolution.
Preferably, the process of performing model hyper-parameter optimization by using the improved discrete differential evolution algorithm utilizes a distributed parallel computing method, and the distributed parallel computing method includes the following steps:
(1) the main node loads a cluster computing node list and initializes the population of an optimization algorithm;
(2) the main node broadcasts confirmation information to all the computing nodes according to the node list, and if the node does not respond, the node is removed from the node list; when the node list is empty, namely all the nodes cannot work, prompting a user to check node information and terminating the calculation;
(3) if the available nodes exist, the main node starts multithreading, balances the calculation tasks of the population to all the calculation nodes, then starts thread waiting, sets timeout time, and waits for all the calculation nodes to return calculation results; when the calculation of a node is overtime, abandoning the calculation result of the node and removing the node;
(4) when all the nodes are calculated, judging whether a termination condition is reached, if the termination condition is not reached, updating the population and the optimal value, and executing the step (3); and if the termination condition is met, performing model training by using the optimal parameters, and outputting the trained model.
Compared with the prior art, the invention has the beneficial effects that:
1) the step short-term power generation amount prediction method based on the long and short-term neural network can be suitable for predicting the power generation amount of large and medium-sized step power stations;
2) through correlation analysis and principal component analysis, factors which have obvious influence on the power generation amount are extracted, and meanwhile, the time sequence is subjected to feature extraction by using an integrated empirical mode decomposition technology, so that the model is favorable for learning scheduling rules from data on different scales, and the generalization capability and stability of the model are improved;
3) the generated energy prediction model based on the LSTM neural network can deal with long-term dependence and has more advantages on the generation amount prediction of a multi-year regulation power station;
4) the ultra-parameter optimization of the generated energy prediction model by adopting the improved discrete differential evolution algorithm improves the fitting precision of the generated energy prediction model based on the long-term and short-term neural network.
Drawings
The invention is further illustrated by the following figures and examples.
Fig. 1 is a flow chart of a cascade hydropower station power generation amount prediction method based on a long-short term memory network.
FIG. 2 is a flow diagram of an integrated empirical mode decomposition.
Fig. 3 is a diagram illustrating a time series conversion into supervised learning data.
FIG. 4 is a diagram of comparing one-hot coding with natural coding.
Fig. 5 is a flowchart illustrating a distributed parallel computing method.
Fig. 6 is a time chart of daily generated energy of the water buffalo power station.
Fig. 7 is a time chart of daily power generation of a river-separating rock power station.
And fig. 8-1 is a generated energy time sequence autocorrelation analysis diagram of the water buffalo power station.
And fig. 8-2 is a time series partial autocorrelation analysis diagram of the power generation amount of the water buffalo power station.
Fig. 8-3 is a diagram of the autocorrelation analysis of the power generation time series of the river-separating rock power station.
Fig. 8-4 is a partial autocorrelation analysis chart of the power generation time series of the river-separating rock power station.
And fig. 9-1 is a Pearson correlation coefficient diagram of correlation analysis of the influence factors of the power generation amount of the water puerto power station.
And FIG. 9-2 is a Spearman correlation coefficient diagram for correlation analysis of the influence factors of the power generation amount of the water puerto power station.
FIG. 10-1 is a Pearson correlation coefficient diagram of the correlation analysis of the influence factors of the power generation of the river-cut rock power station.
FIG. 10-2 is a Spearman correlation coefficient diagram of the correlation analysis of the impact factors of the power generation of the river-separating rock power station.
Fig. 11 is the EEMD decomposition result of the hydrobelock daily generated energy sequence.
FIG. 12 is a graph comparing the predicted effect of the MDDE-EEMD-LSTM model and the LSTM model of the present invention.
FIG. 13 is a graph comparing the predicted effect of the MDDE-EEMD-LSTM model and the LR model of the present invention.
FIG. 14 is a comparison graph of the predicted effect of the MDDE-EEMD-LSTM model and the SVR model of the present invention.
FIG. 15 is a graph comparing the predicted effect of the MDDE-EEMD-LSTM model and the RF model of the present invention.
FIG. 16 is a comparison graph of the predicted effect of the MDDE-EEMD-LSTM model and the MA model of the present invention.
Fig. 17 is a schematic diagram of the structure of an LSTM network.
Detailed Description
As shown in fig. 1, the method for predicting the power generation capacity of the cascade hydropower station based on the long-short term memory network comprises the following steps,
step 1: preprocessing data of characteristic engineering, performing stability inspection on a hydropower station generating capacity time sequence by using a unit root inspection method, and considering that the sequence has stability when the unit root does not exist in the sequence, otherwise, considering that the sequence is a non-stable time sequence;
step 2: performing correlation test on the generated energy time sequence, calculating a Pearson correlation coefficient and a Spearman correlation coefficient, and selecting a generated energy influence factor with high correlation;
and step 3: converting the generated energy time series data into supervised learning data;
and 4, step 4: establishing a generating capacity prediction model based on a long-term and short-term memory network;
and 5: performing integrated empirical mode decomposition on the generated energy data in the step 3 to obtain a training set and a test set;
step 6: training the generated energy prediction model by using a training set, and performing model hyper-parameter optimization by using an improved discrete differential evolution algorithm to obtain optimal model parameters;
and 7: testing the generated energy prediction model based on the long-term and short-term memory network by using a test set;
and 8: and adopting a power generation amount prediction model based on a long-term and short-term memory network to predict the power generation amount of the cascade hydropower station.
The invention utilizes an integrated empirical mode to decompose EEMD, improves a discrete differential evolution algorithm MDDE, and records a generated energy prediction model established based on a long-term and short-term memory network LSTM as MDDE-EEMD-LSTM.
In step 3, converting the multivariate time sequence into supervised learning data, wherein the sequence dimension is N _ features, the length is l, and X _ time is the length of a time window and corresponds to sample data; y _ timestamps is the length of the foreseeable period and corresponds to the data tag data. And sequentially performing sliding segmentation on the sequence, and taking out the sequence with the length of (X _ timing + Y _ timing) to obtain a plurality of subsequences, wherein N _ samples is the number of the obtained subsequences. As shown in fig. 3, taking the time sequence with N _ features being 2 and l being 10 as an example, taking X _ times being 6 and Y _ times being 2, that is, using the sequence data of the historical 6 time intervals to predict the data of the future 2 time intervals, for each feature, the whole time sequence can generate N _ samples being 3 new subsequences. Time series of arbitrary dimensions and lengths can be handled in this way.
In step 5, performing integrated empirical mode decomposition on the generated energy data to obtain a training set, wherein the process is shown in fig. 2 and includes:
1) importing a data set X with the capacity of N and the standard deviation of std;
2) initializing the ratio Nstd of the additive noise to the standard deviation and the integration quantity NE;
3) taking the logarithm of N, rounding, and then subtracting 1 to obtain the number M of IMF components;
4) adding noise to the sequence, wherein X + rand Nstd, rand is a random number, and rand is more than or equal to 0 and less than or equal to 1;
5) performing empirical mode decomposition to obtain an IMF component and a residual error, and entering a step 4 when the decomposition times are less than M);
6) and outputting the IMF component and the residual error.
The improved discrete differential evolution algorithm in step 6 improves the existing discrete differential evolution algorithm, which is disclosed in the paper "An ensemble of discrete evolution for solving the generated transformed discrete evolution protocol" published by Tasgetiren M F2010. The method is characterized in that a discrete differential evolution algorithm is improved to carry out unique hot coding on model parameters, then differential evolution is carried out, an individual cross strategy and a variation strategy are consistent with the differential evolution of the existing discrete differential evolution algorithm, and the difference lies in that normalization is carried out after population individual components are operated.
Carrying out model hyper-parameter optimization by using improved discrete differential evolution algorithm to obtain the optimumModel parameters: the method comprises the steps of carrying out unique hot coding on the hyper-parameter learning rate, iteration times, an optimizer, an activation function, a time window and the number of neurons in deep learning, carrying out differential evolution after the unique hot coding mode is as shown in figure 4, wherein an individual cross strategy and a variation strategy are consistent with those of the traditional differential evolution, and the difference lies in that normalization is carried out after the population individual components are operated. For example, the jth component of the ith individual in the populationIn other words, if the initial value is setAfter cross mutation operation becomeAt this time, the maximum value in the components is changed to 1, and the rest values are set to 0, namely the result after normalization isWhen a plurality of components are maximum values, randomly selecting one bit from the components to change the component into 1, and setting the rest components into 0; when all the components are 0, one of the components is set to 1 randomly. The process adopts a distributed parallel training scheme to train the model, and the flow is shown in fig. 5, and comprises the following steps: (1) the main node loads a cluster computing node list and initializes the population of an optimization algorithm; (2) and the main node broadcasts confirmation information to all the computing nodes according to the node list, and if the node does not respond, the node is removed from the node list. When the node list is empty, namely all the nodes cannot work, prompting a user to check node information and terminating the calculation; (3) if available nodes exist, the main node starts multithreading, balances the calculation tasks of the population to all the calculation nodes, then starts thread waiting, sets timeout time and waits for all the calculation nodes to return calculation results. When the calculation of a node is overtime, abandoning the calculation result of the node and removing the node; (4) when all the nodes are calculated, judging whether a termination condition is reached, and if the termination condition is not reached, judging whether the nodes are calculatedExecuting the step (3) by the new population and the optimal value; and if the termination condition is met, performing model training by using the optimal parameters, and outputting the trained model.
In the embodiment, a two-stage power station of a water budlock and a gazette in a Qingjiang step is used as an object to predict the power generation amount, and the selected time sequence is shown in table 1.
Table 1 step power station series selection statistical table
And analyzing the characteristics of the generated energy time series. To ensure the integrity of the time series, the missing values of the data set are first processed. And then drawing a historical power generation sequence diagram of the power station of the water budlock and the river-separated rock of the Qingjiang cascade, as shown in fig. 6 and 7. The daily full power generation amounts of the water buffet and the river-separated rock power station are known to be 4416 thousands KW.h and 2909 thousands KW.h respectively, the extreme value of a timing diagram is observed, the sequence values are in a reasonable range, and no abnormal point appears. Since zero value appears, the calculation of MAPE index is affected, and in order to ensure the sequence continuity, zero value point is not removed, but the value is processed as 1. The general characteristics of the observation sequence are easy to know that the power station has no obvious trend and periodicity in the process of generating energy for many years, and the generating energy time sequence is preliminarily judged to be a stable time sequence.
And analyzing the stationarity of the power generation sequence, and performing unit root test on the power generation time sequences of the waterflock and the river-separating rock, wherein the test results are shown in a table 2. The original hypothesis of the unit root test is that the test object has no unit root, and the p-values of the two power stations are both smaller than Critical Value (5%), so that the unit root is proved to exist, the original hypothesis is rejected, namely, the power generation amount time sequences of the two power stations are both stable time sequences, and a basis is provided for the next time sequence analysis.
Table 2. test results of unit roots of generated energy of hydropower station of buffy and gazette
The results of the autocorrelation and partial autocorrelation analysis of the generated energy time series are shown in fig. 8-1, 8-2, 8-3, and 8-4. Firstly, researching autocorrelation, wherein the images 8-1 and 8-3 are autocorrelation graphs of the generated energy time sequences of a water bufferblock power station and a river-separating rock power station respectively, a bell-shaped area is a 95% confidence interval, and correlation coefficients of the two power stations are respectively 60-order truncation and 50-order truncation; then, the partial autocorrelation of the generated energy time series is studied, and as can be seen from fig. 8-2 and 8-4, the partial autocorrelation graphs of the two power stations show a tailing trend. In summary, it can be determined that two plants can be modeled using a mobile autoregressive Model (MA), where a hydrobrak plant can be modeled using MA (60), and a gazette plant using model MA (50). The result of the correlation analysis is an applied scaling of the MA model.
And performing correlation analysis on the selected generating capacity influence factors comprehensively, calculating a Spearman correlation coefficient and a Pearson correlation coefficient respectively, and comparing the results. The known correlation coefficient has a value range of [ -1,1], which represents the strength of correlation, wherein a value greater than 0 represents positive correlation, a value less than 0 represents negative correlation, and a value equal to 0 represents no correlation. As shown in fig. 9-1 and fig. 9-2, in the two methods, the correlation degree between the generated energy of the water bufferblock power station and the ex-warehouse flow is the highest and reaches 0.92, and the correlation degree of the in-warehouse runoff is ranked secondly, and the preliminary analysis is that the water bufferblock reservoir has stronger regulating capacity. From the results of the Spearman correlation coefficient and the Pearson correlation coefficient, common factors with high correlation degrees are ex-warehouse flow and warehouse flow, however, the further analysis finds that the ex-warehouse flow and the warehouse flow have strong correlation, and in order to avoid introducing redundancy factors to influence the generalization performance of the model, only the ex-warehouse flow is selected as an influence factor of the water distribution puerperium station; for the river isolation rock power station, as shown in fig. 10-1 and fig. 10-2, both the Spearman correlation coefficient and Pearson correlation coefficient results show that the correlation degree between the warehousing flow and the power generation amount is highest, so that the warehousing flow is selected as a key factor influencing the power generation amount of the river isolation rock power station.
In the embodiment, a main structure of the power generation amount prediction model is built by stacking an LSTM layer and a full connection layer, the LSTM structure is shown in fig. 17, the MDDE algorithm is used for carrying out hyper-parameter optimization on each layer, and the parameter optimization result is shown in table 3.
TABLE 3 table of results of hyper-parametric optimization
In order to verify the performance of the power generation prediction model, single-step prediction is firstly carried out. Firstly, an original generating capacity data signal is decomposed by adopting an EEMD algorithm, taking a water buffet power station as an example, 4047-day generating capacity data, the ratio of additive noise to the standard deviation of the generating capacity is 0.2, and the value of the EEMD aggregate number is 70. The Original power generation sequence Original is decomposed into a Residual error and 10 eigenmode functions IMF through decomposition. As shown in fig. 11, it can be observed that the EEMD algorithm decomposes the original signal, and gradually decomposes the signals of different time scales. Taking the former 3600 days as a training set and the later 417 days as a testing set, respectively adopting MDDE-EEMD-LSTM, SVR, LR and RF models to carry out training and testing, wherein the index results are shown in Table 4, in the table, MAE represents mean absolute error, MAPE represents mean absolute percentage error, RMSE represents root mean square error, R represents R mean absolute percentage error, and R represents weight average absolute percentage error2The method represents and determines coefficients, the model of the invention is optimal in all indexes under the condition of single-station single-step prediction, the fitting effect of the existing LSTM neural network is better, but a certain difference is still formed between the fitting effect and the optimal value, and the improvement measure provided by the invention improves the fitting precision of the LSTM model.
In order to visually display the prediction effect, a comparison graph of 100 points of the power generation amount prediction of each model is drawn, as shown in fig. 12 to 16. Compared with the existing LSTM model, the generated energy prediction model disclosed by the invention well tracks the generated energy trend and is higher in fitting precision. The LSTM model does not learn enough data characteristics, only the power generation amount of the past day is taken as the current power generation amount, and the LSTM prediction curve is equivalent to translating the actual measurement curve by one step along with time; for models such as LR, RF and SVR, only the overall change trend of data is learned, and the predicted value has larger oscillation amplitude near the actual value and has no referential property; for the MA (60) model, the trend of the predicted value is gradually smooth along with the prolonging of time, the influence of the addition of new data on the average value is not large, and the prediction cannot be realized.
TABLE 4 comparison of the Performance of each model for single step prediction
To further evaluate the effectiveness of the MDDE-EEMD-LSTM model, 3-step and 7-step predictions were made and compared to the basic LSTM model. As shown in tables 5 and 6, each evaluation index of each model decreased with the extension of the prediction period, and the fluctuation width varied from model to model. When the prediction step length is increased, the prediction accuracy is reduced, but the model still has higher accuracy compared with other conventional methods, and can provide reference for engineering practice.
TABLE 5 prediction 3-step comparison of model performance
TABLE 6 prediction 7-step comparison table of model performance
And further researching a step power generation amount integral prediction mode, taking the power generation amounts of the water bufferas and the river rocks and the flow of entering and exiting the reservoir as input, and performing power generation amount prediction by adopting an MDDE-EEMD-LSTM model. The prediction is continued for 200 days and various indexes are calculated and compared with the single-station prediction. As shown in table 7, the prediction in steps is slightly less accurate than the single-stop prediction. When the power generation amount of the cascade hydropower station is predicted, the prediction is carried out in a single-station prediction mode, and then the prediction result of the cascade power generation amount is obtained through accumulation, so that the prediction precision can be improved.
TABLE 7 comparison table of single-stop prediction and cascade prediction
Claims (7)
1. The method for predicting the generating capacity of the cascade hydropower station based on the long-term and short-term memory network is characterized by comprising the following steps,
step 1: preprocessing data of the characteristic engineering, and carrying out stability inspection on a hydropower station generating capacity time sequence;
step 2: performing correlation test on the generated energy time sequence, and selecting a generated energy influence factor with high correlation;
and step 3: converting the generated energy time series data into supervised learning data;
and 4, step 4: establishing a generating capacity prediction model based on a long-term and short-term memory network;
and 5: performing integrated empirical mode decomposition on the generated energy data in the step 3 to obtain a training set and a test set;
step 6: training the generated energy prediction model by using a training set, and performing model hyper-parameter optimization by using an improved discrete differential evolution algorithm to obtain optimal model parameters;
and 7: testing the generated energy prediction model based on the long-term and short-term memory network by using a test set;
and 8: and adopting a power generation amount prediction model based on a long-term and short-term memory network to predict the power generation amount of the cascade hydropower station.
2. The method for predicting the power generation capacity of the cascade hydropower station based on the long-short term memory network according to claim 1, wherein in the step 1, the stationarity test is carried out on the time series of the power generation capacity of the hydropower station by adopting a unit root test method.
3. The long-short term memory network-based cascade hydropower station electric energy generation amount prediction method as claimed in claim 1, wherein in the step 2, correlation test is carried out on the electric energy generation amount time series, and comprises calculation of Pearson correlation coefficient and Spearman correlation coefficient.
4. The method for predicting the electric energy production of the cascade hydropower station based on the long-short term memory network as claimed in claim 1, wherein in the step 3, the electric energy production is time-sequencedConverting the data into supervised learning data, wherein the dimension of the time series data is N _ features, and the length of the time series is N _ featureslX _ timepieces is the length of a time window and corresponds to sample data; y _ timestamps is the length of a forecast period and corresponds to data label data; sequentially performing sliding segmentation on the generated energy time sequence, taking out the sequence with the length of X _ times + Y _ times each time to obtain a plurality of subsequences, wherein the number of the subsequences is N _ samples =l-X_timesteps-Y_timesteps+1。
5. The method for predicting the electric energy production of the cascade hydropower station based on the long-short term memory network according to claim 1, wherein in the step 5, the integrated empirical mode decomposition is performed on the electric energy production data in the step 3, and specifically comprises the following steps:
1) importing a data set X, wherein the capacity of the data set is N, and the standard deviation is std;
2) initializing the ratio Nstd of the additive noise to the standard deviation and the integration quantity NE;
3) rounding the logarithm of N, and then subtracting 1 to obtain the number M of IMF components;
4) adding noise to the sequence;
5) performing empirical mode decomposition to obtain an IMF component and a residual error, and entering a step 4 when the decomposition times are less than M);
6) and outputting the IMF component and the residual error.
6. The method for predicting the power generation amount of the cascade hydropower station based on the long and short term memory network according to claim 1, wherein in the step 6, the improved discrete differential evolution algorithm is adopted to carry out model hyper-parameter optimization, and the hyper-parameter learning rate, the iteration times, the optimizer, the activation function, the time window and the number of neurons in deep learning are subjected to one-hot coding and then differential evolution is carried out.
7. The method for predicting the power generation capacity of the cascade hydropower station based on the long-short term memory network according to any one of claims 1-6, wherein the process of optimizing the model hyper-parameters by adopting the improved discrete differential evolution algorithm utilizes a distributed parallel computing method, and the distributed parallel computing method comprises the following steps:
(1) the main node loads a cluster computing node list and initializes the population of an optimization algorithm;
(2) the main node broadcasts confirmation information to all the computing nodes according to the node list, and if the node does not respond, the node is removed from the node list; when the node list is empty, namely all the nodes cannot work, prompting a user to check node information and terminating the calculation;
(3) if the available nodes exist, the main node starts multithreading, balances the calculation tasks of the population to all the calculation nodes, then starts thread waiting, sets timeout time, and waits for all the calculation nodes to return calculation results; when the calculation of a node is overtime, abandoning the calculation result of the node and removing the node;
(4) when all the nodes are calculated, judging whether a termination condition is reached, if the termination condition is not reached, updating the population and the optimal value, and executing the step (3); and if the termination condition is met, performing model training by using the optimal parameters, and outputting the trained model.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911276332.1A CN111027775A (en) | 2019-12-12 | 2019-12-12 | Step hydropower station generating capacity prediction method based on long-term and short-term memory network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911276332.1A CN111027775A (en) | 2019-12-12 | 2019-12-12 | Step hydropower station generating capacity prediction method based on long-term and short-term memory network |
Publications (1)
Publication Number | Publication Date |
---|---|
CN111027775A true CN111027775A (en) | 2020-04-17 |
Family
ID=70206431
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911276332.1A Pending CN111027775A (en) | 2019-12-12 | 2019-12-12 | Step hydropower station generating capacity prediction method based on long-term and short-term memory network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111027775A (en) |
Cited By (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111695290A (en) * | 2020-05-14 | 2020-09-22 | 天津大学 | Short-term runoff intelligent forecasting hybrid model method suitable for variable environment |
CN112116130A (en) * | 2020-08-24 | 2020-12-22 | 长江水利委员会长江科学院 | Short-term scheduling rule extraction method based on variable structure deep learning framework |
CN112308434A (en) * | 2020-11-03 | 2021-02-02 | 中国交通通信信息中心 | Traffic safety risk assessment method and system |
CN112329997A (en) * | 2020-10-26 | 2021-02-05 | 国网河北省电力有限公司雄安新区供电公司 | Power demand load prediction method and system, electronic device, and storage medium |
CN112488404A (en) * | 2020-12-07 | 2021-03-12 | 广西电网有限责任公司电力科学研究院 | Multithreading efficient prediction method and system for large-scale power load of power distribution network |
CN113095642A (en) * | 2021-03-30 | 2021-07-09 | 中国电力科学研究院有限公司 | Method, system, equipment and storage medium for planning cascade hydropower |
CN113359573A (en) * | 2021-06-30 | 2021-09-07 | 华能澜沧江水电股份有限公司 | Intelligent dam safety prediction method and device |
CN113962456A (en) * | 2021-10-19 | 2022-01-21 | 江苏方天电力技术有限公司 | Medium-and-long-term load prediction method considering industry relevance |
CN114065878A (en) * | 2022-01-17 | 2022-02-18 | 国网山东省电力公司泰安供电公司 | Electric quantity missing value filling method based on multi-parameter Internet of things fusion technology |
CN116542392A (en) * | 2023-05-24 | 2023-08-04 | 珠江水利委员会珠江水利科学研究院 | Real-time forecasting method and system for tidal range water level and readable storage medium |
WO2024087670A1 (en) * | 2022-10-26 | 2024-05-02 | 广东电网有限责任公司 | Silicone rubber material performance prediction method and apparatus based on lstm neural network |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110135637A (en) * | 2019-05-13 | 2019-08-16 | 武汉科技大学 | Micro-capacitance sensor short-term load forecasting method based on shot and long term memory and adaptive boosting |
CN110163433A (en) * | 2019-05-21 | 2019-08-23 | 上海海事大学 | A kind of ship method for predicting |
-
2019
- 2019-12-12 CN CN201911276332.1A patent/CN111027775A/en active Pending
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110135637A (en) * | 2019-05-13 | 2019-08-16 | 武汉科技大学 | Micro-capacitance sensor short-term load forecasting method based on shot and long term memory and adaptive boosting |
CN110163433A (en) * | 2019-05-21 | 2019-08-23 | 上海海事大学 | A kind of ship method for predicting |
Non-Patent Citations (1)
Title |
---|
舒生茂: "基于深度学习的梯级水电站发电调度及系统设计研究" * |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111695290B (en) * | 2020-05-14 | 2024-04-09 | 天津大学 | Short-term runoff intelligent forecasting mixed model method suitable for changing environment |
CN111695290A (en) * | 2020-05-14 | 2020-09-22 | 天津大学 | Short-term runoff intelligent forecasting hybrid model method suitable for variable environment |
CN112116130A (en) * | 2020-08-24 | 2020-12-22 | 长江水利委员会长江科学院 | Short-term scheduling rule extraction method based on variable structure deep learning framework |
CN112329997A (en) * | 2020-10-26 | 2021-02-05 | 国网河北省电力有限公司雄安新区供电公司 | Power demand load prediction method and system, electronic device, and storage medium |
CN112308434A (en) * | 2020-11-03 | 2021-02-02 | 中国交通通信信息中心 | Traffic safety risk assessment method and system |
CN112488404A (en) * | 2020-12-07 | 2021-03-12 | 广西电网有限责任公司电力科学研究院 | Multithreading efficient prediction method and system for large-scale power load of power distribution network |
CN113095642A (en) * | 2021-03-30 | 2021-07-09 | 中国电力科学研究院有限公司 | Method, system, equipment and storage medium for planning cascade hydropower |
CN113359573A (en) * | 2021-06-30 | 2021-09-07 | 华能澜沧江水电股份有限公司 | Intelligent dam safety prediction method and device |
CN113962456A (en) * | 2021-10-19 | 2022-01-21 | 江苏方天电力技术有限公司 | Medium-and-long-term load prediction method considering industry relevance |
CN114065878A (en) * | 2022-01-17 | 2022-02-18 | 国网山东省电力公司泰安供电公司 | Electric quantity missing value filling method based on multi-parameter Internet of things fusion technology |
WO2024087670A1 (en) * | 2022-10-26 | 2024-05-02 | 广东电网有限责任公司 | Silicone rubber material performance prediction method and apparatus based on lstm neural network |
CN116542392A (en) * | 2023-05-24 | 2023-08-04 | 珠江水利委员会珠江水利科学研究院 | Real-time forecasting method and system for tidal range water level and readable storage medium |
CN116542392B (en) * | 2023-05-24 | 2024-02-13 | 珠江水利委员会珠江水利科学研究院 | Real-time forecasting method and system for tidal range water level and readable storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN111027775A (en) | Step hydropower station generating capacity prediction method based on long-term and short-term memory network | |
Lin et al. | Short-term power prediction for photovoltaic power plants using a hybrid improved Kmeans-GRA-Elman model based on multivariate meteorological factors and historical power datasets | |
Miranda et al. | One-hour-ahead wind speed prediction using a Bayesian methodology | |
CN107194495B (en) | Photovoltaic power longitudinal prediction method based on historical data mining | |
CN109558975B (en) | Integration method for multiple prediction results of power load probability density | |
CN110717610B (en) | Wind power prediction method based on data mining | |
CN105631558A (en) | BP neural network photovoltaic power generation system power prediction method based on similar day | |
CN106295899B (en) | Wind power probability density Forecasting Methodology based on genetic algorithm Yu supporting vector quantile estimate | |
CN112100911B (en) | Solar radiation prediction method based on depth BILSTM | |
CN110942194A (en) | Wind power prediction error interval evaluation method based on TCN | |
CN110443417A (en) | Multi-model integrated load prediction method based on wavelet transformation | |
CN110766200A (en) | Method for predicting generating power of wind turbine generator based on K-means mean clustering | |
CN110309603A (en) | A kind of short-term wind speed forecasting method and system based on wind speed characteristics | |
CN114757427B (en) | Autoregressive-corrected LSTM intelligent wind power plant ultra-short-term power prediction method | |
CN113141008A (en) | Data-driven power distribution network distributed new energy consumption capacity assessment method | |
CN111626473A (en) | Two-stage photovoltaic power prediction method considering error correction | |
CN115860177A (en) | Photovoltaic power generation power prediction method based on combined machine learning model and application thereof | |
CN105426989A (en) | EEMD and combined kernel RVM-based photovoltaic power short-term prediction method | |
CN113112085A (en) | New energy station power generation load prediction method based on BP neural network | |
CN111242353A (en) | Wind power combined prediction modeling and prediction method | |
CN117114160A (en) | Short-term photovoltaic power prediction method | |
CN115511657A (en) | Wind power output and photovoltaic output evaluation method based on combined prediction model | |
CN113761023A (en) | Photovoltaic power generation short-term power prediction method based on improved generalized neural network | |
CN112651576A (en) | Long-term wind power prediction method and device | |
CN117132132A (en) | Photovoltaic power generation power prediction method based on meteorological data |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20200417 |