CN111259340B - Saturation load prediction method based on logistic regression - Google Patents
Saturation load prediction method based on logistic regression Download PDFInfo
- Publication number
- CN111259340B CN111259340B CN202010048425.5A CN202010048425A CN111259340B CN 111259340 B CN111259340 B CN 111259340B CN 202010048425 A CN202010048425 A CN 202010048425A CN 111259340 B CN111259340 B CN 111259340B
- Authority
- CN
- China
- Prior art keywords
- load
- model
- logistic regression
- parameter
- parameters
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000007477 logistic regression Methods 0.000 title claims abstract description 41
- 238000000034 method Methods 0.000 title claims abstract description 31
- 229920006395 saturated elastomer Polymers 0.000 claims abstract description 8
- 239000011159 matrix material Substances 0.000 claims description 13
- 230000009191 jumping Effects 0.000 claims description 6
- 238000009795 derivation Methods 0.000 claims description 4
- 238000007689 inspection Methods 0.000 claims description 3
- 238000006467 substitution reaction Methods 0.000 claims description 2
- 238000006243 chemical reaction Methods 0.000 abstract 1
- 238000003062 neural network model Methods 0.000 description 2
- 230000002159 abnormal effect Effects 0.000 description 1
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000009738 saturating Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/10—Complex mathematical operations
- G06F17/18—Complex mathematical operations for evaluating statistical data, e.g. average values, frequency distributions, probability functions, regression analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/04—Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06Q—INFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q50/00—Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
- G06Q50/06—Energy or water supply
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y02—TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
- Y02E—REDUCTION OF GREENHOUSE GAS [GHG] EMISSIONS, RELATED TO ENERGY GENERATION, TRANSMISSION OR DISTRIBUTION
- Y02E40/00—Technologies for an efficient electrical power generation, transmission or distribution
- Y02E40/70—Smart grids as climate change mitigation technology in the energy generation sector
-
- Y—GENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
- Y04—INFORMATION OR COMMUNICATION TECHNOLOGIES HAVING AN IMPACT ON OTHER TECHNOLOGY AREAS
- Y04S—SYSTEMS INTEGRATING TECHNOLOGIES RELATED TO POWER NETWORK OPERATION, COMMUNICATION OR INFORMATION TECHNOLOGIES FOR IMPROVING THE ELECTRICAL POWER GENERATION, TRANSMISSION, DISTRIBUTION, MANAGEMENT OR USAGE, i.e. SMART GRIDS
- Y04S10/00—Systems supporting electrical power generation, transmission or distribution
- Y04S10/50—Systems or methods supporting the power network operation or management, involving a certain degree of interaction with the load-side end user applications
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Business, Economics & Management (AREA)
- General Physics & Mathematics (AREA)
- Economics (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Human Resources & Organizations (AREA)
- Strategic Management (AREA)
- Mathematical Physics (AREA)
- Computational Mathematics (AREA)
- Tourism & Hospitality (AREA)
- Operations Research (AREA)
- General Business, Economics & Management (AREA)
- Marketing (AREA)
- Health & Medical Sciences (AREA)
- Pure & Applied Mathematics (AREA)
- Mathematical Optimization (AREA)
- Mathematical Analysis (AREA)
- Life Sciences & Earth Sciences (AREA)
- Databases & Information Systems (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Bioinformatics & Cheminformatics (AREA)
- Probability & Statistics with Applications (AREA)
- Quality & Reliability (AREA)
- Algebra (AREA)
- Development Economics (AREA)
- Game Theory and Decision Science (AREA)
- Software Systems (AREA)
- General Engineering & Computer Science (AREA)
- Entrepreneurship & Innovation (AREA)
- Public Health (AREA)
- Water Supply & Treatment (AREA)
- General Health & Medical Sciences (AREA)
- Primary Health Care (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
A saturated load prediction method based on logistic regression utilizes historical load data to estimate model parameters to obtain a logistic regression load prediction model through parameter conversion; simplifying a logistic regression model; obtaining a probability model of the simplified model if Gaussian white noise is adopted; the obtained probability model and the Neyman-Fisher factorization theorem are utilized to obtain the full statistics of one parameter of the model; using the sufficient statistics of the parameter, the values of the other two parameters can be obtained using the least squares method assuming that the general solution is known; the parameter values meet the constraint conditions within a certain value range, and the parameters which obviously do not accord with the constraint formula are ignored to obtain a proper parameter range; and taking the parameter value of the optimal error value as a model parameter for all the parameters, and obtaining a final logistic model. The invention has better model prediction precision and smaller error for the existing data.
Description
Technical Field
The invention relates to a power load prediction method. In particular to a saturated load prediction method based on logistic regression, which utilizes historical load data to estimate model parameters to obtain a logistic regression load prediction model.
Background
Load prediction is one of important works of the electric power department, and accurate load prediction can bring high social benefits and economic benefits. Meanwhile, load prediction is the basis of power grid planning, and the utility and efficiency of a power grid planning project are directly affected. The load prediction has the advantages of large data information quantity, multiple uncertain factors, wide related fields, and great significance in improving the quality and speed of power distribution network planning, and can realize the rapid and accurate prediction of the load. Accurate load prediction has a great guiding effect on planning projects of government power grids.
The main method of load prediction is grey Verhulst prediction or using neural networks to implement load prediction. Compared with a neural network model, the model of logistic regression does not need a large amount of historical data, and compared with the Verhulst model, the position with higher prediction accuracy of logistic is not limited to the previous M predicted value in the whole prediction sequence. In the existing article based on logistic regression load prediction, a 3-point or 4-point method is adopted to obtain the parameters of the logistic curve, however, the method cannot avoid abnormal or erroneous data, and therefore, has room for improvement. From the theoretical and demonstration researches of domestic and foreign scholars for many years, the logistic model has very reliable identification, prediction and popularization capabilities, is simple in model form, has only three parameters in logistic equation, has very wide application fields, and can be applied to load prediction.
Disclosure of Invention
The invention aims to solve the technical problem of providing a saturated load prediction method based on logistic regression, which has better model prediction precision and smaller error for the existing data.
The technical scheme adopted by the invention is as follows: a saturation load prediction method based on logistic regression comprises the following steps:
1) Load data of a power grid are collected, and an improved logistic regression model and constraint conditions are given;
2) Determining the range of a parameter c in the logistic regression model according to the improved logistic regression model and the constraint condition;
3) Substituting each obtained c into the general solution t=m + X+ct 1 Then utilize the least square formula Obtaining parameters a and b in a logistic regression model, and substituting the obtained results into a formula Obtaining a parameter m in a logistic regression model;
wherein T is a matrix of Nx1,r is the load increasing speed, t is the predicted year vector, m=1/k, k is the regional maximum saturated load, x i The reciprocal of the load y in the ith year, N is the total year of the data;
and c meets the constraint condition of the improved logistic regression model within a certain value range, and the parameter ranges of m, a and b are obtained.
4) Judging whether the calculated results m, a and b meet the constraint condition m>0,a>0,b<0, if the direct update c is not satisfied, jumping to the step 3); if yes, calculating a combination form of the model inspection indexesThen updating c, and jumping to the step 3);
wherein, C is posterior difference test:wherein S is 1 Is the standard deviation of historical load data, S 2 Standard deviation of the load error sequence; q is the relative residual:Wherein->Error->In (1) the->As load predicted values, x (i) is an actual load value; p is the precision:
5) Comparing all S meeting constraint conditions, substituting m, a, b=argmin S corresponding to the minimum S into the improved logistic regression modelAnd the method is used for predicting the maximum saturation load of the regional power grid.
According to the saturation load prediction method based on logistic regression, a logistic probability model is considered, and parameters of the model are obtained according to a Neyman-Fisher factorization theorem and historical load data estimation. The advantages are that:
1. the method can estimate the parameters of the logistic regression model according to the characteristics of the model and the historical data, and compared with a neural network model, the method does not need a large amount of historical data, and compared with a Verhulst model, the predicted value with higher prediction precision in the method is not limited to the first predicted values in the whole predicted sequence.
2. Compared with a gray Verhulst model, the method provided by the invention has better model prediction precision and smaller error for the existing data.
Drawings
FIG. 1 is a flow chart of a method of saturating load prediction based on logistic regression of the present invention;
FIG. 2a is a plot of the S-value parameter value for each iteration;
FIG. 2b is a plot of the parameter m for each iteration as a function of the number of iterations;
FIG. 2c is a plot of the parameter a for each iteration as a function of the number of iterations;
FIG. 2d is a plot of each iteration parameter b as a function of the number of iterations;
FIG. 3 is a predictive comparison graph;
fig. 4 is a graph of the prediction results of the present invention.
Detailed Description
The following describes a method for predicting saturation load based on logistic regression in detail with reference to examples and drawings.
The invention discloses a saturation load prediction method based on logistic regression, which comprises the following steps:
1) Load data of a power grid are collected, and an improved logistic regression model and constraint conditions are given; wherein,,
the improved logistic regression model is as follows:
wherein y is the load amount of the corresponding year; m=1/k, k being the regional maximum saturation load;r is the load increasing speed of the load, y 0 Is t 0 Annual load capacity; b= -r;
the constraint conditions are as follows: m >0, a >0, b <0.
2) Determining the range of a parameter c in the logistic regression model according to the improved logistic regression model and the constraint condition; comprises calculating abs (min (M + X)), initializing c=abs (min (M) + X))+δ,
Wherein M is + Is a pseudo-inverse of M, M is a matrix of NxN, X is a matrix of Nx1, δ is an iteration step, and the ratio abs (min (M + X)) are two orders of magnitude smaller, wherein:
3) Substituting each obtained c into the general solution t=m + X+ct 1 Then utilize the least square formula Obtaining parameters a and b in a logistic regression model, and substituting the obtained results into a formula Obtaining a parameter m in a logistic regression model;
according to the general model x=m+αt b +η, where η is noise subject to a mean of 0 and variance of δ 2 The probability model for obtaining x is:
the sufficient statistics of m using the factorization theorem are:
the general solution t=m + X+ct 1 The derivation process of (2) is as follows:
order the
Namely: mt=x, wherein M + As pseudo-inverse matrix, M + X is a solution, ct 1 To go through, M + Is the pseudo-inverse of M, M is a matrix of NxN, and X is a matrix of Nx 1.
Let mt=0 to get t 1 Values of (2)
wherein T is a matrix of Nx1,r is the load increasing speed, t is the predicted year vector, m=1/k, k is the regional maximum saturated load, x i The reciprocal of the load y in the ith year, N is the total year of the data;
and c meets the constraint condition of the improved logistic regression model within a certain value range, and the parameter ranges of m, a and b are obtained.
4) Judging whether the calculated results m, a and b meet the constraint condition m>0,a>0,b<0, if the direct update c is not satisfied, jumping to the step 3); if yes, calculating a combination form of the model inspection indexesThen updating c, and jumping to the step 3);
wherein, C is posterior difference test:wherein S is 1 Is the standard deviation of historical load data, S 2 Standard deviation of the load error sequence; q is the relative residual:Wherein->Error->In (1) the->As load predicted values, x (i) is an actual load value; p is the precision:
5) Comparing all S meeting constraint conditions, substituting m, a, b=argmin S corresponding to the minimum S into the improved logistic regression modelAnd the method is used for predicting the maximum saturation load of the regional power grid.
Examples are given below:
and step 1, collecting load data of a power grid, and determining the range of c according to constraint conditions and actual conditions.
The load data used was [430.40,454.26,482.94,511.20,559.42,598.99,655.71,745.97,821.44,911.97,980.15,1072.38,1138.22,1153.38,1295.87 ]]The unit is (100 GW x h), the data source is the historical data of the power consumption requirement in the area governed by a certain power grid, and c is initialized>abs(min(M + X)), c is initialized to a value of 6.77×10 -4 Delta is 1.0X10 -6 The number of iterations n=700.
Step 2: and further determining the range of the parameters m, a and b according to the obtained c value and obtaining the optimal parameter value.
Substituting the value of c into the formula from small to large in sequence: t=m + X+ct 1 Wherein M is + X is a special solution and can be directly substituted into data to be obtained. And according toAnd +.>The value of a, b can be obtained from the value of c by the least square method. By the formula->The value of m at this value of c can be obtained. Updating the value of c, c=c+δ. The number of iterations of each corresponds to m, a, b, S as shown in fig. 2a, 2b, 2c, 2 d.
Step 3: substituting the obtained m, a and b into an S formula, and substituting the optimal parameter value corresponding to S into a logistic load prediction model.
FIG. 3 is a graph of the method of the present invention in comparison to GM (1, 1) and the grey Verhulst model predictions.
By knowing the load of the year before the high speed increase period, the load result after prediction by the complete prediction model is shown in fig. 4.
Claims (3)
1. A saturation load prediction method based on logistic regression is characterized by comprising the following steps:
1) Load data of a power grid are collected, and an improved logistic regression model and constraint conditions are given; wherein,,
the improved logistic regression model is as follows:
wherein y is the load amount of the corresponding year; m=1/k, k being the regional maximum saturation load;
r is the load increasing speed of the load, y 0 Is t 0 Annual load capacity; b= -r;
the constraint conditions are as follows: m >0, a >0, b <0;
2) Determining the range of a parameter c in the logistic regression model according to the improved logistic regression model and the constraint condition; comprises calculating abs (min (M + X)), initializing c=abs (min (M) + X))+δ,
Wherein M is + Is a pseudo-inverse of M, M is a matrix of N X N, X is a matrix of N X1, δ is an iteration step, and the ratio abs (min (M + X)) are two orders of magnitude smaller, wherein:
3) Substituting each obtained c into the general solution t=m + X+ct 1 Then utilize the least square formula Obtaining parameters a and b in a logistic regression model, and substituting the obtained results into a formula Obtaining a parameter m in a logistic regression model;
wherein T is a matrix of Nx1,r is the load increasing speed, t is the predicted year vector, m=1/k, k is the regional maximum saturated load, x i The reciprocal of the load y in the ith year, N is the total year of the data;
4) Judging whether the calculated results m, a and b meet the constraint condition m>0,a>0,b<0, if the direct update c is not satisfied, jumping to the step 3); if yes, calculating a combination form of the model inspection indexesThen updating c, and jumping to the step 3);
wherein, C is posterior difference test:wherein S is 1 Is the standard deviation of historical load data, S 2 Standard deviation of the load error sequence; q is the relative residual:Wherein->Error->In (1) the->As load predicted values, x (i) is an actual load value; p is the precision:
2. The method for predicting saturated loads based on logistic regression according to claim 1, wherein the formula in step 3) isThe derivation is as follows:
according to the general model x=m+αt b +η, where η is noise subject to a mean of 0 and variance of δ 2 The probability model for obtaining x is:
the sufficient statistics of m using the factorization theorem are:
3. the method for predicting saturated loads based on logistic regression according to claim 1, wherein the general solution t=m in step 3) + X+ct 1 The derivation process of (2) is as follows:
order the
Namely: mt=x, wherein M + As pseudo-inverse matrix, M + X is a solution, ct 1 To go through, M + Is the pseudo-inverse of M, M is a matrix of N X N, X is a matrix of N X1;
let mt=0 to get t 1 Values of (2)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010048425.5A CN111259340B (en) | 2020-01-16 | 2020-01-16 | Saturation load prediction method based on logistic regression |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202010048425.5A CN111259340B (en) | 2020-01-16 | 2020-01-16 | Saturation load prediction method based on logistic regression |
Publications (2)
Publication Number | Publication Date |
---|---|
CN111259340A CN111259340A (en) | 2020-06-09 |
CN111259340B true CN111259340B (en) | 2023-04-28 |
Family
ID=70948836
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202010048425.5A Active CN111259340B (en) | 2020-01-16 | 2020-01-16 | Saturation load prediction method based on logistic regression |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN111259340B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113435653B (en) * | 2021-07-02 | 2022-11-04 | 国网新疆电力有限公司经济技术研究院 | Method and system for predicting saturated power consumption based on logistic model |
CN117290610B (en) * | 2023-11-24 | 2024-02-09 | 苏州峰学蔚来教育科技有限公司 | University recruitment information recommendation method and system |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110135635A (en) * | 2019-04-29 | 2019-08-16 | 国网山东省电力公司经济技术研究院 | A kind of region electric power saturation load forecasting method and system |
CN110633846A (en) * | 2019-09-02 | 2019-12-31 | 北京市燃气集团有限责任公司 | Gas load prediction method and device |
-
2020
- 2020-01-16 CN CN202010048425.5A patent/CN111259340B/en active Active
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110135635A (en) * | 2019-04-29 | 2019-08-16 | 国网山东省电力公司经济技术研究院 | A kind of region electric power saturation load forecasting method and system |
CN110633846A (en) * | 2019-09-02 | 2019-12-31 | 北京市燃气集团有限责任公司 | Gas load prediction method and device |
Non-Patent Citations (3)
Title |
---|
吉兴全.饱和负荷预测中的多级聚类分析和改进Logistic模型.《电力系统及其自动化学报》.2017,28(8),全文. * |
尚芳屹 ; 李洁 ; .组合预测在饱和负荷预测中的应用.电力与能源.2017,(02),全文. * |
林勇 ; 邹品晶 ; 左郑敏 ; 欧阳旭 ; 朱向前 ; 姚建刚 ; .基于改进PSO算法的Logistic模型在饱和负荷预测中的应用.电力需求侧管理.2015,(05),全文. * |
Also Published As
Publication number | Publication date |
---|---|
CN111259340A (en) | 2020-06-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN108846517B (en) | Integration method for predicating quantile probabilistic short-term power load | |
Lei et al. | A proposed grey model for short-term electricity price forecasting in competitive power markets | |
CN108921339B (en) | Quantile regression-based photovoltaic power interval prediction method for genetic support vector machine | |
CN113572206A (en) | Wind power output interval prediction method | |
CN109919356B (en) | BP neural network-based interval water demand prediction method | |
CN110380444B (en) | Capacity planning method for distributed wind power orderly access to power grid under multiple scenes based on variable structure Copula | |
CN111259340B (en) | Saturation load prediction method based on logistic regression | |
CN107730097B (en) | Bus load prediction method and device and computing equipment | |
CN110298765B (en) | Power distribution network power consumption abnormality detection method based on objective correlation factors | |
CN103853939A (en) | Combined forecasting method for monthly load of power system based on social economic factor influence | |
CN106295877B (en) | Method for predicting electric energy consumption of smart power grid | |
CN111723982A (en) | Medium-and-long-term power load combined prediction method based on gray-Markov chain | |
CN109325880A (en) | A kind of Mid-long term load forecasting method based on Verhulst-SVM | |
CN115619028A (en) | Clustering algorithm fusion-based power load accurate prediction method | |
CN115545333A (en) | Method for predicting load curve of multi-load daily-type power distribution network | |
CN111652422A (en) | Heat supply system load prediction method, device and system based on building classification | |
CN116885703B (en) | Short-term wind-solar power prediction method for high-dimensional multi-element meteorological data fusion | |
CN109214610A (en) | A kind of saturation Methods of electric load forecasting based on shot and long term Memory Neural Networks | |
CN106600038A (en) | Load interval prediction method based on Markov model | |
CN116131255A (en) | Method and device for predicting future power generation capacity of power station based on time sequence conceptual drift | |
CN115907228A (en) | Short-term power load prediction analysis method based on PSO-LSSVM | |
CN115629576A (en) | Non-invasive flexible load aggregation characteristic identification and optimization method, device and equipment | |
CN112581311B (en) | Method and system for predicting long-term output fluctuation characteristics of aggregated multiple wind power plants | |
Lyu et al. | Multivariate-aided Power-consumption Prediction Based on LSTM-Kalman Filter | |
CN109190830B (en) | Energy demand prediction method based on empirical decomposition and combined prediction |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |