Nothing Special   »   [go: up one dir, main page]

CN113723717B - System day-ahead short-term load forecasting method, apparatus, device and readable storage medium - Google Patents

System day-ahead short-term load forecasting method, apparatus, device and readable storage medium Download PDF

Info

Publication number
CN113723717B
CN113723717B CN202111291156.6A CN202111291156A CN113723717B CN 113723717 B CN113723717 B CN 113723717B CN 202111291156 A CN202111291156 A CN 202111291156A CN 113723717 B CN113723717 B CN 113723717B
Authority
CN
China
Prior art keywords
predicted
regression model
load
historical data
training
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN202111291156.6A
Other languages
Chinese (zh)
Other versions
CN113723717A (en
Inventor
张笑晗
步允千
赵梓州
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qu Creative Technology Co ltd
Original Assignee
Beijing Qu Creative Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qu Creative Technology Co ltd filed Critical Beijing Qu Creative Technology Co ltd
Priority to CN202111291156.6A priority Critical patent/CN113723717B/en
Publication of CN113723717A publication Critical patent/CN113723717A/en
Application granted granted Critical
Publication of CN113723717B publication Critical patent/CN113723717B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F30/00Computer-aided design [CAD]
    • G06F30/20Design optimisation, verification or simulation
    • G06F30/27Design optimisation, verification or simulation using machine learning, e.g. artificial intelligence, neural networks, support vector machines [SVM] or training a model
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Evolutionary Computation (AREA)
  • Economics (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • General Engineering & Computer Science (AREA)
  • Artificial Intelligence (AREA)
  • Health & Medical Sciences (AREA)
  • Data Mining & Analysis (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Marketing (AREA)
  • Software Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Public Health (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Geometry (AREA)
  • Computer Hardware Design (AREA)
  • Medical Informatics (AREA)
  • Primary Health Care (AREA)
  • Evolutionary Biology (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Development Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • Water Supply & Treatment (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Biomedical Technology (AREA)
  • Biophysics (AREA)
  • Computational Linguistics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)

Abstract

本发明涉及一种系统日前短期负荷预测方法、装置、设备和可读存储介质,该方法包括:采集历史数据,并对历史数据进行预处理;利用预处理后的历史数据构建训练样本集;利用训练样本集对预先建立的XGBoost多目标回归模型进行训练,得到训练后的XGBoost多目标回归模型;生成预测样本特征;将预测样本特征输入至训练后的XGBoost多目标回归模型,得到预测的短期负荷。本申请提供的技术方案,不仅提高了模型训练、部署和预测的效率,还提高了短期负荷预测的精准度。

Figure 202111291156

The invention relates to a method, device, equipment and readable storage medium for short-term load forecasting in the day before a system. The method includes: collecting historical data, and preprocessing the historical data; using the preprocessed historical data to construct a training sample set; The training sample set is used to train the pre-established XGBoost multi-objective regression model, and the trained XGBoost multi-objective regression model is obtained; the predicted sample features are generated; the predicted sample features are input into the trained XGBoost multi-objective regression model to obtain the predicted short-term load . The technical solution provided by this application not only improves the efficiency of model training, deployment and prediction, but also improves the accuracy of short-term load prediction.

Figure 202111291156

Description

系统日前短期负荷预测方法、装置、设备和可读存储介质System day-ahead short-term load forecasting method, apparatus, device and readable storage medium

技术领域technical field

本发明属于能源监测设备技术领域,具体涉及一种系统日前短期负荷预测方法、装置、设备和可读存储介质。The invention belongs to the technical field of energy monitoring equipment, and in particular relates to a method, device, equipment and readable storage medium for short-term load forecasting in the day-ahead system.

背景技术Background technique

电能的生产、传输、分配和使用是同时进行的,由于电能不能够大量地存储,电力供给与需求必须保持实时平衡,电力调度需要根据未来的负荷需求提前制定机组的启停和电力设备的检修计划,因此准确的负荷预测对电网调度运行具有重要的意义,负荷预测水平直接影响电力系统的经济效益和社会效益。The production, transmission, distribution and use of electric energy are carried out at the same time. Since electric energy cannot be stored in a large amount, the power supply and demand must be balanced in real time. Power dispatching needs to formulate the start and stop of units and the maintenance of power equipment in advance according to the future load demand. Therefore, accurate load forecasting is of great significance to the dispatching operation of the power grid, and the level of load forecasting directly affects the economic and social benefits of the power system.

系统日前短期负荷预测一般是指提前对次日每15分钟一点共96个时刻的负荷需求进行预测,在机器学习领域来说属于典型的有监督回归类问题。传统的负荷预测通常会将问题建模拆解为对逐时刻的单一目标的回归问题,认为待预测时刻与历史日中同一时刻负荷值及气象因素具有相关关系,利用历史数据训练96个回归模型来实现对未来96个时刻的负荷值的预测。 这种方法往往忽略了负荷曲线在一个长时间范围上连续的特性,容易造成模型的欠拟合。同时要训练96个模型,需要对每一个模型分别进行特征工程和样本工程,耗时往往较大,且不利于使用大量数据训练复杂模型。The short-term load forecast of the system before the day generally refers to the forecast of the load demand at 96 times every 15 minutes in the next day in advance, which is a typical supervised regression problem in the field of machine learning. Traditional load forecasting usually disassembles the problem modeling into a regression problem of a single target by time. It is believed that the time to be predicted is related to the load value and meteorological factors at the same time in the historical day, and 96 regression models are trained using historical data. To realize the prediction of the load value of the next 96 moments. This method often ignores the continuous characteristics of the load curve in a long time range, which is easy to cause under-fitting of the model. To train 96 models at the same time, it is necessary to perform feature engineering and sample engineering for each model, which is often time-consuming and is not conducive to training complex models with a large amount of data.

发明内容SUMMARY OF THE INVENTION

有鉴于此,本发明的目的在于克服现有技术的不足,提供一种系统日前短期负荷预测方法、装置、设备和可读存储介质,以解决现有技术中模型欠拟合,以及训练96个模型耗时巨大,从而不利于使用大量数据训练复杂模型的问题。In view of this, the object of the present invention is to overcome the deficiencies in the prior art, and provide a method, device, equipment and readable storage medium for short-term load forecasting in the prior art, to solve the underfitting of models in the prior art, and to train 96 The model is time-consuming, which is not conducive to the problem of training complex models with large amounts of data.

根据本申请实施例的第一方面,提供一种系统日前短期负荷预测方法,所述方法包括:According to a first aspect of the embodiments of the present application, there is provided a method for predicting a system day-ahead short-term load, the method comprising:

采集历史数据,并对所述历史数据进行预处理;collecting historical data, and preprocessing the historical data;

利用预处理后的历史数据构建训练样本集;Use the preprocessed historical data to construct a training sample set;

利用所述训练样本集对预先建立的XGBoost多目标回归模型进行训练,得到训练后的XGBoost多目标回归模型;Use the training sample set to train the pre-established XGBoost multi-objective regression model to obtain the trained XGBoost multi-objective regression model;

生成预测样本特征;Generate predicted sample features;

将所述预测样本特征输入至所述训练后的XGBoost多目标回归模型,得到预测的短期负荷。The predicted sample features are input into the trained XGBoost multi-objective regression model to obtain the predicted short-term load.

进一步的,所述历史数据,包括:历史的短期负荷值、历史的气象值和历史的负荷变化量;Further, the historical data includes: historical short-term load value, historical meteorological value and historical load variation;

所述历史的气象值中的气象指标包括:温度、湿度、降雨量、风力和云量。The meteorological indicators in the historical meteorological values include: temperature, humidity, rainfall, wind and cloudiness.

进一步的,所述对所述历史数据进行预处理,包括:Further, the preprocessing of the historical data includes:

利用插值法填补所述历史数据中的缺失值,并对填补缺失值后的所述历史数据进行归一化处理,得到所述预处理后的历史数据。An interpolation method is used to fill in the missing values in the historical data, and the historical data after filling the missing values is normalized to obtain the preprocessed historical data.

进一步的,所述利用预处理后的历史数据构建训练样本集,包括:Further, the use of the preprocessed historical data to construct a training sample set includes:

将所述预处理后的历史数据的时刻划分为96个时刻点,利用该96个时刻点的预处理后的历史数据构建训练样本集。The time of the preprocessed historical data is divided into 96 time points, and a training sample set is constructed by using the preprocessed historical data of the 96 time points.

进一步的,所述利用所述训练样本集对预先建立的XGBoost多目标回归模型进行训练,得到训练后的XGBoost多目标回归模型,包括:Further, the described training sample set is used to train the pre-established XGBoost multi-objective regression model, and the trained XGBoost multi-objective regression model is obtained, including:

将所述训练样本集分为训练集和验证集;dividing the training sample set into a training set and a validation set;

利用所述训练集对所述预先建立的XGBoost多目标回归模型进行训练,直至当利用验证集对所述预先建立的XGBoost多目标回归模型进行验证时,得到的预测的历史负荷值与实际的历史负荷值的误差小于第一阈值时,训练结束,得到所述训练后的XGBoost多目标回归模型。Use the training set to train the pre-established XGBoost multi-objective regression model, until when the pre-established XGBoost multi-objective regression model is verified using the verification set, the predicted historical load value obtained is different from the actual historical load value. When the error of the load value is less than the first threshold, the training ends, and the trained XGBoost multi-objective regression model is obtained.

进一步的,所述预测样本特征,包括:Further, the predicted sample features include:

待预测时刻前k~k-M个时刻点的负荷值

Figure DEST_PATH_IMAGE001
、待预测时刻前k~k-M个时刻点的气象值
Figure 226029DEST_PATH_IMAGE002
、待预测时刻前k~k-M个时刻点的负荷变化量
Figure DEST_PATH_IMAGE003
、月份特征、日期特征、日期类型特征、待预测开始时刻所在小时特征和待预测开始时刻分钟特征;Load value at k~kM time points before the time to be predicted
Figure DEST_PATH_IMAGE001
, the meteorological value of k~kM time points before the forecast time
Figure 226029DEST_PATH_IMAGE002
, the load variation at k~kM time points before the prediction time
Figure DEST_PATH_IMAGE003
, month feature, date feature, date type feature, hour feature of the start time to be predicted, and minute feature of the start time to be predicted;

其中,k为待预测的时刻点;M为一个可调的超参数,且M是正整数值;待预测时刻前k-M个时刻点的负荷变化量

Figure 126857DEST_PATH_IMAGE004
,以此类推。Among them, k is the time point to be predicted; M is an adjustable hyperparameter, and M is a positive integer value; the load change at kM time points before the time to be predicted
Figure 126857DEST_PATH_IMAGE004
, and so on.

进一步的,所述将所述预测样本特征输入至所述训练后的XGBoost多目标回归模型,得到预测的短期负荷,包括:Further, inputting the predicted sample features into the trained XGBoost multi-objective regression model to obtain the predicted short-term load, including:

按下式确定所述训练后的XGBoost多目标回归模型的目标函数:The objective function of the trained XGBoost multi-objective regression model is determined as follows:

Figure DEST_PATH_IMAGE005
Figure DEST_PATH_IMAGE005

上式中,

Figure 382562DEST_PATH_IMAGE006
,n为预测样本特征的总数量,
Figure DEST_PATH_IMAGE007
,m为待预测的时刻点的总数量;
Figure 27039DEST_PATH_IMAGE008
为第i个预测样本特征的特征向量,
Figure DEST_PATH_IMAGE009
为第i个预测样本特征的第k个待预测的时刻点的实际负荷值,
Figure 825099DEST_PATH_IMAGE010
为第t-1轮迭代时对第i个预测样本特征第k个待预测的时刻点的预测负荷值,
Figure DEST_PATH_IMAGE011
为预测第k个待预测的时刻点的负荷值时第t轮迭代中增加的新模型,
Figure 162409DEST_PATH_IMAGE012
为预测第k个待预测的时刻点的负荷值时第t轮迭代增加的新模型的复杂度,
Figure DEST_PATH_IMAGE013
为损失函数;In the above formula,
Figure 382562DEST_PATH_IMAGE006
, n is the total number of predicted sample features,
Figure DEST_PATH_IMAGE007
, m is the total number of time points to be predicted;
Figure 27039DEST_PATH_IMAGE008
is the feature vector of the i-th predicted sample feature,
Figure DEST_PATH_IMAGE009
is the actual load value of the kth to-be-predicted moment of the i-th prediction sample feature,
Figure 825099DEST_PATH_IMAGE010
is the predicted load value of the k-th time point to be predicted for the i-th predicted sample feature during the t-1 round of iteration,
Figure DEST_PATH_IMAGE011
is the new model added in the t-th iteration when predicting the load value at the k-th time point to be predicted,
Figure 162409DEST_PATH_IMAGE012
is the complexity of the new model added by the t-th iteration when predicting the load value at the k-th time point to be predicted,
Figure DEST_PATH_IMAGE013
is the loss function;

将所述预测样本特征作为自变量输入至所述训练后的XGBoost多目标回归模型的目标函数,当

Figure 678710DEST_PATH_IMAGE014
小于等于第二阈值或迭代轮数t等于第三阈值时,输出预测的所有时刻点的负荷,所述预测的所有时刻点的负荷为预测的短期负荷。The predicted sample feature is input to the objective function of the XGBoost multi-objective regression model after the training as an independent variable, when
Figure 678710DEST_PATH_IMAGE014
When it is less than or equal to the second threshold or when the number of iteration rounds t is equal to the third threshold, the predicted loads at all time points are output, and the predicted loads at all time points are predicted short-term loads.

根据本申请实施例的第二方面,提供一种系统日前短期负荷预测装置,所述装置包括:According to a second aspect of the embodiments of the present application, a system day-ahead short-term load forecasting device is provided, the device comprising:

预处理模块,用于采集历史数据,并对所述历史数据进行预处理;a preprocessing module for collecting historical data and preprocessing the historical data;

构建模块,用于利用预处理后的历史数据构建训练样本集;The building module is used to construct a training sample set using the preprocessed historical data;

训练模块,用于利用所述训练样本集对预先建立的XGBoost多目标回归模型进行训练,得到训练后的XGBoost多目标回归模型;A training module for training the pre-established XGBoost multi-objective regression model using the training sample set to obtain the trained XGBoost multi-objective regression model;

生成模块,用于生成预测样本特征;A generation module is used to generate predicted sample features;

预测模块,用于将所述预测样本特征输入至所述训练后的XGBoost多目标回归模型,得到预测的短期负荷。A prediction module, configured to input the predicted sample features into the trained XGBoost multi-objective regression model to obtain a predicted short-term load.

根据本申请实施例的第三方面,提供一种系统日前短期负荷预测设备,包括:According to a third aspect of the embodiments of the present application, a system day-ahead short-term load forecasting device is provided, including:

存储器,其上存储有可执行程序;a memory on which an executable program is stored;

处理器,用于执行所述存储器中的所述可执行程序,以实现上述的所述一种系统日前短期负荷预测方法的步骤。The processor is configured to execute the executable program in the memory, so as to implement the steps of the above-mentioned method for short-term load forecasting for the day ahead of the system.

根据本申请实施例的第四方面,提供一种可读存储介质,其上存储有可执行程序,所述可执行程序被处理器执行时实现上述的所述一种系统日前短期负荷预测方法的步骤。According to a fourth aspect of the embodiments of the present application, a readable storage medium is provided, on which an executable program is stored, and when the executable program is executed by a processor, the above-mentioned method for predicting a system day-ahead short-term load is implemented. step.

本发明采用以上技术方案,能够达到的有益效果包括:通过采集历史数据,并对所述历史数据进行预处理,利用预处理后的历史数据构建训练样本集,利用所述训练样本集对预先建立的XGBoost多目标回归模型进行训练,得到训练后的XGBoost多目标回归模型,生成预测样本特征,将所述预测样本特征输入至所述训练后的XGBoost多目标回归模型,得到预测的短期负荷,不仅提高了模型训练、部署和预测的效率,还提高了短期负荷预测的精准度。The present invention adopts the above technical solutions, and the beneficial effects that can be achieved include: by collecting historical data, preprocessing the historical data, using the preprocessed historical data to construct a training sample set, and using the training sample set to pre-establish a The XGBoost multi-objective regression model is trained, the XGBoost multi-objective regression model after training is obtained, the predicted sample features are generated, the predicted sample features are input into the XGBoost multi-objective regression model after the training, and the predicted short-term load is obtained, not only It improves the efficiency of model training, deployment and prediction, and also improves the accuracy of short-term load prediction.

附图说明Description of drawings

为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the embodiments of the present invention or the technical solutions in the prior art more clearly, the following briefly introduces the accompanying drawings that are used in the description of the embodiments or the prior art. Obviously, the drawings in the following description are only These are some embodiments of the present invention. For those of ordinary skill in the art, other drawings can also be obtained according to these drawings without creative efforts.

图1是根据一示例性实施例示出的一种系统日前短期负荷预测方法的流程图;FIG. 1 is a flowchart of a method for predicting a system day-ahead short-term load according to an exemplary embodiment;

图2是根据一示例性实施例示出的是一种系统日前短期负荷预测装置的结构框图。FIG. 2 is a structural block diagram of a system day-ahead short-term load forecasting apparatus according to an exemplary embodiment.

具体实施方式Detailed ways

为使本发明的目的、技术方案和优点更加清楚,下面将对本发明的技术方案进行详细的描述。显然,所描述的实施例仅仅是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动的前提下所得到的所有其它实施方式,都属于本发明所保护的范围。In order to make the objectives, technical solutions and advantages of the present invention clearer, the technical solutions of the present invention will be described in detail below. Obviously, the described embodiments are only some, but not all, embodiments of the present invention. Based on the embodiments of the present invention, all other implementations obtained by those of ordinary skill in the art without creative work fall within the protection scope of the present invention.

图1是根据一示例性实施例示出的一种系统日前短期负荷预测方法的流程图,如图1所示,该方法可以但不限于用于终端中,包括以下步骤:Fig. 1 is a flow chart of a method for predicting a system day-ahead short-term load according to an exemplary embodiment. As shown in Fig. 1 , the method can be used in a terminal, but is not limited to, including the following steps:

步骤101:采集历史数据,并对历史数据进行预处理;Step 101: collect historical data, and preprocess the historical data;

步骤102:利用预处理后的历史数据构建训练样本集;Step 102: use the preprocessed historical data to construct a training sample set;

步骤103:利用训练样本集对预先建立的XGBoost多目标回归模型进行训练,得到训练后的XGBoost多目标回归模型;Step 103: using the training sample set to train the pre-established XGBoost multi-objective regression model to obtain a trained XGBoost multi-objective regression model;

步骤104:生成预测样本特征;Step 104: generating predicted sample features;

步骤105:将预测样本特征输入至训练后的XGBoost多目标回归模型,得到预测的短期负荷。Step 105: Input the predicted sample features into the trained XGBoost multi-objective regression model to obtain the predicted short-term load.

本发明实施例提供的一种系统日前短期负荷预测方法,与传统方法不同,本申请将日前短期负荷预测问题定义为一个多目标学习任务,通过构建一个多目标的回归模型实现对未来 96 个时刻负荷值的预测。具体的,通过采集历史数据,并对历史数据进行预处理,利用预处理后的历史数据构建训练样本集,利用训练样本集对预先建立的XGBoost多目标回归模型进行训练,得到训练后的XGBoost多目标回归模型,实现了对多目标回归模型的训练和预测;通过生成预测样本特征,将预测样本特征输入至训练后的XGBoost多目标回归模型,得到预测的短期负荷,不仅提高了模型训练、部署和预测的效率,还提高了短期负荷预测的精准度。The embodiment of the present invention provides a method for short-term load forecasting for the day ahead of the system. Different from the traditional method, the present application defines the short-term load forecasting problem for the day ahead as a multi-objective learning task. Prediction of load values. Specifically, by collecting historical data and preprocessing the historical data, using the preprocessed historical data to construct a training sample set, and using the training sample set to train the pre-established XGBoost multi-objective regression model, the trained XGBoost multi-objective regression model is obtained. The target regression model realizes the training and prediction of the multi-target regression model; by generating the predicted sample features, the predicted sample features are input to the XGBoost multi-target regression model after training, and the predicted short-term load is obtained, which not only improves the model training and deployment. and forecasting efficiency, and also improve the accuracy of short-term load forecasting.

进一步的,历史数据,包括:历史的短期负荷值、历史的气象值和历史的负荷变化量;Further, historical data, including: historical short-term load value, historical meteorological value and historical load variation;

历史的气象值中的气象指标包括:温度、湿度、降雨量、风力和云量。The meteorological indicators in the historical meteorological values include: temperature, humidity, rainfall, wind and cloudiness.

进一步的,步骤101中对历史数据进行预处理,包括:Further, in step 101, the historical data is preprocessed, including:

利用插值法填补历史数据中的缺失值,并对填补缺失值后的历史数据进行归一化处理,得到预处理后的历史数据。The interpolation method is used to fill in the missing values in the historical data, and the historical data after filling the missing values is normalized to obtain the preprocessed historical data.

需要说明的是,本发明实施例涉及的“利用插值法填补历史数据中的缺失值”和“对填补缺失值后的历史数据进行归一化处理”方式,是本领域技术人员所熟知的,因此,其具体实现方式不做过多描述。It should be noted that the methods of "using interpolation to fill missing values in historical data" and "normalizing historical data after filling missing values" involved in the embodiments of the present invention are well known to those skilled in the art. Therefore, its specific implementation will not be described too much.

进一步的,步骤102,包括:Further, step 102 includes:

将预处理后的历史数据的时刻划分为96个时刻点,利用该96个时刻点的预处理后的历史数据构建训练样本集。The time of the preprocessed historical data is divided into 96 time points, and a training sample set is constructed by using the preprocessed historical data of the 96 time points.

可以理解的是,将一天的时间分为96个时刻点,即每个时刻点间隔15分钟。这样的样本构建方式与传统方法相比可以获得近 96 倍的样本量,能够有效支持在大规模特征空间上的模型参数的充分更新,避免模型欠拟合。It is understandable that the time of day is divided into 96 time points, that is, each time point is spaced 15 minutes apart. Compared with the traditional method, such a sample construction method can obtain nearly 96 times the sample size, which can effectively support the sufficient update of model parameters in a large-scale feature space and avoid model underfitting.

需要说明的是,对于多目标回归任务来说,对于样本中每一个label是使用同样的模型和特征进行建模的,因此与传统方法相比,模型训练复杂度会大大降低,且有利于使用大量数据集用于模型的训练。It should be noted that for the multi-objective regression task, the same model and features are used for modeling each label in the sample, so compared with the traditional method, the model training complexity will be greatly reduced, and it is beneficial to use A large dataset is used for model training.

进一步的,步骤103,包括:Further, step 103 includes:

将训练样本集分为训练集和验证集;Divide the training sample set into training set and validation set;

利用训练集对预先建立的XGBoost多目标回归模型进行训练,直至当利用验证集对预先建立的XGBoost多目标回归模型进行验证时,得到的预测的历史负荷值与实际的历史负荷值的误差小于第一阈值时,训练结束,得到训练后的XGBoost多目标回归模型。Use the training set to train the pre-established XGBoost multi-objective regression model, until when the validation set is used to verify the pre-established XGBoost multi-objective regression model, the error between the predicted historical load value and the actual historical load value is less than the first When a threshold is reached, the training ends, and the trained XGBoost multi-objective regression model is obtained.

进一步的,预测样本特征,包括:Further, predict the sample characteristics, including:

待预测时刻前k~k-M个时刻点的负荷值

Figure 251511DEST_PATH_IMAGE001
、待预测时刻前k~k-M个时刻点的气象值
Figure 467860DEST_PATH_IMAGE002
、待预测时刻前k~k-M个时刻点的负荷变化量
Figure 100704DEST_PATH_IMAGE003
、月份特征、日期特征、日期类型特征、待预测开始时刻所在小时特征和待预测开始时刻分钟特征;Load value at k~kM time points before the time to be predicted
Figure 251511DEST_PATH_IMAGE001
, the meteorological value of k~kM time points before the forecast time
Figure 467860DEST_PATH_IMAGE002
, the load variation at k~kM time points before the prediction time
Figure 100704DEST_PATH_IMAGE003
, month feature, date feature, date type feature, hour feature of the start time to be predicted, and minute feature of the start time to be predicted;

其中,k为待预测的时刻点;M为一个可调的超参数,且M是正整数值;待预测时刻前k-M个时刻点的负荷变化量

Figure 730400DEST_PATH_IMAGE004
,以此类推。Among them, k is the time point to be predicted; M is an adjustable hyperparameter, and M is a positive integer value; the load change at kM time points before the time to be predicted
Figure 730400DEST_PATH_IMAGE004
, and so on.

具体的,月份特征指的是一年的12个月份,日期特征指的是每个月份中每一天,日期类型特征指的是星期一到星期日。Specifically, the month feature refers to 12 months of a year, the date feature refers to each day in each month, and the date type feature refers to Monday to Sunday.

例如,根据上述的内容可知,将一天分为96个时刻点,每个时刻点间隔15分钟,那么预测样本特征的时刻点也是分为了96个时刻点,早上0点即为第一个时刻点。假设需要预测明天早上1点的负荷值,那么k为5。假设M为4,则k-M=5-4=1,k-M+1=5-4+1=2,k-M+2=5-4+2=3,k-M+3=5-4+3=4。For example, according to the above content, one day is divided into 96 time points, each time point is 15 minutes apart, then the time points for predicting the sample features are also divided into 96 time points, and 0:00 in the morning is the first time point. . Suppose you need to predict the load value at 1 am tomorrow morning, then k is 5. Suppose M is 4, then k-M=5-4=1, k-M+1=5-4+1=2, k-M+2=5-4+2=3, k-M+3=5- 4+3=4.

需要说明的是,dmlc原生的XGBoost多目标回归模型无法支持多目标回归模型的训练和预测,如下所示,单一目标回归任务的 XGBoost 模型定义的损失函数为:It should be noted that the native XGBoost multi-objective regression model of dmlc cannot support the training and prediction of the multi-objective regression model. As shown below, the loss function defined by the XGBoost model of the single-objective regression task is:

Figure 841313DEST_PATH_IMAGE015
Figure 841313DEST_PATH_IMAGE015

上式中,

Figure 240065DEST_PATH_IMAGE016
,n为样本的总数量;
Figure 43810DEST_PATH_IMAGE017
为第i个样本label,
Figure 895223DEST_PATH_IMAGE008
第i个样本的特征向量,
Figure 809827DEST_PATH_IMAGE018
为模型在第t-1轮迭代时对第i个样本的预测值,
Figure 328664DEST_PATH_IMAGE019
为第t轮迭代要新增的模型,
Figure 37732DEST_PATH_IMAGE020
为第t轮迭代要新增的模型的复杂度,
Figure 642020DEST_PATH_IMAGE021
为损失函数。In the above formula,
Figure 240065DEST_PATH_IMAGE016
, n is the total number of samples;
Figure 43810DEST_PATH_IMAGE017
is the i-th sample label,
Figure 895223DEST_PATH_IMAGE008
The eigenvector of the ith sample,
Figure 809827DEST_PATH_IMAGE018
is the predicted value of the model for the ith sample at the t-1th iteration,
Figure 328664DEST_PATH_IMAGE019
The model to be added for the t-th iteration,
Figure 37732DEST_PATH_IMAGE020
is the complexity of the model to be added for the t-th iteration,
Figure 642020DEST_PATH_IMAGE021
is the loss function.

本发明实施例提供的技术方案将XGBoost多目标回归模型的损失函数进行修改,使之可以支持多目标回归模型,则进一步的,步骤105,包括:The technical solution provided by the embodiment of the present invention modifies the loss function of the XGBoost multi-objective regression model so that it can support the multi-objective regression model. Further, step 105 includes:

按下式确定训练后的XGBoost多目标回归模型的目标函数:Determine the objective function of the trained XGBoost multi-objective regression model as follows:

Figure 94735DEST_PATH_IMAGE005
Figure 94735DEST_PATH_IMAGE005

上式中,

Figure 468079DEST_PATH_IMAGE006
,n为预测样本特征的总数量,
Figure 348048DEST_PATH_IMAGE007
,m为待预测的时刻点的总数量;
Figure 875850DEST_PATH_IMAGE008
为第i个预测样本特征的特征向量,
Figure 633722DEST_PATH_IMAGE009
为第i个预测样本特征的第k个待预测的时刻点的实际负荷值,
Figure 360107DEST_PATH_IMAGE010
为第t-1轮迭代时对第i个预测样本特征第k个待预测的时刻点的预测负荷值,
Figure 178022DEST_PATH_IMAGE011
为预测第k个待预测的时刻点的负荷值时第t轮迭代中增加的新模型,
Figure 989857DEST_PATH_IMAGE012
为预测第k个待预测的时刻点的负荷值时第t轮迭代增加的新模型的复杂度,
Figure 285841DEST_PATH_IMAGE013
为损失函数;In the above formula,
Figure 468079DEST_PATH_IMAGE006
, n is the total number of predicted sample features,
Figure 348048DEST_PATH_IMAGE007
, m is the total number of time points to be predicted;
Figure 875850DEST_PATH_IMAGE008
is the feature vector of the i-th predicted sample feature,
Figure 633722DEST_PATH_IMAGE009
is the actual load value of the kth to-be-predicted moment of the i-th prediction sample feature,
Figure 360107DEST_PATH_IMAGE010
is the predicted load value of the k-th time point to be predicted for the i-th predicted sample feature during the t-1 round of iteration,
Figure 178022DEST_PATH_IMAGE011
is the new model added in the t-th iteration when predicting the load value at the k-th time point to be predicted,
Figure 989857DEST_PATH_IMAGE012
is the complexity of the new model added by the t-th iteration when predicting the load value at the k-th time point to be predicted,
Figure 285841DEST_PATH_IMAGE013
is the loss function;

将预测样本特征作为自变量输入至训练后的XGBoost多目标回归模型的目标函数,当

Figure 155749DEST_PATH_IMAGE014
小于等于第二阈值或迭代轮数t等于第三阈值时,输出预测的所有时刻点的负荷,预测的所有时刻点的负荷为预测的短期负荷。Input the predicted sample features as independent variables to the objective function of the trained XGBoost multi-objective regression model, when
Figure 155749DEST_PATH_IMAGE014
When it is less than or equal to the second threshold or when the number of iteration rounds t is equal to the third threshold, the predicted loads at all time points are output, and the predicted loads at all time points are predicted short-term loads.

可以理解的是,本发明实施例提供的方法可以体现电力负荷作为一个连续的时间序列,待预测时刻点负荷值不仅与历史日同时刻值相关,同时与可获取的最邻近点的负荷值相关性也很大。It can be understood that the method provided by the embodiment of the present invention can reflect the power load as a continuous time series, and the load value at the moment to be predicted is not only related to the historical value at the same time, but also is related to the load value of the nearest point that can be obtained. Sex is also great.

经过实验证明,本发明实施例提供的一种系统日前短期负荷预测方法,在一段时间的稳定运行后,统计平均RMPSE精度可达到97.29%,与前期部署的传统方法94.43%的精度有明显的提高。在运行效率方面,新方法架构将模型训练流程与预测流程进行了拆解,训练流程为一个定时调度任务,而预测流程为调用任务,相比传统方法将训练流程和预测流程耦合在一起的方式,预测效率有了大幅提升,从之前的耗时近30s提高到了400ms。Experiments have shown that the method for short-term load forecasting in the day-ahead system provided by the embodiment of the present invention can achieve a statistical average RMPSE accuracy of 97.29% after a period of stable operation, which is significantly improved from the 94.43% accuracy of the traditional method deployed in the early stage. . In terms of operating efficiency, the new method architecture disassembles the model training process and the prediction process. The training process is a scheduled task, and the prediction process is a calling task. Compared with the traditional method, the training process and the prediction process are coupled together. , the prediction efficiency has been greatly improved, from the previous time-consuming nearly 30s to 400ms.

本发明实施例提供的一种系统日前短期负荷预测方法,与传统方法不同,本申请将日前短期负荷预测问题定义为一个多目标学习任务,通过构建一个多目标的回归模型实现对未来 96 个时刻负荷值的预测。具体的,通过采集历史数据,并对历史数据进行预处理,利用预处理后的历史数据构建训练样本集,利用训练样本集对预先建立的XGBoost多目标回归模型进行训练,得到训练后的XGBoost多目标回归模型,实现了对多目标回归模型的训练和预测;通过生成预测样本特征,将预测样本特征输入至训练后的XGBoost多目标回归模型,得到预测的短期负荷,不仅提高了模型训练、部署和预测的效率,还提高了短期负荷预测的精准度,与传统的方法相比。The embodiment of the present invention provides a method for short-term load forecasting for the day ahead of the system. Different from the traditional method, the present application defines the short-term load forecasting problem for the day ahead as a multi-objective learning task. Prediction of load values. Specifically, by collecting historical data and preprocessing the historical data, using the preprocessed historical data to construct a training sample set, and using the training sample set to train the pre-established XGBoost multi-objective regression model, the trained XGBoost multi-objective regression model is obtained. The target regression model realizes the training and prediction of the multi-target regression model; by generating the predicted sample features, the predicted sample features are input to the XGBoost multi-target regression model after training, and the predicted short-term load is obtained, which not only improves the model training and deployment. and forecasting efficiency, and also improve the accuracy of short-term load forecasting, compared with traditional methods.

本发明实施例还提供一种系统日前短期负荷预测装置,如图2所示,该装置包括:An embodiment of the present invention also provides a system day-ahead short-term load forecasting device, as shown in FIG. 2 , the device includes:

预处理模块,用于采集历史数据,并对历史数据进行预处理;The preprocessing module is used to collect historical data and preprocess the historical data;

构建模块,用于利用预处理后的历史数据构建训练样本集;The building module is used to construct a training sample set using the preprocessed historical data;

训练模块,用于利用训练样本集对预先建立的XGBoost多目标回归模型进行训练,得到训练后的XGBoost多目标回归模型;The training module is used to train the pre-established XGBoost multi-objective regression model by using the training sample set to obtain the trained XGBoost multi-objective regression model;

生成模块,用于生成预测样本特征;A generation module is used to generate predicted sample features;

预测模块,用于将预测样本特征输入至训练后的XGBoost多目标回归模型,得到预测的短期负荷。The prediction module is used to input the predicted sample features into the trained XGBoost multi-objective regression model to obtain the predicted short-term load.

进一步的,历史数据,包括:历史的短期负荷值、历史的气象值和历史的负荷变化量;Further, historical data, including: historical short-term load value, historical meteorological value and historical load variation;

历史的气象值中的气象指标包括:温度、湿度、降雨量、风力和云量。The meteorological indicators in the historical meteorological values include: temperature, humidity, rainfall, wind and cloudiness.

进一步的,预处理模块,具体用于:Further, the preprocessing module is specifically used for:

利用插值法填补历史数据中的缺失值,并对填补缺失值后的历史数据进行归一化处理,得到预处理后的历史数据。The interpolation method is used to fill in the missing values in the historical data, and the historical data after filling the missing values is normalized to obtain the preprocessed historical data.

进一步的,构建模块,具体用于:Further, building modules, specifically for:

将预处理后的历史数据的时刻点长度设置为96,利用该96个时刻点的预处理后的历史数据构建训练样本集。The time point length of the preprocessed historical data is set to 96, and a training sample set is constructed by using the preprocessed historical data of the 96 time points.

进一步的,训练模块,具体用于:Further, the training module is specifically used for:

将训练样本集分为训练集和验证集;Divide the training sample set into training set and validation set;

利用训练集对预先建立的XGBoost多目标回归模型进行训练,直至当利用验证集对预先建立的XGBoost多目标回归模型进行验证时,得到的预测的历史负荷值与实际的历史负荷值的误差小于第一阈值时,训练结束,得到训练后的XGBoost多目标回归模型。Use the training set to train the pre-established XGBoost multi-objective regression model, until when the validation set is used to verify the pre-established XGBoost multi-objective regression model, the error between the predicted historical load value and the actual historical load value is less than the first When a threshold is reached, the training ends, and the trained XGBoost multi-objective regression model is obtained.

进一步的,预测样本特征,包括:Further, predict the sample characteristics, including:

待预测时刻前k~k-M个时刻点的负荷值

Figure 878986DEST_PATH_IMAGE001
、待预测时刻前k~k-M个时刻点的气象值
Figure 178118DEST_PATH_IMAGE002
、待预测时刻前k~k-M个时刻点的负荷变化量
Figure 215475DEST_PATH_IMAGE003
、月份特征、日期特征、日期类型特征、待预测开始时刻所在小时特征和待预测开始时刻分钟特征;Load value at k~kM time points before the time to be predicted
Figure 878986DEST_PATH_IMAGE001
, the meteorological value of k~kM time points before the forecast time
Figure 178118DEST_PATH_IMAGE002
, the load variation at k~kM time points before the prediction time
Figure 215475DEST_PATH_IMAGE003
, month feature, date feature, date type feature, hour feature of the start time to be predicted, and minute feature of the start time to be predicted;

其中,k为待预测的时刻点;M为一个可调的超参数,且M是正整数值;待预测时刻前k-M个时刻点的负荷变化量

Figure 916453DEST_PATH_IMAGE004
,以此类推。Among them, k is the time point to be predicted; M is an adjustable hyperparameter, and M is a positive integer value; the load change at kM time points before the time to be predicted
Figure 916453DEST_PATH_IMAGE004
, and so on.

进一步的,预测模块,具体用于:Further, the prediction module is specifically used for:

按下式确定训练后的XGBoost多目标回归模型的目标函数:Determine the objective function of the trained XGBoost multi-objective regression model as follows:

Figure 76170DEST_PATH_IMAGE005
Figure 76170DEST_PATH_IMAGE005

上式中,

Figure 862598DEST_PATH_IMAGE006
,n为预测样本特征的总数量,
Figure 500384DEST_PATH_IMAGE007
,m为待预测的时刻点的总数量;
Figure 55868DEST_PATH_IMAGE008
为第i个预测样本特征的特征向量,
Figure 386486DEST_PATH_IMAGE009
为第i个预测样本特征的第k个待预测的时刻点的实际负荷值,
Figure 597893DEST_PATH_IMAGE010
为第t-1轮迭代时对第i个预测样本特征第k个待预测的时刻点的预测负荷值,
Figure 537905DEST_PATH_IMAGE011
为预测第k个待预测的时刻点的负荷值时第t轮迭代中增加的新模型,
Figure 449361DEST_PATH_IMAGE012
为预测第k个待预测的时刻点的负荷值时第t轮迭代增加的新模型的复杂度,
Figure 449415DEST_PATH_IMAGE013
为损失函数;In the above formula,
Figure 862598DEST_PATH_IMAGE006
, n is the total number of predicted sample features,
Figure 500384DEST_PATH_IMAGE007
, m is the total number of time points to be predicted;
Figure 55868DEST_PATH_IMAGE008
is the feature vector of the i-th predicted sample feature,
Figure 386486DEST_PATH_IMAGE009
is the actual load value of the kth to-be-predicted moment of the i-th prediction sample feature,
Figure 597893DEST_PATH_IMAGE010
is the predicted load value of the k-th time point to be predicted for the i-th predicted sample feature during the t-1 round of iteration,
Figure 537905DEST_PATH_IMAGE011
is the new model added in the t-th iteration when predicting the load value at the k-th time point to be predicted,
Figure 449361DEST_PATH_IMAGE012
is the complexity of the new model added by the t-th iteration when predicting the load value at the k-th time point to be predicted,
Figure 449415DEST_PATH_IMAGE013
is the loss function;

将预测样本特征作为自变量输入至训练后的XGBoost多目标回归模型的目标函数,当

Figure 711901DEST_PATH_IMAGE014
小于等于第二阈值或迭代轮数t等于第三阈值时,输出预测的所有时刻点的负荷,预测的所有时刻点的负荷为预测的短期负荷。Input the predicted sample features as independent variables to the objective function of the trained XGBoost multi-objective regression model, when
Figure 711901DEST_PATH_IMAGE014
When it is less than or equal to the second threshold or when the number of iteration rounds t is equal to the third threshold, the predicted loads at all time points are output, and the predicted loads at all time points are predicted short-term loads.

本发明实施例提供的一种系统日前短期负荷预测方法,与传统方法不同,本申请将日前短期负荷预测问题定义为一个多目标学习任务,通过构建一个多目标的回归模型实现对未来 96 个时刻负荷值的预测。具体的,通过预处理模块采集历史数据,并对历史数据进行预处理,构建模块利用预处理后的历史数据构建训练样本集,训练模块利用训练样本集对预先建立的XGBoost多目标回归模型进行训练,得到训练后的XGBoost多目标回归模型,实现了对多目标回归模型的训练和预测;通过生成模块生成预测样本特征,预测模块将预测样本特征输入至训练后的XGBoost多目标回归模型,得到预测的短期负荷,不仅提高了模型训练、部署和预测的效率,还提高了短期负荷预测的精准度,与传统的方法相比。The embodiment of the present invention provides a method for short-term load forecasting for the day ahead of the system. Different from the traditional method, the present application defines the short-term load forecasting problem for the day ahead as a multi-objective learning task. Prediction of load values. Specifically, the historical data is collected through the preprocessing module, and the historical data is preprocessed. The building module uses the preprocessed historical data to construct a training sample set, and the training module uses the training sample set to train the pre-established XGBoost multi-objective regression model. , the trained XGBoost multi-objective regression model is obtained, which realizes the training and prediction of the multi-objective regression model; the predicted sample features are generated by the generation module, and the prediction module inputs the predicted sample features into the trained XGBoost multi-objective regression model to obtain the prediction. It not only improves the efficiency of model training, deployment and prediction, but also improves the accuracy of short-term load prediction, compared with traditional methods.

可以理解的是,上述提供的装置实施例与上述的方法实施例对应,相应的具体内容可以相互参考,在此不再赘述。It can be understood that the above-mentioned apparatus embodiments correspond to the above-mentioned method embodiments, and the corresponding specific contents can be referred to each other, which will not be repeated here.

本发明实施例还提供一种系统日前短期负荷预测设备,包括:The embodiment of the present invention also provides a system day-ahead short-term load forecasting device, including:

存储器,其上存储有可执行程序;a memory on which an executable program is stored;

处理器,用于执行存储器中的可执行程序,以实现上述实施例提供的系统日前短期负荷预测方法的步骤。The processor is configured to execute the executable program in the memory, so as to implement the steps of the method for predicting the short-term load ahead of the system day provided by the foregoing embodiment.

本发明实施例还提供一种可读存储介质,其上存储有可执行程序,该可执行程序被处理器执行时实现上述实施例提供的系统日前短期负荷预测方法的步骤。Embodiments of the present invention further provide a readable storage medium on which an executable program is stored, and when the executable program is executed by a processor, implements the steps of the method for predicting system day-ahead short-term load provided by the foregoing embodiments.

本领域内的技术人员应明白,本申请的实施例可提供为方法、系统、或计算机程序产品。因此,本申请可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本申请可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器和光学存储器等)上实施的计算机程序产品的形式。As will be appreciated by those skilled in the art, the embodiments of the present application may be provided as a method, a system, or a computer program product. Accordingly, the present application may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present application may take the form of a computer program product embodied on one or more computer-usable storage media having computer-usable program code embodied therein, including but not limited to disk storage, optical storage, and the like.

本申请是参照根据本申请实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。The present application is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the present application. It will be understood that each flow and/or block in the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to the processor of a general purpose computer, special purpose computer, embedded processor or other programmable data processing device to produce a machine such that the instructions executed by the processor of the computer or other programmable data processing device produce Means for implementing the functions specified in one or more of the flowcharts and/or one or more blocks of the block diagrams.

这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令方法的制造品,该指令方法实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer readable memory capable of directing a computer or other programmable data processing device to function in a particular manner, such that the instructions stored in the computer readable memory result in an article of manufacture comprising the method of the instructions, the instructions A method implements the functionality specified in a flow or flows of a flowchart and/or a block or blocks of a block diagram.

这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded on a computer or other programmable data processing device to cause a series of operational steps to be performed on the computer or other programmable device to produce a computer-implemented process such that The instructions provide steps for implementing the functions specified in one or more of the flowcharts and/or one or more blocks of the block diagrams.

以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应以所述权利要求的保护范围为准。The above are only specific embodiments of the present invention, but the protection scope of the present invention is not limited to this. Any person skilled in the art can easily think of changes or substitutions within the technical scope disclosed by the present invention. should be included within the protection scope of the present invention. Therefore, the protection scope of the present invention should be based on the protection scope of the claims.

Claims (9)

1. A method for predicting a short-term load of a power system in the day ahead, the method comprising:
acquiring historical data and preprocessing the historical data;
constructing a training sample set by utilizing the preprocessed historical data;
training a pre-established XGboost multi-target regression model by using the training sample set to obtain a trained XGboost multi-target regression model;
generating predicted sample features;
inputting the predicted sample characteristics into the trained XGboost multi-target regression model to obtain a predicted short-term load;
inputting the predicted sample characteristics into the trained XGboost multi-objective regression model to obtain a predicted short-term load, wherein the predicted short-term load comprises the following steps:
determining an objective function of the trained XGboost multi-objective regression model according to the following formula:
Figure 328529DEST_PATH_IMAGE001
in the above formula, the first and second carbon atoms are,
Figure 273351DEST_PATH_IMAGE002
n is the total number of predicted sample features,
Figure 625835DEST_PATH_IMAGE003
m is the total number of time points to be predicted;
Figure 381433DEST_PATH_IMAGE004
for the feature vector of the ith prediction sample feature,
Figure 819367DEST_PATH_IMAGE005
for the actual load value of the power grid at the kth time point to be predicted of the ith prediction sample characteristic,
Figure 302301DEST_PATH_IMAGE006
predicting a load value of the power grid at a kth time point to be predicted of the ith prediction sample characteristic during the t-1 th iteration,
Figure 670745DEST_PATH_IMAGE007
to predict the grid load value at the kth point of time to be predicted, a new model is added in the t-th iteration,
Figure 597244DEST_PATH_IMAGE008
the complexity of a new model which is added in the t-th iteration when the grid load value of the kth time point to be predicted is predicted,
Figure 584791DEST_PATH_IMAGE009
is a loss function;
inputting the predicted sample characteristics as independent variables into the objective function of the trained XGboost multi-objective regression model
Figure 622149DEST_PATH_IMAGE010
Less than or equal to a second threshold or number of iterations t, etcAnd outputting the predicted loads of all the time points at the third threshold, wherein the predicted loads of all the time points are predicted short-term loads.
2. The method of claim 1, wherein the historical data comprises: historical short-term power grid load values, historical meteorological values and historical load variation;
the meteorological indexes in the historical meteorological values comprise: temperature, humidity, rainfall, wind, and cloud cover.
3. The method of claim 1, wherein the pre-processing the historical data comprises:
and utilizing an interpolation method to fill missing values in the historical data, and carrying out normalization processing on the historical data after the missing values are filled to obtain the preprocessed historical data.
4. The method of claim 1, wherein constructing a training sample set using the preprocessed historical data comprises:
and dividing the time of the preprocessed historical data into 96 time points, and constructing a training sample set by using the preprocessed historical data of the 96 time points.
5. The method as claimed in claim 1, wherein the training of the pre-established XGBoost multi-objective regression model by using the training sample set to obtain the trained XGBoost multi-objective regression model comprises:
dividing the training sample set into a training set and a verification set;
and training the pre-established XGboost multi-target regression model by using the training set until the error between the obtained predicted historical power grid load value and the actual historical power grid load value is smaller than a first threshold value when the pre-established XGboost multi-target regression model is verified by using the verification set, finishing training and obtaining the trained XGboost multi-target regression model.
6. The method of claim 1, wherein predicting the sample features comprises:
the power grid load value of k-M time points before the time to be predicted
Figure 559012DEST_PATH_IMAGE011
And the meteorological values of k-M time points before the time to be predicted
Figure 905679DEST_PATH_IMAGE012
Load variation of k-M time points before the time to be predicted
Figure 128326DEST_PATH_IMAGE013
A month feature, a date type feature, an hour feature where the start time to be predicted is located, and a minute feature of the start time to be predicted;
wherein k is a time point to be predicted; m is a tunable hyperparameter, and M is a positive integer value; load variation of k-M time points before time to be predicted
Figure 218642DEST_PATH_IMAGE014
And so on.
7. An apparatus for predicting a short-term load before a day in a power system, the apparatus comprising:
the preprocessing module is used for acquiring historical data and preprocessing the historical data;
the construction module is used for constructing a training sample set by utilizing the preprocessed historical data;
the training module is used for training a pre-established XGboost multi-target regression model by using the training sample set to obtain a trained XGboost multi-target regression model;
a generation module for generating predicted sample features;
the prediction module is used for inputting the characteristics of the prediction samples into the trained XGboost multi-target regression model to obtain predicted short-term load;
the prediction module is specifically configured to:
determining an objective function of the trained XGboost multi-objective regression model according to the following formula:
Figure 213273DEST_PATH_IMAGE001
in the above formula, the first and second carbon atoms are,
Figure 465263DEST_PATH_IMAGE002
n is the total number of predicted sample features,
Figure 240452DEST_PATH_IMAGE003
m is the total number of time points to be predicted;
Figure 619612DEST_PATH_IMAGE004
for the feature vector of the ith prediction sample feature,
Figure 983597DEST_PATH_IMAGE005
for the actual load value of the power grid at the kth time point to be predicted of the ith prediction sample characteristic,
Figure 154291DEST_PATH_IMAGE006
predicting a load value of the power grid at a kth time point to be predicted of the ith prediction sample characteristic during the t-1 th iteration,
Figure 603727DEST_PATH_IMAGE007
to predict the grid load value at the kth point of time to be predicted, a new model is added in the t-th iteration,
Figure 848895DEST_PATH_IMAGE008
for predicting the grid load of the kth time point to be predictedThe complexity of the new model added in the t-th iteration at load,
Figure 552540DEST_PATH_IMAGE009
is a loss function;
inputting the predicted sample characteristics as independent variables into the target function of the trained XGboost multi-target regression model
Figure 411911DEST_PATH_IMAGE010
And when the load is less than or equal to the second threshold or the iteration round number t is equal to the third threshold, outputting the predicted load of all the time points, wherein the predicted load of all the time points is the predicted short-term load.
8. A power system day-ahead short-term load prediction apparatus, comprising:
a memory having an executable program stored thereon;
a processor for executing the executable program in the memory to implement the steps of the method of any one of claims 1-6.
9. A readable storage medium having stored thereon an executable program, wherein the executable program, when executed by a processor, performs the steps of the method of any one of claims 1-6.
CN202111291156.6A 2021-11-03 2021-11-03 System day-ahead short-term load forecasting method, apparatus, device and readable storage medium Expired - Fee Related CN113723717B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202111291156.6A CN113723717B (en) 2021-11-03 2021-11-03 System day-ahead short-term load forecasting method, apparatus, device and readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202111291156.6A CN113723717B (en) 2021-11-03 2021-11-03 System day-ahead short-term load forecasting method, apparatus, device and readable storage medium

Publications (2)

Publication Number Publication Date
CN113723717A CN113723717A (en) 2021-11-30
CN113723717B true CN113723717B (en) 2022-04-29

Family

ID=78686524

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202111291156.6A Expired - Fee Related CN113723717B (en) 2021-11-03 2021-11-03 System day-ahead short-term load forecasting method, apparatus, device and readable storage medium

Country Status (1)

Country Link
CN (1) CN113723717B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114358406A (en) * 2021-12-27 2022-04-15 北京百度网讯科技有限公司 Resource data processing method, and resource prediction model training method and device
CN114529099B (en) * 2022-02-28 2025-01-03 北京京东振世信息技术有限公司 Demand data prediction method and device, electronic device, and storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108510113A (en) * 2018-03-21 2018-09-07 中南大学 A kind of application of XGBoost in short-term load forecasting
CN110472778A (en) * 2019-07-29 2019-11-19 上海电力大学 A kind of short-term load forecasting method based on Blending integrated study
CN112016734B (en) * 2020-04-07 2024-03-22 沈阳工业大学 LSTM stack-based self-coding multi-model load prediction method and system
AU2021105453A4 (en) * 2021-08-13 2021-10-14 Guangxi University Method for forecasting line loss rate in low-voltage station area based on extreme gradient lifting decision tree

Also Published As

Publication number Publication date
CN113723717A (en) 2021-11-30

Similar Documents

Publication Publication Date Title
CN102663513B (en) Utilize the wind power combined prediction modeling method of grey relational grade analysis
CN103390116B (en) Use the photovoltaic power station power generation power forecasting method of stepping mode
CN103279804B (en) The Forecasting Methodology of super short-period wind power
WO2023216576A1 (en) Photovoltaic power generation short-term power prediction method and system
CN111667093B (en) Medium-and-long-term wind power generation calculation method and device
CN103488869A (en) Wind power generation short-term load forecast method of least squares support vector machine
CN102184453A (en) Wind power combination predicting method based on fuzzy neural network and support vector machine
CN108053048A (en) A kind of gradual photovoltaic plant ultra-short term power forecasting method of single step and system
CN104732296A (en) Modeling method for distributed photovoltaic output power short-term prediction model
CN113723717B (en) System day-ahead short-term load forecasting method, apparatus, device and readable storage medium
WO2024046137A1 (en) Power prediction model construction method for multi-energy combined power generation system and power prediction method for multi-energy combined power generation system
Sogabe et al. Optimization of decentralized renewable energy system by weather forecasting and deep machine learning techniques
CN111917111B (en) Method, system, device and storage medium for online assessment of distributed photovoltaic power supply capacity of distribution network
CN102509027A (en) Wind powder combined predication method based on cross entropy theory
CN111915079B (en) Hybrid KNN wind power prediction method and system
CN109412161A (en) A kind of Probabilistic Load calculation method and system
CN114330934A (en) Model parameter self-adaptive GRU new energy short-term power generation power prediction method
CN114638396A (en) Photovoltaic power prediction method and system based on neural network instantiation
CN103489041A (en) Short-term wind power forecast method
CN117728404A (en) Method and system for predicting power generation capacity of medium-and-long-term photovoltaic field station
Phan et al. Application of a new Transformer-based model and XGBoost to improve one-day-ahead solar power forecasts
CN116663727A (en) Photovoltaic power prediction method and system
CN115809732A (en) Distributed photovoltaic power generation power prediction method and system
CN104915788A (en) Method for electric power system dynamic economic dispatching with correlation of multiple wind power plants being considered
CN115238854A (en) Short-term load prediction method based on TCN-LSTM-AM

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20220429

CF01 Termination of patent right due to non-payment of annual fee