CN117556369A

CN117556369A - Power theft detection method and system for dynamically generated residual error graph convolution neural network

Info

Publication number: CN117556369A
Application number: CN202410046231.XA
Authority: CN
Inventors: 庄伟�; 江文; 纪兆辉; 樊继利; 李之恒; 邢发男; 申义贤
Original assignee: Nanjing University of Information Science and Technology
Current assignee: Nanjing University of Information Science and Technology
Priority date: 2024-01-12
Filing date: 2024-01-12
Publication date: 2024-02-13
Anticipated expiration: 2044-01-12
Also published as: CN117556369B

Abstract

The invention discloses a power theft detection method of a dynamically generated residual map convolution neural network, which comprises the following steps of: (1) Collecting user power consumption original data in a power system; (2) Performing missing supplement and outlier processing on the original data, and dividing a training set, a verification set and a test set; (3) Transforming the preprocessed one-dimensional power load curve into a two-dimensional power load characteristic matrix to input a graph convolution neural network; (4) Obtaining optimal parameters through training to obtain an adjacency matrix A which can most represent the data relationship; (5) The feature matrix X and the adjacent matrix A are sent into a residual map convolution neural network to extract potential features, and final classification is obtained through a pooling layer and a full-connection layer; the invention improves the accuracy of the electricity larceny detection field by dynamically learning the deep time dependency relationship, the period mode and the potential characteristics among the electricity consumption periods of the users. The adopted data supplementing method effectively relieves the problem of data unbalance in the real world.

Description

Power theft detection method and system for dynamically generated residual error graph convolution neural network

Technical Field

The invention relates to the technical field of electricity theft detection, in particular to an electricity theft detection method and system of a dynamically generated residual error graph convolution neural network.

Background

The power industry is one of important pillar industries supporting national strategic development, and the healthy and stable development thereof is related to the smooth operation of society. The electric energy loss in the power transmission and distribution process has great influence on the safety and economic benefits of the power system, and mainly comprises technical loss and non-technical loss. In recent years, more and more researches are carried out on electricity larceny detection technology, and the traditional method has blindness, randomness and low inspection efficiency and is also influenced by the defects of professional technology and detection experience of electricity consumption detection personnel. At present, a smart power grid is continuously developed, an advanced measurement system (AMI) is increasingly perfected, a smart electric meter is used as an important component of the advanced measurement system, a large amount of electricity load data is collected at high frequency, and a power theft detection method is gradually evolved from a traditional method of on-site inspection, evidence collection and the like by workers to a machine learning-based method.

However, most of machine learning models proposed in the prior art directly model a single power load curve, and cannot fully capture time dependence, space dependence and potential dependence under the period of power consumption data. Moreover, in actual situations, the number of normal electricity consumers is far greater than that of electricity stealing consumers, and most methods cannot effectively analyze consumption habits of the electricity stealing consumers, so that the application effect is poor and cannot be popularized to actual enterprises.

Disclosure of Invention

The invention aims to: the invention aims to provide a method and a system for detecting electricity larceny of a dynamically generated residual error graph convolution neural network, which accurately identify the electricity larceny behavior of a user.

The technical scheme is as follows: the invention discloses a method for detecting electricity theft of a dynamically generated residual error graph convolution neural network, which comprises the following steps:

(1) Collecting user power consumption original data in a power system;

(2) Performing missing supplement and outlier processing on the original data, and dividing a training set, a verification set and a test set;

(3) Transforming the preprocessed one-dimensional power load curve into a two-dimensional power load characteristic matrix X so as to input a graph convolution neural network;

(4) Calculating the correlation among nodes by taking each period of data as a node and adopting a dynamic topological graph generation method, and obtaining optimal parameters through training to obtain an adjacent matrix A which can most represent the data relationship;

(5) And (3) sending the feature matrix X and the adjacent matrix A into a residual map convolution neural network to extract potential features, and obtaining final classification through a pooling layer and a full connection layer.

Further, the step (1) specifically comprises the following steps: the raw data is a daily electricity record of 42372 users produced in 1035 days. The data format of a single user is a time series of 1x 1035.

Further, the step (2) includes the following steps:

(21) And (3) deleting the power consumption data with the missing part, namely deleting the users with the consumption data missing more than 50%, otherwise, supplementing the missing data by adopting a linear interpolation method, respectively taking out three values before and after the missing value, and dividing the six taken out data into a group if the three values are directly discarded in the absence or the null state; calculating to obtain a filling value by using an interpolation formula; the specific operation is as follows:

；

wherein,a value representing the day i electricity data, if empty, as NaN;

(22) Outliers are where individual values in a sample deviate significantly from the rest of the observations; the abnormal value is defined by adopting the three sigma law, and the specific formula is as follows:

；

wherein,mean value of single sample, +.>Is the standard deviation;

(23) Carrying out standardization treatment by adopting a normalization method, and mapping all data into a [0-1] interval;

(24) Dividing the processed data set into a training set, a verification set and a test set, wherein the proportion is 6:2:2.

further, the step (3) specifically includes the following steps: setting the one-dimensional power load curve after pretreatment asComprises N sample users, electricity consumption data for each day +.>Will->Restructuring to +.>Together with the adjacent matrix as input to the picture convolution module, i.e +.>。

Further, the step (4) adaptively learns the graph adjacency matrix according to a dynamic topological graph generating method to capture an implicit connection relation between data, and the method comprises the following steps:

(41) Randomly generating n node vectorsInitializing a weight parameter W and an offset parameter b, and generating an initial relation measure between nodes by the following formula:

；

wherein,to activate the function +.>Controlling the oscillation size of the preliminary relation measurement for the super parameter; v (V) _n V for implicit representation of node vector En _i ,V _j Representing the i, j-th vector in the sequence;

(42) Sorting the initial relation measurement according to rows and columns respectively, selecting k maximum values, setting the maximum values as 1, setting the rest values as 0, and obtaining an adjacent matrix A; where k is a super parameter set to [20, 25, 30, 35, 40, …,80].

Further, the residual map convolution neural network in the step (5) includes: the 1X1 convolution module and the residual MixHop graph convolution module specifically comprise the following steps:

(51) The number of channels of the feature matrix X is increased through a 1X1 convolution module;

(52) Extracting time and space characteristics and potential characteristics among user power curves through a residual MixHop graph convolution module; the residual MixHop graph convolution module consists of a plurality of graph convolution GC layers, each GC layer is formed by overlapping two MixHop modules, residual connection is adopted between the GC layers, and the formula is as follows:

given the adjacency matrix a, then:

；

wherein,；

wherein,the node representing the K layer is obtained through graph convolution operation; />Is a hyper-parameter controlling the ratio of the original state of the reserved root node; />Representing the input hidden state of the first layer, and multiplying the adjacent matrix by the feature matrix;the hidden state of the upper layer is the node characteristic of the upper layer;

adding the normalized result of the identity matrix I to the matrix A; />The degree matrix is the degree of which the diagonal line element is a node; />An inverse of the degree matrix;

；

where k is the propagation depth,for model weight parameters, +.>Representing an output hidden state of the current layer;

(53) Filtering redundant information in the network by adopting a maximum pooling layer; selecting the maximum value in each rectangular subarea to enter the full connection layer; the formula is as follows:

；

wherein,representing the result after the maximum pooling operation; />A value representing a certain rectangular subregion in the input feature map;

(54) Converting the extracted model node characteristics into predictive scores to obtain a final classification detection result; the formula is as follows:

；

wherein,is a SoftMax function; w, B is a model parameter of the full connection layer;

(55) Updating each parameter in the model through gradient descent, wherein binary cross entropy is adopted as a loss function, and the formula is as follows:

；

wherein Y is the actual class number, namely 0 or 1, representing one of the classes; p is the prediction probability of the model, and represents the probability that the sample belongs to the first class;

(56) Training is ended when the model's loss over the training set no longer drops significantly.

The invention relates to a power theft detection system of a dynamically generated residual error graph convolution neural network, which comprises:

and a collection module: the system is used for collecting user power consumption original data in the power system;

and a pretreatment module: the method comprises the steps of performing deletion supplement on original data, performing outlier processing and dividing a training set, a verification set and a test set;

and a conversion module: the method comprises the steps of converting a preprocessed one-dimensional power load curve into a two-dimensional power load characteristic matrix to input a graph convolution neural network;

a neighbor matrix module: each period for data is a node, the correlation among the nodes is calculated by adopting a dynamic topological graph generation method, and the optimal parameters are found through training to obtain an adjacent matrix A which can most represent the data relationship;

and a classification module: and the method is used for sending the feature matrix X and the adjacent matrix A into a residual graph convolution neural network to extract potential features, and obtaining final classification through a pooling layer and a full connection layer.

The device of the invention comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the computer program realizes the electricity larceny detection method of the dynamically generated residual graph convolution neural network when being loaded to the processor.

A storage medium according to the present invention stores a computer program which, when executed by a processor, implements a method for detecting theft of a dynamically generated residual map convolution neural network according to any one of the above.

The beneficial effects are that: compared with the prior art, the invention has the following remarkable advantages: providing fewer parameters while providing more accurate node relationships; a MixHop graph convolutional network is adopted to conveniently extract deep time dependency relationship, periodic pattern and potential characteristics in the user power consumption data. By adding residual connection, the depth of the network is increased, and the problem of gradient explosion in the training process is effectively relieved, so that the robustness of the model is enhanced.

Drawings

FIG. 1 is a flow chart of the method of the present invention;

FIG. 2 is a graphical illustration of a dynamically generated residual map convolutional neural network constructed in accordance with the present invention;

FIG. 3 is a graphical illustration of a residual map convolution neural module in accordance with the present invention;

FIG. 4 is a convergence diagram of training and verifying loss values during the training process of the present invention;

FIG. 5 is a graph comparing effects of the present invention at different convolution layers;

fig. 6 is a graph comparing effects of the present invention at different residual convolution layers.

FIG. 7 is a graph showing the comparison of the effects of the present invention at different ratios of electricity theft.

Description of the embodiments

The technical scheme of the invention is further described below with reference to the accompanying drawings.

As shown in fig. 1, an embodiment of the present invention provides a method for detecting electricity theft of a dynamically generated residual graph convolution neural network, including the following steps:

(1) Collecting user power consumption original data in a power system; the data set adopts 42372 which is recorded by a national electric network (SGCC) from 2014 to 2016 and is true electricity consumption data of 1035 days of users, wherein the number of electricity stealing users is 3615, and the number of normal users is 38757. The data format of a single user is a time series of 1x 1035.

(2) Performing missing supplement and outlier processing on the original data, and dividing a training set, a verification set and a test set; the method comprises the following steps:

；

wherein,a value representing the day i electricity data, if empty, as NaN;

；

wherein,mean value of single sample, +.>Is the standard deviation;

(23) Carrying out standardization treatment by adopting a normalization method, and mapping all data into a [0-1] interval; the processed data is presented in table 1.

(3) Transforming the preprocessed one-dimensional power load curve into a two-dimensional power load characteristic matrix X so as to input a graph convolution neural network; the method comprises the following steps: setting the one-dimensional power load curve after pretreatment asComprises N sample users, electricity consumption data for each day +.>Will->Restructuring to +.>Is used together with the adjacent matrix as a graphThe input of the convolution module is->。

(4) Calculating the correlation among nodes by taking each period of data as a node and adopting a dynamic topological graph generation method, and finding out the optimal parameters through training to obtain an adjacent matrix A which can most represent the data relationship; adaptively learning graph adjacency matrices to capture implicit connection relationships between data according to a dynamic topology graph generation method, comprising the steps of:

；

(5) The feature matrix X and the adjacent matrix A are sent into a residual map convolution neural network to extract potential features, and final classification is obtained through a pooling layer and a full-connection layer; the residual graph convolution neural network includes: the 1X1 convolution module and the residual MixHop graph convolution module specifically comprise the following steps:

given the adjacency matrix a, then:

；

wherein,；

；

as shown in fig. 3, the MixHop convolution operation steps are demonstrated. Information is first propagated laterally and then selected longitudinally. The information dissemination step recursively disseminates node information according to a given graph structure. In the propagation process, the original state of a certain proportion of nodes is reserved. The process of the GCN module is shown in more detail in fig. 3 than the description of the module in fig. 2.

；

wherein,is a SoftMax function; w, B is a full-connection layer mouldA profile parameter;

；

According to the invention, the electricity stealing detection is used as a two-class task for detecting whether a user generates electricity stealing behavior, a single accuracy evaluation index can not objectively evaluate the advantages of a model under the condition of unbalanced data, particularly extremely biased data, and the advantages of the model compared with other existing models are presented through multiple index evaluation models. The invention adopts the Area Under Curve (AUC) and average precision (MAP) which are commonly used in the field of electricity larceny detection as model evaluation indexes, and the detailed explanation is shown in table 2.

TABLE 2 introduction of evaluation index

The calculation method of each index is as follows:

wherein,representing the rank value of sample i, M represents a normal user sample, N represents a steal user sample, and the samples are scored in ascending order of positive samples.

Prior to MAP evaluation, the tags of the test set need to be ranked according to test score, and in the field of electricity theft detection, the first 100 and 200 tags are typically selected to evaluate performance.

First, the precision at k (denoted P@k) is defined:

wherein,representing the number of correctly predicted thieves before location k, then map@k (where k is in the range of [100,200 ]]) The average of all P@k cases is shown, as follows,

where r is the number of power theft in the first N tags,is the location where electricity theft occurs.

The index pairs of the test set of the embodiment of the invention and other models are shown in table 3:

TABLE 3 comparison of effects with mainstream models

As can be seen from Table 3, the AUC of the present model on the test set was 0.932, MAP@100 was 0.959, and MAP@200 was 0.967 higher than the existing model.

As shown in fig. 4, the convergence process of 100 rounds of the method at a training rate of 60% is shown. The horizontal axis represents the number of training rounds and the vertical axis represents the loss value. As training proceeds, both training and validation losses are significantly reduced. In particular, between 35 and 50 rounds, the verification loss exhibits small amplitude fluctuations, which may be due to data noise, and after 60 rounds the loss value tends to stabilize.

And determining the final layer number to be 6 by comparing the effects of different layers of the picture convolution in model training.

Figures 5 and 6 show the effect of different number of convolution layers on the model effect without and with the addition of residual connections, respectively.

Fig. 5 shows the results of a graph convolution module experiment without residual connection. When the number of layers is increased within a certain range, the performance of the model is improved, and after the threshold value is exceeded, the performance of the model is reduced. For example, the model performance increases more when the number of layers is 1 to 3, and the model performance decreases significantly after exceeding 3 layers. This is because, at the beginning, the number of iterations of the parameters can be increased by deepening the number of model layers, but the complex operation in the graph convolution causes the model to appear over-fitting after exceeding three layers, and the performance cannot be improved.

Fig. 6 shows experimental results under a residual graph convolution neural module. Also, model performance is improved when the number of layers can be increased over a range. In contrast, when the number of model layers under residual connection reaches 6, the model performance begins to decrease. This is because the residual connection makes the lower model retain node information of the upper model when the convolution operation is performed, so that the model layer number is deepened.

Meanwhile, the optimal performance of the model added with the residual connection is higher than that of the original model. For example, when the number of layers is 6, the optimal AUC for the model with residual connections added is 0.865, which is 0.12 higher than the model without residual connections added. This fully illustrates the validity of residual connection in the model of the present invention and fully ensures the initial information of the nodes in the process of graph convolution operation.

As shown in fig. 7, a comparison graph of the model detection effect at different electricity theft ratios is shown. In the data set adopted in the experiment, the proportion of electricity stealing users is 8%, and in order to show the effect of the model under different electricity stealing proportions, the proportion of the electricity stealing users is increased by reducing the number of normal users. It can be seen that the detection accuracy of the invention is higher with the increase of the electricity stealing proportion. It should be noted that the present invention requires a certain amount of marked electricity larceny user data, and too few electricity larceny users cannot provide sufficient features, for example, at a electricity larceny proportion of 2%,4%, the model cannot sufficiently extract the time correlation and periodicity in the electricity larceny data, resulting in poor accuracy.

The embodiment of the invention also provides a system for detecting electricity theft of the residual error graph convolution neural network, which comprises the following steps:

The embodiment of the invention also provides equipment, which comprises a memory, a processor and a computer program stored on the memory and capable of running on the processor, wherein the computer program realizes the electricity larceny detection method of the dynamically generated residual error graph convolution neural network when being loaded to the processor.

The embodiment of the invention also provides a storage medium, which stores a computer program, and the computer program realizes the electricity theft detection method of the dynamically generated residual graph convolution neural network when being executed by a processor.

Claims

1. The electricity stealing detection method of the residual map convolution neural network is characterized by comprising the following steps of:

(1) Collecting user power consumption original data in a power system;

2. The method for detecting theft of a dynamically generated residual convolution neural network according to claim 1, wherein the step (1) is specifically as follows: the original data is a daily electricity record generated by 42372 users in 1035 days; the data format of a single user is a time series of 1x 1035.

3. The method for detecting theft of a dynamically generated residual convolution neural network according to claim 1, wherein said step (2) comprises the steps of:

；

wherein,a value representing the day i electricity data, if empty, as NaN;

；

wherein,mean value of single sample, +.>Is the standard deviation;

4. the method for detecting theft of a dynamically generated residual convolution neural network according to claim 1, wherein the step (3) is specifically as follows: setting the one-dimensional power load curve after pretreatment asComprises N sample users, electricity consumption data for each day +.>Will->Restructuring to +.>Together with the adjacent matrix as input to the picture convolution module, i.e +.>。

5. The method for detecting electricity theft of a dynamically generated residual graph convolutional neural network of claim 1, wherein said step (4) adaptively learns graph adjacency matrices to capture implicit connection relations between data according to a dynamic topology graph generation method, comprising the steps of:

；

6. The method for detecting theft of a dynamically generated residual convolutional neural network of claim 1, wherein said step (5) residual convolutional neural network comprises: the 1X1 convolution module and the residual MixHop graph convolution module specifically comprise the following steps:

given the adjacency matrix a, then:

；

wherein,；

wherein,the node representing the K layer is obtained through graph convolution operation; />Is a hyper-parameter controlling the ratio of the original state of the reserved root node; />Representing the input hidden state of the first layer, and multiplying the adjacent matrix by the feature matrix; />The hidden state of the upper layer is the node characteristic of the upper layer;

；

7. A dynamically generated residual map convolutional neural network power theft detection system, comprising:

8. An apparatus comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the computer program when loaded into the processor implements a method of detecting theft of a dynamically generated residual graph convolutional neural network according to any one of claims 1-6.

9. A storage medium storing a computer program, wherein the computer program when executed by a processor implements a dynamically generated residual convolution neural network theft detection method according to any one of claims 1-6.