CN116106701A

CN116106701A - KNN algorithm-based photovoltaic direct current fault arc detection method and system

Info

Publication number: CN116106701A
Application number: CN202310068119.1A
Authority: CN
Inventors: 常兴智; 张军; 王再望; 李鸿; 郑果果; 孙平; 高学平; 张梦莹; 刘荣
Original assignee: Ningxia LGG Instrument Co Ltd
Current assignee: Ningxia LGG Instrument Co Ltd
Priority date: 2023-02-06
Filing date: 2023-02-06
Publication date: 2023-05-12

Abstract

The application provides a photovoltaic direct current fault arc detection method and system based on a KNN algorithm, wherein the method comprises the following steps: acquiring normal and fault current data; calculating characteristic values to generate a training set and a testing set; determining a k value and a characteristic weight; calculating the Euclidean distance; selecting k nearest data, and judging the data category of the input test set by a voting method; calculating an error rate; if the error rate is lower than a preset threshold value, an improved KNN classifier is obtained; and inputting the data to be detected into an improved KNN classifier to obtain a judging result. In the photovoltaic power generation system, normal and fault data are collected through a current sampling module, characteristic values are calculated, and the KNN classifier is trained and improved. The improved KNN classifier can timely judge whether the current signal in the loop has a fault arc or not, timely react, and loss is avoided. Comprehensively considers the factors such as the time-frequency domain of the signal, the energy normalized power spectrum and the like, comprehensively makes judgment, and improves the detection accuracy.

Description

KNN algorithm-based photovoltaic direct current fault arc detection method and system

Technical Field

The application relates to the technical field of photovoltaic power generation, in particular to a photovoltaic direct current fault arc detection method and system based on a KNN algorithm.

Background

The energy field has the contradiction of increasing energy demand, increasing exhaustion of traditional energy and taking coal as energy main power and zero carbon call, so the solar energy which is easy to obtain, clean and has huge potential is a key ring of energy transformation. As such, the photovoltaic power generation field develops rapidly, the installed amount rises year by year, and a plurality of hidden dangers of equipment aging, poor contact and possible arc faults caused by line and device damage are brought. Because the output of the photovoltaic power generation system is direct current, the generated direct current fault arc has the characteristics of large energy, continuous combustion and the like, and is extremely easy to cause large-scale electric fire and cause casualties of property loss fire. The arc prevention is urgent, and timely detection and measures are taken to avoid the loss caused by fire.

The general arc detection method mainly includes a detection method based on physical characteristics and a detection method based on time-frequency domain characteristics. The detection method based on the physical characteristics is to judge whether a fault arc occurs or not by detecting the properties such as the electromagnetic signal of the heat arc according to the property that the arc occurs at the moment; the detection method based on the time-frequency domain characteristics is to select proper criteria, collect data of normal and arc faults, obtain characteristic differences through a signal processing method, and set proper thresholds so as to judge the arc faults.

However, the detection method based on the physical characteristics needs to install the detection device at the position where the arc occurs, and is difficult to realize in actual engineering; the detection method based on the time-frequency domain characteristics has the problem of low accuracy and the like.

Disclosure of Invention

The application provides a photovoltaic direct current fault arc detection method and system based on a KNN algorithm, which are used for solving the problem that a detection device needs to be installed at an arc occurrence position in a detection method based on physical characteristics, and the detection device is difficult to realize in actual engineering; the detection method based on the time-frequency domain characteristics has the problem of low accuracy.

In a first aspect, the present application provides a method for detecting a photovoltaic direct current fault arc based on a KNN algorithm, including: constructing an improved KNN classifier; inputting data to be detected into the improved KNN classifier to obtain a judging result; the improved KNN classifier comprises: an acquisition module configured to acquire normal current data and arc fault current data; the calculation module is configured to calculate characteristic values according to the normal current data and the arc fault current data, and generate a training set and a testing set; the characteristic values include: current limit c ₁ Standard deviation c ₂ Component c of DC ₃ And one to twenty harmonics c ₄ -c ₂₃ Three-layer wavelet packet transformation of eight frequency band energies c ₂₄ -c ₃₁ Normalized power spectrum c ₃₂ The method comprises the steps of carrying out a first treatment on the surface of the The training module is configured to determine a k value and initialize a feature weight value to be 1; sequentially calculating Euclidean distance between each data in the test set and the data in the training set; selecting k nearest data, and judging the data category of the input test set by a voting method; the data category is that normal current data is classified into a first category, and arc fault current data is classified into a second category; calculating the overall error rate of the test set; if the error rate is lower than a preset threshold value, training is finished, and obtainingImproving the KNN classifier.

Optionally, the step of calculating the feature value includes: current limit c ₁ The calculation formula of (2) is as follows:

c ₁ ＝i _max -i _min

wherein ,i_max I is the maximum current value in the sample _min Is the minimum current value in the sample.

Optionally, the step of calculating the feature value includes: standard deviation c ₂ The calculation formula of (2) is as follows:

wherein ,i_n The nth current value in the sample is A, which is the average value of the sample current, and N is the sample number value.

Optionally, the step of calculating the feature value includes: DC component c ₃ And one to twenty harmonics c ₄ -c ₂₃ Extracting frequency domain information of signals through FFT, taking 5KHz as fundamental frequency, and calculating the formula as follows:

c _j ＝|FFT(i ₁ ,i ₂ …i _N )| _50*(j-3) ,3≤j≤23

wherein j is a positive integer, i ₁ …i _N N is the sample number value, which is the current value in the sample.

Optionally, the step of calculating the feature value includes: three-layer wavelet packet decomposition is used to obtain eight frequency band energy c of three-layer wavelet packet transformation ₂₄ -c ₃₁ The calculation formula is as follows:

c ₂₄ ＝∫|AAA3| ² dt

c ₂₅ ＝∫|AAD3| ² dt

c ₂₆ ＝∫|ADA3| ² dt

c ₂₇ ＝∫|ADD3| ² dt

c ₂₈ ＝∫|DAA3| ² dt

c ₂₉ ＝∫|DAD3| ² dt

c ₃₀ ＝∫|DDA3| ² dt

c ₃₁ ＝∫|DDD3| ² dt

wherein AAA3 is a characteristic signal of a frequency band of 0-12.5kHz, AAD3 is a characteristic signal of a frequency band of 12.5-25kHz, ADA3 is a characteristic signal of a frequency band of 25-37.5kHz, ADD3 is a characteristic signal of a frequency band of 37.5-50kHz, DAA3 is a characteristic signal of a frequency band of 50-62.5kHz, DAD3 is a characteristic signal of a frequency band of 62.5-75kHz, DDA3 is a characteristic signal of a frequency band of 75-87.5kHz, and DDD3 is a characteristic signal of a frequency band of 87.5-100 kHz.

Optionally, the step of decomposing the three-layer wavelet packet includes:

the original signal S is a characteristic signal of a frequency band of 0-100kHz, and after passing through a low-pass filter coefficient g (k), the characteristic signal of the frequency band of 0-50kHz of the first layer A1 is obtained; after the original signal S passes through a high-pass filter coefficient h (k), a characteristic signal of which the first layer D1 is a 50-100kHz frequency band is obtained; wherein g (k) and h (k) satisfy an orthogonal relationship:

g(k)＝(-1) ^k h(1-k)

after the first layer A1 passes through the low-pass filter coefficient g (k), obtaining a characteristic signal of which the second layer AA2 is in a frequency band of 0-25 kHz; after the first layer A1 passes through a high-pass filter coefficient h (k), obtaining a characteristic signal of which the second layer AD2 is a frequency band of 25-50 kHz;

after the first layer D1 passes through the low-pass filter coefficient g (k), obtaining a characteristic signal of which the second layer DA2 is in a frequency band of 50-75 kHz; after the first layer D1 passes through a high-pass filter coefficient h (k), obtaining a characteristic signal of which the second layer DD2 is a 75-100kHz frequency band;

after the second layer AA2 passes through the low-pass filter coefficient g (k), a characteristic signal of the third layer AAA3 with the frequency band of 0-12.5kHz is obtained; after the second layer AA2 passes through a high-pass filter coefficient h (k), obtaining a characteristic signal of a third layer AAD3 with a frequency band of 12.5-25 kHz;

after the second layer AD2 passes through the low-pass filter coefficient g (k), a characteristic signal of the third layer ADA3 with the frequency band of 25-37.5kHz is obtained; after the second layer AD2 passes through a high-pass filter coefficient h (k), obtaining a characteristic signal of which the third layer ADD3 is 37.5-50 kHz;

after the second layer DA2 passes through the low-pass filter coefficient g (k), obtaining a characteristic signal of which the third layer DAA3 is in a frequency band of 50-62.5 kHz; after the second layer DA2 passes through the high-pass filter coefficient h (k), obtaining a characteristic signal of the third layer DAD3 with the frequency band of 62.5-75 kHz;

after the second layer DD2 passes through the low-pass filter coefficient g (k), a characteristic signal of which the third layer DDA3 is 75-87.5kHz frequency band is obtained; after the second layer DD2 passes through the high-pass filter coefficient h (k), the characteristic signal of the frequency band of 87.5-100kHz of the third layer DDD3 is obtained.

Optionally, the step of calculating the feature value includes: normalized power spectrum c ₃₂ The calculation formula of (2) is as follows:

wherein T is time, and i is normalized current value.

Optionally, the step of sequentially calculating the euclidean distance between each data in the test set and the data in the training set includes:

the samples are characterized by c respectively ₁ ，c ₂ ，…，c ₃₂ The Euclidean distance between the ith sample in the test set and the jth sample in the training set is:

wherein ,

for the p-th eigenvalue of the i-th sample in the test set,/th sample in the test set>

The p-th eigenvalue of the j-th sample in the training set, M is the number of samples in the training set.

Optionally, the step of calculating the error rate of the whole test set includes: giving an initial value of 0 to the error times; judging the types of the samples in the test set by using a voting method, and if errors are judged, increasing the error times by 1; and traversing the whole test set, comparing the error times with the number of samples, and calculating the whole error rate of the test set.

In a second aspect, the present application further provides a photovoltaic direct current fault arc detection system based on a KNN algorithm, including: the photovoltaic module is configured to generate direct current through photovoltaic power generation; a fault arc generator configured to simulate the generation of a fault arc phenomenon; the current sampling module is configured to collect normal current data and arc fault current data; an inverter, comprising: the photovoltaic direct current fault arc detection device comprises a control component and a conversion component, wherein the conversion component is configured to convert direct current generated by a photovoltaic component into alternating current, and the control component is configured to execute the photovoltaic direct current fault arc detection method based on the KNN algorithm; and a power grid configured as a system load.

Compared with the prior art, the application has the following beneficial effects:

the application provides a photovoltaic direct current fault arc detection method and system based on a KNN algorithm, wherein the method comprises the following steps: acquiring normal current data and arc fault current data; calculating characteristic values to generate a training set and a testing set; determining a k value and initializing a feature weight; sequentially calculating Euclidean distance between each data in the test set and the data in the training set; selecting k nearest data, and judging the data category of the input test set by a voting method; calculating the overall error rate of the test set; if the error rate is lower than a preset threshold, training is finished, and an improved KNN classifier is obtained; and inputting the data to be detected into the improved KNN classifier to obtain a judging result. In the photovoltaic power generation system, normal and fault data are collected through a current sampling module, characteristic values are calculated, and the KNN classifier is trained and improved. The improved KNN classifier can timely judge whether the current signal in the loop has a fault arc or not, can timely react, and avoids loss. Comprehensively considers the time-frequency domain characteristics of signals, the wavelet packet frequency band energy, the normalized power spectrum and other factors, comprehensively makes judgment, and improves the detection accuracy.

Drawings

In order to more clearly illustrate the technical solutions of the present application, the drawings that are needed in the embodiments will be briefly described below, and it will be obvious to those skilled in the art that other drawings can be obtained from these drawings without inventive effort.

Fig. 1 is a schematic flow chart of a method for detecting a photovoltaic direct current fault arc based on a KNN algorithm;

FIG. 2 is a schematic diagram of a three-layer wavelet packet decomposition according to the present application;

fig. 3 is a schematic structural diagram of a photovoltaic dc fault arc detection system based on KNN algorithm according to the present application;

fig. 4 is a block diagram of an improved KNN classifier as described herein.

Detailed Description

Reference will now be made in detail to the embodiments, examples of which are illustrated in the accompanying drawings. When the following description refers to the accompanying drawings, the same numbers in different drawings refer to the same or similar elements, unless otherwise indicated. The embodiments described in the examples below do not represent all embodiments consistent with the present application. Merely as examples of systems and methods consistent with some aspects of the present application as detailed in the claims.

KNN (K-Nearest Neighbor) method, namely K Nearest Neighbor method, the idea of the method is as follows: if a sample belongs to a class for the majority of the K most similar (i.e., nearest neighbor) samples in the feature space, then the sample also belongs to that class. The method only determines the category to which the sample to be classified belongs according to the category of one or more samples which are nearest to each other in the classification decision.

The application provides a photovoltaic direct current fault arc detection method based on a KNN algorithm, as shown in fig. 1, comprising the following steps:

first, it is necessary to construct an improved KNN classifier, as shown in fig. 4, which includes: the system comprises an acquisition module, a calculation module and a training module. The method for constructing the improved KNN classifier comprises the following steps:

s100: normal current data and arc fault current data are obtained.

Simulating the occurrence of a fault arc phenomenon by using a fault arc generator; the current sampling module is used for collecting normal current data and arc fault current data. When the fault arc generator is not used, the current sampling module collects normal current data; when the fault arc generator is in use, the current sampling module collects arc fault current data. The sampling rate of the current sampling module is 200kHz, and according to the Nyquist law, current data with the frequency range of 0-100kHz can be acquired. The current sampling module takes 2000 data acquired every 10ms as one sample for standby.

S200: calculating characteristic values to generate a training set and a testing set; the characteristic values include: current limit c ₁ Standard deviation c ₂ Component c of DC ₃ And one to twenty harmonics c ₄ -c ₂₃ Three-layer wavelet packet transformation of eight frequency band energies c ₂₄ -c ₃₁ Normalized power spectrum c ₃₂ 。

Processing the normal current data and arc fault current data samples collected by the current sampling module respectively, calculating the current range, standard deviation, direct current component, one to twenty harmonic wave, three-layer wavelet packet conversion eight frequency band energy and normalized power spectrum of each sample as characteristic values, and using c respectively ₁ ,c ₂ ,…,c ₃₂ And (3) representing.

In an exemplary embodiment, the step of calculating the characteristic value includes: current limit c ₁ The calculation formula of (2) is as follows:

c ₁ ＝i _max -i _min

In an exemplary embodiment, the step of calculating the characteristic value includes: standard deviation c ₂ The calculation formula of (2) is as follows:

In an exemplary embodiment, the step of calculating the characteristic value includes: DC component c ₃ And one to twenty harmonics c ₄ -c ₂₃ Extracting frequency domain information of signals through FFT, taking 5KHz as fundamental frequency, and calculating the formula as follows:

c _j ＝|FFT(i ₁ ,i ₂ …i _N )| _50*(j-3) ,3≤j≤23

In an exemplary embodiment, the step of calculating the characteristic value includes: three-layer wavelet packet decomposition is used to obtain eight frequency band energy c of three-layer wavelet packet transformation ₂₄ -c ₃₁ The calculation formula is as follows:

c ₂₄ ＝∫|AAA3| ² dt

c ₂₅ ＝∫|AAD3| ² dt

c ₂₆ ＝∫|ADA3| ² dt

c ₂₇ ＝∫|ADD3| ² dt

c ₂₈ ＝∫|DAA3| ² dt

c ₂₉ ＝∫|DAD3| ² dt

c ₃₀ ＝∫|DDA3| ² dt

c ₃₁ ＝∫|DDD3| ² dt

As shown in fig. 2, the step of decomposing the three-layer wavelet packet includes:

g(k)＝(-1) ^k h(1-k)

after the first layer A1 passes through the low-pass filter coefficient g (k), obtaining a characteristic signal of which the second layer AA2 is in a frequency band of 0-25 kHz; after the first layer A1 passes through the high-pass filter coefficient h (k), a characteristic signal of the second layer AD2 with the frequency band of 25-50kHz is obtained.

After the first layer D1 passes through the low-pass filter coefficient g (k), obtaining a characteristic signal of which the second layer DA2 is in a frequency band of 50-75 kHz; after the first layer D1 passes through the high-pass filter coefficient h (k), a characteristic signal of which the second layer DD2 is a 75-100kHz frequency band is obtained.

After the second layer AA2 passes through the low-pass filter coefficient g (k), a characteristic signal of the third layer AAA3 with the frequency band of 0-12.5kHz is obtained; after the second layer AA2 passes through the high-pass filter coefficient h (k), the characteristic signal of the third layer AAD3 with the frequency band of 12.5-25kHz is obtained.

After the second layer AD2 passes through the low-pass filter coefficient g (k), a characteristic signal of the third layer ADA3 with the frequency band of 25-37.5kHz is obtained; after the second layer AD2 passes through the high-pass filter coefficient h (k), the characteristic signal of the third layer ADD3 with 37.5-50kHz frequency band is obtained.

After the second layer DA2 passes through the low-pass filter coefficient g (k), obtaining a characteristic signal of which the third layer DAA3 is in a frequency band of 50-62.5 kHz; after the second layer DA2 passes through the high-pass filter coefficient h (k), the characteristic signal of the frequency band of 62.5-75kHz of the third layer DAD3 is obtained.

In an exemplary embodiment, the step of calculating the characteristic value includes: normalized power spectrum c ₃₂ The calculation formula of (2) is as follows:

wherein T is time, and i is normalized current value.

And obtaining all the characteristic values, and generating a training set and a testing set after adding the category.

S300: and determining a k value and initializing the characteristic weight value as 1.

The selection of the optimal k-value is a necessary condition for building a reasonable and accurate KNN model. If the k value is too low, the model becomes too specific and does not generalize well. It is also sensitive to noise. The model achieves high accuracy over the training set, but the model has poor predictive power for new, previously unseen data points. If k is chosen too large, the model becomes too generalized to accurately predict the data points in the training and testing set. This condition is known as under-fitting. Through a large number of theoretical analysis and calculation, the k value is determined to be 5-10, and a better effect can be obtained.

S400: and sequentially calculating the Euclidean distance between each datum in the test set and the datum in the training set.

In an exemplary embodiment, the step of sequentially calculating the euclidean distance of each data in the test set from the data in the training set includes:

wherein ,

S500: selecting k nearest data, and judging the data category of the input test set by a voting method; the data category is that normal current data is classified into a first category, and arc fault current data is classified into a second category.

Through the operation, the ith sample in the test set can obtain M d values, the M d values are arranged from small to large, k data with the first sequence, namely k data with the smallest distance, are selected, the categories of the k data are respectively read, and the category of the ith sample in the test set is judged by using a voting method.

S600: calculating the overall error rate of the test set;

in an exemplary embodiment, the step of calculating the error rate of the test set as a whole includes:

giving an initial value of 0 to the number of errors, namely T _f ＝0。

Judging the types of the samples in the test set by using a voting method, and if the judgment is wrong, increasing the number of errors by 1, namely T _f ＝T _f +1。

Traversing the whole test set, comparing the number of errors with the number of samples, and calculating the overall error rate of the test set:

A _f ＝T _f /N

wherein ,A_f For error rate, T _f N is the number of samples of the test set for the number of errors.

S700: and if the error rate is lower than a preset threshold value, after training is finished, obtaining the improved KNN classifier.

If the error rate is lower than a preset threshold, the training is required, and the improved KNN classifier can be used for judging the data to be detected.

If the error rate is not lower than the preset threshold, the training is not required, the feature weight needs to be updated, and the step S400 is skipped for retraining.

After the improved KNN classifier is constructed, the improved KNN classifier can be used for analyzing and judging the data to be detected, so that fault arc judgment is realized.

S800: and inputting the data to be detected into the improved KNN classifier to obtain a judging result.

And inputting the data to be detected, namely current data to be judged in a loop acquired in real time, after the characteristic calculation is finished, into an improved KNN classifier to obtain a judging result so as to realize fault arc judgment.

Based on the method for detecting a photovoltaic direct current fault arc based on the KNN algorithm described in the foregoing embodiments, some embodiments of the present application further provide a system for detecting a photovoltaic direct current fault arc based on the KNN algorithm, as shown in fig. 3, including:

and the photovoltaic assembly is configured to generate direct current through photovoltaic power generation.

A fault arc generator configured to simulate the generation of a fault arc phenomenon. The fault arc generator consists of a fixed electrode and a movable electrode, and the circuit conditions such as insulation aging or connection looseness in the analog circuit are realized by the movement of the movable electrode, so that fault arc is generated, and the fault arc is conveniently researched.

And the current sampling module is configured to collect normal current data and arc fault current data. The sampling rate of the current sampling module is 200kHz, and according to the Nyquist law, current data with the frequency range of 0-100kHz can be acquired. The current sampling module takes 2000 data as one sample every 10ms and transmits the sample to the control component of the inverter.

An inverter, comprising: the photovoltaic direct current fault arc detection device comprises a control component and a conversion component, wherein the conversion component is configured to convert direct current generated by the photovoltaic component into alternating current, and the control component is configured to execute the photovoltaic direct current fault arc detection method based on the KNN algorithm.

And a power grid configured as a system load.

The foregoing detailed description of the embodiments is merely illustrative of the general principles of the present application and should not be taken in any way as limiting the scope of the invention. Any other embodiments developed in accordance with the present application without inventive effort are within the scope of the present application for those skilled in the art.

Claims

1. The method for detecting the photovoltaic direct current fault arc based on the KNN algorithm is characterized by comprising the following steps of:

constructing an improved KNN classifier;

inputting data to be detected into the improved KNN classifier to obtain a judging result;

the improved KNN classifier comprises:

an acquisition module configured to acquire normal current data and arc fault current data;

the calculation module is configured to calculate characteristic values according to the normal current data and the arc fault current data, and generate a training set and a testing set; the characteristic value packageThe method comprises the following steps: current limit c ₁ Standard deviation c ₂ Component c of DC ₃ And one to twenty harmonics c ₄ -c ₂₃ Three-layer wavelet packet transformation of eight frequency band energies c ₂₄ -c ₃₁ Normalized power spectrum c ₃₂ ；

The training module is configured to determine a k value and initialize a feature weight value to be 1;

sequentially calculating Euclidean distance between each data in the test set and the data in the training set;

selecting k nearest data, and judging the data category of the input test set by a voting method; the data category is that normal current data is classified into a first category, and arc fault current data is classified into a second category;

calculating the overall error rate of the test set;

and if the error rate is lower than a preset threshold value, after training is finished, obtaining the improved KNN classifier.

2. The method for detecting a photovoltaic direct current fault arc based on a KNN algorithm according to claim 1, wherein the step of calculating the eigenvalue comprises: current limit c ₁ The calculation formula of (2) is as follows:

c ₁ ＝i _max -i _min

3. The method for detecting a photovoltaic direct current fault arc based on a KNN algorithm according to claim 1, wherein the step of calculating the eigenvalue comprises: standard deviation c ₂ The calculation formula of (2) is as follows:

4. The method for detecting a photovoltaic direct current fault arc based on a KNN algorithm according to claim 1, wherein the step of calculating the eigenvalue comprises: DC component c ₃ And one to twenty harmonics c ₄ -c ₂₃ Extracting frequency domain information of signals through FFT, taking 5KHz as fundamental frequency, and calculating the formula as follows:

c _j ＝|FFT(i ₁ ,i ₂ …i _N )| _50*(j-3) ,3≤j≤23

5. The method for detecting a photovoltaic direct current fault arc based on a KNN algorithm according to claim 1, wherein the step of calculating the eigenvalue comprises: three-layer wavelet packet decomposition is used to obtain eight frequency band energy c of three-layer wavelet packet transformation ₂₄ -c ₃₁ The calculation formula is as follows:

c ₂₄ ＝∫|AAA3| ² dt

c ₂₅ ＝∫|AAD3| ² dt

c ₂₆ ＝∫|ADA3| ² dt

c ₂₇ ＝∫|ADD3| ² dt

c ₂₈ ＝∫|DAA3| ² dt

c ₂₉ ＝∫|DAD3| ² dt

c ₃₀ ＝∫|DDA3| ² dt

c ₃₁ ＝∫|DDD3| ² dt

6. The method for detecting a photovoltaic direct current fault arc based on a KNN algorithm according to claim 5, wherein the step of decomposing the three-layer wavelet packet comprises:

g(k)＝(-1) ^k h(1-k)

after the first layer D1 passes through the low-pass filter coefficient g (k), obtaining a characteristic signal of which the second layer DA2 is in a frequency band of 50-75 kHz;

after the first layer D1 passes through a high-pass filter coefficient h (k), obtaining a characteristic signal of which the second layer DD2 is a 75-100kHz frequency band;

7. The method for detecting a photovoltaic direct current fault arc based on a KNN algorithm according to claim 1, wherein the step of calculating the eigenvalue comprises: normalized power spectrum c ₃₂ The calculation formula of (2) is as follows:

wherein T is time, and i is normalized current value.

8. The method for detecting a photovoltaic direct current fault arc based on a KNN algorithm according to claim 1, wherein the step of sequentially calculating the euclidean distance between each data in the test set and the data in the training set comprises:

wherein ,

9. The method for detecting a photovoltaic direct current fault arc based on a KNN algorithm according to claim 1, wherein the step of calculating the error rate of the whole test set comprises:

giving an initial value of 0 to the error times;

judging the types of the samples in the test set by using a voting method, and if errors are judged, increasing the error times by 1;

and traversing the whole test set, comparing the error times with the number of samples, and calculating the whole error rate of the test set.

10. The utility model provides a photovoltaic direct current fault arc detecting system based on KNN algorithm which characterized in that includes:

the photovoltaic module is configured to generate direct current through photovoltaic power generation;

a fault arc generator configured to simulate the generation of a fault arc phenomenon;

the current sampling module is configured to collect normal current data and arc fault current data;

an inverter, comprising: a control assembly and a conversion assembly, the conversion assembly being configured to convert direct current generated by the photovoltaic assembly into alternating current, the control assembly being configured to perform a KNN algorithm-based photovoltaic direct current fault arc detection method according to any one of claims 1-9;

and a power grid configured as a system load.