CN115290596A

CN115290596A - FCN-ACGAN data enhancement-based hidden dangerous goods identification method and equipment

Info

Publication number: CN115290596A
Application number: CN202210928339.2A
Authority: CN
Inventors: 肖红; 朱畅
Original assignee: Guangdong University of Technology
Current assignee: Guangdong University of Technology
Priority date: 2022-08-03
Filing date: 2022-08-03
Publication date: 2022-11-04

Abstract

The invention discloses a hidden dangerous goods identification method and equipment based on FCN-ACGAN data enhancement, relating to the technical field of goods detection and comprising the following steps: the method comprises the steps of preprocessing real terahertz spectrum data of an article acquired in advance to obtain a real sample, generating a simulation sample by using a pre-trained FCN-ACGAN network model, training a pre-constructed initial ResNet-LSTM classification model according to the real sample and the simulation sample to obtain an optimal classification model, classifying the terahertz time-domain spectrum data by using the optimal classification model, and determining the attribute and the type of the detected article according to a classification result.

Description

A method and device for identifying hidden dangerous goods based on FCN-ACGAN data enhancement

技术领域technical field

本发明涉及物品识别技术领域，尤其涉及通过对太赫兹时域光谱进行分析，对物品进行隐匿危险品进行识别，具体涉及一种基于FCN-ACGAN数据增强的隐匿危险品识别方法及设备。The present invention relates to the technical field of item identification, in particular to the identification of hidden dangerous goods by analyzing terahertz time-domain spectra, and in particular to a method and equipment for identifying hidden dangerous goods based on FCN-ACGAN data enhancement.

背景技术Background technique

太赫兹波因其波段具有指纹谱性而被广泛用于物质检测领域，用太赫兹波对各类危险品进行识别可以实现无损检测，随着人工智能的快速发展，利用深度学习对太赫兹时域光谱数据进行训练，利用训练得到的模型进行无损检测也越来越普遍，但是，深度学习产生的模型的准确率极其依赖训练太赫兹时域光谱数据的数据量，训练数据不足会降低模型的普适性，因此需要对太赫兹时域光谱数据进行增强。Terahertz waves are widely used in the field of material detection because of their fingerprint spectrum. Using terahertz waves to identify various dangerous goods can achieve non-destructive testing. With the rapid development of artificial intelligence, deep learning is used to analyze terahertz time It is becoming more and more common to use the trained model for non-destructive testing. However, the accuracy of the model generated by deep learning is extremely dependent on the amount of training terahertz time-domain spectral data. Insufficient training data will reduce the performance of the model. Therefore, it is necessary to enhance the terahertz time-domain spectral data.

现已有专家学者对太赫兹时域光谱数据的增强进行了研究，主要是通过深度学习的时序数据增强方法中效果显著的生成对抗网络(GAN)对原始光谱数据进行扩充，基于WGAN(Wasserstein GAN)的新一代人工智能小样本数据增强方法，首先将原始样本划分为训练集和测试集样本，采用训练集样本训练GAN后生成模拟样本数据，扩增训练集样本规模；然后，使用模拟样本训练得到分类器；最后，使用测试集样本测试分类器的分类效果。Some experts and scholars have conducted research on the enhancement of terahertz time-domain spectral data, mainly through the effective generative confrontation network (GAN) in the time-series data enhancement method of deep learning to expand the original spectral data, based on WGAN (Wasserstein GAN ) new generation of artificial intelligence small sample data enhancement method, first divide the original sample into training set and test set samples, use training set samples to train GAN and generate simulated sample data, and expand the size of training set samples; then, use simulated samples to train Get the classifier; finally, use the test set samples to test the classification effect of the classifier.

但利用生成对抗网络(GAN)进行太赫兹时域光谱数据增强也存在一定的局限性，传统GAN模型难以对时序数据进行建模，存在训练不稳定、梯度消失、模式过度自由且易崩溃的问题，导致最终生成的数据特征信息缺失、代表性差，进一步，利用特征信息缺失的数据训练得到的分类器分类效果难以满足检测需求，导致根据分类结果进行物品识别的识别精度低。However, the use of generative confrontation network (GAN) to enhance terahertz time-domain spectral data also has certain limitations. The traditional GAN model is difficult to model time-series data, and there are problems of unstable training, gradient disappearance, excessive freedom of mode and easy collapse. , leading to the lack of characteristic information and poor representativeness of the final generated data. Furthermore, the classification effect of the classifier obtained by training the data with missing characteristic information is difficult to meet the detection requirements, resulting in low recognition accuracy for item recognition based on the classification results.

发明内容Contents of the invention

本发明提供了一种基于FCN-ACGAN数据增强的隐匿危险品识别方法及设备，用于解决现有数据分类方法中，由于训练数据不足导致训练得到的分类模型分类效果差，进一步导致物品识别精度低的技术问题。The present invention provides a hidden dangerous article identification method and equipment based on FCN-ACGAN data enhancement, which is used to solve the problem of poor classification effect of the classification model obtained by training due to insufficient training data in the existing data classification method, which further leads to poor article identification accuracy Low technical issues.

本发明提供了一种基于FCN-ACGAN数据增强的太赫兹时域光谱隐匿危险品识别方法，方法包括：The present invention provides a terahertz time-domain spectrum hidden dangerous goods identification method based on FCN-ACGAN data enhancement, the method comprising:

对预先获取的真实光谱数据进行预处理，得到真实样本；Preprocess the pre-acquired real spectral data to obtain real samples;

利用预先训练好的FCN-ACGAN网络模型生成模拟样本；其中，所述FCN-ACGAN网络模型包含生成器和判别器，所述生成器和所述判别器分别包含若干全连接层；训练所述FCN-ACGAN网络模型的步骤如下：Utilize the pre-trained FCN-ACGAN network model to generate simulated samples; wherein, the FCN-ACGAN network model includes a generator and a discriminator, and the generator and the discriminator respectively include several fully connected layers; train the FCN - The steps of the ACGAN network model are as follows:

步骤S1、利用所述真实样本对所述判别器进行预训练，得到初级判别器；Step S1, using the real samples to pre-train the discriminator to obtain a primary discriminator;

步骤S2、利用所述生成器生成初级模拟样本；Step S2, using the generator to generate primary simulation samples;

步骤S3、将所述初级模拟样本与所述真实样本混合，得到训练样本；Step S3, mixing the primary simulation samples with the real samples to obtain training samples;

步骤S4、利用所述训练样本对所述初级判别器和所述生成器进行实际训练，基于RMSProp优化器分别对所述初级判别器的网络参数和所述生成器的网络参数进行更新，判断所述FCN-ACGAN网络模型是否达到纳什均衡，若是，则停止训练，得到训练好的FCN-ACGAN网络模型，若否，则返回步骤S2；Step S4, using the training samples to actually train the primary discriminator and the generator, update the network parameters of the primary discriminator and the generator based on the RMSProp optimizer, and judge the Describe whether the FCN-ACGAN network model reaches Nash equilibrium, if so, then stop training, obtain the trained FCN-ACGAN network model, if not, then return to step S2;

根据所述真实样本和所述模拟样本对预先构建的初始ResNet-LSTM分类模型进行训练，得到最优分类模型；Training the pre-built initial ResNet-LSTM classification model according to the real sample and the simulated sample to obtain an optimal classification model;

利用所述最优分类模型对所述太赫兹时域光谱数据进行分类，根据分类结果确定隐匿危险品。The optimal classification model is used to classify the terahertz time-domain spectral data, and the hidden dangerous goods are determined according to the classification result.

优选的，所述根据所述真实样本和所述模拟样本对预先构建的初始ResNet-LSTM分类模型进行训练，得到最优分类模型具体为：Preferably, the pre-built initial ResNet-LSTM classification model is trained according to the real sample and the simulated sample, and the optimal classification model obtained is specifically:

将所述真实样本与所述模拟样本混合得到扩展样本，并将所述扩展样本分为预训练样本和实际训练样本；mixing the real sample with the simulated sample to obtain an extended sample, and dividing the extended sample into a pre-training sample and an actual training sample;

构建初始ResNet-LSTM分类模型，利用所述预训练样本对所述初始ResNet-LSTM分类模型进行预训练，得到所述初始ResNet-LSTM分类模型的超参数，基于所述超参数和所述实际训练样本对所述初始ResNet-LSTM分类模型进行实际训练，当所述初始ResNet-LSTM分类模型的模型误差满足预设误差阈值时，结束实际训练，得到最优分类模型。Construct an initial ResNet-LSTM classification model, use the pre-training samples to pre-train the initial ResNet-LSTM classification model, obtain the hyperparameters of the initial ResNet-LSTM classification model, based on the hyperparameters and the actual training The sample performs actual training on the initial ResNet-LSTM classification model, and when the model error of the initial ResNet-LSTM classification model meets the preset error threshold, the actual training ends and the optimal classification model is obtained.

优选的，所述利用所述生成器生成初级模拟样本具体为：Preferably, the use of the generator to generate primary simulation samples is specifically:

建立所述真实数据与潜在空间的随机噪声之间的映射关系，所述生成器基于所述映射关系生成模拟样本，其中所述模拟样本包含类别标签。A mapping relationship between the real data and random noise in the latent space is established, and the generator generates simulation samples based on the mapping relationship, wherein the simulation samples include category labels.

优选的，所述基于RMSProp优化器分别对所述初级判别器的网络参数和所述生成器的网络参数进行更新具体为：Preferably, the RMSProp-based optimizer updates the network parameters of the primary discriminator and the network parameters of the generator respectively as follows:

保持所述生成器的网络参数不变，获取所述初级判别器的网络损失值，基于RMSProp优化器和所述初级判别器的网络损失值对所述初级判别器进行更新，当所述初级判别器的更新次数满足预设第一更新阈值时，保持所述初级判别器的网络参数不变，获取所述生成器的网络损失值，基于RMSProp优化器和所述生成器的网络损失值对所述生成器进行更新，直至所述生成器的更新次数满足预设第二更新阈值。Keeping the network parameters of the generator unchanged, obtaining the network loss value of the primary discriminator, updating the primary discriminator based on the RMSProp optimizer and the network loss value of the primary discriminator, when the primary discriminator When the number of updates of the discriminator satisfies the preset first update threshold, the network parameters of the primary discriminator are kept unchanged, the network loss value of the generator is obtained, and the network loss value of the generator is based on the RMSProp optimizer and the generator. The generator is updated until the number of updates of the generator meets a preset second update threshold.

优选的，所述生成器包括：输入模块、Dense全连接层、Tanh激活函数层和输出模块。Preferably, the generator includes: an input module, a Dense fully connected layer, a Tanh activation function layer and an output module.

优选的，所述判别器包括：输入模块、Dense全连接层、ReLU激活函数层和softmax分类器。Preferably, the discriminator includes: an input module, a Dense fully connected layer, a ReLU activation function layer and a softmax classifier.

优选的，在步骤S2之前还包括：利用所述真实样本对初始生成器进行预训练，得到训练后的生成器。Preferably, before step S2, the method further includes: using the real samples to pre-train the initial generator to obtain a trained generator.

优选的，所述对预先获取的真实光谱数据进行预处理，得到真实样本具体为：Preferably, the preprocessing of the pre-acquired real spectral data to obtain the real sample is specifically:

对预先获取的真实光谱数据进行数据清洗，得到第一初始样本；Perform data cleaning on the pre-acquired real spectral data to obtain the first initial sample;

对所述第一样本进行缺失值补充，得到第二初始样本；performing missing value supplementation on the first sample to obtain a second initial sample;

对所述第二初始样本进行归一化处理，得到真实样本。Perform normalization processing on the second initial samples to obtain real samples.

优选的，所述分类结果包括第一分类结果和第二分类结果，所述根据分类结果确定隐匿危险品具体为：Preferably, the classification results include a first classification result and a second classification result, and the determination of hidden dangerous goods according to the classification results is specifically:

判断所述分类结果的类型，当所述分类结果为第一分类结果时，则判定被检测物品为非隐匿危险品，当所述分类结果为第二分类结果时，则判定被检测物品为隐匿危险品，并将所述第二分类结果与预先建立的隐匿危险品的光谱特征数据库进行比对，根据对比结果判断所述隐匿危险品的种类。Judging the type of the classification result, when the classification result is the first classification result, it is determined that the detected item is a non-concealed dangerous product, and when the classification result is the second classification result, it is determined that the detected item is hidden dangerous goods, and comparing the second classification result with a pre-established spectral feature database of hidden dangerous goods, and judging the type of hidden dangerous goods according to the comparison results.

本发明还提供了一种电子设备，其特征在于，包括存储器及处理器，所述存储器中储存有计算机程序，所述计算机程序被所述处理器执行时，使得所述处理器执行前述的数据分类方法的步骤。The present invention also provides an electronic device, which is characterized in that it includes a memory and a processor, and a computer program is stored in the memory, and when the computer program is executed by the processor, the processor executes the aforementioned data The steps of the classification method.

从以上技术方案可以看出，本发明具有以下优点：As can be seen from the above technical solutions, the present invention has the following advantages:

本发明提供的一种基于FCN-ACGAN数据增强的隐匿危险品识别方法及设备，基于RMSProp优化器和预先获取的被检测物品的真实样本，对包含若干全连接层的初级判别器和生成器进行训练和更新，得到达到纳什均衡的FCN-ACGAN模型。其中，全连接层可以帮助FCN-ACGAN模型学习真实数据间的动态特性、提高生成器生成数据的质量，采用RMSProp优化器对判别器和生成器的网络参数进行更新，可以有效消除更新过程中的梯度差异导致的抖动，加快FCN-ACGAN模型收敛过程。进一步，利用训练好的FCN-ACGAN网络模型中的生成器创建模拟样本，混合模拟样本和真实样本对初始ResNet-LSTM分类模型进行训练，得到满足数据分类标准的最优分类模型，利用最优分类模型实现对被检测物品的太赫兹时域光谱数据的分类，最后根据分类结果识别被检测物品的属性及种类。本发明提供的数据分类方法，采用FCN-ACGAN模型创建模拟样本，实现训练样本扩展，解决了现有数据分类方法中，由于训练数据不足导致训练得到的分类模型分类效果差，进一步导致物品识别精度低的技术问题。The present invention provides a method and device for identifying hidden dangerous goods based on FCN-ACGAN data enhancement. Based on the RMSProp optimizer and the pre-acquired real samples of the detected items, the primary discriminator and generator including several fully connected layers are processed. After training and updating, the FCN-ACGAN model that achieves Nash equilibrium is obtained. Among them, the fully connected layer can help the FCN-ACGAN model learn the dynamic characteristics between real data and improve the quality of the data generated by the generator. Using the RMSProp optimizer to update the network parameters of the discriminator and generator can effectively eliminate the update process. The jitter caused by the gradient difference speeds up the convergence process of the FCN-ACGAN model. Further, use the generator in the trained FCN-ACGAN network model to create simulated samples, mix simulated samples and real samples to train the initial ResNet-LSTM classification model, and obtain the optimal classification model that meets the data classification standards. The model realizes the classification of the terahertz time-domain spectral data of the detected items, and finally identifies the attributes and types of the detected items according to the classification results. The data classification method provided by the present invention adopts the FCN-ACGAN model to create simulation samples, realizes the expansion of training samples, and solves the problem that in the existing data classification methods, the classification effect of the classification model obtained by training is poor due to insufficient training data, which further leads to the accuracy of item recognition Low technical issues.

附图说明Description of drawings

为了更清楚地说明本发明实施例或现有技术中的技术方案，下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图仅仅是本发明的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动性的前提下，还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present invention or the prior art, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the accompanying drawings in the following description are only These are some embodiments of the present invention. For those skilled in the art, other drawings can also be obtained according to these drawings without any creative effort.

图1为本发明实施方式提供的一种基于FCN-ACGAN数据增强的隐匿危险品识别方法的方法流程图；Fig. 1 is a method flowchart of a method for identifying hidden dangerous goods based on FCN-ACGAN data enhancement provided by an embodiment of the present invention;

图2为本发明实施方式提供的FCN-ACGAN模型的网络结构图；Fig. 2 is a network structure diagram of the FCN-ACGAN model provided by the embodiment of the present invention;

图3为本发明实施方式提供的生成器的网络结构图；FIG. 3 is a network structure diagram of a generator provided by an embodiment of the present invention;

图4为本发明实施方式提供的判别器的网络结构图。FIG. 4 is a network structure diagram of a discriminator provided by an embodiment of the present invention.

具体实施方式Detailed ways

本发明实施例提供了一种基于FCN-ACGAN数据增强的隐匿危险品识别方法及设备，通过对包含若干全连接层的初级判别器和生成器进行训练和更新，得到达到纳什均衡的FCN-ACGAN网络模型，利用FCN-ACGAN网络模型中的生成器创建模拟样本，实现训练样本扩展，进一步利用扩展样本对初始ResNet-LSTM分类模型进行训练得到满足数据分类标准的最优分类模型，解决了现有数据分类方法中，由于训练数据不足导致训练得到的分类模型分类效果差，进一步导致物品识别精度低的技术问题。The embodiment of the present invention provides a method and device for identifying hidden dangerous goods based on FCN-ACGAN data enhancement. By training and updating the primary discriminator and generator including several fully connected layers, an FCN-ACGAN that achieves Nash equilibrium is obtained. Network model, use the generator in the FCN-ACGAN network model to create simulation samples, realize the expansion of training samples, and further use the extended samples to train the initial ResNet-LSTM classification model to obtain the optimal classification model that meets the data classification standards, which solves the existing problems In the data classification method, due to insufficient training data, the classification effect of the trained classification model is poor, which further leads to the technical problem of low accuracy of item recognition.

为使得本发明的发明目的、特征、优点能够更加的明显和易懂，下面将结合本发明实施例中的附图，对本发明实施例中的技术方案进行清楚、完整地描述，显然，下面所描述的实施例仅仅是本发明一部分实施例，而非全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例，都属于本发明保护的范围。In order to make the purpose, features and advantages of the present invention more obvious and understandable, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the following The described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

深度学习网络模型通常是通过将数据集的一般特征提取出来，将特征作为预测某一类结果的特性，通过利用数据集对构建的模型进行训练，使模型输出的结果尽可能接近上述特性。当训练数据较少时，这种训练方式得到的模型往往会出现局限性：当在模型上使用训练集数据进行结果预测时，预测结果较好，而在模型上使用测试集或验证集数据进行结果预测时，模型的错误率较高，且泛化能力低。为避免上述问题，深度学习算法往往需要获取较多数量的训练数据，通过增加训练数据的数量来避免模型出现过拟合的现象。The deep learning network model usually extracts the general characteristics of the data set, uses the characteristics as the characteristics of predicting a certain type of results, and uses the data set to train the constructed model, so that the output results of the model are as close as possible to the above characteristics. When the training data is small, the model obtained by this training method often has limitations: when the training set data is used on the model to predict the result, the prediction result is better, and the test set or verification set data is used on the model. When predicting the results, the error rate of the model is high and the generalization ability is low. In order to avoid the above problems, deep learning algorithms often need to obtain a large amount of training data, and avoid over-fitting of the model by increasing the amount of training data.

太赫兹时域光谱技术是利用太赫兹脉冲在样品表面发生反射或者透射，分别测得通过样品前后的参考信号和测量信号，然后通过快速傅里叶变换(FFT)将采集到的时域信号变换到频域，得到两个频域光谱，最后通过对频域数据进行处理就可以提取出被测样品的折射率、消光系数和吸收系数等光学参数，通过分析相关数据，就可以实现物品检测(物品分类)。现有技术通常通过将太赫兹时域光谱数据作为训练样本建立深度学习网络模型，然后利用模型进行物品检测，但在太赫兹时域光谱隐匿危险品检测领域中，对于不同危险品的太赫兹时域光谱数据的获取通常采用人工标记，往往需要耗费大量人工和时间成本，且获得的数据量无法覆盖实际物品检测场景，训练数据不足会导致深度学习分类模型识别准确率和普适性低，进一步导致依据分类结果进行物品属性的识别率低。Terahertz time-domain spectroscopy technology uses terahertz pulses to reflect or transmit on the surface of the sample, respectively measure the reference signal and measurement signal before and after passing through the sample, and then transform the collected time-domain signal through fast Fourier transform (FFT) In the frequency domain, two frequency domain spectra are obtained. Finally, by processing the frequency domain data, the optical parameters such as the refractive index, extinction coefficient and absorption coefficient of the tested sample can be extracted. By analyzing the relevant data, the object detection can be realized ( item category). Existing technologies usually use terahertz time-domain spectral data as training samples to establish a deep learning network model, and then use the model to detect objects. The acquisition of domain spectral data is usually manually marked, which often requires a lot of labor and time costs, and the amount of data obtained cannot cover the actual object detection scene. Insufficient training data will lead to low recognition accuracy and universality of the deep learning classification model. As a result, the recognition rate of item attributes based on classification results is low.

有鉴于此，本申请提供了一种基于FCN-ACGAN数据增强的隐匿危险品识别方法，请参阅图1，方法包括：In view of this, this application provides a method for identifying hidden dangerous goods based on FCN-ACGAN data enhancement, please refer to Figure 1, the method includes:

100、对预先获取的真实光谱数据进行预处理，得到真实样本。100. Perform preprocessing on the pre-acquired real spectral data to obtain real samples.

可以理解的是，光谱数据采集的过程中，不可避免地会存在噪声，对被检测物品的原始太赫兹时域光谱数据(下述统称数据，不再赘述)进行预处理，可以提高数据的表达能力，使得数据特征更明显。It is understandable that in the process of spectral data collection, there will inevitably be noise. Preprocessing the original terahertz time-domain spectral data of the detected item (hereinafter collectively referred to as data, will not be described in detail) can improve the expression of data. Ability to make data features more obvious.

200、利用预先训练好的FCN-ACGAN网络模型生成模拟样本。200. Generate a simulation sample by using a pre-trained FCN-ACGAN network model.

通过样本数据对构建的模型进行训练，使模型的精度尽可能高，这是模型训练的理想结果。然而，当训练数据较少时，这种训练方式得到的模型往往会出现在模型上使用训练集数据进行结果预测时，预测结果较好，而在模型上使用测试集或验证集数据进行结果预测时，模型的错误率较高。为避免上述问题，模型的建立往往需要获取较多数量的训练数据，通过增加训练数据的数量来避免模型出现过拟合的现象。当原始数据量较少时，可以通过创建样本的方式来扩展数据量。The constructed model is trained with sample data to make the accuracy of the model as high as possible, which is the ideal result of model training. However, when the training data is small, the model obtained by this training method often appears that when the model uses the training set data to predict the result, the prediction result is better, and the test set or verification set data is used on the model to predict the result. , the error rate of the model is high. In order to avoid the above problems, the establishment of the model often needs to obtain a large amount of training data, and the phenomenon of over-fitting of the model can be avoided by increasing the amount of training data. When the amount of original data is small, the amount of data can be expanded by creating samples.

可以理解的是，拥有足够数量的训练数据时，可以训练得到的精度较高的模型，但若训练数据本身的品质较差，且训练数据不足时，训练得到的模型通常精度较差，或无法满足使用要求，因此，既要确保训练数据足量，同时也要确保足量的训练数据具有较好的品质。It is understandable that when there is a sufficient amount of training data, a model with high accuracy can be trained, but if the quality of the training data itself is poor and the training data is insufficient, the trained model usually has poor accuracy or cannot To meet the requirements of use, therefore, it is necessary to ensure that the training data is sufficient, and at the same time ensure that the sufficient training data has good quality.

当数据量较小时，通过数据增强可以获得增加样本总量，但若优先通过数据增强获得样本总量，再对总的数据进行处理提高数据品质，则会增加处理的工作量，因此，本实施方式优先对原始数据进行预处理，再基于原始数据，利用预先训练好的FCN-ACGAN网络模型生成模拟样本实现数样本扩展。When the amount of data is small, the total number of samples can be increased through data enhancement, but if the total number of samples is obtained first through data enhancement, and then the total data is processed to improve the data quality, the processing workload will increase. Therefore, this implementation The method firstly preprocesses the original data, and then based on the original data, uses the pre-trained FCN-ACGAN network model to generate simulation samples to achieve number sample expansion.

300、根据所述真实样本和所述生成样本对预先构建的初始ResNet-LSTM分类模型进行训练，得到最优分类模型。300. Train the pre-built initial ResNet-LSTM classification model according to the real samples and the generated samples to obtain an optimal classification model.

根据步骤200可以获得满足训练需要的模拟样本数量，将模拟样本与原始样本混合实现训练样本扩展，利用扩展样本对ResNet-LSTM分类模型进行训练，当模型误差满足预设误差阈值时，停止训练，得到最优分类模型。According to step 200, the number of simulated samples that meet the training needs can be obtained, the simulated samples are mixed with the original samples to realize the expansion of the training samples, and the ResNet-LSTM classification model is trained using the expanded samples. When the model error meets the preset error threshold, the training is stopped. Get the best classification model.

400、利用所述最优分类模型对所述太赫兹时域光谱数据进行分类，根据分类结果确定隐匿危险品。400. Use the optimal classification model to classify the terahertz time-domain spectral data, and determine hidden dangerous goods according to the classification result.

优选的，本实施方式中，将分类结果分为两种：第一分类结果和第二分类结果，其中，第一分类结果表示被检测的物品不具有危险性，第二分类结果表示被检测的物品中隐匿有危险品Preferably, in this embodiment, the classification results are divided into two types: the first classification result and the second classification result, wherein the first classification result indicates that the detected item is not dangerous, and the second classification result indicates that the detected item Dangerous goods hidden in the item

所述根据分类结果确定隐匿危险品具体如下：The details of determining hidden dangerous goods based on classification results are as follows:

首先判断所述分类结果的类型，当所述分类结果为第一分类结果时，则判定被检测物品为非隐匿危险品，当所述分类结果为第二分类结果时，则判定被检测物品为隐匿危险品，并将所述第二分类结果与预先建立的隐匿危险品的光谱特征数据库进行比对，根据对比结果判断所述隐匿危险品的种类。First judge the type of the classification result. When the classification result is the first classification result, it is determined that the detected item is a non-concealed dangerous product. When the classification result is the second classification result, it is determined that the detected item is Concealing the dangerous goods, and comparing the second classification result with a pre-established spectral feature database of the hidden dangerous goods, and judging the type of the hidden dangerous goods according to the comparison result.

可以理解的是，不同物品的太赫兹光谱数据不同，将物品的太赫兹光谱特征和已建立的各种危险品的光谱特征数据库进行比对，可以判断被检测的物品是否为危险品以及是何种危险品。It is understandable that the terahertz spectral data of different items are different. By comparing the terahertz spectral features of the items with the established spectral feature databases of various dangerous goods, it is possible to judge whether the detected items are dangerous goods and what kind of goods they are. dangerous goods.

本发明提供的数据分类方法，利用预先训练好的FCN-ACGAN网络模型实现数样本扩展，进一步利用扩展样本对初始ResNet-LSTM分类模型进行训练得到满足数据分类标准的最优分类模型，然后利用最优分类模型对被检测物品的太赫兹时域光谱数据进行分类，根据分类结果确定隐匿危险品，解决了现有数据分类方法中，由于训练数据不足导致训练得到的分类模型分类效果差，进一步导致依据分类结果进行物品属性的识别率低的技术问题。The data classification method provided by the present invention utilizes the pre-trained FCN-ACGAN network model to realize number sample expansion, and further uses the expanded samples to train the initial ResNet-LSTM classification model to obtain the optimal classification model that meets the data classification standard, and then utilizes the most The optimal classification model classifies the terahertz time-domain spectral data of the detected items, and determines the hidden dangerous goods according to the classification results, which solves the problem of poor classification effect of the trained classification model due to insufficient training data in the existing data classification methods, which further leads to The technical problem of low recognition rate of item attributes based on classification results.

在前述实施方式的基础上，本申请提供了另一个优选的实施方式，步骤100具体可以通过下述方式实现：On the basis of the foregoing embodiments, the present application provides another preferred embodiment, and step 100 can be specifically implemented in the following manner:

对预先获取的真实光谱数据进行数据清洗，得到第一初始样本，然后对所述第一样本进行缺失值补充，得到第二初始样本，最后对所述第二初始样本进行归一化处理，得到真实样本。performing data cleaning on the pre-acquired real spectral data to obtain a first initial sample, then supplementing the first sample with missing values to obtain a second initial sample, and finally performing normalization processing on the second initial sample, get real samples.

可以理解的是，在对数据进行缺失值补充前，先对数据进行清洗，可以去除数据集中的噪声数据和无关数据，提高数据质量，避免处理无用数据增加工作量，进一步地，通过对清洗后的数据进行缺失值补充，可以实现数据修复，降低偏倚风险，提高样本代表性，进一步对修复后的数据进行标准化和归一化操作，使得数据特征相对一致，排除突出特征对模型训练的影响。It is understandable that before supplementing the data with missing values, the data is cleaned first, which can remove noise data and irrelevant data in the data set, improve data quality, and avoid increasing the workload of processing useless data. Further, by cleaning Complementing the missing values of the missing data can realize data repair, reduce the risk of bias, improve sample representativeness, and further standardize and normalize the repaired data, so that the data features are relatively consistent, and the impact of outstanding features on model training can be eliminated.

在前述实施方式的基础上，本申请提供了另一个优选的实施方式，步骤200中，FCN-ACGAN网络模型包含生成器和判别器，生成器和判别器分别包含若干全连接层，其中，FCN-ACGAN网络模型的构建的步骤如下：On the basis of the aforementioned embodiments, the present application provides another preferred embodiment. In step 200, the FCN-ACGAN network model includes a generator and a discriminator, and the generator and the discriminator respectively include several fully connected layers. Among them, the FCN -The steps of building the ACGAN network model are as follows:

步骤S2、利用所述生成器创建初级生成样本；Step S2, using the generator to create a primary generation sample;

步骤S3、将所述初级生成样本与所述真实样本混合，得到混合样本；Step S3, mixing the primary generated sample with the real sample to obtain a mixed sample;

步骤S4、利用所述混合样本对所述初级判别器和所述生成器进行实际训练，基于RMSProp优化器分别对所述初级判别器和所述生成器的网络参数进行更新，判断所述FCN-ACGAN网络模型是否达到纳什均衡，若是，停止训练，得到训练好的FCN-ACGAN网络模型，若否，返回步骤S2；Step S4, using the mixed samples to actually train the primary discriminator and the generator, update the network parameters of the primary discriminator and the generator based on the RMSProp optimizer, and judge that the FCN- Whether the ACGAN network model reaches the Nash equilibrium, if so, stop the training, and obtain the trained FCN-ACGAN network model, if not, return to step S2;

其中，在步骤S2之前还包括：利用所述真实样本对初始生成器进行预训练，得到训练后的生成器。Wherein, before step S2, it also includes: using the real samples to pre-train the initial generator to obtain the trained generator.

参见图2，其中，初始FCN-ACGAN模型包含判别器(网络)和生成器(网络)，不同于传统的GAN网络，本实施方式的FCN-ACGAN模型中，生成器和判别器中分别加入全连接层，通过全连接层可以帮助FCN-ACGAN模型学习真实数据间的动态特性、提高生成器生成数据的质量、增强判别器鉴定数据真伪的能力。请参阅图3和图4，图3为本实施方式提供的生成器的一种网络结构图，图4为本实施方式提供的判别器的一种网络结构图。Referring to Fig. 2, wherein, the initial FCN-ACGAN model includes a discriminator (network) and a generator (network), which is different from a traditional GAN network. In the FCN-ACGAN model of this embodiment, a full The connection layer, through the fully connected layer, can help the FCN-ACGAN model learn the dynamic characteristics between real data, improve the quality of the data generated by the generator, and enhance the ability of the discriminator to identify the authenticity of the data. Please refer to FIG. 3 and FIG. 4 , FIG. 3 is a network structure diagram of a generator provided in this embodiment, and FIG. 4 is a network structure diagram of a discriminator provided in this embodiment.

其中，生成器由1个输入模块、5个Dense全连接层、4个Tanh激活函数层、和1个输出模块组成。生成器的输入为随机噪声和标签数据，输出为“模拟”的样本。Among them, the generator consists of 1 input module, 5 Dense fully connected layers, 4 Tanh activation function layers, and 1 output module. The input to the generator is random noise and labeled data, and the output is a "simulated" sample.

判别器由1个输入模块、5个Dense全连接层、3个ReLU激活函数层和1个softmax分类器组成。判别器的输入包含生成器的输出的“模拟”样本和真实样本，输出为带有标签类型的判别结果。The discriminator consists of 1 input module, 5 Dense fully connected layers, 3 ReLU activation function layers and 1 softmax classifier. The input of the discriminator contains "simulated" samples and real samples of the output of the generator, and the output is the discriminative result with the label type.

可以理解的是，没有经过学习(训练)的初始生成器在一开始不清楚真实数据的“模样”，直接利用未经训练的初始生成器创建数据，创建出的数据的分布与真实数据的分布的差异性较大，若直接将初始生成器生成的数据与真实数据混合，然后将混合的数据送入判别器中进行模型训练，会增加模型的训练过程。因此，为加快训练进程，需要先让初始生成器进行模仿学习，使初始生成器具备一定的模仿能力，然后再利用训练完成的生成器创建的数据对判别器进行训练。It is understandable that the initial generator that has not been learned (trained) does not know the "appearance" of the real data at the beginning, and directly uses the untrained initial generator to create data, and the distribution of the created data is consistent with the distribution of real data If the data generated by the initial generator is directly mixed with the real data, and then the mixed data is sent to the discriminator for model training, the training process of the model will be increased. Therefore, in order to speed up the training process, it is necessary to let the initial generator perform imitation learning first, so that the initial generator has a certain imitation ability, and then use the data created by the trained generator to train the discriminator.

同样的，若直接将生成器生成的数据与真实数据混合，然后将混合的数据送入判别器中进行模型训练，由于判别器在一开始不清楚真实数据的“模样”，混合数据进入判别器后，判别器也无法做出准确的判断，只能通过多次更新才能提高判别器对真假数据区分的能力，且在更新过程中，生成器也在不断更新，博弈过程导致判别器的判别能力增长缓慢，因此，直接混合真假数据对模型进行训练会增加模型的训练过程。为加快FCN-ACGAN模型的训练进程，本实施方式在利用混合数据进行模型训练之前，优先使用真实数据对判别器进行预训练，使判别器在早期就具备真假数据的区分能力。Similarly, if the data generated by the generator is directly mixed with the real data, and then the mixed data is sent to the discriminator for model training, since the discriminator does not know the "appearance" of the real data at the beginning, the mixed data enters the discriminator Finally, the discriminator cannot make accurate judgments, and the ability of the discriminator to distinguish between true and false data can only be improved through multiple updates. The ability grows slowly, so directly mixing real and fake data to train the model will increase the training process of the model. In order to speed up the training process of the FCN-ACGAN model, before using mixed data for model training in this embodiment, the discriminator is pre-trained with real data first, so that the discriminator has the ability to distinguish between real and fake data at an early stage.

在实际训练阶段，生成器利用潜在空间的随机噪声建立与真实数据分布的映射关系，生成带有标签的若干模拟样本，然后将若干模拟样本与真实样本混合得到训练数据，并将训练数据输入已经经过预训练后的判别器中。判别器利用训练数据进行训练，获取判别器网络损失值，并根据判别器的网络损失值和RMSProp优化器对判别器的网络参数进行更新，每更新一次，判别器就会变得更“聪明”。当判别器的更新次数满足第一更新阈值时，保持判别器的参数不变，获取生成器的网络损失值，并根据生成器的网络损失值和RMSProp优化器对生成器的网络参数进行更新，同样，每更新一次，生成器也会变得更“聪明”，直至所述生成器的更新次数满足预设第二更新阈值。上述过程不断循环，生成器不断生成与真实样本更相似的新的模拟样本，并利用新的模拟样本和原始数据对判别器进行训练，判别器不断更新提升判别能力，同时生成器也不断更新，生成器与判别器交替更新，直至整个FCN-ACGAN模型达到纳什均衡。其中，更新过程中，生成器与判别器的学习率为预设的定值，本实施方式中，不对生成器与判别器的更新频次做具体限定，本领域技术人员可以根据需要进行设定。In the actual training stage, the generator uses the random noise in the latent space to establish a mapping relationship with the real data distribution, generates several simulated samples with labels, and then mixes several simulated samples with real samples to obtain training data, and inputs the training data into the In the discriminator after pre-training. The discriminator uses the training data for training, obtains the network loss value of the discriminator, and updates the network parameters of the discriminator according to the network loss value of the discriminator and the RMSProp optimizer. Every time it is updated, the discriminator will become more "smart". . When the number of updates of the discriminator meets the first update threshold, keep the parameters of the discriminator unchanged, obtain the network loss value of the generator, and update the network parameters of the generator according to the network loss value of the generator and the RMSProp optimizer, Similarly, the generator will become "smarter" every time it is updated, until the number of updates of the generator meets the preset second update threshold. The above process is repeated continuously. The generator continuously generates new simulated samples that are more similar to real samples, and uses the new simulated samples and original data to train the discriminator. The discriminator is constantly updated to improve the discrimination ability, and the generator is also constantly updated. The generator and the discriminator are updated alternately until the entire FCN-ACGAN model reaches the Nash equilibrium. Wherein, during the updating process, the learning rate of the generator and the discriminator is a preset value. In this embodiment, there is no specific limitation on the update frequency of the generator and the discriminator, and those skilled in the art can set it as needed.

当FCN-ACGAN模型达到纳什均衡后，意味着FCN-ACGAN模型中的生成器生成的模拟样本已经可以“欺骗”判别器了，可以理解为，生成器生成的模拟样本已经满足作为训练数据的要求，模拟样本的表达能力与真实数据接近。When the FCN-ACGAN model reaches the Nash equilibrium, it means that the simulated samples generated by the generator in the FCN-ACGAN model can already "deceive" the discriminator. It can be understood that the simulated samples generated by the generator have met the requirements for training data , the expression ability of the simulated samples is close to that of the real data.

上述实施方式中，通过在生成器网络和判别器网络中分别加入全连接层，可以帮助FCN-ACGAN模型学习真实数据间的动态特性、提高生成器生成数据的质量、增强判别器鉴定数据真伪的能力，使得生成器能够生成与真实数据更为相似的模拟样本，同时，在生成器网络和判别器的更新过程中，采用RMSProp优化器对判别器和生成器的网络参数进行更新，可以有效消除更新过程中的梯度差异导致的抖动，加快模型收敛过程。In the above embodiment, by adding a fully connected layer to the generator network and the discriminator network, it can help the FCN-ACGAN model learn the dynamic characteristics between real data, improve the quality of the data generated by the generator, and enhance the discriminator to identify the authenticity of the data. The ability enables the generator to generate simulated samples that are more similar to the real data. At the same time, in the update process of the generator network and the discriminator, the RMSProp optimizer is used to update the network parameters of the discriminator and the generator, which can effectively Eliminate the jitter caused by the gradient difference during the update process, and speed up the model convergence process.

在前述实施方式的基础上，本申请提供了另一个优选的实施方式，步骤300具体可以通过下述方式实现：On the basis of the foregoing embodiments, the present application provides another preferred embodiment, and step 300 can be specifically implemented in the following manner:

当通过生成器获取足够的模拟样本后，将模拟样本与真实样本混合，得到扩展样本，利用扩展样本对ResNet-LSTM分类模型进行训练，当模型误差满足预设阈值时，停止训练，得到最优分类模型，其中，混合样本分为训练样本和实际训练样本。When enough simulated samples are obtained through the generator, the simulated samples are mixed with real samples to obtain extended samples, and the extended samples are used to train the ResNet-LSTM classification model. When the model error meets the preset threshold, the training is stopped and the optimal model is obtained. Classification model, where the mixed samples are divided into training samples and actual training samples.

进一步地，利用混合样本对ResNet-LSTM分类模型进行训练具体包括：构建初始ResNet-LSTM分类模型，利用预训练样本对所述初始ResNet-LSTM分类模型进行预训练，得到所述初始ResNet-LSTM分类模型的超参数，基于所述超参数和所述实际训练样本对所述初始ResNet-LSTM分类模型进行实际训练，当所述初始ResNet-LSTM分类模型的模型误差满足误差阈值时，结束实际训练，得到最优分类模型。Further, using mixed samples to train the ResNet-LSTM classification model specifically includes: constructing an initial ResNet-LSTM classification model, using pre-trained samples to pre-train the initial ResNet-LSTM classification model, and obtaining the initial ResNet-LSTM classification model. The hyperparameters of the model, based on the hyperparameters and the actual training samples, the initial ResNet-LSTM classification model is actually trained, and when the model error of the initial ResNet-LSTM classification model meets the error threshold, the actual training is ended, Get the best classification model.

进一步地，可以利用所述最优分类模型对太赫兹时域光谱数据进行分类。Further, the optimal classification model can be used to classify the terahertz time-domain spectral data.

为验证基于FCN-ACGAN数据增强的太赫兹时域光谱隐匿危险品识别方法的可行性，本实施方式提供一种基于前述最优分类模型的危险品识别准确率验证示例。In order to verify the feasibility of the FCN-ACGAN data-enhanced terahertz time-domain spectrum hidden dangerous goods identification method, this embodiment provides an example of the verification accuracy of dangerous goods identification based on the aforementioned optimal classification model.

选取若干数量经过预处理的真实样本、FCN-ACGAN创建的模拟样本和由FCN-ACGAN创建的模拟样本和真实样本混合得到的扩展样本。其中，真实样本、模拟样本和扩展样本的数量一致。将原始样本、模拟样本和扩展样本分别输入ResNet-LSTM分类模型进行分类识别，分类结果如表1所示。由表1可知，FCN-ACGAN生成的模拟样本和真实样本在ResNet-LSTM分类模的表现基本一致，侧面验证了本申请提供的FCN-ACGAN模型生成的模拟样本不仅能够捕获原始数据的有效特征，还能生成与真实数据特征相适应的新样本。分类模型对由FCN-ACGAN创建的模拟样本和真实样本混合得到的扩展样本的识别率为99.42％，对真实样本的识别率为98.33％，相比于真实样本，采用扩展样本训练得到的模型的识别准确率提高了1.09％，同时提高了根据分类结果进行物品识别的识别精度，验证了基于FCN-ACGAN数据增强的太赫兹时域光谱隐匿危险品识别方法的可行性和有效性。Select a number of preprocessed real samples, simulated samples created by FCN-ACGAN, and extended samples mixed with simulated samples created by FCN-ACGAN and real samples. Among them, the numbers of real samples, simulated samples and extended samples are the same. The original samples, simulated samples and extended samples were respectively input into the ResNet-LSTM classification model for classification and recognition. The classification results are shown in Table 1. It can be seen from Table 1 that the performance of the simulated samples generated by FCN-ACGAN and the real samples in the ResNet-LSTM classification model are basically the same, which verifies that the simulated samples generated by the FCN-ACGAN model provided by this application can not only capture the effective features of the original data, It is also possible to generate new samples that fit the characteristics of the real data. The classification model has a recognition rate of 99.42% for the extended samples obtained by mixing simulated samples and real samples created by FCN-ACGAN, and a recognition rate of 98.33% for real samples. The recognition accuracy rate is increased by 1.09%, and the recognition accuracy of item recognition based on the classification results is improved, which verifies the feasibility and effectiveness of the terahertz time-domain spectrum hidden dangerous goods recognition method based on FCN-ACGAN data enhancement.

表1结合FCN-ACGAN和ResNet-LSTM识别算法性能表Table 1 Combined FCN-ACGAN and ResNet-LSTM recognition algorithm performance table

本发明是基于FCN-ACGAN数据增强的太赫兹时域光谱隐匿危险品识别方法，基于FCN-ACGAN模型对数据集进行扩充后能有效改善因数据不足导致地过拟合问题，并提高分类模型训练后的识别准确率，从而使隐匿危险品识别的准确率更高。The present invention is based on the FCN-ACGAN data enhanced terahertz time-domain spectrum hidden dangerous goods identification method. After expanding the data set based on the FCN-ACGAN model, it can effectively improve the over-fitting problem caused by insufficient data and improve the classification model training. The final identification accuracy rate, so that the accuracy rate of hidden dangerous goods identification is higher.

本申请还提供了一种电子设备，设备包括存储器和处理器，所述存储器中储存有计算机程序，所述计算机程序被所述处理器执行时，使得所述处理器执行前述数据分类方法的步骤。The present application also provides an electronic device, the device includes a memory and a processor, the memory stores a computer program, and when the computer program is executed by the processor, the processor executes the steps of the aforementioned data classification method .

所属领域的技术人员可以清楚地了解到，为描述的方便和简洁，上述描述的系统，装置和单元的具体工作过程，可以参考前述方法实施例中的对应过程，在此不再赘述。Those skilled in the art can clearly understand that for the convenience and brevity of the description, the specific working process of the above-described system, device and unit can refer to the corresponding process in the foregoing method embodiment, which will not be repeated here.

在本申请所提供的几个实施例中，应该理解到，所揭露的系统，装置和方法，可以通过其他的方式实现。例如，以上所描述的装置实施例仅仅是示意性的，例如，所述单元的划分，仅仅为一种逻辑功能划分，实际实现时可以有另外的划分方式，例如多个单元或组件可以结合或者可以集成到另一个系统，或一些特征可以忽略，或不执行。另一点，所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口，装置或单元的间接耦合或通信连接，可以是电性，机械或其他的形式。In the several embodiments provided in this application, it should be understood that the disclosed systems, devices and methods may be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components can be combined or May be integrated into another system, or some features may be ignored, or not implemented. In another point, the mutual coupling or direct coupling or communication connection shown or discussed may be through some interfaces, and the indirect coupling or communication connection of devices or units may be in electrical, mechanical or other forms.

所述作为分离部件说明的单元可以是或者也可以不是物理上分开的，作为单元显示的部件可以是或者也可以不是物理单元，即可以位于一个地方，或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and the components displayed as units may or may not be physical units, that is, they may be located in one place, or may be distributed to multiple network units. Part or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.

另外，在本发明各个实施例中的各功能单元可以集成在一个处理单元中，也可以是各个单元单独物理存在，也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现，也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit. The above-mentioned integrated units can be implemented in the form of hardware or in the form of software functional units.

所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时，可以存储在一个计算机可读取存储介质中。基于这样的理解，本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来，该计算机软件产品存储在一个存储介质中，包括若干指令用以使得一台计算机设备(可以是个人计算机，服务器，或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括：U盘、移动硬盘、只读存储器(ROM，Read-OnlyMemory)、随机存取存储器(RAM，Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质。If the integrated unit is realized in the form of a software function unit and sold or used as an independent product, it can be stored in a computer-readable storage medium. Based on such an understanding, the essence of the technical solution of the present invention or the part that contributes to the prior art or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium , including several instructions to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the method described in each embodiment of the present invention. The aforementioned storage medium includes: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disk or optical disk, and other media that can store program codes.

以上所述，以上实施例仅用以说明本发明的技术方案，而非对其限制；尽管参照前述实施例对本发明进行了详细的说明，本领域的普通技术人员应当理解：其依然可以对前述各实施例所记载的技术方案进行修改，或者对其中部分技术特征进行等同替换；而这些修改或者替换，并不使相应技术方案的本质脱离本发明各实施例技术方案的精神和范围。As mentioned above, the above embodiments are only used to illustrate the technical solutions of the present invention, rather than to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: it can still understand the foregoing The technical solutions recorded in each embodiment are modified, or some of the technical features are replaced equivalently; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the spirit and scope of the technical solutions of the various embodiments of the present invention.

Claims

1. A terahertz time-domain spectrum concealed hazardous article identification method based on FCN-ACGAN data enhancement is characterized by comprising the following steps:

preprocessing pre-acquired real terahertz spectrum data to obtain a real sample;

generating a simulation sample by using a pre-trained FCN-ACGAN network model; the FCN-ACGAN network model comprises a generator and a discriminator, wherein the generator and the discriminator respectively comprise a plurality of full connection layers; the steps of training the FCN-ACGAN network model are as follows:

s1, pre-training the discriminator by using the real sample to obtain a primary discriminator;

s2, generating a primary simulation sample by using the generator;

s3, mixing the primary simulation sample with the real sample to obtain a training sample;

s4, actually training the primary arbiter and the generator by using the training samples, respectively updating the network parameters of the primary arbiter and the network parameters of the generator based on a RMSProp optimizer, judging whether the FCN-ACGAN network model reaches Nash equilibrium, if so, stopping training to obtain a trained FCN-ACGAN network model, and if not, returning to the S2;

training a pre-constructed initial ResNet-LSTM classification model according to the real sample and the simulation sample to obtain an optimal classification model;

and classifying the terahertz time-domain spectral data by using the optimal classification model, and determining hidden dangerous goods according to a classification result.

2. The method for identifying the terahertz time-domain spectrum concealed hazardous articles based on FCN-ACGAN data enhancement as claimed in claim 1, wherein the training of the pre-constructed initial ResNet-LSTM classification model according to the real samples and the simulation samples to obtain the optimal classification model specifically comprises:

mixing the real sample and the simulation sample to obtain an extended sample, and dividing the extended sample into a pre-training sample and an actual training sample;

constructing an initial ResNet-LSTM classification model, utilizing the pre-training sample to pre-train the initial ResNet-LSTM classification model to obtain a super parameter of the initial ResNet-LSTM classification model, actually training the initial ResNet-LSTM classification model based on the super parameter and the actual training sample, and ending the actual training when the model error of the initial ResNet-LSTM classification model meets a preset error threshold to obtain an optimal classification model.

3. The method for identifying the terahertz time-domain spectroscopy concealed hazardous articles based on FCN-ACGAN data enhancement as claimed in claim 2, wherein the generating of the primary simulation sample by the generator specifically comprises:

establishing a mapping relationship between the real data and random noise of a potential space, and generating a simulation sample based on the mapping relationship, wherein the simulation sample contains a class label.

4. The method for identifying the terahertz time-domain spectroscopy concealed hazardous article based on FCN-ACGAN data enhancement as claimed in claim 3, wherein the updating of the network parameters of the primary discriminator and the generator respectively based on the RMSProp optimizer comprises:

keeping the network parameters of the generator unchanged, acquiring a network loss value of the primary discriminator, updating the primary discriminator based on the RMSProp optimizer and the network loss value of the primary discriminator, keeping the network parameters of the primary discriminator unchanged when the updating times of the primary discriminator meet a preset first updating threshold, acquiring the network loss value of the generator, and updating the generator based on the RMSProp optimizer and the network loss value of the generator until the updating times of the generator meet a preset second updating threshold.

5. The FCN-ACGAN data enhancement based terahertz time-domain spectroscopy concealed hazardous article identification method according to claim 4, wherein the generator comprises: the device comprises an input module, a Dense full-connection layer, a Tanh activation function layer and an output module.

6. The FCN-ACGAN data enhancement based terahertz time-domain spectroscopy concealed hazardous article identification method according to claim 5, wherein the discriminator comprises: the device comprises an input module, a Dense full connection layer, a ReLU activation function layer and a softmax classifier.

7. The method for identifying the concealed hazardous article based on the FCN-ACGAN data enhancement terahertz time-domain spectroscopy as claimed in claim 6, further comprising, before step S2: and pre-training the initial generator by using the real sample to obtain a trained generator.

8. The method for identifying the terahertz time-domain spectrum concealed hazardous article based on FCN-ACGAN data enhancement as claimed in claim 1, wherein the preprocessing is performed on the pre-acquired real spectrum data to obtain the real sample specifically as follows:

performing data cleaning on pre-acquired real spectrum data to obtain a first initial sample;

supplementing missing values to the first sample to obtain a second initial sample;

and carrying out normalization processing on the second initial sample to obtain a real sample.

9. The method for identifying the concealed hazardous article based on the FCN-ACGAN data enhancement terahertz time-domain spectroscopy as claimed in claim 1, wherein the classification result comprises a first classification result and a second classification result, and the determining the concealed hazardous article according to the classification result specifically comprises:

and judging the type of the classification result, judging that the detected object is a non-hidden dangerous article when the classification result is a first classification result, judging that the detected object is a hidden dangerous article when the classification result is a second classification result, comparing the second classification result with a pre-established spectral feature database of the hidden dangerous article, and judging the type of the hidden dangerous article according to the comparison result.

10. An electronic device, comprising a memory and a processor, wherein the memory has stored thereon a computer program, which, when executed by the processor, causes the processor to carry out the steps of the data classification method according to any one of claims 1 to 9.