Multi-dimensional Feature Fusion Modulation Classification System Based on Self-training Network

Jingpeng Gao¹⁴,
Yi Lu¹⁴,
Lu Gao¹⁵ &
…
Liangxi Shen¹⁴

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 11902))

Included in the following conference series:

International Conference on Image and Graphics

2139 Accesses

Abstract

To solve the problem that the single feature extraction method cannot fully express the radar signal at low SNR and the large-scale deep learning network cannot deal with small sample size of radar signal, this paper proposes a multi-dimensional feature fusion modulation classification system, which can classify radar signals including CW, BPSK, LFM, COSTAS, FRANK, T1, T2, T3 and T4. The machine could extract time-frequency feature of radar signal automatically through small self-training network. Combined with the idea of multi-dimensional feature fusion, the time-frequency entropy feature, the higher-order statistics feature and network self-extraction feature are normalized and fused by non-negative matrix factorization (NMF), which improves the classification performance of the proposed system at low SNR. The simulation results show that the recognition rate of the proposed system is 78% at −3 dB. Compared with the traditional method, the recognition rate of proposed system has a significant improvement.

You have full access to this open access chapter, Download conference paper PDF

Research on Individual Identification Method of Radar Radiation Source Based on Multi-layer and Multi-dimensional Input

An efficient deep convolutional neural network with features fusion for radar signal recognition

Article 14 July 2022

Intra-pulse modulation recognition of radar signals based on multi-feature random matching fusion network

Article 07 November 2022

Keywords

1 Introduction

Electronic warfare has become an indispensable part of modern warfare, which is the key to contend for the information superiority of the whole battlefield. With the increasing complex electromagnetic environment, the classification of radar emitter signal modulation becomes an urgent problem to be solved [1].

With the application of the new radar system, the modulation mode of radar emitter signal becomes more and more complex and the feature of signal changes and develops ceaselessly. Hence the traditional classification methods of modulation cannot meet the requirements of present electronic reconnaissance [2]. Traditional feature analysis methods mostly focus on the feature in a certain domain, while ignoring other domains. So it cannot effectively extract modulation feature of signal, thus affecting the electronic reconnaissance [3]. In [4], the deep belief network was used to realize the automatic extraction of feature parameters of large-sample data. The extracted feature is of a large order of magnitude, which makes the computational complexity of system high. And deep layer of network will easily cause gradient dispersion and gradient explosion. Literature [5] used LeNet-5 network to recognize document, which achieved good results. LeNet-5 is a small network that can adaptively train network parameters to make the network more suitable for current data processing. What’s more, higher-order statistics (HOS) feature can improve the anti-noise performance [6], Renyi entropy feature can reflect the energy concentration level of the signal [7], and the fusion of them is helpful to improve recognition rate at low SNR.

Based on the self-training network, a multi-dimensional feature fusion modulation classification system is proposed in this paper. The system uses self-training network to extract time-frequency feature of radar signal, then fuses the network extracted feature with HOS feature and Renyi entropy feature, finally sends them into extreme learning machine (ELM) to realize the accurate classification.

The rest of this paper is organized as follows. In Sect. 2, model of system and signal is established. Section 3 introduces the radar signal process. Section 4 shows methods of feature extraction. Section 5 describes the classifier. Simulation results and analysis are given in Sect. 6. At last, conclusions are drawn in Sect. 7.

2 Model of System and Signal

In order to realize the multi-dimensional feature fusion modulation classification based on self-training network, time-frequency transform, image preprocessing, feature extraction, feature fusion and feature classification needed to be introduced. Firstly, the pseudo Wigner-Vile distribution (PWVD) is used to transform radar signals into time-frequency images. Secondly, time-frequency images need to be preprocessed before feature extraction. Thirdly, LeNet-5, a small self-training network, is used to extract features. Fourthly, to express information of radar signals more comprehensively, network feature, Renyi entropy feature and HOS feature are fused together based on non-negative matrix factorization (NMF). Finally ELM can realize the classification. The structure of the proposed system is showed in Fig. 1.

To implement the proposed algorithm, it is necessary to generate radar signal to train the network parameters. The unified model of radar signal is as follow.

$$ s\left( t \right) = A\left( t \right)\exp \left[ {j\left( {2\pi f_{0} t + c\left( t \right) + \varphi_{0} } \right)} \right] $$

(1)

where $ A\left( t \right) $ represents the amplitude function, $ f_{0} $ is the carrier frequency, $ c\left( t \right) $ means phase function and $ \varphi_{0} $ is the original phase.

The classical nine types of radar signal include CW, BPSK, LFM, COSTAS, FRANK, T1, T2, T3 and T4, detailed models are showed in Table 1.

Table 1. Models of nine types of radar signal.

Full size table

3 Signal Processing

Because radar signal is a non-stationary signal, the traditional methods would cause signal aliasing, which leads to in low recognition rate. Therefore, time-frequency transform is adopted.

3.1 Time-Frequency Transform

Wavelet transform and short-time Fourier transform (STFT) are two common methods of time-frequency transform, but wavelet transform is sensitive to noise, STFT can only deal with stationary signal [8, 9]. Wigner-vile distribution (WVD) has good time-frequency aggregation, and PWVD could further enhance the aggregation of distribution, which is helpful to classification [10].

$$ P_{x} (t,f) = \int_{ - \infty }^{ + \infty } {h(\tau )} x\left( {t + \tau /2} \right)x\left( {t - \tau /2} \right)e^{ - jf\tau } d\tau $$

(2)

where $ h\left( \tau \right) $ is window function. Figure 2 shows different time-frequency images without noise obtained by PWVD.

3.2 Image Preprocessing

With the development of artificial intelligence, the application of neural network1 is more and more extensive. To apply LeNet-5 to radar signal, the images need to be preprocessed before being sent into network. The image preprocessing could eliminate noise and reduce the computational complexity of the LeNet-5 network. Most image preprocessing algorithms are defined by grayscale or binary image. In grayscale preprocessing, the original image information brightness is expressed by grayscale, thus changing the color image into grayscale format.

The time-frequency image could be represented by a $ M \times N $ matrix, and the brightness of image pixel points can be calculated by grayscale formula.

$$ {{\boldsymbol{I}}_{fg}} = 0.3{{\boldsymbol{B}}_{fg}} + 0.59\,{{\boldsymbol{G}}_{fg}} + 0.11{{\boldsymbol{R}}_{fg}} $$

(3)

where $ f,g $ represent the pixel point of image, $ 0 < f \le M,0 < g \le N $.

Because the dynamic range of gray value of time-frequency images is different to their corresponding signals, the range of gray value would affect the classification. In order to reduce the data imbalance on classification, the gray value needs to be normalized.

$$ {{{\hat{\boldsymbol{I}}}}_{fg}} = {{\left( {{{\boldsymbol{I}}_{fg}} - {{\bar{\boldsymbol{I}}}}} \right)} \mathord{\left/ {\vphantom {{\left( {{{\boldsymbol{I}}_{fg}} - {{\bar{\boldsymbol{I}}}}} \right)} {\sqrt {{1 \over {MN - 1}}\sum\limits_{f = 1}^M {\sum\limits_{g = 1}^N {({{\boldsymbol{I}}_{fg}} - {{\bar{\boldsymbol{I}}}}} {)^2}} } }}} \right. \kern-0pt} {\sqrt {{1 \over {MN - 1}}\sum\limits_{f = 1}^M {\sum\limits_{g = 1}^N {({{\boldsymbol{I}}_{fg}} - {{\bar{\boldsymbol{I}}}}} {)^2}} } }} $$

(4)

where $ {\bar{\boldsymbol{I}}} $ is the average of gray value.

In order to further enhance signal, reduce the influence of noise and the amount of data, binary process could be carried out.

$$ \user2{P}_{{fg}} = \left\{ {\begin{array}{*{20}l} 1 \hfill & {\user2{I}_{{fg}} \ge \partial } \hfill \\ 0 \hfill & {\user2{I}_{{fg}} < \partial } \hfill \\ \end{array} } \right.$$

(5)

where the $ \partial $ is the binary threshold, in this paper it equals to 0.4.

4 Feature Extraction

In the classification of radar signal, feature extraction is an essential part, this paper proposes a method of multi-dimensional feature extraction, which enables the extracted feature to represent information of signal more comprehensively.

4.1 LeNet-5

LeNet-5 was proposed in 1998, which is the most representative among the early neural networks. It was initially used in document recognition. The network structure is simple so that it is suitable for small sample size training. The application of LeNet-5 on feature extraction can make information of radar signal more comprehensive, which can improve the reliability of the system at low SNR. The structure of LeNet-5 network is showed in Fig. 3.

After the training of LeNet-5, it can be used to extract feature. This paper chooses principal component analysis (PCA) and kernel principal component analysis (KPCA) to deal with network extracted feature. To be specific, PCA treats the extracted feature as a data matrix F, the covariance matrix could be represented as R = FF^T.

$$ {\varvec{R}} = {\varvec{UAU}}^{T} $$

(6)

where A is the eigenvalue diagonal matrix of covariance matrix, U is the correspond feature matrix.

$$ {\varvec{P}} = {{\varvec{U}}^T}{\varvec{F}} = {\left[ {{p_1},{p_2}, \cdots {p_K}} \right]^T} $$

(7)

where $ p $ is the principal component of extracted feature matrix. The first $ K $ principal components are chosen as feature matrix.

The difference between PCA and KPCA is that KPCA would map the feature matrix to high-dimensional feature space by nonlinear mapping. To feature matrix F, it would be mapped to high-dimensional space Φ to get Φ(f), the covariance matrix is $ \varvec{R} = \frac{1}{M}\sum\limits_{c = 1}^{M} {\Phi \left( {f_{c} } \right)}\Phi \left( {f_{c} } \right)^{T} $. Eigen value $ \lambda_{c} $ and eigenvector $ \mu_{c} $ can be get from following equation.

$$ \varvec{R}\mu_{c} = \lambda_{c} \mu_{c} $$

(8)

Eigenvector $ \mu_{c} $ can be represented by the linear combination of $ c $.

$$ \mu_{c} = \sum\limits_{c = 1}^{M} {a_{c} }\Phi \left( {x_{c} } \right) $$

(9)

$$ \lambda_{c} a = \frac{1}{M}\Phi \left( {x_{c} } \right)\Phi \left( {x_{c} } \right)^{T} \cdot a $$

(10)

where $ a $ is the linear combination coefficient $ a = \left( {a_{1} ,a_{2} , \ldots a_{n} } \right)^{T} $. A kernel function can be defined as follow.

$$ K_{cs} = K\left( {f_{c} ,f_{s} } \right) =\Phi \left( {f_{c} } \right)^{T}\Phi \left( {f_{c} } \right) $$

(11)

$$ N\lambda_{c} a = Ka $$

(12)

The $ k $ th kernel principal component through KPCA mapping is

$$ p_{k} = \mu_{t}^{T}\Phi \left( x \right) = \sum\limits_{s = 1}^{M} {a_{s} K\left( {x_{s} ,x} \right)} $$

(13)

The first $ k $ principal components are chosen as feature matrix.

4.2 Renyi Entropy

The more regular the time-frequency of the signal distributes, the less information it contains, and the smaller the entropy value. When the components of signal are cluttered, it means that it contains more information and the entropy will increase. The Renyi entropy of time-frequency image can be represented as

$$ R^{\alpha } = \frac{1}{1 - \alpha }\log_{2} \iint {P_{x}^{\alpha } }(t,f)dtdf $$

(14)

Order can reflect feature well, and this paper chooses the order $ \alpha $ of Renyi entropy in 3, 5, 7, 9 and 11 as signal feature.

4.3 Higher-Order Statistic

HOS can express the essential feature of signal well, which can improve the robustness of system. Kurtosis and margin of signal are extracted in time domain. The average of time domain signal $ x\left( t \right) $ is

$$ \bar{X} = \frac{1}{N}\sum\limits_{i = 1}^{N} {x_{i} } \left( t \right) $$

(15)

where $ N $ is the number of $ x\left( t \right) $. The mean-square value can be represented as

$$ X_{rms}^{2} = \frac{1}{N}\sum\limits_{i = 1}^{N} {x_{i}^{2} } \left( t \right) $$

(16)

Thus, the margin of signal is

$$ C_{e} = \frac{{X_{rms} }}{{\bar{X}}} $$

(17)

The kurtosis of signal is

$$ C_{q} = \frac{{\frac{1}{N}\sum\limits_{i = 1}^{N} {\left( {\left| {x_{i} } \right| - \bar{x}} \right)} }}{{X_{rms}^{4} }} $$

(18)

The time domain signal could be transformed into frequency domain signal $ X\left( f \right) $ by Fourier Transform. We extract the kurtosis and margin feature of frequency as the spectral kurtosis and spectral margin feature.

4.4 Feature Fusion

This paper applies self-training network LeNet-5 to extract features, and the extracted feature is reduced by PCA and KPCA. What’s more, the Renyi entropy feature and HOS feature would be extracted as supplement to make feature extraction more accurate. But the ranges of feature value of different extraction methods are different, which may affect the accuracy of subsequent classifier greatly. So it is necessary to normalize the extracted features respectively.

$$ {\varvec{T}} = \left[ {{{\varvec{T}}_{{\varvec{PCA}}}},{{\varvec{T}}_{{\varvec{KPCA}}}},{{\varvec{T}}_{{\varvec{HOS}}}},{{\varvec{T}}_{{\varvec{Renyi}}}}} \right] $$

(19)

where T_PCA is the normalized feature reduced by PCA, T_KPCA is the normalized feature reduced by KPCA, T_HOS is the normalized feature extracted by HOS, T_Renyi is the normalized feature extracted by Renyi entropy. When different features are normalized, NMF is used to fuse them together, which can reduce the redundant information of features. The NMF can be expressed as

$$ \mathop {\hbox{min} }\limits_{{\varvec{W,H}}} f\left( {\varvec{W,H}} \right) = \frac{1}{2}\sum\limits_{i = 1}^{m} {\sum\limits_{j = 1}^{n} {\left( {\varvec{T}_{ij} - \left( {\varvec{WH}} \right)_{ij} } \right)^{2} } } $$

(20)

where $ m \times n $ is the size of T, W and H are two matrixes. W is the feature matrix after being fused. Then the fused feature is sent into classifier to realize classification.

5 Feature Classification

ELM overcomes the shortcomings of traditional neural network such as low training rate, easy to fall into local optimum and sensitive to learning rate [11]. During the process of training, ELM could randomly generate the connection weights between input layer and hidden layer and threshold of hidden layer. There is no need to adjust them in the training, the global optimal can be obtained by setting the number of hidden layer neuron.

The mathematical expression of ELM is

$$ v_{k} = \omega^{T} g\left( {W_{in} u_{k} + b} \right),k = 1,2, \cdots ,N $$

(21)

where $ v_{k} $ is the output vector, $ \omega $ is the output weight, $ g $ is the activation function, $ W_{in} $ is the input weight, $ u_{k} $ is the input vector, $ b $ is the bias value of hidden layer, $ N $ is the sum of sample.

During the training process, the $ W_{in} $ and $ b $ are randomly initialized and unchanged, the only parameter need to be trained is $ \omega $. The detailed calculation method is as followed.

$$ \omega = \varvec{H}^{{\mathbf{ + }}} \varvec{I} $$

(22)

where $ \varvec{H}^{{\mathbf{ + }}} $ is the Moore-Penrose generalized inverse matrix of hidden layer input matrix H. We can expand the H as

$$ \varvec{H} = \left[ {\begin{array}{*{20}c} {g\left( {W_{in} u_{1} + b_{1} } \right)} & \cdots & {g\left( {W_{in} u_{1} + b_{n} } \right)} \\ \vdots & \ddots & \vdots \\ {g\left( {W_{in} u_{N} + b_{1} } \right)} & \cdots & {g\left( {W_{in} u_{N} + b_{n} } \right)} \\ \end{array} } \right]_{N \times n} $$

(23)

The expected output matrix I is

$$ \varvec{I} = \left( {I_{1} ,I_{2} , \cdots ,I_{N} } \right)^{T} $$

(24)

Thus, the training process of ELM is a simple linear regression process. When the $ \omega $ is found, the training is finished.

6 Simulation Results and Analysis

To test and verity the feasibility of the proposed algorithm, we set the parameters of radar signal as follow. The sampling frequency is $ f_{s} = 32\,\text{MHz} $, the sampling point is $ N = 512 $ and the noise is white Gaussian noise, the pulse width of signal is 10us, the carrier frequency is 10 MHz. According to the signal models in Table 1, we generate nine types of radar signal in random under −3–6 dB respectively, and do time-frequency transform on them. For each SNR, we randomly choose 100 signals from each type of signals. In addition, we generate nine types of radar signal without noise to train LeNet-5 network, each signal is chosen 300 randomly to do time-frequency transformation and image preprocessing.

Figure 4 shows the proportion of each component of total component under 0 dB. From Fig. 4, we can see that both in PCA and KPCA, as the increase of subcomponent, the proportion of subcomponent decreases, while the proportion of sum component increases. In PCA, first 20 subcomponents can account for 90% of the total components, which is enough to represent most feature information. The proportion of the component after 20^th component is smaller and smaller, which is easily disturbed by noise, so it is ignored. We can get 20 features after PCA. Similar to PCA, first twelve subcomponents can account for 90% of the total components in KPCA. We choose first 12 subcomponents of KPCA as feature matrix. If we output the features from C5 convolutional layer, 4096 features will be obtained. But the number of features after dimension reduction is only 32, which greatly reduces the computational complexity.

Figure 5 shows the recognition rate curves of training set and test set of the proposed algorithm under −3–6 dB, in which the number of PCA components is 20, the number of KPCA components is 12, each type of signal is randomly divided into training set and test set according to 7:3, and the experiment results take the mean by 500 times repeat.

Figure 5 shows that the recognition rate of the training set and test set of proposed algorithm would increase as the SNR increases. The recognition rate of training set reaches 91% under 0 dB, which indicates that the proposed algorithm has a good recognition rate. What’s more, the recognition rate trend of the test set is similar to that of the training set, and the difference between them is not obvious, which indicates that there is no over-fitting or over-fitting in classifier. The Fig. 5 proves that the proposed algorithm can be well applied to radar signal modulation classification.

Figure 6 shows that the recognition rates of the test set based on four method all increase as the SNR increases. What’ more, the recognition rate of algorithm based on LeNet-5 + HOS + Renyi entropy is higher than that of LeNet-5, which proves that feature fusion in proposed algorithm can improve the recognition rate. The recognition rate of proposed algorithm tends to stable after 4 dB, the recognition rate under 4 dB is 96% and the recognition rate under low SNR of −3 dB up to 78%. In addition, it is obvious that the performance of LeNet-5 extracted feature is worse that of Renyi entropy feature. This is mainly because the signal is chaotic under the influence of noise in low SNR, Renyi entropy feature can explain the energy concentration level of the signal, which can improve the anti-noise performance effectively, and fusing the HOS feature can further enhance the reliability. However, with the increase of SNR, the influence of noise on the signal decreases gradually, the ability of Renyi entropy feature to interpret modulation regulation becomes weak. On the contrary, LeNet-5 can extract signal features completely and accurately, which makes the recognition rate better than Renyi entropy after 3 dB. All in all, after fusing the LeNet-5 extracted feature with HOS feature and Renyi entropy feature, although a small amount of cumulative error will be included, the fused feature expresses modulation information more comprehensive and the performance is further improved.

7 Conclusion

A multi-dimensional feature fusion modulation classification algorithm based on self-training network is proposed in this paper. The algorithm applies LeNet-5 network to extract the modulation regulation feature automatically, which can solve the problem that traditional algorithms extract feature incompletely and deep learning neural network is not suitable for small sample training. Renyi entropy feature and HOS feature are fused by NMF to increase the recognition rate of proposed algorithm. The simulation results show that the recognition rate of modulation classification based on fused feature is better than other classification algorithms, and it performs well under low SNR.

References

Dudczyk, J., Kawalec, A.: Specific emitter identification based on graphical representation of the distribution of radar signal parameters. Bull. Polish Acad. Sci. Tech. Sci. 63(2), 391–396 (2015)
Google Scholar
Guo, Q., Nan, P., Zhang, X., et al.: Recognition of radar emitter signals based on SVD and AF main ridge slice. J. Commun. Netw. 17(5), 491–498 (2015)
Article Google Scholar
Ma, J., Huang, G., Zuo, W., et al.: Robust radar waveform recognition algorithm based on random projections and sparse classification. IET Radar Sonar Navig. 8(4), 290–296 (2014)
Article Google Scholar
Wang, X., Zhou, Y., Zhou, D., et al.: Research on low probability of intercept radar signal recognition using deep belief network and bispectra diagonal slice. J. Electron. Inf. Technol. 38(11), 2972–2976 (2016)
Google Scholar
Lecun, Y.L., Bottou, L., Bengio, Y., et al.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998)
Article Google Scholar
Wei, S.: Feature space analysis of modulation classification using very high-order statistics. IEEE Commun. Lett. 17(9), 1688–1691 (2013)
Article Google Scholar
Aguiar, V., Guedes, I., Pedrosa, I.A.: Tsallis, Rényi, and Shannon entropies for time-dependent mesoscopic RLC circuits. Prog. Theor. Exp. Phys. 2015(11), 113A01 (2015)
Google Scholar
Samiee, K., Kovacs, P., Gabbouj, M.: Epileptic seizure classification of EEG time-series using rational discrete short-time Fourier transform. IEEE Trans. Biomed. Eng. 62(2), 541–552 (2015)
Article Google Scholar
Huynh, Q.Q., Cooper, L.N., Intrator, N., et al.: Classification of underwater mammals using feature extraction based on time-frequency analysis and BCM theory. IEEE Trans. Signal Process. 46(5), 1202–1207 (2016)
Article Google Scholar
Wang, Y., Wu, X., Li, W., et al.: Analysis of micro-Doppler signatures of vibration targets using EMD and SPWVD. Neurocomputing 171(C), 48–56 (2016)
Google Scholar
Ding, S., Xu, X., Nie, R.: Extreme learning machine and its applications. Neural Comput. Appl. 25(3–4), 549–556 (2014)
Article Google Scholar

Download references

Acknowledgements

This paper is funded by the International Exchange Program of Harbin Engineering University for Innovation-oriented Talents Cultivation, the National Natural Science Foundation of China (61701134), China Shipbuilding Industry Corporation 722 Research Institute Fund Project (KY10800170051), Equipment Pre-research Fund (61404150101), SAST2017-068 and the Fundamental Research Funds for the Central Universities (HEUCFG201832).

Author information

Authors and Affiliations

College of Information and Communication Engineering, Harbin Engineering University, Harbin, China
Jingpeng Gao, Yi Lu & Liangxi Shen
National Key Laboratory of Science and Technology on Test Physics and Numerical Mathematics, Beijing Institute of Space Long March Vehicle, Beijing, China
Lu Gao

Authors

Jingpeng Gao
View author publications
You can also search for this author in PubMed Google Scholar
Yi Lu
View author publications
You can also search for this author in PubMed Google Scholar
Lu Gao
View author publications
You can also search for this author in PubMed Google Scholar
Liangxi Shen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yi Lu .

Editor information

Editors and Affiliations

Beijing Jiaotong University, Beijing, China
Yao Zhao
The Australian National University, Canberra, Australia
Nick Barnes
Peking University, Peking, China
Baoquan Chen
The Technical University of Munich, München, Bayern, Germany
Rüdiger Westermann
Zhejiang University, Hangzhou, China
Xiangwei Kong
Beijing Jiaotong University, Beijing, China
Chunyu Lin

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Gao, J., Lu, Y., Gao, L., Shen, L. (2019). Multi-dimensional Feature Fusion Modulation Classification System Based on Self-training Network. In: Zhao, Y., Barnes, N., Chen, B., Westermann, R., Kong, X., Lin, C. (eds) Image and Graphics. ICIG 2019. Lecture Notes in Computer Science(), vol 11902. Springer, Cham. https://doi.org/10.1007/978-3-030-34110-7_52

Download citation

DOI: https://doi.org/10.1007/978-3-030-34110-7_52
Published: 28 November 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-34109-1
Online ISBN: 978-3-030-34110-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)