1. Introduction
By simultaneously using spatial information and temporal information, the space–time adaptive processing (STAP) method can improve the clutter suppression and moving target detection performance for airborne radar [
1,
2]. However, to ensure that the loss of the output signal-to-clutter-plus-noise ratio (SCNR) does not exceed 3 dB compared to the optimal case, the number of independent identically distributed (IID) training range cells required by conventional STAP methods is at least twice the system degrees of freedom (DOF) [
3]. In practice, non-ideal factors, e.g., a non-uniform ground/sea environment, non-stationary clutter features, complicated platform movements, and array amplitude/phase errors, often make this condition difficult to meet [
4,
5,
6].
To reduce the requirement of IID training range cells, dimension-reduced STAP methods, rank-reduced STAP methods, direct-data-domain STAP methods, and SR-based STAP methods have been proposed [
7,
8,
9,
10]. Among these methods, SR-STAP methods can achieve a high-resolution estimation of the clutter spectrum using a small number of IID training range cells. However, most SR algorithms, e.g., the focal under-determined system solver (FOCUSS), alternating direction method of multipliers (ADMM), and fast-converging sparse Bayesian learning (FCSBL) algorithm [
11,
12,
13] require lots of iterations to obtain the convergent solution, leading to high computational costs, especially when the problem dimension is high. In addition, in different clutter environments, the appropriate parameter settings are also a difficult problem for SR-STAP methods. Unreasonable parameter settings will affect the convergence speed and accuracy of SR algorithms. More importantly, various non-ideal factors in practical applications will deteriorate the clutter sparsity and make the SR signal model inaccurate, resulting in significant interferences deviating from the clutter ridge in the space–time domain and thus degrading the clutter suppression performance of SR-STAP methods. These problems limit the applications of SR-STAP methods in practice [
14,
15,
16].
Recently, the deep neural network (DNN)-based deep learning (DL) technique has been developed and applied to various fields. After proper and sufficient training, DNN can obtain a powerful nonlinear transform capacity for many data-processing or feature-mapping problems [
17,
18,
19,
20]. In addition, after offline training, DNN only needs forward propagation to complete its operations, thus enjoying a high online computing efficiency. These two properties of DNN can help to solve the above-mentioned problems of SR-STAP methods. For example, a STAP method based on convolutional DNN (CNN) is proposed in [
21], which uses the nonlinear image enhancement capability of CNN to realize the high-accuracy reconstruction of the clutter spectrum from its low-accuracy counterpart. It is shown in [
21] that, compared to some typical SR-STAP methods, the CNN-based STAP method can obtain a higher clutter-suppression performance with lower computational costs.
Unlike classical data-driven-only DNNs, deep unfolding (DU)-based neural networks combine the data-driven method with the model-driven method [
22,
23,
24,
25]. In DU-Net, a specific iterative algorithm (e.g., an iterative SR algorithm) with given iterations is unfolded into a DNN with the same number of layers, then the parameters involved in this algorithm are optimized by data learning. In other words, the DU-Net is constructed based on the model of an iterative algorithm. Compared to data-driven DNNs, DU-Nets have the advantage of interpretability and compared to model-driven algorithms, DU-Nets have the advantages of convergence speed and accuracy. Hence, DU-Nets also have the capability to solve the problems of SR-STAP methods. For example, the ADMM algorithm is unfolded into a DNN in [
26] for the joint estimation of the clutter spectrum and array error parameters. It is shown in [
26] that compared to some typical SR-STAP methods, the DU-Net-based STAP method can improve the clutter suppression performance and reduce the computing complexity.
However, although showing promising potential, DNN-based and DU-Net-based STAP methods have some essential problems that need to be solved. For the STAP methods using the nonlinear image enhancement capability of DNNs, the clutter spectrum estimation performance largely depends on the quality of the input data [
27], which cannot be guaranteed using conventional spectrum estimation methods, e.g., the Fourier transform method used in [
26]. For the DU-Net-based STAP methods that use the SR algorithm and the DNN method jointly, the performance will be seriously degraded when the clutter sparsity is damaged by the non-ideal practical factors. In addition, for both the DNN-based and DU-Net-based STAP methods, it is usually difficult to construct sufficient and complete input-label paired datasets for supervised training in an unknown environment.
To solve these problems, a DU-CG-STAP method with unsupervised learning is proposed in this paper, which cascades a DU-Net and a DNN. The DU-Net, named as 2DMA-Net, is used to process the raw radar data to estimate the clutter spectrum. It is constructed by unfolding a space–time 2D-decoupled SR algorithm with the multiple-measurement vector (MMV) model, i.e., the 2D-MMV-ADMM algorithm. Similar to [
28], the self-supervised learning method based on raw radar data is adopted by 2DMA-Net. The DNN, named a cycle-consistent adversarial network (CycleGAN) [
29], is used to process the clutter spectrum obtained by 2DMA-Net to filter out the interferences caused by non-ideal factors. It is acting as a nonlinear image enhancement process with an unsupervised training method based on an input-label unpaired dataset. By using DU-Net and DNN simultaneously, the DU-CG-STAP method can realize a fast and accurate estimation of the clutter spectrum and thus achieve a high clutter suppression and target detection performance for airborne radar.
To summarize, the main contributions of this paper are as follows:
- (1)
To reduce the complexity of solving the SR-STAP model for estimating the clutter spectrum, the MMV-ADMM algorithm is space–time 2D-decoupled. To optimize the iteration parameters of 2D-MMV-ADMM, the 2DMA-Net is constructed. To train 2DMA-Net, the L1 regularization loss function and the mean squared error (MSE) loss function are combined, and thus, with only raw radar data, the self-supervised training method is implemented.
- (2)
To solve the performance degradation problem of SR-STAP under non-ideal conditions, the clutter spectrum obtained using 2DMA-Net is processed using CycleGAN. The generator of CycleGAN maps the low-accuracy clutter spectrum into a high-accuracy domain to adaptively extract the clutter features and thus suppress the interferences caused by non-ideal factors. With an unpaired dataset, CycleGAN is trained based on the adversarial criterion and the cycle-consistency criterion.
- (3)
To generate an accurate clutter spectrum with low complexity, 2DMA-Net and CycleGAN are cascaded to form the DU-CG-STAP method. With raw radar data and theoretical clutter spectrum as the unpaired dataset, the DU-CG-STAP is trained in an unsupervised way.
The rest of this paper is organized as follows.
Section 2 establishes the signal model and briefly introduces the SR-STAP method.
Section 3 introduces the processing framework, network structure, dataset construction, and training methods of DU-CG-STAP in detail.
Section 4 verifies the performance and advantages of the proposed method via various simulations.
Section 5 draws conclusions and discusses future work.
3. DU-CG-STAP
Given the clutter spectrum SR estimation model in Equations (12) or (13), the performance of SR-STAP methods mainly depends on the adopted SR algorithm. Although many effective SR algorithms have been proposed, they have some common problems, e.g., parameter-setting difficulty and high computational complexity. In addition, in practical applications, various non-ideal factors will deteriorate the clutter sparsity and make the SR estimation model inaccurate, resulting in significant interferences deviating from the clutter ridge in the space–time domain and degrading the clutter suppression performance of SR-STAP methods. To solve these problems, a new STAP method, i.e., DU-CG-STAP, is proposed.
The main idea of DU-CG-STAP is to combine an SR-based DU-Net with an image-enhancement DNN. The SR-based DU-Net, namely 2DMA-Net, is used to obtain the clutter spectrum quickly from the raw radar data without parameter tuning. The image-enhancement DNN, namely CycleGAN, is used to process the clutter spectrum obtained by 2DMA-Net to generate an accurate and high-resolution counterpart.
The processing framework of the DU-CG-STAP method is shown in
Figure 2. It realizes the nonlinear transform from the raw radar data
to the clutter spectrum
, i.e.,
. The key to this method is the DU-CG network, where (1) the 2DMA-Net module is a solving network for the problem in Equation (13) with the network parameter as
and the output as the clutter spectrum estimation
; (2) the transform module
completes the single-channel processing of the spectrum
in the range dimension to obtain
; (3) the normalization module
normalizes the clutter spectrum
to obtain
as the input of G
YZ; (4) the generator G
YZ of CycleGAN is the clutter spectrum enhancement network with the parameter as
and the output as the normalized clutter spectrum estimation
; (5) the restoration module
obtains
, i.e., the final output of the DU-CG network.
To summarize, the procedure of the DU-CG-STAP method is as follows.
Step 1. Implement the offline training of the DU-CG network (including 2DMA-Net and CycleGAN).
Step 2. Input the raw radar data into the trained DU-CG network to obtain the clutter spectrum estimation.
Step 3. Calculate the CNCM and the space–time weighting vector and then conduct clutter suppression and moving target detection.
In the following, the network structure, dataset construction method, and network training method of DU-CG will be introduced in detail.
3.1. Network Structure
3.1.1. 2DMA-Net
Since the
norm is a discontinuous function, the complexity of directly solving Equation (13) is quite high. Thus, Equation (13) is usually solved by transforming it into an
convex optimization problem, expressed as
Introducing an auxiliary variable
, Equation (17) can be transformed into
where
denotes the regularization factor.
The augmented Lagrange function of Equation (18) is given by
where
denotes the inner product,
denotes the Lagrange multiplier, and
denotes the quadratic penalty factor.
Given an initial value
, the MMV-ADMM algorithm solves Equation (19) by solving the following three sub-problems alternately with
iterations.
where
,
, and
denote the estimation of
,
, and
in the
kth iteration (
), respectively.
The solutions of Equation (20) can be expressed as [
12,
30]
where
,
,
,
, and
is the iteration step size.
It can be seen from Equation (21) that the MMV-ADMM algorithm needs multiple matrix multiplications in each iteration, causing a high computing complexity. To improve the computing speed, the space–time 2D-decoupling process is implemented.
Firstly, the signal matrix
, noise matrix
, and clutter spectrum matrix
are space–time 2D-decoupled and transformed to the three-dimensional (3D) tensor form as
,
, and
, respectively. Then, corresponding to the space–time dictionary
, the spatial dictionary
and the temporal dictionary
in the 3D tensor form are constructed. At last, the radar-received signal tensor is expressed as
where
denotes the batch multiplication of multiple 3D tensors. For batch multiplication, the matrix slice of each tensor is taken from the third dimension for matrix multiplication. When the third-dimension size of a tensor is one, the batch multiplication takes the same matrix slice each time. For example, the batch multiplication of tensors
,
, and
can be simply expressed as
.
Based on Equation (22) and the batch multiplication process, the 2D-MMV-ADMM algorithm can be obtained from Equation (21), expressed as
where tensors
,
, and
are the space–time 2D-decoupled forms of
,
, and
, respectively.
Given the regularization factor , the quadratic penalty factor , and the iteration step in advance, the 2D-MMV-ADMM algorithm can obtain the clutter spectrum estimation as . Then, the CNCM and the space–time weighting vector can be calculated according to Equations (15) and (16). It can be seen from Equations (21) and (23) that by using the number of multiplications in a single iteration as the indicator, the complexities of the MMV-ADMM algorithm and its space–time 2D-decoupled version are, respectively, and . Compared to the MMV-ADMM algorithm, the 2D-MMV-ADMM algorithm can significantly reduce the computational complexity.
However, in practical applications, the parameter setting for 2D-MMV-ADMM is usually difficult. Unreasonable parameter settings will affect the convergence performance, resulting in high computational complexity and low clutter spectrum estimation accuracy. To solve this problem, based on the idea of DU, the 2D-MMV-ADMM algorithm with
K iterations is unfolded into a
K-layer neural network, i.e., 2DMA-Net, as shown in
Figure 3. The data-learning approach is used to obtain the optimal parameters for 2D-MMV-ADMM.
The input, output, and parameters of 2DMA-Net are the signal tensor
, the clutter spectrum estimation
, and
, respectively. The output of the
k-th layer of 2DMA-Net is the Lagrange multiplier
, the auxiliary variable
, and the clutter spectrum
. With operations similar to Equation (23), the nonlinear function
can be expressed as
2DMA-Net is driven by both data training and the theoretical model, hence having the advantages of data adaptability and model interpretability. With optimized network parameters, 2DMA-Net can achieve a higher convergence performance than the 2D-MMV-ADMM algorithm, thus reducing the computing complexity and improving the performance for estimating the clutter spectrum.
3.1.2. CycleGAN
In practice, non-ideal factors will reduce the clutter spectrum estimation accuracy of 2DMA-Net, resulting in some interferences in the space–time domain. To solve this problem, CycleGAN is used as an image-enhancement mapping tool to process the low-accuracy clutter spectrum output of 2DMA-Net. The processing framework of CycleGAN is shown in
Figure 4, where the unpaired low-accuracy clutter spectrum
and high-accuracy clutter spectrum
are both the input data. There are two generators of CycleGAN, G
YZ and G
ZY, where G
YZ maps the low-accuracy clutter spectrum
into the high-accuracy domain to obtain
and G
ZY maps the high-accuracy clutter spectrum
into the low-accuracy domain to obtain
. Discriminators D
Y and D
Z improve the mapping capability of the generators continuously in an adversarial mechanism. After training, the generator G
YZ of CycleGAN has the high-accuracy mapping capability for the low-accuracy clutter spectrum.
The network structures of the generator and discriminator of the CycleGAN used in this study are shown in
Figure 5, where sigmoid(·) denotes the sigmoid function, Tanh(·) denotes the hyperbolic tangent function, Conv2d denotes the 2D convolution process with the convolution kernel dimension as
,
denotes the number of input channels,
denotes the length and width of the convolution kernel,
denotes the number of convolution kernels (i.e., the number of output channels), Residual Block denotes the cascade of two Conv2d layers, and unlike Conv2d that implements the image down-sampling process, ConvTranspose2d implements the image up-sampling process to expand the image size. It should be noted that to better conduct clutter spectrum enhancement tasks and maintain low computational complexity, some appropriate modifications are made to the original network structures of CycleGAN given in [
29].
3.2. Dataset Construction
Compared to supervised learning, without using paired training data, unsupervised learning and self-supervised learning can acquire a large number of training data at a low cost. In this study, 2DMA-Net uses a self-supervised learning method and CycleGAN uses an unsupervised learning method, for which the training dataset is constructed with the following three steps.
First, some parameters of the airborne radar system, i.e., signal wavelength , pulse repetition interval , ULA element number , element spacing , CPI pulse number , and the training range cell number , are fixed. In addition, it is set that each range cell consists of clutter patches that are uniformly distributed in the azimuth angle range . The noise power is fixed to and the scattering coefficients of clutter patches obey a complex Gaussian distribution with the amplitude determined by the clutter-to-noise ratio (CNR).
Then, some intermediate parameters are calculated. With c as the speed of light, the maximum unambiguous range is calculated as . Given the space–time frequency range and and the grid number and , the spatial dictionary and the temporal dictionary are, respectively, constructed.
Finally, to mimic complicated scenarios, other parameters used to obtain the raw radar data are assumed to be uniformly randomly distributed within specified ranges, i.e., the airplane height , the airplane velocity , the non-side-looking angle , the detection range , the ICM , the array element amplitude error , the array element phase error , and the CNR .
According to the above settings, P different scenarios with random parameters are simulated and the raw radar data () corresponding to each scenario are generated based on Equation (1) and used as the input data for 2DMA-Net. After training 2DMA-Net with the self-supervised method as presented in the following subsection, the low-accuracy clutter spectrum is generated corresponding to each set of raw radar data and used as the input data for CycleGAN.
To train the CycleGAN with the unsupervised method,
P different scenarios are simulated with random parameters. Meanwhile, the theoretical CNCM is calculated for each scenario based on Equation (4), where no array amplitude/phase error is contained and thus the spatial weighting vector is fixed as
. Then, based on the minimum variance distortionless response (MVDR) algorithm [
31], the high-accuracy clutter spectrum
corresponding to each scenario is generated and is also used as the input data for CycleGAN.
In Step 2, the generated dataset for 2DMA-Net is
and the generated dataset for CycleGAN is
. As shown in
Figure 6, in this step, the generated datasets are divided into training datasets and validation datasets according to a certain proportion, with the sizes as
and
, respectively.
3.3. Network Training
3.3.1. 2DMA-Net
In most existing DU-Nets, the supervised training method is used, i.e., the output label for each input data is prepared for network training. However, for airborne radar STAP applications, the output label of 2DMA-Net is difficult to obtain as no exact clutter spectrum is available for each set of input raw radar data. A possible solution is to apply the 2D-MMV-ADMM algorithm with fixed manual-tuned parameters and sufficient iterations to solve Equation (22) to obtain the clutter spectrum estimation as the output label for 2DMA-Net. However, with fixed parameters this method cannot guarantee the estimation accuracy for different inputs, resulting in the distortion of output labels. In addition, to obtain convergence, this method needs a lot of iterations, resulting in high computing costs. To solve these problems, the self-supervised training method is adopted by 2DMA-Net without preparing output labels.
With the output clutter spectrum estimation
of 2DMA-Net for each set of input raw radar data
, the clutter data can be reconstructed as
. Then, the following network loss function is defined for the self-supervised training of 2DMA-Net.
where
,
,
and
are the matrix forms of the tensors
and
, and α is a constant.
It should be noted that in Equation (25), two functions are combined to define the network loss function of 2DMA-Net. The first function (i.e., MSE loss function) is used to ensure the estimation accuracy of the clutter spectrum with the consideration that the more accurate the estimation of , the smaller the difference between () and (). The second function (i.e., L1 regularization loss function) is used to improve the sparsity of the clutter spectrum estimation. If only the MSE loss function is used, the clutter spectrum estimation results may be quite different from the sparse solution of the SR-STAP problem. If only the L1 regularization loss function is used, as the clutter sparsity and the SR estimation model will be seriously damaged by the non-ideal factors (e.g., ICM, element amplitude/phase error, and low CNR), the performance of 2DMA-Net may degrade a lot with significant interferences in the space–time domain. Hence, by using a balancing coefficient α, the MSE and L1 regularization combined loss function is used by 2DMV-Net for network training to achieve a high clutter spectrum estimation performance.
Based on the loss function given in Equation (25), with the parameters of each 2DMA-Net layer initialized as
, the optimal parameters of 2DMA-Net
can be obtained via the back-propagation method [
32,
33], expressed as
3.3.2. CycleGAN
For DNNs using the supervised training method, a paired dataset is required. For airborne radar STAP applications, the practical clutter environment is usually unknown in advance, resulting in difficulties for proper dataset construction. To solve this problem, an unsupervised training method is adopted by CycleGAN. To realize the mutual mapping between the clutter spectra in the low-accuracy domain and the high-accuracy domain via the unpaired dataset, CycleGAN conducts the unsupervised training based on an adversarial criterion and cycle-consistency criterion, which are detailed as follows.
- (1)
Adversarial training
Consider that the generator G
YZ can accurately map the low-accuracy clutter spectrum
to the high-accuracy domain to obtain
(namely the fake high-accuracy clutter spectrum). Then, it will be difficult for the discriminator D
Z to distinguish
from the true high-accuracy clutter spectrum dataset
. The adversarial training process will continuously improve the discriminating capability of D
Z on the fake and true spectrum and based on the feedback of D
Z, G
YZ will continuously improve its high-accuracy mapping capability on the low-accuracy clutter spectrum. Thus, the following loss function is defined for the generator G
YZ and the discriminator D
Z, expressed as
where
denotes expectation and
and
denote the probabilities that the true and fake high-accuracy clutter spectra can be correctly discriminated by D
Z, respectively.
Similarly, the following loss function is defined for the generator G
ZY and the discriminator D
Y, expressed as
where
and
denote the probability that the true and fake low-accuracy clutter spectra can be correctly discriminated by D
Y, respectively.
The training process based on the adversarial criterion optimizes the generators GYZ/GZY and the discriminators DY/DZ simultaneously, expressed as and , i.e., the generators and discriminators will oppositely minimize and maximize the same loss function.
- (2)
Cycle-consistency training
The goal of adversarial training is to make it difficult for DZ to discriminate from the true high-accuracy clutter spectrum dataset . However, it cannot guarantee that and correspond to the same situation. For example, the low-accuracy clutter spectrum in the side-looking case may be transformed by GYZ into a high-accuracy clutter spectrum in the non-side-looking case. In other words, the adversarial training process only forces to belong to the high-accuracy domain but cannot ensure that is the real desired high-accuracy clutter spectrum counterpart of .
Based on the cycle-consistency criterion, if
can be recovered to the original data by G
YZ and G
ZY successively, i.e.,
, it can guarantee
and
correspond to the same situation. Similarly, for
, it has
. Hence, the following loss function is defined for the cycle-consistency training, expressed as
- (3)
Full training
To ensure the mapping and correspondence of the clutter spectrum at the same time, the full training process is conducted. Combining the adversarial loss function and the cycle-consistency loss function with their importance balanced by a coefficient
, the full loss function for CycleGAN is defined as
Then, by using the Glorot method [
34,
35] for initialization, the optimal network parameters of CycleGAN can be obtained via the back-propagation method, expressed as
4. Experiment Results
In this section, the performance of the proposed DU-CG-STAP method is verified and compared with three typical SR-STAP methods, i.e., MMV-FOCUSS-STAP, MMV-FCSBL-STAP, and MMV-ADMM-STAP, via various simulations with the parameters shown in
Table 1, which are set according to their typical values [
13,
15,
21].
In MMV-FOCUSS-STAP, the number of iterations is set as 200 and the sparsity parameter is set as 0.2. In MMV-FCSBL-STAP, the number of iterations is set as 30 and the noise variance is initialized as 10−5. In MMV-FCSBL-STAP, the parameters are set as , , , and . In the self-supervised training of 2DMA-Net, the coefficient of the L1 regularization loss function, the number of network layers, the initial learning rate, and the training epoch are set as , , 10−4, and 500, and the parameters of each layer are initialized as . In the unsupervised training of CycleGAN, the coefficient of the cycle-consistency loss, the initial learning rate, and the training epoch are set as , 2 × 10−5, and 500, respectively.
4.1. Network Convergence Analysis
In this subsection, the convergence results of network training are presented.
Figure 7a shows the combined loss of 2DMA-Net, the cycle-consistency loss, and the full loss of CycleGAN during the training process. It can be seen that the losses decrease gradually and remain unchanged from about 200 epochs, demonstrating the favorable convergence performance of 2DMA-Net and CycleGAN.
Figure 7b shows the discrimination probability curves of the discriminator D
Z on the true spectrum and the fake spectrum. The discrimination probability of 1 indicates that the discrimination results are true and the discrimination probability of 0 indicates that the discrimination results are fake. It can be seen that the discrimination probability of D
Z simultaneously increases to 1 on the true spectrum and decreases to 0 on the fake spectrum. The increasing capacity of the discriminator D
Z to distinguish between the true and fake spectrums indicates the increasing capacity of the generator G
YZ to map the low-accuracy clutter spectrum to the high-accuracy clutter spectrum, hence increasing the following CNCM estimation accuracy.
4.2. Clutter Spectrum Estimation
In this subsection, the clutter spectrum estimation results of the proposed DU-CG network are presented under different situations. For comparison, the results obtained via MMV-FOCUSS, MMV-FCSBL, and MMV-ADMM are also shown. As a reference, the MVDR clutter spectrum is calculated based on the theoretical CNCM.
First,
Figure 8 shows the estimation results using different methods in the ideal case, i.e., the case with the clutter ridge slope as 1, non-side-looking angle as 0, and no ICM or element amplitude/phase error. It can be seen that as the clutter has a high sparsity in the ideal case, these methods can all estimate the clutter spectrum accurately. As a module of DU-CG, the results obtained using 2DMA-Net have relatively low accuracy where the clutter ridge is broadened. However, as the clutter feature is clearly achieved, based on the output of 2DMA-Net, the CycleGAN in DU-CG can successfully obtain a high-accuracy clutter spectrum estimation.
Then,
Figure 9 and
Figure 10 show the estimation results in the non-ideal cases, where the clutter ridge slope is changed to 1.34 and the non-side-looking angle is changed to 16.5°, respectively. It can be seen that, as the clutter sparsity deteriorates in these two cases, the estimation accuracy of typical MMV-SR algorithms degrades significantly. The clutter ridges obtained by these algorithms are broadened and some significant interferences deviating from the clutter ridges are generated. 2DMA-Net can obtain the low-accuracy clutter spectrum estimation with clear clutter features, and thus, based on the output of 2DMA-Net, a high-accuracy clutter spectrum estimation can be obtained by CycleGAN, which is consistent with the reference. These results demonstrate that in the non-ideal cases, typical MMV-SR algorithms are seriously affected by the deteriorated clutter sparsity, whereas the proposed model-driven and data-driven DU-CG network can effectively overcome this problem and adaptively extract the clutter feature to obtain the high-accuracy clutter spectrum estimation.
Furthermore,
Figure 11 and
Figure 12 show the estimation results of DU-CG in the other two non-ideal cases, where the ICM is set as 0.5 m/s and the element amplitude/phase error is set as 0.14/3.83°. It can be seen that in the presence of ICM, the clutter is broadened along the Doppler dimension due to the temporal decorrelation problem, leading to the damaged clutter sparsity. Hence, typical MMV-SR algorithms will have decreased clutter spectrum estimation accuracy. In the presence of an array element amplitude/phase error, as the SR estimation model and the clutter sparsity are both damaged, the performance of typical MMV-SR algorithms degrades significantly. However, although the performance of 2DMA-Net also degrades in these two cases, the clutter feature is maintained. Then, as CycleGAN can reduce the width of the clutter ridge and suppress the discrete interferences in the space–time domain, a high-accuracy clutter spectrum closest to the reference can still be obtained by the DU-CG network.
Finally, all the above-mentioned non-ideal factors are considered, giving the results shown in
Figure 13, where the clutter ridge slope is 0.67, the non-side-looking angle is 15.50°, the ICM is 0.24 m/s, and the array element amplitude/phase error is 0.10/4°. The results show that in such a complicated case, the performance of typical MMV-SR algorithms degrades significantly, the clutter ridge feature distorts severely, and a lot of false peaks appear in the space–time domain. As the proposed DU-CG network can adaptively acquire the clutter features and filter out the interferences caused by non-ideal factors, an accurate estimation of the clutter spectrum is still achieved.
4.3. Clutter Suppression Performance
In this subsection, the clutter suppression performance of different STAP methods is compared using the SCNR loss as the indicator. Keeping the spatial frequency of the target as 0 and linearly varying its normalized Doppler frequency in the range
, the obtained results are shown in
Figure 14, where the subfigures (a)–(f), respectively, correspond to
Figure 8,
Figure 9,
Figure 10,
Figure 11,
Figure 12 and
Figure 13.
The comparison in
Figure 14a shows that, in the ideal case, the MMV-FOCUSS-STAP method and MMV-FCSBL-STAP method can achieve the best clutter suppression performance, whereas the proposed DU-CG-STAP method can obtain slightly worse suboptimal performance, which is better than the MMV-ADMM-STAP method. The comparisons in
Figure 14b,c show that the clutter suppression performance of typical MMV-SR-STAP methods degrades with the clutter sparsity deterioration, which is manifested by the broadened notch in the zero-Doppler region and the false notches deviating from the clutter ridge. The proposed DU-CG-STAP method can obtain a narrower clutter suppression notch and avoid false notches. The comparisons in
Figure 14d,e show that in the presence of ICM and array element amplitude/phase error, typical MMV-SR-STAP methods have significant SCNR losses in almost the entire Doppler frequency range, hence they will suppress not only the clutter but also the target, resulting in low target-detection performance. The proposed DU-CG-STAP method can form an effective suppression notch for the clutter and maintain the power for the target, hence it has a higher performance. The comparison in
Figure 14f shows that under conditions with all considered non-ideal factors, compared to typical MMV-SR-STAP methods, the proposed DU-CG-STAP method can still obtain a high clutter suppression performance that is close to the theoretical optimal STAP.
4.4. Computational Complexity Analysis
In this subsection, the computational complexity of the DU-CG network is analyzed and compared with MMV-FOCUSS, MMV-FCSBL, and MMV-ADMM. It should be noted that when applied, the computational complexity of DU-CG is the sum of those of 2DMA-Net and the generator G
YZ in CycleGAN. With the only difference in the iterative parameters, the computations of 2DMA-Net and the 2D-MMV-ADMM algorithm are the same. Thus, with the same number of network layers and iterations, 2DMA-Net and the 2D-MMV-ADMM algorithm will have the same computational complexity. Using the multiplication numbers as the indictor, the computational complexities of different algorithms are given in
Table 2.
According to
Table 2, the computational complexities of different algorithms under different conditions are shown in
Figure 15.
Figure 15a corresponds to the conditions of
,
, and the number of algorithm iterations or network layers
varying from 20 to 50 with a step of 5.
Figure 15b corresponds to the conditions of
varying from 5 to 30 with a step of 5 and the number of algorithm iterations or network layers
of MMV-FOCUSS, MMV-FCSBL, MMV-ADMM, and the DU-CG network as 200, 30, 2000, and 30 (which are determined considering their convergence performance). The comparisons show that the computational complexities of 2DMA-Net and the generator G
YX in CycleGAN are much lower than the other methods. Hence, the proposed DU-CG network can always obtain a faster convergence speed under different conditions.
4.5. Rationality of DU-CG-STAP
In the CycleGAN of DU-CG-STAP, the discriminator DZ learns the features of the high-accuracy clutter spectrum with the capacity to discriminate the true and fake spectra, which is continuously improved based on the unpaired dataset. At the same time, the generator GYZ is committed to mapping the low-accuracy clutter spectrum into the high-accuracy domain. If the low-accuracy clutter spectrum is provided with poor quality, it will be difficult for CycleGAN to extract the clutter features and complete the high-accuracy reconstruction task in the unsupervised training process. Hence, to illustrate the rationality of the DU-CG-STAP processing framework, the following results are provided.
In the proposed processing framework, the low-accuracy clutter spectrum dataset is generated by the self-supervised trained 2DMA-Net and used as the input data for CycleGAN. If the low-accuracy clutter spectrum dataset is generated by some low-accuracy and low-resolution methods, the high-resolution clutter spectrum reconstruction performance of CycleGAN will seriously degrade. For example, with the low-accuracy clutter spectrum obtained using the Fourier transform and the MVDR methods that were conducted on the raw radar data, the low-accuracy and high-accuracy mapping results of CycleGAN under different conditions are obtained and shown in
Figure 16a,b, where the same network scale and training process with the proposed method are used.
It can be seen in
Figure 16a that due to the high sidelobes in the Fourier clutter spectrum, the generator G
YZ of CycleGAN incorrectly extracts many high values, resulting in significant distortions of the clutter features. It can be seen in
Figure 16b that with a small amount of training range cells, the clutter ridge obtained by the MVDR algorithm broadens and some noises exist in the clutter spectrum. Hence, even though the generator G
YZ can extract the clutter features, it cannot effectively reduce the clutter ridge width and suppress the noisy spectrum component.
On the contrary, by generating the low-accuracy clutter spectrum dataset via the MMV-ADMM algorithm and the trained 2DMA-Net, the low-accuracy and high-accuracy mapping results of CycleGAN are obtained and shown in
Figure 16c,d. Since the clutter spectra obtained by these two approaches have no obvious sidelobe/noises and the features of the clutter ridge are clear, CycleGAN can filter out the interferences caused by non-ideal factors in the unsupervised training process so as to complete the high-accuracy clutter spectrum reconstruction task.