1. Introduction
Hyperspectral remote sensing technology has developed significantly over the past two decades [
1]. However, due to the low spatial resolution and the complex background except for the effects of the atmosphere, the mixed pixels are a common phenomenon in the hyperspectral image. As a result, when mixing occurs, it is not possible to determine the material directly at the pixel level. Nevertheless, thanks to the exceptional spectral resolution, hyperspectral imaging offers a great promise in mapping the abundance of particular species at the subpixel level and the species are called endmembers. Through hyperspectral unmixing (HU), we can identify the distinct signatures in a scene and its abundance fractions in each pixel in order to enable applications such as precision classification and target detection at the sub-pixel level as well as risk prevention and response [
2].
The unmixing procedure comprises two main steps, endmember determination, and abundance estimation. It is a common knowledge that the accuracy of the unmixing result depends highly on the accuracy and completeness of the extracted endmembers. The most important source of error in unmixing can be classified into three aspects including nonlinearities (NL) effects, mis-modelling effects (MEs) or outliers, and endmember spectral variability (ESV) [
3]. NL is associated with multiple scattering effect that appears in presence of terrain relief and/or trees. Mis-modelling effects result from the presence of some physical phenomena such as NL or ESV. The propagated errors in the signal processing chain such as the wrong estimation of endmember number or their spectra can also result in bad abundance estimates. Additionally, the choice of endmember number drastically alters the representation of the image scene and, therefore, it is a crucial step for the endmember identification and the subsequent abundance estimation [
3,
4,
5]. The last one is spectral variability. In this paper, we focus on reducing spectral variability’s effect on the unmixing result. The spectral variability of endmembers can be caused by variable illumination and environmental, atmospheric, material aging, object contamination, and other conditions [
6,
7,
8]. The major source of ESV results from variation due to illumination conditions and changing atmospheric conditions. Changes in illumination can result from a variation in topography and surface roughness that leads to varying levels of shadows and brightly lit regions. Atmospheric gases have strong absorption features or scattering characteristics that modify measures and spectral signatures when they affect the downward and upward transmittance of radiation from the sun to the surface measured and from the ground surface to the hyperspectral sensor. The radiance or reflectance of a material can also significantly change because of the intrinsic variability of a material due to the variation of a hidden parameter (e.g., concentration of chlorophyll in vegetation). Additionally, multiple scattering and intimate mixing contribute to the material’s radiance or reflectance in a nonlinear fashion [
9,
10]. There are two types of spectral variability. These include the variability within an endmember class called intra-class variability and the similarity among the endmember class called interclass variability [
11]. In particular, the accuracy of subpixel abundance estimation linearly decreases with intra-class variability. This is reasonable since the higher the intra-class variability, the higher the likelihood that the actual spectral features of the endmembers in a specific pixel deviate from the fixed endmembers used in the mixing model. On the other hand, the similarity among the endmembers (e.g., crops and weeds) results in a high correlation among the endmember spectra, which, in turn, leads to an unstable inverse matrix and a dramatic drop in the estimation accuracy [
7]. Therefore, to circumvent this problem, reducing or eliminating the spectral variability of endmembers is an effective approach for improving the accuracy of unmixing.
Currently, automated endmember extraction algorithms such as vertex component analysis or minimum volume simplex analysis [
12,
13] fail to cope with the effect of spectral variability. In addition, quite recently, considerable attention has been paid to dealing with spectral variability so that the accuracy of sub-pixel abundance estimations can be improved significantly [
14]. To reduce the effect of spectral variability on the unmixing result, there are five basic principles [
7] including (1) the use of multiple endmembers of each component during an iterative mixture analysis procedure. One of the first attempts to address spectral variability was made by Roberts who proposed a multiple endmember spectral mixture analysis [
15]. It allowed the types and number of endmembers to vary on a per-pixel basis. Although iterative mixture analysis cycles produced good results, the computational complexity of the approach is a major drawback when dealing with hyperspectral data [
7]. Furthermore, the premise of these endmember variability reduction techniques is the availability of a spectral library containing representative instances of all endmembers present within the scene. As such, the library should be able to model the spectral variability of the endmembers. Therefore, an automated extraction of image-based endmember bundles and a new extended linear mixing model are subsequently proposed to address spectral variability [
14,
16]. (2) Another principle includes the selection of feature bands [
17]. An empirical spectral feature selection technique is Stable Zone Unmixing proposed by Somers. In this approach, the wavelength sensitivity towards spectral variability was assessed using the InStability Index (ISI) [
8]. However, it did not consider the high correlation among adjacent wavebands in hyperspectral imagery. (3) A third principle includes the spectral weighting of bands. Instead of assuming equally important effects in each band, Chang proposed a method of spectral weighting to emphasize the effect of differences among spectral bands and results demonstrated that it can perform better than traditional unweighted approaches [
18]. (4) The spectral transformation is another feature. Spectral transformations have been proposed to reduce the effect of endmember variability such as Derivative spectral Unmixing (DSU) [
19] and Normalized Spectral Mixture Analysis (NSMA) [
20]. The derivative can be used from the first derivative to the fourth one. Nevertheless, derivative analysis errors tend to increase with high-frequency noise. (5) The fifth principle involves the use of radiative transfer models. The spectral model uses a radioactive transfer model to generate endmember libraries while referencing a radiative transfer model is usually designed to solve a specific scene. Thouvenin et al. indicated that the spectral variability can be represented using a perturbed linear mixing model (PLMM) where the spectral variability is explained by an additive perturbation term for each endmember [
5]. However, this model lacks physical meaning. On the other hand, Hong et al. then introduced a spectral variability dictionary to account for any residuals that cannot be explained by the linear mixing model [
21]. Given the analysis and comparison of all the methods mentioned above, a method combining band selection and spectral weighted matrix needs to be proposed to reduce the effect of spectral variability so that unmixing accuracy can be enhanced.
This paper intends to present an alternative unmixing method to address the ground component or endmember similarity problem. In this paper, we propose a compact but accurate hyperspectral unmixing method. To reduce the spectral variability’s effect on unmixing results, this method induces a weighed matrix in the approximation problem, which can be incorporated with the algorithm of sparsity. In addition, to improve the performance in ill-conditioned data and also reduce the risk of getting stuck in local minima in non-convex alternating minimization computations, a hierarchical sparsity unmixing method can be feasible for addressing endmember variability in the hyperspectral image. Lastly, we can obtain a more accurate endmember matrix as well as abundance map.
The research is presented in five sections.
Section 2 is the theoretical background and the proposed method.
Section 3 provides data acquisition for the experiment.
Section 4 and
Section 5 report the experimental results and discussion and give suggestions for future research, respectively. Finally,
Section 6 concludes the paper.
4. Experimental Results and Analysis
To illustrate the accuracy of unmixing results, spectral measurements are used in this paper. The results are evaluated using the spectral angle distance (SAD), the abundance angle distance (AAD), and the root mean squared error (RMSE).
SAD is defined by the equation below.
where
mi denotes the ith endmember spectrum, 〈
mi,
mj〉 denotes the inner product of the two spectra, and
denotes the vector magnitude. It is used to compare the similarity of the original pure endmember signatures and the estimated ones.
AAD is used on condition that the abundance fractions are known as prior knowledge. It measures the similarity between original abundance fractions (
si) and the estimated one (
), which is formulated in the equation below.
RMSE is denoted by
where
N denotes the total number of pixels, l denotes the total number of spectral bands, and
denotes the error of the
ith row and the
jth column between the original and simulated image data. It is used to evaluate the abundance estimates’ error. Since the model errors are likely to have a normal distribution rather than a uniform distribution, the RMSE is a good metric to present model performance [
34].
4.1. Simulated Data Experiments
Providing the sub-pixel fraction abundance map of the image’s components is nearly impossible, expensive, and time-consuming. However, the simulated data makes it possible to preset the endmember set and the abundance fraction map as prior knowledge.
In this paper, several scenarios and patterns have been designed to evaluate the performance of the proposed method. In addition, different signal-to-noise ratios have been employed to generate the simulated images.
4.1.1. Synthetic Data Experimental Results
In the first experiment, the endmembers are rice, corn, and weed and the SNR is set to 20. The layer number is 5 and the maximum iteration times in each layer is 1000. Furthermore, we set
Figure 8 shows the unmixing results of the proposed method and the non-weighted L
1/2 sparsity constrained NMF (L
1/2S-NMF) method [
31]. Among them, the first column is the true endmembers spectra and the second column is the estimated endmembers spectra by the proposed method. The third column is the original abundance map and the last two columns are the estimated abundance map by the proposed method and comparison method, respectively. The abundance fraction map is denoted using a color bar showing color scale. The blue color shows the smallest fraction near zero and the red color shows the highest fraction near one. The variation of the color bar is consistent with the natural spectral wavelength change. Through the comparison, it can be seen that the proposed algorithm performs better than the comparative method especially in the weed abundance fraction estimation. This is because the similarity among the crops and weed is difficult to distinguish by the traditional method, which the crops and weed cannot be the vertices of any convex simultaneously.
4.1.2. Influence of SNR
The second experiment presented there aims at evaluating the performance of the methods with different SNR. The SNR varies from 20 to 60. The endmember number is 6.
Figure 9 is the SAD and RMSE accuracy assessment results of the proposed algorithm, NMF, and L
1/2S-NMF. Since the scenarios and patterns of the simulated datasets are randomly generated, it is a good approach to evaluate the goodness and stability of unmixing results by different methods.
From
Figure 9, with the increase of the SNR, there is also a decrease of the estimated RMSE errors for all methods while the proposed method shows clear advantages. It should be noted that in
Figure 9a, there is a small increase of SAD where the SNR is 50. That is due to the fact that the simulated image abundance is a random series that may produce highly mixed data. However, under this condition, the proposed method still outperforms more than other approaches. Therefore, combining intra-class scatter matrix information with hierarchical sparsity constrained NMF, the proposed algorithm can obtain a good performance in hyperspectral unmixing.
4.1.3. Influence of Layer Number and Endmember Number
To verify how the layer and endmember number affect the proposed method in hyperspectral unmixing accuracy, we experiment with various layers and endmembers under the same fixed initial condition. In this experiment, the endmember number is set from 4 to 12 and the number of layers varies from 1 to 16. It should be noted that the endmember number is known as prior knowledge and the other parameters such as SNR and the total number of iterations keep invariant than previously mentioned. The initial endmember and abundance matrix are extracted by VCA-FCLS, which are the input of the proposed method with different layers. Therefore, with the fixed dataset and initial input parameters, the influence of layers and endmember number on unmixing results can be observed. It is worth mentioning that, when the layer number is one, it can be approximately regarded as the single layer sparsity-based NMF method. We also make a comparison with L
1/2S-NMF. The parameter of L
1/2S-NMF is set the same as mentioned in the literature [
31]. The results are shown in
Table 1. The best accuracy results with different endmember numbers are in bold fonts. From the table, it can be inferred that the best choice of the layer number setting is approximately linear with the endmember number. The AAD accuracy is increased with the layer number. It is true that the error of the hierarchical system may be accumulated layer by layer. However, the AAD error can still be controlled to a degree. The proposed method still outperforms when the method is compared against various endmember numbers.
4.1.4. Influence of Iterations
In this experiment, we intend to find the number of iterations’ effect on final unmixing accuracy. We still fix the initial endmember set. The endmember and layer number are set to 8 and 10, respectively. The iteration in each layer varies from 100 to 3000. Additionally, the other parameters such as lambda or SNR keep invariant. The accuracy results are shown in
Table 2. As expected, the best performance regarding of AAD and SAD is obtained with the larger iteration number. Note that there exists a balance between the maximum layer number and accuracy. When the number of iterations ranges from 100 to 200, the accuracy increases drastically. When the number ranges from 400 to 1000, the accuracy increases linearly. Whereas when the iteration number is larger than 1000, the rate of convergence slows down and becomes flat. This is due to the fact that the algorithm converges gradually and has reached the stopping criteria before getting the maximum iteration number. Generally, the number of iterations in each layer is set to 2000.
4.1.5. Time-Consuming with Layer Number
Finally, to conclude this section, we emphasize the computation complexity of the hierarchical sparsity unmixing method considered in this study. For the proposed method, the computational complexity is approximately given by the product of the computational complexity of each layer and the number of iterations. In this experiment, there are 58 × 58 pixels and the endmember number and SNR are set to 8 and 25, respectively. Furthermore, the regularization parameters and the whole iterations keep invariant. The time-consuming results versus the layer number is shown in
Table 3. Through the results, it can be concluded that, as layer numbers increase, better performance in saving time is shown. This is due to the hierarchical strategy breaking the large matrix into two parts. In the traditional NMF with single layer, the computational cost in one iteration is
. However, in the hierarchical strategy, the computational cost in one iteration of the second or latter layer is
. Additionally, the value of p is less than L. Consequently, it can be inferred that the performance of the hierarchical algorithm in the saving time-consuming process is substantially better than the traditional single layer NMF.
To summarize, the results mentioned above confirm the satisfactory performance of the proposed method in terms of unmixing quality and computational time. The best results are obtained with a proper layer number. Additionally, the proposed method is robust to the different conditions that can affect hyperspectral images.
4.2. Real Hyperspectral Data Experiments
In this section, we use two different scenarios to verify the effectiveness of the proposed method. The images are collected in Jiangxi captured by Pika XC2 imaging of Resonon. We choose 90 pixels with 30 pixels for each endmember directly from the image. Then an average of the observations in each endmember class is computed, which serves as the reference endmember spectra. Additionally, each endmember spectral class can be served as a weighted matrix to reduce endmember variability effect. The size of the test image is 400 × 400 × 372.
Figure 6 shows the other study site of Jiangxi. There are tidal flats, tree, and water in the scene. From the image, it should be noted that on the top right of the spectrum, the shadow among the tree canopy is similar to the water spectrum to some degree. However, in the near-infrared wavelength, the shadow spectrum reserves the property of red edge, which is unique for plants. In other words, except for the similarity of interclass endmember spectra, there also exists intra-class spectral variability. Therefore, the spectral variability of both interclass and intra-class increase the difficulty of hyperspectral unmixing. In this paper, we use the intra-class scatter matrix weighted constraint to decrease the intra-class endmember variability and enlarge the interclass spectral variability. The endmember and layer number is set as 3 and 5, respectively. The iteration in each layer is set to 1000.
Figure 10 is the unmixing results of the proposed method and the traditional method. The compared methods are geometric-based VCA with fully constrained least square (FCLS) [
33], minimum volume constrained non-negative matrix factorization (MVC-NMF) [
35], and sparsity constrained non-negative matrix factorization (L
1/2S-NMF) [
31]. Since the weeds and crops have highly similar diagnostic spectral features, the rice, weed, and corn in
Figure 10 are not the vertices of any convex set. Therefore, they cannot be endmember sets simultaneously in the geometric simplex [
22]. This is why the traditional unmixing method such as VCA-FCLS and MVC-NMF cannot distinguish them effectively. Note that in
Figure 10c,d, the wrong estimation of weed and corn is good evidence to illustrate this phenomenon. On the other hand, the proposed method takes the intra-class endmember variability into account and through the difference within endmember spectral, we make a weighting for each band. Furthermore, a hierarchical sparsity strategy is also used to search the optimal global solution. Then we can obtain a better result than the traditional one especially in distinguishing the corn and weed.
From
Figure 11, it can be seen that, in the tree canopy and beach abundance fraction maps, the proposed method estimates better than the traditional method, which is more close to reality. On the water abundance map, all the methods have little error that the shadow of the tree canopy shows the higher faction in these maps. This is due to the fact that the shadow spectra show lower magnitude, which is similar to the water feature. However, it is worth mentioning that the proposed method has a better ability to circumvent the shadow effect than the traditional approach to distinguish these spectra.
Table 4 and
Table 5 illustrate the mean SAD and RMSE accuracy of different algorithms. The lower SAD and RMSE shows the better estimation of reconstructed data. Both the results show that the proposed method can obtain the best result of all.
Based on the real hyperspectral image captured by Pika XC2 imaging of Resonon, the most of the endmember variability can be interpreted by a variation in illumination, which is mainly located in vegetation areas and among trees. Experimental results show that it is feasible to combine the spectral weighted matrix with hierarchical strategy in reducing the spectral variability effect on unmixing accuracy. Furthermore, the proposed method also outperforms the compared methods.
4.3. Potential Applications
In this experiment, we use the classic hyperspectral image, named Data 3, to illustrate the performance of unmixing methods. The image is captured by Airborne Visible-Near infrared Imaging Spectrometer (AVIRIS) hyperspectral sensor, which is shown in
Figure 12. It is located at Cuprite, Nevada USA. The original data has 512 × 614-pixel with 224 bands. The wavelength ranges from 400 nm to 2500 nm, full width at half maximum of 10 nm, and the spatial resolution is 20 m. We use the sub-image data with size 250 × 191. After removing low SNR and water-vapor absorption bands (1–2, 104–113, 148–167, and 221–224), 188 bands are remained. The situation is located in Cuprite Nevada, which is a mineralogical site that has been established as a reference site for hyperspectral and other remote sensing instruments [
36,
37,
38]. Cuprite is an arid and sparsely vegetated place along the California-Nevada border near Death Valley. It contains sulfates (K-Alunite, Jarosite, etc.), Kaolinite group clays (Kaolinite, Dickite, etc.), Carbonates (Calcite, Calcite+montmorillonite), Clays (Na-Montmorillonite), and other minerals. Since the spectral variability matrix is hard to obtain from the image, we define the weighted matrix as the unit matrix. According to the reference [
31], we set endmember and layer number as 10 and 8, respectively, and part of unmixing results are shown in
Figure 13. The blue solid lines illustrate the original spectra from the USGS library and the orange dotted lines are spectra extracted by the proposed method. Clearly, the endmember spectra extracted by the proposed method are in good accordance with USGS library.
Table 6 and
Table 7 show the SAD and RMSE accuracy generated by the proposed method and compared ones, which are L
1/2S-NMF and VCA, respectively. It can be seen that the proposed method can extract better endmember and abundance estimation results versus the traditional approaches.
Results from Cuprite data confirms that the proposed method can extract different endmember signatures in the observed scenes accurately without weighted spectral variability matrix.
5. Discussion
As summarized in published research [
39,
40], hyperspectral unmixing approaches can be categorized into geometric, statistical, and sparse regression-based algorithms. To some degree, the proposed method is closely linked to the minimum volume method through its geometric interpretation, which is easy to understand. It is intuitive because if the volume of simplex becomes large, the observed pixels become less sparse [
31]. Additionally, through the abundance sparsity constraint, the abundance maps can be more accurately recovered.
From the experimental results, several issues should be noted. First, endmember spectra from the field can be controlled well and measured accurately, but they may not match those in the image due to differences in sensors, atmospheric effects, and illumination conditions. This is the reason why it is impossible to obtain the same high accuracy result of simulated data and real hyperspectral data. Furthermore, linear mixing model excludes the nonlinear effects caused by the complex canopy structures and possible multi-reflection effects. The combination of the intra-class scatter matrix endmember spectra in a single analysis improved the accuracy of similar terrain covering endmember spectral extraction and abundance estimation in each scenario.
Second, for all scenarios and all noise levels, the traditional linear unmixing methods provided the least accurate abundance fraction estimates. As commonly accepted, this is due to the high similarity between the weed and crops spectra, which is evident from
Figure 7 and intra-class spectral variability. This results in unstable model inversion. Using a weighted matrix makes it possible to enlarge the separation between endmember spectra and shrinks the variances of residuals. On the other hand, the selection of lambda is also an open theoretical issue. In the research [
41], it sets the lambda value based on the sparseness criteria, which is highly dependent on the sparsity of the material abundances. In addition, the method is widely used in other unmixing methods [
31]. However, in this paper, we refer to the research [
32] to define the value of lambda. Additionally, through the experimental results, it outperformed traditional sparsity criteria in the hierarchical sparsity constraint unmixing method.
Third, it is an open theoretical issue to prove mathematically or explain more rigorously why the hierarchical or multilayer distributed NMF system results in considerable improvement in performance and reduces the risk of getting stuck in local minima. An intuitive explanation is that the multilayer system provides a sparse representation of basis matrices
Mi. In other words, even a true basis endmember matrix is not sparse, it can still be represented by a product of a sparse factor. Furthermore, experiments have shown that using hierarchical sparsity strategy can considerably improve the performance especially if a problem is ill-conditioned or data are badly scaled and projected gradient algorithms are used [
32].
Fourth, the fittest layer number to obtain the best accuracy results needs to be further researched. Take the
Section 4.1.3 simulated experiment with specific 10 endmembers as an example. We have conducted the experiments at least 120 times with the weighed matrix as the unit matrix. In each synthetic data, the best SAD accuracy with layer number was counted. From the result, it indicated that the fittest layer number for the SAD result might be 12, which has got 24 times among the 120 kinds of synthetic data. Then it might be 22, 30, and 16. All of them are larger than the endmember number. It seems that the fittest layer number are more likely larger than the endmember. However, it should be noted that when the layer number is too big, it may raise errors in abundance estimation, which is mentioned in
Section 4.1.3. On the contrary, we also conducted the similar experiment with eight endmembers. And compared with 10 endmembers, there is a big probability to get the best results in eight and five layers, which are equal to or a somewhat smaller than the endmember number. The other relationship between the endmember number and the layer number is analogous to this condition. Furthermore, the results of the other layer number are also acceptable even though they are not the optimal solution. Therefore, when it comes to determining the layer number, a value ranging from 0.5 to 2 times of the endmember number may be a good choice.
Fifth, when it comes to selecting a methodology to apply, an important aspect that end-users always take into account is how feasible it is to quickly apply the methodology, how many input parameters are involved, how difficult it is to set them to obtain appropriate results, how much prior knowledge is needed in order to run each technique, and what the main caveats are when running the algorithms [
7]. This paper uses the field or laboratory spectra to estimate the similarity endmembers’ abundance and the model is easy to understand. If it is hard to obtain field spectra for establishing a weighted matrix of endmember variability, this information could be directly extracted from the image. Through the visual observation and prior knowledge, we can choose the specific area of the image to extract spectra as a reference spectral matrix. Note that the endmembers extracted from the image have an advantage over library or field endmembers because they are collected under nearly the same conditions as the image. In this way, we can both obtain the reference endmember matrix and the weighted matrix. Furthermore, the input parameters are also simple to implement. The effectiveness of the proposed method has been verified by the simulated and real hyperspectral image and the application can be expanded to monitor glacier or other applications. Superior to the supervised algorithm, the proposed method only uses the partial intra-class endmember similarity information to make a weighting on each band, which will enlarge the separation of similar spectra and to obtain better unmixing results. For another, if it is hard to obtain the intra-class endmember similarity information especially if there’s no prior knowledge at all. Yet, we can reset the weighted matrix as the unit matrix. Furthermore, the hierarchical non-negative matrix factorization has been proven effective and applied on signal processing and other areas [
32]. Therefore, the proposed method can be applied on the other dataset. In addition, we intend to further stimulate efforts to merge the different conceptual approaches—iterative mixture analysis cycles, feature selection, spectral transformations, and spectral model—to reduce endmember variability effect and enhance unmixing accuracy.
6. Conclusions
Spectral variability has been under-researched in recent years. Experimental data were used to familiarize readers with the positive effects that endmember spectral variability reduction techniques can have on subpixel fraction estimate accuracy. However, in some environments or for some applications, similarities of different endmembers of interest provides an additional difficulty for obtaining accurate estimation or classification results. The weighted matrix makes it possible to emphasize the different bands in distinguishing similar endmembers. In addition, five basic principles aiming to reduce the spectral variability’s effect should also be considered. The integration of different methods is a prerequisite for the effective mitigation of endmember spectral variability.
This research took spectral variability into consideration to unmix hyperspectral images. We used field-measured or lab-measured spectra as prior knowledge and computed the within-class scatter matrix as a weighted matrix of the image. In contrast with previous supervised approaches, the proposed method only used the prior knowledge as each band’s weighted matrix rather than a fixed endmember input. In addition, the hierarchical sparsity is also considered and aims to obtain better unmixing results. Therefore, it is feasible to examine the suitability of unmixing results by comparing the different methods.
To verify the proposed algorithm, experiments with simulated and real-world hyperspectral image data were conducted. The spectral curves and abundance map results confirmed the effectiveness of hyperspectral unmixing (HU) particularly in the presence of noise corruption, low purity levels, and similar endmember conditions. Considering the RMSE, SAD, and AAD of the obtained abundance maps, the proposed HWSU method outperformed VCA-FCLS, MVC-NMF, and L1/2S-NMF and was compatible with the simulated and in situ observations.