Nothing Special   »   [go: up one dir, main page]

Next Article in Journal
Monitoring Wheat Fusarium Head Blight Using Unmanned Aerial Vehicle Hyperspectral Imagery
Previous Article in Journal
Technical Solution Discussion for Key Challenges of Operational Convolutional Neural Network-Based Building-Damage Assessment from Satellite Imagery: Perspective from Benchmark xBD Dataset
You seem to have javascript disabled. Please note that many of the page functionalities won't work as expected without javascript enabled.
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

An Unsupervised Urban Extent Extraction Method from NPP-VIIRS Nighttime Light Data

1
School of Earth Sciences, Zhejiang University, Hangzhou 310027, China
2
Zhejiang Provincial Key Laboratory of Geographic Information Science, Hangzhou 310028, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2020, 12(22), 3810; https://doi.org/10.3390/rs12223810
Submission received: 13 September 2020 / Revised: 2 November 2020 / Accepted: 17 November 2020 / Published: 20 November 2020
(This article belongs to the Section Urban Remote Sensing)
Figure 1
<p>Study area: (<b>A</b>) The spatial distribution of 25 evaluation areas; (<b>B</b>) 1 km grid population data of China in 2015; (<b>C</b>) 1km grid gross domestic product (GDP) data of China in 2015; (<b>D</b>) The 2015 NPP-VIIRS NTL data of China; (<b>E</b>) The 2015 normalized difference vegetation index (NDVI) data of China; (<b>F</b>) The nighttime land surface temperature (LST) data of China in 2015.</p> ">
Figure 2
<p>The workflow of the proposed method.</p> ">
Figure 3
<p>Average silhouette coefficient versus number of clusters.</p> ">
Figure 4
<p>Comparison of NTL image filtering results: median filtering and mean filtering.</p> ">
Figure 5
<p>Eight filtering direction templates.</p> ">
Figure 6
<p>Extraction accuracy of urban extent under different neighborhood sizes: on the left is the different neighborhood sizes of the first stage (fixed neighborhood radius 4 in the second stage); on the right is the different neighborhood sizes of the second stage (fixed neighborhood radius 2 in the second stage).</p> ">
Figure 7
<p>Urban extent extraction results: (<b>a</b>) Landsat8 OLI Images of selected cities, (<b>b</b>) VIIRS NTL images of selected city, (<b>c</b>) the urban extent extracted using our method, (<b>d</b>) the urban extent extracted using local optimal threshold (LOT), (<b>e</b>) the urban extent extracted using INNL-SVM.</p> ">
Figure 8
<p>Comparing the urban pattern and spatial details between our results and other products (<b>a</b>) Landsat8 OLI Images (30 m), (<b>b</b>) the global artificial impervious area (GAIA) (30 m), (<b>c</b>) our results (500 m), (<b>d</b>) the global urban land based on a normalized urban areas composite index (NUACI-based GUL) (30 m), (<b>e</b>) the MODIS 500 (500 m), (<b>f</b>) the global urban expansion product based on a fully convolutional network (FCN) (500 m).</p> ">
Figure 9
<p>Spatial Comparison of the results different methods for Beijing: (<b>a</b>) the inner part of urban; (<b>b</b>) the edge district of urban; the yellow box is the area zooming in of the google image.</p> ">
Figure 10
<p>Urban expansion from 2013 to 2018 for Shanghai and Lanzhou.</p> ">
Figure 11
<p>Growth of urban areas from 2013 to 2018.</p> ">
Figure 12
<p>The spatial distribution of nighttime light intensity in Lhasa and Urumqi (yellow and red boxes indicate areas zoomed in column 3 and 4, respectively).</p> ">
Figure 13
<p>Analysis of overestimation urban area for Ningbo: (black box is the overestimation area zoomed in column 1, which was indicated by “A” in text).</p> ">
Figure 14
<p>The Sensitivity analysis of parameters: (<b>a</b>–<b>c</b>) were the results of all parameter combinations; (<b>d</b>) was the results withR2 = 4; (<b>e</b>) was the results with R1 = 2; (<b>f</b>) was the results with N = 1.</p> ">
Review Reports Versions Notes

Abstract

:
An accelerating trend of global urbanization accompanying various environmental and urban issues makes frequently urban mapping. Nighttime light data (NTL) has shown great advantages in urban mapping at regional and global scales over long time series because of its appropriate spatial and temporal resolution, free access, and global coverage. However, the existing urban extent extraction methods based on nighttime light data rely on auxiliary data and training samples, which require labor and time for data preparation, leading to the difficulty to extract urban extent at a large scale. This study seeks to develop an unsupervised method to extract urban extent from nighttime light data rapidly and accurately without ancillary data. The clustering algorithm is applied to segment urban areas from the background and multi-scale spatial context constraints are utilized to reduce errors arising from the low brightness areas and increase detail information in urban edge district. Firstly, the urban edge district is detected using spatial context constrained clustering, and the NTL image is divided into urban interior district, urban edge district and non-urban interior district. Secondly, the urban edge pixels are classified by an adaptive direction filtering clustering. Finally, the full urban extent is obtained by merging the urban inner pixels and the urban pixels in urban edge district. The proposed method was validated using the urban extents of 25 Chinese cities, obtained by Landsat8 images and compared with two common methods, the local-optimized threshold method (LOT) and the integrated night light, normalized vegetation index, and surface temperature support vector machine classification method (INNL-SVM). The Kappa coefficient ranged from 0.687 to 0.829 with an average of 0.7686 (1.80% higher than LOT and 4.88% higher than INNL-SVM). The results in this study show that the proposed method is a reliable and efficient method for extracting urban extent with high accuracy and simple operation. These imply the significant potential for urban mapping and urban expansion research at regional and global scales automatically and accurately.

1. Introduction

Urbanization is speeding up the consumption of natural resources, energy, and altering land use and cover, which is closely related to almost all aspects of global change [1,2]. The rapid expansion of urban areas has brought about various environmental and urban issues, such as deforestation, farmland decrease, ecological damage, air pollution, water shortage, climate change, traffic congestion, massive and chaotic urban settlements and mismatched infrastructure in recent years [3,4]. Accurate and consistent information on the distribution and extent of urban areas is fundamental to understanding urbanization dynamics, evaluating the impacts of urbanization on environments, and addressing the issues mentioned above [5,6]. How to accurately and timely update urban extent has become a compelling topic.
Remote sensing technology provides an effective and alternative approach to detect urban extent for repeated surveys of urban growth [7,8]. Many methods and analysis have carried out in urban mapping and urbanization using remote sensing. Nevertheless, there are still some limitations. Of particular concern is how to obtain urban extent at large scale effectively and timely [9]. Extensive research has shown that the nighttime light (NTL) data, capturing information on human activities, can be an effective, economic, and straightforward data source for extracting urban extent [10,11,12,13]. NTL data has advantages in urban extent extraction at large spatial scales or high temporal frequencies [14], because it has appropriate spatial and temporal resolution, free access and global coverage [15]. The most commonly used NTL data is from the Defense Meteorological Satellite Program—Operational Linescan System (DMSP-OLS), which has a relatively long time series released from 1992 to 2013 [16]. A considerable research has grown up around the way to obtain urban extent using DMSP-OLS NTL data. An alternative NTL data provided by the Visible Infrared Imaging Radiometer Suite Day/Night Band carried by the Suomi National Polar-orbiting Partnership (NPP-VIIRS) is available from April 2012 to present [17]. It has a better performance with finer spatial resolution (500 m), lower radiance detection limits, wider radiometric range, and on-board calibration, which significantly reduces the saturation and over-glow problem inherent to DMSP-OLS NTL data [18]. Shi et al. [19] evaluated the potential of NPP-VIIRS NTL data for extracting urban extent and found that the urban extent extracted from NPP-VIIRS NTL data has higher spatial accuracies than that extracted from DMSP-OLS NTL data. In recent years, there has been an increase interest in extracting urban extent using NPP-VIIRS NTL data. However, the research on methods has tended to focus on evaluating the applicability of the methods designed for DMSP-OLS NTL data to NPP-VIIRS NTL data [20], rather than developing methods based on the characteristics of NPP-VIIRS NTL data.
The vast majority of previous research on urban extent extraction methods based on NTL data has focused on threshold techniques and supervised classification techniques. The threshold-based method is most popular because of its simplicity [17]. It selects the appropriate threshold with the auxiliary information, and set pixels greater than the threshold as urban areas and the rest as nonurban areas according the distribution law of the nighttime light radiation [19]. The way of determining the threshold value categorized into three types including empiricism-based [21], mutation-based [11], and reference-based [22,23] methods. The empirical threshold method requires certain expert knowledge or experience to determine the threshold, which is easy to be subjective, resulting in poor accuracy and reliability. The mutation threshold method believes that the real urban areas is composed of complete patches, and the threshold point is the mutation point of patch rupture. The obvious drawback is that it is not suitable for complex urban structures, such as polycentric cities. The reference threshold method utilized statistical data or higher resolution remote sensing images as reference data [19] and takes the pixel value with the minimum urban area difference between the extracted results and the reference data as the threshold for a city. Among them, the threshold method based on high-resolution remote sensing images has better accuracy and reliability, and is widely used in urban extent extraction. It is not appropriate to apply a global threshold for large areas, because thresholds of urban extent extraction varies with socioeconomic development levels [24]. The local optimal threshold (LOT) method has been used in recent studies, which estimates optimal thresholds for each city or tile at a regional scale based on the relationships between nightlight magnitude in NTL data and urban morphology interpreted from high-resolution images [9,18,24,25]. Nevertheless, these relationships and parameters change across regions and vary by time, which requires a large amount of work on interpreting from high resolution images [14]. Unlike the threshold-based method, the supervised classification method regards the process of urban extraction from NTL data as the problem of classifying urban and non-urban pixels [26]. The supervised machine learning has been applied to classify urban and non-urban pixels in NTL data with training set [27]. Cao et al. [26] developed a support vector machine (SVM)-based to extract urban areas from NTL data achieving comparable results to the LOT method. Xu et al. [28] demonstrated that ANN could provide an effective and accurate alternative in extracting urban built-up areas from NTL. Liu et al. [29] explored the effectiveness of commonly used machine learning methods for urban extent extraction from NTL data, including random forest (RF), gradient boosting machine (GBM), neural network (NN), and their ensemble (ESB), and the results showed that these machine learning methods can achieve similar high accuracies. He et al. [30] proposed a fully convolutional network (FCN) to extract urban land using NTL data and employed it to detect global urban expansion from 1992–2016. The supervised classification method avoids the cumbersome process of threshold selection and raises the automaticity to extract urban extent from NTL data [26]. However, it is difficult to label enough urban pixels. The extracted results using the supervised classification are sensitive to the quality of samples [31].
The previous methods rely highly on the auxiliary data or urban training samples, which largely increases the uncertainty of the results and limits its application in practice, especially for NPP-VIIRS NTL data with higher spatial resolution and radiometric resolution. We attempted to develop an automatic method based on unsupervised methods to improve the efficiency of NTL data in urban mapping. The new method, designed for NPP-VIIRS NTL data, assigned a label to a pixel-based on the light intensity of the pixel and the pixels that are spatially close to it. Clustering is an unsupervised learning method widely used for classification and segmentation of remote sensing images, which was used in this study [32]. Twenty-five cities with different levels of the natural environment and economic development in China were selected as evaluation areas. The urban extent derived from the Landsat8 OLI image was used as the reference for accuracy assessment, and the most commonly used methods the local-optimized threshold method (LOT) and the integrated night light, normalized vegetation index, and surface temperature support vector machine classification method (INNL-SVM) [33] were used to compare, the validity, reliability, and robustness of our method are verified.

2. Study Area and Data

2.1. Study Area

This study took China as the experimental area because China has experienced rapid urbanization in the last four decades and it has a vast territory with diverse natural conditions and an uneven geographical pattern of socioeconomic development (Figure 1). To evaluate the effectiveness and robustness of the proposed method for urban extent extraction using NPP-VIIRS NTL data, 25 representative cities, located from east to west and from north to south across China, were selected as evaluation areas. The populations of these selected cities varied from 220 thousand to over 21 million and the gross domestic product (GDP) ranged from approximately 20 billion RMB to over 2400 billion RMB (Table 1). These cities, with various urban sizes, diverse physical environments, and different economic and social development levels, can assess the applicability of the proposed urban extent extraction method for different urbanization levels well.

2.2. Data

(1)
Remote Sensing Data
The Earth Observations Group (EOG) is producing a version 1 suite of average radiance composite images using nighttime data from the Visible Infrared Imaging Radiometer Suite (VIIRS) Day/Night Band (DNB). The version 1 series of monthly composites and annual composites is now publicly available through the Payne Institute for Public Policy at the Colorado School of Mines (https://payneinstitute.mines.edu/eog/). The products are 15 arc-second geographic grids, spanning the globe from 75N to 65S in latitude with six tiles. The DN unit is nanoWatts · cm 2 · sr 1 . The geographic coordinate system is WGS-1984. Prior to averaging, the NTL data is filtered to exclude data impacted by stray light, lightning, lunar illumination, and cloud-cover. The monthly composites have not been filtered to screen out lights from aurora, fires, boats, and other temporal lights, while the annual composites have layers with additional separation, removing temporal lights and background (non-light) values. We downloaded the 2015 annual NPP-VIIRS NTL composite data and made an outlier removal process to filter out moonlight reflected from water by setting water pixels value to zero.
The Geospatial Data Cloud website (http://www.gscloud.cn/) provides Moderate-resolution Imaging Spectroradiometer (MODIS) daily, 5-days, 10-days, and monthly composite images of normalized difference vegetation index (NDVI) and land surface temperature (LST) covering the whole of China. The monthly composite images of NDVI (MODND1M) of China in 2015 were obtained and processed by maximum value composition (MVC) to create a yearly maximal NDVI image [33]. We also applied the MVC method to the monthly nighttime LST (MODLT1M.LTN) product of China from January to December 2015 to create a yearly maximal LST image.
The 2015 Landsat8 OLI images covering 25 cities with less than 1% cloud cover were obtained from the Geospatial Data Cloud website. Radiation calibration, atmospheric correction, mosaic, clipping, and other pre-processing operations were performed on Landsat8 OLI images of each city with ENVI. The FLAASH (Fast Line-of-sight Atmospheric Analysis of Spectral Hypercubes) tool was applied to atmospheric correction. The impervious surface was initially classified using maximum likelihood method. Then the actual urban extent was acquired by visually interpreted with the help of 2015 google history images at the fifteenth layer. The spatial resolution of google history images in the 15th layer is 3.66 m, which we downloaded by 91 Weitu (http://www.91weitu.com).
All images were resampled to a spatial resolution of 500m on the Albers Conical Equal Area projection.
(2)
Auxiliary Data
The administrative boundaries of Chinese cities were applied to partition the research areas into cities. The images were cutting out by the extent of each city’s administrative boundary. The water mask images excluded the water pixels from the nighttime light images to remove the effect of water on night light. The administrative boundaries and water mask were obtained from the Resource and Environment Data Cloud Platform website (http://resdc.cn/). The 1 km grid population dataset [34] and 1 km grid GDP dataset [35] of China in 2015 were from the Resource and Environment Data Cloud Platform. The statistical data of Table 1 was from the China City Statistical Yearbook 2016. The China City Statistical Yearbook 2016 covered the main socio-economic statistical data of cities at all levels for 2015.
(3)
Other Global Urban Data Products
Recently, there are many global or national urban/impervious surface products released. We chose four existing products for comparison purpose, including the annual maps of global artificial impervious area (GAIA) [36], the high-resolution multi-temporal mapping of global urban land based on a normalized urban areas composite index (NUACI-based GUL) [37], the global urban expansion product based on a fully convolutional network (FCN) [30] and the MCD12Q1.006 MODIS Land Cover Type Yearly Global 500 m product (MODIS500). The MODIS500 were obtained from Google Earth Engine. We extracted the “urban and built-up lands” type for comparison. Because the 2015 urban mapping data is not available from the FCN, we replaced with 2016 FCN data. The spatial resolution of the GAIA and the NUACI-based GUL was 30m. The original spatial resolution was maintained in urban spatial comparison and area calculation. All urban products were resampled to a spatial resolution of 500 m on the Albers Conical Equal Area projection for accuracy assessment.

3. Method

The NTL data records the light intensity emitted by objects on the ground in the night, helping to distinguish bright urban areas from dim background [2]. The key to extracting urban extent based on NTL data is to identify urban areas according to the spatial distribution pattern of nighttime light intensity. The clustering method, widely used for classification and segmentation of remote sensing images as an unsupervised approach, makes inferences from input data without referring to known, or labeled, outcomes [38]. This study applied the clustering method based on pixels to extract urban extent from NTL data. The clustering method regarded input pixels as an out-of-order dataset and ignored the spatial context information of pixels. This resulted in the clustering results that were sensitive to noise and outliers. In addition, adjacent pixels in an NTL image were interdependent, and a large part of NTL radiation of a pixel came from the surroundings. Therefore, the spatial context information of each pixel was considered to improve the classification accuracy of urban pixels. There was a roughly negative correlation between nighttime light intensity and distance from the urban center. It was easy to distinguish between urban inner pixels and non-urban inner pixels because of their distinct brightness differences. However, it was difficult to classify urban pixels in urban edge district because of the complex spatial distribution of light intensity and the large light intensity range in urban edge district. Therefore, extracting urban areas separately by urban inner pixels and urban edge pixels was more efficient and suitable.
Therefore, we proposed an automatic method for extracting urban extent from NTL data based on spatial context constraints and clustering algorithm. We first segmented the NTL image into urban interior district, urban edge district and non-urban interior district by a spatial context constrained clustering algorithm. We then classified urban pixels in urban edge district by developing a direction-based spatial context constrained clustering algorithm. Finally, the full urban extent was obtained by merging the urban inner pixels and the urban pixels in urban edge district. In addition, we assessed the performance of our method by evaluating the accuracy of the extracted results and comparing it with two commonly used urban extent extraction methods based on NTL data. The workflow of the proposed method is shown in Figure 2.

3.1. The Spatial Context Constrained Clustering Algorithm

K-means clustering is a simple and popular unsupervised machine learning algorithm that partitions data points into k clusters by allocating each point to the cluster with the nearest cluster centroid. The silhouette method plays a significant role in k-means clustering evaluation and help to find the optimum number of clusters [39]. The Silhouette Coefficient is calculated using the mean intra-cluster distance and the mean nearest-cluster distance for each point data in the silhouette method. A higher average Silhouette Coefficient indicates a better clustering effect. Interestingly, when performing K-Means clustering on NTL pixels for different cities, the average Silhouette coefficient always has the highest value when the number of clusters is 2 (Figure 3). Therefore, in this study, we used the K-Means clustering algorithm to classify the NTL pixels into two groups: a low-NTL light intensity group and a high-NTL light intensity group. We assigned pixels in the high- NTL light intensity group are urban pixels and pixels in the low-NTL light intensity group are non-urban pixels.
A sliding window was used to define the surrounding neighborhood. Take the pixels in the window of ( 2 × R + 1 ) × ( 2 × R + 1 ) with radius R, centered on the pixel ( x , y ) as the spatial context of the pixel ( x , y ) . We applied the filtering algorithms to integrate the spatial context information to correct, enrich and constrains the current pixel. The adopted filtering algorithms could be mean filtering, median filtering, and other filtering methods, which were determined according to the spatial distribution of light intensity. Then input the filtered pixels into the K-means algorithm for clustering to get the urban pixels. The loss function of the spatial context constrained clustering algorithm was defined as follows.
E   =   j = 1 K = 2 p ( x , y ) C j F ( x , y )   μ j 2 ,
μ j =   1 d j p ( x , y ) C j F ( x , y ) ,
F ( x , y ) = f ( { N ( x , y ) } ) .
In Equations (1)–(3), E is the loss function; K is the total number of clusters, this study specified K as 2; C j is the collection of jth cluster; d j and μ j is the number of pixels and the center of the jth cluster, respectively; f is the adopted filtering algorithm and F ( x , y ) is the filtered result of pixel in ( x , y ) ; { N ( x , y ) } represents the set of all pixels in the sliding window centered on the pixel ( x , y ) .

3.2. Urban Edge District Detection

We first grouped the NTL pixels into two clusters and regarded the cluster with high light intensity as potential urban ( P u r b ) areas and the other as potential non-urban ( P n o n ) areas. Due to the low light intensity of water, green spaces, and inactive areas at night within urban areas, it was easy to be mistaken as non-urban pixels. We applied the median filter to smooth areas in the urban inner. The median filter is a well-known nonlinear filter with good performance for removing noise, especially for some specific noise types such as “Gaussian”, “random”, and “salt and pepper” noises. It sorts all the pixel values from the surrounding neighborhood and replaces the current pixel value with the middle value. Compared to the mean filter, the median filter is less affected by the background and better preserves edges while removing noise (Figure 4). The filtered pixels were grouped into P u r b areas and P n o n areas with k-means.
The spatial distribution pattern of nighttime light intensity corresponds to the urban spatial pattern. In NTL images, the urban core areas are brighter than the surrounding areas, while the countryside and background are dim. Thus, pixels with relatively low light intensity in P u r b areas were more likely to be in urban edge district; similarly, pixels with relatively high light intensity in P n o n areas were more likely to be in urban edge district. The mean value and standard deviation of P u r b areas were calculated to define the threshold t2. P u r b areas were divided into urban inner areas and edge areas (Equations (4) and (6)). Similarly, P n o n areas were divided into non-urban inner areas and edge areas by the threshold t1 defined by its mean and standard deviation (Equations (5) and (6)). The urban edge district was obtained by merging the edge areas of P u r b areas and P n o n areas (Equation (6)).
P u r b a n   i n n e r = p i x e l s   i n   P u r b   , p i x e l   v a l u e >   t 2  
P n o n u r b a n   i n n e r = p i x e l s   i n   P n o n   , p i x e l   v a l u e   < t 1
P e d g e   a r e a s = { p i x e l s   i n   P u r b   , p i x e l   v a l u e t 2 p i x e l s   i n   P n o n   , p i x e l   v a l u e t 1
t 1 =   P n o n u r b _ m e a n + N × P n o n u r b _ s t d
t 2 =   P u r b _ m e a n N × P u r b _ s t d
In Equations (4)–(8), P u r b _ m e a n and P u r b _ s t d is the mean value and the standard deviation of P u r b areas, respectively; P n o n u r b _ m e a n and P n o n u r b _ s t d is the mean value and the standard deviation of P n o n areas, respectively; P u r b a n   i n n e r and P n o n u r b a n   i n n e r is the pixels within the urban inner areas and the non-urban inner areas, respectively; P e d g e   a r e a s is the pixels of the urban edge district; t1 and t2 are the segment thresholds for P n o n areas and P u r b areas, respectively, which are determined by the mean value, the standard deviation of the clusters. N is a coefficient, usually 1.

3.3. Urban Pixels Recognition in the Urban Edge District

The urban and non-urban pixels at the edge district have similar and confusing nighttime light intensity, and the urban edge pixels contain more complex meanings and neighborhood spatial heterogeneity than the non-edge pixels. It cannot simply treat the urban edge pixels as noise or one class and requires mining deep spatial information in a larger neighborhood space. Besides, the spatial distribution of light intensity in the edge district presents a certain directionality. We first used an adaptive directional filter algorithm to integrate a larger spatial neighborhood information and filter interference information at the same time, which could enhance the discrimination of urban and non-urban pixels. We then clustered the filtered result into two groups with K-Means. Finally, pixels in a high-intensity group were classified as urban pixels.
Direction filter replaced the center pixel value with the average value of the pixels in the considered direction within the window. We considered eight directions including east, south, west, north, southeast, northeast, southwest, and northwest. The eight direction templates were showed as Figure 5. The filtering direction discrimination function was as Equations (9) and (10). S i was the standard deviation of all available pixel values in the i-th direction template centered on the current pixel. The filtering direction was satisfied Equations (9), which was selected the direction template with the minimum standard deviation.
Dir = min { S 1 ,   S 2 , S 3 , , S 8 } ,
S i =   j = 1 r + 1 ( p j   p ¯ i ) 2 r ,

3.4. Accuracy Evaluation

According to the previous studies, the most popular way of conducting the accuracy evaluation is to compare the extracted urban extent with those derived from Landsat 8 OLI images [19,26,33]. Therefore, we used urban extent obtained from Landsat8 OLI images as the reference for accuracy evaluation. We defined urban areas as human settlements and functionally-related regions that have high density of roads, buildings, shops, and restaurants that attract people, including suburban regions closely related to urban cores. Firstly, we classified the impervious surface from Landsat8 OLI images and produced discrete and bounded patches having a high percentage of impervious surface. Then the coarse urban extent was derived through recognizing and adjusting those patches under visually interpreting Landsat8 OLI images. For those areas that were not easy to recognize, we interpreted with the help of google high-resolution images and POIs to interpret, and got the finer urban extent that was regarded as the actual urban extent.
The confusion matrix was calculated by comparing the urban extent extracted from the NTL data with the reference data pixel by pixel. Quantified the performance of the urban extraction methods with four accuracy indexes, the overall accuracy (OA), the kappa coefficient, the omission error (OE) and the commission error (CE). The specific calculation formula of evaluation index is as follows.
OA is the proportion of correctly classified pixels to the total number of pixels, including correctly classified urban pixels and correctly classified non-urban pixels.
O A = U r b a n t r u e + N o n _ U r b a n t r u e N ,
Kappa coefficient indicates the consistency between the extracted results and the reference data, and statistically reflects the extent to which the classification results are better than the random classification results. Pe is the proportion of misinterpretations caused by accidental factors.
Ka =   O A P e 1 P e ,
Pe =   U r b a n p r e × U r b a n r e a l + N o n _ U r b a n p r e × N o n _ U r b a n r e a l N × N ,
OE is the proportion of prediction errors in city pixels, reflecting the underestimation of extracted results. The producer’s accuracy (PA) is the recognition rate of the actual urban pixels. The sum of OE and PA is 1.
O E = N o n _ U r b a n f a l s e U r b a n t r u e + N o n _ U r b a n f a l s e ,
CE is the proportion of prediction errors in the pixels predicted as cities, reflecting the overestimation of the extracted results. The user’s accuracy (UA) is the correct proportion of the predicted urban pixels. The sum of CE and UA is 1.
C E = U r b a n f a l s e U r b a n t r u e + U r b a n f a l s e ,
N is the total number of the input pixels. U r b a n r e a l and N o n _ U r b a n r e a l are the number of urban pixels and the number of non-urban pixels in reference data, respectively. U r b a n p r e is the number of urban pixels in extracted result, and N o n _ U r b a n p r e is the number of non-urban pixels in extracted result. U r b a n t r u e is the number of pixels that are the urban pixels in the extracted result and are also the urban pixels in the reference data. U r b a n f a l s e is the number of urban pixels in the extracted result which are non-urban pixels in the reference data. N o n _ U r b a n f a l s e is the number of non-urban pixels in extracted result which are urban pixels in the reference data.

4. Results

4.1. Selection of Neighborhood Size

We tested various neighborhood size for urban edge district detection and urban edge pixels classification (Figure 6). We found that the larger the neighborhood size for urban edge district detection, the lower PA and the higher UA of the extracted results were, and the Kappa coefficient reached the highest value in a certain range. It was similar with the neighborhood size for urban edge pixels classification. By comparing the filtering and clustering effects with different neighborhood sizes, we used 5 × 5 window size to detect urban edge district and 9 × 9 window size to classify urban edge pixels.

4.2. Accuracy and Comparison

The urban extent for 25 cities were extracted and compared using the proposed method, the LOT method, and the INNL-SVM method. The urban extent extraction process of the LOT method and the INNL-SVM method were referred to Dou et al. [33]. Then we calculated the confusion matrix of these three methods between the extracted results and the actual urban extent extracted from Landsat8 OLI, respectively. OA, kappa coefficient, OE, and CE of three methods were as shown in Table 2. The average OA of our method, LOT and INNL-SVM, were 0.9625, 0.9622, and 0.9571, and the average kappa coefficients were 0.7679, 0.7544, and 0.7304, respectively. The method had the highest average kappa coefficient and the second highest average of OA very close to LOT. As a whole, the three methods had high classification accuracy. Since there was a large number of non-urban background pixels in the input data, which can greatly improve the value of OA, we mainly compared the kappa coefficients of the three methods. The lowest kappa coefficients of our method, LOT and INNL-SVM were 0.6871, 0.6645, and 0.6083, respectively. Kappa coefficient over 0.61 means that the results is highly consistent with the reference data. Therefore, the extracted results of our method had high reliability. The kappa coefficients of the extracted results by our method were mostly higher than LOT and INNL-SVM in 25 cities, which indicated that our method had strong robustness and better performance. We used OE and CE to evaluate the underestimation and overestimation of the three methods, respectively. The higher value of OE, the more serious of underestimation, and the higher value of CE, the more serious of overestimation. For most cities, the OE and CE of the three methods were between 0.1 and 0.35. The results of LOT had similar OE and CE in most cities. The OE and CE of INNL-SVM varied greatly in different places. It meant that the performance of INNL-SVM was greatly affected by different regionally environment. The OE of our method was less than CE in most cities, except in Hohhot, Lanzhou, Lhasa, and Shenyang. Our method had OE greater than CE in these four cities. This was because human activities in these four cities were mainly concentrated urban interior areas, while urban fringe areas were more prone to underestimation. In most cities, the OE of our method was smaller than that of LOT and INNL-SVM, while CE was generally between LOT and INNL-SVM.
The urban extent extraction results of seven representative cities using three methods were illustrated in Figure 7. These cities range from mega cities (e.g., Beijing, Shanghai, and Guangzhou), middle-size cities (e.g., Wuhan) to small cities (e.g., Lanzhou, Urumqi, and Kunming). In order to clearly present the difference between the extracted results and the reference data, the common parts, overestimated parts and underestimation parts were showed in red, green, and blue, respectively. The complete urban extent result was composed of the common parts (red pixels) and the overestimated parts (green pixels). The visual comparison revealed that the urban extent extracted from NTL data by these three methods match the reference data well. Nevertheless, all the urban extent extracted by three methods existed some overestimation. This may partly be explained by the inconsistency between NTL data and Landsat 8 images in expressing the urban and the low resolution of NTL data. In addition, there were some problems with the extracted results of LOT and INNL-SVM. The results of LOT and INNL-SVM had many holes in urban inner areas and many pixels at urban edge district were mistaken as non-urban pixels. These problems were improved in our method. The urban extent extracted by our method had high spatial continuity and higher consistency with the reference data at urban edge district. Overall, these results indicate that our method performs well in extracting urban extent from NTL data, enhances the internal spatial continuity of the extracted results, and improves the underestimation of the edge pixels.

4.3. Comparison with Other Products

We also compared our results in the year 2015 with the existing global urban datasets, the GAIA (2015), the MODIS500 (2015), the NUACI-based GUL (2015), the FCN (2016). Figure 8 shows the comparisons in seven representative cities, in which GAIA and NUAIC-based GUL had 30 m spatial resolution and our result, MODIS500 and FCN had 500 m spatial resolution. When visually compared with the referenced urban extent and Landsat8 OLI images, our results and FNC provided a good delineation of urban extent, while GAIA and NUACI-based GUL demonstrate the finest spatial details of urban land. The urban spatial pattern of these five dataset MODIS500 had relatively strong similarity. MODIS500 overestimated urban land for cities such as Beijing, while had more omission errors for the cities of Urumqi and Kunming. We resampled these urban datasets for the seven representative cities to a spatial resolution of 500 m on the Albers Conical Equal Area projection and calculated urban pixels and the kappa coefficient (Table 3). The Kappa value of our method and FCN ranged from 0.7 to 0.8, while that of GAIA, NUACI-based GUL, and MODIS500 spanned 0.5 to 0.7. GAIA had the highest urban area for all cities because it is an artificial impervious surface product, not only covering urban areas but also non-urban areas.

5. Discussion

5.1. The Proposed Method Can Effectively Extract Urban Extent from NPP-VIIRS NTL

This study proposed an automatic and reliable approach to extract urban extent from NTL data quickly and accurately. Different neighborhood sizes of spatial context constraints are utilized to characterize the edge and interior of the urban areas and reduce the effects of noise and abnormal light intensity. The spatial context constrains can enhance the signals of urban areas and weaken the influence of non-urban areas light. The major difference from previous studies is that we attempt to develop a simple method without the help of ancillary data and transform urban extent extraction as an urban and non-urban pixels-based clustering problem, which could avoid the tedious procedure of threshold or sample selection. The experimental results prove that the proposed method is reliable and robust, which can extract the urban extent of different areas with different regional environment and social and economic development level accurately. The kappa ranged from 0.687 to 0.829 with an average of 0.768 (1.80% higher than LOT and 4.88% higher than INNL-SVM). The extraction results of the proposed method demonstrate great integrity and connectivity, which are consistent with the reference data. Comparison with LOT and INNL-SVM showed that the proposed method could guarantee high extraction accuracy, while increasing detail information of areas in edge district (Figure 9a) and reducing errors affected by low brightness of areas in urban inner (Figure 9b). Among three popular methods, LOT requires ancillary data to determine optimal thresholds and the extraction results of INNL-SVM are highly rely on sample data. However, for large-scale studies, considerable time and labor are needed to obtain high-accuracy ancillary data and sample data, which limits the practicability of LOT and INNL-SVM. The proposed method automatically recognizes urban extent depending on NTL data itself, without referring to other data, increasing its efficiency for extracting urban extent at large scales or long-term temporal scales. These prove that the proposed method has extensive applicability in the field of urban extent extraction.
In addition, we applied our method to extracting urban extent in the years 2013 and 2018 for the seven representative cities to demonstrate the potential of our method in urban dynamic analysis. We selected the GAIA to comparison because of its appropriate time coverage and relatively high resolution. Figure 10 and Figure 11 showed urban expansion of Shanghai and Lanzhou from 2013 to 2018. The urban extent extracted using our method in the years 2013, 2015, and 2018 revealed the urban growth trace and the growth regions of our method matched to those of GAIA to certain degrees considering the difference between imperviousness- and NTL- indicated urbanization.

5.2. The Disadvantages of the Proposed Method

There were some overestimation and underestimation of our extraction results. For example, Lhasa and Hohhot had relatively high OE and Ningbo and Guangzhou had relatively high CE. We compared Lhasa and Urumqi because they both were in west China, while Urumqi had much lower OE and CE. The brightness distribution in Urumqi was relatively uniform and urban areas were noticeably brighter than non-urban areas in urban edge district, while the brightness distribution in Lhasa showed obviously spatial aggregation (Figure 12). The great brightness difference between urban center and the surrounding areas resulted in mistaking many urban areas for non-urban areas. Observation of incorrect urban areas illustrated that the inconsistent description of urban areas between NTL data and Landsat images was one of the main causes of overestimating the urban areas. In Landsat, area A in Figure 13 was regarded as a non-urban area because of its low percentages of impervious surfaces, while NTL identified it as an urban area because of its relatively high brightness. From the google image, there are several roads through area A and some high-density building areas are embedded in contiguous farmland (Figure 13). It seems that there is a close relationship between area A and the main urban areas.
In general, the proposed method has the following shortcomings. It is based solely on NTL data result in some misclassification. The overflow effect of NTL data and the strong light from non-human active area facilities (such as city periphery, airport, oil field) may cause the overestimation of urban areas. The overpass time of VIIRS is near 1:30 a.m., which leads to underestimation in places with declining brightness in midnight. In addition, the proposed method uses a simple way to represent the spatial context information and classify urban pixels, which is not sufficient for complex areas. In the area of rapid urban development and continuous emergence of small towns, it is more likely to exist overestimation between cities and towns. In the area where human activities are mainly concentrated in the center part, it is easy to be underestimated in the edge district. Besides, this study remains narrow in focus dealing only with the NPP-VIIRS data. The developed method is not suitable for other NTL products due to the different of spatial resolution, radiation resolution and process procedures. To extract urban extent from other NTL products such as DMSP-OLS and Luojia-01, the appropriate clustering method based on spatial context constraints should be designed according to the characteristics of the NTL product. Future research might focus on developing the unsupervised urban extent extraction model with better spatial features representation capabilities to adapt to different NTL products and improve the accuracy of complex environmental areas. Consider using convolution neural network methods to enhance the ability of spatial information expression and integrating other data sources (such as the human activity data) to descript urban areas more comprehensively.

5.3. The Sensitivity Analysis of Parameters

The algorithm of the proposed method was presented in Supplementary Table S1, where there are three free parameters: R1, R2, and N. The parameters R1 and R2 are used to determined the filtering window radius for step 1 and step 2 (Figure 2), respectively. The window size of step 1 is ( 2 × R 1 + 1 ) × ( 2 × R 1 + 1 ) , and the window size of step 2 is ( 2 × R 2 + 1 ) × ( 2 × R 2 + 1 ) . The parameter N is used to specify the threshold value for urban edge district detection.
A sensitivity analysis was performed for R1, R2, and N to examine how changes in these parameters impact urban extent extraction accuracy. We use the average kappa coefficient of 25 experiment cities to represent the extraction accuracy. Figure 14 showed that the accuracy kept in the range of 0.68 to 0.8 with fewer outliers, which indicated that no matter how the parameters change, our results keep a relative high accuracy. However, parameters also can have important impacts on the accuracy. The accuracy increased with the increase of N, and it achieved the highest value when N was 1. For R2, the accuracy had the highest value at 4 or 5. For R1, the results had the highest value at 2. Besides, we found that these parameters have little influence on each other.

6. Conclusions

This study set out to develop an unsupervised method for urban extent extraction from NTL data, avoiding the time-consume preparation of ancillary or labelled data. The proposed method applies the pixel-based clustering algorithm to distinguish urban areas from background. Multi-scale spatial context constraints are utilized to integrate surrounding information of pixels, to increase the distinction between urban pixels and nonurban pixels and correct those areas with abnormal light intensity. The experiments confirmed that it has high accuracy and can extract urban extent with different regional environments and socioeconomic development levels (the kappa of the proposed method ranged from 0.687 to 0.829, while the kappa range of LOT and INNL-SVM was 0.664 to 0.811 and 0.608 to 0.821, respectively). The overlay between extraction results and reference data demonstrates that the urban extent extracted by the proposed method had greater spatial integrity and connectivity in urban inner and lower omission errors in urban edge district. The method can delineate urban extent accurately and automatically, without the time-consuming and laborious process of threshold selection and training dataset construction, which saves a large amount of time and labor. These indicate the broad application prospects of the proposed method in large-scale urban research. In addition, the results in this study also proved the feasibility of applying unsupervised methods for urban extent extraction, which provides new inspiration for urban extent extraction based on NTL data and has a number of important implications for future practice.

Supplementary Materials

The following are available online at https://www.mdpi.com/2072-4292/12/22/3810/s1.

Author Contributions

Conceptualization, F.Z.; Funding acquisition, F.Z., Z.D. and R.L.; Methodology, X.C.; Project administration, R.L.; Writing—original draft, X.C.; Writing—review & editing, F.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Key R&D Program of China (2018YFB0505000, 2017YFB0503604), National Natural Science Foundation of China (41671391, 41922043, 41871287).

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Foley, J.A.; DeFries, R.; Asner, G.P.; Barford, C.; Bonan, G.; Carpenter, S.R.; Chapin, F.S.; Coe, M.T.; Daily, G.C.; Gibbs, H.K.; et al. Global Consequences of Land Use. Science 2005, 309, 570–574. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  2. Li, K.; Chen, Y. A Genetic Algorithm-Based Urban Cluster Automatic Threshold Method by Combining VIIRS DNB, NDVI, and NDBI to Monitor Urbanization. Remote Sens. 2018, 10, 277. [Google Scholar] [CrossRef] [Green Version]
  3. Park, S.; Jeon, S.; Kim, S.; Choi, C. Prediction and comparison of urban growth by land suitability index mapping using GIS and RS in South Korea. Landsc. Urban Plan. 2011, 99, 104–114. [Google Scholar] [CrossRef]
  4. He, C.; Liu, Z.; Tian, J.; Ma, Q. Urban expansion dynamics and natural habitat loss in China: A multiscale landscape perspective. Glob. Chang. Biol. 2014, 20, 2886–2902. [Google Scholar] [CrossRef] [PubMed]
  5. Seto, K.C.; Fragkias, M.; Güneralp, B.; Reilly, M.K. A Meta-Analysis of Global Urban Land Expansion. PLoS ONE 2011, 6, e23777. [Google Scholar] [CrossRef]
  6. Schneider, A.; Friedl, M.A.; Potere, D. Mapping global urban areas using MODIS 500-m data: New methods and datasets based on ‘urban ecoregions’. Remote Sens. Environ. 2010, 114, 1733–1746. [Google Scholar] [CrossRef]
  7. Liu, X.; Ning, X.; Wang, H.; Wang, C.; Zhang, H.; Meng, J. A Rapid and Automated Urban Boundary Extraction Method Based on Nighttime Light Data in China. Remote Sens. 2019, 11, 1126. [Google Scholar] [CrossRef] [Green Version]
  8. Hu, X.; Weng, Q. Estimating impervious surfaces from medium spatial resolution imagery using the self-organizing map and multi-layer perceptron neural networks. Remote Sens. Environ. 2009, 113, 2089–2102. [Google Scholar] [CrossRef]
  9. Zhou, Y.; Smith, S.J.; Elvidge, C.D.; Zhao, K.; Thomson, A.M.; Imhoff, M.L. A cluster-based method to map urban area from DMSP/OLS nightlights. Remote Sens. Environ. 2014, 147, 173–185. [Google Scholar] [CrossRef]
  10. Elvidge, C.D.; Baugh, K.E.; Kihn, E.A.; Kroehl, H.W.; Davis, E.R. Mapping city lights with nighttime data from the DMSP Operational Linescan System. Photogramm. Eng. Remote Sens. 1997, 63, 727–734. [Google Scholar]
  11. Imhoff, M.L.; Lawrence, W.T.; Stutzer, D.C.; Elvidge, C.D. A technique for using composite DMSP/OLS “City Lights” satellite data to map urban area. Remote Sens. Environ. 1997, 61, 361–370. [Google Scholar] [CrossRef]
  12. Elvidge, C.D.; Safran, J.; Tuttle, B.; Sutton, P.; Cinzano, P.; Pettit, D.R.; Arvesen, J.; Small, C. Potential for global mapping of development via a nightsat mission. GeoJournal 2007, 69, 45–53. [Google Scholar] [CrossRef]
  13. Elvidge, C.D.; Erwin, E.H.; Baugh, K.E.; Ziskin, D.; Tuttle, B.T.; Ghosh, T.; Sutton, P.C. Overview of DMSP nightime lights and future possibilities. In Proceedings of the 2009 Joint Urban Remote Sensing Event, Shanghai, China, 20–22 May 2009; Institute of Electrical and Electronics Engineers (IEEE): New York, NY, USA, 2009; pp. 1–5. [Google Scholar]
  14. Hu, X.; Qian, Y.; Pickett, S.; Zhou, W. Urban mapping needs up-to-date approaches to provide diverse perspectives of current urbanization: A novel attempt to map urban areas with nighttime light data. Landsc. Urban Plan. 2020, 195, 103709. [Google Scholar] [CrossRef]
  15. Zhou, Y.; Smith, S.J.; Zhao, K.; Imhoff, M.L.; Thomson, A.M.; Bond-Lamberty, B.; Asrar, G.R.; Zhang, X.; He, C.; Elvidge, C.D. A global map of urban extent from nightlights. Environ. Res. Lett. 2015, 10, 54011. [Google Scholar] [CrossRef]
  16. Zhou, Y.; Li, X.; Asrar, G.R.; Smith, S.J.; Imhoff, M. A global record of annual urban dynamics (1992–2013) from nighttime lights. Remote Sens. Environ. 2018, 219, 206–220. [Google Scholar] [CrossRef]
  17. Zhang, Q.; Wang, P.; Chen, H.; Huang, Q.; Jiang, H.; Zhang, Z.; Zhang, Y.; Luo, X.; Sun, S. A novel method for urban area extraction from VIIRS DNB and MODIS NDVI data: A case study of Chinese cities. Int. J. Remote Sens. 2017, 38, 6094–6109. [Google Scholar] [CrossRef]
  18. Ma, W.; Li, P. An Object Similarity-Based Thresholding Method for Urban Area Mapping from Visible Infrared Imaging Radiometer Suite Day/Night Band (VIIRS DNB) Data. Remote Sens. 2018, 10, 263. [Google Scholar] [CrossRef] [Green Version]
  19. Shi, K.; Huang, C.; Yu, B.; Yin, B.; Huang, Y.; Wu, J. Evaluation of NPP-VIIRS night-time light composite data for extracting built-up urban areas. Remote Sens. Lett. 2014, 5, 358–366. [Google Scholar] [CrossRef]
  20. Yu, B.; Tang, M.; Wu, Q.; Yang, C.; Deng, S.; Shi, K.; Peng, C.; Wu, J.; Chen, Z. Urban Built-Up Area Extraction From Log- Transformed NPP-VIIRS Nighttime Light Composite Data. IEEE Geosci. Remote Sens. Lett. 2018, 15, 1279–1283. [Google Scholar] [CrossRef]
  21. Milesi, C.; Elvidge, C.D.; Nemani, R.R.; Running, S.W. Assessing the impact of urban land development on net primary productivity in the southeastern United States. Remote Sens. Environ. 2003, 86, 401–410. [Google Scholar] [CrossRef]
  22. Henderson, M.; Yeh, E.T.; Gong, P.; Elvidge, C.; Baugh, K. Validation of urban boundaries derived from global night-time satellite imagery. Int. J. Remote Sens. 2003, 24, 595–609. [Google Scholar] [CrossRef]
  23. He, C.; Shi, P.; Li, J.; Chen, J.; Pan, Y.; Li, J.; Zhuo, L.; Ichinose, T. Restoring urbanization process in China in the 1990s by using non-radiance-calibrated DMSP/OLS nighttime light imagery and statistical data. Chin. Sci. Bull. 2006, 51, 1614–1620. [Google Scholar] [CrossRef]
  24. Xie, Y.; Weng, Q. Updating urban extents with nighttime light imagery by using an object-based thresholding method. Remote Sens. Environ. 2016, 187, 1–13. [Google Scholar] [CrossRef]
  25. Zhai, W.; Han, B.; Cheng, C. Evaluation of Luojia 1–01 Nighttime Light Imagery for Built-Up Urban Area Extraction: A Case Study of 16 Cities in China. IEEE Geosci. Remote Sens. Lett. 2019, 17, 1802–1806. [Google Scholar] [CrossRef]
  26. Cao, X.; Chen, J.; Imura, H.; Higashi, O. A SVM-based method to extract urban areas from DMSP-OLS and SPOT VGT data. Remote Sens. Environ. 2009, 113, 2205–2209. [Google Scholar] [CrossRef]
  27. Jing, W.; Yang, Y.; Yue, X.; Zhao, X. Mapping Urban Areas with Integration of DMSP/OLS Nighttime Light and MODIS Data Using Machine Learning Techniques. Remote Sens. 2015, 7, 12419–12439. [Google Scholar] [CrossRef] [Green Version]
  28. Xu, T.; Coco, G.; Gao, J. Extraction of urban built-up areas from nighttime lights using artificial neural network. Geocarto Int. 2019, 35, 1049–1066. [Google Scholar] [CrossRef]
  29. Liu, X.; De Sherbinin, A.; Zhan, Y. Mapping Urban Extent at Large Spatial Scales Using Machine Learning Methods with VIIRS Nighttime Light and MODIS Daytime NDVI Data. Remote Sens. 2019, 11, 1247. [Google Scholar] [CrossRef] [Green Version]
  30. He, C.; Liu, Z.; Gou, S.; Zhang, Q.; Zhang, J.; Xu, L. Detecting global urban expansion over the last three decades using a fully convolutional network. Environ. Res. Lett. 2019, 14, 034008. [Google Scholar] [CrossRef]
  31. Ma, X.; Tong, X.; Liu, S.; Luo, X.; Xie, H.; Li, C. Optimized Sample Selection in SVM Classification by Combining with DMSP-OLS, Landsat NDVI and GlobeLand30 Products for Extracting Urban Built-Up Areas. Remote Sens. 2017, 9, 236. [Google Scholar] [CrossRef] [Green Version]
  32. Li, M.; Xu, L.; Gao, S.; Xu, N.; Yan, B. Adaptive Segmentation of Remote Sensing Images Based on Global Spatial Information. Sensors 2019, 19, 2385. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  33. Dou, Y.; Liu, Z.; He, C.; Yue, H. Urban Land Extraction Using VIIRS Nighttime Light Data: An Evaluation of Three Popular Methods. Remote Sens. 2017, 9, 175. [Google Scholar] [CrossRef] [Green Version]
  34. Xu, X.L. China Population Spatial Distribution Kilometer Grid Dataset; Data Registration and Publishing System of Resource and Environmental Science Data Center of the Chinese Academy of Sciences, 2017; Available online: http://www.resdc.cn/DOI (accessed on 24 May 2020).
  35. Xu, X.L. China GDP Spatial Distribution Kilometer Grid Dataset; Data Registration and Publishing System of Resource and Environmental Science Data Center of the Chinese Academy of Sciences, 2017; Available online: http://www.resdc.cn/DOI (accessed on 24 May 2020).
  36. Gong, P.; Li, X.; Wang, J.; Bai, Y.; Chen, B.; Hu, T.; Liu, X.; Xu, B.; Yang, J.; Zhang, W.; et al. Annual maps of global artificial impervious area (GAIA) between 1985 and 2018. Remote Sens. Environ. 2020, 236, 111510. [Google Scholar] [CrossRef]
  37. Liu, X.; Hu, G.; Chen, Y.; Li, X.; Xu, X.; Li, S.; Pei, F.; Wang, S. High-spatiotemporal multi-temporal mapping of global urban land using Landsat images based on the Google Earth Engine Platform. Remote Sens. Environ. 2018, 209, 227–239. [Google Scholar] [CrossRef]
  38. Chandana, B.S.; Phanendra, M.L.; Babu, S.W. Remote Sensing Image Classification Based on Clustering Algorithms; Confianzit Org: Kerala, India, 2013. [Google Scholar]
  39. Dinh, D.-T.; Fujinami, T.; Huynh, V.-N. Estimating the Optimal Number of Clusters in Categorical Data Clustering by Silhouette Coefficient; Springer: Singapore, 2019; pp. 1–17. [Google Scholar]
Figure 1. Study area: (A) The spatial distribution of 25 evaluation areas; (B) 1 km grid population data of China in 2015; (C) 1km grid gross domestic product (GDP) data of China in 2015; (D) The 2015 NPP-VIIRS NTL data of China; (E) The 2015 normalized difference vegetation index (NDVI) data of China; (F) The nighttime land surface temperature (LST) data of China in 2015.
Figure 1. Study area: (A) The spatial distribution of 25 evaluation areas; (B) 1 km grid population data of China in 2015; (C) 1km grid gross domestic product (GDP) data of China in 2015; (D) The 2015 NPP-VIIRS NTL data of China; (E) The 2015 normalized difference vegetation index (NDVI) data of China; (F) The nighttime land surface temperature (LST) data of China in 2015.
Remotesensing 12 03810 g001
Figure 2. The workflow of the proposed method.
Figure 2. The workflow of the proposed method.
Remotesensing 12 03810 g002
Figure 3. Average silhouette coefficient versus number of clusters.
Figure 3. Average silhouette coefficient versus number of clusters.
Remotesensing 12 03810 g003
Figure 4. Comparison of NTL image filtering results: median filtering and mean filtering.
Figure 4. Comparison of NTL image filtering results: median filtering and mean filtering.
Remotesensing 12 03810 g004
Figure 5. Eight filtering direction templates.
Figure 5. Eight filtering direction templates.
Remotesensing 12 03810 g005
Figure 6. Extraction accuracy of urban extent under different neighborhood sizes: on the left is the different neighborhood sizes of the first stage (fixed neighborhood radius 4 in the second stage); on the right is the different neighborhood sizes of the second stage (fixed neighborhood radius 2 in the second stage).
Figure 6. Extraction accuracy of urban extent under different neighborhood sizes: on the left is the different neighborhood sizes of the first stage (fixed neighborhood radius 4 in the second stage); on the right is the different neighborhood sizes of the second stage (fixed neighborhood radius 2 in the second stage).
Remotesensing 12 03810 g006
Figure 7. Urban extent extraction results: (a) Landsat8 OLI Images of selected cities, (b) VIIRS NTL images of selected city, (c) the urban extent extracted using our method, (d) the urban extent extracted using local optimal threshold (LOT), (e) the urban extent extracted using INNL-SVM.
Figure 7. Urban extent extraction results: (a) Landsat8 OLI Images of selected cities, (b) VIIRS NTL images of selected city, (c) the urban extent extracted using our method, (d) the urban extent extracted using local optimal threshold (LOT), (e) the urban extent extracted using INNL-SVM.
Remotesensing 12 03810 g007
Figure 8. Comparing the urban pattern and spatial details between our results and other products (a) Landsat8 OLI Images (30 m), (b) the global artificial impervious area (GAIA) (30 m), (c) our results (500 m), (d) the global urban land based on a normalized urban areas composite index (NUACI-based GUL) (30 m), (e) the MODIS 500 (500 m), (f) the global urban expansion product based on a fully convolutional network (FCN) (500 m).
Figure 8. Comparing the urban pattern and spatial details between our results and other products (a) Landsat8 OLI Images (30 m), (b) the global artificial impervious area (GAIA) (30 m), (c) our results (500 m), (d) the global urban land based on a normalized urban areas composite index (NUACI-based GUL) (30 m), (e) the MODIS 500 (500 m), (f) the global urban expansion product based on a fully convolutional network (FCN) (500 m).
Remotesensing 12 03810 g008
Figure 9. Spatial Comparison of the results different methods for Beijing: (a) the inner part of urban; (b) the edge district of urban; the yellow box is the area zooming in of the google image.
Figure 9. Spatial Comparison of the results different methods for Beijing: (a) the inner part of urban; (b) the edge district of urban; the yellow box is the area zooming in of the google image.
Remotesensing 12 03810 g009
Figure 10. Urban expansion from 2013 to 2018 for Shanghai and Lanzhou.
Figure 10. Urban expansion from 2013 to 2018 for Shanghai and Lanzhou.
Remotesensing 12 03810 g010
Figure 11. Growth of urban areas from 2013 to 2018.
Figure 11. Growth of urban areas from 2013 to 2018.
Remotesensing 12 03810 g011
Figure 12. The spatial distribution of nighttime light intensity in Lhasa and Urumqi (yellow and red boxes indicate areas zoomed in column 3 and 4, respectively).
Figure 12. The spatial distribution of nighttime light intensity in Lhasa and Urumqi (yellow and red boxes indicate areas zoomed in column 3 and 4, respectively).
Remotesensing 12 03810 g012
Figure 13. Analysis of overestimation urban area for Ningbo: (black box is the overestimation area zoomed in column 1, which was indicated by “A” in text).
Figure 13. Analysis of overestimation urban area for Ningbo: (black box is the overestimation area zoomed in column 1, which was indicated by “A” in text).
Remotesensing 12 03810 g013
Figure 14. The Sensitivity analysis of parameters: (ac) were the results of all parameter combinations; (d) was the results withR2 = 4; (e) was the results with R1 = 2; (f) was the results with N = 1.
Figure 14. The Sensitivity analysis of parameters: (ac) were the results of all parameter combinations; (d) was the results withR2 = 4; (e) was the results with R1 = 2; (f) was the results with N = 1.
Remotesensing 12 03810 g014
Table 1. The population (Pop) and GDP for Districts under City of the selected cities for 2015 (“Districts under City” includes all the city districts, not including counties and the city at lower levels).
Table 1. The population (Pop) and GDP for Districts under City of the selected cities for 2015 (“Districts under City” includes all the city districts, not including counties and the city at lower levels).
CityPop (Million Persons)GDP (Billion RMB)CityPop (Million Persons)GDP (Billion RMB)
Beijing13.392301.46Shanghai13.732483.84
Hohhot1.29230.09Nanjing6.51972.08
Tianjin10.221653.82Hangzhou5.16872.20
Qingdao3.72597.71Ningbo2.31487.72
Shenyang5.29589.12Jiaxing0.8787.09
Jilin1.82141.38Jinhua0.9664.57
Xian6.04513.64Yiwu0.77104.51
Urumqi2.61261.01Wuhan5.15880.60
Lanzhou2.05174.15Zhengzhou3.39408.04
Lhasa0.2219.69Guangzhou8.481810.04
Chongqing21.271320.63Shenzhen3.441750.29
Chengdu6.93846.00Xiamen2.07346.60
Kunming2.78307.29
Table 2. Accuracy assessments of the three methods of extracting urban extent by city.
Table 2. Accuracy assessments of the three methods of extracting urban extent by city.
CityOur MethodLOTINNL-SVM
OAKappaOECEOAKappaOECEOAKappaOECE
AVG0.96250.76790.17140.23730.96220.75440.22090.22540.95710.73040.22550.2413
STD0.02640.03840.07200.07000.02610.04030.04510.04430.03080.05770.11410.0969
Beijing0.95630.79780.13970.21180.95220.77040.20200.20300.95230.75390.28080.1469
Hohhot0.99150.75730.34090.09850.99140.78580.21170.20780.99170.77940.27170.1520
Jinhua0.97360.70650.20980.33840.97270.66870.31700.31700.96850.67650.17470.4032
Shenyang0.97410.75890.24230.21190.97400.76200.22420.22420.95380.67820.06400.4389
Jilin0.99630.75730.22720.25410.99660.76870.22960.22960.99680.78600.20670.2180
Xian0.97340.79610.13850.23500.97330.78370.20160.20250.97650.80220.21990.1476
Urumqi0.99050.82630.16060.17680.98980.81130.18350.18350.98940.82110.09080.2423
Lanzhou0.99450.82560.20290.13780.99250.78480.15900.25770.99060.74930.13420.3322
Tianjin0.94960.77490.17150.21950.94500.74760.22110.22090.91110.66130.11900.4032
Qingdao0.96850.72360.16690.33410.96830.68930.29400.29400.96600.64190.38900.2834
Shanghai0.91650.78790.08400.21670.91250.76500.17690.17690.90340.75330.11820.2360
Nanjing0.94600.78010.13600.23500.94580.76740.20130.20130.93930.75140.16810.2540
Hangzhou0.96770.74430.16570.29960.96740.71940.26270.26370.96610.67070.39420.2031
Ningbo0.95250.70820.16580.34500.95530.69120.28450.28450.95450.65370.39110.2358
Jiaxing0.95540.68710.25600.31880.95410.66450.31070.31070.95370.65540.33260.3062
Yiwu0.93360.75360.14870.25790.92910.72780.20640.25290.92460.73260.11840.3050
Wuhan0.96070.78100.14920.24020.95970.76330.21430.21460.94670.60830.50810.1070
Zhengzhou0.96070.78070.13520.25170.96040.76340.21480.21480.96330.74980.33610.0859
Guangzhou0.91490.73280.03360.34130.93370.74790.22970.19520.92120.73750.09860.3053
Shenzhen0.90920.79400.12780.14880.90820.79060.14070.14230.89210.74460.23460.1139
Xiamen0.93500.80090.13280.18240.92830.77510.18010.18010.92530.75550.24420.1475
Lhasa0.99840.73830.35550.13380.99850.78360.21560.21560.99820.77330.14690.2913
Chongqing0.99210.75310.16230.30950.99240.73760.25840.25860.99190.74310.18550.3098
Chengdu0.96050.80360.08580.24720.96320.79980.17920.18020.96130.77840.24430.1506
Kunming0.99130.82880.14560.18680.98960.79070.20370.20410.98990.80390.16600.2143
Table 3. Comparing our results in the year 2015 with other products.
Table 3. Comparing our results in the year 2015 with other products.
CityOur MethodGAIANUACI-BasedMODIS500FCNLandsat
AreaKappaAreaKappaAreaKappaAreaKappaAreaKappaArea
Beijing84430.797812,9350.583410,2010.701214,0850.626977120.79827736
Shanghai10,7370.787911,1550.607567440.606712,3320.749410,6300.81199181
Guangzhou68470.732863930.612443060.616561540.713754900.74404667
Wuhan36100.781041630.575928830.685037190.706542550.72143224
Lanzhou8200.825613090.637610850.672518250.530812420.7083887
Urumqi16060.826329630.60038490.439615210.772215780.81371575
Kunming22590.828830160.652419200.616836850.567735190.69672150
Note: “Area” means the number of urban pixels in spatial resolution 500 m, and “Landsat” means the urban extent obtained from Landsat.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Chen, X.; Zhang, F.; Du, Z.; Liu, R. An Unsupervised Urban Extent Extraction Method from NPP-VIIRS Nighttime Light Data. Remote Sens. 2020, 12, 3810. https://doi.org/10.3390/rs12223810

AMA Style

Chen X, Zhang F, Du Z, Liu R. An Unsupervised Urban Extent Extraction Method from NPP-VIIRS Nighttime Light Data. Remote Sensing. 2020; 12(22):3810. https://doi.org/10.3390/rs12223810

Chicago/Turabian Style

Chen, Xiuxiu, Feng Zhang, Zhenhong Du, and Renyi Liu. 2020. "An Unsupervised Urban Extent Extraction Method from NPP-VIIRS Nighttime Light Data" Remote Sensing 12, no. 22: 3810. https://doi.org/10.3390/rs12223810

APA Style

Chen, X., Zhang, F., Du, Z., & Liu, R. (2020). An Unsupervised Urban Extent Extraction Method from NPP-VIIRS Nighttime Light Data. Remote Sensing, 12(22), 3810. https://doi.org/10.3390/rs12223810

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop