Article
Open access
Published: 28 September 2023

A high-resolution canopy height model of the Earth

Nature Ecology & Evolution volume 7, pages 1778–1789 (2023)Cite this article

38k Accesses
132 Altmetric
Metrics details

Subjects

A Publisher Correction to this article was published on 19 February 2024

This article has been updated

Abstract

The worldwide variation in vegetation height is fundamental to the global carbon cycle and central to the functioning of ecosystems and their biodiversity. Geospatially explicit and, ideally, highly resolved information is required to manage terrestrial ecosystems, mitigate climate change and prevent biodiversity loss. Here we present a comprehensive global canopy height map at 10 m ground sampling distance for the year 2020. We have developed a probabilistic deep learning model that fuses sparse height data from the Global Ecosystem Dynamics Investigation (GEDI) space-borne LiDAR mission with dense optical satellite images from Sentinel-2. This model retrieves canopy-top height from Sentinel-2 images anywhere on Earth and quantifies the uncertainty in these estimates. Our approach improves the retrieval of tall canopies with typically high carbon stocks. According to our map, only 5% of the global landmass is covered by trees taller than 30 m. Further, we find that only 34% of these tall canopies are located within protected areas. Thus, the approach can serve ongoing efforts in forest conservation and has the potential to foster advances in climate, carbon and biodiversity modelling.

A biomass map of the Brazilian Amazon from multisource remote sensing

Article Open access 30 September 2023

An unexpectedly large count of trees in the West African Sahara and Sahel

Article 14 October 2020

Nation-wide mapping of tree-level aboveground carbon stocks in Rwanda

Article Open access 22 December 2022

Main

As our society depends on a multitude of terrestrial ecosystem services¹, the conservation of Earth’s forests has become a priority on the global political agenda². To ensure sustainable development through biodiversity conservation and climate change mitigation, the United Nations have formulated global forest goals that include maintaining and enhancing global carbon stocks and increasing forest cover by 3% between 2017 and 2030². Yet global demand for commodities is driving deforestation, impeding progress towards these ambitious goals³. Earth observation and satellite remote sensing play a key role in this context, as they provide the data to monitor the quality of forested area at global scale⁴. However, to measure progress in terms of carbon and biodiversity conservation, novel approaches are needed that go beyond detecting forest cover and can provide consistent information about morphological traits predictive of carbon stock and biodiversity⁵ at global scale. One key vegetation characteristic is canopy height^5,6.

Mapping canopy height in a consistent fashion at global scale is key to understand terrestrial ecosystem functions, which are dominated by vegetation height and vegetation structure⁷. Canopy-top height is an important indicator of biomass and the associated, global aboveground carbon stock⁸. At high spatial resolution, canopy height models (CHMs) directly characterize habitat heterogeneity⁹, which is why canopy height has been ranked as a high-priority biodiversity variable to be observed from space⁵. Furthermore, forests buffer microclimate temperatures under the canopy¹⁰. While it has been shown that in the tropics, higher canopies provide a stronger dampening effect on microclimate extremes¹¹, targeted studies are needed to see if such relationships also hold true at global scale¹⁰. Thus a homogeneous high-resolution CHM has the potential to advance the modelling of climate impact on terrestrial ecosystems and may assist forest management to bolster microclimate buffering as a mitigation service to protect biodiversity under a warmer climate¹⁰.

Given forests’ central relevance to life on our planet, several new space missions have been developed to measure vegetation structure and biomass. A key mission is the Global Ecosystem Dynamics Investigation (GEDI) operated by NASA (National Aeronautics and Space Administration), which has been collecting full-waveform LiDAR data explicitly for the purpose of measuring vertical forest structure globally, between 51.6° N and S (ref. ¹²). GEDI has unique potential to advance our understanding of the global carbon stock, but its geographical range, and also its spatial and temporal resolutions, are limited. The mission, initially planned to last for two years, collected four years of data from April 2019 to March 2023. The instrument will be stored on the International Space Station and is expected to continue collecting data in fall 2024. Independent of these interruptions, GEDI is a sampling mission expected to cover, at most, 4% of the land surface. By design, the collected samples sparsely cover the surface of the Earth, which restricts the resolution of gridded mission products to 1 km cells¹². In contrast, satellite missions such as Sentinel-2 or Landsat, which have been designed for a broader range of Earth observation needs, deliver freely accessible archives of optical images that are not as tailored to vegetation structure but offer longer-term global coverage at high spatial and temporal resolution. Sensor fusion between GEDI and multi-spectral optical imagery has the potential to overcome the limitations of each individual data source¹³.

In this work, we describe a deep learning approach to map canopy-top height globally with high resolution, using publicly available optical satellite images as input. We deploy that approach to compute a global canopy-top height product with 10-m ground sampling distance (GSD), based on Sentinel-2 optical images for the year 2020. That global map and the underlying source code and trained models are made publicly available to support conservation efforts and science in disciplines such as climate, carbon and biodiversity modelling. The map can be explored interactively in this browser application: nlang.users.earthengine.app/view/global-canopy-height-2020.

However, estimating forest characteristics such as canopy height or biomass from optical images is a challenging task¹⁴, as the physical relationships between spectral signatures and vertical forest structure are complex and not well understood¹⁵. Given the vast amount of data collected by the GEDI mission, we circumvent this lack of mechanistic understanding by harnessing supervised machine learning, in particular end-to-end deep learning. From millions of data examples, our model learns to extract patterns and features from raw satellite images that are predictive of high-resolution vegetation structure. By fusing GEDI observations (that is, RH98, the relative height at which 98% of the energy has been returned) with Sentinel-2 images, our approach enhances the spatial and temporal resolution of the CHM and extends its geographic range to the sub-Arctic and Arctic regions outside of GEDI’s coverage. While retrieval of vegetation parameters with deep learning has been demonstrated regionally and up to country scale^16,17,18,19, we scale it up and process the entire global landmass. This step presents a technical challenge but is crucial to enable operational deployment and ensure consistent, globally homogeneous data.

Results and discussion

Deep learning approach

Deep learning is revolutionizing fields ranging from medicine²⁰ to weather forecasting²¹ and has great potential to advance environmental monitoring^22,23, but its application to global remote sensing is technically challenging due to the large data volume^22,24. Cloud platforms such as Google Earth Engine²⁵ simplify the analysis of satellite data but provide a limited set of traditional machine learning tools that depend on manual feature design. To use them, one must sacrifice some flexibility in terms of methods in return for easy access to large data archives and compute power. In particular, canopy height estimation with existing standard tools tends to struggle with the underestimation of tall canopies, as the height estimates saturate around 25 to 30 m (refs. ^26,27,28). This is a fairly severe limitation in regions dominated by tall canopies, such as tropical forests, and deteriorates downstream carbon stock estimation, because tall trees have especially high biomass⁶. A further restriction of prior large-scale CHM projects is that they rely on local calibration, which hampers their use in locations without nearby reference data^27,28. Technically, existing mapping schemes aggregate reflectance data over time but perform pure pixel-to-pixel mapping without regard to local context and image texture.

Going global

Here we extend previous regional deep learning methods^16,17,18 to a global scale. These methods have been shown to mitigate the saturation of tall canopies by exploiting texture while not depending on temporal features¹⁶. Previous work has demonstrated that without the ability to learn spatial features, the performance drops substantially especially for the tall canopies¹⁶. In more detail, our approach employs an ensemble²⁹ of deep, (fully) convolutional neural networks (CNN), each of which takes as input a Sentinel-2 optical image and transforms it into a dense canopy height map with the same GSD of 10 m (ref. ¹⁶) (Fig. 1a). Our unified, global model is trained with sparse supervision, using reference heights at globally distributed GEDI footprints derived from the raw waveforms³⁰. A dataset of 600 million samples is constructed by extracting Sentinel-2 image patches of 15 × 15 pixels around every GEDI footprint acquired between April and August in 2019 or 2020. The sparse GEDI data is rasterized to the Sentinel-2 10-m grid by setting the pixel corresponding to the centre of each GEDI footprint to the associated footprint height. In this way, during model training, one can optimize the loss function with respect to (w.r.t.) the model parameters only at valid reference pixels (Fig. 1a); whereas during map generation, the CNN model will nevertheless output a height prediction for every input pixel. To evaluate the model globally, we split the collected dataset at the level of Sentinel-2 tiles. Of the 100 km × 100 km regions defined by the Sentinel-2 tiling, 20% are held out for validation and the remaining 80% are used to train the model (the validation regions and the associated estimation errors are shown in Extended Data Fig. 1).

**Fig. 1: Model overview and global model evaluation on held-out GEDI reference data.**

An important goal of our work is low estimation error for tall vegetation because it indicates potentially high carbon stocks. To that end, we extend the CNN model in three ways (Fig. 1b). First, we equip the model with the ability to learn geographical priors³¹ by feeding it geographical coordinates (in a suitable cyclic encoding) as additional input channels (Fig. 1a). Second, we employ a fine tuning strategy where the sample loss is re-weighted inversely proportional to the sample frequency per 1-m height interval so as to counter the bias in the reference data towards low canopies (which reflects the long-tailed worldwide height distribution, where low vegetation dominates and high values are comparatively rare). Finally, we train an ensemble of CNNs and aggregate estimates from repeated Sentinel-2 observations of the same location, which reduces the underestimation of tall canopies even further. The combination of all three measures yields the best performance. The average root mean square error (aRMSE) of the height estimates, balanced across all 5-m height intervals, is 7.3 m, and the average mean error (aME, that is, bias) is −1.8 m (Fig. 1b). The root mean square error (RMSE) over all validation samples (without height balancing) is 6.0 m, with a bias of 1.3 m (Fig. 1c). The latter is due to a slight overestimation of low canopy heights and is the price we pay for improving the performance on tall canopies (Fig. 1b).

A biome-level analysis based on the 14 biomes defined by The Nature Conservancy (www.nature.org) shows how the bias varies across biomes (Fig. 1d and Extended Data Fig. 2). The model is able to correctly identify bare soil in deserts with zero height, with marginal error and no bias. The bias is also low in montane, temperate and tropical grasslands and in Mediterranean and tropical dry broadleaf forest, but higher in flooded grasslands. The most severe overestimation, on average ≈ 2.5 m, is observed for mangroves, tundra and tropical coniferous forests. The highest spread of residuals is observed in tropical and temperate coniferous forests and in the tundra, where we note that the latter is substantially underrepresented in the dataset, as GEDI’s range does not extend beyond 51.6° N. Furthermore, the GEDI reference data in these tundra regions (southern part of Kamchatka Mountain and Forest Tundra and Trans-Baikal Bald Mountain Tundra) appears rather noisy and contaminated with a notable number of outliers (Extended Data Fig. 2).

Comparison to existing canopy height estimates

We compare our estimates with a state-of-the-art global-scale canopy-top height map (henceforth referred to as UMD) that has been derived by combining GEDI data (RH95) with Landsat image composites²⁷. This UMD map relies on local model fitting and is created by combining the results of multiple regional models, which means it is not available beyond the GEDI coverage north of 51.6° latitude. For a fair comparison the UMD map with 30 m GSD is re-projected to the same Sentinel-2 10-m grid. Overall, our map reduces the underestimation bias from − 7.1 m to − 1.7 m w.r.t. the hold-out GEDI validation data (in total, 87 million footprints) when averaging the bias across height intervals (Extended Data Fig. 3a). The UMD map underestimates the reference data over the entire height range starting from 5 m canopy height, whereas the underestimation increases for canopy heights >30 m. While the UMD map has negligible bias for low vegetation <5 m, our map tends to overestimate some of the vegetation <5 m. Furthermore, our map has an overestimation bias of ≈ 2 m for heights ranging from 5 to 20 m and a bias of <1 m from 20 to 35 m. Starting from vegetation 35 m tall upwards, the negative bias grows with canopy-top height but is substantially lower compared to the UMD map. The bias also varies across biomes (Extended Data Fig. 3b). In most of the cases where the UMD map tends to underestimate the GEDI reference height, our map tends to overestimate. Exceptions where our map has a low bias of <1 m are: tropical dry broadleaf, tropical grassland, temperate grassland, flooded grasslands, montane grassland and Mediterranean forests. It is worth noting that our global model did not see these validation regions (each 100 × 100 km) during training; in contrast, the UMD approach fits a local model for each region.

Evaluation with independent airborne LiDAR

In addition, we compare our final map (and the UMD map) with independent reference data from two airborne LiDAR sources: (1) NASA’s Land, Vegetation, and Ice Sensor (LVIS) airborne LiDAR campaigns³², which were designed to deliver canopy-top heights comparable to GEDI¹² and (2) GEDI-like canopy-top height retrieved from high-resolution canopy height models derived from small-footprint airborne laser scanning (ALS) campaigns in Europe³³. We report error metrics within 24 Sentinel-2 tile regions (12 each for LVIS and ALS) from 11 countries in North and Central America and Europe (Extended Data Table 1) covering a diverse range of vegetation heights and biomes (Extended Data Fig. 4). In nine out of the 12 regions with UMD map data, our map yields lower random error (RMSE and mean absolute error (MAE)) and bias (mean error). While the UMD map underestimates the airborne LiDAR data in all regions, our map tends to overestimate the reference data in most, but not all, cases. The strongest differences are observed in regions with high average canopy height (that is 28–36 m in the United States (Oregon) and Gabon) where the UMD map has a bias ranging from −19 to −23% w.r.t. the average height, and our map yields a bias of −4% to −13% and 7% to 10%. In the rare cases where the underestimation bias of UMD map is lower than the overestimation bias of ours, we observe qualitatively that our map captures structure within high vegetation, where the UMD map saturates (for example, Netherlands; Extended Data Fig. 7c). Further qualitative comparisons against LVIS and ALS data are presented in Extended Data Fig. 7a–f. Comparing the mean error over regions within the GEDI coverage reveals that our map outperforms the UMD map on all error metrics. The UMD map yields an RMSE of 9.1 m and a bias of −4.5 m (−25.2%) and our map an RMSE of 7.9 m and a bias of 1.7 m (16.2%) (Extended Data Table 2).

Our approach allows us to map beyond the northernmost latitude with GEDI data for which we have 12 regions with independent airborne LiDAR data (Extended Data Table 1). Here we find a mean RMSE of 5.3 m and a bias of 0.5 m (19.4%) (Extended Data Table 2). In the northernmost region in Finland at 70° N latitude, the dominant vegetation structure is captured in our map with an RMSE of 3.0 m and a bias of 0.5 m (17.1%) (for example, Extended Data Fig. 8a). In other words, our estimates agree well with independent LVIS and ALS data, even outside the geographic range of the GEDI training data (qualitative examples in Extended Data Fig. 8a–f).

Modelling predictive uncertainty

Whereas deep learning models often exhibit high predictive skill and produce estimates with low overall error, the uncertainty of those estimates is typically not known or unrealistically low (that is, the model is over-confident)³⁴. But reliable uncertainty quantification is crucial to inform downstream investigations and decisions based on the map content⁸; for example, it can indicate which estimates are too uncertain and should be disregarded³⁰. To afford users of our CHM a trustworthy, spatially explicit estimate of the map’s uncertainty, we integrate probabilistic deep learning techniques. These methods are specifically designed to quantify also the predictive uncertainty, taking into account, on the one hand, the inevitable noise in the input data and, on the other hand, the uncertainty of the model parameters resulting from the lack of reference data for certain conditions³⁵. In particular, we independently train five deep CNNs that have identical structure but are initialized with different random weights. The spread of the predictions made by such a model ensemble²⁹ for the same pixel is an effective way to estimate model uncertainty (also known as epistemic uncertainty), even with small ensemble size³⁶. Each individual CNN is trained by maximizing the Gaussian likelihood rather than minimizing the more widely used squared error. Consequently, each CNN outputs not only a point estimate per pixel but also an associated variance that quantifies the uncertainty of that estimate (a.k.a. its aleatoric uncertainty)³⁵.

During inference, we process images from ten different dates (satellite overpasses) within a year at every location to obtain full coverage and exploit redundancy for pixels with multiple cloud-free observations. Each image is processed with a randomly selected CNN within the ensemble, which reduces computational overhead and can be interpreted as natural test-time augmentation, known to improve the calibration of uncertainty estimates with deep ensembles³⁷.

Finally, we use the estimated aleatoric uncertainties to merge redundant predictions from different imaging dates by weighted averaging proportional to the inverse variance. While inverse-variance weighting is known to yield the lowest expected error³⁸, we observe a deterioration of the uncertainty calibration for low values (<4 m standard deviation in Extended Data Fig. 5a). We also note that uncertainty calibration varies per biome (Extended Data Fig. 5c), so it may be advisable to re-calibrate in post-processing depending on the intended application and region of interest. Despite these observations, the estimated predictive uncertainty correlates well with the empirical estimation error and can therefore be used to filter out inaccurate predictions, thus lowering the overall error at the cost of reduced completeness (Extended Data Fig. 5b). For example, by filtering out the 20% most uncertain canopy height estimates, overall RMSE is reduced by 13% (from 6.0 m to 5.2 m) and the bias is reduced by 23% (from 1.3 m to 1.0 m).

Interestingly, the tendency to overestimate some of the low vegetation <5 m can also be removed by using our estimated uncertainty to filter out, for example, the 20% of validation points with the highest relative standard deviation (‘ETH (ours) 80%’ in Extended Data Fig. 3a). Here we follow the filtering protocol proposed in previous work using an adaptive threshold depending on the predicted canopy height to preserve the full canopy height range³⁰. The ability to identify erroneous estimates based on the predictive uncertainty is a unique characteristic of our proposed methodology and allows us to reduce random errors and biases (Extended Data Fig. 3).

Global canopy height map

The model has been deployed globally on the Sentinel-2 image archive for the year 2020 to produce a global map of canopy-top height. To cover the global landmass ( ≈ 1.3 × 10¹² pixels at the GSD of Sentinel-2), a total of ≈ 160 terabytes of Sentinel-2 image data are selected for processing. This required ≈ 27,000 graphics processing unit (GPU) hours ( ≈ 3 GPU years) of computation time, parallelized on a high-performance cluster to obtain the global map in ten days real time.

The new dense canopy height product at 10-m GSD makes it possible to gain insights into the fine-grained distribution of canopy heights anywhere on Earth and the associated uncertainty (Fig. 2). Three example locations (A–C in Fig. 2) demonstrate the level of canopy detail that the map reveals, ranging from harvesting patterns from forestry in Washington state, United States (A), through gallery forests along permanent rivers and ground water in the forest–savannah of northern Cameroon (B), to dense tropical broadleaf forest in Borneo, Malaysia (C).

**Fig. 2: Global canopy height map for the year 2020.**

Also at large scale, the predictive uncertainty is positively correlated with the estimated canopy height (Fig. 2b). Still, some regions such as Alaska, Yukon (northwestern Canada) and Tibet exhibit high predictive uncertainty, which cannot be explained only by the canopy height itself. The two former lie outside of the GEDI coverage, so the higher uncertainty is probably due to local characteristics that the model has not encountered during training. The latter indicates that also within the GEDI range, there are environments that are more challenging for the model, for example, due to globally rare ecosystem characteristics not encountered elsewhere. Ultimately, all three regions might be affected by frequent cloud cover (and snow cover), limiting the number of repeated observations. Qualitative examples with high uncertainty, but reasonable canopy-top height estimates, for Alaska are presented in Extended Data Fig. 8e,f.

Our new dataset enables a full, worldwide assessment of coverage of the global landmass with vegetation. Doing this for a range of thresholds recovers an estimate of the global canopy height distribution (for the year 2020; Fig. 3a and Extended Data Fig. 6a). We find that an area of 53.6 × 10⁶ km² (41% of the global landmass) is covered by vegetation with >5 m canopy height, 39.6 × 10⁶ km² (30%) by vegetation >10 m and 6.7 × 10⁶ km² (5%) by vegetation >30 m (Fig. 3b).

**Fig. 3: Global canopy height distributions of the entire landmass, protected areas and biomes.**

We see that protected areas (according to the World Database on Protected Areas, WDPA³⁹) tend to contain higher vegetation compared to the global average (Fig. 3a). Furthermore, 34% of all canopies >30 m fall into protected areas (Fig. 3a). Extended Data Fig. 6b provides examples of protected areas that show good agreement with mapped canopy height patterns. This analysis highlights the relevance of the new dataset for ecological and conservation purposes. For instance, canopy height and its spatial homogeneity can serve as an ecological indicator to identify forest areas with high integrity and conservation value. That task requires both dense area coverage at reasonable resolution and a high saturation level to locate very tall vegetation.

Finally, our new map makes it possible to analyse the exhaustive distributions of canopy heights at the biome level, revealing characteristic frequency distributions and trends within different types of terrestrial ecosystem (Fig. 3b). While, for instance, the canopy heights of tropical moist broadleaf forests follow a bimodal distribution with a mode >30 m, mangroves have a unimodal distribution with a large spread and heights ranging up to >40 m. Notably, our model has learned to predict reasonable canopy heights in tundra regions, despite scarce and noisy reference data for that ecosystem.

Discussion

The evaluation with hold-out GEDI reference data and the comparison with independent airborne LiDAR data show that the presented approach yields a new, carefully quality-assessed state-of-the-art global map product that includes quantitative uncertainty estimates. But the retrieval of canopy height from optical satellite images remains a challenging task and has its limitations.

Limitations and map quality

We note that despite the nominal 10-m ground sampling distance of our global map, the effective spatial resolution at which individual vegetation features can be identified is lower. As a consequence of the GEDI reference data used to train the model, each map pixel effectively indicates the largest canopy-top height within a GEDI footprint ( ≈ 25 m diameter) centred at the pixel. Two subtle reasons further impact the effective resolution, compared to a map learned from dense, local reference data (for example, airborne LiDAR¹⁶): the sparse supervision means that the model never explicitly sees high-frequency variations of the canopy height between adjacent pixels, and misalignments caused by the geolocation uncertainty (15–20 m) of the GEDI version 1 data^12,40,41 introduce further noise. While at present, we do not see this as severely limiting the utility of the map, in the future one could consider extending the method with techniques for guided super resolution⁴² to better preserve small features visible in the raw Sentinel-2 images, such as canopy gaps. Furthermore, using the latest GEDI release version with improved geolocation accuracy⁴¹ may improve the measured performance.

Regarding map quality, besides minor artefacts in regions with persistent cloud cover, we observe tiling artifacts at high latitudes in the northern hemisphere. The systematic inconsistencies at tile borders point at degradation of the absolute height estimates, possibly caused by a lack of training data for particular, locally unique vegetation structures. Interestingly, it appears that a notable part of these errors are constant offsets between the tiles.

Whereas our approach substantially reduces the error for tall canopies representing potentially high carbon stocks, we observe a tendency to overestimate some areas with very low canopy heights (<5 m). However, future downstream applications may use additional land cover masks or predictive uncertainty to identify and filter these biased estimates. The modelled spatially explicit uncertainty can be used to identify erroneous estimates. On a global scale, we observe that some regions are subject to overall high predictive uncertainty including tropical regions such as Papua New Guinea, but also regions in northern latitudes, for example, Alaska. This shows the importance of modelling the predictive uncertainty to allow transparent and informed use of the canopy height map. It also indicates that there are limits when using optical images as the only predictor for canopy height estimation and that future work could explore the combination with additional predictive environmental data to resolve ambiguities in visual observations.

Potential applications

In the context of the GEDI mission goals¹², our presented canopy height map may be used to fill the gaps in the gridded products at 1-km resolution where no GEDI tracks are available^43,44. According to the Global Forest Resources Assessment 2020⁴⁵, 31% of the global land area is covered by forests. Whereas our new global canopy height map can contribute to global forest cover estimates, such a forest definition relies not solely on canopy height, but also on connectivity and includes areas with trees likely to reach a certain height. Therefore, these derivations require more in situ data and threshold calibration. Furthermore, there are at least two major downstream applications that the new high-resolution canopy height dataset can help to advance at global scale, namely biomass and biodiversity modelling. Furthermore, our model can support monitoring of forest disturbances. Canopy-top height is a key indicator to study the global aboveground carbon stock stored in the form of biomass⁸. On a local scale, we compare our canopy height map with dense aboveground carbon density data⁴⁶ that was produced by a targeted airborne LiDAR campaign in Sabah, northern Borneo⁴⁷ (Fig. 4a). We observe that for natural tropical forests, the spatial patterns agree well and that our canopy height estimates from Sentinel-2 are predictive of carbon density even in tropical regions, with canopy heights up to 65 m. Notably, the relationship between carbon and canopy height is sensitive up to ≈ 60 m. We note that although it is technically possible to map biomass at 10-m ground sampling distance, this may not be meaningful in regions such as the tropics, with dense vegetation where single tree crowns may exceed a 10-m pixel. It is rather recommended to model biomass at coarser spatial resolutions (for example 0.25 ha) suitable to capture the variation of dense vegetation areas. Nevertheless, high-resolution canopy height data has great potential to improve global biomass estimates by providing descriptive statistics of the vegetation structure within local neighbourhoods.

We further demonstrate that our model can be deployed annually to map canopy height change over time, for example, to derive changes in carbon stock and estimate carbon emissions caused by global land-use changes, at present mainly deforestation⁴⁸. Annual canopy height maps are computed for a region in northern California where wildfires have destroyed large areas in 2020 (Fig. 4b). Our automated change map corresponds well with the mapped fire extent from the California Department of Forestry and Fire Protection (www.fire.ca.gov), while at the same time the annual maps are consistent in areas not affected by the fires, where no changes are expected. While the sensitivity of detectable changes such as annual growth might be limited by the model accuracy and remains to be evaluated (for example, with repeated GEDI shots or LiDAR campaigns), such high-resolution change data may potentially help to reduce the high uncertainty of emissions estimates that are reported in the annual Global Carbon Budget⁴⁸. It is worth mentioning that the presented approach yields reliable estimates based on single cloud-free Sentinel-2 images. Thus, its potential for monitoring changes in canopy height is not limited to annual maps but to the availability of cloud-free images that are taken at least every five days globally. This high update frequency makes it relevant for, for example, real-time deforestation alert systems, even in regions with frequent cloud cover.

A second line of potential applications includes biodiversity modelling as the high spatial resolution of the canopy height model brings about the possibility to characterize habitat structure and vegetation heterogeneity, known to promote a number of ecosystem services⁴⁹ and to be predictive of biodiversity^50,51. The relationship between heterogeneity and species diversity is founded in niche theory^50,51, which suggests that heterogeneous areas provide more ecological niches for different species to co-exist. Our dense map makes it possible to study second-order homogeneity⁹ (which is not easily possible with sparse data like GEDI) and down to a length scale of 10 to 20 m.

Technically, our dense high-resolution map makes it a lot easier for scientists to intersect sparse sample data, for example, field plots, with canopy height. To make full use of scarce field data in biomass or biodiversity research, dense complementary maps are a lot more useful: when pairing sparse field samples with other sparsely sampled data, the chances of finding enough overlap are exceedingly low; whereas pairing them with low-resolution maps risks biases due to the large-scale difference and associated spatial averaging.

Conclusion

We have developed a deep learning method for canopy height retrieval from Sentinel-2 optical images. That method has made it possible to produce a globally consistent, dense canopy-top height map of the entire Earth, with 10-m ground sampling distance. Besides Sentinel-2, the GEDI LiDAR mission also plays a key role as the source of sparse but uniquely well-distributed reference data at global scale. Compared to previous work that maps canopy height at global scales²⁷, our model substantially reduces the overall underestimation bias for tall canopies. Our model does not require local calibration and can therefore be deployed anywhere on Earth, including regions outside the GEDI coverage. Moreover, it also delivers spatially explicit estimates for the predictive uncertainties of the retrieved canopy heights. As our method, once trained, needs only image data, maps can be updated annually, opening up the possibility to track the progress of commitments made under the United Nation’s global forest goals to enhance carbon stock and forest cover by 2030². At the same time, the longer the GEDI mission will collect on-orbit data, the denser the reference data for our approach will become, which can be expected to diminish the predictive uncertainty and improve the effective resolution of its estimates.

As a possible future extension, our model could be extended to map other vegetation characteristics¹⁷ at global scale. In particular, it appears feasible to densely map biomass by retraining with GEDI L4A biomass data⁸ or by adding additional data from planned future space missions⁵².

Whereas deep learning technology for remote sensing is continuously being refined by focusing on improved performance at regional scale, its operational utility has been limited by the fact that it often could not be scaled up to global coverage. Our work demonstrates that if one has a way of collecting globally well-distributed reference data, modern deep learning can be scaled up and employed for global vegetation analysis from satellite images. We hope that our example may serve as a foundation on which new, scalable approaches can be built that harness the potential of deep learning for global environmental monitoring and that it inspires machine learning researchers to contribute to environmental and conservation challenges.

Methods

Data

This work builds on data from two ongoing space missions: the Copernicus Sentinel-2 mission operated by the European Space Agency (ESA) and NASA’s Global Ecosystem Dynamics Investigation (GEDI). The Sentinel-2 multi-spectral sensor delivers optical images covering the global landmass with a revisit time of, at most, five days on the equator. We use the atmospherically corrected L2A product, consisting of 12 bands ranging from the visible and near infrared to the short wave infrared. While the visible and near infrared bands have 10-m GSD, the other bands have a 20-m or 60-m GSD. For our purposes, all bands are upsampled with cubic interpolation to obtain a data cube with 10-m ground sampling distance. The GEDI mission is a space-based full-waveform LiDAR mounted on the International Space Station and measures vertical forest structure at sample locations with a 25-m footprint, distributed between 51.6° N and S. We use footprint-level canopy-top height data derived from these waveforms as sparse reference data^30,53. The canopy-top height is defined as RH98, the relative height at which 98% of the energy has been returned, and was derived from GEDI L1B version 1 waveforms⁵⁴ collected between April and August in the years 2019 and 2020.

To train the deep learning model, a global training dataset has been constructed within the GEDI range by combining the GEDI data and the Sentinel-2 imagery. For every Sentinel-2 tile, we select the image with the least cloud coverage between May and September 2020. Thus, the model is trained to be invariant against phenological changes within this period but is not designed to be robust outside of this period for regions experiencing high seasonality. Ultimately, the annual maps are computed on Sentinel-2 images from the same period. Image patches of 15 × 15 pixels (that is, 150 m × 150 m on the ground) are extracted from these images at every GEDI footprint location. Therefore, the GEDI data are rastered to the Sentinel-2 pixel grid by setting the canopy height reference value of the pixel that corresponds to the centre of the GEDI footprint. Locations for which the image patch is cloudy or snow covered are filtered out from the dataset. To correct noise injected by the geolocation uncertainty of the GEDI version 1 data⁴¹, we use the Sentinel-2 L2A scene classification and assign 0 m canopy height to footprints located in the categories ‘not vegetated’ or ‘water’. This procedure also addresses the slight positive bias due to slope in the GEDI reference data³⁰. Overall, the resulting dataset contains 600 × 10⁶ samples globally distributed within the GEDI range. All samples located within 20% of the Sentinel-2 tiles in that range (each 100 × 100 km) are set aside as validation data (Extended Data Fig. 1).

A second evaluation is carried out w.r.t. canopy-top heights independently derived from airborne LiDAR data from two sources. This includes NASA’s LVIS. LVIS is a large-footprint full-waveform LiDAR from which the LVIS L2 height metric RH98 is rastered to the Sentinel-2 10-m grid. The second source is high-resolution canopy height models (1-m GSD) derived from small-footprint ALS campaigns³³. To derive a comparable ‘GEDI-like’ canopy-top height metric within the 25-m footprint, we first run a circular max-pooling filter with a 12-m radius at the 1-m GSD resolution with a stride of 1 pixel (that is 1 m) before we resample the canopy height models to the Sentinel-2 10-m GSD (Supplementary Fig. 5 illustration). This processing is necessary as the maximum canopy height depends on the footprint size and avoids the comparison of systematically different canopy height metrics. Locations of the LVIS and ALS data are visualized in Extended Data Fig. 4.

Deep fully convolutional neural network

Our model is based on the fully convolutional neural network architecture proposed in prior work¹⁶. The architecture employs a series of residual blocks with separable convolutions⁵⁵ without any downsampling within the network. The sequence of learnable 3 × 3 convolutional filters is able to extract not only spectral but also textural features. To speed up the model for deployment at global scale, we reduce its size, setting the number of blocks to eight and the number of filters per block to 256. This speeds up the forward pass by a factor of ≈ 17 compared to the original, larger model. In our tests, the smaller version did not cause higher errors in an early phase of training. When trained long enough, a larger model with higher capacity may be able to reach lower prediction errors, but the higher computational cost of inference would limit its utility for repeated, operational use. The model takes the 12 bands from Sentinel-2 L2A product and the cyclic encoded geographical coordinates per pixel as input for a total of 15 input channels. Its outputs are two channels with the same spatial dimension as the input, one for the mean height and one for its variance (Fig. 1). Because the architecture is fully convolutional, it can process arbitrarily sized input image patches, which is useful when deploying at large scale.

Model training with sparse supervision

Formally, canopy height retrieval is a pixel-wise regression task. We train the regression model end to end in supervised fashion, which means that the model learns to transform raw image data into spectral and textural features predictive of canopy height, and there is no need to manually design feature extractors (Supplementary Fig. 3). We train the convolutional neural network with sparse supervision, that is, by selectively minimizing the loss (equation (1)) only at pixel locations for which there is a GEDI reference value. Before feeding the 15-channel data cube to the CNN, each channel is normalized to be standard normal, using the channel statistics from the training set. The reference canopy heights are normalized in the same way, a common pre-processing step to improve the numerical stability of the training. Each neural network is trained for 5,000,000 iterations with a batch size of 64, using the Adam optimizer⁵⁶. The base learning rate is initially set to 0.0001 and then reduced by factor 0.1 after 2,000,000 iterations and again after 3,500,000 iterations. This schedule was found to stabilize the uncertainty estimation.

Modelling the predictive uncertainty

Modelling uncertainty in deep neural networks is challenging due to their strong nonlinearity but crucial to build trustworthy models. The approach followed in this work accounts for two sources of uncertainty, namely the data (aleatoric) and the model (epistemic) uncertainty³⁵. The uncertainty in the data, resulting from noise in the input and reference data, is modelled by minimizing the Gaussian negative log likelihood (equation (1)) as a loss function³⁵. This corresponds to independently representing the model output at every pixel i as a conditional Gaussian probability distribution over possible canopy heights, given the input data, and estimating the mean $\hat{\mu }$ and variance ${\hat{\sigma }}^{2}$ of that distribution.

$${{{{\mathcal{L}}}}}_\mathrm{NLL}=\frac{1}{N}\mathop{\sum }\limits_{i=1}^{N}\frac{{\left(\,{\hat{\mu} }({x}_{i})-{y}_{i}\right)}^{2}}{2{\hat{\sigma }}^{2}({x}_{i})}+\frac{1}{2}\log {\hat{\sigma }}^{2}({x}_{i}).$$

(1)

To account for the model uncertainty, which in high-capacity neural network models can be interpreted as the model’s lack of knowledge about patterns not adequately represented in the training data, we train an ensemble²⁹ of five CNNs from scratch, that is, each time starting the training from a different randomly initialized set of model weights (learnable parameters). At inference time, we process images from T different acquisition dates (here T = 10) for every location to obtain full coverage and to exploit redundancy in the case of repeated cloud-free observations of a pixel. Each image is processed with one CNN picked randomly from the ensemble. This procedure incurs no additional computational cost compared to processing all images with the same CNN. It can be interpreted as a natural variant of test-time augmentation, which has been demonstrated to improve the calibration of uncertainty estimates from deep ensembles in the domain of computer vision³⁷. Finally, the per-image estimates are merged into a final map by averaging with inverse-variance weighting (equation (3)). If the variance estimates of all ensemble members are well calibrated, this results in the lowest expected error³⁸. Thus the variance of the final per-pixel estimate is computed with the weighted version of the law of total variance (equation (4))³⁵. For readability we omit the pixel index i.

$${\hat{p}}_{t}=\frac{1/{\hat{\sigma }}_{t}^{2}}{\mathop{\sum }\nolimits_{{j = 1}}^{\quad \, \, {T}}1/{{\hat{\sigma }}_{j}}^{2}},$$

(2)

$${\hat{y}}=\mathop{\sum }\limits_{t=1}^{T}{\hat{p}}_{t}{\hat{\mu }}_{t},$$

(3)

$${{{\rm{Var}}}}(\,{\hat{y}})=\mathop{\sum }\limits_{t=1}^{T}{\hat{p}}_{t}{\hat{\mu }}_{t}^{2}-{\left(\mathop{\sum }\limits_{t = 1}^{T}{\hat{p}}_{t}{\hat{\mu }}_{t}\right)}^{2}+\mathop{\sum }\limits_{t=1}^{T}{\hat{p}}_{t}{\hat{\sigma }}_{t}^{2},$$

(4)

Correction for imbalanced height distribution

We find that the underestimation bias on tall canopies is partially due to the imbalanced distribution of reference labels (and canopy heights overall), where large height values occur rarely. To mitigate it, we fine tune the converged model with a cost-sensitive version of the loss function. A softened version of inverse sample-frequency weighting is used to re-weight the influence of individual samples on the loss (equation (5)). To establish the frequency distribution of the continuous canopy height values in the training, we bin them into 1-m height intervals and in each of the resulting K bins count the number of samples N_k. Empirically, for our task, the moderated reweighting with the square root of the inverse frequency works better (leaving all other hyper-parameters unchanged). Moreover, we do not fine tune all model parameters but only the final regression layer that computes mean height (Fig. 1a). We observe that the uncertainty calibration is preserved when fine tuning only the regression weights for the mean (‘S2 + geo balanced: mean’ in Extended Data Fig. 5a), whereas fine tuning also the regression of the variance decalibrates the uncertainty estimation (‘S2 + geo balanced: mean&var’). The fine tuning is run for 750,000 iterations per network.

$${q}_{i}=\frac{\sqrt{1/{N}_{k,i\in k}}}{\mathop{\sum }\nolimits_{j = 1}^{K}\sqrt{1/{N}_{j}}}$$

(5)

Evaluation metrics

Several metrics are employed to measure prediction performance: the RMSE (equation (6)) of the predicted heights, their MAE (equation (7)) and their mean error (ME, equation (8)). The latter quantifies systematic height bias, where a negative ME indicates underestimation, that is, predictions that are systematically lower than the reference values.

$${{{\rm{RMSE}}}}=\sqrt{\frac{1}{N}\mathop{\sum }\limits_{i=1}^{N}{\left({\hat{y}}_{i}-{y}_{i}\right)}^{2}}$$

(6)

$${{{\rm{MAE}}}}=\frac{1}{N}\mathop{\sum }\limits_{i=1}^{N}|\, {\hat{y}}_{i}-{y}_{i}|$$

(7)

$${{{\rm{ME}}}}=\frac{1}{N}\mathop{\sum }\limits_{i=1}^{N}\left({\hat{y}}_{i}-{y}_{i}\right)$$

(8)

We also report balanced versions of these metrics, where the respective error is computed separately in each 5-m height interval and then averaged across all intervals. They are abbreviated as aRMSE, aMAE and aME (Fig. 1b).

The bias analyses with the independent airborne LiDAR data include the normalized mean error (NME, equation (7)) in percentage, where the mean error is divided by the average of the reference values $\bar{y}$:

$${{{\rm{NME}}}}=\frac{100}{\bar{y}N}\mathop{\sum }\limits_{i=1}^{N}\left({\hat{y}}_{i}-{y}_{i}\right)$$

(9)

For the estimated predictive uncertainties, there are, by definition, no reference values. A common scheme to evaluate their calibration is to produce calibration plots^34,57 that show how well the uncertainties correlate with the empirical error. As this correlation holds only in expectation, both the uncertainties and the empirical errors at the test samples must be binned into K equally sized intervals. In each bin B_k, the average of the predicted uncertainties is then compared against the actual average deviation between the predicted height and the reference data. On the basis of the calibration plots, it is further possible to derive a scalar error metric for the uncertainty calibration, the uncertainty calibration error (UCE) (equation (10))⁵⁷. Again, we additionally report a balanced version, the average uncertainty calibration error (AUCE) (equation (11)), where each bin has the same weight independent of the number N_k of samples in it.

$${{{\rm{UCE}}}}=\mathop{\sum }\limits_{k=1}^{K}\frac{{N}_{k}}{N}| \mathrm{err}({B}_{k})-\mathrm{uncert}({B}_{k})|$$

(10)

$${{{\rm{AUCE}}}}=\frac{1}{K}\mathop{\sum }\limits_{k=1}^{K}| \mathrm{err}({B}_{k})-\mathrm{uncert}({B}_{k})|$$

(11)

In our case err(B_k) represents the RMSE of the samples falling into bin B_k, and the bin uncertainty uncert(B_k) is defined as the root mean variance (RMV):

$${{{\rm{RMV}}}}=\sqrt{\frac{1}{{N}_{k}}\mathop{\sum}\limits_{i\in {B}_{k}}{\hat{u}}_{i}},$$

(12)

with ${\hat{u}}_{i}={\hat{\sigma}}_{i}^{2}$ when evaluating the calibration of a single CNN and ${\hat{u}}_{i}={{{\rm{Var}}}}(\,{\hat{y}}_{i})$ when evaluating the calibration of the ensemble. We refer to the RMV as the predictive standard deviation in our calibration plots (Extended Data Fig. 5a,c).

Global map computation

Sentinel-2 imagery is organized in 100 km × 100 km tiles; a total of 18,011 tiles cover the entire landmass of the Earth, excluding Antarctica. However, depending on the ground tracks of the satellites, some tiles are covered by multiple orbits, whereas, in general, no more than two orbits are needed to get full coverage. To optimize computational overhead, we select the relevant orbits per tile by using those with the smallest number of empty pixels, according to the metadata. For every tile and relevant orbit, the ten images with least cloud cover between May and September 2020 are selected for processing.

While it only takes ≈ 2 minutes to process a single image tile with the CNN on a GeForce RTX 2080 Ti GPU, downloading the image from the Amazon Web Service S3 bucket takes about 1 minute, and loading the data into memory takes about 4 minutes. To process a full tile with all ten images per orbit takes between 1 and 2.5 hours, depending on the number of relevant orbits (one or two).

We apply only minimal post-processing and mask out built-up areas, snow, ice and permanent water bodies according to the ESA WorldCover classification⁵⁸, setting their canopy height to ‘no data’ (value: 255). The canopy height product is released in the form of 3° × 3° tiles in geographic longitude/latitude, following the format of the recent ESA WorldCover product. This choice shall simplify the integration of our map into existing workflows and the intersection of the two products. Note that the statistics in the present paper were not computed from those tiles but in Gall–Peters orthographic equal-area projection with 10-m GSD for exact correspondence between pixel counts and surface areas.

Energy and carbon emissions footprint

The presented map has been computed on a GPU cluster located in Switzerland. Carbon accounting for electricity is a complex endeavour, due to differences in how electricity is produced and distributed. To put the power consumption needed to produce global maps with our method into context, we estimate carbon emissions for two scenarios, where the computation is run on Amazon Web Services (AWS) in two different locations: European Union (Stockholm) and United States East (Ohio). With ≈ 250 W to power one of our GPUs, we get a total energy consumption of 250 W × 27,000 h = 6,750 kWh for the global map. The conversion to emissions highly depends on the carbon efficiency of the local power grid. For European Union (Stockholm), we obtain an estimated 338 kg CO₂-equivalent, whereas for United States East (Ohio), we obtain 3,848 kg CO₂-equivalent, a difference by a factor >10. Whereas the former is comparable to driving an average car from Los Angeles to San Francisco and back (1,360 km), the latter corresponds to a round trip from Seattle (United States) to San José, Costa Rica (15,500 km). These estimates were conducted using the Machine Learning Impact calculator (ref. ⁵⁹). For the carbon footprint of the current map (not produced with AWS), we estimate ≈ 729 kg CO₂-equivalent, using an average of 108 g CO₂-equivalent kWh⁻¹ for Switzerland, as reported for the year 2017⁶⁰.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

A summary of all links to data, browser application and code can be found on the project page at langnico.github.io/globalcanopyheight. Global map: the global canopy height map for 2020 is available for download at https://doi.org/10.3929/ethz-b-000609802. Individual tiles can be downloaded at langnico.github.io/globalcanopyheight/assets/tile_index.html. The map is also available on the Google Earth Engine (GEE assets on project page). The global map can be explored interactively in this browser application: nlang.users.earthengine.app/view/global-canopy-height-2020. GEDI footprint data: sparse footprint-level RH98 estimates used as reference data for developing the presented model are available on Zenodo at https://doi.org/10.5281/zenodo.7737946 (ref. ⁵³), which is the filtered version of the full orbit predictions for 2019, https://doi.org/10.5281/zenodo.5704852 (ref. ⁶¹), and 2020, https://doi.org/10.5281/zenodo.7737869 (ref. ⁶²). Training and validation datasets: the global training and validation dataset with image patches and rastered reference data is available at https://doi.org/10.3929/ethz-b-000609845. The rastered airborne LiDAR canopy height models from LVIS and ALS used for validation are available at https://doi.org/10.5281/zenodo.7885699. ESA WorldCover: a derived version of the original ESA WorldCover 10-m 2020 v100⁵⁸ re-projected to the Sentinel-2 UTM Tiling Grid for the global land surface is available at https://doi.org/10.5281/zenodo.7888150 (ref. ⁶³).

Code availability

The source code and the trained models are available via Github, github.com/langnico/global-canopy-height-model.

Change history

19 February 2024
A Correction to this paper has been published: https://doi.org/10.1038/s41559-024-02371-2

References

Manning, P. et al. Redefining ecosystem multifunctionality. Nat. Ecol. Evol. 2, 427–436 (2018).
Article PubMed Google Scholar
United Nations Strategic Plan for Forests 2017–2030. (United Nations); https://www.un.org/esa/forests/documents/un-strategic-plan-for-forests-2030/index.html (2017).
Hoang, N. T. & Kanemoto, K. Mapping the deforestation footprint of nations reveals growing threat to tropical forests. Nat. Ecol. Evol. 5, 845–853 (2021).
Article PubMed Google Scholar
Hansen, M. C. et al. High-resolution global maps of 21st-century forest cover change. Science 342, 850–853 (2013).
Article CAS PubMed Google Scholar
Skidmore, A. K. et al. Priority list of biodiversity metrics to observe from space. Nat. Ecol. Evol. 5, 896–906 (2021).
Article PubMed Google Scholar
Jucker, T. et al. Allometric equations for integrating remote sensing imagery into forest monitoring programmes. Glob. Change Biol. 23, 177–190 (2017).
Article Google Scholar
Migliavacca, M. et al. The three major axes of terrestrial ecosystem function. Nature 598, 468–472 (2021).
Article CAS PubMed PubMed Central Google Scholar
Duncanson, L. et al. Aboveground biomass density models for NASA’s Global Ecosystem Dynamics Investigation (GEDI) lidar mission. Remote Sens. Environ. 270, 112845 (2022).
Article Google Scholar
Tuanmu, M.-N. & Jetz, W. A global, remote sensing-based characterization of terrestrial habitat heterogeneity for biodiversity and ecosystem modelling. Glob. Ecol. Biogeogr. 24, 1329–1339 (2015).
Article Google Scholar
De Frenne, P. et al. Global buffering of temperatures under forest canopies. Nat. Ecol. Evol. 3, 744–749 (2019).
Article PubMed Google Scholar
Jucker, T. et al. Canopy structure and topography jointly constrain the microclimate of human-modified tropical landscapes. Glob. Change Biol. 24, 5243–5258 (2018).
Article Google Scholar
Dubayah, R. et al. The global ecosystem dynamics investigation: high-resolution laser ranging of the Earth’s forests and topography. Sci. Remote Sens. 1, 100002 (2020).
Article Google Scholar
Valbuena, R. et al. Standardizing ecosystem morphological traits from 3D information sources. Trends Ecol. Evol. 35, 656–667 (2020).
Article CAS PubMed Google Scholar
Gibbs, H. K., Brown, S., Niles, J. O. & Foley, J. A. Monitoring and estimating tropical forest carbon stocks: making redd a reality. Environ. Res. Lett. 2, 045023 (2007).
Article Google Scholar
Rodríguez-Veiga, P., Wheeler, J., Louis, V., Tansey, K. & Balzter, H. Quantifying forest biomass carbon stocks from space. Curr. For. Rep. 3, 1–18 (2017).
Article Google Scholar
Lang, N., Schindler, K. & Wegner, J. D. Country-wide high-resolution vegetation height mapping with Sentinel-2. Remote Sens. Environ. 233, 111347 (2019).
Article Google Scholar
Becker, A. et al. Country-wide retrieval of forest structure from optical and SAR satellite imagery with deep ensembles. Preprint at https://doi.org/10.48550/arXiv.2111.13154 (2021).
Lang, N., Schindler, K. & Wegner, J. D. High carbon stock mapping at large scale with optical satellite imagery and spaceborne LIDAR. Preprint at https://doi.org/10.48550/arXiv.2107.07431 (2021).
Rodríguez, A. C., D’Aronco, S., Schindler, K. & Wegner, J. D. Mapping oil palm density at country scale: an active learning approach. Remote Sens. Environ. 261, 112479 (2021).
Article Google Scholar
Jumper, J. et al. Highly accurate protein structure prediction with alphafold. Nature 596, 583–589 (2021).
Article CAS PubMed PubMed Central Google Scholar
Ravuri, S. et al. Skilful precipitation nowcasting using deep generative models of radar. Nature 597, 672–677 (2021).
Article CAS PubMed PubMed Central Google Scholar
Reichstein, M. et al. Deep learning and process understanding for data-driven earth system science. Nature 566, 195–204 (2019).
Article CAS PubMed Google Scholar
Tuia, D. et al. Perspectives in machine learning for wildlife conservation. Nat. Commun. 13, 792 (2022).
Article CAS PubMed PubMed Central Google Scholar
Kattenborn, T., Leitloff, J., Schiefer, F. & Hinz, S. Review on convolutional neural networks (CNN) in vegetation remote sensing. ISPRS J. Photogramm. Remote Sens. 173, 24–49 (2021).
Article Google Scholar
Gorelick, N. et al. Google Earth Engine: planetary-scale geospatial analysis for everyone. Remote Sens. Environ. 202, 18–27 (2017).
Article Google Scholar
Hansen, M. C. et al. Mapping tree height distributions in sub-Saharan Africa using Landsat 7 and 8 data. Remote Sens. Environ. 185, 221–232 (2016).
Article Google Scholar
Potapov, P. et al. Mapping global forest canopy height through integration of GEDI and Landsat data. Remote Sens. Environ. 253, 112165 (2021).
Article Google Scholar
Healey, S. P., Yang, Z., Gorelick, N. & Ilyushchenko, S. Highly local model calibration with a new GEDI LiDAR asset on Google Earth Engine reduces Landsat forest height signal saturation. Remote Sens. 12, 2840 (2020).
Article Google Scholar
Lakshminarayanan, B., Pritzel, A. & Blundell, C. Simple and scalable predictive uncertainty estimation using deep ensembles. In Proc. 31st International Conference on Neural Information Processing Systems 6405–6416 (Curran Associates, Inc., Red Hook, 2017).
Lang, N. et al. Global canopy height regression and uncertainty estimation from GEDI LIDAR waveforms with deep ensembles. Remote Sens. Environ. 268, 112760 (2022).
Article Google Scholar
Tang, K., Paluri, M., Fei-Fei, L., Fergus, R. & Bourdev, L. Improving image classification with location context. In Proc. IEEE International Conference on Computer Vision 1008–1016 (IEEE Computer Society, Los Alamitos, 2015).
Blair, J. Processing of NASA LVIS elevation and canopy (LGE, LCE and LGW) data products, version 1.0. NASA https://lvis.gsfc.nasa.gov (2018).
Liu, S. et al. The overlooked contribution of trees outside forests to tree cover and woody biomass across Europe. Preprint at Research Square https://doi.org/10.21203/rs.3.rs-2573442/v1 (2023).
Guo, C., Pleiss, G., Sun, Y. & Weinberger, K.Q. On calibration of modern neural networks. In Proc. 34th International Conference on Machine Learning 1321–1330 (ML Research Press, 2017).
Kendall, A. & Gal, Y. What uncertainties do we need in Bayesian deep learning for computer vision? In Proc. 31st International Conference on Neural Information Processing Systems 5580–5590 (Curran Associates, Inc., Red Hook, 2017).
Ovadia, Y. et al. Can you trust your model’s uncertainty? Evaluating predictive uncertainty under dataset shift. In Proc. 33rd Conference on Neural Information Processing Systems 13991–14002 (Curran Associates, Inc., Red Hook, 2019).
Ashukha, A., Lyzhov, A., Molchanov, D. & Vetrov, D. Pitfalls of in-domain uncertainty estimation and ensembling in deep learning. In Proc. 8th International Conference on Learning Representations (Curran Associates, Inc., Red Hook, 2020); https://openreview.net/forum?id=BJxI5gHKDr, https://dblp.org/rec/conf/iclr/AshukhaLMV20.bib
Strutz, T. Data Fitting and Uncertainty: A Practical Introduction to Weighted Least Squares and Beyond (Vieweg and Teubner, 2010).
Protected Planet: The World Database on Protected Areas (WDPA) (UNEP-WCMC & IUCN, 2021); https://www.protectedplanet.net/en
Roy, D. P., Kashongwe, H. B. & Armston, J. The impact of geolocation uncertainty on GEDI tropical forest canopy height estimation and change monitoring. Sci. Remote Sens. 4, 100024 (2021).
Article Google Scholar
Tang, H. et al. Evaluating and mitigating the impact of systematic geolocation error on canopy height measurement performance of gedi. Remote Sens. Environ. 291, 113571 (2023).
Article Google Scholar
De Lutio, R., D’Aronco, S., Wegner, J. D. & Schindler, K. Guided super-resolution as pixel-to-pixel transformation. In Proc. IEEE/CVF International Conference on Computer Vision 8828–8836 (IEEE Computer Society, Los Alamitos, 2019).
Dubayah, R. et al. GEDI launches a new era of biomass inference from space. Environ. Res. Lett. 17, 095001 (2022).
Article Google Scholar
Dubayah, R. et al. GEDI L3 Gridded Land Surface Metrics Version 1 (ORNL DAAC, 2021); https://doi.org/10.3334/ORNLDAAC/1865
Global Forest Resources Assessment 2020: Main Report (FAO, 2020); https://doi.org/10.4060/ca9825en
Asner, G. P., Brodrick, P. G. & Heckler, J. Global airborne observatory: forest canopy height and carbon stocks for Sabah, Borneo Malaysia. Zenodo https://doi.org/10.5281/zenodo.4549461 (2021).
Asner, G. P. et al. Mapped aboveground carbon stocks to advance forest conservation and recovery in Malaysian Borneo. Biol. Conserv. 217, 289–310 (2018).
Article Google Scholar
Friedlingstein, P. et al. Global carbon budget 2021. Earth Syst. Sci. Data Discuss. 14, 1917–2005 (2022).
Article Google Scholar
Felipe-Lucia, M. R. et al. Multiple forest attributes underpin the supply of multiple ecosystem services. Nat. Commun. 9, 4839 (2018).
Article PubMed PubMed Central Google Scholar
MacArthur, R. H. & MacArthur, J. W. On bird species diversity. Ecology 42, 594–598 (1961).
Article Google Scholar
Tews, J. et al. Animal species diversity driven by habitat heterogeneity/diversity: the importance of keystone structures. J. Biogeogr. 31, 79–92 (2004).
Article Google Scholar
Le Toan, T. et al. The biomass mission: objectives and requirements. In IGARSS 2018–2018 IEEE International Geoscience and Remote Sensing Symposium 8563–8566 (IEEE, Piscataway, 2018).
Lang, N. et al. Filtered canopy top height estimates from GEDI LIDAR waveforms for 2019 and 2020. Zenodo https://doi.org/10.5281/zenodo.7737946 (2023).
Dubayah, R. et al. GEDI L1B Geolocated Waveform Data Global Footprint Level V001 (NASA, 2020).
Chollet, F. Xception: deep learning with depthwise separable convolutions. In Proc. of the IEEE Conference on Computer Vision and Pattern Recognition 1800–1807 (IEEE Computer Society, Los Alamitos, 2017).
Kingma, D. P. & Ba, J. Adam: a method for stochastic optimization. In Proc. 3rd International Conference on Learning Representations (Eds. Bengio, Y. & LeCun, Y.) (Curran Associates, Inc., Red Hook, 2015).
Laves, M.-H., Ihler, S., Fast, J. F., Kahrs, L. A. & Ortmaier, T. Well-calibrated regression uncertainty in medical imaging with deep learning. In Proc. 3rd Conference on Medical Imaging with Deep Learning 393–412 (ML Research Press, 2020).
Zanaga, D. et al. ESA WorldCover 10 m 2020 v100. Zenodo https://doi.org/10.5281/zenodo.5571936 (2021).
Lacoste, A., Luccioni, A., Schmidt, V. & Dandres, T. Quantifying the carbon emissions of machine learning. Preprint at https://doi.org/10.48550/arXiv.1910.09700 (2019).
Rüdisüli, M., Romano, E., Eggimann, S. & Patel, M. K. Decarbonization strategies for Switzerland considering embedded greenhouse gas emissions in electricity imports. Energy Policy 162, 112794 (2022).
Article Google Scholar
Lang, N. et al. Global canopy top height estimates from GEDI LIDAR waveforms for 2019. Zenodo https://doi.org/10.5281/zenodo.5704852 (2021).
Lang, N. et al. Global canopy top height estimates from GEDI LIDAR waveforms for 2020. Zenodo https://doi.org/10.5281/zenodo.7737869 (2023).
Lang, N., Schindler, K. & Wegner, J. D. ESA WorldCover 10 m 2020 v100 reprojected to the Sentinel-2 UTM tiling grid. Zenodo https://doi.org/10.5281/zenodo.7888150 (2023).

Download references

Acknowledgements

The project received funding from Barry Callebaut Sourcing AG as part of a research project agreement. The Sentinel-2 data access for the computation of the global map was funded by AWS Cloud credits for research, courtesy of P. Gehler. We thank Y. Sica for sharing a rastered version of the WDPA data and S. Liu and M. Brandt for sharing the rastered ALS canopy height models in Europe. We thank N. Gorelick and S. Ilyushchenko for providing the resources to make our global map available on the Google Earth Engine. We greatly appreciate the open data policies of the ESA Copernicus program, the NASA GEDI mission and the LVIS campaigns. N.L. acknowledges support by the research grant DeReEco (grant number 34306) from Villum Foundation and by the Pioneer Centre for AI, Danish National Research Foundation grant number P1.

Funding

Open access funding provided by Swiss Federal Institute of Technology Zurich.

Author information

Authors and Affiliations

EcoVision Lab, Photogrammetry and Remote Sensing, ETH Zürich, Zürich, Switzerland
Nico Lang, Konrad Schindler & Jan Dirk Wegner
Department of Computer Science, University of Copenhagen, Copenhagen, Denmark
Nico Lang
Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT, USA
Walter Jetz
Institute for Computational Science, University of Zurich, Zürich, Switzerland
Jan Dirk Wegner

Authors

Nico Lang
View author publications
You can also search for this author in PubMed Google Scholar
Walter Jetz
View author publications
You can also search for this author in PubMed Google Scholar
Konrad Schindler
View author publications
You can also search for this author in PubMed Google Scholar
Jan Dirk Wegner
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

N.L. developed the code and carried out the experiments under the guidance of J.D.W. and K.S. W.J. gave suggestions and feedback on the global analyses. All authors contributed to the article and the analyses of the results.

Corresponding authors

Correspondence to Nico Lang or Jan Dirk Wegner.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Ecology & Evolution thanks the anonymous reviewers for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Extended data

Extended Data Fig. 1 Geographical error analysis on held-out GEDI validation data at 1 degree resolution ( ≈ 111 km on the equator).

a) Root mean square error (RMSE). b) Mean absolute error (MAE). c) Mean error (ME), where negative ME means an underestimation bias when the predictions are lower than the reference values.

Extended Data Fig. 2 Biome-level evaluation with GEDI reference data.

Biome-level confusion plots showing the relationship between GEDI reference data and the estimated canopy top height from Sentinel-2.

Extended Data Fig. 3 Comparison of canopy top height estimates from ETH (ours) and UMD against the hold-out GEDI validation data.

a) Residual analysis w.r.t. GEDI reference canopy height. The boxplot shows the median, the quartiles, and the 10th and 90th percentiles (n=87,406,779 “UMD" and “ETH (ours)"; n=69,925,423 “ETH (ours) 80%"). b) Residual analysis w.r.t. biomes defined by The Nature Conservancy. Negative residuals indicate that estimates are lower than reference values. “ETH (ours) 80%" is a filtered version of our estimates where the 20% of estimates with the highest relative standard deviation are removed. This filtering follows the protocol proposed in previous work using an adaptive threshold depending on the predicted canopy height to preserve the full canopy height range³⁰. The distributions over the full height range are compared in Supplementary Information Fig. S2. The boxplot shows the median, the quartiles, and the 10th and 90th percentiles (n=87,406,779 “UMD" and “ETH (ours)"; n=69,925,423 “ETH (ours) 80%").

Extended Data Fig. 4 Locations of independent airborne LIDAR campaigns.

Independent airborne LIDAR data from 24 regions including 12 regions with LVIS LIDAR and 12 regions with small-footprint airborne laser scanning (ALS) campaigns.

Extended Data Fig. 5 Evaluation of the estimated uncertainty.

a) Calibration plot showing the relationship between the estimated predictive uncertainty and the empirical error. b) Improvement of overall error metrics when filtering out the most uncertain canopy height predictions with the help of the estimated predictive uncertainty. c) Biome-level calibration plots, showing the relationship between the estimated predictive uncertainty and the empirical error w.r.t. GEDI reference data.

Extended Data Fig. 6 Protected area analysis according to the World Database on Protected Areas (WDPA)39.

a) Cumulative area covered by vegetation above a given height in protected areas and unprotected areas. The sum at height 0 equals the area of the global landmass (excluding Antarctica). b) Examples where the dense canopy height map reveals the spatial patterns of protected areas. Left: “Devil’s Staircase Wilderness" containing federally protected old-growth forest stands in the Oregon Coast Range. Center: “Ulu Temburong National Park" in Brunei, Borneo, established in 1991. Right: Protected areas in Ghana that indicate the strong impact of protection measures on the growing vegetation.

Extended Data Fig. 7 Qualitative examples comparing UMD and ETH (ours) against GEDI-like canopy top height derived from small-footprint airborne laser scanning (ALS) campaigns (a-c) and LVIS LIDAR (d-f).

a) Switzerland (from tile region 32TMT with average height 14.1m). Our map with RMSE: 8.0m, bias: 1.5 m (10.5%) outperforms the UMD map with RMSE: 10.1m, bias: -5.4 m (-37.4%). b) Spain (from tile region 30SWH with low average height 6.0m). Our map with RMSE: 4.7m, bias: 0.5 m (7.7%) outperforms the UMD map with RMSE: 5.2m, bias: -3.3 m (-54.9%). c) Netherlands (from tile region 31UFT with average height 6.6m). Our map with RMSE: 6.4m, bias: 3.9 m (59.9%) yields higher error than the UMD map with RMSE: 5.9m, bias: -2.2 m (-37.5%). In both b) and c), the UMD map underestimates high vegetation and misses small structures in low vegetation areas. d) Gabon, Lopé National Park (from the tile region 32MQE with the highest average height of 36.2 m). Our map with RMSE: 8.9m, bias: -4.8 m (-13.3%) outperforms the UMD map with RMSE: 11.4m, bias: -7.9 m (-21.8%). e) US, Oregon (from tile region 10TER with an average height of 28.6m). Our map with RMSE: 9.6m, bias: 2.9m, (10.0%) outperforms the UMD map with RMSE: 12.0 m, bias: -5.4 m (-19.3%). f) Costa Rica (from tile region 16PHS with average height 16.7m). Our map with RMSE: 9.2m, bias: 5.9 m (35.4%) yields a higher error than the UMD map with RMSE: 8.1m, bias: -3.0 m (-18.0%).

Extended Data Fig. 8 Qualitative examples beyond GEDI coverage (that is north of 51.6^∘ latitude) against GEDI-like canopy top height derived from small-footprint airborne laser scanning (ALS) campaigns (a-c) and LVIS LIDAR (d-f).

a) Finland (from tile region 35WNT with average height 3.0m). Our map yields an RMSE: 3.0 m and bias: 0.5 m (17.1%). b) Finland (from tile region 35VNL with average height 15.3m). Our map yields an RMSE: 5.7 m and bias: -2.6 m (-17.2%). c) Wales (from tile region 30UVD with average height 3.3m). An error case with an RMSE: 7.3 m and bias: 5.7 m (171.1%). Our map overestimates the ALS reference data, especially in low vegetation areas. d) Canada (from tile region 08WNA with average height 4.3m). Our map yields an RMSE: 2.8 m and bias: 0.7 m (16.4%). e) US, Alaska (from tile region 06VXR with average height 9.3m). Our map yields an RMSE: 6.4 m and bias: 2.7 m (29.2%). f) US, Alaska (from tile region 06WVT with average height 6.8m). Our map yields an RMSE: 5.3 m and bias: 0.6 m (8.7%). Both e) and f) show high predictive uncertainty, yet the canopy top height estimates are reasonable.

Extended Data Table 1 Tile-level evaluation with independent airborne LIDAR data

Full size table

Extended Data Table 2 Averaged evaluation with independent airborne LIDAR data

Full size table

Supplementary information

Supplementary Information

Supplementary Figs. 1–5.

Reporting Summary

Peer Review File

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Lang, N., Jetz, W., Schindler, K. et al. A high-resolution canopy height model of the Earth. Nat Ecol Evol 7, 1778–1789 (2023). https://doi.org/10.1038/s41559-023-02206-6

Download citation

Received: 09 June 2023
Accepted: 24 August 2023
Published: 28 September 2023
Issue Date: November 2023
DOI: https://doi.org/10.1038/s41559-023-02206-6

This article is cited by

Grassland vertical height heterogeneity predicts flower and bee diversity: an UAV photogrammetric approach
- Michele Torresani
- Duccio Rocchini
- David Kleijn
Scientific Reports (2024)
High-resolution sensors and deep learning models for tree resource monitoring
- Martin Brandt
- Jerome Chave
- Christian Igel
Nature Reviews Electrical Engineering (2024)
Uncertainty quantification for probabilistic machine learning in earth observation using conformal prediction
- Geethen Singh
- Glenn Moncrieff
- Tamara B. Robinson
Scientific Reports (2024)
Characterizing the structural complexity of the Earth’s forests with spaceborne lidar
- Tiago de Conto
- John Armston
- Ralph Dubayah
Nature Communications (2024)
Global high-resolution total water storage anomalies from self-supervised data assimilation using deep learning algorithms
- Junyang Gou
- Benedikt Soja
Nature Water (2024)

Subjects

Abstract

Similar content being viewed by others

Main

Results and discussion

Deep learning approach

Going global

Comparison to existing canopy height estimates

Evaluation with independent airborne LiDAR

Modelling predictive uncertainty

Global canopy height map

Discussion

Limitations and map quality

Potential applications

Conclusion

Methods

Data

Deep fully convolutional neural network

Model training with sparse supervision

Modelling the predictive uncertainty

Correction for imbalanced height distribution

Evaluation metrics

Global map computation

Energy and carbon emissions footprint

Reporting summary

Data availability

Code availability

Change history

19 February 2024

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Extended data

Extended Data Fig. 6 Protected area analysis according to the World Database on Protected Areas (WDPA)39.

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links