Identifying and Quantifying Pixel-Level Uncertainty among Major Satellite Derived Global Land Cover Products

Accurate global land cover (GLC), as a key input for scientific communities, is important for a wide variety of applications. In order to understand the current suitability and limitation of GLC products, the discrepancy and pixel-level uncertainty in major GLC products in three epochs are assessed in this study by using an integrated uncertainty index (IUI) that combines the thematic uncertainty and local classification accuracy uncertainty. The results show that the overall spatial agreements (Ao values) between GLC products are lower than 58%, and the total areas of forests are very consistent in major GLC products, but significant differences are found in different forest classes. The misclassification among different forest classes and mosaic types can account for about 20% of the total disagreements. The mean IUI almost reaches 0.5, and high uncertainty mostly occurs in transition zones and heterogeneous areas across the world. Further efforts are needed to make in the land cover classifications in areas with high uncertainty. Designing a classification scheme for climate models, with explicit definitions of land cover classes in the threshold of common attributes, is urgently needed. Information of the pixel-level uncertainty in major GLC products not only give important implications for the specific application, but also provide a quite important basis for land cover fusion.


Introduction
Land cover plays an important role in the biogeochemistry of earth system, which influences the carbon, moisture, energy, and momentum exchanges between the land and atmosphere (Findell et al., 2007;Sterling et al., 2013). As an essential variable considered by the Global Climate Observing System (GCOS), land cover is used in science and policy applications, such as the climate change, food security, biodiversity conservation, ecosystem assessment, and hydrological and dynamic vegetation modeling.
A number of global land cover (GLC) products have been produced based on the remote sensing data in response to the need for land cover information and dy-namics in the past decades (Hansen et al., 2000;Bartholomé and Belward, 2005;Friedl et al., 2010;Gong et al., 2013;Tateishi et al., 2014;Chen et al., 2015;Defourny et al., 2016), which have been widely used in various scientific studies. Many applications have benefited from using satellite derived land cover products as an input variable in the model for improving the model performance. However, the quality of these products has a strong impact on the final model output, which has significant effects on these applications (Ge et al., 2007;Sertel et al., 2010;Santos-Alamillos et al., 2015;Madhusoodhanan et al., 2017). Thus, accurate GLC products are vital for global applications, and associated information of the accuracy and bias is required for better employing GLC products in scientific studies.
Although many efforts have been made to provide valuable quality information of GLC products, it is still unable to fulfill requirements of different users in providing a guide for data usage. For instance, the cross validation with reference data provides only overall and specific accuracy estimations for the entire products, which could not represent the spatial variations (Foody, 2002). In terms of the spatial agreement, the inter comparison with the existing land cover products does not provide information on the comparative accuracy of these products. Recently, several studies have estimated the accuracy for different applications (Fritz and See, 2008;Gao and Jia, 2013;Tsendbazar et al., 2016), analyzed the thematic similarity (Ahlqvist, 2005;Neumann et al., 2007;Pérez-Hoyos et al., 2012), and evaluated the spatial variations in classification accuracy (Foody, 2005;See et al., 2015;Tsendbazar et al., 2015;Comber et al., 2017;Yang et al., 2017). The thematic analysis indicates the discrepancies in the definition of land cover class and positional errors among products, which are often ignored when the classification scheme is translated. The spatial variation assessment informs the detailed spatial variability in the classification accuracy, which is crucial for scientific applications. However, no previous studies, to our knowledge, have evaluated the uncertainty of current GLC products with the combination of both the thematic similarity and spatial variation.
In this study, we develop a methodology to identify the spatial distribution of uncertainty among GLC products that combines both the thematic uncertainty and local classification accuracy uncertainty. First, the thematic uncertainty index (TUI) is calculated based on the thematic similarity analysis, which accommodates both the partial definition overlaps between classes and positional errors by the per-pixel comparison. Second, the local classification accuracy uncertainty index (AUI) is calculated by using the reference dataset via a geostatistical model. Finally, the integrated uncertainty index (IUI) is calculated based on the combination of the thematic uncertainty and local classification accuracy uncertainty to quantify the pixel-level uncertainty of GLC products, offering new insights to understand the spatial distribution of disagreement and implication for both the users and producers of GLC products.

GLC products
The major GLC products derived from satellite data for three epochs (2000-2005, 2008-2010, and 2013-2015) are used in this study, including the GLC2000 dataset from the European Commission's Joint Research Center, land cover dataset based on the Moderate Resolution Imaging Spectroradiometer (MODIS) data, GLC map (GlobCover) from European Space Agency (ESA), GLC by National Mapping Organizations (GLCNMO) dataset from the International Steering Committee for Global Mapping, and Climate Change Initiative Land Cover dataset (CCI-LC) from ESA. These products are developed by different national or international initiatives for different scientific purposes, with diverse classification characteristics, which are widely used as a baseline for GLC conditions and dynamics over the past decades. Characteristics of these different products are summarized in Table 1.
We clipped all the GLC products to the overlap extent of the globe, projected, and co-registered them to the same projection and spatial resolution to reduce errors in geo-registration. Since the resolution of the products ranges from 300 m to 1 km, the resampling strategies for the three epochs are not the same, e.g., the products are resampled to a resolution of 1 km for 2000-2005 and 500 m for periods of both 2008-2010 and 2013-2015 by using the nearest neighbor rule.
These GLC products have different classification schemes (Table 1). In order to quantify the uncertainty among these products for climate models, legends associated with each GLC product are translated into a common scheme with 14 land cover classes (Table 2), by us- AUGUST 2020 ing the Land Cover Classification System (LCCS) legend translation protocols developed by Herold et al. (2008).

Reference datasets
Several existing GLC reference datasets are used in this paper, including the consolidated GLC2000 reference dataset (Schultz et al., 2015), GLCNMO2008 training dataset (Tateishi et al., 2011), consolidated GlobCo-ver2005 reference dataset (Bicheron et al., 2008), System for Terrestrial Ecosystem Parameterization (STEP) reference dataset (Friedl et al., 2010), Visible Infrared Imaging Radiometer Suite (VIIRS) Surface Type validation database (Olofsson et al., 2012), GLC Ground Truth (GLCGT) database, Global Forest Resources Assessment (Global FRA) reference (Potapov et al., 2011), Geo-Wiki validations (Fritz et al., 2009), and a common validation sample set for the Finer Resolution Observation and Monitoring (FROM) of GLC project (Zhao et al., 2014) and Global Flux sites, which are generated by different international organizations through enormous interpretation efforts. All these reference datasets are publicly accessible through the Global Observation of Forest Cover and Land Dynamics (GOFC-GOLD) reference data portal, Geo-Wiki, and Internet, and can be freely used by any researcher. It should be noted that time periods for these reference datasets are discrepant, but errors in the processes of land cover changing over   time can be neglected compared to misclassification errors as most of the reference points and scientific sites are located in homogeneous and stable areas. The class of land cover at the centroid of sample sites is considered as the reference land cover class. Legends of all the reference datasets mentioned above are translated into a 14-class common scheme listed in Table 2. Totally, 17,361 reference sites within the globe are adopted in this study. The spatial distributions and corresponding size of each reference dataset as well as reference land cover classes are shown in Fig. 1.

Comparison analysis
In this study, the area and spatial comparisons are applied. The area comparison shows the total area differences of individual land cover class in the common scheme among different GLC products. The spatial comparison is based on the per-pixel comparison among GLC products, and the overall spatial agreement (A o ) as well as class-specific spatial agreement (A s ) values are Geo-Wiki (823) GLC2000 (832) GLCGT (331) GLCNMO (2088) GlobCover2005 (184) Global FRA (975) GlobalCrowdSource (2026) GlobalFluxSites (508) STEP ( Gao, H., G. S. Jia, and Y. Fu calculated for quantifying the differences of spatial patterns for each class by using the following equations. where X i is the number of pixels of class i in GLC product X, and Y i is the number of pixels of class i in GLC product Y. XY ii is the number of pixels of class i in both GLC products X and Y. N is the number of land cover classes (14 in this study), which is used to calculate A o , and M is the total number of pixels in the entire global area.

Thematic uncertainty
The thematic similarity is defined as the affinity between different land cover classes based on a set of common land cover attributes from the LCCS translation in previous studies (Ahlqvist, 2005;Neumann et al., 2007;Fritz and See, 2008;Pérez-Hoyos et al., 2012). It allows us to examine the agreement between different land cover products with different classification schemes. The overlap property is firstly computed for each separate categorical or continuous attribute, and then those values for all the common considered attributes are combined together to yield an overlap metric, ranging from 0 (unrelated classes) to 1 (coincidence of all attributes). The thematic similarity indicates the degree of agreement between two land cover classes, and the thematic uncertainty represents the discrepancies in land cover definitions and positional errors. The thematic uncertainty of GLC products can be obtained through the following steps.
Step 1: Following the method described by Pérez-Hoyos et al. (2012), land cover classes of GLC products are translated into LCCS and a set of land cover attributes are defined, and the overlap property is calculated for each separate common attribute. Then, the overlap property values for all common attributes are aggregating into a single quantity by using the following equations to indicate the thematic similarity for any pair of GLC products.
The overlap metric is a similarity score of the attributes that two land cover classes have in common, ranging from 0 (total disagreement between the two land cover classes) to 1 (the two land cover classes are con-sidered to be identical). O ij is the overlap metric, W k represents the weight of each common attribute (each attribute is assumed to have the same contribution to the overlap metric), and O k (C i , X j ) is the overlap property for each common attribute between classes i and j.
Step 2: TUI is defined as below.
where a ts is the accuracy, which is the difference value between the true and mean thematic similarities, and p ts is the precision, which is the standard deviation (STD) of thematic similarities. The True ts is set as 1. For a given pixel, if all attributes of the two land cover classes are the same, the thematic similarity is 1, otherwise, it is computed for any pair of GLC products, so there are several thematic similarities of pair-combinations for a given pixel. For example, there are totally six pair-combinations for the four GLC products, and thus, six thematic similarities are obtained for a given pixel. Mean ts and STD ts are the mean value and STD of the six thematic similarities for a given pixel in the same epoch, respectively.

Local classification accuracy uncertainty
The accuracy of GLC products is often expressed in terms of the global accuracy, without informing about the spatial variation in the classification accuracy. The local classification accuracy denotes the local probability that a specific GLC product is correct, and it can be modeled with substantial GLC reference datasets to assess the local classification accuracy of global-scale products in previous studies (See et al., 2015;Tsendbazar et al., 2015;Comber et al., 2017). We followed the method elaborated by Tsendbazar et al. (2015) for calculating the local classification accuracy in this work. The local classification accuracy uncertainty of GLC products is calculated through the following two steps.
Step 1: Correspondences between GLC products and reference data are indicator-coded. If the land cover class of a reference site matched with that of a GLC product, an indicator code of 1 is assigned to the reference site, otherwise, an indicator code of 0 is given to that site. Then, the spatial autocorrelation of the indicator-coded data is analyzed by using indicator semivariograms. More detailed information can be found in . Semivariogram examples of the spatial correspondence for GLC in 2008-2010, and the used fitted models for the indicator kriging, are demonstrated in Fig. 2.
Finally, the local classification accuracy is generated by indicator kriging for each GLC product. The value of local classification accuracy ranges from 0 to 1, which denotes the local probability that a specific GLC product is correct. Here, a correct land cover class has a value of 1, while a probably wrong land cover class has a value of less than 1.
Step 2: AUI is defined as follows.
where True la is set as 1, and Mean la and STD la are the mean value and STD of the local classification accuracy among all GLC products in the same epoch, respectively.

Integrated uncertainty of GLC products
In this study, an IUI, which combines both the thematic uncertainty and local classification accuracy uncertainty, is defined as below.
where IUI is the GLC integrated uncertainty index, TUI is the thematic uncertainty index, and AUI is the local classification accuracy uncertainty index. W i and W j de-note the weights of TUI and AUI, respectively. Although in this study identical weights are assigned to TUI and AUI (W i and W j are equal to 0.5), different weights can also be applied to reveal the relative importance of the two uncertainty indexes in terms of land cover characters.

Areal comparisons
The comparison of total percent area for 14 aggregated classes of major GLC products in three epochs is shown in Fig. 3. The analysis excluding Antarctica shows that the grassland and barren or sparsely vegetated are dominant cover types over the globe. Differences among the GLC products are significant. Major discrepancies occur in all the aggregated land cover classes except for the evergreen broadleaf forest, croplands, snow and ice, and inland water. Surprisingly, the urban and built-up with distinct spectral-temporal signals has a relative higher discrepancy among the products. The coefficient of variance (CV) values of the total percent area for each aggregated class in major GLC products in the same epoch (red line in Fig. 3) range from 2.0% to 88.4%, 2.8% to 81.9%, and 8.7% to 77% for the three epochs, respectively. The relative small val- AUGUST 2020 ues are found in the evergreen broadleaf forest (2.0%, 2.8%, and 8.7%), snow and ice (10.8%, 11.9%, and 14.2%), inland water (22.7%, 19.2%, and 27.6%), and croplands (26.7%, 23.8%, and 19.8%) for all the three epochs, representing the high classification agreement in these land cover classes. While CV values for the deciduous needleleaf forest (88.4%, 81.9%, and 77.0%), evergreen needleleaf forest (62.3%, 61.0%, and 59.2%), de- (1 is evergreen needleleaf forest, 2 is evergreen broadleaf forest, 3 is deciduous needleleaf forest, 4 is deciduous broadleaf forest, 5 is mixed forest, 6 is shrublands, 7 is grassland, 8 is croplands, 9 is permanent wetlands, 10 is urban and built-up, 11 is cropland/natural vegetation mosaic, 12 is snow and ice, 13 is barren or sparsely vegetated, and 14 is inland water).

812
Journal of Meteorological Research Interestingly, when five classes of forests in all GLC products are aggregated into forests, the CV values (10.7%, 12.3%, and 14.4%) decrease sharply compared to forests with the plant functional type. This phenomenon reveals that the total percent area of forests in different products are very close to each other, but significant discrepancies occur in different forests, which can be explained by the difference in the definitions of "forest" in different classification schemes, and confusion among different forest types.

Per-pixel comparisons
The A o and 14 A s values for any pair of GLC products conducted by the per-pixel comparisons for the three epochs are shown in Fig. 4. The A o values between GLC products are 35.9%-57.8%, 36.1%-51.6%, and 46.8%-48.8% for the three epochs, respectively. The highest A o of 57.8% is found between the CCI-LC 2001 and GLC2000, which is attributed to the fact that the GLC2000 product is used as an alternative global or regional reference for CCI-LC. The secondary high value of 51.6% is found between the CCI-LC 2010 and Glob-Cover2009, which are both produced by using the LCCS scheme from ESA. The A o values between other GLC products in the three epochs are all lower than 50%, which is consistent with the findings of Yang et al. (2017). The lowest spatial agreements found between the GlobCover2005 and MODIS2001 only account for 35.9% as a result of differences in classification schemes and data collection time.
When the five types of forest classes and cropland/natural vegetation mosaic in all GLC products are aggregated into forests and croplands, A 0 values between GLC products are 50.8%-66.0%, 51.8%-66.1%, and 60.6%-66.9% for the three epochs, which have respectively increased by 8.2%-14.1%, 14.5%-15.7%, and 13.8%-18.1% than the results in a detailed scheme for the three epochs. The misclassification among the forests of different plant functional types and mosaic types can account for approximately 20% of disagreements between the GLC products. Naturally, the spatial agreements will decrease sharply when they are estimated based on schemes that contain detailed classes. The analysis indicates that improving the classification accuracy of different plant functional type forests is the key point for GLC products, and mosaic classes are the main factors causing discrepancies among these GLC products.

Comprehensive analysis between area and per-pixel comparisons
A comprehensive analysis on the results between area and per-pixel comparisons is performed. The results show that the aggregated land cover classes with high consistency in the total area among the major GLC products also have high A o values (Figs. 3, 4), such as the evergreen broadleaf forest, croplands, urban and built-up, snow and ice, barren or sparsely vegetated, and inland water. Similarly, A o of aggregated land cover classes with poor consistency in the total area among different products is found to be poor (Figs. 3, 4), such as the evergreen needleleaf forest, deciduous needleleaf forest, mixed forest, shrublands, permanent wetlands, and cropland/natural vegetation mosaic. No aggregated land cover classes with high consistency in the total area among different products have low A o . In other word, central/core land cover classes were correctly identified in all major GLC products, while land cover classes distributing along edges and transition zones were misclassified. It is non-trivial to discriminate clear boundaries between the major ecologically similar land cover classes.
Furthermore, area consistencies and A o values of specific land cover classes vary greatly among the major AUGUST 2020 GLC products, such as the deciduous needleleaf forest, deciduous broadleaf forest, shrublands, etc. (Figs. 3, 4). For the deciduous needleleaf forest, the total areas among GLC2000, GlobCover, and CCI-LC are relatively consistent, meanwhile the corresponding A o is also higher than that in other products. Similar cases also occur in specific land cover classes, such as the deciduous broadleaf forest, mixed forest, shrublands, permanent wetlands, and barren or sparsely vegetated class. It could be largely explained with the LCCS used in GLC2000, GlobCover, and CCI-LC. In addition, the same agency mainly responsible for generating these three land cover   is evergreen needleleaf forest, A 2 is evergreen broadleaf forest, A 3 is deciduous needleleaf forest, A 4 is deciduous broadleaf forest, A 5 is mixed forest, A 6 is shrublands, A 7 is grassland, A 8 is croplands, A 9 is permanent wetlands, A 10 is urban and builtup, A 11 is cropland/natural vegetation mosaic, A 12 is snow and ice, A 13 is barren or sparsely vegetated, and A 14 is inland).

Journal of Meteorological Research
Volume 34 products is also likely to have contributed to this distinguishing characteristic. It means that different land cover definitions used in classification schemes for the major GLC products are likely to make great contributions to the discrepancy.

Thematic uncertainty analysis
The spatial thematic uncertainty of the major GLC products based on their original schemes for the three epochs is depicted in Fig. 5, with values typically ranging from 0 (low uncertainty) to 1 (high uncertainty). The mean thematic similarity and thematic similarity STD are also shown in Fig. 5 for better understanding the quality of GLC products. Mean thematic uncertainties of GLC products in the three epochs are 0.49, 0.46, and 0.43, respectively. The mean thematic similarity with STD of GLC products in the three epochs are 0.58 ± 0.24, 0.60 ± 0.21, and 0.61 ± 0.16, respectively. In terms of the thematic uncertainty, mean thematic similarity, and their STD in classification schemes, their spatial patterns in the three epochs are very similar over the entire globe. Generally, areas with low thematic uncertainty also have high mean thematic similarity and low STD of thematic similarity. Low thematic uncertainties are found in Greenland, Sahara Desert, Arabian Peninsula, Taklimakan Desert, tropical rainforest regions (the Amazon, Congo River, and Southeast Asia), and main arable land (North China Plain, India, southern Brazil, Southeast Argentina, and northern central United States), whereas the Arctic Circle, central Asia, Australia, southern Africa, Sahel, western North America, and eastern South America generally present high thematic uncertainties.
Different GLC products are produced with different classification schemes. As presented in Table 1, the LCCS scheme has more classes than IGBP. Definitions among the classification schemes do not match well. The definition of common attributes varies significantly, which are recognized as the main cause of the thematic uncertainty, leading to confusions in the spatial distribution and therefore to inaccurate classifications. Many inconsistencies are evident among the thresholds for common attributes. For example, the definition of the density cover attribute in forest classes varies markedly in different classification schemes. The tree height attribute also has different definitions depending on the classification schemes for different GLC products. Moreover, the leaf type and leaf phenology attributes among the forest types are also very important factors for the thematic uncertainty.
Another example is the density cover attribute in barren and sparsely vegetated areas. Thresholds for the density of cover separating the barren and sparsely veget-  ated areas (i.e., < 10% for MODIS, 1%-20% for GLC2000 and GLCNMO, < 15% for GlobCover and CCI-LC, and 0-4% for the barren) vary among different products, thus resulting in significant disagreements. Therefore, this is one reason why high thematic uncertainty is generally observed in transition zones of the major ecosystems across the world. The sparse vegetation, bare area, open shrubland, and grassland are always confused with each other among different land cover products. One important reason is the overlap among their density cover attributes. Figure 6 shows the local classification accuracy uncertainty in the existing GLC products during the three epochs. Local classification accuracy uncertainties among different GLC products are 0.47, 0.44, and 0.39 in the three epochs, respectively. The mean local classification accuracy with STD of GLC products in the three epochs are 0.57 ± 0.17, 0.60 ± 0.16, and 0.64 ± 0.13, respectively. Patterns of the local classification accuracy uncertainty, mean local classification accuracy, and their STD in the three epochs are generally similar to those over the entire globe, and areas with low local classifica-tion accuracy uncertainty also have high local classification accuracy and low STD of the local classification accuracy. As shown in Fig. 6, areas with high uncertainty, analyzed based on the aggregated scheme, are found in the central Asia, Australia, northern Canada, northern Russia, eastern and southern Africa, Sahel, eastern and southern parts of South America, and southern part of China. On the other hand, low uncertainty occurs in Greenland, Sahara Desert, Arabian Peninsula, Taklimakan Desert, tropical rainforest regions, and main arable land. It is very similar to the thematic uncertainty pattern. For example, the major land cover of reference data is the shrubland in eastern Africa, shrubland, and barren or sparsely vegetated class in southern Africa during 2000-2005. While the major land cover for these areas is the shrubland or cropland in CCI-LC, grassland in MODIS, grassland, barren or sparsely vegetated class in GLC2000, shrubland or cropland in GLCNMO, and grassland, barren or sparsely vegetated class in GlobCover. For other regions in northern Canada, the major land cover class is barren or sparsely vegetated, but it is classified as the grassland in CCI-LC, shrubland in MODIS, barren or sparsely vegetated in GLC2000, and grassland or shrubland in GLCNMO. Additionally, reference data show the shrubland or cropland/natural vegetation mos-  In general, confusions among the shrubland, grassland, and cropland/natural vegetation mosaic are always serious, and their local accuracies are relatively low due to similar temporal and spectral features. Hence, those areas should be paid more attention for GLC producers to generate high-quality products.

Integrated uncertainty analysis
The integrated uncertainty among GLC products to quantify the total classification uncertainty, which combines both the thematic uncertainty and local classification accuracy uncertainty, is shown in Fig. 7. Mean values of the spatial integrated uncertainty among GLC products are 0.48, 0.45, and 0.41 in the three epochs, respectively. The spatial pattern of the integrated uncertainty among GLC products is very similar to that of the thematic uncertainty and local classification accuracy uncertainty. High uncertainty occurs in northern Russia, northern part of North America, western and central America, eastern Brazil and Argentina, Australia, central Asia, southern part of China, eastern and southern parts of Africa, and Sahel. While low uncertainty is observed in Greenland, Sahara Desert, Arabian Peninsula, Taklimakan Desert, tropical rainforest regions, and the main arable land over the world in the three epochs. Obviously, transition zones of the major ecosystems across the world and areas with high landscape complexity, e.g., mountainous areas or hilly regions, have relatively high integrated uncertainty. It is not surprising that the transition zones and landscape heterogeneity have been regarded as major factors influencing the GLC accuracy. AUGUST 2020 On the one hand, similar temporal and spectral features of land cover classes in transition zones largely contribute to disagreements in this area. On the other hand, due to the complicated spectral characteristics, it is difficult for classification on the land cover in areas with high landscape heterogeneity.
Moreover, the mean thematic similarity and local classification accuracy are used to identify specific areas for validating the integrated uncertainty. Pixels can be considered as the uncertain area, if a pixel shows high thematic similarity among GLC products but low local classification accuracy. STD of the mean thematic similarity and mean local classification accuracy are selected as the threshold for identifying areas with high thematic similarity but low local classification accuracy (Fig. 7). It can be seen that the identified areas with high thematic similarity but low local classification accuracy are all captured in the integrated uncertainty among GLC products. These areas should be paid more attention for both GLC product producers and users in the future.
The integrated uncertainty contains the thematic uncertainty, which presents discrepancies in land cover definitions and positional errors. Despite of the high local classification accuracy uncertainty in some areas, the wrong classification is not vital, which is just a confusion between similar land cover classes. However, compared with the local classification accuracy uncertainty, the integrated uncertainty would decrease by combining the thematic uncertainty together. For example, the main land cover is found to be the deciduous needleleaf forest with evergreen needleleaf forest in northern Russia over 2008-2010, but the area is classified as the mixed forest in MODIS, deciduous needleleaf forest in GlobCover, mixed forest or deciduous needleleaf forest in GLCNMO, and evergreen needleleaf forest or mixed forest in CCI-LC, respectively. In terms of the classification accuracy, it is wrong, but for the thematic uncertainty, these land cover classes have some similarity. The integrated uncertainty has well captured these characteristics among the GLC products. In fact, these types of the uncertainty can be found in many areas in the integrated uncertainty figure. Therefore, the integrated uncertainty in this study provides more implications for understanding the uncertainty among the existing GLC products.

Conclusions and discussion
Study on the pixel-level uncertainty is essential for understanding the suitability and limitation of GLC products, as well as the analysis on the propagation of this kind of uncertainty in various models, particularly for areas and classes with high uncertainties. In this study, both area and per-pixel comparisons are conducted among the existing GLC products for identifying the bias of land cover products. This paper has also outlined a method that provides the pixel-level uncertainty combining both the thematic uncertainty (indicating discrepancies in land cover definitions and positional errors) and local classification accuracy uncertainty (depicting the spatial variability of classification accuracy). Based on the major existing GLC products, this methodology is performed in three epochs.
Comparisons show that, for all the three epochs, the total percent area of each land cover class among different GLC products varies significantly. Major discrepancies are found in the deciduous needleleaf forest, evergreen needleleaf forest, deciduous broadleaf forest, and cropland/natural vegetation mosaic, whereas the evergreen broadleaf forest, croplands, snow and ice, and inland water are with high consistency. This may be the result of the good spectral discrimination of the evergreen broadleaf forest, snow and ice, inland water, great pixel homogeneity, and relative stable distribution of croplands. Total areas of the forest are very consistent for the existing GLC products, but significant differences are observed in different forest classes. A o values between GLC products are 35.9%-57.8%, 36.1%-51.6%, and 46.8%-48.8% for the three epochs, respectively. The highest A o of 57.8% is found between CCI-LC 2001 and GLC2000, followed by that between CCI-LC 2010 and GlobCover2009 (51.9%), and the lowest value (35.9%) occurs between GlobCover2005 and MODIS2001. Meanwhile, A o values between the other GLC products are all lower than 50%. A s exhibits significantly different characteristics. Classes with high spatial agreement are the snow and ice, evergreen needleleaf forest, barren or sparsely vegetated, inland water, and cropland, while the cropland/natural vegetation mosaic, deciduous needleleaf forest, and permanent wetlands have extremely low spatial agreements. This may be partly associated to the mixed class pixels with the high intra-class variability and fragmented land cover, which has a negative influence on the classification accuracy of land cover. Mean spatial agreements of other land cover classes are lower than 50%. When the five classes of forests are aggregated into forests, A o values of the products increase by 8.2%-18.1% for the three epochs. The misclassification among different forest classes and mosaic types can account for about 20% of the total disagreement. A key point of improving the quality of GLC products is to en-hance the accuracy of forest types and mosaic classes.
The spatial uncertainty analysis has further shown that the mean thematic uncertainty, local classification accuracy uncertainty, and integrated uncertainty all nearly reach 0.5 for the three epochs. The results show that different definitions of common attributes among classification schemes with GLC products are recognized as the main factor of the thematic uncertainty, leading to a confusion and inaccurate classification. The local classification accuracy uncertainty demonstrates where the relatively high confidence on classification accuracy exists, and to what degree it is. Generally, areas with the ecologically similar land cover classes tend to have low values of local accuracy due to the low separability of spectral-temporal signal features. Most of the high uncertainty occurs in transition zones of major ecosystems across the world and areas with high complexity in the landscape. Compared with the previous studies, the integrated uncertainty in this work, which contains both the thematic uncertainty and local classification accuracy uncertainty, provides more implications for understanding the uncertainty among the existing GLC products. The distribution of the integrated uncertainty is very similar to that of Yu et al. (2018). The spatial distributions of the difficult to map regions (DMRs) identified by Yu et al. (2018) are the most consistent with high slope variance areas attached with shrubland-related or grassland-related land cover classes or biomes. Therefore, DMRs or areas with high integrated uncertainty in our study should be taken as focus areas to improve the accuracy of GLC products.
Significantly, the difference in classification schemes is one of the major factors driving disagreements among the GLC products (Herold et al., 2006(Herold et al., , 2008. Scientists have highlighted the harmonization and adoption of common classification schemes. In our study, a common classification scheme with 14 aggregated land cover classes are used for area and per-pixel comparisons. Classes of the evergreen broadleaf forest, cropland, urban and built-up, snow and ice, and inland water have the relative high consistency of total areas and A o values. Interestingly, it is found that these aggregated land cover classes have explicit definitions in different classification systems. For instance, the evergreen broadleaf forest, urban and built-up, snow and ice, and inland water with high A o values are the only class in the classification system of GLC products. The classification system cross-working process for the GLC product comparison may induce some uncertainty. It is shown that designing a common classification scheme for different application goals, which gives explicit definitions of land cover classes in the thresholds of common attributes, is still necessary and exigent. Moreover, according to the previous studies, enhancing the ability to discriminate the ecologically and spectrally similar land cover classes and improving the classification accuracy for transition zones and heterogeneous landscapes are key approaches in a short term. Recently, the emerging novel data such as hyperspectral data and crucial attribute data, and methods such as artificial intelligence as well as deep learning, have provided a new opportunity for improving the classification accuracy of land cover. There are also some limitations with our method. Firstly, it is considered that the thematic uncertainty and local classification accuracy uncertainty are of equal importance for GLC products in this study. However, the integrated uncertainty information has more implications for specific applications. Different weights can be applied to evaluate the relative importance of the two properties, regarding to the suitability of this method for specific applications. For example, if the local classification accuracy uncertainty is regarded as the extremely crucial factor affecting the results in an application, and the harmonization could solve the difference between land cover classes, the influence of thematic uncertainty can be ignored. Then the weight for the thematic uncertainty can be assigned with a low value, while a relatively high value is assigned to the local classification accuracy uncertainty. Secondly, the thematic uncertainty is implemented based on different classification schemes with common attributes from the viewpoint of product producers. The result is universal for common applications, but is not fit for specific applications, such as climate modeling. In order to obtain the optimal uncertainty sources for specific applications, the thematic uncertainty could be calculated based on special attributes that affect the application outcomes, as done in the previous studies (Fritz and See, 2008;Gao and Jia, 2013;Tsendbazar et al., 2016). In addition, the local classification accuracy uncertainty is achieved with both the GLC products and reference data based on the aggregated scheme. However, the classification harmonization process of land cover products and reference datasets can induce new uncertainties. Using enough credible reference data for the local classification accuracy uncertainty analysis with the classification scheme of products can provide more accurate results. The integrated uncertainty could not only enhance the understanding of the quality of GLC products, but also provide a very important basis for the land cover fusion. Thus, our next step is to generate the integrated GLC products with high quality based on information of the integrated uncertainty.