Evaluation of Daily Precipitation Product in China from the CMA Global Atmospheric Interim Reanalysis

The China Meteorological Administration (CMA) recently produced a CMA Global Atmospheric Interim Reanalysis (CRAI) dataset for the years 2007–2016. A comprehensive evaluation of the ability of CRAI to capture the spatiotemporal variability of observed precipitation, in terms of both mean states and extreme indicators over China, is performed. Comparisons are made with other current reanalysis datasets, namely, the ECMWF interim reanalysis (ERAI), Japanese 55-yr reanalysis (JRA55), NCEP Climate Forecast System Reanalysis (CFSR), and NASA Modern-Era Retrospective analysis for Research and Applications version 2 (MERRA2), as well as NCEP Climate Prediction Center (CPC) observations. The results show that, for daily variations of rainfall during warm seasons in eastern China, CRAI and CFSR overestimate the precipitation of the main rain belt, while the overestimation is confined to the area south of 25°N in JRA55 but north of 24°N in MERRA2; whereas ERAI tends to underestimate the precipitation in most regions of eastern China. Two extreme metrics, the total amount of precipitation on days where daily precipitation exceeds the 95th percentile (R95pTOT) and the number of consecutive dry days (CDDs) in one month, are examined to assess the performance of reanalysis datasets. In terms of extreme events, CRAI, ERAI, and JRA55 tend to underestimate the R95pTOT in most of eastern China, whereas more frequent extreme rainfall can be found in most regions of China in both CFSR and MERRA2; and all of the reanalyses underestimate the CDDs. Among the reanalysis products, CRAI and JRA55 show better agreement with the observed R95pTOT than the other datasets, with fewer biases, higher correlation coefficients, and much more similar linear trend patterns, while ERAI stands out in better capturing the amount and temporal variations of the observed CDDs.


Introduction
China is the most populated nation (Piao et al., 2010) and one of the fastest-growing economies in the world (Hubacek et al., 2007). It is characterized by complex topography and heterogeneous climate (Gao et al., 2008). Since China is experiencing rapid industrialization, urbanization, growing agricultural demand, and environmental degradation, a variety of problems have challenged the management and utilization of China's water resources (Varis and Vakkilainen, 2001). Therefore, reliable, long-term, and relatively high-resolution precipitation datasets are essential for natural process modeling, hydrometeorological analysis and forecasting, and monitoring of climatic variations and changes (Kucera et al., 2013;Kirschbaum et al., 2017).
A rain gauge is a mechanical and simple ground-based measurement tool for rainfall, and provides highly accurate precipitation datasets for various climatological and hydrological applications (Kidd, 2001). However, the distribution of rain gauges is uneven across the country. The ground-based measurement networks in China are mainly distributed in southeastern and central China, while the spatial distribution of stations in other regions are relatively sparse. Additionally, observations are prone to severe underestimation of precipitation, which is amplified in cases of solid precipitation and over mountainous areas (Rasmussen et al., 2012;Isotta et al., 2015). With advanced infrared and microwave instruments, satellite observations make up for these deficiencies by providing coverage that is more temporally complete and spatially homogeneous for vast areas of the globe (Kidd and Levizzani, 2011). However, satellite-related datasets have limitations in terms of their short history and their retrieval approaches, their relative insensitivity to light rainfall events, and their tendency to fail over snow-and ice-covered surfaces, making them susceptible to systematic biases (Ferraro, 1997;Dai et al., 2007;Ebert et al., 2007;Kidd and Levizzani, 2011).
Precipitation estimates from atmospheric reanalysis data with good spatial and temporal continuity provide a potential alternative in regions where conventional in situ precipitation measurements are not readily available. However, because reanalysis data contain uncertainties resulting from the forecast model, data assimilation, and data sources used, it is fundmental to evaluate the quality of reanalysis products in representing weather and climate variations (Trenberth and Guillemot, 1998;Lin et al., 2014). In China, many studies have assessed the performance of reanalysis data in reproducing the diurnal cycle, interannual variation, climatology, and long-term trend of observed precipitation (e.g., Dai et al., 2011;Wang and Zeng, 2012;Chen et al., 2014;Lin et al., 2014). For example, through a preliminary comparison with observational data, Zhao and Fu (2006) found that ECMWF 40-yr reanalysis (ERA-40) and NCEP/NCAR reanalysis 2 (NCEP-2) were able to reflect the temporal and spatial distribution of precipitation but showed regional variation. Ma et al. (2009) evaluated precipitation from ERA-40, NCEP-1, NCEP-2, Climate Prediction Center (CPC) Merged Analysis of Precipitation version 1 (CMAP-1), CMAP-2, and Global Precipitation Climatology Project version 2 (GPCP-2) with ground-based measurements in China and concluded that CMAP-1 and GPCP-2 generally had better correspondence with adjusted observational precipitation. Chen et al. (2014) found that four reanalyses [Japanese 55-yr reanalysis (JRA55), ECMWF interim reanalysis (ERAI), NCEP Climate Forecast System Reanalysis (CFSR), and NASA Modern-Era Retrospective analysis for Research and Applications (MERRA)] reproduced well the rainfall diurnal cycle over East Asia in terms of the contrast over largescale terrain, the evolution during summer, and its interannual variability.
Additionally, some comparisons of extreme precipitation according to reanalysis data have been carried out in previous studies from a global perspective and for several regions, including China. For instance, Sillmann et al. (2013) highlighted the large spread in absolute values of precipitation extremes between different reanalysis products, comparable to the spread between different climate models from CMIP5. Donat et al. (2014) found that the extreme precipitation patterns and time series from reanalyses showed lower agreement with observations than for extreme temperatures, but generally still correlated significantly. However, some spatial variations have not been considered in China, and further analyses are needed, especially in assessing the performance of the China Meteorological Administration (CMA) Global Atmospheric Interim Reanalysis (CRAI) against that of previous reanalyses.
Recently, the CMA released its first reanalysis, called the CMA 40-yr Global Reanalysis (abbreviated to CRA-40). It was designed to provide global land surface information from as early as 1979 that includes ground temperature, soil moisture, precipitation, etc. Subsequently, a 10-yr interim product (i.e., CRAI), with a horizontal resolution of approximately 34 km and temporal resolution of 6 h, has been produced. The present study comprehensively assesses the ability of CRAI to capture the observed mean state and spatiotemporal variability of precipitation, as well as extreme precipitation indicators, over China. More specifically, comparisons with observations are provided, along with an examination of how well different reanalyses agree with each other, and a determination of whether there are significant regional or seasonal variations in the discrepancies between the models.
Following this introduction, Section 2 describes the observational and reanalysis datasets employed in the study. Comparisons of the characteristics of precipitation from 2007 to 2016, including the climatology of daily precipitation and extreme indicators as well as the related atmospheric circulation features, are presented in Section 3. Finally, Section 4 summarizes our findings and draws the conclusions.

Datasets and methods
This study utilizes the daily precipitation from the following reanalysis datasets: CRAI, ERAI (Dee et al., 2011), JRA55 (Kobayashi et al., 2015), CFSR (Saha et al., 2010), and MERRA2 (Reichle et al., 2017) (see Table S1 in the online supplementary material for further details on these reanalysis datasets). The real-time precipitation dataset derived from the NCEP's CPC (Xie et al., 2010(Xie et al., ), covering 2007(Xie et al., -2016, is also used, as an observational reference for the evaluation. This product is a gauge-based analysis of daily precipitation constructed over the global land areas, at a 0.5° × 0.5° spatial resolution. It is used as a baseline for evaluation in this study because it combines all ground-based information sources. There are also other observational products available for regions of interest, such as Global Precipitation Climatology Center (GPCC) gauge data (Rudolf et al., 2010) and the East Asia daily analysis data (Xie et al., 2007); however, these products do not cover the entirety of 2007-2016.
Two indicators are used in this study owing to their usefulness in representing dry or wet conditions (Alexander et al., 2006;Moberg et al., 2006;Zhang et al., 2011). The indicator for wet conditions is R95pTOT, which denotes the monthly amount of precipitation when daily precipitation is greater than the 95th percentile of daily precipitation (R95p); while the index for dry conditions is CDD, which is the maximum number of consecutive dry days (CDDs) when daily rainfall amounts are less than 1 mm (in units of days per month). Owing to the lack of observational data on vertical wind speed and specific humidity, we only analyze the water vapor flux from the five reanalyses to give a general explanation for the difference in precipitation between CRAI and the other four reanalyses. The water vapor flux is calculated as an integral over the atmospheric column for the eastward and northward components retrieved from 20 pressure levels between 300 and 1000 hPa (Trenberth, 1991;Zhou, 2003). A description of the calculation of the moisture flux and its divergence (Chen, 1985) is given in the online supplemental material.
All the precipitation datasets have been converted from subdaily to daily timescales (mm day −1 ). Both the gauge-based and reanalysis precipitation products have been interpolated to common grid cells with a horizontal resolution of 0.5° × 0.5° for comparison. It should be noted that, given the difference in horizontal resolution between the different grids, some information might be lost in the re-gridding. To assess the performances of the reanalyses, the bias, relative bias, root-mean-square error (RMSE), Pearson correlation coefficient, and empirical orthogonal function (EOF) analysis are used as statistical metrics in this study (Chen et al., 2013;Zhao and Yatagai, 2014;Guo et al. 2016). The details for calculating these statistics are provided in Table 1.

Spatial distribution
First, we briefly compare the climatological precipitation characteristics of China among the reanalyses. More detailed comparisons, as well as validations of the mean precipitation of China, can be found in Su et al. (1999) and Ma et al. (2009). Overall, the precipitation distribution in China is characterized by a northwest-to-southeast increase in the annual and half-year mean precipitation (Ding and Chan, 2005;Gao et al., 2006). All of the reanalysis products capture this spatial pattern (figure omitted), and the pattern correlations are approximately 0.9 (Table 2). Figure 1 displays the 10-yr mean differences of daily precipitation between each of the reanalysis products and the CPC precipitation for the annual, warm half-year (April-September), and cold half-year (October-March) periods from 2007 to 2016. The figure illustrates that the bias is much greater in warm seasons than in cold seasons, and is spatially greater in southern areas than in northern parts, especially in the northeast. Given the association with the East Asian monsoon (Zhou et al., 2010), the warm seasons and southern regions correspond to high precipitation amounts (Shen et al., 2010), resulting in large differences (Luo et al., 2013;Sun et al., Range [−∞, +∞], best value = 0 Pearson correlation coefficient (R) Range [−1, 1], best value = 1 Root-mean-square error (RMSE) Li, C. X., T. B. Zhao, C. X. Shi, et al.      (Table 2), the precipitation amounts from ERAI have a lower bias (0.09-0.18 mm day −1 ) but correlate slightly less with spatial patterns against the observational data (R = 0.82-0.85) than do those of the other reanalyses (R = 0.82-0.91). The bias in MERRA2 exceeds 1.54 and 0.45 mm day −1 for the national average in the warm and cold seasons, respectively. The larger discrepancies in MERRA2 are partly driven by the large seasonal bias across the southern regions. Figure 2 illustrates the temporal correlation coefficients (R) of the 10-yr mean daily precipitation between the CPC observational data and the reanalyses for warm half-year and cold half-year periods. This figure shows that performances of the five reanalysis datasets at representing temporal variations of daily precipitation are better in the eastern half of the country, especially the northeastern portions, than in the western half of the country, and are better for the cold half-year than for the warm half-year. Elsewhere, correlation coefficients are mostly lower than 0.15 over Northwest Tibetan Plateau (TP), where precipitation amounts are already small. This might be due to large uncertainties in both the gaugebased analysis and the reanalyses. Among the five reanalysis products, JRA55 also stands out in capturing the temporal variations in precipitation in both the warm and cold seasons, with a national average R of 0.69 and 0.75, respectively.

Temporal variation
In this section, we analyze the fields of climatological daily precipitation by calculating the time series of the 10-yr (2007-2016) mean daily precipitation for the 366 calendar days for all grids. We first compare the national average time series of observations with the five reanalyses based on statistics constructed from 5-day running means using daily estimates (Fig. 3)  lyses, both the spatial correlation and relative bias are smaller for the warm season than for the cold season. In contrast, it can be seen that the RMSE has higher values during the warm season. Since the relative bias (RMSE) will often have lower (higher) values when precipitation is higher, it is important to state that these results cannot be attributed to temporal differences, but do provide a diagnosis regarding performances of the products. CRAI, ERAI, and JRA55 closely align with the observations in China for all three statistics. The RMSE values are quite similar for each of them (Fig. 3c), while ERAI has better bias characteristics (Fig. 3b) and JRA55 has consistently higher correlations throughout most of the 12-month comparison period (Fig. 3a).
The daily precipitation rate is classified into four grades according to the criteria defined by the CMA (Committee for the Verification of Terms in Atmospheric Sciences, 2009): light (1.0-9.9 mm day −1 ), moderate (10.0-24.9 mm day −1 ), heavy (25.0-49.9 mm day −1 ), and extreme (≥ 50.0 mm day −1 ) precipitation. Here, we examine the frequency of daily precipitation occurrence to understand how well the reanalysis products match the observed daily precipitation. The distribution of daily precipitation rates among the observational data and reanalyses from all of the grid points within China from 2007 to 2016 is depicted as a histogram in Fig. 4. For precipitation intensities between 1.0 and 18 mm day −1 , all of the reanalysis products have the higher frequency of rainfall occurrence compared to the observational data. For precipitation rates higher than 25 mm day −1 , however, three of the five reanalysis products (CRAI, ERAI, and JRA55) detect the lower frequency of rainfall occurrence compared to the observational data, indicating a suppression of heavy and extreme precipitation. Additionally, it is found that the overestimation from CRAI, ERAI, and JRA55 ( Fig. 1) is mainly caused by the overestimation of the light and moderate grades. Meanwhile, both CFSR and MERRA2 tend to overestimate the observed precipitation in all categories from light to extreme ranges. Note that CRAI most closely matches the observed distribution of precipitation rates for the 8-25 mm day −1 range, suggesting that the best representation of moderate precipitation is found in CRAI.
The evolution of the warm-season precipitation belt from the south to the north in eastern China is also investigated. Figure 5 shows the time-latitude cross-sec-       pecially in the southern portions of South China during the entire period and in the lower reaches of the Yangtze River before late July.

EOF analysis of warm half-year precipitation
To assess the seasonal and intraseasonal variability of precipitation as represented in the reanalyses and observational data, we apply an EOF analysis. Using the results from the EOF analysis of the 10-yr mean seasonal cycle from 1 April to 30 September (Figs. 6 and 7), the spatial characteristics, including the shape, orientation and location of the rainfall area, of daily rainfall in China are investigated. The first two leading EOF modes of the five reanalyses and the observational data account for 41.9%, 32.9%, 35.6%, 26.9%, 27.4%, and 30.0% of the total variance, respectively.
It can be seen that positive anomalies are dominant in the first EOF mode (EOF1), particularly in the northeast and southwest of China, whereas negative anomalies are  Fig. 7. As in Fig. 6, but for the second EOF mode (EOF2).

Journal of Meteorological Research
Volume 34 confined to South China (Fig. 6d). Associated with temporal coefficients (Fig. 6e), this mode corresponds to above-normal rainfall in South China before early-June and after August, which refers to the onset of the preflood and post-flood seasons of South China, respectively. Overall, the agreement is good between the reanalyses and observational data (pattern correlation coefficients: 0.49-0.81; PCs correlated at 0.95-0.99), but regional differences exist. For instance, compared to the observational data, both CRAI and CFSR depict an opposite sign of the seasonal precipitation variability in South China, showing in-phase changes across the whole of China. The second EOF mode (EOF2) features a "positivenegative-positive" meridional pattern in eastern China, but displays a dipole pattern in western China (Fig. 7d). Associated with the temporal coefficient curve (Fig. 7e), we can see that the main abundant rainfall areas are located in southern, northeastern, and northwestern China, whereas the deficient rainfall areas are located in southwestern and northern China before late June. Hereafter, an opposite sign is found, depicting a north-south migration of the precipitation largely modulated by the monsoon circulations. The corresponding modes from all of the reanalyses capture these features reasonably well (pattern correlation coefficients: 0.49-0.81; PCs correlated at 0.87-0.96), but slightly underestimate the variability in southwestern and northeastern China (Figs. 7a-c). Moreover, there is a substantial overestimation of the seasonal variability in the middle and lower basins of the Yellow River in CRAI and CFSR (Figs. 7b, e). Overall, among the reanalyses, JRA55 outperforms the other four reanalysis products in capturing the structure and variability of precipitation in warm seasons in eastern China.

Comparison of extreme events
In this section, we evaluate the performance of reanalyses in capturing the behavior of extreme precipitation FEBRUARY 2020 events in China. Figure 8 shows the R95p of daily precipitation from 2007 to 2016. Here, R95p is the 95th percentile of daily precipitation on wet days (days with daily precipitation ≥ 1 mm) and is used for defining extreme precipitation amounts. The maximum values of annual R95p for observations exceeding 26 mm day −1 are located along the southeast coast and over the lower reaches of the Yangtze River (Fig. 8a). Meanwhile, these values range between 18 and 26 mm day −1 in most parts of eastern China. Moreover, they decrease to 8-14 mm day −1 in most parts of central and eastern Inner Mongolia, the east of Northwest China, and Tibet, and become less than 6 mm day −1 in the west of Northwest China (Zhai et al., 2005). In general, the reanalyses capture a spatial distribution of R95p similar to that of the observational data, with values decreasing from south to north and east to west (figure omitted). However, CRAI, ERAI, and JRA55 tend to noticeably underestimate the high-value percentile indices (R95p ≥ 18 mm day −1 ) for humid re-gions in most parts of eastern China (Figs. 8b-d), where the annual precipitation amount (P) is more than 800 mm (Chen and Sun, 2015). By contrast, an overestimation of R95p is found in the southwestern China in ERAI. Moreover, overestimation extends from northwestern to southeastern China in CFSR (Fig. 8e) and throughout the entire country in MERRA2 (Fig. 8f). The mean differences (bias), temporal correlations, and linear trends of R95pTOT are evaluated in Figs. 9-11, respectively. Since the daily precipitation in parts of northwestern China is below the threshold (R95p), blank spaces occur on the monthly correlation and trend analysis map. R95pTOT, derived from the observational data, shows a similar distribution to R95p (Fig. 8a), with more R95pTOT in Southeast China and less in Northwest China (Fig. 9a). The R95pTOT is underestimated by CRAI, ERAI, and JRA55 in southeastern China, indicating fewer instances of extreme precipitation there. However, the R95pTOT is generally overestimated by 128

Journal of Meteorological Research
Volume 34 CFSR, with the exception of Northeast China (Fig. 9e). For MERRA2, consistent overestimation is found in China, and the maximum positive biases are located in southern China (Fig. 9f), which is similar to that of CFSR. That is, more frequent heavy rainfall can be found in most regions of China in both CFSR and MERRA2. These findings confirm the results shown in the histogram (Fig. 4), i.e., extreme precipitation is underestimated by CRAI, ERAI, and JRA55 but overestimated by CFSR and MERRA2. With respect to the correlation coefficients between the reanalyses and the observational data for R95pTOT, high correlations are also observed in the southern and eastern regions of the country, where rain gauge networks are much denser and where extreme heavy rainfall events occur more frequently. In contrast, except for several small regions (e.g., in the northwestern corner of the country), the temporal variations of R95pTOT are poorly reproduced in western China. The corresponding correlation coefficients are close to zero or even negative (Fig. 10). In terms of linear trends, the observational data show that the monthly R95pTOT increases in Southeast and Northeast China and along the southern edge of the TP. All of the reanalyses can reproduce some features of R95pTOT change, with the pattern correlation coefficient of the trend ranging from 0.16 to 0.46. Nevertheless, the drying trend in the southwest from ERAI (Fig. 11c) is opposite to that of the observational data (Fig. 11a), while the wetting trend along the southern edge of the TP and Southeast China from both CFSR (Fig. 11e) and MERRA2 (Fig. 11f) is much more obvious than that seen in the observational data. 18 are significant at the 95% confidence level with a sample number of 120. White shading indicates missing data.

FEBRUARY 2020
The observed minimum CDD is below 8 days, occurring mainly in the Sichuan basin and increasing both southward and northward, while high CDD values can be seen in southern Xinjiang and northern TP, approaching 26 days per month ( Fig. 12a; Duan et al., 2017). This distribution can be reproduced by all of the reanalyses (figure omitted); however, CDD is generally underestimated, especially for both the northern and southern edges of the TP (Figs. 12b-f). In the reanalysis products, compared to the observational data, the indication is that dry spells are shorter in most regions of China, with a nationwide average bias of −3.29 to −1.28 days (Table 3). Meanwhile, the reanalyses show a better similarity to the observational data, with higher correlation coefficients in the eastern portions of the country but lower correlation coefficients in TP regions (Fig. 13). During the past 10 years, a significant decrease in observed CDDs is apparent in Southeast, North, and Northeast China, as well as along the western edge of the TP and in northern Xinjiang, with small positive values in the rest of the regions (Fig. 14a). CRAI, JRA55, and MERRA2 reproduce the wetting trend, with pattern correlation coefficients of 0.22-0.32 (Figs. 14b, d, f). However, a significant increase in CDD occurs over the TP in CFSR (Fig. 14e) and extends to northeastern and central China in ERAI (Fig. 14c).
Overall, in terms of R95pTOT, CRAI, ERAI, and JRA55 (CFSR and MERRA2) exhibit a smaller (larger) amount of extreme precipitation with relatively strong correlations with observational data in the southern and eastern portions of China, where the landfall of typhoons and the seasonal migration of monsoons (Meiyu) introduce abundant rainfall. Moreover, CRAI and JRA55 Although an opposite sign in the nationwide average trend is found in ERAI [0.09 versus −0.08 days (10 yr) −1 ], it also performs better in depicting the spatial distribution of observed trends, with the pattern correlation coefficient of 0.39 (Fig. 14c).

Comparison of moisture flux
A possible explanation for the above mentioned biases might be related to the prevailing circulation. The  Fig. 9, but for the monthly CDD (days).

Summary and discussion
In this study, we evaluate the capability of CRAI to capture the observed mean and spatiotemporal variability of precipitation, as well as extreme precipitation features, in China. The intercomparisons of reanalysis pre- cipitation between CRAI, ERAI, JRA55, CFSR, and MERRA2, as well as the comparisons against observations, are performed for a 10-yr period from 2007 to 2016. The results show that the spatial characteristics, including the shape, orientation, and location of the precipitation area, as well as the seasonal and intraseasonal variations of precipitation, are generally reproduced by the five reanalyses. Furthermore, the performances of the reanalysis products vary for different regions and different precipitation regimes, with better performance in wet regions and for cold seasons. Among the reanalysis products, ERAI provides the best data on the magnitude of the 10-yr mean precipitation, while JRA55 exhibits the temporal variations and spatial patterns of precipitation closest to those of the observation data. Overall, the biases of the seasonal precipitation between the CRAI and other four reanalyses could be explained by the largescale circulation and moisture fields.
For daily variations, all the reanalyses perform reasonably well in depicting the exact timing and location of rainfall bands, as well as the seasonal migration of precipitation bands, during the warm seasons in eastern China. However, both CFSR and CRAI exhibit stronger precipitation in the rain belt during the whole period, whereas the overestimation is confined to the south of 25°N in JRA55 but to the north of 24°N in MERRA2. For ERAI, an underestimation is found in most regions of eastern China, especially in the southern portions of South China during the whole period and in the lower reaches of the Yangtze River before late-July. An EOF analysis shows that the reanalysis products are reproduced well by the spatiotemporal evolution of the observed daily precipitation for most of China during the warm season. Among the reanalyses, JRA55 outperforms the other four reanalysis products in capturing the structure and variability of precipitation in warm seasons For extreme events, CRAI, ERAI, and JRA55 tend to underestimate the extreme precipitation amounts (R95pTOT), with relatively strong correlations with the observational data across most of eastern China, where the landfall of typhoons and the seasonal migration of monsoons (Meiyu) introduce abundant rainfall. In contrast, more frequent heavy rainfall can be found in most regions of China in both CFSR and MERRA2. Meanwhile, all of the reanalyses underestimate the CDDs, with fairly low correlations in the dry and arid regions of western China (P < 200 mm), where frequent droughts occur. Among them, CRAI and JRA55 show better agreement with the observed R95pTOT data than do the other products, with fewer biases, higher correlation coefficients, and much more similar linear trend patterns. Additionally, ERAI stands out in capturing the amount and temporal variations in the observed CDD data.
In general, CRAI, ERAI, and JRA55 tend to overestimate light and moderate grades of precipitation but underestimate heavy and extreme precipitation compared to the CPC observational data. Meanwhile, a bias of too much precipitation in all categories from light to extreme ranges is presented in both CFSR and MERRA2. Moreover, CRAI agrees best with the observed distribution of precipitation rates for the 8-25 mm day −1 range in China. These results suggest that CRAI is potentially applicable for studying the large-scale daily variability of precipitation in China, whereas it should be used with caution when monitoring heavy and extreme precipitation events in semi-humid (400 ≤ P < 800 mm) and humid areas or the dry spells associated with droughts in arid and semi-arid (200 ≤ P < 400 mm) areas of China. The results presented here suggest that various reanalysis products should be combined for the study of weather and climate, since no reanalysis product is superior to any other in terms of local-scale precipitation at daily timescales (Kidd and Huffman, 2011). Therefore, the application of reanalysis data for climate and hydrological studies should be performed carefully, and bias correction strategies are necessary for model initiation (Trenberth and Guillemot, 1998;Berg et al., 2003;Decker et al., 2012).
It is worth noting that these results are heavily dependent on the reliability of observations. However, there are many uncertainties in observed datasets, stemming from the quality and/or consistency of the underlying station data to the choices made within a chosen gridding/interpolation method (parametric uncertainty), and the network selection and analytical framework (structural un-certainty) (Yin et al., 2015). These uncertainties generally influence both the magnitude and trend of extreme precipitation (Hofstra et al., 2010). Hence, further analysis might be needed to test the robustness of the results using different observations. Moreover, how to improve the results is also not deeply dealt with in this study. Nevertheless, this study represents a comprehensive evaluation of the capability of the latest reanalysis products to capture the observed spatiotemporal variability of precipitation in China, including extreme precipitation events. The work provides an important reference for future climatic applications, including statistical flood frequency analysis, water resource planning, design, and system operations. In general, the CRAI precipitation data are applicable and interpretable. Further studies will compare CRAI with other observational data, such as gauge-observed daily precipitation records from a dense national network of > 2400 gauges, for clarifying the overall performance of the reanalysis. In addition, it would be desirable to carry out longer reanalyses by comparing the 40yr product (CRA-40) with the recently released ERA5 reanalysis in the future work. More detailed analysis of the accuracy of daily data of CRAI will further explore the possible causes for the biases, in order to provide program developers with additional information that could lead to improvements.