A Multivariable Approach for Estimating Soil Moisture from Microwave Radiation Imager (MWRI)

Accurate measurements of soil moisture are beneficial to our understanding of hydrological processes in the earth system. A multivariable approach using the random forest (RF) machine learning technique is proposed to estimate the soil moisture from Microwave Radiation Imager (MWRI) onboard Fengyun-3C satellite. In this study, Soil Moisture Operational Products System (SMOPS) products disseminated from NOAA are used as a truth to train the algorithm with the input of MWRI brightness temperatures (TBs) at 10.65, 18.7, 23.8, 36.5, and 89.0 GHz, TB polarization ratios (PRs) at 10.65, 18.7, and 23.8 GHz, height in digital elevation model (DEM), and soil porosity. The retrieved soil moisture is also validated against the independent SMOPS data, and the correlation coefficient is about 0.8 and mean bias is 0.002 m3 m−3 over the period from 1 August 2017 to 31 May 2019. Our retrieval of soil moisture also has a higher correlation with ECMWF ERA5 soil moisture data than the MWRI operational products. In the western part of China, the spatial distribution of MWRI soil moisture is much improved, compared to the MWRI operational products.


Introduction
Soil moisture controls the evapotranspiration (ET) process of land surfaces and plays an irreplaceable role in the interactions between water, energy, and carbon cycles over land (Albergel et al., 2012;Cao et al., 2019;Green et al., 2019;Liao et al., 2019;Rudd et al., 2019;Seager et al., 2019;Zhang et al., 2019). In the natural environment, soil moisture varies greatly with soil properties (i.e., porosity, texture, density, and structure), surface roughness, topography, land cover, land temperature, rainfall, and ET. Therefore, spatial-temporal heterogeneity is the major characteristic of soil moisture. Previous studies have shown the impact of soil moisture on atmospheric variables at various scales (Xu et al., 2012;Zhang et al., 2013;Parrens et al., 2014;Ruosteenoja et al., 2018;Pangaluru et al., 2019). Today, in addition to the point measurements of soil moisture from in situ probes, microwave remote sensing is widely used to determine large-scale surface soil moisture, typically in the low frequencies from 1 to 10 GHz (Wagner et al., 2007). In microwave remote sensing, both active and passive techniques have been deployed for making the soil moisture measurements. For example, Fengyun-3C (FY-3C) Microwave Radiation Imager (MWRI) measures the thermal emission from the surface, commonly expressed as brightness temperatures (TBs). These TBs are used to estimate the surface parameters such as soil temperature, soil moisture, and surface emissivity (Schmugge and Jackson, 1994).
Many studies have demonstrated that the L-band technology (frequency f = 1-2 GHz, wavelength λ = 30-15 cm) can be a better choice for soil moisture remote sensing since its radiation can better penetrate through the vegetation canopy than other shorter microwave wavelengths at X-/C-band and also the contribution from the atmosphere is negligible (Wigneron et al., 2003). Soil Moisture and Ocean Salinity (SMOS) is the first satellite mission dedicated to making global observations of soil moisture. It was launched on 2 November 2009. The SMOS satellite carries an L-band radiometer to derive the global soil moisture every three days with a designed accuracy of 0.04 m 3 m −3 at a spatial resolution of about 50 km (Kerr et al., 2001;Kerr, 2007). Launched in January 2015, Soil Moisture Active Passive (SMAP) mission utilizes L-band radar and radiometer instruments together to monitor surface soil moisture and distinguish frozen from thawed soils.
However, global soil moisture products can also be derived by using higher microwave frequencies (Wagner et al., 2007). For instance, MWRI sensors are carried on three Fengyun satellites (FY-3B/-3C/-3D), and their Xband (10.65 GHz) observations are routinely used to retrieve global surface volumetric soil moisture (VSM) product. Since the launch of the FY-3B satellite in November 2010, MWRI data have been available for a variety of applications. Operational applications and climate change researches can benefit from the VSM products derived from this long-term, continuous, stable X-band dataset (Wang et al., 2010;Yang et al., 2011;Yang Z. D. et al., 2012). However, compared with other satellite VSM products, there is a relatively large error (Parinussa et al., 2014) in the released MWRI VSM product (L2) by the National Satellite Meteorological Center (NSMC) of China Meteorological Administration (CMA).
In general, the observations at X-band are not ideal for soil moisture retrieval, e.g., the estimated accuracy is about 0.06 cm 3 cm −3 for the Advanced Microwave Scanning Radiometer (AMSR)-E/-2 X-band observations (Du et al., 2017), because of the reduced penetration in vegetation and the increased effects of various noises from surface thermodynamic temperature, snow cover, topography, surface roughness, soil properties, and the intervening atmosphere (Choudhury et al., 1982;Jackson and Schmugge, 1991;Njoku and Entekhabi, 1996;Wigneron et al., 2003). Therefore, an improvement to VSM retrieval from MWRI is highly designed in accounting for the multivariable nonlinear relationship. The multivariable effects on surface VSM retrieval may be better addressed through a machine learning algorithm such as the random forest (RF). Unlike a traditional linear regression fit technique, the RF method allows many input variables or parameters used for nonlinear regression and can estimate the contributions on VSM estimating from each input parameter.
In this study, in addition to the X-band observations, multifrequency, multipolarization measurements from MWRI, vegetation leaf area index (LAI), surface temperature, topography [height in digital elevation model (DEM)] data, and statistical soil porosity map are considered as the input parameters of the RF training model. The effects from each input parameter are evaluated by the mean square errors (MSEs) and the node impurity derived from the RF model. Hence, the major factors of VSM retrieval for building the RF training model could be determined. The scores of the VSM derived from different RF models with various input parameters are evaluated and the final retrieval accuracy is derived through comparing with various soil moisture products including the in situ data.

MWRI products
MWRI L1 products from FY-3C are collected for nearly two years from 1 August 2017 to 31 May 2019 for estimating VSM. Since the vegetation penetration is limited for passive X-band remote sensing, we focused on the MWRI data that have a good dynamic range for measuring the soil moisture. The observations are typically associated with the seven surface types used in this study. The measurements from other surfaces such as water, forest, closed shrublands, permanent wetlands, urban, and snow/ice surfaces are excluded in our training. The International Geosphere-Biosphere Programme (IGBP) data acquired in MWRI L1 dataset are used for surface classification.
MWRI measures TBs at five frequencies (10.65, 18.7, 23.8, 36.5, and 89.0 GHz) with both horizontal and vertical polarizations (TB H and TB V ) at a fixed viewing angle (53.4°). The radio frequency interference (RFI) effects from artificial sources on X-band (10.65 GHz) observations are removed by using the spectral difference method (Li et al., 2004). In addition to the multichannel TBs, the corresponding polarization indices and DEM data in the MWRI L1 products are also used as the RF model input.
Currently, the daily operational product of MWRI L2 VSM is derived by using an algorithm based on the parameterized surface emission model provided by Shi et al. (2006), and the vegetation effects are corrected by using AUGUST 2020 ancillary vegetation data, e.g., normalized difference vegetation index (NDVI) and vegetation water content (VWC) data. MWRI L2 VSM product is used as a baseline in this study to evaluate the improvement of the retrieval accuracy by using a multivariable method based on the RF training model. Both MWRI L1 and L2 products are resampled to the same 0.25-degree grids for intercomparison using the nearest-neighbor search algorithm. MWRI L1 products are missing for seven days from 25 to 31 March 2018, while MWRI L2 products have more missing data on the dates 24 March and 17 September 2018.

ECMWF land surface data
In order to analyze the effects on MWRI VSM retrieval from various land surface parameters, we use the low vegetation LAI, dewpoint temperature and air temperature at a height of 2 m, skin temperature, and soil temperature for the top 7 cm derived from ECMWF ERA5 reanalysis dataset at 0000 UTC as the input parameters of the RF training model. Moreover, the statistical global porosity map (0.25-degree grids) provided by European Space Agency Climate Change Initiative (ESA-CCI) project is also contained in the RF model input to represent the variabilities of soil texture for a global scale. On the other hand, the VSM products for the top 7 cm from ECMWF ERA5 are used to validate the estimated VSM by using the RF model.

Soil Moisture Operational Products System (SMOPS) and other satellite VSM products
The National Environmental Satellite, Data, and Information Service (NESDIS) SMOPS daily blended satellite VSM products for the top 5 cm are used as the learning objectives in the RF training model. SMOPS merges almost all available VSM products derived from L-/C-/X-band, including from SMAP, SMOS, Global Precipitation Measurement (GPM), Global Change Observation Mission-Water 1 (GCOM-W1), and Meteorological Operational (MetOp)-A/-B satellite missions. SMOPS has much better spatial and temporal coverages than sparse in situ VSM observations for a global scale, and it has been verified by using the in situ VSM observations from various networks (Liu et al., 2016). SMOPS is a very valuable data source for training the MWRI machine learning VSM algorithm. Thus, daily SMOPS VSM products are also collected for validation over the same time period of the MWRI operational products that we used in this study.

In situ soil moisture observations
The daily mean in situ VSM observations at 5-cm depth from 12 stations in Natural Resources Conservation Service-Soil Climate Analysis Network (NRCS-SCAN) over the American continent (Schaefer and Paetzold, 2000) are used to validate the estimated VSM using RF models from 1 August 2017 to 31 May 2019.

Measurement physics
The TB p measured by a spaceborne microwave radiometer is given by Schmugge and Jackson (1994): , where the superscript p denotes polarization (V or H); TB atm_up is the upwelling TB from the atmosphere; Γ atm is the atmospheric transmittance from surface to the top of atmosphere, which is related to the atmospheric opacity along the viewing path; TB atm_down represents the downwelling TB from the atmosphere, which is partly reflected depending on the surface reflectivity ( ); is the TB from the surface emission. At the MWRI 10.65-GHz frequency, the atmospheric contributions to the measured TB p at clear-sky conditions are often negligible, i.e., TB p ≈ , but at higher frequencies, and for precipitating clouds, the effects of atmospheric emission and scattering are required to separate from the observed TB p . For a homogenous vegetated surface, the vegetation layer attenuates the radiation emitted from soil and emits the radiation at its own emission. In general, where the first term is the soil emission attenuated by vegetation layer: is the soil emissivity, Γ is the vegetation transmissivity related to the vegetation optical thickness, and TB soil is the soil TB; the second term represents the sum of the direct vegetation emission and the reflected vegetation emission by soil surface: ω is the single scattering albedo of the vegetation surface, r is the soil reflectivity, and TB veg is the vegetation temperature. Approximately, TB soil and TB veg are assumed equal, and . Equation (2) has been used for low vegetation surfaces at frequencies up to X-band, in which vegetation transmissivity can be approximated as: where VWC is the vegetation water content (kg m −2 ), θ is the incident angle, and coefficient depends on the fre-quency, vegetation type, VWC, polarization, θ, crop phenology, and especially the vertical structure of the canopies (Ulaby and Wilson, 1985;Jackson and Schmugge, 1991;Wigneron et al., 2003).
In addition to the complexity of the vegetation effects, the soil reflectivity also varies with many factors. For a smooth soil, can be calculated from the soil permittivity related to soil moisture and the incidence angle (θ) by using Fresnel equations. But in general, the effects of surface roughness and soil texture should also be taken into account (Wang and Choudhury, 1981;Wang, 1983). It is difficult to build a physical parameterized model to cover all the possible situations over the global lands. For example, although the roughness and vegetation effects are considered in the parametric model used for deriving MWRI L2 VSM products, the retrieval results are still suboptimal for many conditions. This may be due to using only X-band MWRI observations. Hence, the corresponding factors related to vegetation, surface temperature, roughness, and soil properties are all required in the RF model input to deal with such a complex multivariable issue for estimating VSM from MWRI data.

RF training model input
Because multifrequency and multipolarization TB measurements have different sensitivities to vegetation and roughness than soil moisture, it is possible to correct the vegetation and roughness effects on soil moisture retrieval by combining various frequency and polarization TB observations together (Njoku and Entekhabi, 1996). There is a large polarization difference (PD = TB V − TB H ) from bare soil emission ( ), but the difference is reduced with increasing vegetation biomass and/or vegetation cover density. This property has been commonly used in the vegetation correction of the soil moisture retrieval from passive microwave radiometers (Owe et al., 2001;Paloscia et al., 2001). The most common index is the TB polarization ratio (PR), defined as: Since the PR index is less sensitive to the effects of surface temperature than the PD index, it is adopted in this study. PR is less affected by soil moisture and more affected by vegetation and roughness with increasing frequency. This is the physical basis for a multivariable approach by combining multifrequency TBs and PRs as the input of the RF training model to distinguish soil moisture contribution from that of surface temperature and vegetation. Since the DEM data are provided in the MWRI L1 dataset and the statistical soil porosity map is available, we can simply use them as the input paramet-ers to represent the changes in topography and soil texture for a global scale. Moreover, the VSM retrieval derived from MWRI is very robust and fairly independent by using TBs, PRs, DEM, and soil porosity as the RF training model input.
On the other hand, since the proposed multivariable method (RF model) in this study is expected to assess the vegetation and surface temperature effects on soil moisture retrieval, LAI and soil/air/skin temperature reanalysis from the ECMWF ERA5 are added to the model input together with the independent input parameters to perform dependent experiments. The VSM estimates derived from the dependent experiments are compared with those from the independent experiments.

RF training model experiments
First, a representative training dataset is needed to build the RF training model for VSM estimating. Here, MWRI observations from four random days (31 January, 30 April, 31 July, and 31 October 2018) are used to represent the seasonal differences. In addition, to account for the spatial heterogeneity of vegetation covers and soil properties at a global scale, we also select the training samples from MWRI 0.25-degree grid global data to cover the spatial variations. Totally, 642,408 samples are used as our training dataset.
Four RF models are trained through the R package "RandomForest" (Liaw and Wiener, 2002), using the same training dataset and the same number of decision trees (480) but various model inputs. EXP1 is a dependent experiment containing all available 22 parameters: 17 independent parameters from MWRI (multifrequency TBs, PRs, and DEM) and the statistical soil porosity map, and 5 dependent parameters from ECMWF ERA5 reanalysis (low vegetation LAI, dewpoint temperature and air temperature at a height of 2 m, skin temperature, and soil temperature for the top 7 cm). Different from EXP1, EXP2 is an independent experiment with only the 17 independent input parameters in EXP1. From our analysis (see Section 4.1), we remove the less important input parameters from MWRI (PRs from 36.5 and 89.0 GHz) in EXP1 and EXP2 to perform EXP3 and EXP4, respectively. In EXP3, we only use LAI and soil temperature from ECMWF to represent the variations in vegetation and surface temperature, excluding the dewpoint, air, and skin temperatures from ECMWF in EXP1 input. Finally, EXP3 is a dependent experiment with 17 input parameters, while EXP4 is an independent one with 15 input parameters. The detailed flowchart for the optimal RF model decision is shown in Fig. 1.

Performance of the MWRI L2 VSM products
Since the performance of the MWRI L2 VSM products compared with other satellite VSM products is rarely presented in the past, an intuitive comparison is presented first in Fig. 2 between MWRI L2 and four daily gridded VSM products derived from SMAP, SMOS, AMSR2 on GCOM-W1, and GPM Microwave Imager (GMI), respectively, for the focused land covers from 1 August 2017 to 31 May 2019. The number of daily available VSM estimates from each satellite is different but with a similar trend across time.
The daily coefficient of determination (R 2 ) scores between MWRI L2 and SMAP are much higher than the scores by comparing MWRI L2 with the other satellite products. This is positive to use MWRI observations to estimate VSM. But it is noticed that the range of the R 2 scores between MWRI L2 and SMAP is large from 0.1 to 0.6, and the average value of the R 2 scores is 0.44 over the whole time period. Moreover, the average values of the R 2 scores are only 0.25, 0.09, and 0.09 for MWRI L2 compared with SMOS, GMI, and AMSR2, respectively. On the other hand, the unbiased root mean square difference (ubRMSD) scores are relatively large. During the whole time period, the smallest average value of ubRMSD is 0.10 m 3 m −3 from the comparison between MWRI L2 VSM and SMAP, while the average ubRMSD values are all around 0.13 m 3 m −3 for MWRI L2 VSM compared with the other three satellite VSM products. Therefore, we need to increase the stability of the MWRI VSM retrieval and reduce the values of the ubRMSD scores.

RF training dataset and models
The training dataset used in this work is simply selected from four random days to represent the four seasons in a year, but it includes the variations in global soil texture and land cover using a high-density spatial sampling (0.25 degree). Figure 3 presents the statistical distributions of three variables in the training dataset, including land cover type, SMOPS product (learning objective), and TB observation from the 10.65 GHz horizontal channel (TB10H) typically used for VSM retrieval [e.g., National Snow and Ice Data Center (NSIDC) AMSR-E/-2 VSM product]. In Fig. 3 Fig. 3 are only from 4-day observations, it approximately represents the annual distributions for a global scale. Based on the above training dataset, EXP1 and EXP2 models are trained by using the RF method with different input parameters. The contribution of each input in each model is evaluated by two factors: the percentage for increasing the MSE and the node impurity. The higher percentage for increasing MSE or the larger node purity value (meaning impurity), the larger contribution of the input parameter on VSM retrieval. Although DEM and soil porosity are ancillary data, not observed by MWRI, both of them are very important for increasing MSE and node impurity shown in Fig. 4. In EXP1 with all avail-able input parameters, soil porosity is the most important parameter for increasing node impurity and the second important parameter for increasing MSE, while DEM is the top one for increasing MSE and the sixth for increasing node impurity. The following are vegetation and temperature from ECMWF and low frequency TBs and PRs from MWRI. It is similar in EXP2, without the contributions from ECMWF reanalysis data, DEM rises to the third one for increasing node impurity. From both EXP1 and EXP2 models, we consider that PRs from 36.5 and 89.0 GHz are the two least important input parameters in increasing both MSE and node impurity and can be removed from the input parameters. This is similar to that only low frequency PRs (from 6.9, 10.6, and 18.9 GHz of AMSR-E) are used in Njoku et al. (2003). Here, PR at 18.7 GHz is less important than that at 23.8 GHz in both EXP1 and EXP2 models. Figure 5 displays an example of the estimated VSM from the independent EXP4 using the least input para-

Journal of Meteorological Research
Volume 34 SMOPS products than MWRI L2 products. Due to the MWRI data gap and the limited land covers for estimat-ing MWRI VSM, the number of the available EXP4 estimates is smaller than that of SMOPS, but the number of the public MWRI L2 VSM products is even less, especially over the Tibetan Plateau and Northern Hemisphere high latitudes. The spatial distribution of MWRI soil moisture estimates is much improved in EXP4.
For an intercomparison, SMOPS and ECMWF ERA5 are used as the benchmark, respectively, for the whole time period from 1 August 2017 to 31 May 2019. Daily global scores of the MWRI VSM retrievals from EXP1, EXP2, EXP3, EXP4, and MWRI L2 are calculated only using the data where the VSM estimates from all data sources are available. Hence, the data size is the same for the four experiments and MWRI L2 product. Because of the missing data in SMOPS, the amount of SMOPS data is slightly smaller than that of ECMWF ERA5 data. We can see the dynamic change of the available data size in Fig. 6. The amount of data is much larger in summer and smaller in winter. Due to the soil freezing and snow/ice cover problems in MWRI L2 VSM products, missing data are much more common in winter.
From the daily mean bias scores between various MWRI VSM and SMOPS in Fig. 7, all of the four experiments are close and around zero, while for MWRI L2 the daily mean bias ranges between −0.025 and 0.025 m 3 m −3 . Since SMOPS is the learning objective in the RF models, the bias between SMOPS and the model derived value is very small. But note that only 4-day SMOPS data are used in the training data, and most of the model estimates (656 of 660 days) are independent. This means that our multivariable method can produce MWRI VSM estimates with a really small bias related to SMOPS. However, based on the independent data source, ECM-WF ERA5, the mean bias values of our four experiments are all below zero, displaying a negative bias, as low as −0.025 m 3 m −3 . The mean bias scores of the MWRI L2 AUGUST 2020 Zhang, S. B., F. Z. Weng, and W. Yao products are also worse, with a larger variation from −0.050 to 0.025 m 3 m −3 . In Fig. 8, the VSM retrievals from all four experiments generally present a good agreement with SMOPS: R 2 is not lower than 0.55, ubRMSD does not exceed 0.05 m 3 m −3 , and p-values are all smaller than 0.01. A significant improvement is found in the retrievals derived from our proposed multivariable method, compared with the MWRI L2 VSM products (R 2 below 0.45 and ubRMSD around 0.11 m 3 m −3 ). Among the four experiments, with the help of the ancillary LAI and temperature data, R 2 scores in EXP1 and EXP3 are relatively higher than those in EXP2 and EXP4: the average value of R 2 scores is 0.66 (0.65) in EXP1 (EXP3) and 0.63 in both EXP2 and EXP4. It presents the enhancement in estimating MWRI VSM with vegetation and temperature information. Moreover, the difference between EXP2 and EXP4 is neglected and thus this proves the 15 input parameters in EXP4 enough for training the independent RF model. For dependent experiments, EXP1 is slightly better than EXP3. There is no doubt that the best R 2 and ubRMSD scores are from the four random days (31 January, 30 April, 31 July, and 31 October 2018) used for constructing the training dataset.
We also compare the MWRI retrievals in this study and the MWRI L2 products with the independent ECM-WF ERA5 reanalysis. An improvement for estimating MWRI VSM using our method is also found in R 2 scores but not significant in ubRMSD scores. The average value of R 2 scores during the whole time period from 1 August 2017 to 31 May 2019 is 0.68 (0.64) in EXP1 (EXP3) and 0.62 in both EXP2 and EXP4, while for MWRI L2, it is only 0.42. Moreover, the difference between EXP1 and EXP2/EXP4 is obvious. But note that the ancillary data used in EXP1 are also from ECMWF ERA5, which are not independent and might increase the effects of vegetation and temperature on VSM retrieval. On the other hand, the average value of ubRMSD scores is around 0.11 m 3 m −3 for all MWRI experiments and around 0.12 m 3 m −3 for the MWRI L2 products, which is very close and much larger than that based on SMOPS. Figure 8 shows a slight improvement in MWRI experiments in spring and summer.
The detail scores for various land covers are shown in Tables 1 and 2 using SMOPS and ECMWF ERA5 as the benchmark, respectively. The largest percentage of land covers is from open shrublands (IGBP 7). Moreover, all scores (R 2 , bias, and ubRMSD) are good in open shrublands (Table 1). Besides, R 2 scores in grasslands (IGBP 10) and ubRMSD scores in barrens (IGBP 16) are the best in all land covers. While the lowest R 2 values and the largest bias values are both from croplands/natural vegetation mosaic lands (IGBP 14), and the largest ubRMSD values are from woody savannas (IGBP 8). But  note the number of the daily estimates in croplands/natural vegetation mosaic lands (IGBP 14) is relatively small due to the limited global coverage. In Table 2, taking ECMWF ERA5 as the benchmark, R 2 and bias scores in woody savannas (IGBP 8), croplands/natural vegetation mosaic lands (IGBP 14), and barrens (IGBP 16) are relatively low in all land covers, but ubRMSD scores in barrens (IGBP 16) are the best (around 0.05 m 3 m −3 ) and much better than others (from 0.08 to 0.11 m 3 m −3 in our experiments). From the above results, we conclude the retrieval accuracies for croplands/natural vegetation mosaic lands and woody savannas are the worst, and the res-   Since the scores between MWRI L2 VSM and other satellites VSM are shown in Fig. 2, we also present the scores between EXP4 VSM (the independent estimates) and the same benchmarks in Fig. 9. The average ubRMSD score between MWRI VSM and AMSR2 is significantly improved from 0.13 (L2) to 0.07 m 3 m −3 (EXP4), while for comparing with SMAP, GMI, and SMOS, the average ubRMSD values are also reduced to 0.08, 0.10, and 0.11 m 3 m −3 , respectively. An improvement is also found in the R 2 scores. The highest one (0.51) is from the comparison between EXP4 and SMAP,     the second one (0.29) is from the comparison with SMOS, while for comparing with AMSR2 and GMI, R 2 scores are 0.17 and 0.14.
In situ VSM observations (daily mean values) at 5 cm from the NRCS-SCAN are used to further evaluate the performance of the EXP4 VSM estimates derived by our RF model and the improvement compared with the MWRI L2 VSM products. Since the VSM estimates are made at a 0.25-degree grid resolution, not all in situ stations are suitable for validating. The VSM observations from 12 stations from 1 August 2017 to 31 May 2019 are selected to assess the correlations with the SMOPS products, MWRI L2 VSM products, and EXP4 estimates, respectively. The scores from the 12 stations are shown in Table 3. The correlation coefficient (R) scores between scaled EXP4 estimates and scaled in situ observations range from 0.4 to 0.7, which are significantly better than the R scores (from −0.2 to 0.5) of scaled MWRI L2 VSM products, and slightly worse than the R scores (from 0.5 to 0.8) of scaled SMOPS products. The mean values of the ubRMSD scores from the 12 stations are 0.87, 0.98, and 1.35 m 3 m −3 for scaled SMOPS products, scaled EXP4 estimates, and scaled MWRI L2 VSM products, respectively. Similar to R scores, the performance of ubRMSD scores of EXP4 estimates is much better than that of MWRI L2 products but slightly worse than that of SMOPS products.
Scaled EXP4 VSM estimates from three typical stations in Table 3 are compared in Fig. 10 with three scaled VSM time series: in situ VSM observations at a depth of 5 cm, MWRI L2 VSM products, and SMOPS products. EXP4 estimates are significantly closer to the in situ observations than the MWRI L2 VSM products. Moreover, the available MWRI L2 VSM products (N values in Table 3) are obviously less than the EXP4 estimates. Most of the missing data in MWRI L2 VSM are much more common in winter, shown in Fig. 10. It proves that more MWRI VSM estimates can be obtained and better performance of MWRI VSM can be reached by using our RF model.

Discussion
In using machine learning methods, it is very important to make sure the spatial-temporal representation of training sample space. According to the characteristics of the surface soil moisture, we emphasized the spatial sampling at 0.25-degree grids but the temporal sampling was sparse, i.e., only four days in a year were used. Therefore, we evaluate whether the sample space is big enough by comparing the difference among 4, 8, and 12 random days used in a whole year with the same input parameters in EXP4. The statistical scores are shown in Fig. 11. The 4-day experiment is the same as EXP4, and the 4 random days are 31 January, 30 April, 31 July, and 31 October 2018; for the 8-day experiment, the 8 random days are 15 January, 31 January, 15 April, 30 April, 15 July, 31 July, 15 October, and 31 October 2018; while for the 12-day experiment, the 12 random days are 31 January, 28 February, 24 March, 30 April, 31 May, 30 June, 31 July, 31 August, 30 September, 31 October, 30 November, and 31 December 2018. Since MWRI L2 VSM products are missing on 24 March, the VSM estimates on this day are not shown in Fig. 11.
Compared with SMOPS products, the statistical scores are slightly better by using more random days: the average R 2 = 0.67, 0.65, and 0.63, and the average ubRMSD = 0.042, 0.043, and 0.044 m 3 m −3 for 12, 8, and 4 days, respectively, while there is no improvement in scores compared with ECMWF ERA5 products. It is clearly shown that more random days cannot significantly improve the scores. However, using more days of MWRI data in the training can consume huge computing re- sources. In our current computing environment, 12 days of MWRI data in the training require 9 days to complete the calculation.

Conclusions and future perspectives
Based on the MWRI measurement physics, a multivariable approach using the RF machine learning technique is proposed to estimate MWRI VSM for improving retrieval accuracy. Moreover, the VSM estimates derived from different RF models are compared with the validations from SMOPS or ECMWF ERA5 VSM products during the same time period from 1 August 2017 to 31 May 2019, to evaluate the differences in retrieval accuracy due to the various model input parameters. From the MWRI experiments, the independent input parameters from all 10-channel TBs and three PRs at 10.65, 18.7, and 23.8 GHz, DEM, and soil porosity are determined as the significant important ones. Based on these model input parameters, the independent MWRI retrievals are in good agreement with the SMOPS products: R 2 = 0.63, ubRMSD = 0.044 m 3 m −3 , and mean bias = 0.002 m 3 m −3 for a global scale. This positive result shows the possibility of using the RF training model to estimate MWRI VSM independently.
It should be noted that there is a risk of data gap from the present to 2025, for continuing the low frequency microwave observations, i.e., from GCOM-W1 AMSR2 and GMI on GPM. Meanwhile, MWRI observations from FY-3 series satellites can be a beneficial supplement to ensure data integrity and increase the data density, but more work should be investigated to make an operational RF model for releasing daily MWRI VSM products with high accuracy. This is useful to promote the reasonable use of global satellite resources and to speed up the development of national satellite product applications.

744
Journal of Meteorological Research Volume 34 First, vegetation and land surface temperature are useful for MWRI VSM retrieval, but we need to find good proxies of them from Fengyun satellite observations or other satellites with the least spatial and temporal influences. Second, the amount of the missing data is significantly large in MWRI L2 VSM products. It is needed to assess the possibility of increasing the available VSM estimates based on the multivariable machine learning technique. Once the training model is constructed, we can obtain the MWRI VSM estimates using the MWRI observations, but we need to validate the retrieval accuracy especially over the many places where the soil exper-   AUGUST 2020 Zhang, S. B., F. Z. Weng, and W. Yao iences freezing and thawing. We will further explore the ways of selecting the training data samples with the best representation of the MWRI observations and the VSM observations.