Verification of High-Resolution Medium-Range Precipitation ... - MDPI

6 downloads 0 Views 12MB Size Report
Mar 13, 2018 - Precipitation Forecasts from Global Environmental .... issues, some forecasts (about 15%) are missing from the website within the study period but the ... The west part of Inner Mongolia and Xinjiang is less than 200 mm.
atmosphere Article

Verification of High-Resolution Medium-Range Precipitation Forecasts from Global Environmental Multiscale Model over China during 2009–2013 Huating Xu 1 1 2

*

ID

, Zhiyong Wu 1, *

ID

, Lifeng Luo 2

ID

and Hai He 1

College of Hydrology and Water Resources, Hohai University, Nanjing 210098, China; [email protected] (H.X.); [email protected] (H.H.) Department of Geography, Environment and Spatial Science, Michigan State University, East Lansing, MI 48824, USA; [email protected] Correspondence: [email protected]; Tel.: +86-025-8378-7743

Received: 9 December 2017; Accepted: 3 March 2018; Published: 13 March 2018

Abstract: Accurate and timely precipitation forecasts are a key factor for improving hydrological forecasts. Therefore, it is fundamental to evaluate the skill of Numerical Weather Prediction (NWP) for precipitation forecasting. In this study, the Global Environmental Multi-scale (GEM) model, which is widely used around Canada, was chosen as the high-resolution medium-term prediction model. Based on the forecast precipitation with the resolution of 0.24◦ and taking regional differences into consideration, the study explored the forecasting skill of GEM in nine drought sub-regions around China. Spatially, GEM performs better in East and South China than in the inland areas. Temporally, the model is able to produce more precise precipitation during flood periods (summer and autumn) compared with the non-flood season (winter and spring). The forecasting skill variability differs with regions, lead time and season. For different precipitation categories, GEM for trace rainfall and little rainfall performs much better than moderate rainfall and above. Overall, compared with other prediction systems, GEM is applicable for the 0–96 h forecast, especially for the East and South China in flood season, but improvement for the prediction of heavy and storm rainfall and for the inland areas should be focused on as well. Keywords: global environmental multi-scale; precipitation verification; high-resolution forecast; medium-term prediction; China

1. Introduction Timely and skillful precipitation forecasts are important for decision-making when dealing with meteorological and hydrological hazards such as floods and droughts. Reservoir operators can benefit from skillful precipitation forecasts for effective flood control, while during droughts farmers and water resources managers can also utilize precipitation forecasts to determine irrigation schedules for more effective drought mitigation. To this end, Quantitative Precipitation Forecast (QPF) in the short-term (up to 72 h in lead time) and medium-range (up to 15 days) are now available from a number of Numerical Weather Prediction (NWP) models around the world. However, a QPF, ignoring the uncertainty of precipitation occurrence, induces an illusion of certainty. Hence, if the forecast is inaccurate, the consequences can be grave, both in terms of economic and social losses. Consequently, Probabilistic Quantitative Precipitation Forecast (PQPF) based on probabilistic prediction has been developed and it offers many advantages [1] over QPF as it allows the total uncertainty related to the occurrence of a future event to be quantified. Much research has been conducted to verify and evaluate probabilistic forecasts [2–6]. Furthermore, stochastic models have been proposed for precipitation forecasting [7–10] but for flood prediction and drought detection Atmosphere 2018, 9, 104; doi:10.3390/atmos9030104

www.mdpi.com/journal/atmosphere

Atmosphere 2018, 9, 104

2 of 20

purposes, QPF is still more common and familiar than probabilistic forecasts [11]. In particular, the availability of increasingly more powerful computers and improved model physics now allow for higher resolution global models to be used in operational weather forecasting [12]. However, higher spatial resolution does not necessarily mean higher accuracy in model forecasts, which is imperative for assessing the accuracy and skill of these model forecasts before they can be properly used in real world applications [13,14]. The NWP model forecast products that are currently being widely used include the forecasts from the National Centers for Environmental Prediction (NCEP) Global Forecast System (GFS), the European Centre for Medium-range Weather Forecast (ECMWF) Integrated Forecast System (IFS), Japan Meteorological Agency (JMA), the UK Met Office and the Canadian Meteorological Center (CMC). Verification and evaluations of QPF from these models not only identify and quantify the model deficiency, which can help to guide future model improvements, but also provide the baseline for users to determine the usefulness of forecast information in a specific region [15–17]. Many previous studies have contributed to this verification effort by using in-situ observations and satellite remote sensing (e.g., [16,18–20]). Their results all indicated that model forecast skills vary with both space and time, thus for any specific region a set of metrics need to be used to develop a more comprehensive understanding of the forecast performance. The Global Environmental Multi-scale (GEM) model, developed by the Recherche en Prévision Numérique (RPN), Meteorological Research Branch (MRB) and the Canadian Meteorological Centre (CMC), is a NWP and data assimilation system currently used in Canada [21]. The GEM model is a global spectrum model that can run with variable resolution over a global domain such that high resolution is focused over an area of interest. The variable resolution approach is used for global data assimilation and medium-range weather forecasting, as well as regional data assimilation cycles and short-term prediction [21,22]. In 2006, the spatial resolution of the GEM model in mid-latitude regions increased from 100 km to 33 km and the numbers of vertical levels increased to 58 [23]. Bélair et al. [24] found that the forecast skill improved with the increased resolution. Zadra et al. [25] conducted an in-depth study on the regional climate forecasts in Canada and concluded that the GEM model is capable of predicting accurately. Markovic et al. [23] compared the forecast skill of the GEM model with ERA-40 reanalysis in North America and the tropical Pacific-East Indian Ocean and showed that the skills in seasonal average and variation are similar and reliable. Currently, the model’s spatial resolution has been increased to 0.24◦ (about 25 km) and its forecast now goes out to 10 days. The GEM model is one of the most widely used and ideal NWP models in North America. However, its performance and skill has not been assessed in China. Geographically, China is located at a similar latitude to North America, has a similar climate and there are mountains and plateaus in both regions, especially in the central and western parts. The GEM model has proven suitable for mountainous areas [26]. Additionally, GEM is also one of the adopted NWP models in international projects and more and more meteorological and hydrological applications are used or based on the GEM model [22,26]. Therefore, it is necessary to investigate its forecast merits and disadvantages in China, in order to understand its usefulness and potential applications in China. So far, few studies [22] have looked at the model’s forecast skill and potential application in China. Our study attempts to fill the gap by examining the GEM model’s precipitation forecasts between 2009 and 2013. To place the results in perspective, this study also compares the forecast performance from GEM with that from the NCEP GFS model. The overall objectives of the study are: (1) to examine the forecast skill of the GEM model at a high resolution for a 6-day lead time in different regions around China and (2) to understand the spatial and temporal variations of model skills with both season and lead time.

Atmosphere 2018, 9, 104

3 of 20

2. Data and Method 2.1. Data Observations of daily precipitation are obtained from the China Meteorological Scientific Data Sharing Website [27] for verification. The dataset is based on daily precipitation of 2419 stations around China and using Optimal Interpolation (OI) to get daily precipitation data at the resolution of 0.25◦ . Shen and Xiong [28] used the automated weather stations’ (AWS) data to check the quality of the interpolated dataset and showed that the dataset can be considered a reliable and high-quality precipitation dataset over Mainland China. The GEM model forecasts are regridded to 0.25◦ to match the grid of observations in this study. The medium-range precipitation forecasts from the high-resolution GEM model are obtained from Meteorological Service of Canada [29] for the period of 2009–2013. Model forecasts are released every 3 h and each forecast contains quantitative hourly precipitation with lead times up to 240 h. However, this study only focused on the first 144 h as the skills are generally higher at shorter lead times. The forecasts cover the entire globe with a spatial resolution of 0.24◦ . Due to some technological issues, some forecasts (about 15%) are missing from the website within the study period but the remaining data still provide a large sample size for a meaningful and robust verification. The total observations are 1826 and for the same periods, the total number of 24 h accumulative precipitation forecasts for all lead times is up to 1400. It is very important to test the representation of the observations and forecasts. There are many methods to calculate the effective degree of freedom of correlation coefficient [30–33]. In the study, we found that the correlation coefficient has passed the significant test with the confidence interval of 0.01, which illustrates that it is practical to use the data from 2009–2013 to validate the forecast skill of GEM. 2.2. Study Domain The climate across Mainland China varies significantly due to the complex landscape with drastic changes in topography from the east to the west. Therefore, the methods used to divide sub-regions in China were proposed in terms of spatial differences [34,35]. In this study, to better detect the geographical distribution of the model skill and to distinguish the differences of average annual precipitation, climate and landscape among regions, the study domain is divided into nine drought sub-regions following [36] (Figure 1). The 9 regions are: (I) Northeast, (II) North, (III) East, (IV) South, (V) Northwest, (VI) Southwest, (VII) Inner Mongolia, (VIII) Xinjiang and (IX) Tibet. As shown in Figure 1, the annual precipitation varies significantly between regions but more or less similar within each region. Precipitation is abundant in the southeast part of the domain and then gradually decreases towards northwest. The west part of Inner Mongolia and Xinjiang is less than 200 mm. Wu et al. [37] analyzed spatial and temporal characteristics of drought in these nine sub-regions, the results demonstrated that the precipitation and evaporation in each region differs from each other quite significantly, so it is necessary to differentiate them and treat them separately in the verification. 2.3. Evaluation Metrics We chose several metrics to quantify and summarize the precipitation forecast skills. Two categorical scores, threat score (TS) and bias score (BS) were used to measure GEM model’s forecast skill for different precipitation intensity. We first classified 24-h cumulative precipitation (P24h ) into five precipitation categories according to the Quality Verification Method for Short- and Medium-Term Weather Forecast proposed by China Meteorological Administration. The forecast periods in the study are 0–24 h (P1), 24–48 h (P2), 48–72 h (P3), 72–96 h (P4), 96–120 h (P5) and 120–144 h (P6). The categories are trace of rainfall (P24h ≥ 0.1 mm), little rainfall (0.1 mm ≤ P24h < 10 mm), moderate rainfall (10 mm ≤ P24h < 25 mm), heavy rainfall (25 mm ≤ P24h < 50 mm) and storm rainfall (P24h ≥ 50 mm) [38]. This categorization has been widely used in China for categorical forecast verification for precipitation [39–41]. For each category, we first determine if the P24 h forecast and

use the data from 2009–2013 to validate the forecast skill of GEM. 2.2. Study Domain The climate across Mainland China varies significantly due to the complex landscape with drastic changes in topography from the east to the west. Therefore, the methods used to divide4 subAtmosphere 2018, 9, 104 of 20 regions in China were proposed in terms of spatial differences [34,35]. In this study, to better detect the geographical distribution of the model skill and to distinguish the differences of average annual corresponding observations at a given among grid fallregions, into thisthe category. A contingency tableinto is then precipitation, climate and landscape study domain is divided nineproduced drought to summarize the number of counts for each of the four possible outcomes for all forecasts the sub-regions following [36] (Figure 1). The 9 regions are: (I) Northeast, (II) North, (III) East, during (IV) South, study period and(VI) for Southwest, all grids within the 9 regions (Table 1). Please noteTibet. that a As contingency (V) Northwest, (VII)each InnerofMongolia, (VIII) Xinjiang and (IX) shown in table is created for each precipitation category and for each lead time. Based on Table 1, TS and BS are Figure 1, the annual precipitation varies significantly between regions but more or less similar within calculated using the Formulas (1) and (2) below: each region. Precipitation is abundant in the southeast part of the domain and then gradually decreases towards northwest. The west part of Inner Mongolia and Xinjiang is less than 200 mm. Wu TS = Ncharacteristics A/(NA + NB + ), (1) et al. [37] analyzed spatial and temporal ofNC drought in these nine sub-regions, the results demonstrated that the precipitation and evaporation in each region differs from each other BS = ( N A + NB)/( N A + NC ), (2) quite significantly, so it is necessary to differentiate them and treat them separately in the verification.

Figure 1. 1. The The nine nine sub-regions sub-regions of of mainland mainland China China used used in in this this study study and and average average annual annual precipitation precipitation Figure ◦ based on on gridded griddedobservations observationsat atthe theresolution resolutionof of0.25 0.25°from from2009 2009to to2013. 2013. based Table 1. The contingency table. Observation

Forecast

Yes No

Yes NA NC

No NB ND

The TS score measures the degree of coincidence between observations and forecasts and the BS represents the overall ratio between forecast and observation for the occurrence of rainfall for the given category and it reflects the bias in rainfall area for that category. We also use correlation coefficient (R), relative mean absolute error (RMAE) and mean bias error (MBE) to quantify the forecast skills. The R measures the degree of correlation between forecasts and observations while the RMAE reflects the percentage of the average absolute difference between forecasts and observations in average observations and MBE measures the differences between average forecasts and observations. Their calculation is shown in Equations (3)–(5): R= q

∑iN=0 (Oi − O)(Fi − F ) q , 2 2 ∑iN=0 (Oi − O) ∑iN=0 (Fi − F )

(3)

Atmosphere 2018, 9, 104

5 of 20

RMAE =

∑iN=0 | Fi − Oi | × 100%, ∑iN=0 Oi

MBE =

(4)

∑iN=0 (Fi − Oi ) , N

(5)

where, N is the total number of 0.25◦ grids in the study domain, Oi and Fi (i = 1, 2, . . . , N) represent the observed precipitation and forecast precipitation for each grid and O and F are the average observations and forecasts respectively. Previous studies often use root mean square error (RMSE) [42–44] to evaluate the bias of forecasts against observations. However, RMSE is not independent of bias, including both systematic errors and the variable portion of the error field, the RMSE cannot distinguish systematic and random errors [43–45]. The great variation in observations is usually regarded as the error between forecasts to the observations. So, in the study, we utilized the alpha index (AI), which defines the intensity of systematic error and quantifies the forecast skill taking the random errors into consideration [42,46]. The calculation of AI is shown in Equation (6): AI =

2  ∑iN=0 (Fi − F ) − (Oi − O) 2

∑iN=0 ( Fi − F) + ∑iN=0 (Oi − O)

2

,

(6)

AI = 0 represents that the random error is small and F and O correspond with each other after revision. AI = 1 indicates that forecasts and observations are not consistent with high random error, while AI = 2 means small random error and negative coefficient between forecasts and observations. Therefore, when the model is physically reliable, AI should be less than 1. Another metric we used in the study is ETS (equitable threat score) [43,47]. The ETS is similar with TS but without randomness of precipitation. The calculation of ETS is shown in Equations (7) and (8): NA − E ETS = , (7) N A + NB + NC − E E=

(NA + NC) × (NA + NB) . N A + NB + NC + ND

(8)

3. Results 3.1. Spatial Distribution of Forecast Skill 3.1.1. Forecast Skill of Precipitation Categories In the study, the spatial TS and BS were used to measure the forecast skill for different precipitation categories and lead times in each sub-region. Figure 2 shows the multi-year average TS and BS for different forecast periods in the nine sub-regions from 2009 to 2013. We can see that TS in different sub-regions decreases with the increase of lead time and the increase of precipitation intensity, while BS increases instead and the variation characteristics of these two metrics correspond to previous studies [16,40,48–50]. For TS in different sub-regions, the results show that for different precipitation categories and forecast periods, the scores in Southwest (VI) and East (III) are higher than that in other regions. Furthermore, TS in Southwest (VI) for the trace of rainfall for different lead times are greater than 0.4. TS in Xinjiang (VIII) are all less than 0.2, the lowest in all sub-regions, with 0.15–0.19 and 0.14–0.16 for the trace of rainfall and little rainfall forecast. The variation degree of TS in different forecast periods varies in different sub-regions. In Xinjiang (VIII), Tibet (IX) and Southwest (VI), the variation in TS is less than 0.04. While in Northeast (I), North (II) and Inner Mongolia (VII), the TS variations are above 0.1 from the trace of rainfall to moderate rainfall forecast and that for heavy and extreme rainfall forecast are around 0.05.

Atmosphere 2018, 9, 104 Atmosphere 2018, 9, x FOR PEER REVIEW

6 of 20 6 of 20

Figure 2. average (a) TS score)score) and (b) BS (bias score) of nine sub-regions over China Figure 2. The The average (a) (threat TS (threat and (b) BS (biasvalue score) value of nine sub-regions for different periods and precipitation categories categories from 2009from to 2013 over China forforecast different forecast periods and precipitation 2009 of to GEM 2013 of(Global GEM Environmental Multi-scale Model). The (I) to (IX) here represent Northeast, North, East, South, (Global Environmental Multi-scale Model). The (I) to (IX) here represent Northeast, North, East, Northwest, Southwest, Southwest,Inner Inner Mongolia, Xinjiang respectively the torepresent P6 here Northwest, Mongolia, Xinjiang and and TibetTibet respectively and theand P1 to P6P1 here represent the periods 6 forecast periods 0–24 to the 6 forecast from 0–24 from to 120–144 h. 120–144 h.

For BS in the same forecast period, the value for trace of rainfall and little rainfall are greater For BS in the same forecast period, the value for trace of rainfall and little rainfall are greater than 1, indicating that the forecasts area of the GEM model is larger than observations. The score in than 1, indicating that the forecasts area of the GEM model is larger than observations. The score Northwest (V) and Xinjiang (VIII) are the highest while that in South (IV), Southwest (VI) and East in Northwest (V) and Xinjiang (VIII) are the highest while that in South (IV), Southwest (VI) and (III) are the lowest, indicating a more skillful performance in the latter regions. And the BS are almost East (III) are the lowest, indicating a more skillful performance in the latter regions. And the BS the same from moderate to extreme rainfall forecast. For different forecast periods, BS in North (II), are almost the same from moderate to extreme rainfall forecast. For different forecast periods, BS in East (III) and South (IV) are greater than 1 except for the extreme rainfall forecast in the period 0–24 North (II), East (III) and South (IV) are greater than 1 except for the extreme rainfall forecast in the h. In Northwest (V), Tibet (IX) and Southwest (VI), BS are below 1 for extreme rainfall forecast. period 0–24 h. In Northwest (V), Tibet (IX) and Southwest (VI), BS are below 1 for extreme rainfall Statistically speaking, the BS in Northwest (V) and Southwest (VI) are greater than 0.5 while that in forecast. Statistically speaking, the BS in Northwest (V) and Southwest (VI) are greater than 0.5 while Tibet is less than 0.5. In Xinjiang (VIII), the BS for heavy and extreme rainfall are less than 1. In that in Tibet is less than 0.5. In Xinjiang (VIII), the BS for heavy and extreme rainfall are less than 1. particular, BS for extreme rainfall is almost 0, suggesting that there is an obvious failed report in In particular, BS for extreme rainfall is almost 0, suggesting that there is an obvious failed report in heavy rainfall of the GEM model, especially for the extreme rainfall forecast. heavy rainfall of the GEM model, especially for the extreme rainfall forecast. 3.1.2. Spatial Spatial Distribution Distribution Characteristics Characteristics 3.1.2. In this this part, ofof RR and RMAE to In part, we we calculated calculated the thetemporal temporalaverage averageofofeach eachgrid gridaround aroundChina China and RMAE directly understand the spatial characteristics of forecast skills with a lead time of 144 h. The to directly understand the spatial characteristics of forecast skills with a lead time of 144 h. characteristics of theofforecast periods from 72–96 h and hfrom h are similar to thosetoofthose 48–72ofh The characteristics the forecast periods from 72–96 and96–120 from 96–120 h are similar and 96–120 h respectively, so only the results for the other four forecast periods are presented in this 48–72 h and 96–120 h respectively, so only the results for the other four forecast periods are presented section. in this section. We can can see see from from Figure Figure 3a 3a that that there there is is aa significant significant positive positive correlation correlation between between corresponding corresponding We observations and forecasts at different forecast periods and the R values decrease regionally with the observations and forecasts at different forecast periods and the R values decrease regionally with extension of forecast period. In particular, the the variations of Rofin central and southern parts of the extension of forecast period. In particular, variations R the in the central and southern parts Xinjiang (VIII), northern and central part of Tibet (IX) and the eastern part of Northwest (V) are of Xinjiang (VIII), northern and central part of Tibet (IX) and the eastern part of Northwest (V) are relatively small small (below (below 0.5). 0.5). In variation is more significant, significant, where where the the R R value value is is relatively In other other areas, areas, the the variation is more greater than 0.5, even up to 0.8 in some areas, for the forecast period 0–24 h and 24–48 h and less than 0.5 or even 0.2 when the lead time is longer than 72 h.

Atmosphere 2018, 9, 104

7 of 20

greater than 0.5, even up to 0.8 in some areas, for the forecast period 0–24 h and 24–48 h and less than 7 of 20 is longer than 72 h.

Atmosphere 2018, 9, xwhen FOR PEER REVIEW 0.5 or even 0.2 the lead time

Figure 3. The spatial distribution of R (correlation coefficient) between observations and the GEM Figure 3. The spatial distribution of R (correlation coefficient) between observations and the GEM forecasts forecastsfor fordifferent differentforecast forecastperiods periodsofofthe themulti-year multi-yearaverage averagefor for(a) (a)all allthe theprecipitation, precipitation,(b) (b)the the precipitation in flood season, (c) the precipitation in the non-flood season and (d) the moderate rainfall precipitation in flood season, (c) the precipitation in the non-flood season and (d) the moderate rainfall and andabove. above.P1, P1,P2, P2,P3 P3and andP6 P6here hererepresent representthe theforecast forecastperiods periodsofof0–24, 0–24,24–48, 24–48,48–72 48–72and and120–144 120–144h.h. The color of white represents negative correlation between observations and forecasts. The color of white represents negative correlation between observations and forecasts.

Comparing Figure 3b with Figure 3c, the R values between the flood season and the non-flood Comparing Figure 3b with Figure 3c, the R values between the flood season and the non-flood season in South (IV), East (III) and Southwest (VI) are similar. The values in Xinjiang (VIII), northern season in South (IV), East (III) and Southwest (VI) are similar. The values in Xinjiang (VIII), northern Tibet (IX), Inner Mongolia (VII) and northern part of Northeast (I) are from 0 to 0.2. The R in the Tibet (IX), Inner Mongolia (VII) and northern part of Northeast (I) are from 0 to 0.2. The R in the junction among Xinjiang (VIII), Northwest (V) and Inner Mongolia (VII) is almost 0 in the non-flood junction among Xinjiang (VIII), Northwest (V) and Inner Mongolia (VII) is almost 0 in the non-flood season and the proportion for 120–144 h is the smallest. In different forecast periods, the correlation season and the proportion for 120–144 h is the smallest. In different forecast periods, the correlation coefficient of flood season was higher in 24–48 h and 48–72 h than that in other periods and for the coefficient of flood season was higher in 24–48 h and 48–72 h than that in other periods and for the same forecast period, the values of flood and non-flood season were lower than that of annual same forecast period, the values of flood and non-flood season were lower than that of annual average. average. For the moderate rainfall forecast and above (Figure 3d), the R is much less than that in Figure 3a, For the moderate rainfall forecast and above (Figure 3d), the R is much less than that in Figure which is below 0.4 around China. The variation in different forecast periods is relatively milder 3a, which is below 0.4 around China. The variation in different forecast periods is relatively milder than Figure 3a–c. There are some high values of R in Xinjiang (VIII), Northwest(V), Tibet (IX) than Figure 3a–c. There are some high values of R in Xinjiang (VIII), Northwest(V), Tibet (IX) and and Inner Mongolia(VII), while R in the southwestern part of Xinjiang (VIII), northern Tibet (IX), Inner Mongolia(VII), while R in the southwestern part of Xinjiang (VIII), northern Tibet (IX), northern northern Northwest (V) and midwestern Inner Mongolia (VII) are still negative for all the forecast Northwest (V) and midwestern Inner Mongolia (VII) are still negative for all the forecast periods, periods, with insignificant differences in area proportion, indicating a relative poor correlation between with insignificant differences in area proportion, indicating a relative poor correlation between observations and forecasts in these sub-regions. observations and forecasts in these sub-regions. As shown in Figure 4, the spatial distribution for average RMAE around the year (Figure 4a) and that for the flood season (Figure 4b) are similar. In the 0–24 h forecast period, the RMAE values in Mideastern parts is below 100%. And RMAE in the flood season is slightly higher than that of the multi-year average. Meanwhile, the RMAE is about 130% when the lead time is longer than 72 h. For

Atmosphere 2018, 9, 104

8 of 20

As shown in Figure 4, the spatial distribution for average RMAE around the year (Figure 4a) and that for the flood season (Figure 4b) are similar. In the 0–24 h forecast period, the RMAE values in Atmosphere 2018,parts 9, x FOR REVIEW 20 Mideastern is PEER below 100%. And RMAE in the flood season is slightly higher than that 8ofofthe multi-year average. Meanwhile, the RMAE is about 130% when the lead time is longer than 72 h. all periods, the RMAE values in Xinjiang (VIII)(VIII) and and TibetTibet (IX) (IX) are higher thanthan 130%. In Forthe allforecast the forecast periods, the RMAE values in Xinjiang are higher 130%. particular, RMAE is up to 250% in the south of Xinjiang (VIII), slightly increasing in area with the In particular, RMAE is up to 250% in the south of Xinjiang (VIII), slightly increasing in area with the increase increasein inlead leadtime. time.And Andthe thearea areawhere wherethe thevalue valueisislower lowerthan than250% 250%in inflood floodseason seasonisissmaller smallerthan than the multi-year average. the multi-year average.

Figure Figure4.4.The Thespatial spatialdistribution distributionof ofRMAE RMAE(relative (relativemean meanabsolute absoluteerror) error)between betweenobservations observationsand and the theGEM GEMforecasts forecastsfor fordifferent differentforecast forecastperiods periodsofofthe themulti-year multi-yearaverage averagefor for(a) (a)all allthe theprecipitation, precipitation, (b) (b)the theprecipitation precipitationin inflood floodseason, season,(c) (c)the theprecipitation precipitationin innon-flood non-floodseason seasonand and(d) (d)the themoderate moderate rainfalland andabove. above.P1, P1,P2, P2,P3 P3and andP6 P6here hererepresent representthe theforecast forecastperiods periodsof of0–24, 0–24,24–48, 24–48,48–72 48–72and and rainfall 120–144h. h. 120–144

With With the the extension extensionof of the the forecast forecast period, period,the thearea area where where the the RMAE RMAE in in the the non-flood non-flood season season (Figure 4c) reaches 250% extends northeastwards to Northeast (I), westwards to the central part (Figure 4c) reaches 250% extends northeastwards to Northeast (I), westwards to the central partof of Southwest Southwest(VI), (VI),southeastwards southeastwardsto tosoutheastern southeasternNorthwest Northwest(V) (V)from frommidsouthern midsouthernpart partof ofXinjiang Xinjiang (VIII), Tibet (IX), (IX),midwest midwest Northwest (V) west and part westofpart Inner Mongolia (VII), gradually (VIII), Tibet of of Northwest (V) and InnerofMongolia (VII), gradually increasing. increasing. In the Northeast (I), North (II), East (III) and South (IV) regions, the RMAE in the non-flood In the Northeast (I),than North East and South (IV)time regions, the RMAE non-flood season is slightly lower that(II), in (a) and(III) (b) when the lead is shorter than 48 in h. the For most areas, season is slightly lower than that in (a) and (b) when the lead time is shorter than 48 h. For most areas, the value is below 50% for the forecasts 0–24 h. When the lead time is longer than 48 h, the forecast the value is below season 50% foristhe forecasts h. When the lead time longer than 48 h, the forecast skill in non-flood similar to the0–24 multi-year average and theisflood season. skill in non-flood season is similar to the multi-year average and the flood season. From Figure 4d, the RAME for moderate rainfall forecast and above is not greater than 100%, the maximum value area range increases with the extension of forecast period, still occurring in the junction of Xinjiang (VIII), Tibet (IX) and Northwest (V) and when the lead time is longer than 72 h, the RMAE in the other regions is higher than 60%. In the same forecast period, the RMAE for moderate rainfall and above is lower than that in multi-year average (a), the flood season (b) and the

Atmosphere 2018, 9, 104

9 of 20

From Figure 4d, the RAME for moderate rainfall forecast and above is not greater than 100%, the maximum value area range increases with the extension of forecast period, still occurring in the junction of Xinjiang (VIII), Tibet (IX) and Northwest (V) and when the lead time is longer than 72 h, the RMAE in the other regions is higher than 60%. In the same forecast period, the RMAE for moderate rainfall and above is lower than that in multi-year average (a), the flood season (b) and the non-flood Atmosphere 2018, 9, x FOR PEER REVIEW 9 of 20 season (c). When the lead time is shorter than 48 h, the low value range appeared in the midsouthern partsmidsouthern of Northeastparts (I), central Inner Mongolia central Northwest (V) andNorthwest midwestern of of Northeast (I), central(VII), Inner Mongolia (VII), central (V)regions and East midwestern (III). regions of East (III). 3.2. Temporal Distribution of Forecast 3.2. Temporal Distribution of ForecastSkill Skill 3.2.1.3.2.1. Monthly Variations Monthly Variations Figure 5 shows monthlyseries seriesfor forR, R, AI, AI, MBE around China. It can that that Figure 5 shows thethe monthly MBEand andRMAE RMAE around China. It be canseen be seen the monthly variation of each metricisissimilar similar for for all periods. the monthly variation of each metric all different differentforecast forecast periods.

Figure 5. The monthly average for (alphaindex), index),(c)(c) MBE (mean error) andRMAE (d) RMAE Figure 5. The monthly average for(a) (a)R, R,(b) (b) AI AI (alpha MBE (mean biasbias error) and (d) between observations and the for different differentforecast forecast periods from to 2013 between observations and theGEM GEMforecasts forecasts for periods from 20092009 to 2013 over over China. to here P6 here represent the6 6forecast forecastperiods periods from h. h. China. P1 toP1P6 represent the from0–24 0–24toto120–144 120–144

From Figure 5a, it can be seen that the monthly R decreases with the extension of the forecast From Figure 5a, it can be seen that the monthly R decreases with the extension of the forecast period and has no significant differences. The highest values are in May (lead time is shorter than 120 period has no differences. The are in (lead time is shorter than h) and and June (thesignificant forecast period is 120–144 h) highest and the values lowest ones areMay in January and December. 120 h) and June when (the forecast period is 120–144 h) and theh,lowest ones R are January and December. Statistically, the forecast period is 0–24 h and 24–48 the monthly in in May and June is above Statistically, when the forecast period is 0–24 h and 24–48 h, the monthly R in May and June is above 0.46, the highest value reaches 0.59 and 0.55 respectively, while the values in January and December 0.46, are the0.49/0.48 highest and value reaches andtwo 0.55 respectively, the values in January December 0.46/0.47. R 0.59 in these periods increasewhile from January to May and thenand gradually decrease to September andRthen slightly increase until a sharpfrom decrease in December. are 0.49/0.48 and 0.46/0.47. in these two periods increase January to May and then gradually the lead and time then is longer than increase 48 h, the until monthly R is less than 0.5, highest in February decrease When to September slightly a sharp decrease in the December. and November while the lowest in summer. For these four forecast periods, R obviously When the lead time is longer than 48 h, the monthly R is less than 0.5, the highest in increases February and from January to February and decreases from March to August and then again increases to November November while the lowest in summer. For these four forecast periods, R obviously increases from and then decreases significantly in December. The monthly variation of R indicates that the related January to February and decreases from March to August and then again increases to November and degree between forecasts and observed precipitation of the GEM model in summer is slightly worse then than decreases that insignificantly other seasons.in December. The monthly variation of R indicates that the related degree It can be seen from Figure 5b that the monthly AI is less than 1 for different forecast periods, indicating that the GEM model can predict the precipitation at the lead time of 144 h to a certain degree. In all forecast periods, AI increases with the forecast period, which suggests that the random

Atmosphere 2018, 9, 104

10 of 20

between forecasts and observed precipitation of the GEM model in summer is slightly worse than that in other seasons. It can be seen from Figure 5b that the monthly AI is less than 1 for different forecast periods, indicating that the GEM model can predict the precipitation at the lead time of 144 h to a certain degree. In all forecast periods, AI increases with the forecast period, which suggests that the random error of the model increases as well. In the same forecast period, the monthly differences are not significant. Comparatively speaking, the lowest value appears in May and the highest one is in winter. The index value in summer is a little lower than that in spring and fall, whose values are similar. It can be seen from Figure 5c that as the lead time extends, the monthly MBE increases. Except for the negative MBE (−0.14 and −0.04) in July and August for the forecast period 0–24 h, the monthly values are not less than 0, indicating that the forecasts of GEM are slightly higher than the observations. For all the forecast periods, the MBE is the highest in April and lowest in July. And when the lead time is longer than 24 h, MBE in July is below 0. The monthly series of MBE is similar to that of R, lowest in summer, highest in spring and winter is a little higher than autumn. Figure 5d shows that the RMAE is higher than 100% for all months except June and July. And the RMAE from June to September is less than 150%. For all the forecast periods, the values of RMAE increase as the lead time increases. When the lead time is shorter than 96 h, the monthly increase between two adjacent forecast periods is almost the same and the smallest increase is in the summer, about 10% to 15%, while the most obvious increase is in winter with 120%. When the lead time prolongs to 96 h, the RMAE in November obviously increases, the value of for 96–120 h and 120–144 h is 1355% and 1883% respectively. The monthly variation characteristics of RMAE are similar with that R and MBE. To further understand the temporal predictive ability of the GEM model for different precipitation categories, the TS and BS values were used. As shown in Figure 6, the change in monthly TS becomes more consistent for all forecast periods from moderate rainfall to the trace of rainfall. For the trace of rainfall and little rainfall forecast, the TS from June to August (summer) are higher than those in other months and the lowest are in January and December (winter). The values in spring and autumn are about the same. The TS of moderate rainfall forecast is the highest in May and the lowest also in January and December. The forecast skills are almost the same in spring, summer and winter, which are a little more desirable than that in winter. The monthly changes in heavy and extreme rainfall forecast are obvious. For heavy rainfall forecast, the forecast skill for different forecast periods are similar. In general, TS in summer and autumn are higher than that in winter and spring in the 0–24 h and 24–48 h. When the lead time is longer than 72 h, the monthly TS are almost the same, lower than 0.1. The monthly changes of TS of the extreme rainfall forecast do not have consistency. The value in summer for 0–24 h is around 0.1 while others are less than 0.1, which indicates that the GEM model has poor forecast ability for heavy and extreme rainfall. Figure 7 illustrates the monthly variation of multi-year average of BS. The monthly variations of BS in the trace of rainfall and little rainfall forecast are consistent with each other. Statistically, BS in the trace of rainfall in summer is less than 2, indicating that the predicted area by GEM is below twice more than the measured area. Meanwhile, BS in winter is higher than 7, with an obvious vacancy forecast. The monthly BS values for moderate rainfall are all greater than 1, while that in May to September are below 2. The monthly variations differ in different forecast periods. For the forecast period 0–24 h, the monthly change is small from 1.1 to 2.5. For the lead time from 24–96 h, the monthly series are similar. The values in January are the highest and then slightly decrease in the following two months and after that gradually decrease from April to August then slowly increase until the next January. When the lead time is longer than 96 h, the monthly change process is similar to that of 24–96 h but there is a sharp increase from November to December. For heavy rainfall, BS from June to September are about 1, which indicates that the heavy rainfall forecast area of GEM in summer is almost identical with observations. When the lead time is longer than 24 h, BS in spring and winter are higher than 1, especially in February and March for 120–144 h, indicating GEM overestimates the

Atmosphere 2018, 9, 104

11 of 20

heavy rainfall area in these two seasons when the lead time is longer than 24 h. For extreme rainfall forecasts, the monthly variations are more complicated. For 0–48 h forecasts, monthly BS are not greater than2018, 1, except April in 24–48 h forecast, suggesting that the GEM model underestimates Atmosphere 9, x FORfor PEER REVIEW 11 of 20the precipitation area. For other four forecast periods, BS in January to March are lower than 1 and that in thattoinJune Aprilare to around June are1.around BS for July to December are above 1, especially forlead the time lead time April BS for1.July to December are above 1, especially for the longer longer than 48 h, BS in November and December are up to 5. than 48 h, BS in November and December are up to 5.

Figure 6. 6. The monthly and precipitation precipitationcategories categories(a–e (a–erespect: respect: trace Figure The monthlyaverage averageTS TSfor fordifferent different periods and trace of,of, little, moderate, China of ofGEM GEMfrom from2009 2009toto2013. 2013. here little, moderate,heavy heavyand andextreme extremerainfall) rainfall) over China P1P1 to to P6P6 here represent thethe 6 forecast represent 6 forecastperiods periodsfrom from0–24 0–24to to120–144 120–144 h. h.

Figure 7. 7. The monthly and precipitation precipitationcategories categories(a–e (a–erespect: respect: trace Figure The monthlyaverage averageBS BSfor fordifferent different periods periods and trace of,of, little, moderate, heavy and extreme rainfall) over China of GEM from 2009 to 2013. P1 to P6 here little, moderate, heavy and extreme rainfall) over China of GEM from 2009 to 2013. P1 to P6 here represent thethe 6 forecast represent 6 forecastperiods periodsfrom from0–24 0–24to to120–144 120–144 h. h.

3.2.2. Forecast Skill in Flood and Non-Flood Season Due to the uneven distribution of precipitation during a year in China, the precipitation in flood season is more abundant than that in non-flood season and the corresponding weather conditions

(November to April). The multi-year average TS and BS values in flood season and non-flood season are calculated respectively to compare the effect of GEM model in these two periods. Figures 8 and 9 show the multi-year average TS and BS for different forecast periods in the nine sub-regions of flood and non-flood season from 2009 to 2013. From Figure 8, it can be seen that the TS value 2018, in the flood season is higher than that in the non-flood season, i.e., the forecast skill12 ofofthe Atmosphere 9, 104 20 GEM model in flood season is better than the non-flood season. But there are also differences in the prediction skill in different sub-regions in these two periods. In general, the TS differences of the trace 3.2.2. Forecast Flood and Non-Flood of rainfall andSkill littleinrainfall are greater thanSeason that of the other three precipitation categories. For the traceDue of rainfall and little rainfall forecast, the difference forecast skill is most significant in to the uneven distribution of precipitation duringof a year in China, thethe precipitation in flood Northwest (V),abundant Tibet (IX) than and Southwest (VI). On average, between the flood and nonseason is more that in non-flood season andthe thedifferences corresponding weather conditions are flooddifferent. season in the threethe regions are greater 0.3 for theare trace of rainfall. In two Northwest also Therefore, numerical modelthan forecast skills different in these periods.(V) In and the Tibet (IX), the differences little into rainfall above while inand Southwest the study, we divided the studyof period floodforecast season are (from May 0.2, to October) non-flood(VI), season difference istoabout 0.12. Inmulti-year Northeast average (I) and Inner Mongolia (VII), the differences trace of rainfall (November April). The TS and BS values in flood season andfor non-flood season are calculated both aboutrespectively 0.25, while the differences the little rainfall are in 0.07–0.13 andperiods. 0.09–0.16, respectively. are to compare theofeffect of GEM model these two The difference in South (IV) formulti-year trace of rainfall is about anddifferent almost the same periods for littlein rainfall. In Figures 8 and 9 show the average TS and0.15 BS for forecast the nine East (III), theofprediction is similar for from trace 2009 of rainfall andFrom littleFigure rainfall8,forecast. sub-regions flood and skill non-flood season to 2013. it can be For seenmoderate that the andvalue above forecast skill between allthe regions is similar between these two periods TS inforecasts, the floodthe season is higher than thatinin non-flood season, i.e., the forecast skill of and the there model is little in forecast ability for heavythan rainfall. GEM flood season is better the non-flood season. But there are also differences in the From skill Figure 9, it can be seen that the in flood season isInlower thanthe that non-flood of season and prediction in different sub-regions in BS these two periods. general, TSindifferences the trace therainfall difference these twoare periods forthan the trace ofthe rainfall higher than other precipitation categories. of and for little rainfall greater that of otheristhree precipitation categories. For the trace In rainfall particular, trace of rainfall and little rainfall forecast, the difference Northwest (V) in is the highest. of andfor little rainfall forecast, the difference of forecast skill is thein most significant Northwest Next comes the difference in the Inner Mongolia (VII) and Tibet (IX), followed by that in the Northeast (V), Tibet (IX) and Southwest (VI). On average, the differences between the flood and non-flood season (I) the andthree Southwest (VI), 20. In North South (IV)ofand Xinjiang the difference about(IX), 10, in regions areabout greater than 0.3 (II), for the trace rainfall. In(VIII), Northwest (V) andisTibet while the difference in rainfall Xinjiangforecast (VIII) isare obviously higher that of the(VI), otherthe two sub-regions. The the differences of little above 0.2, whilethan in Southwest difference is about differences in East (III) for all precipitation categories the least among the of nine regions, from 0.12. In Northeast (I) and Inner Mongolia (VII), theare differences for trace rainfall areranging both about 3.6 towhile 10.6. For rainfall Xinjiang as well as0.09–0.16, the extreme rainfall forecast in all sub0.25, the heavy differences of forecast the littleinrainfall are(VIII) 0.07–0.13 and respectively. The difference regions fortrace the North (II), East (III) and (IV), the BSsame valuefor is less 1, suggesting that in Southexcept (IV) for of rainfall is about 0.15South and almost the littlethan rainfall. In East (III), missing report skill also occurs. Meanwhile, therainfall BS values Innerrainfall Mongolia (VII), Northwest (V),and Southwest the prediction is similar for trace of andinlittle forecast. For moderate above (VI) and Xinjiang (VIII)skill for between non-floodinseason are about 0, suggesting thetwo GEM modeland canthere hardly forecasts, the forecast all regions is similar betweenthat these periods is predict the extreme rainfall in these regions all the year around. little forecast ability for heavy rainfall.

Figure Figure 8. 8. The The average average TS TS value value of of nine nine sub-regions sub-regions over over China China in in (a) (a) flood flood and and (b) (b) non-flood non-flood season season for from 2009 to to 2013 of of GEM forecasts. TheThe (I) (I) to for different differentforecast forecastperiods periodsand andprecipitation precipitationcategories categories from 2009 2013 GEM forecasts. (IX) here represent Northeast, North, East, South, Northwest, Southwest, Inner Mongolia, Xinjiang and Tibet respectively and the P1 to P6 here represent the 6 forecast periods from 0–24 to 120–144 h.

Atmosphere 2018, 9, x FOR PEER REVIEW

13 of 20

to (IX)9,here Atmosphere 2018, 104 represent Northeast, North, East, South, Northwest, Southwest, Inner Mongolia, Xinjiang and Tibet respectively and the P1 to P6 here represent the 6 forecast periods from 0–24 to 120–144 h.

13 of 20

Figure 9. average The average BS value ninesub-regions sub-regions over over China and (b)(b) non-flood season Figure 9. The BS value of of nine Chinainin(a) (a)flood flood and non-flood season for different forecast periods and precipitation categories from 2009 to 2013 of GEM forecasts. The (I)(I) to for different forecast periods and precipitation categories from 2009 to 2013 of GEM forecasts. The to (IX) here represent Northeast, North, East, South, Northwest, Southwest, Inner Mongolia, Xinjiang (IX) here represent Northeast, North, East, South, Northwest, Southwest, Inner Mongolia, Xinjiang and Tibet respectively theP6P1here to P6represent here represent 6 forecast periods from 0–24 120–144h. h. Tibet and respectively and the and P1 to the 6 the forecast periods from 0–24 toto 120–144

4. Discussion

From Figure 9, it can be seen that the BS in flood season is lower than that in non-flood season 4.1. difference Performance for Comparison withperiods GFS and the these two for the trace of rainfall is higher than other precipitation categories.The Inmedium-range particular, formodel, trace of rainfall little rainfall the difference in Northwest GEM used and in the study, has a forecast, higher resolution than many other (V) ismodels the highest. Next comes the difference in the Inner on Mongolia (VII) andof Tibet followed and the model developers have been working the improvement the (IX), forecast skill. by that in the Northeast (I) and Southwest (VI),that about 20. In North (II),not South (IV) andmore Xinjiang (VIII), Nevertheless, previous studies have shown high resolution does always mean skillful forecasts. is The resolution in thethe study is muchin higher than (VIII) that inis[24]. They compared the forecast the difference about 10, while difference Xinjiang obviously higher than that of the the new GEMThe with the resolution of 33(III) km with that of the old GEM model with resolution other skill twoof sub-regions. differences in East for all precipitation categories arethe the least among of 100 km. The results show that GEM with higher resolution can better agree with the observations the nine regions, ranging from 3.6 to 10.6. For heavy rainfall forecast in Xinjiang (VIII) as well as the over North America. They analyzed the ETS and BS of the GEM model with the lead time of 120 h extreme rainfall forecast in all sub-regions except for the North (II), East (III) and South (IV), the BS for different precipitation categories, using the 0–24 h, 48–72 h and 96–120 h accumulative value is less than 1, suggesting that missing report also occurs. Meanwhile, the BS values in Inner precipitation. MongoliaTo (VII), Northwest Southwest (VI) andperforms Xinjiangbetter (VIII)infor non-flood season are about further recognize(V), whether the GEM model China at a higher resolution, 0, suggesting that the the GEM model predict theand extreme rainfall in these10regions all the we also compared skills using can ETS hardly and BS over China North America. Figure shows the year around. ETS and BS over China for different precipitation categories. The comparison of these two scores demonstrates that the ETS over China is similar to that over North America, while the BS over China

4. Discussion is much higher than that in North America. Therefore, the degrees of coincidence between forecasts and observations are similar in China and North America but the vacancy rate of the model in China

4.1. Performance Comparison withmodel GFS of 33 km in North America. It is perhaps caused by the is much higher than the

parameterization of GEM, which is necessary propose in China. The medium-range model, GEM used intothe study,improvement has a higher resolution than many other In the study, the comparison between the forecast skill of GEM and GFS, with the resolution of models and the model developers have been working on the improvement of the forecast skill. 1°, was also conducted for the lead time of 96 h in 2011 to further investigate the GEM forecast skill. Nevertheless, previous studies have shown that high resolution does not always mean more skillful Figure 11 shows the TS and BS for GEM and GFS. In general, the forecast skills for the two models forecasts. The resolution in the study is much higher than that in [24]. They compared the forecast are almost similar as well as some differences for TS and the BS values for these two models are skill of the new GEM with the resolution of 33 km with that of the old GEM model with the resolution different. For TS, we can see that for the forecast period 0–24 h, the GEM model is a little better than of 100 km. The results show that GEM with higher resolution can better agree with the observations over North America. They analyzed the ETS and BS of the GEM model with the lead time of 120 h for different precipitation categories, using the 0–24 h, 48–72 h and 96–120 h accumulative precipitation. To further recognize whether the GEM model performs better in China at a higher resolution, we also compared the skills using ETS and BS over China and North America. Figure 10 shows the ETS and BS over China for different precipitation categories. The comparison of these two scores

Atmosphere 2018, 9, 104

14 of 20

demonstrates that the ETS over China is similar to that over North America, while the BS over China is much higher than that in North America. Therefore, the degrees of coincidence between forecasts and observations are similar in China and North America but the vacancy rate of the model in China is much higher than the model of 33 km in North America. It is perhaps caused by the parameterization of GEM, which is necessary to propose improvement in China. In the study, the comparison between the forecast skill of GEM and GFS, with the resolution of 1◦ , was also conducted for the lead time of 96 h in 2011 to further investigate the GEM forecast skill. Figure 11 shows the TS and BS for GEM and GFS. In general, the forecast skills for the two models are almost similar as well as some differences for TS and the BS values for these two models 14 are different. Atmosphere 2018, 9, x FOR PEER REVIEW of 20 Atmosphere 2018, 9, x FOR PEERforecast REVIEW period 0–24 h, the GEM model is a little better than GFS. 14 of 20For the For TS, we can see that for the GFS. For the other periods,isthe GEM model is slightly skillfulrainfall in moderate rainfall other forecast periods, theforecast GEM model slightly more skillful in more moderate forecast and above, GFS. For the other forecast periods, the GEM model is slightly more skillful in moderate rainfall forecast and above, while in trace of rainfall and little rainfall forecast, the GFS model is a little better. while inforecast trace ofand rainfall and little rainfall forecast, the GFS model is a little better. For the BS score, above, while in trace of rainfall and little rainfall forecast, the GFS model is a little better. For the BS score, GFS is much better than GEM. There are many vacancy forecasts for both models GFS is much better than GEM. There are many vacancy forecasts for both models but the forecasts For the BS score, GFS is much better than GEM. There are many vacancy forecasts for both models area but the forecasts area of GEM is much larger than that of GFS. Therefore, compared to the GFS model, but the forecasts ofthat GEMof is GFS. much Therefore, larger than that of GFS. Therefore, compared to the GEM GFS model, of GEM the is much largerisarea than compared to thethe GFS model, model is of GEM model of similar consistency with the observations while forecast areathe is dramatically the GEM model is ofthe similar consistencywhile with the observations while the forecast arealarger. is dramatically similar consistency with observations the forecast area is dramatically larger. Thelarger. investigation into into the the spatial distribution R and RAME, shown in Figures 12 The investigation spatial distribution differences differences ofof R and RAME, shown in Figures 12 The investigation into the spatial distribution differences of R and RAME, shown in Figures 12 13, between GEM and GFS were alsoconducted. conducted. The difference is defined as theasmulti-year and 13, and between GEM and GFS were also The difference is defined the multi-year and 13, between GEM and GFS were also conducted. The difference is defined as the multi-year R/RMAE of GEM minus thatofofGFS GFS of each grid. From Figure 12 we that these average average R/RMAE valuevalue of GEM minus that grid. From Figure 12can wesee can see that these average R/RMAE value of GEM minus that of GFSof of each each grid. From Figure 12 we can see that these two models have similar forecast skill over most parts of China for all the forecast periods. The area two models have similar forecast skillskill over most Chinafor for forecast periods. The area two models have similar forecast over mostparts parts of China allall thethe forecast periods. The area where the R of GEM is greater than that of GFS decreases with the increase of lead time. The obvious where the R of is GEM is greater than that of GFSdecreases decreases with the increase of lead time.time. The obvious where the R of GEM greater than that of GFS with the increase of lead The differences occur around Inner Mongolia (VII), Northwest (V), mid-Xinjiang (VIII) and Northernobvious differencesaround occur around Inner Mongolia (VII), Northwest mid-Xinjiang(VIII) (VIII) and and Northern Tibet differences Inner Mongolia (VII), (V),(V), mid-Xinjiang Tibetoccur (IX). But for the RMAE (Figure 13), the Northwest values of GEM are much higher than that of Northern GFS and Tibet (IX). But for the RMAE (Figure 13), the values of GEM are much higher than that of GFS and lead time, area of higher value are of GEM than GFS increases dramatically. The (IX). Butwith for the theincrease RMAE in (Figure 13),the the values of GEM much higher than that of GFS and with the with the increase in lead time, the area of higher value of GEM than GFS increases dramatically. The value for the the difference is in thevalue southeastern Tibet (IX)GFS and increases the junctiondramatically. between Southwest increaselowest in lead time, area of higher of GEM than The lowest value for the difference is in the southeastern Tibet (IX) and the junction between Southwest lowest (VI), South (IV) and East (III). Therefore, for the forecasts of 0–24 h, the GEM model has a more value for(VI), theSouth difference is East in the southeastern (IX) and the junction between Southwest (IV) and (III). Therefore, for Tibet the forecasts of 0–24 h, the GEM model has a more (VI), desirable forecast skill than the GFS model but with the extension of lead time, the GEM’s skill desirable forecast skill than the for GFSthe model but with the extension of lead time, has the GEM’s skill South (IV) and East (III). Therefore, forecasts of 0–24 h, the GEM model a more desirable decreases. forecast decreases. skill than the GFS model but with the extension of lead time, the GEM’s skill decreases.

Figure 10. The results of (a) ETS (equitable threat score) and (b) BS over China of the different Figure 10. The10.results of (a) (equitable and(b)(b) over China the different Figure The results of ETS (a) ETS (equitablethreat threat score) score) and BSBS over China of theofdifferent precipitation categories of GEM. P1 to P6 here represent the 6 forecast periods from 0–24 to 120–144 h. precipitation categories of GEM. P1 to here the6 6forecast forecast periods 0–24 to 120–144 h. precipitation categories of GEM. P1 P6 to P6 hererepresent represent the periods fromfrom 0–24 to 120–144 h.

Figure 11. The comparison of (a) TS and (b) BS of GEM and GFS (Global Forecast System) for the lead

Figure 11. The comparison (a) and TS and (b) BSofofGEM GEM and Forecast System) for thefor lead Figure 11. comparison (a)ofrepresent TS (b) andGFS GFS(Global (Global Forecast System) the lead timeThe of 0–96 h. P1 to P4of here the BS 4 forecast periods from 0–24 to 72–96 h. time of 0–96 h. P1 to P4 here represent the 4 forecast periods from 0–24 to 72–96 h. time of 0–96 h. P1 to P4 here represent the 4 forecast periods from 0–24 to 72–96 h.

Atmosphere 2018, 9, 104 Atmosphere 2018, 9, x FOR PEER REVIEW Atmosphere 2018, 9, x FOR PEER REVIEW

15 of 20 15 of 20 15 of 20

Figure 12. The multi-year average spatial distribution of differences of R between GEM and GFS Figure 12. The multi-year average spatial distribution of differences of R between GEM and GFS Figure 12. multi-year average of hdifferences ofhere R between GEM GFS forecasts forThe different forecast periodspatial for the distribution lead time of 96 in 2011 (a–d represent the 4and forecast forecasts for different forecast period for the lead time of 96 h in 2011 (a–d here represent the 4 forecast forecasts for different forecast period for the lead time of 96 h in 2011 (a–d here represent the 4 forecast periods from 0–24 to 72–96 h). periods from 0–24 to 72–96 h). periods from 0–24 to 72–96 h).

Figure 13. The multi-year average spatial distribution of differences of RMAE between GEM and GFS Figure 13. The multi-year average spatial distribution of 96 differences RMAE GEM GFS forecasts forecast period for the lead time of h in 2011 of (a–d here between represent the 4and forecast Figure 13. for Thedifferent multi-year average spatial distribution of differences of RMAE between GEM and GFS forecasts for different forecast period for the lead time of 96 h in 2011 (a–d here represent the 4 forecast periods for from 0–24 toforecast 72–96 h).period for the lead time of 96 h in 2011 (a–d here represent the 4 forecast forecasts different periods from 0–24 to 72–96 h). periods from 0–24 to 72–96 h).

Figure 14 shows the RMAE differences between GEM and GFS in nine sub-regions, to directly Figure 14 shows the RMAE differences GEM and GFS in(except nine sub-regions, to directly obtain the spatial average difference. We canbetween see that in 8 sub-regions Tibet (IX)), the RMAE obtain the spatial average difference. We can see that in 8 sub-regions (except Tibet (IX)), the RMAE

Atmosphere 2018, 9, 104

16 of 20

Atmosphere 2018, x FOR PEER REVIEWdifferences Figure 149,shows the RMAE

16 of 20 between GEM and GFS in nine sub-regions, to directly obtain the spatial average difference. We can see that in 8 sub-regions (except Tibet (IX)), the RMAE differencesrange rangefrom from−−40% and 40%, 40%, especially especiallyin inEast East(III), (III),the thedifference differencevalues valuesare are−−13.6%, −4.1%, differences 40% and 13.6%, − 4.1%, 0 and 4.2% for four forecast periods, respectively; while in Tibet (IX), RMAE of GEM is up to 80% 0 and 4.2% for four forecast periods, respectively; while in Tibet (IX), RMAE of GEM is up to 80% larger larger than that of GFS in the forecast period 48–72 h. Meanwhile, for forecast period 0–24 h, the than that of GFS in the forecast period 48–72 h. Meanwhile, for forecast period 0–24 h, the RMAE RMAE GEM is than smaller than GFS. In South (IV), Inner Mongolia and(VIII), Xinjiang (VIII), GEM of GEMof is smaller GFS. In South (IV), Inner Mongolia (VII) and (VII) Xinjiang GEM performs performs better in the lead time of 0–96 h and in East (III), GEM is more skillful except in 72–96 h. We better in the lead time of 0–96 h and in East (III), GEM is more skillful except in 72–96 h. We also also conducted the regionally differences ofnine R in sub-regions nine sub-regions (results are not shown conducted the regionally differences of R in (results are not shown here),here), almostalmost all of all of the differences are 0, which indicates that both of the forecasts of these two models correspond the differences are 0, which indicates that both of the forecasts of these two models correspond well well the withobservations the observations regionally. with regionally.

Figure 14. 14. The Theregional regionaldifference differenceofofRMAE RMAE nine sub-regions between GEM to Figure in in nine sub-regions between GEM andand GFS.GFS. The The (I) to(I)(IX) (IX) here represent Northeast, North, South, Northwest, Southwest, Mongolia, Xinjiang here represent Northeast, North, East, East, South, Northwest, Southwest, InnerInner Mongolia, Xinjiang and and Tibet respectively andP1the P4 represent here represent 4 forecast periods from to 0–24 to 72–96 h. Tibet respectively and the toP1 P4 to here the 4 the forecast periods from 0–24 72–96 h.

4.2. Overview Overview 4.2. We can can see see from from the the results results that that the the GEM GEM model model performs performs better better in in East East and and Central Central China, China, We especially for the flood season in South (IV), East (III) and North (I). However, in Xinjiang (VIII), Inner especially for the flood season in South (IV), East (III) and North (I). However, in Xinjiang (VIII), Inner Mongolia(VII) (VII)and andTibet Tibet (IX), forecast ability is a little On the one during hand, during flood Mongolia (IX), thethe forecast ability is a little poor.poor. On the one hand, flood season, season, more precipitation by occurs monsoon occurs with compared withseason, non-flood season, the more precipitation caused bycaused monsoon compared non-flood the precipitation precipitationisdistribution is in maybe morethe in parameterization line with the parameterization So as to the distribution maybe more line with of the model.ofSothe as model. to the comparison comparison between south theOn inland area.hand, On the hand, the greater precipitation between south coast and the coast inlandand area. the other theother greater precipitation variability [51] variability [51] in Xinjiang (VIII), Inner Mongolia (VII) and Tibet (IX) may cause that the dynamic in Xinjiang (VIII), Inner Mongolia (VII) and Tibet (IX) may cause that the dynamic model and model and parameterization of always GEM cannot always reflect the realistic atmospheric motions. parameterization of GEM cannot fully reflect thefully realistic atmospheric motions. Meanwhile, one Meanwhile, onefor of the the model reasonsperforms for the model East, South China and Central China is of the reasons betterperforms in East, better South inand Central is topography. topography. et al.out [52]that pointed out that the ofdistribution precipitation is affected by Ren et al. [52]Ren pointed the complexity of complexity precipitation is distribution affected by topography to topography to some degree and it is not easy to reflect the high-latitude area atmospheric motions. some degree and it is not easy to reflect the high-latitude area atmospheric motions. Another reason is Another reason observations is that the gridded observations used better in thein study better in East, that the gridded we used in the study we perform East,perform South and Central ChinaSouth [28], andthere Central Chinastations [28], for moreproviding stations ina more these convincing regions, providing a more convincing for are more inthere these are regions, observation, while in other observation, while in other regions, the observations are not insufficient. And the results are regions, the observations are not insufficient. And the results are coincident with previous research coincident with previous research using other NWPs both at home and abroad [15–19,39–42,50]. For using other NWPs both at home and abroad [15–19,39–42,50]. For different categories, the forecast different categories, the forecast skills forare heavy and storm are worse thanprecipitation. that of trace skills for heavy and storm precipitation worse than thatprecipitation of trace of rainfall or little of rainfall reason that occurrence of heavy storm rainfall related The reasonorislittle that precipitation. occurrence ofThe heavy and isstorm rainfall is related to and mesoscale weatheris system to mesoscale weather system and the mechanism and parameterization cannot reflect the real and the mechanism and parameterization cannot reflect the real distribution information effectively. distribution information effectively. The results are also coincident with that in Canada [24]. And the The results are also coincident with that in Canada [24]. And the relatively poor forecast performance relatively poorstorm forecast performance for heavy andinstorm a common drawback in NWP for heavy and rainfall is a common drawback NWPrainfall modelsis[42]. models [42]. Mcbride and Ebert [53] pointed out that the precipitation threshold used affected the verification of forecast skill. Though the precipitation categories used in the study are with lower and upper threshold and the forecast is more detailed and strict compared with the previous studies, the forecast skill of the GEM model is similar to or better than that of other models [18,54], which fully explains

Atmosphere 2018, 9, 104

17 of 20

Mcbride and Ebert [53] pointed out that the precipitation threshold used affected the verification of forecast skill. Though the precipitation categories used in the study are with lower and upper threshold and the forecast is more detailed and strict compared with the previous studies, the forecast skill of the GEM model is similar to or better than that of other models [18,54], which fully explains the application of the GEM model in China. However, the GEM model with a higher resolution than GFS does not present a much better performance instead, which motivates an in-depth comparison between the mechanisms of GEM and GFS, to further understand the reasons and then propose the method to improve the forecast skill of GEM with high-resolution. Meanwhile, in the study, we only focused on the forecast skill of the GEM model, paying no attention to the investigation into the inner mechanism. The dynamic model and parameterization of GEM, affected by precipitation mechanism, topography as well as precipitation scale, are key factors in model performance. Therefore, more improvement needs to be proposed in the future study, such as the characteristics of performance variability, the reasons for the difference of the performance variability regionally, the relationship between precipitation variability and performance variability, etc. The results we obtained in the study is a solid foundation to adopt the GEM model in China. For example, in terms of hydrological prediction, we can use the forecasts, coupled with hydrological models to predict extreme hydrological events, such as floods and droughts. Wu has conducted a similar research and it shown that the flood forecasts is performed both in meteorological and discharge prediction [22]. Besides, it is also a tool to use GEM in soil moisture detection and prediction to help make a more convincing strategy in ahead of time. However, we also need to pay more attention to the improvement of forecasts for the forecast skill does not always enhance with the increasing of resolution. And how about the prediction performance with the increasing resolution as well in the next coming steps. 5. Conclusions In recent years, with the development of NWP, the precipitation forecast skill also gradually increases, which is beneficial to prolong the lead time of quantitative precipitation forecasts. The study is to evaluate the forecast skill of the medium-range GEM model with high resolution in China, with the lead time of 144 h. Based on the 24 h accumulative observations, the study explored the temporal and spatial skill of the model, as well as for the flood and non-flood season. The results show that the forecast skill divers obviously temporally and spatially. Generally, with the increase in precipitation categories and lead time, the forecast ability of the GEM model declines dramatically. The GEM model often provides a larger precipitation area than observations, especially estimating whether precipitation occurs or there is little rainfall. And in spatial, the forecast skill decreases northwestwards from Southeast China. In detail, for the trace of rainfall and little rainfall, the forecast skill in Southwest and East varies more obviously with lead time than other sub-regions, especially for Xinjiang. For moderate rainfall and above, the forecast skill is poor to some degree, especially in western China, such as Xinjiang and Tibet. For different forecast periods, the model is relatively more skillful for 0–96 h lead time forecast in Northeast, North, East and South and for 96–144 lead time, the skill is not ideally enough. Meanwhile, the forecast skill in flood season (summer and autumn) is better than that in non-flood season (spring and winter). However, the performance of these two periods in South, East and Southwest are similar. The monthly variance tendency for each forecast period are similar and the variance of performance difference is of significance with rainfall area bias. The results here indicate it is appropriate to use the GEM model in China to some degree, both in weather forecast and meteorological and hydrological prediction in the future. Acknowledgments: This work is supported by the National Key Research and Development Project (2017YFC1502403), the National Science Foundation of China (Grant Nos. 51579065, 51779071), the Fundamental Research Funds for the Central Universities (Grant No. 2017B10514), Central Public Welfare Operations for Basic

Atmosphere 2018, 9, 104

18 of 20

Scientific Research of Research Institute (Grant No. HKY-JBYW-2017-12) and Water Science and Technology Project of Jiangsu Province (2017007). Author Contributions: H.X. and Z.W. conceived and designed the experiments; H.X. performed the experiments; H.X. and H.H. analyzed the data; Z.W. and L.L. contributed reagents/materials/analysis tools; H.X. and L.L. wrote the paper. All authors read and approved the final manuscript. Conflicts of Interest: The authors declare that they have no conflict of interests. The founding sponsors had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript and in the decision to publish the results.

References 1. 2. 3.

4.

5. 6. 7. 8. 9. 10. 11. 12. 13. 14. 15. 16. 17. 18.

19.

Krzysztofowicz, R. The case for probabilistic forecasting in hydrology. J. Hydrol. 2001, 249, 2–9. [CrossRef] Biondi, D.; Luca, D.L.D. Performance assessment of a Bayesian Forecasting System (BFS) for real-time flood forecasting. J. Hydrol. 2013, 479, 51–63. [CrossRef] Liu, D.; Li, X.; Guo, S.; Dan, R.; Chen, H. Using a Bayesian Probabilistic Forecasting Model to Analyze the Uncertainty in Real-Time Dynamic Control of the Flood Limiting Water Level for Reservoir Operation. J. Hydrol. Eng. 2015, 20, 4014036. [CrossRef] Ciupak, M.; Ozga-Zielinski, B.; Adamowski, J.; Quilty, J.; Khalil, B. The application of Dynamic Linear Bayesian Models in hydrological forecasting: Varying Coefficient Regression and Discount Weighted Regression. J. Hydrol. 2015, 530, 762–784. [CrossRef] Reggiani, P.; Weerts, A.H. Probabilistic Quantitative Precipitation Forecast for Flood Prediction: An Application. J. Hydrometeorol. 2008, 9, 76–95. [CrossRef] Wilks, D.S. Statistical Methods in the Atmospheric Sciences; Elsevier: Amsterdam, The Netherlands, 2005; pp. 183–197, ISBN 978-0-12-751966-1. French, M.N.; Bras, R.L.; Krajewski, W.F. A Monte Carlo study of rainfall forecasting with a stochastic model. Stoch. Hydrol. Hydraul. 1992, 6, 27–45. [CrossRef] Versace, P.; Sirangelo, B.; Luca, D.L.D. A space-time generator for rainfall nowcasting: the PRAISEST model. Hydrol. Earth Syst. Sci. Discuss. 2009, 13, 441–452. [CrossRef] Metta, S.; Hardenberg, J.V.; Ferraris, L.; Rebora, N.; Provenzale, A. Precipitation nowcasting by a spectral-based nonlinear stochastic model. J. Hydrometeorol. 2010, 10, 1285–1297. [CrossRef] Nicholls, N.; Dyer, T.G.J. Comment on the paper ‘On the application of some stochastic models to precipitation forecasting’ by T. G. J. Dyer. Q. J. R. Meteorol. Soc. 2010, 103, 177–189. [CrossRef] Laio, F.; Tamea, S. Verification tools for probabilistic forecasts of continuous hydrological variables. Hydrol. Earth Syst. Sci. 2007, 11, 1267–1277. [CrossRef] Kumar, P.; Kishtawal, C.M.; Pal, P.K. Skill of regional and global model forecast over Indian region. Theor. Appl. Climatol. 2016, 123, 629–636. [CrossRef] Mass, C.F.; Ovens, D.; Westrick, K.; Colle, B.A. Does Increasing Horizontal Resolution Produce More Skillful Forecasts? Bull. Am. Meteorol. Soc. 2002, 83, 407–430. [CrossRef] Roberts, N. Assessing the spatial and temporal variation in the skill of precipitation forecasts from an NWP model. Meteorol. Appl. 2008, 15, 163–169. [CrossRef] Chakraborty, A. The Skill of ECMWF Medium-Range Forecasts during the Year of Tropical Convection 2008. Mon. Weather Rev. 2010, 138, 3787–3805. [CrossRef] Durai, V.R.; Bhowmik, S.K.R. Prediction of Indian summer monsoon in short to medium range time scale with high resolution global forecast system (GFS) T574 and T382. Clim. Dyn. 2013, 42, 339. [CrossRef] Pan, L.J.; Xue, C.F.; Zhang, H.F.; Wang, J.P.; Yao, J. Comparison of Three Verification Methods for High-Resolution Grid Precipitation Forecast. Clim. Environ. Res. 2017. (In Chinese) Meng, Y.J.; Wu, H.B.; Wang, L.; Zhang, P.P. Evaluation of Quantitative Precipitation Estimation of Numerical Weather Prediction Models in Wuhan Region during Main Flood Season of 2007. Torrential Rain Disasters 2008, 12, 73–77. [CrossRef] Novak, D.R.; Bailey, C.; Brill, K.F.; Burke, P.; Hogsett, W.A.; Rausch, R.; Schichtel, M. Precipitation and Temperature Forecast Performance at the Weather Prediction Center. Weather Forecast. 2014, 29, 489–504. [CrossRef]

Atmosphere 2018, 9, 104

20.

21. 22. 23.

24.

25. 26.

27. 28. 29. 30. 31. 32. 33. 34. 35.

36.

37. 38. 39. 40. 41. 42.

19 of 20

Liu, Y.; Duan, Q.Y.; Zhao, L.N.; Ye, A.Z.; Tao, Y.M.; Miao, C.Y.; Mu, X.; Schaake, J.C. Evaluating the predictive skill of post-processed NCEP GFS ensemble precipitation forecasts in China’s Huai river basin. Hydrol. Process. 2013, 27, 57–74. [CrossRef] Cote, J. The operational CMC-MRB Global Environmental Multiscale (GEM) Model. Part I. Design consideration and formulation. Mon. Weather Rev. 1998, 126, 1373–1395. [CrossRef] Wu, J.; Lu, G.H.; Wu, Z.Y. Flood forecasts based on multi-model ensemble precipitation forecasting using a coupled atmospheric-hydrological modeling system. Nat. Hazards 2014, 74, 325–340. [CrossRef] Markovic, M.; Lin, H.; Winger, K. Simulating Global and North American Climate Using the Global Environmental Multiscale Model with a Variable-Resolution Modeling Approach. Mon. Weather Rev. 2010, 138, 3967–3987. [CrossRef] Bélair, S.; Roch, M.; Leduc, A.M.; Vaillancourt, P.A.; Laroche, S.; Mailhot, J. Medium-Range Quantitative Precipitation Forecasts from Canada’s New 33-km Deterministic Global Operational System. Weather Forecast. 2009, 24, 690. [CrossRef] Zadra, A.; Caya, D.; Côté, J.; Dugas, B.; Jones, C.; Laprise, R.; Winger, K.; Caron, L.P. The next Canadian Regional Climate Model. Phys. Can. 2008, 64, 75–83. Erfani, A.; Méthot, A.; Goodson, R.; Bélair, S.; Yeh, K.S.; Côté, J.; Moffet, R. Synoptic and mesoscale study of a severe convective outbreak with the nonhydrostatic Global Environmental Multiscale (GEM) model. Meteorol. Atmos. Phys. 2003, 82, 31–53. [CrossRef] China Meteorological Scientific Data Sharing Website. Available online: http://data.cma.cn/ (accessed on 6 June 2014). Shen, Y.; Xiong, A.Y. Validation and comparison of a new gauge-based precipitation analysis over mainland China. Int. J. Climatol. 2016, 36, 252–265. [CrossRef] GDPS data in GRIB2 format: 25 km. Available online: https://weather.gc.ca/grib/grib2_glb_25km_e.html (accessed on 31 December 2013). Zhu, Z.W.; Li, T. Extended-range forecasting of Chinese summer surface air temperature and heat waves. Clim. Dyn. 2017, 1–15. [CrossRef] Zhu, Z.W.; Li, T.; Bai, L.; Gao, J.Y. Extended-range forecast for the temporal distribution of clustering tropical cyclogenesis over the western North Pacific. Theor. Appl. Climatol. 2016, 130, 1–13. [CrossRef] Zhu, Z.W.; Li, T. Statistical extended-range forecast of winter surface air temperature and extremely cold days over China. Q. J. R. Meteorol. Soc. 2017, 143, 1528–1538. [CrossRef] Zhu, Z.W.; Li, T.; Hsu, P.C.; He, J.H. A spatial-temporal projection model for extended-range forecast in the tropics. Clim. Dyn. 2015, 45, 1085–1098. [CrossRef] Zhu, Z.W.; Li, T. The statistical extended-range (10–30-day) forecast of summer rainfall anomalies over the entire China. Clim. Dyn. 2016, 48, 1–16. [CrossRef] Zhu, Z.W.; Chen, S.J.; Yuan, K.; Chen, Y.N.; Gao, S.; Hua, Z.F. Empirical Subseasonal Prediction of Summer Rainfall Anomalies over the Middle and Lower Reaches of the Yangtze River Basin Based on Atmospheric Intraseasonal Oscillation. Atmosphere 2017, 8, 185. [CrossRef] Office of State Flood Control and Drought Relief Headquarters and Nanjing Institute of Hydrology and Water Resources. China Flood and Drought Disaster; China Water Power Press: Beijing, China, 1997; p. 569, ISBN 7-80124-422-2. (In Chinese) Wu, Z.Y.; Lu, G.H.; Wen, L.; Lin, C.A. Reconstructing and analyzing China’s fifty-nine year (1951–2009) drought history using hydrological model simulation. Hydrol. Earth Syst. Sci. 2011, 8, 2881–2894. [CrossRef] Zhang, Z.M.; Zhang, C.C.; Hou, H.Y.; Wang, Y.Z. Application of Grade of Precipitation (GB/T 28592-2012). Modern Agric. Sci. Technol. 2016, 19, 223. (In Chinese) Lian, L.; Lin, K.P.; Huang, H.H. Comparison on Precipitation Forecast Effects of Several Numerical Prediction Models in Guangxi. J. Meteorol. Res. Appl. 2012. [CrossRef] Zhao, N.K.; Wan, S.Y.; Qi, M.H. Precipitation verification of three models during rainy season in Yunnan province. J. Meteorol. Environ. 2015, 5, 39–44. (In Chinese) Gao, S.Y.; Sun, L.Q. Verification of JMH0 s NWP on Rainfall Forecast in Dandong, Liaoning Province. Meteorological 2006, 32, 79–83. [CrossRef] Dong, Y.; Liu, S.D.; Wang, D.H.; Zhao, Y.F. Assessment on Forecasting Skills of GFS Model for Two Persistent Rainfalls over Southern China. Meteorol. Mon. 2015. [CrossRef]

Atmosphere 2018, 9, 104

43. 44. 45. 46. 47. 48.

49. 50. 51. 52. 53. 54.

20 of 20

Colle, B.A.; Mass, C.F.; Westrick, K.J. MM5 Precipitation Verification over the Pacific Northwest during the 1997 99 Cool Seasons. Weather Forecast. 2000, 15, 730–744. [CrossRef] Eerola, K. Twenty-One Years of Verification from the HIRLAM NWP System. Weather Forecast. 2012, 28, 270–285. [CrossRef] White, B.G.; Paegle, J.; Steenburgh, W.J.; Horel, J.D.; Swanson, R.T.; Cook, L.K.; Onton, D.J.; Miles, J.G. Short-Term Forecast Validation of Six Models. Weather Forecast. 1998, 14, 84–108. [CrossRef] Koh, T.Y.; Ng, J.S. Improved diagnostics for NWP verification in the tropics. J. Geophys. Res. Atmos. 2009, 114, 1192. [CrossRef] Wang, Y.; Yan, Z.H. Effect of Different Verification Schemes on Precipitation Verification and Assessment Conclusion. Meteorol. Mon. 2007, 33, 53–61. [CrossRef] Hayashi, S.; Aranami, K.; Saito, K. Statistical Verification of Short Term NWP by NHM and WRF-ARW with 20 km Horizontal Resolution around Japan and Southeast Asia. Sci. Online Lett. Atmos. Sola 2008, 4, 133–136. [CrossRef] Chen, C.J.; Jun, L.I.; Wang, M.H. Verification of precipitation forecast using an operational numerical model during flooding season of 2013 in the middle area of China. J. Meteorol. Environ. 2015, 2, 1–8. (In Chinese) Olson, D.A.; Junker, N.W.; Korty, B. Evaluation of 33 Years of Quantitative Precipitation Forecasting at the NMC. Weather Forecast. 1995, 10, 498–511. [CrossRef] Zhou, T.J.; Qian, Y.F. An experimental study on the effects of topography on numerical prediction. Sci. Atmos. Sin. 1996. [CrossRef] Ren, G.Y.; Zhan, Y.J.; Ren, Y.Y.; Chen, Y.; Wang, T.; Liu, Y.X.; Sun, X.B. Spatial and temporal patterns of precipitation variability over mainland China: I: Climatology. Adv. Water Sci. 2015, 26, 299–310. [CrossRef] Mcbride, J.L.; Ebert, E.E. Verification of Quantitative Precipitation Forecasts from Operational Numerical Weather Prediction Models over Australia. Weather Forecast. 1999, 15, 103–121. [CrossRef] Li, J.; Wang, M.H.; Gong, Y.; Li, C.; Li, W.J. Precipitation Verifications to an Ensemble Prediction System Based on AREM. Torrential Rain Disasters 2010. [CrossRef] © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).