the run at version 4.4 on the Cray T3E will be denoted âC44â. 2.2 Description of comparison techniques. To show the effects of model spinup, timeseries of the ...
A comparison between HadCM3 integrations for COAPEC using Beowulf (UM version 4.5) and Cray T3E (UM version 4.4)
Alan Iwi & Bryan Lawrence Rutherford Laboratory July 2004
Executive Summary This report documents a comparison between two long integrations of HadCM3 which have been performed for COAPEC: one on a Cray T3E using Unified Model version 4.4, the other on a Beowulf cluster using UM version 4.5. A detailed comparison of difference plots between the two models shows a number of statistically significant differences in some fields, but these are small in magnitude, and both models are judged to be equally valid for scientific use for climate studies. Data are available from both runs at the NCAS/British Atmospheric Data Centre, and can be supplied to the research community.
1 Introduction The atmospheric science community has expressed concern as to whether Beowulf clusters are suitable platforms for climate model integrations based on the Met Office Unified Model (UM). (The issue as to whether the Beowulf clusters are in general suitable for atmospheric modelling has long been resolved.) There are three particular issues: Whether 64-bit integrations of fully coupled models are stable and are comparable with integrations on more traditional architectures, whether 32-bit integrations of any version of the Unified Model remain stable and are comparable with integrations on more traditional architectures, and more specifically, in the latter case, even if the atmosphere integrations are reliable, will the ocean integrations be stable? Of course these quesitons are actually not specific to Beowulf clusters and need to be answered for any new architecture (and model version). This report addresses the first of these concerns for one instance; a fully coupled climate version of the UM - HadCM3, by comparison of two 64-bit integrations:
1
A 100-year subset of a 1000-year integration, performed on a Cray T3E within the Met Office, using UM version 4.4 A 500-year integration, performed on a Beowulf cluster at Rutherford Laboratory, using UM version 4.5. Both integrations were performed for the COAPEC programme, and the output of both is available in the COAPEC section of the NCAS/British Atmosphere Data Centre. Obviously there were significant differences both between the hardware and software configurations of the models, so the comparison starts from the premise that the two models are different, so the questions are: 1. How different are the two climates? 2. Is the climate of the Beowulf simulation realistic and suitable for scientific use? The remainder of this document consists of four sections: section 2 outlines a more detailed comparison of the model configurations and the methodology used; section 3 gives the main results from this detailed comparison; section 4 addresses the first question above: “how different are the two simulations?”, based on a more detailed case-study of one of the quantities shown in section 3; section 5 summarises and draws some conclusions about the overall usability of the Beowulf simulations.
2 Description of models and comparison methods 2.1 Description of models The model runs described in this report were both time-slice experiments of the fully coupled climate model, HadCM3, with greenhouse gas forcing held constant at pre-industrial levels, and without the use of air-sea heat flux correction (although a salinity flux correction was applied). In both experiments, all calculations were performed using 64-bit data types. 2.1.1 Differences in model code Although the two model runs can both be described as integrations of HadCM3, they were run using model code taken from different stages of development of the Unified Model. UM version 4.4 was used for the Cray T3E integration, whereas version 4.5 was used on the Beowulf cluster. In fact, these model version numbers oversimplify the situation somewhat. In reality, there are frequent minor changes to the model code: the integration at 4.4 contained a number of code modifications which later became a standard part of version 4.5; likewise the integration at 4.5 contained further code modifications developed after the release of 4.5. Most of these code changes consist either of bug fixes or of optimisations. In a few cases the changes introduce wholly new physics schemes, but often these are only enabled if the user specifically selects the new scheme as an alternative. In summary: there is a long list of code changes between the two models, although in terms of model evolution they are very close. Amongst the list of changes are a few salient physics scheme differences between the model integrations: in the atmosphere, updated spectral coefficients for longwave radiation, a 3-dimensional CO field, and an accurate treatment of precipitation phase change; in the ocean, the Griffies isopycnal diffusion scheme replacing the older Redi scheme, and a new parameterisation of Mediterranean and Hudson bay outflow.
2.1.2 Model initialisation There are also differences in model initialisation: both integrations were performed from initial conditions supplied by the Met Office from previous integrations of HadCM3, but the version 4.5 run used a run dated 2789, and the version
2
4.4 a run dated 1849. While these dates are arbitrary (as external parameters such as greenhouse gases are constant), they do reflect a greater level of spin-up equilibration in the version 4.5 initialisation. This difference was in part due to the fact that it was not originally intended to run the V4.5 simulation for as long, and it was thought the better spun-up start would result in a better quality simulation. Given the other significant differences between the simulations it was thought this would not be a problem, but it did lead to a systematic offset between the two simulations as discussed below. 2.1.3 Nomenclature In order to reflect the fact that the integrations being compared in this report differed not only in computer hardware platform, but also in the model setup, the following labels will be used: the run at version 4.5 on the Beowulf cluster will be denoted “B45” the run at version 4.4 on the Cray T3E will be denoted “C44”
2.2 Description of comparison techniques To show the effects of model spinup, timeseries of the following annually averaged quantities were calculated: surface temperature: global mean, area-weighted average over all gridboxes surface temperature and precipitation: unweighted average over a British Isles
rectangle of gridboxes covering the
ocean root-mean-square horizontal velocity on various levels: this is calculated from monthly mean total ocean velocity (sum of barotropic and baroclinic components), then an area-weighted global horizontal average over all ocean gridboxes ( ) is taken at each level, regardless of ocean basin, then annually averaged and the square-root taken.
This report mostly presents anomaly fields, calculated and presented as B45 – C44, to show detailed differences between the simulations. For the atmosphere, the set of diagnostics was chosen (as far as available) based on those used by the “validation note” tests which are performed within the Met Office (although that specific test suite was not available for technical reasons1 ). Mostly these fields are seasonal averages for DJF and JJA, although a few fields for MAM/SON are also included. Additionally some representative fields are shown for the ocean; some ocean fields which have little interseasonal variability are shown as annual rather than seasonal means. For all anomaly calculations, an associated significance level was calculated. This was performed separately at each grid point to be plotted (after any spatial and seasonal averaging), using the composite multi-annual timeseries 2 for the given season. Anomalies locally significant at the 95% confidence level using a two-tailed -test are indicated on the plots. The statistical implications of the use of this technique are further discussed in section 4.
3 Detailed comparison of model runs This section comprises a discussion of many of the differences seen between the runs. It will be seen that most of them are minor, and attributable to either the initial conditions or the physics changes between the models. 1 It
has not been ported from the Cray; neither does it support the NetCDF file format to which the output files had been converted. all cases, the maximum number of available complete periods was used for any composite; for example, in C44 (100 years January– December) the JJA composites have 100 members, but the DJF composites, and the annual mean composites (which were constructed from seasonal means, hence run from December to November), each have 99 members. 2 In
3
Run B45
Run C44 o
C
Surface temperature: Global mean 14.6 14.4 14.2 14.0 13.8
Surface temperature: mean over "British Isles" box (12 gridpoints)
o
C
12 11 10 9 8
mg m-2 s-1
Precipitation: mean over "British Isles" box (12 gridpoints) 36 34 32 30 28 26 24 22
cm s-1
ocean RMS horizontal velocity at depth 5m 20 19 18 17 16
cm s-1
ocean RMS horizontal velocity at depth 447m 6.0 5.8 5.6 5.4 5.2
cm s-1
ocean RMS horizontal velocity at depth 2116m 1.95 1.90 1.85 1.80 1.75
ocean RMS horizontal velocity at depth 4577m cm s-1
1.20 1.10 1.00 0.90 0
100 Years (C44)
0
100
4
200
300 Years (B45)
Figure 1: Timeseries of selected quantities from the model integrations (see text)
400
500
3.1 Timeseries Figure 1 shows timeseries plots from the integrations for the quantities described in section 2.2 (using identical scales for C44 and B45 in both time and independent variable axes). These timeseries suggest an overall broad similarity between the results from runs B45 and C44 in the selected diagnostics. Average values are similar, although global mean surface temperature is approximately 0.1 K warmer in C44 compared to B45. Interannual variability of the various diagnostics shown has similar magnitude in the two integrations, and also similar temporal structure (i.e. high- versus low-frequency variability). The timeseries also suggest that, at least as regards the selected diagnostics, integration B45 is well equilibrated without obvious drifts. Integration C44 (based on less spun-up initial conditions, as discussed) is largely well equilibrated, although there is a drift evident in the deep ocean. This confirms the appropriateness of using the entire run duration as an averaging period in the anomaly plots shown in later figures (which do not extend down to ocean levels where the drift is seen in C44). There is actually also a small bug in the C44 run which results in atmospheric mass loss, and a gradual drift downwards in the surface pressure, as can be seen in figure 2. The rate of decrease (initially hPA decade, though slightly non-linear with time) results in a mean surface pressure anomaly of 1.7 hPA between the two simulations, attributable to the fact that B45 was initialised from a C44 run after approximately 1000 years of mass-loss.
C44
B45 982.7
984.3
982.6
984.2
982.5
hPa
984.4
984.1
982.4 0
50 100 years
0
50 100 150 200 250 300 350 400 450 500 years
Figure 2: Annual-mean global-mean surface pressure from the two integrations
3.2 Atmospheric dynamical fields In all the figures to follow, the quantity shown is the anomaly of the named field, with the sign B45 minus C44, which is also the meaning given in the discussion to the sign of the anomaly. Solid contours denote positive anomalies; dashed contours negative anomalies; faint dotted lines denote the zero contour. The labelling “[c=. . . ]” in the plot titles denotes the contour interval. Shaded regions correspond to the 5% statistical significance as previously described. For brevity, where possible we have chosen to highlight differences by choosing examples from one (rather than both) of the solsticial seasons. As explained above, there is a surface pressure difference between the simulations which is attributable to a (minor) bug in the C44 simulation. While the pressure difference ( hPa) is small, it could have a slight effect on the location of the hybrid - levels which would probably be most obvious in comparisons near orography. In any case, because pressure gradients are more important dynamically than the absolute value, the pressure anomaly fields are shown with the global mean removed (see figure 3). While this is relatively featureless, a few anomalies (of order 0.5–1 hPa) exist, most notably a positive anomaly in the North Atlantic in spring/summer, a persistent positive anomaly over the Himalayas, and various anomalies in the southern ocean. In addition, the American mountain chains (Rockies, Andes) show significant seasonally consistent differences. We have not yet attempted to ascertain as to whether that can be attributed to the surface pressure difference or other processes.
The global mean geopotential height (not shown) shows a systematic negative anomaly of m with height and latitude which is consistent with the pressure difference. When the global mean is removed (figure 4), a weak Rossby wave train in the mid-latitude is significant in DJF, but not so significant in JJA.
5
surface pressure (global mean subtracted), DJF (hPa) [c=0.5]
surface pressure (global mean subtracted), JJA (hPa) [c=0.5]
-0.5 -0.5
-0.5
-0
.5
0.5
-0.5
-0.5
0.5
-0.5
0.5
0.5
0.5
-0.5
0.5 0.5 0.5
0.5
surface pressure (global mean subtracted), MAM (hPa) [c=0.5]
surface pressure (global mean subtracted), SON (hPa) [c=0.5] 0.5
0.5
0.5
0.5
0.5
-0.5
0.5
-1.0
-0.5
-0.5 -0.5
Figure 3: Surface pressure differences for all seasons (after subtraction of global mean).
6
geopotential height at 500hPa (global mean subtracted), DJF (m) [c=1] geopotential height at 500hPa (global mean subtracted), JJA (m) [c=1]
-5
-1
1
2 -2
-2
-1
4 3
4
-2 -3
-2
-6 -54 -3-2 -1 1
-1
-1 -1
-1 2
-1
-2 1
-6
-5 -1 -2 -3 -4 2 4 3 1
-4 -3
-2 -1
-4 -3 --12
-3 -2
1
-2 -2
-1
21
2
-1
2
1
2
-6
-7 -6
-1
-2
3
1
3
2 1 -1
-1
-2
-8
-5 -6
1
-8 -10 -11 -12
3
2
-9
1 2
-3
-1
-6
-7
1
-1
-2
-2
1
-4
-2
1
-3 -4
-3 -4 -5 -6 -7
-1
-3
-6
-3
-4
-5
1
-1 1
1
1
1
1 2
1
-1
1
3
1
3 4 5
6
2
1
6
-4 -5
-1
1
-3
3
2
-1
-4
4
2
2
-3
-1
1 2
7
2
6
0
-2
1
4
4
1
-1
2
10
1
1 -1-2 3 --4-56 -
-1
1
4 5
2
-2
3
-2 -4 3 -5 -9 7
-8
2
2 3
-6
7
-3
7 6 5 4 3 2
1
6 5 4 3
3 4 5 7 8 9
-2
6
1
2 3 4 5 6 7
3 4
4 6 5
3
-1
2
2
1
2 1
-1
-2 -1
3
3
-1
2
-1
Figure 4: Geopotential anomalies at 500 hPa (after subtraction of global mean)
zonal wind (zonal mean), DJF (m s-1) [c=0.1]
temperature (zonal mean), DJF (K) [c=0.1] 0.40
0.50
0.20 0.30
-0.10
-0.10
-0.1
0
-0.20
-0.20
0.10
10 0.10
10
0.10
0.30
0.10
0.20
-0.10
100
-0.20
0.10
Pressure (hPa)
0.10 0
0.2
0.1 0
-0.20
-0.1 0
0.10
0
-0.10
0.1
-60
-30
0 30 Latitude (degrees)
60
90
1000 -90
-60
-30
0 30 Latitude (degrees)
Figure 5: Zonal-mean zonal wind and temperature anomalies for DJF
7
0
0.20
-0.10
1000 -90
-0.2
0.10
-0.10
Pressure (hPa)
0
.1
-0
100
0
-0.3
60
90
Upper-air winds and temperatures show minimal anomalies, none significant in the zonal mean (e.g. figure 5). An examination of the longitudinal mean structures (not shown) shows weak significant features associated with the Rossby-wave train anomaly seen already and nothing else of signficance. temperature at 1.5m, DJF (K) [c=0.5]
temperature at 1.5m, JJA (K) [c=0.5]
-0.5
-0.5
-0.5 .5 -0
-0.5
-0.5
-0.5
-0.5
-1.0
-0 .5 1.0
-0.5
-0
.5 .5
-1.0
-0
-0.5 -1.0
5
0.
-1.5-1.0
-1.0 -0.5
-1.0
-0.5
Figure 6: 1.5m temperature anomalies The near-surface air temperature (figure 6) is very slightly different throughout the globe (but especially over the oceans), with a persistent anomaly in the North Atlantic, and a winter anomaly above the polar oceanic region. Again these are probably consistent with the global mean air-mass difference associated with the initialisation. Near-surface zonal-wind anomalies are rather uninteresting and are not shown. There is a region of the tropical Atlantic in which m s throughout the year, see figure 7). the meridional wind is persistently different (of order
!"
#%$
3.3 Humidity, Cloud, and Precipitation There are significant physics changes between V4.4 and V4.5 of the UM, and so it is important not to over-interpret differences in the water and cloud variables. There are large differences in the relative humidity in the troposphere (e.g. see figure 8 for DJF), but these are not significant. Differences in the stratosphere are signficant, but small — of order 1–3% of relative humidity. Similar results are found for the specific humidity (not shown), with the largest differences in the middle stratosphere where the anomalies are of order of 30% of the background value. However, because there is no methane oxidation in these models, the stratospheric water budget is rather unphysical anyway (the specific humidity decreases with height instead of increasing). Zonally averaged cloud water and vertically resolved layer and convective cloud fraction do not show significant differences between the simulations (zonal mean convective cloud fraction anomalies, not shown, are less than 1% nearly everywhere). However, an examination of the geographical distribution of clouds does show signficant differences (e.g. JJA: figure 9, where the North Atlantic cloud anomaly may be associated with the surface temperature anomaly seen in figure 6). The geographical anomalies in cloud distribution are associated with precipitation differences in the DJF season (figure 10), although not as obviously in the JJA season. A comparison of the convective precipitation with the large-scale
8
meridional wind at 10m, DJF (m s-1) [c=0.2]
meridional wind at 10m, JJA (m s-1) [c=0.2]
0.2
0.
2
0.2
-0.2
0.2
0.2
-0.2
-0.2
-0.2 -0.2 0.2
0.2
-0.2
0. 2
-0
.2
0.2
Figure 7: Meridional wind anomaly at 10m.
relative humidity (zonal mean), DJF [c=0.1]
relative humidity (zonal mean), DJF (x1000) [c=5] 10 30
20
5
10
35
5
15
10
25
10
0.10
15
Pressure (hPa)
0
20 10
0.2
21 1 020 85 0
0.50
0.4 0 0
5 3025
0.3
215
20 10
30 35
-10
-60
25
-80 -75 -85 -90 -11 -10 -1-95 -1 -10 -1 015 5 0 2520
-30
20 1510 5
5
-10 -5 -15 -25 -20 -40-35 -30 -5-6 5-500-45 -7 -70-6 -8 5050 5 -9-8
0.2
0.500 0.4 0.30 0.20
-0.020 -0.3
0.10
0
90
0 30 Latitude (degrees)
30 25
60
40 45 5055 60 65 70 75 80 85 9590 100
0 30 Latitude (degrees)
100 -90
25 30 35
-0.20
100 105110 125 120 115130 140 135 155 150 145 170 165 185 160 180 200 175 215 210 190 230 220 205 225 245 0195 24235
10 15 20
0.30
-30
10
-60
-0.40
-0.10
0.20 0.10 -0.10
0.
0.10
0.20 0.10 -0-0.10 .20
0.10
0.10
-0.10
-0. -0.20 10
5
0
0.70
-0.10
0
0.10
5
0 0.3 0.40 0.50 0.60
-0.1
0.10
0.10
0.20 0.1
1000 -90
-5
0.20
0
20 25 30 35 40 45 50 55 60 65 70 75 80 85 90 5 9 110050 0 111115250 1121305305 1144150550 115610560 1177910895 1 105 2
15
20
0 0.1
0.1 0
0.30
-0.10
-0.1 0
Pressure (hPa)
-0.20 -0.1
10
5
0.20
100
60
550556065 35 440
90
Figure 8: Relative humidity anomaly in DJF, with tropospheric contour intervals and stratospheric contour intervals (Left and right panels respectively).
9
layer cloud fraction on level (zonal mean), JJA (x1000) [c=1]
total cloud fraction, JJA (x1000) [c=10] 10
-10
-10
20
3
10
30 1
100
-10
10
1
2
10
10
-10
4
-1
-1
10
10
-3
-2
10
-1
23
5
-4
-10 -10
0
10
10
10
-1
10
10
-1
10
-1
1
-1
1
10
-10
10
-1
-2
Model hybrid level (= p / hPa if psurf=1000hPa)
10
10
1
-1
1
1
1
10 -10
-10
-2
1
-2 1 -
1
10
-60
-30
0 30 Latitude (degrees)
1
60
-1
-10
54
3
465
2
87
132
-1
1
-1
-1
-2
1000 -90
1 -1 -2-3
90
-10
10
Figure 9: JJA Anomaly comparison between zonal mean layer cloud fraction and geographical distribution of total cloud fraction precipitation (figure 11) shows that the differences in precipitation in the tropical Atlantic are due entirely to convective precipitation (which suggests that the cloud differences in this region are also convective, despite the small zonal mean differences in convective cloud). These differences in precipitation also lead to a significant difference in soil moisture in the Brazilian region of South America (not shown). Figure 12 shows that the boundary layer height anomalies are of order a few tens of metres, and the main differences are consistent with the temperature and convection anomalies already noted. 3.3.1 Radiation As one might expect, the radiation anomalies are dominated by the effects of the cloud differences (e.g. outgoing radiation — figure 13). Although the surface downwelling radiation is not shown here, the patterns are consistent with what is seen in the top of atmosphere (outgoing) fluxes. The differences can be summarised as: Shortwave: increased cloudiness increases outgoing shortwave but decreases surface downward shortwave (due to scattering). Longwave: increased cloudiness increases surface downward longwave, but reduces outgoing longwave (OLR) (because clouds emit longwave both upward and downward, but by an amount which is less than the surfaceemitted OLR which they absorb, due to temperature differences).
10
total precip rate, DJF (mg m-2 s-1) [c=2]
total cloud fraction, DJF (x1000) [c=10] 10 10
10 -10
-10
-2
-2
-10
10
-10
-2
-2 2 -4
2
-10
2
68 24
10
4
-2 -6
-2
4
2
10
6
-2
-2 24 6
-2
4 -2 -2 -4
42
-2
-6
2
2 2
-2
10
-4
2
0 -1
-10
-4
4
-10
2
-4
10
2
10 20
-2
8
30
10 20
4
4
10 20
2
-2
-10
-6
0
2
10
0
-1 -2 -4
-1
2
2 -2
-2
10
-2
-10
10
10
-10
-10
10
10
Figure 10: DJF anomalies in geographical distribution of total cloud and precipitation
convective precip rate (rain+snow), DJF (mg m-2 s-1) [c=2]
large-scale precip rate (rain+snow), DJF (mg m-2 s-1) [c=2]
-2
--24
-4 -2
2
-2
2
4 6
-4
-2
-2
4
2
10
4
-2 42 6
-2 -4
4 2
2 -2
-6
2 2
-2
-2
-6
-2
-4
8
2
4
-2
2
-2
2
-2
-6
-2 -4
4
-4
4
-6
68 2
2
2
-2
2
Figure 11: Partition of DJF anomalies in precipitation between convective and large-scale precipitation (rain plus snow).
11
boundary layer height, DJF (m) [c=10]
boundary layer height, JJA (m) [c=10]
10
0
-1
10
-10
10
-20 -10
-10
-20 10
-1
0 -10
-10
10
-10
10 10
-20 -10
10
-10
-10
30 20
10
10
-10
-10
0
-1
10
10 10
10
-10
10
-10 10
10
-10 -10
-10
10
-10
-10 -20
0 210
0 102-10
-1
0
30
20
10
10
20
10
10
10
0
-1
-10
-10
10
Figure 12: Boundary layer height
12
-10 -20 -30
-10
outgoing LW at TOA, DJF (W m-2) [c=1]
outgoing LW at TOA, JJA (W m-2) [c=1]
-1
-1
-1
-1
-1 -1
-1
1
-1
-1
1
-1
1
-1
-1
1
1
2
1
-2
1
1
-1
-1
-1
1
1 1
-2
-1
-1 1
-2
-1
-2
3
1
1 2
3
-1
-1
1
2
-2
-2
1
2
-1
1 1 -1
2 1
1
-1
1
-4 -3 -2 -1
1
3
2
-1
-1
1
-5
1
-1 -2 -3
2
1
1
-2
2
-1
-1
1
1
1
-1 1
-1
-1
-1
-1
-1
-1
-1
-1
-1
-1
-1
-1
outgoing SW at TOA, DJF (W m-2) [c=1]
outgoing SW at TOA, JJA (W m-2) [c=1] 1
2
12
1
2 -2
1
-2
-1
-1
3
1 1
1 -1
2
1
1
21
1 1
1
1
-2
1
2
-2
-1
2
-1
-3
1
5
6
2
1
1
-1
1
2
-1
-2
-3
-2
-1
1
2
2
1
3
2
-2
2
1 -1
-1
2 2
-2
1
-1
1
-2
1
3
-1 1
-2
1 2
1
-1
1
1
3
3
12
-1 -2-3
1
2
-1
-3
-1
-2
2
1
-1
1
32 1
1
-1 2
-1
1
2
-2 -1
1
-2
1
-1 -2
-1
-1
1
-1
-1
1
2
5
1
1
1
1
-1
-1
1 -1
-1
-2
3
1
1
5
1
-2 1
-2 -1 1
3
1
2
-1
1
1
1
1
2
1
3
4
-1 -2
-1
12
-2
-1
1 2 3
-2
1
1
-1
1
-1
2 1
-2
1
5
5
-1
-1
-1
-1
1
2
2
6
1
1
12
-1
2
-1
4
1
1
1
-1
1
1
1
1
-1
2
12
-2
-1
1
-1 -2
-1
1
2
1
1
2 1
-1 -2 23
2
Figure 13: Outgoing longwave and shortwave radiation (top of atmosphere)
13
3.4 Air-sea fluxes surface zonal wind stress, JJA (N m-2 x 1000) [c=2]
surface meridional wind stress, JJA (N m-2 x 1000) [c=2]
-2
2
-4
2
-2
-2
2
4
-2
6
2
8
2
4
2
2 -2
2
-2
4
2
2
-2
4
2
-2 2
4
-2 -4 -6
-2
-4
-2
-2
-2
2
-2
-2
-2
-2
-4
2
-2
2
-4
-6
-2
2
-2
-4
2
2
-2
-2
2
-2
-2
-2
2
-6
-4
2
2
6
4
2 4
2
-2
-4
2
6
-2
2
-2
-2
2
4
2
-2
-2
4 2
2
-4-2
2
-2
-2
-2
2
-2
-6
2
-4
-2
42
2
-2
-8
-2
6
2
-2
2 2
2
4
4 -2
-2 -6-4
-2
2
-4
-2
2 -2
2
-2
-2
-4
2
-8
-2 2
2
2
2
-2 46 8
2
-2
42
-2 -2
-2
4
2
6
2
2
-2
2
4
4
-2
-8
4
-4 -6 2 -4
4 -1 -10
-4
8 46
-6 -12
2
-4
-4
2
6 24
6
4 6 8
2
-6
-4
6 2 -4-2
-2 2 4 6
4 -2
-2
4 -2
6
-2-6
24
2
2 -2 -4
-4
2
6
6
-2
-6-4
4 6
-2
2
2
4
2
2
-2
2
6
4
-2
-2 -4
-4
4
8
-4
2
-2 2
-2
-2
Figure 14: Surface wind stress anomalies for JJA. Figure 14 shows anomalous wind stresses for JJA. It can be seen that the differences are more significant in the meridional direction than the zonal direction, and are well correlated with the 10m anomalies in meridional wind (figure 7). The sea-air heat flux anomalies (figure 15; positive is upwards for B45 / downwards for C44) are also consistent with the stresses and the surface downward longwave radiation anomalies. The latter is expected since enhanced evaporation due to enhanced downwelling longwave gives a negative latent heat flux anomaly. Although the latent heat flux anomalies are larger over a greater area of the southern ocean the sensible heat flux is also well correlated.
3.5 Ocean fields As already seen in the timeseries in figure 1, the ocean shows small but readily detectable differences between the two integrations, which are evident from the start, and hence due at least in part to the differences in initial conditions. Given the long persistence of any initial anomalies, and the relatively small interannual variability, it would therefore be expected to find that even small anomalies in the deep ocean have high statistical significance. Figures 16 and 17 show anomalies in ocean potential temperature and salinity between the integrations, at the 5 m (i.e. top) and 447 m levels. Temperature anomalies are mostly small, except for some well-defined regions in the north Atlantic and around 40–50 S. The ocean temperatures are well correlated between the 447 m level and the top ocean level, as well as with surface air temperature (SAT, figure 6), and are doubtless a key factor in the SAT anomalies. The 5 m ocean temperature field also shows the effect of forcing of the ocean by the atmosphere (due to anomalies in wind stress and radiative fluxes), in that the larger area of non-significance at 5 m compared to 447 m is indicative of atmosphere-induced variability. B45 has generally fresher water than C44, with differences up to 0.5 PSU, mostly in the Atlantic. At depth, differences are smaller ( PSU) but as with potential temperature the statistical significance is high.
&
'"(
Figure 18 shows mixed-layer depth anomalies. There are differences of order tens of metres in winter mixed-layer depth, in the main regions where atmospheric fields show anomalies already discussed (north Atlantic, southern ocean); in the generally shallower mixed layer in summer, any anomalies are also correspondingly smaller.
14
sensible heat from sea, JJA (W m-2) [c=5]
-5
-5
-5
latent heat from sea, JJA (W m-2) [c=5]
-5 5
5
5 -5 -10
5
150
-5
5
5
-5
5
10
5
-5
-5
-5
Figure 15: Sea-air heat flux anomalies for JJA.
potential temperature at 5m, annual (degC) [c=0.2]
potential temperature at 447m, annual (degC) [c=0.2] -0.2
-0.6 -0.2
-0.2
-1.0
-1.0
-0.4 -0.2
.2 -0
.4
0.2
-0.2
-0
.4
-0.8
-0.6
-0.8
-0.2
-0
-0.2
-0.2
-0.2
0.4 0.2
-0.2 -0.4 -0.6
-0.4 -0.2
-0.2
-0.2
-0.2
-0.4 -0.2
0.2 0.4
0.2 0.4 -0
.2
-0.2
0.4
.2
-0
-0.2
-0.2
-0.2
Figure 16: Annual mean ocean potential temperature at 5m (left) and at 447m (right).
15
-0.2
-0.2
salinity at 5m, annual (PSU) [c=0.1]
salinity at 447m, annual (PSU) [c=0.1]
-0.10
-0.10
0
0.1
-0.10
-0.20
-0.10
-0.10
-0
.2
0
-0 .2 0
0 -0.50 -0.3-0.40
-0.10
-0.10
-0.10
0
-0.1
-0.10
-0
-0.1 0
-0.1
0
-0.10
.1
0
-0.1
.10
-0
0
-0.10
-0.10
-0.10
-0
.10
0
-0.1
-0.10
-0.10
-0.10
-0.1
0
-0.1
0
-0 .1 0
-0.1
0
-0.1
0
-0 .1 0
-0.10
-0 .1
0
-0.10 -0.10
Figure 17: Annual mean salinity at 5m (left) and at 447m (right).
mixed-layer depth, JJA (m) [c=10]
-20
-50
-30 -40
-10
mixed-layer depth, DJF (m) [c=10]
-10
10
10
-10
20
10
10
10
-
10
-10 -10
10 20 10
Figure 18: Mixed layer depth
16
10
10
-10
10
-10
ocean total u-velocity at 5m, annual (cm s-1) [c=1]
ocean total v-velocity at 5m, annual (cm s-1) [c=1] 1
-1 1
1
-1 1
-1
-1-2 1
1
-1
1 -2 1 34
-1
1
2-1
3
1
-1
1 1
-1 -1
2
1
1 1
1
11
-1
-1
1
1
1
-1
ocean total v-velocity at 447m, annual (cm s-1) [c=1]
1
-1
-1 1
ocean total u-velocity at 447m, annual (cm s-1) [c=1]
2
1
1
-1
1
Figure 19: Ocean horizontal velocity (top level and 447 m)
17
-1 1
1
-2
2
34
1
-1
4
1
1
1
23
-1
1
1
-11 -2
1
-1
-1
1
1
1
1
2
23
2
1
1
2 1 1 2
-2
2
1
1
1
-12
1 -1
-1
32
1
12
1 -1
-1 1 -2
2
2 1
4
-11 1
1
-1
1 -1 -1
1
-1
2
-1 -1
-1
3
34 5
3
1 2
1
1 -2-1 -3
1
-2
6
-2
1
15 4
1
2 1
1
43
5 2
2
2
-1
-1
-2 -1
1
1
1
1
2
2 3 1
13
1 2
32
1
-1
1
-4
23
-2 -1
1
1
-1 -1 1 -3 1 34 -2 2
-4
-1 -1
-1
1
1
1
1
1
1
-1
-1 1
Figure 19 shows horizontal velocity anomalies at the top level and at 447 m. The most notable features are a 1–2 cm s eastward anomaly in -velocity in the southern ocean and in the tropical Atlantic.
ocean vertical velocity at 243m, annual (mm s-1) [c=0.001]
ice thickness (gridbox mean), annual (m) [c=0.1] 0.10 0.20 0.10
0.20 0.10.20 0
#)$
0.10 0.20
0.10
0.10
0.10
0.10
0.10 0.20 0.10
Figure 20: Annual mean differences in ice thickness (left) and vertical velocity at 243m (right). Finally, although not related, ice depth and oceanic vertical velocity anomalies are displayed together in figure 20 for conciseness. The ice depth in B45 is approximately 0.2m deeper than in C44 in both polar regions throughout the year. The largest differences in oceanic vertical velcity at (243m) are around the continental boundaries (presumably related to small horizontal differences). There appears to be more rising (or less sinking) in the north Atlantic and southern oceans in B45, although the numbers are small.
18
4 Discussion The detailed comparison in the section 3 outlined a number of differences between the results of integrations B45 and C44. Here it is discussed whether these differences are statistically significant, and if so whether they are important.
4.1 Statistical significance of anomalies Although we have seen that many of the fields of anomalies between the two integrations have areas where the anomaly passes a -statistic significance test at the 5% level, it is difficult to deduce from this that these detected differences are truly statistically significant, as even under the null hypothesis of identical model climate, some proportion of the local significance tests will show differences to be significant at any given probability cutoff level. 3 Furthermore, because of the underlying model physics, any such locally significant differences are likely also to exist in a consistent way across a number of physically related fields, and to show some spatial correlation corresponding to the scale of the simulated features.
In principle, it is possible to establish significance with greater rigour by using a part of the dataset for exploratory comparison, in order to choose carefully a small number (say ) of statistical indices for formal hypothesis testing, and , use the remainder of the dataset to test the significance of these indices formally (each at probability level so that the overall probability of a false positive is no greater than ).
*
+-,.+ 0/1$3234
It is not the intention here to follow this methodology completely. However, as an example, let us consider how it would work in the case of one of the most substantial differences already noted, namely the difference in top-ofatmosphere outgoing shortwave radiative flux (hereafter OSR) in the North Atlantic in summer (JJA), associated with positive cloud cover anomalies (B45 minus C44). Let us suppose that in integration C44, the last 40 years were used for exploratory anomaly plots, and the first 40 years were then used for formal testing, and that in B45 the last and first 240 years respectively were used. (This gives an unused 20-year gap between the two periods, to help them to be reasonably independent even if there is some decadal variability.) The OSR anomaly plot in this case is shown in figure 21 (this is as for the OSR shown in figure 13, but as calculated using the sub-timeseries mentioned). In this case, the OSR anomaly pattern still shows an obvious significant anomaly over the North Atlantic. Although this does not itself establish that this feature would reasonably be chosen for formal testing (as anomaly plots for some other fields could appear to show features which are more important, when recalculated using these sub-timeseries), let us nonetheless suppose that a hypothesis test is to be performed on an index which is the average OSR value over the region bounded with a dashed box in figure 21 (unweighted average of 66 grid points).
#
#
#
This box-average JJA OSR has the following values in the formal-testing periods from the two integrations: 154.2 W m (with interannual standard deviation 4.5 W m ) in the first 240 years of B45, and 150.1 W m (with interannual s.d. 4.4 W m ) in the first 40 years of C44. This gives a 4.1 W m anomaly with an associated -statistic probability of .
50#16
#
#
This impressively low probability shows with virtual certainty that there is a difference in JJA OSR between B45 and
7
3 It is, in principle, possible to test instead for “field significance” on the overall difference field (or here, the overall set of heterogeneous difference fields) between the two integrations. This works as follows. If is the proportion of points which pass a local significance test when comparing two timeseries, then the fields will be significantly different at the 95% level if the value of is beyond the 95% point of the distribution of which is obtained using many randomly selected pairs of timeseries from a long control dataset. However, this requires a control dataset which is very much longer than the actual pair of timeseries to be compared, and even if most of the 500-year integration were used as the control, it is very questionable whether this condition would be achieved. (This is because to establish with any accuracy the 95% cutoff value of the distribution of , there must be more than just one or two points above this value; this suggests the need for at least of order 100 truly independent pairs of timeseries, each as long as the timeseries to be compared between integrations.) In any event, all the field significance would show is whether some difference exists between the two integrations (but without indicating that differences in any particular fields or regions are significant). Given the differences of model version, this in itself would not be a particularly informative result.
7
7
19
7
C44, and is robust even if this statistic were tested as part of a large set of candidate indices.
4.2 “Importance” of anomalies
#
Let us stay with the example of Atlantic OSR. We have shown reasonably conclusively that there is a statistically significant difference between the integrations. However, is an anomaly of 4.1 W m enough to be considered “important”? We have already seen that it is of similar magnitude to the interannual variability in the integrations. Here are comparisions with the magnitude of some other variations.
#
#
Spatial and temporal variability: OSR has a large annual cycle: in B45, the average OSR over this box drops from 154 W m in JJA to 84 W m in SON. It also has large spatial variations: see figure 22(a) for full fields (not anomalies) for JJA OSR from B45. These mean that a 4 W m difference in OSR is equivalent to a 1 offset in latitude or a 5-day offset in time.
#
&
Comparison with observational data: These integrations are forced with pre-industrial levels of CO , which limits the usefulness of comparisons with observations of recent climate. Nonetheless, a comparison with observations is useful for rough comparison of the magnitudes of between-model anomalies and model-data anomalies, even if the causes of model-data anomalies are difficult to separate into the effects of model limitations versus those of different forcings. Figure 22(b) shows the JJA average OSR for the period 1985-1989, derived from measurements from the NASA Earth Radiation Budget Satellite (ERBS). Although the patterns are broadly similar to those shown in figure 22(a), there are model-data anomalies of a few tens of W m in the Atlantic, so the model-model anomalies are comparatively small.
#
#
#
#
For the specific box under consideration, the area-average OSR is 137.1 W m , (c.f. 154.2 W m and 150.1 W m for the two models as above). The associated inter-annual standard deviation in the observations is 4.5 W m . Hence in this one particular variable the model appears to estimate variability correctly, even though models are in general known to underestimate variability; that said, the observed timeseries is a small sample, and too short a timeseries to contain decadal variability.)
#
Overall, the inter-model anomalies, although statistically significant, appear fairly small compared to other variability. Additionally, even though only one variable has been discussed in detail here, for the other fields shown in this report the magnitudes of the anomalies have also been similarly compared with the spatial variability in the full fields, and although the full fields are not shown here, it can be reported that the differences due to model-model anomalies are generally small compared to spatial variations.
20
outgoing SW at TOA anom (from sub-timeseries), JJA (W m-2) [c=1] 1
2
2
-1
-2
-3
2 1
1
1
2
-1
32
1 3
2
3
1
1
3
4
-1
-2
2 3
1
1
1
1
-1
-1
1
2
3 2 4
1
1
-3 -2
1 -1
1
1
1
2
1
-1
1
-1
1
1
1
1
1
1
1
-2
324
2
-1
1
-1
1
-3
-1
1 -2
-2
-1
2
2
1
1
-2
-2
1
-1
2
-3
-5 -3 -1 12 3
-4-3 -1
2
4 213
2
-2 -4
-2
1
-3
4 3
3
2
-2
12
-1
4
-2-1
-1
5
65
1
-1
12
-2
-3
1
-1
1 -1
5
-1
1 1
-2
1
-1
4 6 5
2
1
-2
12
1
-1
-2 -1
1 32
1
-1 -2
-1
1 -1
2
-1 -2
2
3
1
1 32
4
2
1
2
2
-1 -2-3
-3
321 1
-2-1 -3
1
1 23
-2
1
-1
-1
3 21
1
-1
-1
1
1
-1 -1
1
1
Figure 21: Top of atmosphere outgoing shortwave for JJA, as in figure 13, but calculated from subset timeseries, see text
(a) outgoing SW at TOA, JJA, full field from B45 (W m-2) [c=10] 220
200
200240 220 260
0
240
180
22
160
160 0
0
160
180
180
0
0
160
0
16 14
160 200
16
0
180
16
18
18
160
180
2000 18
(b) outgoing SW at TOA, JJA, from ERBS 1985-9 (W m-2) [c=10]
220
220
60
80
140
160
0
0 14
120
18
0 10 10 0
100
60
80
80 60
60
80
80
60
60
60 60
60
60
120
80 0
12
80
80
60
60
60
20
1 12000
0
0
10
0 12
60
40
100
80
80
80
60
80
60
140 12000 1
60 80
100
80
60
80
60
120
140
100 80
100
80
140
120
100
80 100
0 12
80
100 20 1410
100
80
00 1210
0
60
16
0
14
0 12 0 10
80
100
80
140
120
0
80
80
0 80
120 140
10
120 60
120
0
180 160 140 120 100
0
16
14
10
120
0
120
10
100 80 60
10 0 10
140
0 18 160
14012 0
140
1
01 14 20
140
200 180 160 140
200
1 40
60
40
20
Figure 22: Top of atmosphere outgoing shortwave for JJA, full fields (not anomalies): (a) from run B45, and (b) from ERBS observations
21
5 Conclusions Results from two integrations of HadCM3 have been compared: The first (C44) was a 100-year run performed on a Cray T3E using Unified Model version 4.4. The second (B45) was a 500-year run performed on a PC Beowulf Cluster using UM version 4.5. Both assumed pre-industrial CO concentrations, and were run at 64-bit precision.
A detailed comparison of model output from the two integrations has been undertaken. This has shown a few small but discernible differences between the two integrations, most notably slight differences in geopotential and surface pressure, stratospheric water vapour, and in a set of physically related atmospheric and oceanic fields in the Atlantic and Southern Ocean (cloud cover, radiation, precipitation, soil moisture, air-sea fluxes, ocean temperature and ocean zonal velocity). Although the statistical methods used have largely consisted of ad-hoc inspection of local significance levels, from a more careful case-study it appears that at least some of these differences are genuinely statistically significant. The fact that statistically significant differences exist between the B45 and C44 model integrations exist underlines the fact that when performing a controlled experiment, it is always highly inadvisable to “mix and match” control and perturbation runs from two different hardware platforms or model versions. (Here there is no separation of the effects of model version versus hardware platform, although a separate comparison of the same model on Beowulf and Cray also showed differences to be small.) However, the magnitude of the differences is small in comparison with typical levels or variability in the relevant fields (in the case of the pressure and geopotential anomalies, this is particularly evident when considering spatial derivatives which are more dynamically important than their absolute values), or they relate to stratospheric water values which are known to be unphysical anyway. Provided that the models are used appropriately (i.e. conducting control and perturbation experiments on the same platform), the evidence here strongly suggests that both combinations of model version and hardware platform examined here are valid for the uses expected from climate models. Any differences between them are relatively minor, and given the simulations are in any event for pre-industrial conditions, it is not possible here to use observational data as an arbiter of these minor differences.
22