Appendix I Applying the box-counting method to a ...

3 downloads 0 Views 1MB Size Report
California in the 2000s (second section in Table A1). We used those ..... Ojai, CA. Sacaton, AZ. El Paso, TX. Santa Barbara,. CA. Safford. Agricultural center, AZ.
1

Appendix I

2

Applying the box-counting method to a diffusion growth model

3

Introduction and methods

4

We tested the robustness of the box-counting method by applying it to an expansion

5

model with known scale-dependent pattern. We chose a simple diffusion growth model in

6

which the movement of the invasion front approaches asymptotically a constant speed VF as

7

time becomes sufficiently large (Andow et al. 1990). Our model assumes a radial expansion

8

over homogeneous environment. Therefore, decadal expansion rates are invariant across

9

scales smaller than VF and should equal D2/(D-1)2 in the Dth decade. At scales larger than VF,

10

the expansion rate should rapidly decline to zero, but will occasionally rise when the range of

11

the population breaks the boundary of those larger scales.

12

In this analysis, we set the constant VF as 186 km per decade and let the population

13

expand radially for 6 decades. We randomly sampled 50 points within the population range in

14

the first decade and then consequentially increased the number of random sampling points in

15

the following decades. The increase was proportional to the increase in range size between

16

decades. We then applied the box-counting method to those sampling points to calculate the

17

expansion rate for the 2nd – 6th decade. The estimation was done at the scale of 1, 2, 5, 10,

18

20, 50, 100, 200, 500, and 1000 km scale. We expected that the estimates at 1 – 100 km scale

19

match the theoretical rate of D2/(D-1)2; the rate at 200 km scale be slightly lower than the

20

theoretical rate because the scale is slightly beyond the VF; and the rate at 500 and 1000 km

21

be zero except for decades in which the invasion front move into new squares of 500 and

22

1000 km size.

23

Estimates from the box-counting method can be sensitive to the placement of the grid.

24

We shifted the grid from its original placement (population starts at the center of the grid) in

25

16 directions (π/8 radians apart) and three distances (600m, 26km, and 108 km) in each

26

direction. Those replacements allowed us to assess how much fluctuation in estimates of

27

expansion rate can be caused by grid placement. We then averaged the estimates over all the

28

placements to examine whether those mean estimates are more reliable in representing the

29

true expansion rate.

30

Results and discussion

31

The estimated expansion rates at 1 – 100 km scale generally match their theoretical

32

values (Fig. A1). Fluctuations of the estimates due to grid replacements become noticeable at

33

scales beyond 50 km and the degree of fluctuation increases with scale. The 49 different grid

34

placements only result in a few different estimates, suggesting many placements give

35

consistent estimates. Estimates in the second decade have stronger fluctuation than those in

36

other decades, probably because of the lower number of sampling points.

37

When fluctuations in estimates are large, taking the mean over all the placements

38

gives a more reliable expansion rate (closer to the theoretical value) than using an estimate by

39

a single grid placement.

40

As predicted, the expansion rate at 200 km is lower than those at smaller scales

41

because the invasion front moves at a speed slightly slower than 200 km per decade. At 500

42

and 1000 km scale, the expansion rate is either close to zero or surges dramatically. Since

43

those two scales are much beyond the invasion speed, a quantitative interpretation of their

44

expansion rates is not very meaningful. Any surge is a qualitative indication that the invasion

45

front moved into new 500 or 1000 km squares in that decade.

46

Overall, the box-counting method is capable of reproducing the expected scale

47

dependent or independent pattern of range expansion. Its estimates can be sensitive to grid

48

placement especially at larger scales. Averaging the estimates over multiple placements can

49

ease the sensitivity and produce more reliable results.

50 51

Figure A1. Expansion rates in a diffusion growth model estimated by the box-counting method at 1 –

52

1000 km scales. Dash lines indicate the theoretical values of the expansion rate at 1 – 100 km scale.

53

Each “+” indicates an estimated expansion rate by each grid placement. Circles are the mean over

54

estimates of all grid placements.

55

References

56

Andow, D. A. et al. 1990. Spread of invading organisms. - Landscape Ecol 4: 177–188.

57 58

Appendix II

59

Quantifying sampling bias by applying box-counting method to herbarium records of

60

native species

61

Introduction and methods

62

Herbarium sampling efforts may vary across spatial scales and between decades. This

63

variation will introduce bias to our multi-scale analysis. To correct for the potential

64

spatiotemporal sampling bias, we chose herbarium records of three native winter annual

65

species (Plantago patagonica, Lepidium lasiocarpum, and Chaenactis stevioides) collected

66

between the 1950s and the 2000s and applied the box-counting method to those records. We

67

shifted the grid placement as in the analysis for Sahara mustard and for the diffusion growth

68 69

model. These three species are commonly found in the southwestern North America. We

70

assumed that the distribution of those native species were relatively stable during this period.

71

Any deviation from zero expansion rate can indicate decadal difference in sampling efforts.

72

We estimated the “expansion” rates at 1, 2, 5, 10, 20, 50, 100, 200, and 500 km scale. We

73

expected that the rates were close to zero at the 500 km scale (i.e. no sampling bias at this

74

scale) if the distribution of the native species stayed stable. It is unlikely that herbarium

75

collectors would search for those plants in a 500 x 500 km region in one decade and

76

completely ignored that region in the following decade.

77

Results and discussion

78

The “expansion” rates at 1-100 km scales substantially deviated from zero in each

79

decade for all three species (Fig. A2a-c). Estimates by different grid placements were quite

80

consistent with each other. Surprisingly, at 500 km scale, range of C. stevioides showed

81

strong contraction in the 2000s and the range of L. lasiocarpum showed expansion and

82

contraction in the 1960s and the 1970s respectively. As explained, we consider those regional

83

scale expansion and contraction unlikely a result of sampling bias but more likely a reflection

84

of true regional scale range shift by these two species. This interpretation is further supported

85

by the fact that while one species showed regional scale range shift in one decade, the other

86

two species did not show the same trend.

87

To avoid using true range shifts as indicators of sampling bias, for each decade we

88

combined records of only species whose 500 km scale range remained stable. In particular,

89

we combined records of P. patagonica and C. stevioides to calculate the native “expansion”

90

rates in the 1960s and the 1970s, records of all three species to calculate the rates in the 1980s

91

and the 1990s, and records of P. patagonica and L. lasiocarpum to calculate the rates in the

92

2000s (Fig. A2d). At each scale, we averaged the estimates over all the grid placements and

93

used this mean as a quantitative indicator of sampling bias. We subtracted those native

94

“expansion” rates from those of Sahara mustard to calculate the corrected expansion rate of

95

this invasive species (Fig. 1c in main text).

96 97

Figure A2. Decadal “expansion” rates estimated by the box-counting method for three native winter

98

annual species (a) Plantago patagonica, (b) Chaenactis stevioides, and (c) Lepidium lasciocarpum

99

representing the variation in herbarium sampling efforts between decades. C. stevioides and L.

100

lasiocarpum show range shifts at the 500 km scale in some decades, which are unlikely a result of

101

change in sampling efforts at such a large scale. (d) We derived the combined “expansion” rates to

102

represent sampling bias by combining records of P. patagonica and C. stevioides for estimates in the

103

1960s and 1970s, records of all three species for estimates in the 1980s and 1990s, and records of P.

104

patagonica and L. lasiocarpum for estimates in the 2000s.

105 106

Appendix III

107

Inquiring distribution of Sahara mustard in North America and its native range To infer the historical and current distribution of Sahara mustard in North America,

108 109

we searched a large number of online herbarium databases covering the continent (Table A1)

110

and acquired all the collection data points wherever possible. Only four databases (indicated

111

by stars in Table A1) have records of Sahara mustard and the majority of the records came

112

from SEINet and CCH. In these two databases, the total number of all plant collections and of

113

Sahara mustard varied strongly among decades and among different states (Table A2). We

114

applied the box-counting method to those herbarium records to calculate decadal expansion

115

rates across multiple spatial scales. We also acquired records from invasive plant surveys conducted in Arizona and

116 117

California in the 2000s (second section in Table A1). We used those records, combined with

118

herbarium records, to build species distribution models for inferring the climatic niche of

119

Sahara mustard. To infer the distribution of Sahara mustard over its native range, we surveyed the

120 121

existing literature and inquired the Global Biodiversity Information Facility (GBIF) database.

122

Distribution inferred from each source is listed in Table A3.

123 124

Table A1. List of herbarium databases and invasive plant surveys inquired in this study. “*”

125

indicates a herbarium database in which records of Sahara mustard were used in this study. Region

Herbarium Database

Global *Global Biodiversity Information Facility (GBIF)

Southwest

*Southwest Environmental Information Network (SEINet) *Consortium of California Herbaria (CCH) *New Mexico Biodiversity Collections Consortium University of Texas Herbarium Botanical Research Institute of Texas Herbarium University of Oklahoma Vascular Plants Database University of Utah Garret Herbarium Intermountain Region Herbarium Network

Northwest

Consortium of Pacific Northwest Herbaria

Great Plains and

Great Plains Herbarium Network

Midwest

Black Hills State University Herbarium (South Dakota and Wyoming) Iowa State University Herbarium Wisconsin Botanical Information System University of Michigan Herbarium

Southeast

Southeast Regional Network of Expertise and Collections

Northeast

Consortium of Northeastern Herbaria

Region

Invasive plant surveys

Southwest

Southwest Exotic Plant Mapping Program (SWEMP) California Invasive Plant Council (Cal IPC) Saguaro National Park Survey Cameron Barrow’s study of Sahara mustard in Coachella Valley, California Author’s personal records

126

127

Table A2. The number of records of all plant collections obtained within the region between 100-121 degree west and 25-38 degree north

128

registered in the SEINet and CCH database. The number of herbarium collections of Sahara mustard is also listed in the table for comparison.

129 All

All

All

All

All

All

All

collections

collections

collections

collections

collections

collections

from CA

from CO

from Mexico from NV

from TX

from UT

(SEINet)

(SEINet)

(SEINet)

(SEINet)

(SEINet)

All collections collections Decade

Records of

from AZ & NM from CA

Total

(SEINet) (CCH)

Sahara mustard

(SEINet)

1920s

1

1930s

0

1940s

52110

32526

1524

1009

2611

1101

195

1924

93000

4

1950s

48085

29448

822

995

2779

118

64

925

83236

6

1960s

85038

76214

996

871

6265

1292

107

2133

172916

32

1970s

64130

95196

783

609

5609

1766

544

2090

170727

62

1980s

58924

81463

1359

2243

5287

1322

284

2573

153455

73

1990s

92220

122800

613

1940

2189

629

202

2379

222972

76

2000s

136487

171478

305

196

567

154

90

762

310039

279

130

Table A3. The distribution of Sahara mustard over its native range recorded in the literature and

131

GBIF database. Sources

Described distribution of Sahara mustard

Jalas 1996

Along the coastal area of Mediterranean Europe, including continental coast and islands of Spain, France, Italy, and Greece.

Zohary 1966 and Zohary et al.

In Egypt: Nile Delta, Nile Valley, western desert, oasis, northwest,

1980

northeast, Northern, Central and Southern Sinai. In Saudi Arabia: Hejaz, eastern Arabia, central Arabia. Bahrein. Kuwait. In Israel, Palestine and Jordan: Mediterranean Littorals, northern, western and central Negev Desert, Acco Plain, Sharon Plain, Philistean Plain, desert of Edom, Jordan Mts, East Jordan Desert, Southern Jordan Desert. In Syria and Lebanon: coastland, Lebanon Mts, Jebel Druze, Northern Mts. Northern, southern and eastern Cyprus. In Turkey: Western Anatolia, Mesopotamian Anatolia, Aegean Islands. In Iraq: mountain region, lower Mesopotamia, northern plains and foothills, western and southern desert. In Iran: northern, southwestern mts, and central, and southern Iran.

Townsend and Guest 1980

In Iraq: occasional in the steppe region. Common in southern sector of the desert region.

Miller and Cope 1996

Saudi Arabia, Southern Yemen, Oman, UAE, Qatar, Bahrain, Kuwait, S&W Europe, N Africa and SW Asia. On sand and gravel in deserts: 0 – 2400 m

Maire 1965

Coastal and interior dunes of North Afirca. Oasis in M’zab (Algeria) of northern Sahara. The High Plateau. Saharan Atlas Mountains range. Oasis in Ahaggar Mountains (Algeria).

Rechinger 1968

Western and southern Europe, North Africa, western Syria, Iraq, Anatolia, Cyprus, Iran and Armenia.

Jafri 1977

N. Africa, S. Europe, eastwards to Pakistan. Recorded collections in coastal Libya.

Battandier and Trabut 1888-90

Coastal Algeria, High Plateau, Sahara. Mediterranean region.

Global Biodiversity

Records found in the following countries:

Information Facility

Europe (Greece, Cyprus, France, Greece, Italy, Portugal, Spain,

(data.gbif.org)

Turkey) Africa (Algeria, Burkina Faso, Egypt, Libya, Morocco, Tunisia) Asia (Iraq, Israel, Jordan, Kuwait, Lebanon, Oman, Pakistan, Qatar, Saudi Arabia, Syria, United Arab Emirates)

132 133

References

134

Battandier, J. A. and L. Trabut. 1888-90. Flore de L’Algerie: contenant la description de toutes

135

les plantes singnalee’s fusqu a ce jour comme spontanees en Algerie et catalogue des

136

plants du Maroc, Dicotyledones. Librairie Adolphe Jourdan. Alger.

137

Jafri, S.M.H. 1977. Flora of Libya: Brassicaceae. 23. Al Faatech University. Tripoli.

138

Jalas, J., J. Suominen, and R. Lampinen. 1996. Atlas Florae Europaeae: Distribution of Vascular

139

Plants in Europe. The Committe for Mapping the Flora of Europe and Societas Biologica

140

Fennica Vanamo, Helsinki.

141

Maire, R. 1965. Flore de L'Afrique du Nord. Paul Lechevalier, Paris.

142

Miller, A. G. and T. A. Cope. 1996. Flora of the Arabian Peninsula and Socotra. Edinburgh

143

University Press, Edinburgh.

144

Rechinger, Karl. H (ed.). 1968. Flora Iranica: Flora des iranischen Hochlandes und der

145

umrahmenden Gebirge. Cruciferae. 57. Akademische Druck – u. Verlagsanstalt, Graz.

146

Townsend, C. C. and E. Guest. 1980. Flora of Iraq. Ministry of Agriculture and Agrarian

147

Reform, Baghdad.

148

Zohary, M. 1966. Flora Palaestina. The Israel Academy of Sciences and Humanities, Jerusalem.

149

Zohary, M., C. C. Heyn, and D. Hller. 1980. Conspectus Florae Orientalis: An Annotated

150

Catalogue of the Flora of the Middle East. The Israel Academy of Sciences and Humanities,

151

Jerusalem.

152 153

Appendix IV

154

Using MaxEnt to model the distribution of range-expanding species

155

The expansion of a non-native species means that its range does not reflect its stable

156

relationship with the invaded environments (Elith et al. 2010a). This lack of equilibrium presents

157

challenges for modeling potential distribution using data from current distribution.

158

Solutions to this problem include using less complex models and comparing models based on

159

different background samples (Elith et al. 2010a).

160 161

We reduced the complexity of our models in three ways. First, we used only four climatic variables that are most biologically relevant to our focal species. Second, we used only the hinge

162

and quadratic features in MaxEnt. Choosing the two features means that the modeled distribution

163

is constrained by the mean and variance of the given climatic variables, may have piecewise

164

linear response to any of them, but is not constrained by any interaction between them (Elith et

165

al. 2010b). Third, we increased the regularization parameter in MaxEnt (from the default value of

166

1 to 2.5) to reduce the complexity of the surface of fitted models. Hence, our models excluded

167

complicated detail response of species distribution to climate, which is more appropriate for

168

species that have formed a stable relationship with its environment (Elith et al. 2010a).

169

To further account for ongoing range expansion, we also allowed our models to provide

170

Sahara mustard with more potential space for expansion. We did so by adding models based on a

171

much larger background than those using our standard background. Our standard background

172

was a polygon that consists of the majority of southwestern North America. The enlarged

173

background was a rectangular region containing all lower 48 states of the U.S. and the entire

174

territory of Mexico. By choosing the standard background, we asked why Sahara mustard was

175

only found in certain areas of the Southwest given the spatial climatic variation within the region

176

and whether it could further expand in the Southwest. By choosing the enlarged background, we

177

asked why this species was only found in the Southwest given the climatic conditions across

178

North America and whether it could expand beyond the Southwest. To provide approximately

179

equal spatial density of sampling for our models, we drew 10,000 random samples from the

180

standard background and 25,000, from the enlarged background.

181 182

We found that models based on both backgrounds allow us to reach the same conclusion that 1) Sahara mustard in North America is restricted by its climatic envelope (Fig. A3) and 2) the

183

climate in the invaded range generally predicts the native distribution (Fig. A4). The model

184

predictions also allow us to infer the climatic range under which Sahara mustard is likely to be

185

present in both its invaded and native range (Fig. A5).

(a)

(b)

186 187 188 189

Figure A3. Distribution of Sahara mustard within its climatic niche in North America predicted by SDMs. We used background samples from (a) southwestern North America (SWNA) and (b) North America (NA) to build the models. For each background scenario, we trained two models: one using only

190 191 192 193 194

herbarium records and the other, herbarium and invasive plant survey records combined. We then derived an ensemble from the two models. Each ensemble shows the area predicted by both models (peach) and by each model alone (green or yellow). The maps also show the occurrence of Sahara mustard recorded by herbarium collections (red circles) and invasive plant surveys (blue triangles).

(a)

(b)

195 196 197 198 199 200 201

Figure A4. Projected distribution of Sahara mustard in its native continents projected by SDMs based on its invaded range in North America. We used background samples from (a) southwestern North America (SWNA) and (b) North America (NA) to build the models. The building of the models was the same as described in Fig. A3. Shaded areas represent its native range estimated from the literature and the GBIF records.

(a)

TEMPCOLDQ

TEMPWARMQ

RAINCOLDQ

RAINYEAR

202

(b)

TEMPCOLDQ

TEMPWARMQ

RAINCOLDQ

RAINYEAR

203 204 205 206 207 208 209

Figure 5A. Range of climatic variables in areas where Sahara mustard is predicted to be present in (a) North America and (b) its native continents by the SDMs. The four variables are mean temperature of the coldest quarter (TEMPCOLDQ), mean temperature of the warmest quarter (TEMPWARMQ), precipitation of the coldest quarter (RAINCOLDQ), and annual precipitation (RAINYEAR). Temperature values are shown as degree Celsius x 10 and precipitation values, millimeters. The SDMs use southwest North America as the background.

210 211 212

Climatic variables used in the model and their contribution None of the four climatic variables used in the two background regions were overly

213

correlated (if |R| > 0.85) with each other (Table A4); therefore we included all of them in each

214

model.

215

To understand which variable is most important in limiting the species’ distribution, we

216

evaluated the contribution of each variable to a model using MaxEnt’s built-in evaluation

217

algorithm (Table A5). Models based on SWNA background include summer temperature as the

218

most influential variable, whereas models based on NA background include annual precipitation

219

as the most influential variable. MaxEnt’s evaluation of each variable’s contribution to a model is

220

sensitive to correlation between variables (though the model itself is not). In our models, summer

221

temperature is correlated with winter temperature, and annual precipitation, with winter

222

precipitation (Table A4). Therefore, the results can only allow us to suggest that temperature

223

variation drives the species distribution within the Southwest, whereas precipitation is more

224

important in limiting its range to the Southwest.

225

Models based on the SWNA background predicted a smaller range than those based on the

226

NA background (Fig. A3). Since temperature is a more influential variable in SWNA

227

background models, this stronger climatic restriction suggests that Sahara mustard would have a

228

broader range in the Southwest if temperature were not a limiting factor. Given that regional

229

distributions are likely to shift following a changing global climate, Sahara mustard might be

230

predicted to expand particularly in response to elevated temperatures.

231

Table A4. Pearson coefficient of the four climatic variables used for building SDMs drawn from the

232

two background regions: a polygon that consists of the majority of southwestern region in North America

233

(SWNA) and a rectangular region containing all lower 48 states of the United States and the entire

234

territory of Mexico (NA). TEMPCOLDQ SWNA

NA

RAINCOLDQ

-0.1605

0.0523

RAINYEAR

-0.0470

TEMPWARMQ

0.7582

RAINCOLDQ SWNA

NA

0.2770

0.7382

0.7383

0.7257

-0.4071

-0.0943

RAINYEAR SWNA

NA

-0.4198

0.1275

TEMPCOLDQ

235 236

Table A5. The estimated influence of climatic variables on each SDM. To determine the “percent

237

contribution”, in each iteration of the training algorithm, the increase in regularized gain of the model is

238

added to the contribution of the corresponding variable, or subtracted from it if the gain is negative. To

239

determine the “permutation importance”, each variable was selected in turn, the values of that variable on

240

training presence and background data are randomly permuted. The model is re-evaluated on the

241

permuted data, and the resulting drop in training AUC (normalized to percentages) is shown.

242

Training Data Herbarium records Variable

Backg round SWN A

TEMPWARMQ TEMPCOLDQ RAINCOLDQ

Percent Permutation contribution importance

42.2 30.6 14.5

33.4 10.7 27.2

Herbarium and invasive plant survey records combined Percent Permutation Variable contribution importance TEMPWARMQ RAINYEAR TEMPCOLDQ

51.6 18.8 17.6

42.1 33.7 6.3

NA

RAINYEAR

12.6

28.7

RAINCOLDQ

12

17.8

RAINYEAR TEMPCOLDQ TEMPWARMQ RAINCOLDQ

41.9 30.7 19.1 8.4

71.8 14.5 4.1 9.6

RAINYEAR TEMPCOLDQ RAINCOLDQ TEMPWARMQ

61.1 27.8 6.5 4.6

67.7 18.6 6.4 7.3

243 244 245

Model validation

246

We tested our SDM models using 10-fold cross validation and used the test score of the Area

247

Under the receiver operating characteristic Curve (AUC) to judge whether each model performed

248

reasonably well (Table A6). An AUC of 0.5 means the model prediction is no better than

249

random, whereas a value closer to the maximum achievable AUC indicates better performance of

250

a model. The maximum achievable AUC is 1-a/2, where a is the prevalence of the species over

251

the sampled region (Phillips et al. 2006). This means that models trained by records of lower

252

prevalence (e.g. fewer records or larger background) will have a higher maximum achievable

253

AUC values. Therefore, models with higher AUC scores in the table are not necessarily better.

254

Unfortunately, using presence-only data means that the prevalence of a species is unknown (no

255

information on absence) and thus the maximum achievable AUC cannot be estimated. In our

256

study, we used the test AUC to affirm that each model performed reasonably well (AUC>0.5) but

257

did no judge how much the model deviates from its theoretical optimum. We note that MaxEnt

258

has been repeatedly shown to produce some of the most robust SDMs using presence-only data

259

when compared with other modeling methods (Elith et al. 2006; Phillips et al. 2006).

260

If a model passed the test, we trained the model used the entire dataset and accepted its

261

logistic output as the relative probability of the species’ presence in a spatial unit. We used the

262

logistic output threshold at which the model achieved maximum training sensitivity plus

263

specificity (minimizing the error rate for both positive and negative observations (Freeman &

264

Moisen 2008)) and treated any logistic value under this threshold as an indication of absence.

265 266

Table A6. The AUC score of the MaxEnt species distribution model based on two different backgrounds

267

(SWNA and NA) and using two different datasets: herbarium records only (H) and by combining

268

herbarium and invasive plant survey records (H+IS).

269

Background

Training Data H

H+IS

SWNA

0.903+0.016

0.879+0.006

NA

0.977+0.004

0.963+0.002

270 271

References

272

Elith, J., Graham, C.H., Anderson, R.P., Dudik, M., Ferrier, S., Guisan, A., et al. 2006. Novel methods

273

improve prediction of species’ distributions from occurrence data. Ecography, 29, 129–151.

274

Elith, J. et al. 2010a. The art of modelling range‐shifting species. - Methods in Ecology and Evolution 1: 330–

275

342.

276 277

Elith, J. et al. 2010b. A statistical explanation of MaxEnt for ecologists. - Diversity and Distributions 17: 43– 57.

278

Freeman, E. A. and Moisen, G. G. 2008. A comparison of the performance of threshold criteria for binary

279

classification in terms of predicted prevalence and kappa. - Ecological Modelling 217: 48–58.

280

Phillips, S.J., Anderson, R.P. & Schapire, R.E. 2006. Maximum entropy modeling of species geographic

281

282

distributions. Ecological Modelling, 190, 231–259.

283 284

Appendix V.

285

Examining the influence of decadal cold-season precipitation on the local expansion of

286

Sahara mustard

287

Introduction

288

As a winter annual plant, Sahara mustard experiences population boom in response to high

289

precipitation over the cold (late fall-winter-early spring) season. Moreover, precipitation in fall

290

and early winter may favor its growth more strongly than late cold-season precipitation because

291

early winter rainfall followed by later precipitation events provides a larger time window for the

292

species to outperform native annual plants (Barrows et al. 2009; Marushia et al. 2010).

293

Therefore, years of high (early) cold-season precipitation provide potential temporal niche

294

opportunities for its expansion. To examine whether such niche opportunities exist, we assessed

295

the relationship between the species’ local expansion and cold-season precipitation at the decadal

296

scale.

297

Methods

298

We located local expansion hotspots using the same box-counting method (see Method in

299

main text). We first evenly divided the southwestern North America into 100 km squares as our

300

focal cells. We then divided each focal cell into 1, 5 and 10 km square cells and calculated the

301

expansion rate within a focal cell (not adjusted for sampling effort) at those three local scales.

302

We chose the maximum expansion rate among the three as the expansion rate in a focal cell and

303

located cells that experienced high rates. We considered those focal cells as local expansion

304

hotspots in that decade (Fig. A6). The calculation was done for each of the five decades (the

305

1960s – the 2000s).

306

We evaluated the decadal cold-season precipitation in those hotspots using weather data

307

from the United States Historical Climatology Network. We acquired monthly precipitation data

308

from weather stations that are located either within or adjacent to the hotspots (Table A7). We do

309

not suggest that these stations faithfully represent the climate of each hotspot. Our interest is in

310

revealing the general trend of winter precipitation between decades, which we believe is

311

consistent over large spatial scales and therefore should be comparable between the hotspots and

312

their correspondent weather stations. For each station, we calculated the annual cold-season

313

precipitation as the mean monthly precipitation from October to April. We calculated its long-

314

term mean averaged over year 1929-2010. We then calculated the abnormality of decadal cold-

315

season precipitation (cold-season Ad) as the percent deviation of the decadal mean from the long-

316

term mean. For instance, a cold-season Ad of 0.2 means the cold-season precipitation averaged

317

over that decade is 20% higher than the long-term mean. We used Wilcox signed rank test to

318

determine whether the cold-season Ad averaged over those hotspots in each decade was

319

significantly above zero.

320

We also located focal cells that were hotspots in one decade but experienced negligible

321

expansion in the following decade. We considered those cells as local expansion coldspots for

322

that following decade and calculated cold-season Ad in those cells. We compared the cold-season

323

Ad averaged over hot- and coldspots of the same decade (Wilcox rank sum test) to examine

324

whether the former received significantly more rainfall. We only compared hot- and coldspots in

325

the 1970s – 1990s, the three decades in which similar number of hot- and coldspots were

326

detected.

327

Moreover, since high amount of rainfall early in the winter season may give Sahara mustard,

328

a species characterized by rapid phenology, an advantageous start, we performed the same

329

analyses using early cold-season Ad, namely the decadal abnormality of mean monthly

330

precipitation from October to December.

331

We repeated all the above analyses using precipitation data over the decade prior to any local

332

expansion to indicate whether a local expansion was a delayed response to historical conditions.

333

Finally we applied False Discovery Rate analysis (QVALUE in R package) to control the

334

increased chance of listing a false positive test in this multiple-hypotheses test.

335

Results

336

Neither the cold-season nor the early cold-season Ad in local expansion hotspots were

337

consistently positive across the five decades (Table A8). The cold-season Ad was significantly

338

positive in the 1970s, 80s and 90s, but not so in the 1960s and significantly negative in the

339

2000s. The early cold-season Ad was significantly positive in the 1960s and 80s, but not so in the

340

1970s and 90s, and significantly negative in the 2000s. Comparing early cold-season Ad between

341

hot- and coldspots, we found that those in hotspots were not significantly higher than in

342

coldspots in any decade.

343

Among the three decades (the 1960s, 70s and 2000s) in which Sahara mustard achieved the

344

most rapid local expansion (Fig. 1 in main text; Fig. A6), only the 1960s had local expansion

345

hotspots experiencing a substantial increase (+37.5%) in early cold-season precipitation but no

346

increase in cold-season precipitation (Table A8). The hotspots in the 1970s experienced a modest

347

increase (+5.9%) in cold-season precipitation, but the coldspots in the same decade received

348

similar amount of rainfall. The 2000s saw the highest rate of local expansion and the largest

349

number of hotspots. However, hotspots in this decade experienced a significant decline in cold-

350

season (-17.7%) and early cold-season precipitation (-19.1%).

351

Neither did results suggest that the local expansion was a delayed response to previous

352

decade’s high precipitation. Among the three decades (the 1970s, 90s and 2000s) in which either

353

cold-season or early cold-season Ad was significantly positive in the previous decade, none had

354

hotspots exceeding coldspots by their previous-decade cold-seaosn Ad (Table A8).

355 356

Discussion

357

Our results do not support the hypothesis that decadal variation in climate explains the

358

decadal change in local expansion of Sahara mustard. Neither the rapid local expansion nor the

359

difference between local expansion hotspots and coldspots can be consistently explained by

360

higher cold-season precipitation averaged over each decade or the previous decade.

361

The local population growth of Sahara mustard may respond to climatic fluctuation at a

362

much shorter time scale. One or two years of heavy seasonal rainfall is sufficient to trig a local

363

population boom of an annual species, strongly increasing the local occurrences recorded in a

364

decade, but not raising the average precipitation over the same decade. For example, herbarium

365

records of Sahara mustard in 2005 (78 records) make up more than a quarter of the 272 records

366

in the 2000s as a result of an extremely wet winter and spring in the 2004-2005. However, the

367

cold-season precipitation averaged over the 2000s is significantly below the long-term average.

368

Population data of higher temporal resolution is needed to investigate whether the local

369

expansion of this species tracks the climate at a much finer temporal scale.

370 371

References

372

Barrows, C. W. et al. 2009. Effects of an invasive plant on a desert sand dune landscape. -

373 374 375 376 377

Biological Invasions 11: 673–686. Marushia, R. G. et al. 2010. Phenology as a basis for management of exotic annual plants in desert invasions. - Journal of Applied Ecology 47: 1290–1299.

378

Table A7. Locations of weather stations in the United States Historical Climatology Network

379

that are either in or adjacent to local expansion hotspots and coldspots in each decade from 1960s

380

to 2000s. The weather data from those weather stations were used to calculate the cold-season

381

precipitation in each hot- or coldspot.

382 Decade

1960s

1970s

1980s

Ajo, AZ

Ajo, AZ

Buckeye, AZ

Buckeye, AZ Chandler Heights, AZ

Buckeye, AZ

Parker, AZ

Kingman, AZ

Yuma, AZ

Miami, AZ

Prescott, AZ

Tucson WFO, AZ

Prescott, AZ

Childs, AZ

Parker, AZ

Wickenburg, AZ

Blythe, CA

Roosevelt 1 WNW, AZ

Yuma, AZ

Chula Vista, CA

Tombstone, AZ

Blythe, CA

Cuyamaca, CA

Wickenburg, AZ

FT Valley (Flagstaff), AZ Grand Canyon NP, AZ Kingman, AZ

Indio, CA

Pasadena, CA

Blythe, CA

Miami, AZ

Redlands, CA Santa Barbara, CA Tustin Irvine RCH, CA Boulder City, NV

Redlands, CA

Brawley, CA

Parker, AZ

Indio, CA

Prescott, AZ

Roosevelt 1 WNW, AZ Sacaton, AZ Tucson WFO, AZ Brawley, CA Indio, CA Locations of weather stations that are in or adjacent to local expansion hotspots

Redlands, CA

El Paso, TX

1990s Chandler height, AZ Miami, AZ Pearce Sunsites, AZ

Needles, CA Ojai, CA Santa Barbara, CA Jornada Exp Range, NM Orogrande, NM NM State University (Las Cruces), NM

2000s Ajo, AZ Buckeye, AZ Chandler Heights, AZ

Roosevelt 1 WNW, AZ Sacaton, AZ Safford Agricultural center, AZ Tucson WFO, AZ Wickenburg, AZ Williams, AZ Blythe, CA Brawley, CA Cuyamaca, CA Indio, CA Needles, CA Newport Beach Harbar (Huntington Beach), CA Ojai, CA Pasadena, CA Santa Barbara, CA

Tustin Irvine RCH, CA Boulder City, NV Kanab, UT St George, UT Zion NP, UT Locations of weather stations that are in or adjacent to local expansion coldspots

Buckeye, AZ

Ajo, AZ

Buckeye, AZ

Pearce Sunsites, AZ

Chandler Heights, AZ

Kingman, AZ

Parker, AZ

Tombstone, AZ

Miami, AZ

Prescott, AZ

Tucson WFO, AZ

Parker, AZ

Wickenburg, AZ

Yuma, AZ

Yuma, AZ

Chula Vista, CA

Indio, CA Santa Barbara, CA Boulder City, NV El Paso, TX

Cuyamaca, CA

Roosevelt WNW, AZ

1

Tucson WFO, AZ Brawley, CA Indio, CA

383

Pasadena, CA Redlands, CA

Jornada Exp Range, NM Orogrande, NM NM State University (Las Cruces), NM

Table A8. Cold-season and early cold season Ad averaged over local expansion hotspots and coldspots in each decade. The top half of the table shows the mean decadal abnormality ( Ad ) of cold-season (Oct-Apr) and early cold-season (Oct-Dec) precipitation averaged over local expansion hotspots in each decade. The bottom half of the table shows the increase in Ad from local expansion coldspots to hotspots. “Current decade” rows show results derived from precipitation data over the same decade as the local expansion occurred, whereas “Previous decade” rows, over the decade prior to the local expansion. P-values were estimated by using Wilcox signed rank test (hotspots) and Wilcox rank sum test (hotspots vs. coldspots). Q-values were estimated by using QVALUE package in R. The q-value cut-off was set to 0.005, corresponding to a false discovery of 0.065 tests among the 13 tests that were significant. Ad that are significantly different from zero were highlighted in bold.

Current decade

Previous decade Current decade

Previous decade

Hotspots

Focal decade

Hotspots - Coldspots

384 385 386 387 388 389 390 391 392

1960s

1970s

1980s

1990s

2000s

Ad

P-value

q-value

Ad

P-value

q-value

Ad

P-value

q-value

Ad

P-value

q-value

Ad

P-value

q-value

cold-season

0.002

1

0.156

0.059

0.057

0.019

0.086

0.012

0.004

0.190

< 0.01

< 0.001

-0.177

< 0.01

< 0.001

early coldseason

0.375

< 0.01

< 0.001

0.076

0.340

0.074

0.236