Evaluating correlative and mechanistic niche ... - Wiley Online Library

56 downloads 0 Views 1MB Size Report
Jul 31, 2014 - SUNIL KUMAR,1,| LISA G. NEVEN,2 ..... above 8.38C (van Kirk and AliNiazee 1981), (2) ..... Kirk and AliNiazee (1982), who found that the.
Evaluating correlative and mechanistic niche models for assessing the risk of pest establishment SUNIL KUMAR,1,  LISA G. NEVEN,2

AND

WEE L. YEE2

1 Natural Resource Ecology Laboratory, Colorado State University, Fort Collins, Colorado 80523 USA United States Department of Agriculture, Agricultural Research Service, Yakima Agricultural Research Laboratory, 5230 Konnowac Pass Road, Wapato, Washington 98951 USA

2

Citation: Kumar, S., L. G. Neven, and W. L. Yee. 2014. Evaluating correlative and mechanistic niche models for assessing the risk of pest establishment. Ecosphere 5(7):86. http://dx.doi.org/10.1890/ES14-00050.1

Abstract. Insect pests pose a great threat to global food security. Improved methods for assessment of the risk of pest establishment are needed to enhance informed decision-making, to develop cost-effective pest management strategies, and to design quarantine policies for preventing the spread of pests. We evaluated the capabilities of a correlative and a process-based mechanistic niche model, and their combination, to assess the risk of pest establishment. The correlative model MaxEnt and the process-based mechanistic model CLIMEX were used to assess the risk of establishment of western cherry fruit fly, Rhagoletis indifferens Curran (Diptera: Tephritidae) in California. We integrated R. indifferens occurrence records and spatial environmental variables using MaxEnt to assess the potential risk of establishment of this pest. The CLIMEX model was developed using eco-physiological tolerances of R. indifferens. The predictive performance of the MaxEnt model improved by including the host species’ distribution and Ecoclimatic Index generated using the CLIMEX model. The best model predicted no risk for R. indifferens establishment in the Central Valley around the areas where sweet cherries are produced in California. Most of the high to very high risk areas for R. indifferens were predicted in northern parts of California and the Sierra Nevada Mountains, where the fly exists on its native host, bitter cherry [Prunus emarginata (Douglas) Eaton]. Precipitation of driest quarter, degree days with average temperatures 8.38C, degree days with average temperatures 58C, and mean diurnal range in temperature were the strongest predictors of R. indifferens distribution in western North America. We showed that the predictive power of correlative niche models can be improved by including outputs from the process-based mechanistic niche models. Overall results suggest that R. indifferens is unlikely to establish in the commercial cherry-growing areas in the Central Valley of California, largely because heat stress is too high and chilling requirement in those areas is not met. Key words: CLIMEX; insect pests; MaxEnt; niche modeling; presence-only model; Rhagoletis indifferens; species distribution models; sweet cherry; western cherry fruit fly. Received 11 February 2014; revised 7 May 2014; accepted 13 May 2014; final version received 12 June 2014; published 31 July 2014. Corresponding Editor: K. Haynes. Copyright: Ó 2014 Kumar et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. http://creativecommons.org/licenses/by/3.0/   E-mail: [email protected]

INTRODUCTION

economy and global food security. Resource managers need improved methods for assessment for the risk of pest establishment to enhance decision-making, developing cost-effective pest management strategies, and designing quaran-

Invasive alien insect pests cause enormous damage to the environment, human health and wildlife health, and pose a great threat to the v www.esajournals.org

1

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL.

tine policies for preventing the spread of invasive pests. One method for assessing the risk of pest establishment is through use of niche models that correlate known occurrences of species with environmental variables and predict species potential distributions (Peterson et al. 2011). The niche models are based on classical concept of ‘‘niche’’ in ecology, and model potential or realized distribution of a species depending on the modeling algorithm used (Jimenez-Valverde et al. 2008, Franklin 2009, Peterson et al. 2011). Niche models can be broadly classified into two groups: correlative models and process-based or mechanistic models (Dormann et al. 2012). The correlative niche models associate species occurrence data with spatial environmental layers of the study area and produce maps of probability of presence or relative environmental suitability for a species. The process-based or mechanistic niche models use species’ functional traits and physiological tolerances for model fitting (Kearney et al. 2010). The correlative models generally do not perform well when extrapolated to novel environments (Webber et al. 2011, Owens et al. 2013). However, the extrapolations can be improved by fitting these models using biologically informed, hypothesis-driven variables, and minimizing model complexity (Kumar et al. 2014). Correlative models can be fitted using existing species occurrence data from museums/herbaria (Kearney et al. 2010, Elith and Franklin 2013), whereas most process-based mechanistic models need detailed experimental data that may not be available for the target species (Dormann et al. 2012). Both types of models have been used in quantifying and mapping the potential risk of establishment of insect pests in areas outside their current distributional range (e.g., Lozier and Mills 2011, Ni et al. 2012, de Villiers et al. 2013). This approach has rarely been applied to temperate fruit flies such as R. indifferens. In this study we evaluated the capabilities of a correlative model MaxEnt and a process-based mechanistic model CLIMEX, and their combination, to assess the risk of pest establishment. The western cherry fruit fly, Rhagoletis indifferens Curran (Diptera: Tephritidae), is a major quarantine pest of sweet cherry, Prunus avium (L.) L., in the western U.S. and potentially could invade areas of the U.S. that currently do not have the fly. The fly is native to Washington, v www.esajournals.org

Oregon, Montana, Idaho, British Columbia in Canada and in northern California (Bush 1966). The fly moved into drier areas of Washington and Oregon after cherries were introduced into these areas in the 1850s and was first detected attacking cultivated cherries (sweet cherry and sour cherry, P. cerasus L.) in Oregon in the early 1900s (Wilson and Lovett 1913). While the fly is found in northern California (Mackie 1940, Frick et al. 1954, Blanc and Kiefer 1955) on its native host, bitter cherry, Prunus emarginata (Douglas) Eaton (Curran 1932), it has not been detected farther south in the commercial cherry growing regions of California. However, there are concerns that it may invade those areas in the future. The major production areas for commercial sweet cherries in California are located in the Central Valley from Lodi to Bakersfield (http://calcherry.com/about/history. cfm?CFID¼18447373&CFTOKEN¼91065340). The fly in California is apparently confined to native bitter cherry in the northern and southern parts of the state at high altitudes (Mackie 1940, Frick et al. 1954, Blanc and Kiefer 1955). If true, one hypothesis is that the lower-altitude areas in California where commercial cherries are grown are too warm for the flies, which require long periods of chilling to break diapause (Frick et al. 1954). Currently, for cherry growers in the Pacific Northwest of the U.S. to ship sweet cherries to California, they must maintain an active control program for R. indifferens that includes monitoring through trapping, adhering to a strict pesticide application schedule, shipping through California designated ‘Approved Shippers’, performing both ‘porch sampling’ (sampling of 1–5 lbs. of cherries from each load of cherries, depending on the size of the load, that is delivered to the warehouse) and sampling of packed fruit from all lots using the approved ‘brown sugar’ or ‘hot water’ screening procedure (Brown 2009), and be subjected to rigorous border inspection (CDFA, http://pi.cdfa.ca.gov/ pqm/manual/htm/305.htm). The penalty for any grower whose lot is found to contain one or more larvae is elimination of the grower from shipping cherries to California for the remainder of the shipping season. Also, packinghouse approval for shipping cherries to California can be revoked if the larval detection threshold is exceeded (i.e., 2

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL.

two or more larval detections from a production zone results in elimination of the grower from the program for the whole zone) (CDFA 2012). However, if the flies cannot survive or establish in the cherry-growing areas of California, then the stringent protocol for the shipment of cherries to California from areas where R. indifferens is known to occur can be relaxed. Since R. indifferens has never been reported in commercial sweet cherries in California, information documenting the unlikelihood of this pest being present or establishing in the commercial sweet cherry-growing regions may be beneficial in negotiating trade agreements with countries restricting the importation of this commodity due to concerns over this pest. The objectives in this study were to: (1) evaluate the capabilities of the correlative model MaxEnt and the process-based mechanistic niche model CLIMEX to assess the risk of R. indifferens establishment in the commercial cherry-growing areas of California; (2) identify environmental factors associated with R. indifferens distribution; (3) test whether including a host species’ distribution improves the niche models’ accuracy; and (4) test whether models of pest risk establishment can be improved by combining a correlative and a process-based model. To capture the entire environmental niche of R. indifferens, we developed models for western North America (its current distributional range) and extracted the maps for California.

et al. (1993: Map 68) using ArcGIS (ESRI 2006). A total of 154 records were obtained covering the entire range of the distribution of R. indifferens that included eight western U.S. states and southern British Columbia, Canada (Fig. 1A). Four duplicate occurrence records were removed and the remaining 150 spatially unique (one record per 5 3 5 km cell) records were used in spatial modeling and other analyses. A total of 862 occurrence records for the major native host, bitter cherry, were acquired from GBIF, Intermountain Region Herbarium Network (http:// intermountainbiota.org/portal/index.php), Calflora (http://www.calflora.org/) and E-Flora of British Columbia (http://www.geog.ubc.ca/ biodiversity/eflora/) (see Appendix: Fig. A1). The number of bitter cherry occurrences was reduced from 862 to 635 after we used ‘spatial filtering’ strategy to filter bitter cherry occurrence points to make sure that data points were at least 12.6 km apart and to achieve spatial independence (Veloz 2009). Moran’s I correlograms were generated to examine spatial autocorrelation in model residuals (1  predicted probability of presence; De Marco et al. 2008) using the ‘sp.correlogram’ function in the ‘‘spdep’’ package in R (R Development Core Team 2012; Appendix: Fig. A2).

Environmental data Twenty nine environmental variables were considered as potential predictors of R. indifferens distribution (Appendix: Table A1). These variables were chosen based on the fly’s biology and ecological requirements, and similar niche modeling studies on other fruit flies and insects (e.g., Li et al. 2009, De Meyer et al. 2010, Sambaraju et al. 2012). These variables included climatic, topographic, and species-specific phenology variables as well as human factors. Nineteen Bioclim variables were obtained from the WorldClim dataset (http://www.worldclim.org; Hijmans et al. 2005). Apart from these 19 variables, we also calculated three phenology variables for R. indifferens based on its lower development threshold and chilling requirement: (1) number of degree days with average temperatures at or above 8.38C (van Kirk and AliNiazee 1981), (2) number of degree days with average temperatures at or below 38C, and (3) at or below 58C (chilling requirement; Frick et al. 1954, van Kirk

METHODS Species occurrence data Presence records for R. indifferens were collected from published papers, reports, and books (Appendix: Fig. A1). Five occurrence records were obtained from the Global Biodiversity Information Facility (GBIF) website (http://data. gbif.org). Three presence records for Utah were collected from the Utah State University’s Cooperative Extension website (http://utahpests.usu. edu/ipm/files/uploads/PPTDocs/04sh-insectswcffcontrol.pdf ). Google Map (http://maps. google.com/) was used to record geographic coordinates of locations (using the ‘‘What’s here?’’ feature) where exact latitude and longitude data were not provided. We also digitized R. indifferens occurrence data published in Foote v www.esajournals.org

3

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL.

Fig. 1. (A) Occurrence of western cherry fruit fly R. indifferens, and potential environmental suitability for R. indifferens predicted by (B) MaxEnt_Env, (C) MaxEnt_EnvHost, (D) MaxEnt_EnvHostClimex, and (E) CLIMEX models in western North America and California. Environmental suitability predicted by MaxEnt models varied from 0 (lowest) to 1 (highest), and Ecoclimatic Index (EI) from CLIMEX model varied from 0 (unsuitable) to 100 (suitable; EI . 0).

v www.esajournals.org

4

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL.

and AliNiazee 1981, 1982), using monthly temperature data layers in ‘‘Raster Calculator’’ in ArcGIS (ESRI 2006). We also included direct solar radiation, elevation, and potential evapotranspiration (Appendix: Table A1). The probability of presence of the native host of R. indifferens, bitter cherry, was modeled using MaxEnt and included in the R. indifferens model. For bitter cherry modeling we also considered slope, aspect (converted into northness and eastness gradients), and growing degree days with average temperature .08C (Appendix: Table A1). Bioclimatic variables were obtained at ;5-km spatial resolution to account for potential spatial inaccuracies during digitization of presence records from the published maps (e.g., Foote et al. 1993). Other layers obtained at 1-km resolution were resampled to a ;5-km resolution to match with the 19 bioclimatic variables.

employed to evaluate the relative influence of different environmental predictors on R. indifferens distribution. The MaxEnt generated species’ ‘Response Curves’ that show the relationships between predicted probabilities of presence for a species and different environmental predictors were also examined. All environmental variables were examined for cross-correlations to address potential problems due to multicollinearity (Dormann et al. 2013). Only one variable from each set of the highly correlated predictors (Pearson correlation coefficient, r  0.75 or  0.75) was included in the model (Appendix: Table A3). The decision to include or exclude a highly correlated variable was made based on its biological relevance to R. indifferens, ease of interpretation, and its relative predictive power (based on the training gain in the preliminary MaxEnt model). For example, degree days with average temperature 8.38C and annual potential evapotranspiration were highly correlated (r ¼ 0.85, P , 0.0001), so we dropped the latter and included the former. The final MaxEnt model included only seven variables: degree days with average temperature 8.38C, degree days with average temperature 58C, mean diurnal range in temperature, mean temperature of wettest quarter, mean temperature of driest quarter, precipitation of driest quarter, and precipitation of coldest quarter (Appendix: Tables A1 and A2). Background selection and sampling bias.—The background extent in MaxEnt was defined based on the Biotic-Abiotic-Mobility (BAM) diagram, a framework suggested by Soberon and Peterson (2005); and included regions that have been accessible to R. indifferens since 1900s (Barve et al. 2011, Saupe et al. 2012; Appendix: Fig. A1). To account for potential sampling bias, 10,000 random background data points (or pseudoabsences) were drawn using a kernel density estimator (KDE) surface (see Appendix: Fig. A1). Background data points drawn in this way had the same sampling bias as the occurrence data and both biases are cancelled out in the modeling process. The KDE surface was generated using all the occurrence data points in Arc GIS using ‘‘kernel density’’ and ‘‘create spatially balanced points’’ tools in Arc Tools. Environmental variables’ values were extracted for presence and background points and models were trained

Modeling methods Correlative niche modeling: MaxEnt.—A number of correlative niche modeling algorithms are available for modeling potential distribution of a species (Franklin 2009). We used the most commonly employed correlative niche model, MaxEnt (version 3.3.3k; Phillips et al. 2006). MaxEnt is a presence-only method and recent studies on distribution modeling of insect pests and other species in different parts of the world have demonstrated its effectiveness (e.g., Kumar et al. 2009, Li et al. 2009, De Meyer et al. 2010). MaxEnt generates an estimate of the probability of presence (or relative environmental suitability) of a species that varies from 0 (lowest) to 1 (highest). We initially ran models with default settings in MaxEnt, which resulted in highly complex models and nonsensical species’ response curves (Appendix: Fig. A3). We also ran MaxEnt with different values of the regularization parameter (L ¼ 1.5 and 2.0) and left other settings at default, but that also resulted in quite a complex model (Appendix: Table A2). Therefore, we changed Maxent’s default settings and used only linear, quadratic, and product features to keep the models simple and to avoid overfitting. The ‘fade-by-clamping’ option was used to prevent extrapolations outside the environmental range of the training data (Owens et al. 2013). The ‘jackknife’ feature in MaxEnt was v www.esajournals.org

5

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL.

using ‘‘samples with data’’ (SWD) format in MaxEnt. The trained models were projected (using ‘‘projection’’ feature) to the entire range of fly occurrences (western North America; Fig.1B–E). Predicted maps for California were clipped from the full extent predictions (Fig. 2). Model selection.—We used Akaike’s Information Criteria (AIC) and the information-theoretic approach (Burnham and Anderson 2002) to evaluate multiple models and select the ‘‘best’’ models for R. indifferens and bitter cherry. We calculated AICc (AIC corrected for small sample sizes) for different Maxent models using ENMTools (Warren et al. 2010). The models were ranked by calculating differences in AICc values as DAICci ¼ (AICci  minimum AICc). The best model has DAICci ¼ 0 and only the models with DAICci  2 have substantial support (Burnham and Anderson 2002). The values of DAICc also helped us test whether inclusion of a particular variable improved the model. For example, we included host plant species data and Ecoclimatic Index generated by CLIMEX in the MaxEnt model and tested whether they improved the model. We evaluated seven different MaxEnt models: MaxEnt_Env (model with only environmental variables); MaxEnt_EnvDef (MaxEnt_Env with default settings), MaxEnt_EnvHost (MaxEnt_Env and host); MaxEnt_EnvHostClimex (MaxEnt_Env, host and CLIMEX outputs), MaxEnt_Climex (model with only CLIMEX outputs), MaxEnt_ClimexHost (MaxEnt_Climex and host), and MaxEnt_ClimexHostTopo (MaxEnt_Climex, host and topographic variables). Model evaluation and validation.—We used 80% of occurrence data for training (n ¼ 120 for R. indifferens, n ¼ 508 for bitter cherry) the MaxEnt models and withheld the remaining 20% (n ¼ 30 for R. indifferens, n ¼ 127 for bitter cherry) for independent validation of model performance. Rhagloetis indifferens models were also tested using independently collected fly presence data (n ¼ 51) for California from Dowell and Penrose (2012). We chose the commonly used metric AUC or area under the receiver operating characteristic (ROC) curve (Fielding and Bell 1997, Phillips et al. 2006) as one of the measures for evaluating model performance. The AUC is a thresholdindependent measure of a model’s ability to discriminate presence from absence (or background). It varies from 0 to 1; an AUC value of v www.esajournals.org

0.5 shows that model predictions are not better than random; values ,0.5 are worse than random; 0.5–0.7 indicates poor performance; 0.7–0.9, reasonable/moderate performance; and .0.9, high performance (Peterson et al. 2011). The 10-fold cross-validation procedure in Maxent was used on 80% training data and averaged test AUC values across the 10 replicates were reported. We also used Pearson correlation coefficient between observed presence-random background points and predicted probabilities of presence to evaluate MaxEnt models (Elith et al. 2006). Validation AUC and sensitivity (fraction of correctly predicted presences) values were calculated using above independent datasets. Test sensitivity was calculated at a 0% training omission rate (or Lowest Predicted Threshold; LPT), 2% training omission rate, and 5% training omission rate. Zero percent omission rate means 100% of the training presence locations fall inside the suitable areas, and 5% training omission rate means 5% of the training localities fall outside the suitable areas. More details about different threshold selection for presence-only models are discussed by Liu et al. (2013). Process-based niche modeling: CLIMEX.—We also used a process-based niche model CLIMEX 3.0 (Sutherst et al. 2007) to develop a mechanistic simulation model to estimate the climatic suitability for the establishment of R. indifferens in western North America. CLIMEX does not use species occurrence data but estimates suitable areas based on climatic conditions alone; it assumes that there are no limiting factors other than climate (Sutherst et al. 2007). The ‘‘Compare Locations’’ function in CLIMEX was used to develop the simulation model for R. indifferens that calculated an annual index of climatic suitability by combining growth index, stress indices, and stress interaction indices (Sutherst et al. 2007, Kriticos and Leriche 2010). The recently published CliMond CM10_ 1975H_V1 climatic dataset (Kriticos et al. 2012; available at http://www.climond.org) interpolated at 10 arc minute (;18 km) resolution was used for CLIMEX modeling. This dataset has longterm monthly climate means centered on 1975 for precipitation, maximum temperature, minimum temperature, and relative humidity at 0900 and 1500 hours. CLIMEX generates an index of climatic suitability for the species called the 6

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL.

Fig. 2. Potential environmental suitability for R. indifferens predicted by MaxEnt_Env model for California at (A) 0% training omission rate or Lowest Predicted Threshold (LPT), (B) 2% training omission rate, (C) 5% training omission rate, (D, E) CLIMEX, and (F) CLIMEX generated Heat Stress Index. Black dots represent R. indifferens occurrences.

v www.esajournals.org

7

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL.

Ecoclimatic Index (EI) that varies from 0 to 100; where 0 represents locations that are unfavorable for long-term survival of the species, and values close to 100 indicate areas that have optimal conditions for the species growth year round. Following Kriticos et al. (2003), EI values were classified into four arbitrary categories: unsuitable (EI ¼ 0), marginal (EI ¼ 1–5), moderately favorable (EI ¼ 6–25), and highly favorable (EI . 25). Localities with EI . 0 were interpreted as correctly predicted presences for sensitivity calculations. Fitting CLIMEX parameters.—Values for CLIMEX model parameters were defined based on published laboratory studies and phenological observations on physiological tolerances of R. indifferens and were iteratively adjusted until the simulated distribution matched the known distribution of R. indifferens in western North America (Table 1). 1. Degree-days per generation (PDD).—The degree-day (DD) accumulation requirement for the completion of one generation of R. indifferens was set to 1800 based on a 58C lower optimum temperature threshold for growth based on the phenological studies conducted by Jones et al. (1991) and Song et al. (2003). Jones et al. (1991) detected first average fly emergence (based on trap captures) at 573 6 19.0 DD (mean 6 SE) in Utah and 592 6 42.1 DD in Washington. Jones et al. (1991) and Song et al. (2003) detected last adult spring emergence between 1700 and 1800 degree-days at various locations in Utah and Washington state. 2. Temperature index.—The lower temperature threshold for growth (DV0) was set at 38C based on van Kirk and AliNiazee (1982), who found that diapause development rate of R. indifferens pupae after exposure to different cold temperatures was optimum at 38C. Lower optimum temperature for growth (DV1) was set at 58C based on studies conducted by Frick et al. (1954) and van Kirk and AliNiazee (1981). The upper optimum temperature for growth (DV2) and the upper temperature threshold for growth (DV3) were set at 258C and 288C, respectively, based on postdiapause developmental rate function published in Stark and AliNiazee (1982). 3. Moisture index.—The soil moisture (SM) index in CLIMEX model is used as a proxy for moisture availability. A hydrological model that v www.esajournals.org

uses rainfall and evapotranspiration is used to calculate the weekly soil moisture balance for determining population growth. A value of SM ¼ 0 indicates no soil moisture; SM ¼ 0.5 indicates soil moisture content is 50% of field capacity; SM ¼ 1 indicates that the soil moisture content is 100% of capacity; and SM . 1.0 indicates the possibility of excessive amounts of rainfall and soil moisture (Sutherst et al. 2007). The initial soil moisture parameters for R. indifferens (SM0, SM1, SM2, and SM3; Table 1) were set based on Yee (2013) and the values were iteratively adjusted to fit the known distribution of the fly. Yee (2013) investigated the effects of different levels of soil saturation (0–76%) on R. indifferens emergence and mortality. The fly was tolerant of wide range of soil moisture conditions but his results showed emergence of higher percentage of deformed/ unhealthy flies at lower levels of soil saturation. 4. Cold stress.—The temperature threshold for cold stress (TTCS) was set to 68C based on van Kirk and AliNiazee (1982), who found that the diapause development was slow at 0 and 38C. The cold stress affected R. indifferens distribution in southern parts of British Columbia, Canada. The cold stress accumulation rate (THCS) was set to 0.001 week1 to fit the fly’s distribution in British Columbia. 5. Heat stress.—The heat stress affects the distribution of R. indifferens in the southwestern United States. The temperature threshold for heat stress was set at 288C because the upper threshold is close to 308C (Jones et al. 1991). The heat stress accumulation rate (THHS) was set to 0.009 week1 to match the fly’s current known distribution in the south-western U.S. 6. Dry stress.—The soil moisture threshold for dry stress (SMDS) and dry stress accumulation rate (HDS) constrained the fly’s distribution in south-eastern parts of California and southern Arizona and New Mexico. These parameters were iteratively adjusted to match the fly’s absence from these areas. 7. Wet stress.—High soil moisture appears to limit the distribution of R. indifferens in the Olympic Peninsula in western Washington, Vancouver Island, and the coastal areas of western Canada. Therefore, wet stress parameters were accordingly adjusted to ensure absence of the fly from these areas. 8

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL. Table 1. Parameters used for Rhagoletis indifferens distribution modeling using CLIMEX. Values were chosen based on Frick et al. (1954), Stark and AliNiazee (1982), van Kirk and AliNiazee (1982), Jones et al. (1991), Song et al. (2003), and Yee (2013). Parameter Temperature index (TI) DV0 DV1 DV2 DV3 PDD Moisture index (MI) SM0 SM1 SM2 SM3 Cold stress (CS) TTCS THCS Heat stress (HS) TTHS THHS Dry stress (DS) SMDS HDS Wet stress (WS) SMWS HWS

Description

Value

Lower temperature threshold for growth Lower optimum temperature for growth Upper optimum temperature for growth Upper temperature threshold for growth Number of degree-days above DV0 needed to complete one generation

38C 58C 258C 288C 1800

Lower Lower Upper Upper

soil moisture threshold optimum soil moisture optimum soil moisture soil moisture threshold

0.0  0.10  0.60  0.76 

Temperature threshold for cold stress Cold stress accumulation rate

6.08C 0.001 week1

Temperature threshold for heat stress Heat stress accumulation rate

288C 0.01 week1

Soil moisture threshold for dry stress Dry stress accumulation rate

0.0  0.0 week1

Soil moisture threshold for wet stress Wet stress accumulation rate

0.76  0.001 week1

  Threshold expressed as a proportion of soil moisture holding capacity (0 ¼ oven dry, and 1 ¼ field capacity (saturation)). Values .1.0 indicate the possibility of excessive amounts of rainfall and soil moisture.

America (AUC ¼ 0.800) and California (AUC ¼ 0.931) (Table 3). This model also had higher sensitivity values across all three omission rates (Table 3) for both test datasets. CLIMEX model predicted 81% of the R. indifferens localities correctly for western North America and 84% for California (Table 3).

RESULTS Predictive performance of different models All seven MaxEnt models evaluated for R. indifferens risk of establishment in western North America performed better than random with training AUC values greater than 0.50 (Table 2). Average AUC values based on 10-fold cross validation varied from 0.70 to 0.77. Predictions of all models were also significantly and positively correlated with R. indifferens presence-background data (Table 2). The best model had an AICc value of 2732.5 (lowest) and included nine predictors comprising seven environmental variables, host species, and Ecoclimatic Index from the CLIMEX model and other factors (MaxEnt_EnvHostClimex; Table 2). The MaxEnt_Env model with default settings and model with CLIMEX variables alone performed worst and had higher AICc values (MaxEnt_EnvDef, and MaxEnt_Climex; Table 2). When tested using independent datasets from western North America and California, the model with only seven environmental variables (MaxEnt_Env) performed the best with highest validation AUC values for both western North v www.esajournals.org

Predicted potential risk of R. indifferens establishment Predictions from the three best MaxEnt models (MaxEnt_Env, MaxEnt_EnvHost, and MaxEnt_EnvHostClimex) and the CLIMEX model matched closely with the R. indifferens current distribution in western North America (Fig. 1). For example, the MaxEnt and CLIMEX models correctly predicted R. indifferens occurrences in southern parts of British Columbia, Canada, central Washington, western Montana (near Flathead Lake), Colorado, and northern California (Fig. 1). Both CLIMEX and MaxEnt models also predicted potential distribution of R. indifferens in Nevada, central Montana, Nevada and Arizona. However, the fly has never been reported from these areas. The CLIMEX model missed R. indifferens localities in the Yakima 9

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL. Table 2. Summary of Rhagoletis indifferens model selection and evaluation. Model description

Model

Model selection

Model evaluation

Parameters No. in MaxEnt variables model AICc DAICc Rank Test AUC

Variables

MaxEnt_Env

Degreeday8.3, degreeday5, bio2, bio8, bio9, bio17, bio19 MaxEnt_EnvDef Degreeday8.3, degreeday5, bio2, (default settings) bio8, bio9, bio17, bio19 MaxEnt_EnvHost MaxEnt_Env, bitter cherry MaxEnt_EnvHostClimex MaxEnt_Env, bitter cherry, Climex_EI MaxEnt_Climex Climex_EI, Climex_HS MaxEnt_ClimexHost Climex_EI, bitter cherry MaxEnt_ClimexHostTopo Climex_EI, bitter cherry, elevation CLIMEX only Temperature, soil moisture

r

7

16

2757.1

24.6

4

0.754

0.107

7

45

2845.5 113.0

6

0.750

0.126

8 9

20 29

2750.8 2732.5

18.3 0.0

3 1

0.768 0.759

0.116 0.124

2 2 3

5 4 8

2844.0 111.5 2804.2 71.7 2747.3 14.8

6 5 2

0.708 0.695 0.722

0.071 0.072 0.082

...

...

...

...

...

...

...

Notes: AICc is the Akaike’s Information Criterion corrected for small sample size; AUC is area under the ROC curve; r is Pearson correlation coefficient; Climex_EI and Climex_HS are eco-climatic index and heat stress index generated by CLIMEX model, respectively. All correlations were significant (P , 0.0001). MaxEnt_Env is MaxEnt model with only environmental variables; MaxEnt_EnvDef is MaxEnt_Env with default settings; MaxEnt_EnvHost is MaxEnt_Env and host; MaxEnt_EnvHostClimex is MaxEnt_Env, host and CLIMEX outputs; MaxEnt_Climex is model with only CLIMEX outputs; MaxEnt_ClimexHost is MaxEnt_Climex and host; and MaxEnt_ClimexHostTopo is MaxEnt_Climex, host and topographic variables.

Valley in central Washington (Fig. 1E), whereas MaxEnt model predicted those correctly (Fig. 1B– D). All models predicted no risk for R. indifferens establishment in the Central Valley around the areas where sweet cherries are produced and in the Mojave Desert and other parts of southern California (Fig. 2). Most of the high to very high risk areas for R. indifferens were predicted in the northern parts of California (Klamath Mountains in the Pacific Coast Range and the Cascade

Range) and Sierra Nevada Mountains (Figs. 1 and 2), where native populations of the fly exist in bitter cherry (Mackie 1940, Dowell and Penrose 2012). Some patches of medium to high risk were predicted in Mount San Jacinto and the San Bernardino Mountains (Figs. 1 and 2), which agrees with Frick et al. (1954). The CLIMEX model missed several R. indifferens occurrences in northern California (Fig. 1D, E). The CLIMEX model indicated that high Heat Stress in the Central Valley of California potentially makes it

Table 3. Summary of Rhagoletis indifferens model validation using AUC and sensitivity metrics. Sensitivity using 20% withheld data (n ¼ 30) from western North America

AUC

20% withheld 0% data from Dowell and training western Penrose (2012) omission North America California data or LPT

Model MaxEnt_Env MaxEnt_Env (default settings) MaxEnt_EnvHost MaxEnt_EnvHostClimex MaxEnt_Climex MaxEnt_ClimexHost MaxEnt_ClimexHostTopo

2% training omission

5% training omission

Sensitivity using Dowell and Penrose (2012) data (n ¼ 51) from California 0% training 2% 5% omission training training or LPT omission omission

0.800 0.798

0.931 0.872

1.0 1.0

1.0 1.0

1.0 0.93

1.0 0.98

1.0 0.98

0.98 0.94

0.783 0.793 0.654 0.680 0.710

0.928 0.927 0.805 0.820 0.781

1.0 1.0 0.97 0.97 0.93

1.0 1.0 0.97 0.97 0.93

0.93 0.93 0.97 0.93 0.93

1.0 1.0 1.0 1.0 1.0

1.0 1.0 1.0 1.0 1.0

0.98 1.0 0.96 1.0 1.0

Notes: Sensitivity is the percentage of correctly predicted presences and varies from 0 to 1.0; a value of 1.0 indicates 100% correctly predicted presences. LPT is lowest predicted threshold; 2% training omission means that 2% of training locations (i.e., R. indifferens presences) fell outside the predicted suitable area. For the CLIMEX only model, sensitivity was 0.87 for 20% test data (0.81 for all data) and 0.84 using the Dowell and Penrose (2012) data. Training omission rates do not apply to CLIMEX model and locations with Ecoclimatic Index (EI) . 0 were interpreted as correctly predicted presences.

v www.esajournals.org

10

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL. Table 4. Average percent contribution of different environmental variables to MaxEnt models; values were averaged across 10 replicate runs. Variable Precipitation of driest quarter (bio17; mm) No. degree days with average temp  8.38C No. degree days with average temp  58C Mean diurnal range in temp (bio2; 8C) Mean temp of driest quarter (bio9; 8C) Precipitation of coldest quarter (bio19; mm) Mean temp of wettest quarter (bio8; 8C) Bitter cherry CLIMEX Eco-climatic index CLIMEX Heat stress index Elevation (m)

MaxEnt_ MaxEnt_ MaxEnt_ MaxEnt_ MaxEnt_ MaxEnt_ Env EnvHost EnvHostClimex Climex ClimexHost ClimexHostTopo 37.7 28.3 13.8 9.5 4.5 4.2 2.1 ... ... ... ...

28.0 17.1 14.1 8.0 4.3 2.1 4.5 21.9 ... ... ...

18.6 13.2 11.5 3.9 3.8 11.8 4.8 8.5 23.9 ... ...

... ... ... ... ... ... ... ... 59.5 40.5 ...

... ... ... ... ... ... ... 25.4 74.6 ... ...

... ... ... ... ... ... ... 22.1 57.9 ... 20.0

Note: The abbreviation ‘‘temp’’ is temperature.

unsuitable for the establishment of R. indifferens (Fig. 1F).

was 75 mm (Fig. 4A) and degree days at average temperature of 8.38C was 1300 (Fig. 4B). The probability of R. indifferens presence was high when the degree days at average temperature of 58C was close to zero and was highest around 600 (Fig. 4C). The probability of R. indifferens presence increased with the increased probability of bitter cherry presence (Fig. 4D).

Factors influencing R. indifferens distribution Precipitation of the driest quarter and degree days at average temperature 8.38C were the strongest predictors of R. indifferens presence in the MaxEnt_Env model, with average percent contributions of 38 and 28 percent, respectively (Table 4). Other top predictors of the R. indifferens distribution in this model were degree days at average temperature 58C and mean diurnal range in temperature (Table 4). The average percent contributions of environmental variables changed after considering native host bitter cherry and CLIMEX outputs in MaxEnt models (Table 4). For example, percent contribution of precipitation of the driest quarter and degree days at average temperature 8.38C decreased in MaxEnt_EnvHost and MaxEnt_EnvHostClimex models (Table 4). Degree days at average temperature 8.38C also had the highest ‘training gain’ (Fig. 3A) and ‘AUC values (Fig. 3B) when used in isolation, which means it had the most useful information for predicting the distribution of R. indifferens. Mean temperature of the driest quarter had the lowest training gain and AUC values (Fig. 3) in MaxEnt_Env model but we kept it in the model because it lowered the AICc values and improved the model (Appendix: Table A2). Rhagoletis indifferens response to precipitation of the driest quarter and degree days at average temperature 8.38C was unimodal (Fig. 4A, B). The probability of R. indifferens presence was highest when precipitation of the driest quarter v www.esajournals.org

DISCUSSION In this study we evaluated the capabilities of the correlative niche model MaxEnt and the process-based mechanistic niche model CLIMEX to assess the risk of R. indifferens establishment in the commercial cherry-growing areas of California. We also tested (1) whether including a host species distribution improved the niche models’ accuracy, and (2) whether models of pest risk establishment can be improved by combining a correlative and a process-based model. Our results showed that both correlative and process-based mechanistic niche models did an excellent job in estimating the current known occurrences of R. indifferens in western North America (Fig. 1). The MaxEnt model was more accurate (i.e., had very low omission errors; Table 3) than the CLIMEX model which could be because of the differences in spatial resolutions (;5 km versus ;18 km), and the type and time period of climate datasets (WorldClim versus CliMond) used in these models. We also found that including the distribution of a native host plant species improved the models of pest risk establishment. Models were further improved when the CLIMEX generated Ecoclimatic Index 11

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL.

Fig. 3. Relative importance of the environmental variables based on the Jackknife test. The figure shows each environmental variable’s contribution to (A) ‘training gain’, and (B) ‘AUC’; both are measures of model’s predictive ability. Degree days with average temperature 8.38C had the highest training gain and AUC when used in isolation which means it had the most useful information for predicting R. indifferens distribution. Values shown are averages of 10 replicate runs.

tially univoltine (with a very small second generation) and has a near obligatory diapause (Frick et al. 1954, van Kirk and AliNiazee 1982). The significance of this very small second generation (0.3–1.1%) and its potential impact on population composition has not been determined. However, since it takes approximately 3 to 5 weeks for this small segment of the population to emerge (Frick et al. 1954; L. Neven, unpublished data), there is a very low probability that host fruit would be available to support any potential offspring from this second generation (Frick et al. 1954) in California cherry-growing regions. Previous reports indicate that at least 20 weeks of chilling below 58C is necessary to

was used as a predictor in the correlative MaxEnt model. Finally, our study showed that R. indifferens is unlikely to establish in the commercial cherry-growing areas in the Central Valley of California, largely because heat stress is too high and chilling requirement in those areas are not met. The major production areas for commercial sweet cherries in California are located in the Central Valley. The sweet cherry season in this region spans from mid-May to around 20 June. Sweet cherry harvest in Oregon and Washington generally begins in mid to late June, with the latest varieties being harvested into early August (Long et al. 2007). Rhagoletis indifferens is essenv www.esajournals.org

12

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL.

Fig. 4. Relationships between four strongest environmental predictors and R. indifferens probability of presence: (A) precipitation of driest quarter (bio17; mm), (B) degree days with average temperatures  8.38C, (C) degree days with average temperatures  58C, and (D) probability of bitter cherry presence. Each of these curves is based on different MaxEnt models that included only the corresponding variable.

complete the chilling requirement for this species (Frick et al. 1954, van Kirk and AliNiazee 1981) (Table 1), which is not met in the Central Valley of California (Fig. 2F). This can result in either asynchronous spring/summer emergence or no emergence at all from diapausing individuals. More research, similar to that of Johnson et al. (2011) on olive fruit fly (Bactrocera oleae (Rossi )), is needed to determine the effects of heat stress (or high temperatures) on R. indifferens. Our study identified precipitation of the driest quarter, degree days at average temperature 8.38C, degree days at average temperature 58C, and mean diurnal range in temperatures as the strongest predictors of R. indifferens presence in western North America. In the Pacific Northwest, R. indifferens is native to the coastal forest and ponderosa pine ecosystems, where its v www.esajournals.org

ancestral host bitter cherry is commonly found (Yee and Goughnour 2008, Yee et al. 2011). Establishment of the fly in the desert sagebrush ecosystem of central Washington and Oregon was possible only because of irrigation and planting of monocultures of sweet cherries in this habitat. Bitter cherry is not found in sagebrush habitat but along its margins (Yee 2008). In contrast to the situation in the Pacific Northwest, in California, irrigation and planting of monocultures apparently are insufficient for establishment of R. indifferens due to lack of sufficient low temperatures.

Caveats and uncertainties Results from this study should be interpreted with caution, keeping in mind the uncertainties associated with different niche models. Different 13

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL.

niche modeling algorithms have different assumptions and limitations and may produce slightly different predictions (Elith et al. 2006, Kumar et al. 2009, Taylor and Kumar 2012). For example, they may be affected by sampling bias (Syfert et al. 2013), sample size (Wisz et al. 2008), multicollinearity (Dormann et al. 2013), and spatial autocorrelation (Veloz 2009) and do not include biotic interactions automatically. Performance of these models may also depend on species characteristics, spatial resolution and extent of the study area, and choice of predictor variables (Guisan et al. 2007a, b). The models may also be affected by the way background data points are selected (Phillips 2008, Phillips et al. 2009, VanDerWal et al. 2009). We addressed some of these by (1) using more than one niche model, (2) reducing the number of variables by assessing cross-correlations, (3) examining spatial autocorrelation and sampling bias before modeling, (4) including the fly’s native host plant distribution, (5) including species-specific phenology variables, and (6) drawing background points using a kernel density estimator (KDE) surface. CLIMEX is a semi-quantitative model (Sutherst et al. 2007) which calculates a Growth Index (GI) based on user defined parameters. The model parameters, if not known, are inferred by fitting the simulated distribution to known geographical distribution of the species. A user starts with a template (e.g., temperate or tropical) depending on the species of interest and then estimates the parameters. The majority of the parameters for R. indifferens (Table 1) were based on published laboratory studies and phenological observations but some were iteratively estimated which may have uncertainties associated with them (Taylor and Kumar 2012). This uncertainty is not specific to CLIMEX model as correlative niche models are also subject to parameter uncertainty (Barry and Elith 2006). The CLIMEX model is considered a process-based mechanistic niche model (Kriticos and Leriche 2010); however, it can also be a correlative model if species-specific phenological thresholds are not used in defining model parameters, but instead parameters are estimated using species’ known occurrences (Elith 2014). In this study, CLIMEX was more process-based than MaxEnt, but not entirely process-based and had some correlative component. None of the models included future climate v www.esajournals.org

change scenarios; both CLIMEX and MaxEnt included only current climatic variables. Future studies should investigate how changing climate might affect R. indifferens environmental suitability in the next 50 to 100 years in the Central Valley of California because studies have shown that an insect pest’s phenology may shift as a result of climate change (Hodgson et al. 2011). However, an increase in temperature as predicted by the general circulation models (IPCC 2007) will further decrease the possibility of R. indifferens establishing in the Central Valley of California.

Conclusions, recommendations and management implications This study shows the usefulness of correlative and process-based mechanistic niche models in assessing the risk of pest establishment. The approach presented here can be used to assess pest risk establishment at regional or global levels; for example, R. indifferens models developed in this study were projected to eight tropical countries that are current or potential fresh sweet cherry markets (Kumar et al. 2014). We showed how the predictive power of correlative niche models can be improved by including outputs from the process-based mechanistic niche models. We also showed that correlative niche models such as MaxEnt, if used carefully, can provide excellent estimates of likelihood of pest risk establishment. While estimating the risk of pest establishment using MaxEnt (or other correlative niche models), we suggest that (1) default settings in MaxEnt not be used (especially with small sample sizes) because they often result in overfitting and generate poorer estimates (Merow et al. 2013; Table 2; Appendix: Table A2); (2) sampling bias in occurrence data always be accounted for (Syfert et al. 2013); (3) background points be drawn from the areas that have been accessible to the species over a given period of time (Barve et al. 2011, Saupe et al. 2012; Appendix: Fig. A1); (4) species’ response curves be critically examined (Appendix: Fig. A3); (5) wherever possible, species’ interactions be considered by including host species; and (6) more than one niche model be tested. Although the quarantine procedures for sweet cherry shipments from the Pacific Northwest to 14

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL.

California were originally established 30 years ago to prevent the accidental introduction of this pest into commercial California cherry-producing areas, our model indicates that these precautions are unnecessary due to the unsuitability of this region for R. indifferens. Given the presence of R. indifferens in northern California, one would expect that if the climatic conditions are favorable for R. indifferens to establish in commercial sweet cherries, as is the case for Oregon and Washington, then establishment would have already occurred. Our model indicates that this is not very likely and that the rigid protocol concerning the shipment to California of cherries from areas where R. indifferens is known to occur might be reconsidered. Dowell and Penrose (2012) monitored for the presence of R. indifferens in commercial sweet cherry orchards in the Central Valley. They found no R. indifferens in the commercial sweet cherry growing region, and attributed the lack of presence to adhering to a strict, 30 year, quarantine procedure along with the asynchrony of fruit availability when comparing host use on sweet cherry and bitter cherry without consideration of the effects of climate on the potential distribution of this pest. Since R. indifferens was first reported in commercial cherries in the early 1900s, it is more than probable that infested cherries were shipped to the Central Valley of California from multiple sources prior to the establishment of the quarantine barrier but the fly was unable to establish due to adverse climatic conditions. The model generated in this study can also benefit sweet cherry growers exporting to areas climatically similar or warmer. It demonstrates that R. indifferens cannot or is highly unlikely to survive in those areas. Thus, where trade restrictions imposed on U.S.-produced sweet cherries exist due to concerns over R. indifferens, an argument can be made that restrictions should be eased. California exports of cherries to countries with concerns about R. indifferens also should not be affected, even if access to those countries is based on maintenance of a fly free area through the currently strict exterior quarantines. This research provides supporting documentation that relaxation of the quarantine will not represent a measurable increase in risk. v www.esajournals.org

ACKNOWLEDGMENTS This research was funded by a grant through the Washington Tree Fruit Research Commission (WTFRC) from the Foreign Agricultural Service of the USDA. We thank Mike Willett (Northwest Horticultural Council), Kathy Coffey (WTFRC), and Shelly Watkins (USDA-ARS) for providing occurrence data and assistance during the project. We thank Robert Dowell and Colleen Murphy-Vierra from the California Department of Food and Agriculture for providing R. indifferens survey data for model validation. We also thank the Natural Resource Ecology Laboratory at Colorado State University, and Yakima Agricultural Research Laboratory, USDA-ARS, Wapato, Washington, for providing the logistical support. We are grateful to two anonymous reviewers whose comments improved the manuscript.

LITERATURE CITED AliNiazee, M. T. 1978. The western cherry fruit fly Rhagoletis indifferens (Diptera: Tephritidae): Part 3 developing a management program by utilizing attractant traps as monitoring devices. Canadian Entomologist 110:1133–1140. Banham, F. L. 1971. Native hosts of western cherry fruit fly (Diptera: Tephritidae) in the Okanagan Valley of British Columbia. Journal of the Entomological Society of British Columbia 68:29–32. Banham, F. L. 1973. An evaluation of traps for the western cherry fruit fly (Diptera: Tephritidae). Evaluation of traps for the western cherry fruit fly (Diptera: Tephritidae) [Rhagoletis indifferens]. Journal of the Entomological Society of British Columbia 70:13–16. Barry, S., and J. Elith. 2006. Error and uncertainty in habitat models. Journal of Applied Ecology 43:413– 423. Barve, N., V. Barve, A. Jimenez-Valverde, A. LiraNoriega, S. P. Maher, A. T. Peterson, J. Soberon, and F. Villalobos. 2011. The crucial role of the accessible area in ecological niche modeling and species distribution modeling. Ecological Modelling 222:1810–1819. Blanc, F. L., and H. H. Kiefer. 1955. The cherry fruit fly in North America. California Department of Agriculture Bulletin 44:77–88. Brown, S. S. 2009. Master permit for the shipment of cherry fruit from Washington to California or Taiwan. Permit No. QC 783. California Department of Food & Agriculture, Plant Health and Pest Prevention Services, Sacramento, California, USA. Burnham, K. P., and D. R. Anderson. 2002. Model selection and multimodel inference: A practical

15

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL. Waltham, Massachusetts, USA. Elith, J., C. H. Graham, R. P. Anderson, M. Dudik, S. Ferrier, A. Guisan, R. J. Hijmans, F. Huettmann, J. R. Leathwick, A. Lehmann, J. Li, L. G. Lohmann, B. A. Loiselle, G. Manion, C. Moritz, M. Nakamura, Y. Nakazawa, J. M. Overton, A. T. Peterson, S. J. Phillips, K. Richardson, R. Scachetti-Pereira, R. E. Schapire, J. Soberon, S. Williams, M. S. Wisz, and N. E. Zimmermann. 2006. Novel methods improve prediction of species’ distributions from occurrence data. Ecography 29:129–151. ESRI. 2006. ArcGIS 9.2 desktop. ESRI, Redlands, California, USA. Fielding, A. H., and J. F. Bell. 1997. A review of methods for the assessment of prediction errors in conservation presence/absence models. Environmental Conservation 24:38–49. Foote, R. H., F. L. Blanc, and A. L. Norrbom. 1993. Handbook of the fruit flies (Diptera: Tephritidae) of America north of Mexico. Comstock, Ithaca, New York, USA. Franklin, J. 2009. Mapping species distributions: spatial inference and prediction. Cambridge University Press, Cambridge, UK. Frick, K. E., H. G. Simkover, and H. S. Telford. 1954. Bionomics of the cherry fruit flies in eastern Washington. Washington Agricultural Experiment Stations Technical Bulletin 13. Guisan, A., C. H. Graham, J. Elith, F. Huettmann, and the NCEAS Species Distribution Modelling Group. 2007a. Sensitivity of predictive species distribution models to change in grain size. Diversity and Distributions 13:332–340. Guisan, A., N. E. Zimmermann, J. Elith, C. H. Graham, S. Phillips, and A. T. Peterson. 2007b. What matters for predicting the occurrences of trees: Techniques, data, or species’ characteristics? Ecological Monographs 77:615–630. Hijmans, R. J., S. E. Cameron, J. L. Parra, P. G. Jones, and A. Jarvis. 2005. Very high resolution interpolated climate surfaces for global land areas. International Journal of Climatology 25:1965–1978. Hodgson, J. A., C. D. Thomas, T. H. Oliver, B. J. Anderson, T. M. Brereton, and E. E. Crone. 2011. Predicting insect phenology across space and time. Global Change Biology 17:1289–1300. IPCC. 2007. Climate change 2007: synthesis report. Contribution of Working Groups I, II and III to the Fourth Assessment Report of the Intergovernmental Panel on Climate Change. IPCC, Geneva, Switzerland. Jimenez-Valverde, A., J. M. Lobo, and J. Hortal. 2008. Not as good as they seem: the importance of concepts in species distribution modelling. Diversity and Distributions 14:885–890.

information-theoretic approach. Second edition. Springer, New York, New York, USA. Bush, G. L. 1966. The taxonomy, cytology, and evolution of the genus Rhagoletis in North America (Diptera, Tephritidae). Bulletin of the Museum of Comparative Zoology 134:431–562. CDFA [California Department of Food and Agriculture]. 2012. Master permit for shipments of commercially produced cherry fruit from states of Idaho, Montana, Oregon, Washington and Utah to California for domestic use or export. QC permit No. 1281. Appendix B. Curran, C. H. 1932. New North American Diptera with notes on others. American Museum Novitates 526:1–13. De Marco, P., J. A. F. Diniz, and L. M. Bini. 2008. Spatial analysis improves species distribution modelling during range expansion. Biology Letters 4:577–580. De Meyer, M., M. P. Robertson, M. W. Mansell, S. Ekesi, K. Tsuruta, W. Mwaiko, J. F. Vayssieres, and A. T. Peterson. 2010. Ecological niche and potential geographic distribution of the invasive fruit fly Bactrocera invadens (Diptera, Tephritidae). Bulletin of Entomological Research 100:35–48. de Villiers, M., V. Hattingh, and D. J. Kriticos. 2013. Combining field phenological observations with distribution data to model the potential distribution of the fruit fly Ceratitis rosa Karsch (Diptera: Tephritidae). Bulletin of Entomological Research 103:60–73. Dormann, C. F., J. Elith, S. Bacher, C. Buchmann, G. Carl, G. Carre, J. R. G. Marquez, B. Gruber, B. Lafourcade, P. J. Leitao, T. Munkemuller, C. McClean, P. E. Osborne, B. Reineking, B. Schroder, A. K. Skidmore, D. Zurell, and S. Lautenbach. 2013. Collinearity: a review of methods to deal with it and a simulation study evaluating their performance. Ecography 36:27–46. Dormann, C. F., S. J. Schymanski, J. Cabral, I. Chuine, C. Graham, F. Hartig, M. Kearney, X. Morin, C. Romermann, B. Schroder, and A. Singer. 2012. Correlation and process in species distribution models: bridging a dichotomy. Journal of Biogeography 39:2119–2131. Dowell, R. V., and R. L. Penrose. 2012. Distribution and phenology of Rhagoletis fausta (Osten Sacken 1877) and Rhagoletis indifferens Curren 1932 (Diptera: Tephritidae) in California. Pan-Pacific Entomologist 88:130–150. Elith, J. 2014. Predicting distributions of invasive species. http://arxiv.org/abs/1312.0851 Elith, J., and J. Franklin. 2013. Species distribution modelling. Pages 692–705 in S. Levin, editor. Encyclopedia of biodiversity. Academic Press,

v www.esajournals.org

16

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL. Lozier, J. D., and N. J. Mills. 2011. Predicting the potential invasive range of light brown apple moth (Epiphyas postvittana) using biologically informed and correlative species distribution models. Biological Invasions 13:2409–2421. Mackie, D. B. 1940. Cherry fruit flies in California (a preliminary note). California State Department of Agriculture Bulletin 29:157. Maxwell, S. A., G. Rasic, and N. Keyghobadi. 2009. Characterization of microsatellite loci for the western cherry fruit fly, Rhagoletis indifferens (Diptera: Tephritidae). Molecular Ecology Resources 9:1025–1028. Merow, C., M. J. Smith, and J. A. Silander. 2013. A practical guide to MaxEnt for modeling species’ distributions: what it does, and why inputs and settings matter. Ecography 36:1058–1069. Messina, F. J. 1990. Components of host choice by 2 Rhagoletis species (Diptera: Tephritidae) in Utah. Journal of the Kansas Entomological Society 63:80– 87. Ni, W. L., Z. H. Li, H. J. Chen, F. H. Wan, W. W. Qu, Z. Zhang, and D. J. Kriticos. 2012. Including climate change in pest risk assessment: the peach fruit fly, Bactrocera zonata (Diptera: Tephritidae). Bulletin of Entomological Research 102:173–183. Owens, H. L., L. P. Campbell, L. L. Dornak, E. E. Saupe, N. Barve, J. Soberon, K. Ingenloff, A. LiraNoriega, C. M. Hensz, C. E. Myers, and A. T. Peterson. 2013. Constraints on interpretation of ecological niche models by limited environmental ranges on calibration areas. Ecological Modelling 263:10–18. Peterson, A. T., J. Soberon, R. G. Pearson, R. P. Anderson, E. Martinez-Meyer, M. Nakamura, and M. B. Araujo. 2011. Ecological niches and geographic distributions. Princeton University Press, Princeton, New Jersey, USA. Phillips, S. J. 2008. Transferability, sample selection bias and background data in presence-only modelling: a response to Peterson et al. (2007). Ecography 31:272–278. Phillips, S. J., R. P. Anderson, and R. E. Schapire. 2006. Maximum entropy modeling of species geographic distributions. Ecological Modelling 190:231–259. Phillips, S. J., M. Dudik, J. Elith, C. H. Graham, A. Lehmann, J. Leathwick, and S. Ferrier. 2009. Sample selection bias and presence-only distribution models: implications for background and pseudo-absence data. Ecological Applications 19:181–197. R Development Core Team. 2012. R: a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. Sambaraju, K. R., A. L. Carroll, J. Zhu, K. Stahl, R. D.

Johnson, M. W., X. G. Wang, H. Nadel, S. B. Opp, K. Lynn-Patterson, J. Stewart-Leslie, and K. M. Daane. 2011. High temperature affects olive fruit fly populations in California’s Central Valley. California Agriculture 65:29–33. Jones, V. P., D. G. Alston, J. F. Brunner, D. W. Davis, and M. D. Shelton. 1991. Phenology of the western cherry fruit-fly (Diptera: Tephritidae) in Utah and Washington. Annals of the Entomological Society of America 84:488–492. Kearney, M. R., B. A. Wintle, and W. P. Porter. 2010. Correlative and mechanistic models of species distribution provide congruent forecasts under climate change. Conservation Letters 3:203–213. Kriticos, D. J., and A. Leriche. 2010. The effects of climate data precision on fitting and projecting species niche models. Ecography 33:115–127. Kriticos, D. J., R. W. Sutherst, J. R. Brown, S. W. Adkins, and G. F. Maywald. 2003. Climate change and the potential distribution of an invasive alien plant: Acacia nilotica ssp indica in Australia. Journal of Applied Ecology 40:111–124. Kriticos, D. J., B. L. Webber, A. Leriche, N. Ota, I. Macadam, J. Bathols, and J. K. Scott. 2012. CliMond: global high-resolution historical and future scenario climate surfaces for bioclimatic modelling. Methods in Ecology and Evolution 3:53–64. Kroening, M. K., B. C. Kondratieff, and E. E. Nelson. 1989. New host and distributional records for Rhagoletis in Colorado. Southwestern Entomologist 14:147–152. Kumar, S., L. G. Neven, and W. L. Yee. 2014. Assessing the potential for establishment of western cherry fruit fly using ecological niche modeling. Journal of Economic Entomology 107(3):1032–1044. Kumar, S., S. A. Spaulding, T. J. Stohlgren, K. A. Hermann, T. S. Schmidt, and L. L. Bahls. 2009. Potential habitat distribution for the freshwater diatom Didymosphenia geminata in the continental US. Frontiers in Ecology and the Environment 7:415–420. Li, B. N., J. Ma, X. N. Hu, H. J. Liu, and R. J. Zhang. 2009. Potential geographical distributions of the fruit flies Ceratitis capitata, Ceratitis cosyra, and Ceratitis rosa in China. Journal of Economic Entomology 102:1781–1790. Liu, C., M. White, and G. Newell. 2013. Selecting thresholds for the prediction of species occurrence with presence-only data. Journal of Biogeography 40:778–789. Long, L. E., M. Whiting, and R. Nu˜nez-Elisea. 2007. Sweet cherry cultivars for the fresh market. PNW 604. http://extension.oregonstate.edu/catalog/pdf/ pnw/pnw604.pdf

v www.esajournals.org

17

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL. van Kirk, J. R., and M. T. AliNiazee. 1982. Diapause development in the western cherry fruit fly, Rhagoletis indifferens Curran (Diptera, Tephritidae). Zeitschrift fu¨r Angewandte Entomologie 93:440– 445. Veloz, S. D. 2009. Spatially autocorrelated sampling falsely inflates measures of accuracy for presenceonly niche models. Journal of Biogeography 36:2290–2299. Warren, D. L., R. E. Glor, and M. Turelli. 2010. ENMTools: a toolbox for comparative studies of environmental niche models. Ecography 33:607– 611. Webber, B. L., C. J. Yates, D. C. Le Maitre, J. K. Scott, D. J. Kriticos, N. Ota, A. McNeill, J. J. Le Roux, and G. F. Midgley. 2011. Modelling horses for novel climate courses: insights from projecting potential distributions of native and alien Australian acacias with correlative and mechanistic models. Diversity and Distributions 17:978–1000. Wilson, H. F. and A. L. Lovett. 1913. Miscellaneous insect pests of orchard and garden. Oregon Agricultural Experimental Station Biennial Crop Pest and Horticultural Report (1911–1912). Wisz, M. S., R. J. Hijmans, J. Li, A. T. Peterson, C. H. Graham, A. Guisan, and NCEAS Predicting Species Distributions Working Group. 2008. Effects of sample size on the performance of species distribution models. Diversity and Distributions 14:763– 773. Yee, W. L. 2005. Seasonal distributions of eggs and larvae of Rhagoletis indifferens Curran (Diptera : Tephritidae) in cherries. Journal of Entomological Science 40:158–166. Yee, W. L. 2006. GF-120, Nu-Lure, and Mazoferm effects on feeding responses and infestations of western cherry fruit fly (Diptera : Tephritidae). Journal of Agricultural and Urban Entomology 23:125–140. Yee, W. L. 2008. Host plant use by apple maggot, western cherry fruit fly, and other Rhagoletis species (Diptera: Tephritidae) in central Washington state. Pan-Pacific Entomologist 84:163–178. Yee, W. L. 2013. Soil moisture and relative humidity effects during postdiapause on the emergence of western cherry fruit fly (Diptera: Tephritidae). Canadian Entomologist 145:317–326. Yee, W. L., and D. G. Alston. 2006. Effects of spinosad, spinosad bait, and chloronicotinyl insecticides on mortality and control of adult and larval western cherry fruit fly (Diptera : Tephritidae). Journal of Economic Entomology 99:1722–1732. Yee, W. L., and R. B. Goughnour. 2008. Host plant use by and new host records of apple maggot, western cherry fruit fly, and other Rhagoletis species

Moore, and B. H. Aukema. 2012. Climate change could alter the distribution of mountain pine beetle outbreaks in western Canada. Ecography 35:211– 223. Saupe, E. E., V. Barve, C. E. Myers, J. Sobero´n, N. Barve, C. M. Hensz, A. T. Peterson, H. L. Owens, and A. Lira-Noriega. 2012. Variation in niche and distribution model performance: The need for a priori assessment of key causal factors. Ecological Modelling 237-238:11–22. Senger, S. E., R. Tyson, B. D. Roitberg, H. M. A. Thistlewood, A. S. Harestad, and M. T. Chandler. 2009. Influence of habitat structure and resource availability on the movements of Rhagoletis indifferens (Diptera: Tephritidae). Environmental Entomology 38:823–835. Soberon, J., and A. T. Peterson. 2005. Interpretation of models of fundamental ecological niches and species’ distributional areas. Biodiversity Informatics 2:1–10. Song, Y., L. B. Coop, M. Omeg, and H. Riedl. 2003. Development of a phenology model for predicting western cherry fruit fly, Rhagoletis indifferens Curran (Diptera: Tephritidae), emergence in the mid Columbia area of the western United States. Journal of Asia-Pacific Entomology 6:187–192. Stark, S. B., and M. T. AliNiazee. 1982. Model of postdiapause development in the western cherry fruitfly (Diptera, Tephritidae). Environmental Entomology 11:471–474. Sutherst, R. W., G. F. Maywald, and D. J. Kriticos. 2007. CLIMEX Version 3: user’s guide: http://www. hearne.com.au/attachments/ClimexUserGuide3. pdf Syfert, M. M., M. J. Smith, and D. A. Coomes. 2013. The effects of sampling bias and model complexity on the predictive performance of MaxEnt species distribution models. PLoS ONE 8:e55158. Taylor, S., and L. Kumar. 2012. Sensitivity Analysis of CLIMEX Parameters in Modelling Potential Distribution of Lantana camara L. PLoS ONE 7. Trabucco, A., and R. J. Zomer. 2009. Global aridity index (global-aridity) and global potential evapotranspiration (global-PET) geospatial database. CGIAR Consortium for Spatial Information. http://www.csi.cgiar.org VanDerWal, J., L. P. Shoo, C. Graham, and S. E. William. 2009. Selecting pseudo-absence data for presence-only distribution modeling: How far should you stray from what you know? Ecological Modelling 220:589–594. van Kirk, J. R., and M. T. AliNiazee. 1981. Determining low-temperature threshold for pupal development of the western cherry fruit fly for use in phenology models. Environmental Entomology 10:968–971.

v www.esajournals.org

18

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL. (Diptera: Tephritidae) in western Washington state. Pan-Pacific Entomologist 84:179–193. Yee, W. L., R. B. Goughnour, and J. L. Feder. 2011. Differences in body size and egg loads of Rhagoletis indifferens (Diptera: Tephritidae) from introduced and native cherries. Environmental Entomology 40:1353–1362. Yee, W. L., H. M. A. Thistlewood, and M. W. Klaus.

2010. Infestation of apricot by Rhagoletis indifferens Curran (Diptera: Tephritidae) in Washington state and British Columbia. Pan-Pacific Entomologist 86:100–103. Zwick, R. W., U. Kiigemagi, and G. J. Fields. 1977. Residues of Dimethoate and Dimethoxon on sweet cherries following air carrier application. Journal of Agricultural and Food Chemistry 25:937–940.

SUPPLEMENTAL MATERIAL APPENDIX Table A1. Environmental variables considered in the MaxEnt model for Rhagoletis indifferens and bitter cherry Prunus emarginata. General statistics for the environmental variables to show fly’s bioclimatic profile were calculated based on 150 unique fly occurrence records.

Environmental variable 2

No. degree days with average temp  8.38C 2 No. degree days with average temp  58C 1 Mean diurnal range in temp (bio2; 8C) 1 Mean temp of wettest quarter (bio8; 8C) 1 Mean temp of driest quarter (bio9; 8C) 1 Precipitation of driest quarter (bio17; mm) 1 Precipitation of coldest quarter (bio19; mm) 2 Total solar radiation (direct þ diffuse; WH/m2) 1 Max temp of warmest month (bio5; 8C) 1 Min temp of coldest month (bio6; 8C) 1 Mean temp of coldest quarter (bio11; 8C) 1 Mean annual precipitation (bio12; mm) 1 Precipitation seasonality (CV) (bio15) 1 Precipitation of warmest quarter (bio18; mm) Slope (degrees) 2 Direct solar radiation (WH/m2) 1 Annual mean temp (bio1; 8C) 1 Isothermality (bio3) 1 Temp seasonality (SD 3 100) (bio4) 1 Temp annual range (bio7; 8C) 1 Mean temp of warmest quarter (bio10; 8C) 1 Precipitation of wettest month (bio13; mm) 1 Precipitation of driest month (bio14; mm) 1 Precipitation of wettest quarter (bio16; mm) 2 No. degree days with min temp 38C 3 Potential evapotranspiration (Annual; mm) 2 No. degree days with average temp . 08C 4 Heat index from CLIMEX 4 Eco-climatic index from CLIMEX 5 Bitter cherry (probability of presence) 1 Elevation (m) 2 Eastness 2 Northness

Included in Included in bitter cherry WCFF model model Yes Yes Yes Yes Yes Yes Yes No No No No No No No No No No No No No No No No No No No No No No No No No No

No No No Yes No No No Yes Yes Yes Yes Yes Yes Yes Yes No No No No No No No No No No No No No No ... No No No

Mean

SD

Min

Max

1,135.1 356.8 186.0 2,205.0 527.2 379.6 0.0 1,390.0 13.2 2.1 7.7 17.6 3.3 4.6 4.2 18.1 15.7 5.1 1.8 23.0 64.0 33.2 22.0 197.0 393.4 275.9 28.0 1,256.0 1,346,216.0 102,180.2 1,208,901.5 1,688,262.9 27.4 3.2 19.1 34.3 4.5 3.6 12.6 4.8 0.9 3.1 5.4 9.0 909.3 555.0 182.0 2,687.0 55.0 19.1 18.0 80.0 74.6 39.4 24.0 197.0 1.8 1.3 0.1 6.8 1,000,349.0 74,277.6 900,170.9 1,256,095.0 9.0 2.0 3.2 14.2 41.4 5.6 29.0 55.0 6,531.1 1,455.3 2,249.0 9,305.0 31.9 5.6 14.3 42.0 17.5 2.3 10.9 23.0 149.5 100.4 29.0 478.0 14.3 9.1 4.0 54.0 420.7 284.9 78.0 1,336.0 310.8 293.8 0.0 1057.0 27.4 64.8 0.0 603.3 3234.3 624.8 1441.0 5033.3 27.4 64.8 0.0 603.3 20.8 12.1 0.0 57.3 0.45 0.15 0.09 0.72 817.9 599.4 4.0 2,549.0 0.0 0.4 0.9 0.9 0.0 0.4 0.9 0.9

Notes: Abbreviations are: bio, the ‘BIOCLIM’ variables from the WorldClim dataset (see Hijmans et al. [2005] for more details); temp, temperature; SD, standard deviation; max, maximum; min, minimum; WCFF, western cherry fruit fly (Rhagoletis indifferens). Data sources (appearing as superscripts in column 1) are: 1, Bioclim variables and elevation from WolrdClim dataset (http://www.worldclim.org/); 2, generated using Arc GIS; 3, potential evapotranspiration: Trabucco and Zomer (2009); 4, generated using CLIMEX model; 5, generated using MaxEnt model.

v www.esajournals.org

19

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL. Table A2. Summary of Rhagoletis indifferens environment-only (MaxEnt_Env) model selection using AICc. See Table A1 for variables’ full names. Model no. 1 2 3 4 5 6 7 8 9 10

Variables Degree bio9, Degree bio9, Degree bio9, Degree Degree Degree Degree bio9, Degree Degree Degree

No. variables

No. parameters

AICc

DAICc

Model rank

7

16

2757.1

0.0

1

7

45

2845.5

88.5

7

7

23

2826.9

69.8

5

6 5 4 9

14 12 11 27

2822.3 2773.4 2797.5 2851.0

65.2 16.4 40.4 94.0

4 2 3 8

3 3 2

7 8 4

2842.9 2888.6 2930.3

85.8 131.5 173.2

6 9 10

days8.3, degree days5, bio17, bio2, bio19, bio8 days8.3, degree days5, bio17, bio2, bio19, bio8 (default settings) days8.3, degree days5, bio17, bio2, bio19, bio8 (default settings; Regularization ¼ 2.0) days8.3, degree days5, bio17, bio2, bio19, bio9 days8.3, degree days5, bio17, bio2, bio19 days8.3, degree days5, bio17, bio2 days8.3, degree days5, bio17, bio2, bio19, bio15, bio8, elevation days8.3, degree days5, bio17 days8.3, degree days5, bio14 days8.3, degree days5

Table A3. Cross-correlations among environmental predictor variables included in the final Rhagoletis indifferens models. Variable No. degree days with average temp  8.38C (dd8.3) No. degree days with average temp  58C (dd5.0) Mean diurnal range in temp (bio2; 8C) Mean temp of wettest quarter (bio8; 8C) Mean temp of driest quarter (bio9; 8C) Precipitation of driest quarter (bio17; mm) Precipitation of coldest quarter (bio19; mm) Bitter cherry (Bitchr) CLIMEX Eco-climatic index (EI) CLIMEX Heat stress index (HSI) Elevation (m) (Elev)

dd8.3 dd5.0

bio2

bio8

bio9

bio17

bio19 Bitchr

EI

HSI

1.0 0.65 0.39 0.68 0.64 0.62 0.28 0.39 0.23 0.85 0.54

1.0 0.18 1.0 0.51 0.29 1.0 0.73 0.27 0.20 1.0 0.43 0.72 0.41 0.45 1.0 0.26 0.53 0.12 0.04 0.55 1.0 0.12 0.13 0.30 0.06 0.11 0.46 1.0 0.07 0.09 0.19 0.05 0.15 0.13 0.28 1.0 0.43 0.35 0.53 0.50 0.48 0.26 0.44 0.40 1.0 0.75 0.23 0.32 0.56 0.15 0.24 0.08 0.01 0.34

Notes: The correlations were significant at alpha ¼ 0.05 unless otherwise stated. Significance levels were adjusted using Bonferroni correction for multiple comparisons. The abbreviation ‘‘temp’’ is temperature.

v www.esajournals.org

20

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL.

Fig. A1. Distribution of (A) Rhagoletis indifferens and (B) bitter cherry occurrence data in the study area. Background points for (C) R. indifferens and (D) bitter cherry were selected using a kernel density estimator (KDE) surface. Presence records for R. indifferens were collected from Banham (1971, 1973), Zwick et al. (1977), AliNiazee (1978), Kroening et al. (1989), Messina (1990), Jones et al. (1991), Yee (2005, 2006, 2008), Yee and Alston (2006), Yee and Goughnour (2008), Maxwell et al. (2009), Senger et al. (2009), Yee et al. (2010, 2011), and Dowell and Penrose (2012).

v www.esajournals.org

21

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL.

Fig. A2. Moran’s I correlograms for (A) Rhagoletis indifferens, and (B) bitter cherry MaxEnt model residuals. Open diamonds represent nonsignificance, and closed diamonds indicate significance (two-tailed test; P  0.05) for positive spatial autocorrelation adjusted using progressive Bonferroni correction for multiple comparisons.

v www.esajournals.org

22

July 2014 v Volume 5(7) v Article 86

KUMAR ET AL.

Fig. A3. Effects of MaxEnt’s default settings on model complexity, model predictions and response curves. (A) MaxEnt_Env model (default settings) had seven variables, 45 parameters, and AICc value of 2845.54; (B) MaxEnt_Env model (Simple settings) had seven variables, 16 parameters, and AICc value of 2732.51. Simple settings included only Linear, Quadratic and Product features. Default settings resulted in very complex response curves (C, E), and Simple settings resulted in simple non-linear response curves (D, F). Simple settings model had higher accuracy than default settings model (see Tables 2 and 3).

v www.esajournals.org

23

July 2014 v Volume 5(7) v Article 86