Vibrio parahaemolyticus Isolates from ... - Mary Ann Liebert, Inc.

4 downloads 0 Views 262KB Size Report
Vibrio parahaemolyticus Isolates from Southeastern. Chinese Coast Are Genetically Diverse with Circulation of Clonal Complex 3 Strains Since 2002. Ying Yu,1 ...
FOODBORNE PATHOGENS AND DISEASE Volume 8, Number 11, 2011 ª Mary Ann Liebert, Inc. DOI: 10.1089/fpd.2011.0865

Vibrio parahaemolyticus Isolates from Southeastern Chinese Coast Are Genetically Diverse with Circulation of Clonal Complex 3 Strains Since 2002 Ying Yu,1,* Weizhao Hu,1,* Beibei Wu,2 Peipei Zhang,1 Jianshun Chen,1 Shuna Wang,1 and Weihuan Fang1

Abstract

Multilocus sequence typing (MLST) was used to examine the clonal relationship and genetic diversity of 71 Vibrio parahaemolyticus isolates from clinical and seafood-related sources in southeastern Chinese coast between 2002 and 2009. The tested isolates fell into 61 sequence types (STs). Of 17 clinical isolates, 7 belonged to ST3 of the pandemic clonal complex 3, with 3 strains isolated in 2002. Although there was no apparent clonal relationship found between clinical strains and those from seafood-related sources positive with pathogenic markers, there were clonal relationships between clinical strains from this study and those from environmental sources in other parts of China. Phylogenetic analysis showed that strains of 112 STs (61 STs from this study and 51 retrieved from PUBMLST database covering different continents) could be divided into four branches. The vast majority of our isolates and those from other countries were genetically diverse and clustered into two major branches of mixed distribution (of geographic origins and sample sources), whereas five STs representing six isolates split as two minor branches because of divergence of their recA genes, which had 80%–82% nucleotide identity to typical V. parahaemolyticus strains and 73.3%–76.9% identity to the CDS24 of a Vibrio sp. plasmid p23023, indicating that the recA gene might have recombined by lateral gene transfer. This was further supported by a high ratio of recombination to mutation (3.038) for recA. In conclusion, MLST with fully extractable database is a powerful system for analysis of clonal relationship for strains of a particular region in a national or global scale as well as between clinical and environmental or food-related strains.

Introduction

V

ibrio parahaemolyticus is one of the leading causes of human foodborne gastroenteritis in China (31.1% of foodborne outbreaks reported between 1991 and 2001) (Liu et al., 2004). In the mid 1990s, its O3:K6 serotype was identified as a cause of seasonal outbreak in Asia and North America (Okuda et al., 1997; Chiou et al., 2000; DePaola et al., 2003; Martinez-Urtaza et al., 2004; Nair et al., 2007). The O3:K6 pandemic clone also existed in China (Vongxay et al., 2008). From epidemiological perspectives, it is important to understand the genetic relatedness of isolates of this particular bacterium from clinical sources and those from seafood or environmental settings. It is also critical to understand whether there is any particular clonal group or complexes circulating in a particular country or region. Apparently, a good genetic fingerprinting technique is required to fulfill these purposes.

A number of genetic fingerprinting techniques have been used to examine the relatedness of several bacterial species such as pulse-field gel electrophoresis (PFGE), ribotyping, and multilocus sequence typing (MLST) (Maiden et al., 1998; Kai et al., 2008; Pavlic and Griffiths, 2009). MLST has been found suitable for a variety of bacterial species (Nallapareddy et al., 2002; Chen et al., 2006; Foley et al., 2006). Chowdhury et al. (2004) approached MLST based on four genes on chromosome I for subtyping of pandemic strains of V. parahaemolyticus. Gonzalez-Escalona et al. (2008) used seven house-keeping genes to examine the genetic diversity of V. parahaemolyticus. Sixty-two sequence types (STs) were identified, and three major clonal complexes (CCs) were found, two for isolates from the Pacific and Gulf coasts of the United States and a third one contained strains belonging to the pandemic CC3 (Gonzalez-Escalona et al., 2008). The CC3 strains have worldwide distribution (Okuda et al., 1997; Chiou et al., 2000;

1 Zhejiang University Institute of Preventive Veterinary Medicine and Zhejiang Provincial Key Laboratory of Preventive Veterinary Medicine, Hangzhou, China. 2 Zhejiang Provincial Center for Disease Control, Hangzhou, China. *These authors equally contributed to this work.

1169

1170 DePaola et al., 2003; Martinez-Urtaza et al., 2004, 2005; Nair et al., 2007; Ansaruzzaman et al., 2008). Several papers reported the sero-diversity and genetic structure of some clinical and environmental isolates of V. parahaemolyticus. Chao et al. (2009) reported that the pandemic clonal serovars of their clinical isolates from 2005–2008 included O3:K6, O1:KUT, O1:K25, O1:K26, and O4:K68, with O3:K6 being the dominant serovar. Yan et al. (2011) used an extended MLST scheme [having four gene loci identical to MLST by Gonzalez-Escalona et al. (2008)] on 174 global strains, including 7 clinical strains between 2003 and 2007 and 33 environmental isolates between 2006 and 2007 from mainland China. They found three major clonal groups corresponding to the groups of pre-1996 old-O3:K6 strains CC2, post-1996 pandemic strains (CC3), and nonclinical isolates (CC5), respectively. One of the CC3 strains in that paper was a clinical strain isolated in the southern Chinese province of Guangxi in 2003. However, genetic diversity and clonal relationship of V. parahaemolyticus strains from southeastern Chinese coast remain uncharacterized over a wider time period. In this study, we used MLST to examine the genetic population structure of selected V. parahaemolyticus strains from clinical and seafood-related sources between 2002 and 2009 and to analyze the clonal relationship of Chinese isolates to those from other parts of the world. Materials and Methods Bacterial strains, cultures, and DNA extraction Seventy-one V. parahaemolyticus isolates in our strain collection (658 strains, mostly from seafood-related samples) from 2002 to 2009 were selected. There were 17 isolates from clinical sources, including 10 from Zhejiang Center for Disease Control and Prevention (Zhejiang CDC), China, and 7 from Hangzhou CDC, Zhejiang, China. Fifty-three isolates were from seafoodrelated sources (seafood and their environments) in the southeastern Chinese coastal provinces of Zhejiang and Fujian. These seafood-related isolates were selected based on the presence of major or putative virulence genes such as tdh or trh (Vongxay et al., 2008) or those representing type 3 secretion systems (T3SS) or putative T6SS by polymerase chain reaction (PCR) typing (Table 1 and Supplementary Table S1; Supplementary Data are available online at www.liebertonline.com/ fpd). The reference strain BJ1997 was purchased from China General Microbiological Culture Collection Center. The bacterial strains were revived from - 80C stock and cultured at 37C at 250 rpm overnight in brain heart infusion broth (Oxoid) with 3% NaCl. One milliliter of each culture was centrifuged at 12,000 g for 1 min and then resuspended in 100 lL distilled water and 100 lL of 2 · TZ (2% Triton X-100, 2.5 mg/mL NaN3, 1 M Tris-HCl [pH 8.0]). The suspension was heated in a boiling water bath for 10 min, then left on ice for 10 min, and subjected to centrifugation at 12,000 g for 1 min. The supernatant solution containing DNA template was either immediately used or kept at - 20C for later use. PCR amplification For direct sequencing of the PCR products of seven loci as described in PUBMLST (http://pubmlst.org/ vparahaemolyticus/) or by Gonzalez-Escalona et al. (2008), the primers were redesigned in conserved regions of at least

YU ET AL. 70 bp length beyond the target gene fragments (Supplementary Table S1). The primers were synthesized by Invitrogen Co. Ltd. The PCR conditions were essentially the same as those specified in the PUBMLST database. The PCR products were purified using PCR clean kit (Axygen) and subjected to sequencing in an ABI-3700 automated DNA sequencer (Invitrogen). Sequencing quality was visually checked and cut to the exact length of each locus in the PUBMLST database, in which the sequences of new STs were deposited. Identification of CCs, groups, and singletons The eBURST program v3.0 (Feil et al., 2004) was used to subdivide 201 STs (representing 386 strains) into groups of related isolates and CCs. These included 61 STs (71 strains) from this study and all 140 STs (315 strains) available from PUBMLST database as of December 2009 (representing strains from Bangladesh, other parts of China, Chile, Ecuador, India, Japan, Korea, Mozambique, Norway, Peru, Spain, Thailand, and United States). Of the 386 strains, 185 were from environmental sources (47.9%). We used the most stringent group definition that STs share identical alleles at six or seven of the MLST loci with at least one other member of the group. If a group contains closely related STs, which have diversified from founding genotype, the group is considered as a CC. Any two STs differing form each other at a single locus were defined as single-locus variant (SLV) and two STs differing form each other at two loci as double-locus variant (DLV). The ST that has the largest number of SLVs in a CC is considered as the founding genotype. The statistical confidences for the founding genotype were assessed using 1000 bootstrap resamplings. STs that do not belong to any groups are called singletons (Feil et al., 2004). Determination of allelic profiles and STs The trace files of new alleles were submitted to the curator for verification. For each gene, new alleles were given with arbitrary allelic numbers. Combination of allelic numbers for a particular isolate constituted allelic profile for that isolate. Each unique allelic profile was assigned an ST number by the curator. Information of the allelic profiles and STs can be found on http://pubmlst.org/vparahaemolyticus/. Genetic diversity and phylogenetic analyses Genetic diversity statistics were computed on sequences of individual loci or on concatenated sequences of each isolate by DNASP v5.0 (Librado et al., 2009). The statistics include the number of polymorphic sites, percentage of variable nucleotides sites, nucleotide diversity, number of alleles, and pairwise ratios of nonsynonymous substitutions to synonymous substitutions (dN/dS). The r/m parameter, calculated using ClonalFrame (Didelot and Falush, 2007), is the ratio of probabilities that a given site is altered through recombination and mutation. Recombination events were tested using RDP3.44 (Martin et al., 2010). MEGA v4.1 (Tamura et al., 2007) was used to construct the minimum-evolution (ME) tree of concatenated sequences of 112 STs (including 52 STs unique from this study out of a total of 61, 9 STs from this study that shared the STs from other countries in the database, and 51 STs from different continents that formed CCs or groups as analyzed by eBURST) (Supplementary Table S2). Statistical confidence of

VIBRIO PARAHAEMOLYTICUS CLONAL COMPLEX 3 STRAINS IN CHINA

1171

Table 1. Sources of 71 Vibrio parahaemolyticus Isolates and Their Characteristics Virulence gene a

Strain

ST

Allelic profiles

BJ1997 HM15 HM18 HM23 ZJ3 ZJ17 HM12 Huzhou0810 ZJ2 HM17 HM4 F22 HM6 HM13 KP9 KP14 L16 L17 L23 L25 L28 L37 L48 L66 L67 L70 L95 L128 L138 ZS99 MJ3 C5-1 MJ42 XS65 XS81 XS86 YH2 YH18 YH21 YH24 YH27 YH47 YH60 ZJ15 ZJ6N ZJ9N ZS6-1 ZS6-3 ZS41 ZS44 ZS46 ZS76 ZS94 L7 MJ24 HM16 MJ37 MJ40 HM1 C3-2

1 3 3 3 3 3 3 3 8 120 120 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 163 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179 180 181 182 183 184 186 187 189 201 202 203 204

5, 52, 27, 13, 17, 25, 10 3, 4, 19, 4, 29, 4, 22 3, 4, 19, 4, 29, 4, 22 3, 4, 19, 4, 29, 4, 22 3, 4, 19, 4, 29, 4, 22 3, 4, 19, 4, 29, 4, 22 3, 4, 19, 4, 29, 4, 22 3, 4, 19, 4, 29, 4, 22 28, 4, 82, 88, 63, 69, 1 60, 108, 86, 98, 18, 45, 51 60, 108, 86, 98, 18, 45, 51 64, 77, 58, 62, 4, 62, 48 42, 81, 59, 63, 28, 7, 52 65, 43, 33, 64, 30, 5, 23 3, 79, 31, 13, 18, 4, 51 68, 76, 57, 65, 50, 70, 53 19, 74, 61, 68, 48, 11, 26 3, 82, 62, 69, 30, 7, 23 67, 83, 31, 70, 47, 73, 13 6, 6, 3, 17, 11, 35, 54 35, 50, 63, 27, 49, 46, 26 11, 75, 64, 67, 26, 7, 50 28, 73, 65, 71, 27, 52, 1 3, 71, 66, 73, 26, 27, 23 71, 13, 67, 77, 21, 69, 55 12, 58, 68, 78, 39, 37, 49 70, 78, 31, 81, 49, 74, 62 28, 17, 21, 79, 20, 23, 24 47, 58, 53, 19, 50, 37, 26 72, 96, 70, 80, 18, 75, 9 72, 96, 70, 80, 18, 75, 9 72, 96, 70, 80, 18, 75, 9 25, 89, 72, 76, 55, 72, 54 76, 88, 31, 13, 53, 45, 13 83, 97, 73, 83, 4, 77, 58 42, 93, 74, 50, 26, 78, 57 19, 87, 76, 19, 61, 79, 56 51, 4, 77, 67, 60, 8, 33 82, 85, 78, 19, 60, 69, 24 81, 84, 75, 84, 63, 26, 47 80, 95, 80, 74, 4, 67, 61 77, 104, 3, 85, 65, 10, 57 84, 103, 79, 82, 62, 46, 60 78, 4, 31, 90, 21, 11, 12 5, 4, 25, 4, 64, 4, 22 79, 43, 31, 86, 21, 11, 12 81, 102, 83, 86, 59, 80, 48 81, 106, 81, 86, 50, 80, 48 89, 105, 15, 89, 57, 11, 57 91, 101, 84, 91, 58, 78, 33 90, 100, 31, 29, 66, 11, 9 87, 99, 97, 87, 67, 5, 54 19, 87, 31, 19, 61, 79, 56 31, 121, 60, 66, 52, 62, 23 75, 120, 71, 13, 56, 37, 29 11, 48, 3, 48, 26, 48, 26 74, 94, 89, 82, 23, 37, 23 73, 90, 98, 66, 4, 76, 26 13, 10, 19, 27, 28, 7, 21 19, 133, 103, 21, 37, 103, 74

Source

Year

Serovar

CGMCC Hangzhou CDC Hangzhou CDC Hangzhou CDC Zhejiang CDC Zhejiang CDC Hangzhou CDC Zhejiang CDC Zhejiang CDC Hangzhou CDC Hangzhou CDC Oyster, Fujian Hangzhou CDC Hangzhou CDC Clam, Fujian Fish, Fujian Shrimp, Wenzhou Shrimp, Wenzhou Environ, Wenzhou Environ, Wenzhou Shrimp, Wenzhou Environ, Wenzhou Environ, Wenzhou Environ, Wenzhou Environ, Wenzhou Fish, Wenzhou Environ, Wenzhou Shell, Wenzhou Environ, Wenzhou Environ, Zhoushan Shrimp, Zhoushan Environ, Zhoushan Environ, ZhouShan Fish, Xiangshan Environ, Xiangshan Shellfish, Xiangshan Environ, Yuhuan Environ, Yuhuan Snail, Yuhuan Environ, Yuhuan Environ, Yuhuan Environ, Yuhuan Clam, Yuhuan Zhejiang CDC Zhejiang CDC Zhejiang CDC Environ, Zhoushan Environ, Zhoushan Environ, Zhoushan Environ, Zhoushan Environ, Zhoushan Environ, Zhoushan Environ, Zhoushan Shrimp, Wenzhou Clam, Zhoushan Hangzhou CDC Environ, Zhoushan Clam, Zhoushan Hangzhou CDC Environ, Zhoushan

1997 2002 2002 2002 2003 2003 2006 2008 2003 2002 2006 2005 2006 2002 2004 2004 2007 2007 2007 2007 2007 2007 2007 2007 2007 2007 2007 2007 2007 2007 2008 2009 2008 2007 2007 2007 2008 2008 2008 2008 2008 2008 2008 2003 2003 2003 2007 2007 2007 2007 2007 2007 2007 2007 2008 2002 2008 2008 2006 2009

ND O3:K6 O4:K8 O3:K6 O1:KUT O3:K6 O3:K6 ND ND O4:K8 O3:K6 O1:K41 O3:K6 O3:K6 ND ND ND ND ND ND ND ND ND ND ND ND ND ND ND ND ND ND ND ND ND ND ND ND ND ND ND ND ND ND O4:K68 04:K8 ND ND ND ND ND ND ND ND ND O4:K8 ND ND O3:K6 ND

KP tdh trh ureC T3SS2 T6SS1 + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +

+ + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + + +

+ + + + + + + + + + + + + + + + + + + + -

+ + + + + -

+ + + + + + + + + + + + + + + + + + + + + -

+ + + + + + + + + + + + + + + + + + + + + + + + -

(continued)

1172

YU ET AL. Table 1. (Continued) Virulence gene

Strain

ST

Allelic profilesa

Source

Year

Serovar

C41-2 C49-1 L140 ZS49 ZS62 L64 L84 KP34 L100 YH4 C56-2

205 206 207 208 208 209 210 211 212 213 214

105, 134, 78, 19, 26, 102, 75 31, 131, 60, 78, 52, 62, 13 47, 91, 105, 46, 79, 45, 26 88, 4, 104, 92, 60, 8, 26 88, 4, 104, 92, 60, 8, 26 66, 72, 107, 72, 21, 94, 32 7, 70, 106, 75, 4, 5, 59 3, 4, 108, 4, 51, 4, 22 69, 92, 69, 114, 54, 71, 24 19, 86, 3, 113, 31, 68, 23 104, 132, 102, 115, 26, 62, 73

Environ, Zhoushan Environ, Zhoushan Environ, Wenzhou Shrimp, Zhoushan Shrimp, Zhoushan Fish, Wenzhou Shell, Wenzhou Clam, Fujian Environ, Wenzhou Environ, Yuhuan Environ, Zhoushan

2009 2009 2007 2007 2007 2007 2007 2004 2007 2008 2009

ND ND ND ND ND ND ND O3:K6 ND ND ND

KP tdh trh ureC T3SS2 T6SS1 + + + + + + + + + + +

+ + + + + + + + + + +

+ + + -

-

+ -

+ -

a

Ordered as dnaE, gyrB, recA, dtdS, pntA, pyrC, and tnaA. ND, not done.

the nodes in the ME tree was assessed by bootstrap interior branch test (500 replicates). Results Allelic profiles Of the 71 isolates, there were 61 STs (Table 1). Nearly 10% of the isolates belonged to ST3, all being clinical isolates collected from 2002 to 2008. Three environmental/seafood isolates (ZS99, MJ3, and C5-1) from samples in Zhoushan (north coast of Zhejiang province) fell into ST163, and two isolates from shrimps fell into ST208. ST120 included two clinical strains (HM4 and HM17). The remaining 57 STs were represented by 57 isolates, including 8 clinical strains, 1 reference strain BJ1997, and 48 seafood-related isolates. We did not find clonally related STs between clinical strains and the seafoodrelated isolates that tested positive for pathogenic markers from our strain collection. However, there were clonal relationships with ST3, ST120, and ST189 between clinical strains from this study and those from environmental sources in other parts of China (Supplementary Table S2). Analysis of CCs, groups, and singletons The eBURST analysis divided 201 STs into 4 CCs, 18 groups, and 141 singletons (Supplementary Fig. S1). Two STs from this study (ST3, ST161) were contained in two CCs (CC3 and CC49), whereas seven other STs (ST1, ST189, ST120, ST203, ST168, ST184, and ST153) were in six groups (groups 1–6) (Supplementary Table S2). ST3 covered seven clinical isolates from this study (41%, 7/17; Table 1) and belonged to the ancestral CC3, which were joined by ST2, ST27, ST42, ST51, ST71, ST72, and ST192 (Supplementary Fig. S1). Of 60 STs containing 206 strains, there were only 15 STs that all contained clinical strains (n = 131 strains), mostly in CC3 (109 strains, 83.2%) or its seven SLVs (7 STs, n = 7), and groups 1 (3 STs, n = 10), 4 (2 STs, n = 3), and 15 (2 STs, n = 2). The STs in other groups or CCs were either from environmental sources or a mixture of clinical and environmental sources (Supplementary Table S2). The clinical strains in the CC3 or its SLVs were from 12 countries including China, covering a 14-year span from 1996 to 2009. Multiple serotypes were found in CC3 (O1:K25, O1:KUT, O3:K6, O3:K68, O3:Kuk, O4:K8, O4:K68,

O5:K68). CC49 was comprised of four STs (ST49, ST53, ST70, and ST161), represented by four environmental isolates from United States, Norway, Chile, and China (Supplementary Table S2), with ST49 being the founding genotype (Supplementary Fig. S1). The ST211 isolate was from clam with O3:K6 serotype and appeared to be a DLV of ST3 for its differences in the loci recA and dtdS (Table 1). Phylogenetic analysis Figure 1 shows that the 112 STs formed four branches. The vast majority of the strains from our laboratory were in the two main branches together with those from other parts of China or other countries. There were six strains (belonging to five different STs) branched off as two different clusters: ST207, ST209, and ST211 as one cluster, and ST208 and ST210 as the other. The recA gene from these strains was diversified (80%–82% nucleotide identity) from typical V. parahaemolyticus strains, but had 82%– 84% identity to V. fortis (AJ842423), V. tubiashii (AJ842522), and V. halioticoli (AJ842431) (Table 2). Cluster analysis by recA alone gave the same structure (Fig. 1, insert), indicating that outlining of these strains was mainly due to recA diversification, most probably by recombination as shown by an r/m ratio of 3.038 (Table 3). The recA locus of the 71 isolates tested did show six unique events of recombination as detected by the RDP program. The tree revealed two major differences with eBURST analysis (Supplementary Fig. S1 vs. Fig. 1). ST71 and ST72 that belonged to the same CC3 were clustered in the ME tree with ST91 and ST96 of group 15, which was far from CC3 in eBURST. ST88 and ST189, though being in the same group 2 by eBURST, were in different clusters corresponding to group 14 (ST74 and ST85) and group 6 (ST9 and ST153), respectively, in the tree. ST211 (strain KP34) was DLV of CC3, which was placed in the outer branch because of its divergent recA sequence (Fig. 1, inset). Nucleotide diversity and recombination The number of polymorphic sites of the loci tested were from 28 ( pyrC, tnaA) to 255 (recA) and the number of alleles ranged from 32 (tnaA) and 52 (gyrB) in the seven loci of 71 V. parahaemolyticus isolates (Table 3). The percentage of variable sites and nucleotide diversity per site were greatest in recA (34.9% and 0.0564, respectively), followed by dtdS (15.1% and 0.0278).

VIBRIO PARAHAEMOLYTICUS CLONAL COMPLEX 3 STRAINS IN CHINA 49 99

1173

Group8(ST24,ST60,ST102)

55

Group11(ST25,ST26) 99

81

Group16(ST100,ST101) 99 Group12(ST30,ST31)

62 99

Group18(ST117,ST118)

63

99

ST152 ST163

4093

#

ST182

80

ST149 #

89

ST201

90

ST180

87

#

ST204 99

8794 47

Group17(ST109,ST110)

CC34(ST33,ST34,ST35,ST77,ST95) 99

ST154

98

#

ST183 99

7518

Group3(ST120,ST188)

ST165

85 5

Group9(ST18,ST98) 99 Group5(ST168,ST184)

99 64

ST162 74 94

ST213 ST173

79

ST186

50

#

ST206

99

ST156 #

FIG. 1. The minimum evolution tree of 112 STs based on concatenated sequences (representing all STs from this study, and those from different continents that formed clonal complexes or groups as shown by eBURST analysis; Supplementary Table S2). The inset is the tree of recA loci from the 112 STs, showing that the divergence of the recA loci in these five STs from this study contributed to the branching of the major tree. #Close relationship between two STs; *close relationship between STs in this study and those of clonal complexes or groups; C, STs in minimum-evolution tree divergent from eBURST results; =, the symbol of varying size means that those STs are clustered in the same branch as opposed to the neighboring one and is used to save spaces for other STs with more pronounced divergence.

ST160

67 91

ST71

99 80

ST72 Group15(ST91,ST96) 99

ST155

91

88

#

52

ST169

99

ST172

7489

28

ST175 #

ST177

99

ST151

95

ST209

ST166

99

ST157 #

ST148

4370

0.02

ST167 ST189

49

*

79

Group6(ST9,ST153) 99 98

99

Group10(ST21,ST59)

99

CC36(ST36,ST37,ST38,ST39) 97 99 Group1(ST1,ST81,ST82)

62

ST187

85

ST164 3

Group13(ST41,ST199) 99

99

CC49(ST49,ST53,ST70,ST161)

72

*

ST171

43

ST174

96 18

ST214

92

Group7(ST4,ST50,ST90) 99

ST8 ST158

85

#

ST170

74

ST150 ST178

99 63 99

#

ST179

63

ST205

91

ST145 ST181

84

ST159 *

71 97 55 92 40

Group4(ST17,ST203) 99 Group14(ST74,ST85)

ST88 ST212 ST146

90

ST147

90 91

CC3(ST2,ST3,ST27,ST42,ST51,ST192,exclude ST71,ST72) 96

ST211 ST207

94

#

ST209

99

ST208 # 99

0.005

ST210 ST208

ST202

75

ST207

99

#

42 95

83

ST211

96

97

ST210

1174

YU ET AL. Table 2. recA Gene Identities of Six Vibrio parahaemolyticus Strains to Other (Vibrio) Species Blast results

Isolates (ST)

Location in Location in query isolates reference strain

L140 (ST207)

11–727 2–726

71–788 2–726

ZS49, ZS62 (ST20)

11–729

116–834

56–729

56–729

1–714

61–774

2–726

2–726

11–729

116–834

56–729

56–729

1–727 11–615

61–788 25–629

L64 (ST209)

L84 (ST210)

KP34 (ST211)

Identities

Gaps

Species (accession number)

613/722 (84%) 9/722 (1%) Vibrio halioticoli (AJ842431) 601/736 (81%) 22/736 (2%) Vibrio parahaemolyticus (EU051550) 600/730 (82%) 22/730 (3%) Photobacterium mandapamensis (DQ648411) 554/685 (80%) 22/685 (3%) Vibrio parahaemolyticus (EU051529) 609/726 (83%) 24/726 (3%) Vibrio cincinnatiensis (AJ842398) 605/734 (82%) 18/734 (2%) Vibrio parahaemolyticus (EU051550) 601/731 (82%) 24/731 (3%) Photobacterium mandapamensis (DQ648411) 554/686 (80%) 24/686 (3%) Vibrio parahaemolyticus (EU051529) 619/735 (84%) 15/735 (2%) Vibrio fortis (AJ842423) 507/614 (82%) 18/614 (2%) Vibrio parahaemolyticus (EF601744)

Total score

E value

721 592

0.0 3.0e - 169

608

1.0e - 170

520

1.0e - 147

671

0.0

625

3.0e - 179

608

1.0e - 170

514

6.0e - 146

701 525

0.0 3.0e - 149

ST, sequence type.

The other loci had variable sites of 5.7%–9.1%. The low ratio of dN/dS suggests that negative selection was dominant in these house-keeping genes. Of the seven loci, six had the average per site r/m ratio between 0.67 and 1.23, indicating that recombination and mutation did not differ significantly on diversification of these loci. With recA, however, recombination appeared to play a much greater role than mutation in the generation of genetic heterogeneity in the strains tested, as evidenced by its average r/m ratio of 3.038 (Table 3). Discussion One of the major findings in this study was that seven clinical isolates belonging to ST3 linked with the pandemic CC3 (Supplementary Fig. S1) were also existent in China as early as in the year 2002, a year earlier than we previously reported (Vongxay et al., 2008) and as recently reported by Yan et al. (2011). However, it does not exclude the possibility that such strains could have emerged in China before 2002,

as was the case in Japan and Taiwan in 1996 (Okuda et al., 1997; Chiou et al., 2000). The CC3 strains, represented by pandemic O3:K6 serotype, were first found in India in 1996 and then in other Asian countries, the American continent, Europe, and even Africa (Martinez-Urtaza et al., 2005; Nair et al., 2007; Ansaruzzaman et al., 2008). However, in this study, no clonally related STs were identified between the clinical strains and seafood-related isolates that were TDH positive (Table 1). The number of thermostable direct hemolysin (TDH)-positive strains in this study seems to be low (about 8% from our strain collection). The low proportion of TDH-positive strains in our collection could be due to the difficulty in isolating TDH-producing colonies on agar plates as a result of relatively low proportion of TDH-producing V. parahaemolyticus to total V. parahaemolyticus in some samples (DePaola et al., 2000). However, there were clonal relationships with ST3, ST120, and ST189 between clinical strains from this study and those from environmental sources in other parts of China (Supplementary Table S2).

Table 3. Genetic Diversity of the Seven Loci of 71 Vibrio parahaemolyticus Isolates

Chromosome I

II

Concatenated

Fragment No. of No. of polymorphic % Variable Locus size (bp) alleles sites sites dnaE gyrB recA dtdS pntA pyrC tnaA

557 592 729 458 430 493 423 3682

44 52 48 45 37 34 32 61

46 54 255 69 31 28 28 511

8.26 9.12 34.98 15.07 7.2 5.68 6.62 13.88

dN/dS

Nucleotide diversity (per site) Pi (SD)

r/ma

0.0286 0.0055 0.0275 0.0052 0.0373 0.0401 0.0069 0.0219

0.01183 0.01391 0.05644 0.02780 0.01106 0.01170 0.01268 0.01991

0.7553 0.6763 3.0382 1.2321 0.7723 0.7932 0.9297 0.8297

(0.00083) (0.00057) (0.01358) (0.00356) (0.00072) (0.00085) (0.00096) (0.00301)

Recombination eventsb 0 0 6 0 0 0 0

a Based on all 71 isolates tested in this study. The ratio of probabilities that a given site is altered through recombination and mutation represents a measure of how important the effect of recombination is in the diversification of the sample relative to mutation. b recA recombination events using RDP3.44 (Martin et al., 2010).

VIBRIO PARAHAEMOLYTICUS CLONAL COMPLEX 3 STRAINS IN CHINA Nair et al. (2007) have indicated that there are currently 21 serotypes reported as having the same genetic lineage of pandemic O3:K6 as shown by identical ribotyping or PFGE profiles. A diverse set of serotypes belong to the same genetic lineage of ST3 in CC3 (Supplementary Table S2), suggesting that there might be horizontal transfer of genes associated with O or K antigen synthesis among V. parahaemolyticus strains in the environments. This was proposed as the mechanism of serotype diversification of V. cholerae with identical recA ST (Colin et al., 2000). Although the average per site r/m ratios were low (0.67–1.23) on six of the seven loci examined, diversification of the recA gene was three times more likely because of recombination than mutation (Table 3). RDP analysis on the recA loci of 71 isolates showed six unique recombination events, whereas there were none for the other six loci. This result seems to be contradictory to earlier studies that showed that recombination was more likely than mutation in genetic diversification of V. parahaemolyticus (Gonzalez-Escalona et al., 2008; Yan et al., 2011). Prediction results may vary with different software using different algorithms (Stumpf and McVean, 2003; Vos and Didelot, 2009), thus making comparison of data from the literature difficult. The ClonalFrame model estimates the relative probabilities that a nucleotide is changed as the result of recombination relative to point mutation, taking into account of fragment length and nucleotide diversity (Guttman and Dykhuizen, 1994; Didelot and Falush, 2007). Of the four branches in the phylogenetic tree, the vast majority were in the two major branches, in which the STs from this study were interspersed with those from other countries, indicating that these Chinese strains do not cluster by geographic origins but are as diverse as those from other countries. There were five STs in this study that fell off the major branches (Fig. 1). This divergence was largely attributable to high numbers of polymorphic sites (Table 3) in recA, as this locus alone gave the same phylogenetic structure as the branch in the major tree (Fig. 1 insert). The recA genes from strains of these STs were divergent from ‘‘typical’’ V. parahaemolyticus strains (Table 3), but had 82%–84% identity to V. fortis, V. tubiashii, or V. halioticoli and, more strikingly, 73.3%–76.9% identity to the CDS24 of a Vibrio sp. plasmid p23023 (GenBank Accession No. CP000755.1) (Hazen et al., 2007). These results appear to indicate that the recA gene could have been recombined by lateral gene transfer and/or gene conversion. Such possibility was suggested with the 16S rRNA genes of different strains of V. parahaemolyticus (Gonza´lez-Escalona et al., 2005). In conclusion, the present study reveals emergence of pandemic CC3 strains as early as in the year 2002 in China. The recA gene of some Chinese V. parahaemolyticus isolates was found divergent from a majority of the strains examined in this study most probably by recombination. Application of the MLST scheme to more V. parahaemolyticus strains and by different laboratories would facilitate a global picture of the epidemiology and genetic population structure of this seafood-borne pathogen. Acknowledgments This research was funded in part by the Key Project of National Science and Technology Pillar Program (2009BADB9B01) and the Natural Science Foundation of China (30571436 to W.F. and 30700605 to B.W.). The authors thank Dr. N. Gonzalez-

1175

Escalona at the Center for Food Safety and Applied Nutrition, Food and Drug Administration, for communicating with the sequence verification. The authors appreciate Dr. Lingling MEI at Zhejiang Provincial Center for Disease Control, Hangzhou, Zhejiang, for kindly providing the clinical strains. Disclosure Statement No competing financial interests exist. References Ansaruzzaman M, Chowdhury A, Bhuiyan NA, Sultana M, Safa A, Lucas M, von Seidlein L, Barreto A, Chaignat C-L, Sack DA, Clemens JD, Nair GB, Choi SY, Jeon YS, Lee JH, Lee HR, Chun J, and Kim DW. Characteristics of a pandemic clone of O3: K6 and O4: K68 Vibrio parahaemolyticus isolated in Beira, Mozambique. J Med Microbiol 2008;57:1502–1507. Chao G, Jiao X, Zhou X, Yang Z, Huang J, Pan Z, Zhou L, and Qian X. Serodiversity, pandemic O3:K6 clone, molecular typing, and antibiotic susceptibility of foodborne and clinical Vibrio parahaemolyticus isolates in Jiangsu, China. Foodborne Pathog Dis 2009;6:1021–1028. Chen Y, Zhang W, and Knabel SJ. Multi-virulence-locus sequence typing clarifies epidemiology of recent listeriosis outbreaks in the United States. J Clin Microbiol 2006;43:5291–5294. Chiou CS, Hsu SY, Chiu SI, Wang TK, and Chao CS. Vibrio parahaemolyticus serovar O3:K6 as cause of unusually high incidence of food-borne disease outbreaks in Taiwan from 1996 to 1999. J Clin Microbiol 2000;38:4621–4625. Chowdhury NR, Stine OC, Morris JG, and Nair GB. Assessment of evolution of pandemic Vibrio parahaemolyticus by multilocus sequence typing. J Clin Microbiol 2004;42:1280–1282. Colin SO, Sozhamannan S, Gou Q, Zheng SQ, Morris JG Jr., and Johnson JA. Phylogeny of Vibrio cholerae based on recA sequence. Infect Immun 2000;68:7180–7185. DePaola A, Kaysner CA, Bowers J, and Cook DW. Environmental investigations of Vibrio parahaemolyticus in oyster after outbreaks in Washington, Texas, and New York (1997 and 1998). Appl Environ Microbiol 2000;66:4649–4654. DePaola A, Ulaszek J, Kaysner CA, Tenge BJ, Nordstrom JL, Wells J, Puhr N, and Gendel SM. Molecular, serological, and virulence characteristics of Vibrio parahaemolyticus isolated from environmental, food, and clinical sources in North America and Asia. Appl Environ Microbiol 2003;69: 3999–4005. Didelot X and Falush D. Inference of bacterial microevolution using multilocus sequence data. Genetics 2007;175:1251–1266. Feil EJ, Li BC, Aanensen DM, Hanage WP, and Spratt BG. eBURST: inferring patterns of evolutionary descent among clusters of related bacterial genotypes from multilocus sequence typing data. J Bacteriol 2004;186:1518–1530. Foley SL, White DG, McDermott PF, Walker RD, Rhodes B, Fedorka-Cray PJ, Simjee S, and Zhao S. Comparison of subtyping methods for differentiating Salmonella enterica serovar Typhimurium isolates obtained from food animal sources. J Clin Microbiol 2006;44:3569–3577. Gonzalez-Escalona N, Martinez-Urtaza J, Romero J, Espejo RT, Jaykus LA, and DePaola A. Determination of molecular phylogenetics of Vibrio parahaemolyticus strains by multilocus xsequence typing. J Bacteriol 2008;190:2831–2840. Gonza´lez-Escalona N, Romero J, and Espejo T. Polymorphism and gene conversion of the 16S rRNA genes in the multiple rRNA operons of Vibrio parahaemolyticus. FEMS Microbiol Lett 2005;164:213–219.

1176 Guttman DS and Dykhuizen DE. Clonal divergence in Escherichia coli as a result of recombination, not mutation. Science 1994;266:1380–1383. Hazen TH, Wu DY, Eisen JA, and Sobecky PA. Sequence characterization and comparative analysis of three plasmids isolated from environmental Vibrio spp. Appl Environ Microbiol 2007;73:7703–7710. Kai MK, et al. Vibrio parahaemolyticus PulseNet PFGE protocol working group. Evaluation and validation of a PulseNet standardized pulsed-field gel electrophoresis protocol for subtyping Vibrio parahaemolyticus: an international multicenter collaborative study. J Clin Microbiol 2008,46:2766–2773. Librado P and Rozas J. DnaSP v5: A software for comprehensive analysis of DNA polymorphism data. Bioinformatics 2009;25: 1541–1542. Liu X, Chen Y, Wang X, and Ji R. Foodborne disease outbreaks in China from 1992 to 2001 national foodborne disease surveillance system. Wei Sheng Yan Jiu 2004;33:725–727. Maiden MC, Bygraves JA, Feil E, Morelli G, Russell JE, Urwin R, Zhang Q, Zhou J, Zurth K, Caugant DA, Feavers IM, Achtman M, and Spratt BG. Multilocus sequence typing: a portable approach to the identification of clones within populations of pathogenic microorganisms. Proc Natl Acad Sci U S A 1998; 95:3140–3145. Martin DP, Lemey P, Lott M, Moulton V, Posada D, and Lefeuvre P. RDP3: a flexible and fast computer program for analyzing recombination. Bioinformatics 2010;26:2462–2463. Martinez-Urtaza J, Lozano-Leon A, DePaola A, Ishibashi M, Shimada K, Nishibuchi M, and Liebana E. Characterization of pathogenic Vibrio parahaemolyticus isolates from clinical sources in Spain and comparison with Asian and North American pandemic isolates. J Clin Microbiol 2004;42:4672–4678. Martinez-Urtaza J, Simental L, Velasco D, DePaola A, Ishibashi M, Nakaguchi Y, Nishibuchi M, Carrera-Flores D, ReyAlvarez C, and Pousa A. Pandemic vibrio parahaemolyticus 03:K6, Europe. Emerg Infect Dis 2005;11:1319–1320. Nair GB, Ramamurthy T, Bhattacharya SK, Dutta B, Takeda Y, and Sack DA. Global dissemination of Vibrio parahaemolyticus serotype O3:K6 and its serovariants. Clin Microbiol Rev 2007;20:39–48.

YU ET AL. Nallapareddy SR, Duh RW, Singh KV, and Murray BE. Molecular typing of selected Enterococcus faecalis isolates: pilot study using multilocus sequence typing and pulsed-field gel electrophoresis. J Clin Microbiol 2002;40:868–876. Okuda J, Ishibashi M, Hayakawa E, Nishino T, Takeda Y, Mukhopadhyay AK, Garg S, Bhattacharya SK, Nair GB, and Nishibuchi M. Emergence of a unique O3:K6 clone of Vibrio parahaemolyticus in Calcutta, India, and isolation of strains from the same clonal group from Southeast Asian travelers arriving in Japan. J Clin Microbiol 1997;35:3150–3155. Pavlic M and Griffiths MW. Principles, applications, and limitations of automated ribotyping as a rapid method in food safety. Foodborne Pathog Dis 2009;6:1047–1055. Stumpf MP and McVean GA. Estimating recombination rates from population-genetic data. Nature Rev Genet 2003;4: 959–968. Tamura K, Dudley J, Nei M, and Kumar S. MEGA4: Molecular Evolutionary Genetics Analysis (MEGA) software version 4.0. Mol Biol Evol 2007;24:1596–1599. Vongxay K, Pan ZJ, Zhang XF, Wang SN, Cheng SY, Mei LL, Xu C, and Fang WH. Occurrence of pandemic clones of Vibrio parahaemolyticus isolates from seafood and clinical samples in a Chinese coastal province. Foodborne Pathog Dis 2008;5:127–134. Vos M and Didelot X. A comparison of homologous recombination rates in bacteria and archaea. ISME J 2009;3:199–208. Yan YF, Cui YJ, Han HH, Xiao X, Wong HC, Tan YF, Guo ZB, Liu XM, Yang RF, and Zhou DS. Extended MLST-based population genetics and phylogeny of Vibrio parahaemolyticus with high levels of recombination. Int J Food Micriobiol 2011; 145:106–112.

Address correspondence to: Weihuan Fang, Ph.D. Zhejiang University Institute of Preventive Veterinary Medicine Hangzhou, Zhejiang 310029 China E-mail: [email protected]