JOURNAL OF BACTERIOLOGY, Apr. 2005, p. 2638–2650 0021-9193/05/$08.00⫹0 doi:10.1128/JB.187.8.2638–2650.2005 Copyright © 2005, American Society for Microbiology. All Rights Reserved.
Vol. 187, No. 8
Diversity of Genome Structure in Salmonella enterica Serovar Typhi Populations† Sushma Kothapalli,1 Satheesh Nair,1 Suneetha Alokam,1 Tikki Pang,2 Rasik Khakhria,3‡ David Woodward,3 Wendy Johnson,4 Bruce A. D. Stocker,5§ Kenneth E. Sanderson,1* and Shu-Lin Liu1,6,7 Department of Biological Sciences1 and Department of Microbiology and Infectious Diseases,6 University of Calgary, Calgary, and Bacteriology and Enteric Diseases Program, National Microbiology Laboratory, Health Canada,3 and Cangene Corporation,4 Winnipeg, Manitoba, Canada; Research Policy and Cooperation, World Health Organization, Geneva, Switzerland2; Department of Medical Microbiology, Stanford University, Stanford, California5; and Department of Microbiology Peking University School of Basic Medical Sciences, Beijing, China7 Received 24 September 2004/Accepted 6 January 2005
The genomes of most strains of Salmonella and Escherichia coli are highly conserved. In contrast, all 136 wild-type strains of Salmonella enterica serovar Typhi analyzed by partial digestion with I-CeuI (an endonuclease which cuts within the rrn operons) and pulsed-field gel electrophoresis and by PCR have rearrangements due to homologous recombination between the rrn operons leading to inversions and translocations. Recombination between rrn operons in culture is known to be equally frequent in S. enterica serovar Typhi and S. enterica serovar Typhimurium; thus, the recombinants in S. enterica serovar Typhi, but not those in S. enterica serovar Typhimurium, are able to survive in nature. However, even in S. enterica serovar Typhi the need for genome balance and the need for gene dosage impose limits on rearrangements. Of 100 strains of genome types 1 to 6, 72 were only 25.5 kb off genome balance (the relative lengths of the replichores during bidirectional replication from oriC to the termination of replication [Ter]), while 28 strains were less balanced (41 kb off balance), indicating that the survival of the best-balanced strains was greater. In addition, the need for appropriate gene dosage apparently selected against rearrangements which moved genes from their accustomed distance from oriC. Although rearrangements involving the seven rrn operons are very common in S. enterica serovar Typhi, other duplicated regions, such as the 25 IS200 elements, are very rarely involved in rearrangements. Large deletions and insertions in the genome are uncommon, except for deletions of Salmonella pathogenicity island 7 (usually 134 kb) from fragment I-CeuI-G and 40-kb insertions, possibly a prophage, in fragment I-CeuI-E. The phage types were determined, and the origins of the phage types appeared to be independent of the origins of the genome types.
S. enterica serovar Typhi is more homogeneous than most serovars of Salmonella. Using multilocus enzyme electrophoresis, Reeves et al. (54) and Selander et al. (58) showed that S. enterica serovar Typhi strains constitute only one or two clones that are widely separated from the other serovars in subspecies I. Membrane protein profiles (15, 17) and plasmids (42) show homogeneity. Multilocus sequence typing of housekeeping genes has suggested that S. enterica serovar Typhi evolved only about 50,000 years ago from other Salmonella serovars (30). The orders of orthologous genes in Escherichia coli K-12 and S. enterica serovar Typhimurium LT2 are almost identical, although the genera diverged about 100 to 160 million years ago (38, 48, 59, 62). Within the genus Salmonella, the gene order of S. enterica serovar Paratyphi B (33) and the gene order of S. enterica serovar Enteritidis (36) are very similar to the gene order of S. enterica serovar Typhimurium LT2, and chromosomes are also conserved in 17 independent strains of S. enterica serovar Typhimurium (39). During growth in laboratory culture, duplications of segments of the chromosome occur at high frequencies (10⫺2 to 10⫺5) (4, 23), and some inversions, especially those with endpoints in the rrn operons, are common (22). Such remarkable conservation of the chromosome during evolution, in spite of the high frequency of
Salmonella enterica serovar Typhi is host restricted, for it grows only in humans, where it causes typhoid enteric fever (13, 51). The annual global incidence of typhoid fever is estimated to be 21.6 million cases, with more than 220,000 deaths (10). The emergence of antibiotic-resistant strains (8) and the increased incidence of typhoid fever in human immunodeficiency virus type 1-infected persons are causes for concern. The genus Salmonella is separated into two species and more than 2,500 serovars (52) on the basis of the somatic and flagellar antigens. Many of the serovars, such as S. enterica serovar Typhimurium, are host generalists, growing in many different animal species and humans and causing gastroenteritis.
* Corresponding author. Mailing address for Kenneth E. Sanderson: Department of Biological Sciences, University of Calgary, Calgary T2N 1N4, Canada. Phone: (403) 220-6792. Fax: (403) 289-9311. E-mail:
[email protected]. Mailing address for Shu-Lin Liu: Department of Microbiology and Infectious Diseases, University of Calgary, Calgary T2N 1N4, Canada. Phone: (403) 220-3799. Fax: (403) 270-2772. E-mail:
[email protected]. † Supplemental material for this article may be found at http://jb .asm.org/. ‡ Present address: 32 Amberwood Crescent, Nepean, Ontario, Canada. § Deceased. 2638
GENOME STRUCTURE IN S. ENTERICA SEROVAR TYPHI
VOL. 187, 2005
rearrangements in culture, may have resulted from strong selective pressures that selectively removed rearranged genomes. Pulsed-field gel electrophoresis (PFGE) permits rapid construction of genomic maps (16); partial digestion with the endonuclease I-CeuI shows the number and locations of the rrl genes for 23S rRNA and the order of adjacent fragments (34) (the rrn skeleton). I-CeuI, which is encoded by a class I mobile intron in the rrl gene for the large-subunit rRNA (23S-rRNA) in the chloroplast DNA of Chlamydomonas eugamatos (44), digests a 19-bp sequence in all seven rrl genes of enteric bacteria (43). The rrn skeleton is highly conserved in enteric bacteria, so related strains usually yield identical fingerprints (39). Surprisingly, in view of the homogeneity in many properties, independent wild-type strains of S. enterica serovar Typhi show significant genomic rearrangements. The I-CeuI fragments of S. enterica serovar Typhi strains were shown by PFGE to be in many different orders, called genome types (40), that are mediated through recombination between the seven rrn genes that code for rRNA. Partial I-CeuI digestion can determine the order but not the orientation of the I-CeuI fragments. Vi phage typing has high discriminatory power for subdivision of strains of S. enterica serovar Typhi, and the method of Craigie and Felix (9) has allowed over 100 different phage types to be recognized (49). Vi phage absorbs to the Vi (virulence) exopolysaccharide and adapts itself to the last strain in which it was propagated. In this study we determined the phage types of strains to determine if they are independent of or correlated with genome types. In this study we used PFGE with I-CeuI to determine the genome types of a set of strains of S. enterica serovar Typhi which had been assembled from a variety of sources; we then determined the orientation of the I-CeuI fragments using PCR. In addition, we determined the Vi phage types and the flagellar antigens of the strains. The sizes of the seven fragments (the I-CeuI fingerprint) are indistinguishable in most strains, so they are characteristic of the species, but the sizes of a few fragments are increased or decreased due to insertions or deletions. We show here that the phage type is largely independent of the genome type. The need for chromosome balance between the two replichores and for maintaining appropriate gene dosage appears to restrict the range of genomic rearrangements. MATERIALS AND METHODS Bacterial strains and cultivation conditions. The S. enterica serovar Typhi strains and their sources are shown in Table 1. All strains were maintained in 15% glycerol at ⫺70°C in the collection of the Salmonella Genetic Stock Center (www.ucalgary.ca/⬃kesander), and single-colony isolates were isolated prior to use. The strains were grown at 37°C in Luria-Bertani medium; solid media contained 1.5% agar. Enzymes and chemicals. Endonucleases were obtained from New England Biolabs (AvrII [⫽ BlnI], I-CeuI, and SpeI) and Boehringer-Mannheim (XbaI). Taq polymerase and deoxynucleoside triphosphates were obtained from Amersham. Other chemicals, including Luria-Bertani medium and agarose, were obtained from Sigma Chemicals. Endonuclease digestion and PFGE methods. Preparation of high-molecularweight genomic DNA, endonuclease cleavage of DNA in agarose blocks, separation of the DNA fragments by PFGE, and double-digestion techniques were performed as described previously (32, 35, 37). For digestion by I-CeuI, including partial digestion, we used the methods described previously (39). Primers. The primers were designed based upon the sequence of DNA flanking each of the seven rrn operons in S. enterica serovar Typhi CT18 (50) (Gen Bank accession no. NC_003198) by using the Primer3 program (http:
2639
//frodo.wi.mit.edu/cgi-bin/primer3/primer3_www.cgi/), and they were synthesized by the University Core DNA Services (Health Sciences Centre, University of Calgary). The sequences of the primers used in this study and their locations relative to rrn operons are shown in Table S1 in the supplemental material. The locations of the primers in the genomes of S. enterica serovar Typhimurium LT2 and S. enterica serovar Typhi CT18 are shown in Fig. 1. These primers were used in different combinations to amplify the rrn operons. PCR amplification and agarose gel electrophoresis. Chromosomal DNA used in PCR was isolated with a Wizard genomic DNA purification kit (Promega) used in accordance with the manufacturer’s instructions. Each PCR was carried out by using a HotStart storage and reaction tube (Gordon Technologies Inc.) in an Eppendorf gradient thermal cycler. In each 50-l (total volume) PCR mixture, 20 l was the lower mixture and 30 l was the upper mixture. The lower mixture contained 250 ng of template DNA, 1 l of each primer (0.4 M), and 2 l of deoxynucleoside triphosphates (0.4 mM) (the concentrations of the components of the lower mixture were calculated based on the 50-l reaction mixture), and the final volume was adjusted with doubledistilled water. Denaturation was done at 90°C for 30 s to melt the wax pellet. After the mixtures were cooled to room temperature, the reactions were initiated by addition of 5 l of 1⫻ PCR buffer, 3 l of MgCl2 (1.5 mM), and 2.5 U of Taq DNA polymerase (0.25 l), and 21.75 l of double-distilled water was added (upper mixture) (the concentrations of the components of the upper mixture were calculated based on the 50-l reaction mixture). PCR amplification was performed with 30 cycles of denaturation at 96°C for 1 min, annealing at 57°C for 1 min, and extension at 72°C for 10 min, followed by a final extension at 72°C for 10 min. The PCR product was electrophoresed at 65 V on a 1% agarose gel in 0.5⫻ Tris-borate-EDTA buffer (45 mM Tris, 45 mM boric acid, 10 mM EDTA [pH 8]) with 1 g of ethidium bromide per ml. Following electrophoresis the gel was photographed under UV light. Computer methods. The individual fragment sizes were estimated by using the S. enterica serovar Typhi CT18 genome sequence (50) (GenBank accession no. NC_003198) and the BLAST Program produced by National Center for Biotechnology Information, Bethesda, Md. (www.ncbi.nlm.nih.gov/BLAST). Phage typing and serotyping. Bacteriophage typing was performed at the National Microbiology Laboratory (formerly the Laboratory Centre for Disease Control), Health Canada, as reported previously by Khakhria et al. (29), by using the methods and scheme described by Anderson and Williams (3) and the Vi phage of S. enterica serovar Typhi. Serotyping was performed at the National Microbiology Laboratory, Health Canada, by the methods described in a report on the Kauffmann-White scheme (53).
RESULTS Partial digestion by I-CeuI in S. enterica serovar Typhi. Partial digestion yielded the seven bands expected from complete digestion plus other bands resulting from a failure to cleave between adjacent fragments; representative data are shown in Fig. 2. All strains produced seven fragments (fragments A to G), which ranged from 44 to about 2,400 kb long, and usually the lengths were indistinguishable in different strains (with the exception of fragment G in strain SARB64 [Fig. 2, lane 2], which was about 130 kb smaller than the normal fragment G). However, the fragments resulting from partial digestion were different in different strains, indicating that the fragments in the chromosome are in different orders. For example, the following partial digestion bands were observed in Fig. 2, lanes 1 and 6: DF, EF, and DEF. These data indicate that the order is EFD. The order of these three fragments is different in other strains. For example, the partial digestion bands DF, DE, and DEF were observed in Fig. 2, lane 4; these data indicate that the order is fragment FDE. The order of many of the other fragments could be determined from the same gel. Some of the expected partial digestion fragments could not be recognized in other lanes because the gel was overloaded; in order to determine the order of all
2640
KOTHAPALLI ET AL.
J. BACTERIOL.
TABLE 1. Phenotypic and genotypic characteristics of S. enterica serovar Typhi strains used in this study Straina
SGSC no.b
Source (reference)c
26.001 26.003 26.004
3124 3126 3127
NML NML NML
26.005
3128
NML
26.006 26.007 26.008 26.009 26.010 26.011 26.012 26.015 26.016 26.017 26.018 26.019 26.020 26.021 26.022 26.023 26.024 26.027 26.028 26.029 26.03 26.031 26.032 26.033 26.034 26.035 26.037 26.038 26.04 26.041
3129 3130 3131 3132 3133 3134 3135 3138 3139 3140 3141 3142 3143 3144 3145 3146 3147 3150 3151 3152 3153 3154 3155 3156 3157 3158 3160 3161 3163 3164
NML NML NML NML NML NML NML NML NML NML NML NML NML NML NML NML NML NML NML NML NML NML NML NML NML NML NML NML NML NML
26.042
3165
NML
26.043 26.044 26.045 26.047 26.048 26.049 26.05 26.051 26.054 26.055 26.056 25.035 25.036 25.037 25.039 25.04
3166 3167 3168 3170 3171 3172 3173 3174 3177 3178 3179 2667 2668 2669 2671 2672
NML NML NML NML NML NML NML NML NML NML NML NML NML NML NML NML
25.041 25.042 ST60 ST24A ST143 ST145 ST308 ST1002 ST168 ST309 ST1106 ST495 In14 In15 3123 3125 T189 T202
2673 2674 2770 2771 2773 2774 2775 2776 2777 2779 2780 2781 2782 2783 3184 3185 3187 3189
NML NML Pang Pang Pang Pang Pang Pang Pang Pang Pang Pang Pang Pang Pang Pang Pang Pang
Locality of isolation
Manitoba Alberta British Columbia (India) British Columbia (India) British Columbia British Columbia British Columbia Manitoba Manitoba Alberta (Kenya) Manitoba (India) British Columbia British Columbia British Columbia British Columbia Alberta British Columbia British Columbia Alberta Alberta Ontario Quebec Quebec Quebec Quebec Quebec Quebec Quebec Quebec Quebec British Columbia British Columbia British Columbia Alberta (El Salvador) Alberta (California) Alberta British Columbia British Columbia British Columbia British Columbia British Columbia Alberta British Columbia Quebec Quebec Quebec Manitoba Alberta British Columbia British Columbia British Columbia (Guatamala) Alberta Ontario Malaysia Malaysia Malaysia Malaysia Malaysia Malaysia Malaysia Malaysia Malaysia Malaysia Indonesia Indonesia Chile Chile Thailand Thailand
Year of isolation
Flagellar antigend
Phage typee
Fragment order in genomef
PCR orderg
Genome typeh
Altered 1-Ceu-I fragment(s)i
⫺ ⫺ ⫺
E1 UT(Vi-ve) I⫹IV
BCFEDG BCDEFG BCEFDG
A⫺C⫹ A⫺C⫹ A⫹C⫹
2 1 3
G, ⫺130 (SP17⫺)
⫹
⫺
E1
BCFEDG
A⫺C⫹
2
1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994
⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹
⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺
UT(Vi-ve) A D2 B1 E1 B2 O J1 DVS B1 B2 A Atypical O I⫹IV UT(Vi-ve) E1 46 DVS UT(Vi-ve) I⫹IV Atypical I⫹IV Atypical I⫹IV I⫹IV I⫹IV E1 M3 B3
GFCEDB BCEFDG BCDFEG GFCEDB BCFEDG BCEFDG BCFDEG BCEFDG BCFDEG BCDFEG BCEFDG BCEDFG BCEDFG BCFDEG BCEFDG BCEFDG BCFEDG BCFEDG BCFDEG BCEFDG BCEFDG GDCEFB GECFDB BCEFDG BCEFDG BCEFDG BCFEDG BFCEDG GDCEFB BCEFDG
A⫺C⫹ A⫺C⫹ A⫹C⫹ A⫹C⫺ A⫺C⫹ A⫺C⫹ A⫹C⫹ A⫹C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫹C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫹C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫺ A⫹C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫹C⫹ A⫺C⫹ A⫹C⫹
16 3 4 16 2 3 6 3 6 4 3 5 5 6 3 3 2 2 6 3 3 19 24 3 3 3 2 14 19 3
G, ⫹50 B, ⫹80
1994
⫹
⫺
B2
BCEFDG
A⫺C⫹
3
E, ⫹40
1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1994 1993 1993 1993 1993
⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹
⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺
DVS F4 D8 A variant B1 B1 I⫹IV DVS E1 Atypical F1 F1 E1 D5 O E1
BCEFDG BCFEDG BCEFDG BCEFDG BCFEDG GCEDFB GCFEDG BCDEFG BCFEDG BCFEDG GECDFB BDCFEG BCEFDG BCEFDG BCFDEG BCDFEG
A⫹C⫹ A⫺C⫹ A⫹C⫹ A⫹C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫺ A⫺C⫹ A⫺C⫺ A⫹C⫹ A⫹C⫹
3 2 3 3 2 11 2 1 2 2 23 18 3 3 2 4
B, ⫹80 B, ⫹90
1993 1993 1986 1986 1994 1994 1987 1987 1987 1987 1987 1987 1994 1994 1983 1983 1990 1990
⫹ ⫹ ⫹ ⫹ ⫹ ⫺ ⫹ ⫹ ⫹ ⫹
⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺
⫹ ⫹ ⫹
⫺ ⫺ ⫺ ⫺ ⫺ ⫺
BCEFDG BCFEDG BCFEDG BCEFDG BCDFEG BCEFDG BCEFDG GCEFDB BCEFDG BCEFDG BCDFEG BCEFDG BCEFDG BCEFDG BCEFDG BCEFDG GCEDFB BCEFDG
A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫹C⫹ A⫹C⫹ A⫹C⫹ A⫹C⫹ A⫺C⫹ A⫺C⫹ A⫹C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫹C⫹ A⫹C⫹ A⫺C⫹ A⫹C⫹
3 2 2 3 4 3 3 9 3 3 4 3 3 3
⫹ ⫹ ⫹
B2 E1 C4 DVS D2 I⫹IV E1 E1 UT(Vi-ve) E1 D1 B1 Atypical D2 rough 46 N UT(Vi-ve)
d
J
1994 1993 1993
⫹ ⫹ ⫹
1994
3 11 3
B, ⫹88
B, ⫹80
B, ⫹80 B, ⫹15 G, ⫹80
E, ⫹40; B, ⫹20 G, ⫹20 E, ⫹30 E, ⫹40; G, ⫺70 B, ⫹15 E, ⫹40
G, ⫺130 (SP17⫺)
Continued on following page
GENOME STRUCTURE IN S. ENTERICA SEROVAR TYPHI
VOL. 187, 2005
2641
TABLE 1—Continued Straina
SGSC no.b
Source (reference)c
Locality of isolation
Thailand Indonesia Indonesia Indonesia Papua New Guinea Papua New Guinea Papua New Guinea
Year of isolation
Flagellar antigend
Fragment order in genomef
PCR orderg
Genome typeh
UT(Vi-ve) 53 A C3 D2
BCDEFG BDCEFG GCEFDB BCEFDG BCFEDG
A⫺C⫹ A⫹C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹
1 17 9 3 2
⫺
D2
BCFEDG
A⫺C⫹
2
⫹
⫺
D2
BCEFDG
A⫹C⫹
2
⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹
⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺
I⫹IV A A M1 UT(Vi-ve) UT(Vi-ve) A G1 K1 T A UT(Vi-ve) B2 M1 DVS A O I⫹IV 46
BDCFEG GCDEFB GCDEFB BFCDEG BFDCEG BCEFDG BCFDEG BECFDG BCFDEG BCEFDG CBEFDG BDCEFG BCEFDG ECBFDG BCEFDG BCFEDG BCFDEG BCFDEG BFECDG
A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫹C⫹ A⫹C⫺ A⫹C⫹ A⫺C⫹ A⫺C⫹ A⫹C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫹C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹
18 7 7 13 23 3 6 22 6 3 25 19 3 26 3 2 6 6 19
⫹ ⫹
⫺ ⫺
E1 E1
⫹
⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫹ ⫹
E2 UT(Vi-ve) A D9 M1 46 E1 I-IV UT(Vi-ve)
GCEFDB GCEFDB BCEFDG GCDEFB BFCEDG BCDEFG GDCEFB BCEFDG BCEFDG BCFEDG BCFEDG BCEFDG BCEFDG
A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫹C⫹ A⫺C⫹ A⫹C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹
9 9 3 7 14 1 19 3 3 2 2 3 3
d
J
1990 1992 1992 1992 1994
⫹ ⫹ ⫹ ⫹ ⫹
⫺ ⫺ ⫺ ⫺ ⫺
1994
⫹
1994
Phage typee
T104 In4 In20 In24 PNG30
3188 3190 3191 3192 3193
Pang Pang Pang Pang Pang
PNG31
3194
Pang
PNG32
3195
Pang
ST1 CC6 CC7 382-82 9032-85 1707-81 1196-74 3434-73 3137-73 3815-73 SARB63 SARB64 SA4825 PL27566 PL45838 PL73203 57639-199 Lysin⫹SA4 ISP1820 (ATCC 55047) Ty2 200Ty 403Ty 541Ty R1637 R1962 R1167 R136 R2101 R70 R196 414Ty 415Ty
2728 3198 3199 2664 2663 2661 2662 2658 2660 2659 2520 2521 2655 2990 2991 2992 2682 2683 2406
Pang Ho Ho CDC CDC CDC CDC CDC CDC CDC Selander Selander ProvLab ProvLab ProvLab ProvLab Hone (25)
Chile
1983
2408 2272 2273 2758 2692 2693 2694 2695 2696 2697 2698 3212 3213
Hone (25) Stocker Stocker Stocker ProvLab ProvLab ProvLab ProvLab ProvLab ProvLab ProvLab Stocker Stocker
USSR
1918
Australia The Netherlands
1981 1982
⫹ ⫹ ⫹ ⫹ ⫹ ⫺ ⫺
416Ty 417Ty 418Ty 419Ty 420Ty 421Ty
3214 3215 3216 3217 3218 3219
Stocker Stocker Stocker Stocker Stocker Stocker
Japan New Caledonia The Netherlands The Netherlands Japan France
1982 1982 1988 1988 1982 1984
⫺ ⫺ ⫺ ⫹ ⫹ ⫹
⫹ ⫹ ⫹ ⫺ ⫺ ⫺
UT(Vi-ve) HIV I⫹IV I⫹IV UT(Vi-ve) UT(Vi-ve)
BCEFDG BECFDG BCEFDG BCEFDG BCEFDG BCEFDG
A⫺C⫹ A⫺C⫺ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹
3 22 3 3 3 3
422Ty 423Ty 424Ty 425Ty 444Ty 445Ty 446Ty 447Ty 701Ty 702Ty TYT1668 TYT1669
3220 3221 3222 3223 3224 3225 3226 3227 3485 3486 3487 3488
Stocker Stocker Stocker Stocker Stocker Stocker Stocker Stocker Stocker Stocker Mora Mora
The Netherlands Australia The Netherlands
1988 1981 1988
⫹ ⫺ ⫹ ⫺ ⫹
⫺ ⫹ ⫺ ⫹ ⫺
I⫹IV I⫹IV I⫹IV I⫹IV I⫹IV
⫺ ⫹
⫹ ⫺
I⫹IV I⫹IV
Chile Chile
⫹ ⫹
⫺ ⫺
M1 UT(Vi-ve)
BCEFDG BCEFDG BCEFDG BCEFDG BCEFDG BCEFDG BCEFDG BCEFDG CEFBDG BCEFDG BECDFG BCFDEG
A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫹C⫹ A⫹C⫹
3 3 3 3 3 3 3 3 27 3 21 6
TYT1670 TYT1671 TYT1672 TYT1673 TYT1674 TYT1675 TYT1676 TYT1677 ST318 CT18
3489 3490 3491 3492 3493 3494 3495 3496 2778 4072
Mora Mora Mora Mora Mora Mora Mora Mora Pang Sanger Centre (50)
Chile Chile Chile Chile Chile Chile Chile Chile Malaysia Vietnam
⫹
⫺
46
⫹ ⫹ ⫹ ⫹ ⫹ ⫹ ⫹
⫺ ⫺ ⫺ ⫺ ⫺ ⫺ ⫺
E1 F8 E1 E1 E1 F8 Atypical
BCEFDG BCEFDG GECDFB BCEFDG GECDFB GECDFB GECDFB BCEDFG BCEFDG
A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫺C⫹ A⫹C⫹
3 3 23 3 23 23 23 5 3 9
Thailand Thailand Marshall Islands Taiwan Liberia Mexico Peru India Unknown Dakar, Senegal Dakar, Senegal Calgary, Canada
1988 1988 1993 1994 1995 1990
Altered 1-Ceu-I fragment(s)i
G, ⫺130 (SP17⫺) B, ⫹80 E, ⫹40
G, ⫺130 (SP17⫺)
G, ⫹15 B, ⫹80
G, ⫺130 (SP17⫺) E, ⫹40 B, ⫹20 B, ⫹80
G, ⫺130 G, ⫺130 (SP17⫺) E, ⫹40
E, ⫹40 E, ⫹40; G, ⫺130 (SP17⫺) E, ⫹40 E, ⫹40; G, ⫺130 E, ⫹40 E, ⫹40 E, ⫹40 E, ⫹40; G, ⫺130 (SP17⫺) E, ⫹40 E, ⫹40 E, ⫹40 E, ⫹40 E, ⫹40 E, ⫹40 E, ⫹40 E, ⫹40 E, ⫹40 E, ⫹40 G, ⫺170 (part of SP17 deleted)
G, ⫹80
Continued on following page
2642
KOTHAPALLI ET AL.
J. BACTERIOL. TABLE 1—Continued
a
Designation used in the laboratory which provided the strain. b Strain number used at the Salmonella Genetic Stock Centre. c NML, National Microbiology Laboratory, Winnipeg, Canada (formerly Laboratory Centre for Disease Control) (R. Khakhria and David Woodward); Pang, T. Pang, Research Policy and Cooperation, World Health Organization, Geneva, Switzerland; Ho, M. Ho, Department of Microbiology and Infectious Diseases, University of Calgary; CDC, Centers for Disease Control (J. J. Farmer) (54); Selander, Robert K. Selander, Pennsylvania State University (part of SARB set, Salmonella Reference B) (6); ProvLab, Southern Alberta Provincial Lab (C. Anand); Hone, D. Hone, Center for Vaccine Development, Baltimore, Md. (25) (H238.2 is ISP1820 with aroD1013, and H251.1 is Ty2 with aroC101); Stocker, B. A. D. Stocker, Stanford University; Mora, G. Mora, University of Chile, Santiago, Chile. d Flagellar antigens d and j are two alternative states of the phase 1 antigen. e Phage typing with Vi typing phage II of Craigie and Felix (9) was done at the Laboratory Centre for Disease Control in Ottawa, Canada. f Fragment order was determined by partial digestion of DNA with endonuclease I-CeuI and separation of the fragments by PFGE; the data are the order of I-CeuI fragments. g Order and orientation of the A and C fragments as determined by PCR. A plus sign indicates that the fragment is in the normal (uninverted) orientation, and a minus sign indicates that the fragment is in the inverted orientation. h Specific order of I-CeuI fragments, as shown in Fig. 4. i Altered I-CeuI fragments and their sizes (in kilobases) are indicated. All of the strains of S. enterica serovar Typhi yielded seven fragments following complete digestion with I-CeuI, and in most strains the sizes of these fragments were indistinguishable from the sizes observed in strain Ty2 (12) and CT18 (50). In certain strains the sizes of the fragments differed by the amounts indicated; larger sizes are indicated by a plus sign, and smaller sizes are indicated by a minus sign. Deletion of the I-Ceul-G fragment resulted in loss of all or part of Salmonella pathogenicity island 7 (SPI7), a 134-kb island which has the viaB operon for Vi exopolysaccharide (47). The sizes of all of the fragments not listed are not distinguishable from the sizes of the normal fragments.
fragments, several gels with different loading and electrophoresis conditions were run if necessary. A set of 136 strains of S. enterica serovar Typhi which were assembled from a variety of sources (Table 1) was analyzed by partial digestion with I-CeuI, as shown in Fig. 2. Many different arrangements of I-CeuI fragments were detected; these arrangements apparently resulted from inversions and translocations following recombination between rrn operons, as illustrated in Fig. 3. Figure 4 shows I-CeuI fragments A, B, C, D, E, F, and G as a contiguous block arranged in different orders; 27 different genome types are shown, some of which were not detected in the 136 strains. This linear unit is joined at both ends through fragment A to produce a circular chromosome, as shown in Fig. 1. Genome types 1 to 6 had all possible rearrangements of the three small fragments, fragments D, E, and F; all of these rearrangements were detected, although some were much more frequent than others. Genome type 1 (I-CeuI-ABCDE FG) is most common in the enteric bacteria; e.g., it has been observed for S. enterica serovar Typhimurium LT2 (39), for 17 wild-type strains of S. enterica serovar Typhimurium (41), for S. enterica serovar Enteritidis (36), and for S. enterica serovar Paratyphi B (33), as well as for the following strains of E. coli whose complete sequences have been deposited in the GenBank database: E. coli K-12, E. coli CFT073, and E. coli O157:H7 (strains EDL933 and Sakai). However, only 4 of 136 strains of S. enterica serovar Typhi are genome type 1. Genome type 3 (I-CeuI-BCEFDG) is by far the most common genome type, represented by 59 strains. Translocation of the fragments to new locations could result from deletion of a fragment due to homologous recombination between rrn operons, followed by insertion of the fragment in a different rrn operon (Fig. 3). Genome types 7 to 12 involve the same arrangements of the D, E, and F fragments, but fragments B and G are inverted, presumably due to a crossover between rrnD and rrnE; these types are less common than genome types 1 to 6. Strain Ty2, which is widely used as a wild-type strain, is a genome type 9 organism; the detailed genomic cleavage map for the enzymes XbaI, BlnI, SpeI, and I-CeuI for Ty2 (38), the partial I-CeuI digestion data (this study), and the complete nucleotide sequence (12) all confirm the same order. Genome types 13 to 16 are types in which I-CeuI-F has been translocated to a position
to the left of I-CeuI-C; genome types 17 to 20 and 21 to 24 represent equivalent translocations of I-CeuI-D, and I-CeuI-E, respectively. Genome types 13 to 24 are uncommon, and some of these types were not detected in the sample studied, although they might be found in a larger sample. All combinations in which I-CeuI-B and -G are adjacent to fragment A are represented in genome types 1 to 24; genome types 25 to 27 are three of the rarely encountered types in which a different fragment is adjacent to fragment A. PCR to detect chromosomal rearrangements in independent wild-type strains of S. enterica serovar Typhi. Partial digestion with I-CeuI determines the order of fragments, but not their orientation. The orientation of rrn operons limits the types of rearrangements which can be formed. The chromosome is composed of two replichores, and replication begins at oriC and proceeds bidirectionally to the termination of replication (Ter). For example, in S. enterica serovar Typhimurium LT2 (Fig. 1A) replichore 1 contains rrnCABEH and replichore 2 contains rrnDG, and all of these genes are oriented so that they are transcribed from oriC toward Ter. Homologous recombination between rrn operons to produce inversions or translocations can occur only between rrn operons in the same orientation; thus, the orientation of fragments which have two similarly oriented rrn operons at their ends, such as I-CeuI-B, -D, -E, -F, and -G, can be predicted. However, I-CeuI-C (which contains oriC) has two rrn operons at its ends transcribed away from oriC, and I-CeuI A (which contains Ter) has the two rrn operons at its ends transcribed toward Ter. Thus, both fragments can be inverted by recombination, which can occur between rrn operons at their ends (Fig. 1 and 3); therefore, the orientation of these fragments cannot be predicted by PFGE methods. Recombination between other rrn operons in different replichores results in inversions (Fig. 3). Therefore, we used PCR analysis (as first described by Helm and Maloy [20]) to confirm the order of the seven I-CeuI fragments analyzed by PFGE and to determine their orientations. All 14 primers were designed from the S. enterica serovar Typhi CT18 sequence (Fig. 1B; see Table S1 in the supplemental material) and are located inside the genes which are adjacent to each of the seven rrn operons on either side. These primers were used in different pairwise combinations, based on the order of I-CeuI fragments predicted from PFGE analysis
VOL. 187, 2005
GENOME STRUCTURE IN S. ENTERICA SEROVAR TYPHI
FIG. 1. Location and order of the seven I-CeuI fragments on the chromosome (the rrn skeleton). oriC is the site of initiation of bidirectional replication; Ter is the termination site. The numbers with arrows represent the different primer combinations used to amplify the seven rrn operons (indicated by arrows outside the circles). The numbers outside the circles indicate the sizes of the I-CeuI fragments (in kilobases) based on the previously published sequences (45, 50). (A) S. enterica serovar Typhimurium LT2. (B) S. enterica serovar Typhi CT18.
(Fig. 2); appropriate pairwise combinations resulted in ca. 6-kb amplicons containing the rrn operons. Since the orientations of fragments A and C were unknown, two primer combinations might work for these two fragments. PFGE indicated that the order of I-CeuI fragments for S. enterica serovar Typhi 26.047 was ABCEFDG (genome type 3) (Table 1). Template DNA from this strain gave successful amplification with the following primer pairs, indicating that specific I-CeuI fragments are adjacent: primers 3 and 4 (AB), primers 15 and 12 (BC), primers 14 and 13 (CE), primers 6 and 5 (EF), primers 10 and 9 (FD), primers 16 and 11 (DG), and primers 8 and 7 (GA) (Fig. 5A). Primer pairs 7-4 and 3-8, as well as primer pairs 15-14 and 12-13, would have produced an amplicon if the A and C fragments, respectively, were inverted; all these combinations failed. This indicates that both fragment A and fragment C are in the orientation present in most strains of Salmonella and E. coli. This was illustrated by the locations of the pro and his genes in I-CeuI-A and the location of oriC in fragment C at the clockwise end; we called these orientations A⫹C⫹ (Fig. 5A).
2643
FIG. 2. Partial digestion of DNA of strains of S. enterica serovar Typhi with endonuclease I-CeuI, separation by PFGE, and staining with ethidium bromide. The gel is shown on the left. The fragments are shown on the right, and the inferred composition and sizes (in kilobases) are indicated. Lane 1, strain SARB63 (fragment order, CBEFDG; genome type 25); lane 2, SARB64 (fragment order, BDCEFG; genome type 19); lane 3, 27566 (fragment order, ECBFDG; genome type 26); lane 4, SA4864 (fragment order, BCFDEG; genome type 6); lane 5, SA4665 (fragment order, GFCEDB; genome type 16); lane 6, Ty2 (fragment order, GCEFDB; genome type 9).
When the same primer pairs were used with genomic DNA of S. enterica serovar Typhi 425Ty as the template (genome type 3) (Table 1), seven PCR amplicons were again produced (Fig. 5B). Primer pairs for amplification of the rrn operons between the following fragments produced the same pattern that was observed with S. enterica serovar Typhi 26.047 (Fig. 5A): B and C, C and E, E and F, F and D, and D and G. However, primer pairs 7-4 and 3-8 yielded PCR amplicons, whereas primer pairs 3-4 and 7-8 did not, indicating that the A fragment was inverted. The lack of amplicons for primer pairs 15-14 and 12-13 indicated that fragment C was in the normal orientation (uninverted); this structure is represented by A⫺C⫹ (Fig. 5B). The order and orientation of the seven rrn operons were determined in the same way by using template DNA from all 136 wild-type strains of S. enterica serovar Typhi previously tested by PFGE. In all cases the template DNA yielded seven PCR amplicons, and the order agreed with the order of fragments determined by PFGE and with the orientation of fragments B, D, E, F, and G (inferred from the polarity of the rrn operons). The genome type and orientation of fragments A and C are indicated for each strain in Table 1 and are summarized in Fig. 4, which shows the number of strains for each
2644
KOTHAPALLI ET AL.
FIG. 3. Proposed model of genomic rearrangements due to homologous recombination between rrn operons resulting in inversions or translocations. oriC is indicated by a shaded circle in fragment C (which corresponds to I-CeuI-C), and Ter is indicated by a shaded square in fragment A (I-CeuI-A). pro (proline requirement) and his (histidine requirement) indicate the positions of standard genes. (A) Both the A and C fragments are in the normal, uninverted orientation (A⫹C⫹), and the fragment order is I-CeuI-ABCDEFG. (B and C) Inversion. Recombination between rrnH and rrnG results in inversion of fragment A. (D to F) Translocation. Recombination between rrnC and rrnA deletes fragment D, which is reinserted by recombination with rrnE; this results in translocation to produce the fragment order ABCEFDG.
genome type and for each of the four orientations of fragments A and C. Fragment I-CeuI-A was frequently inverted, and I-CeuI-C was rarely inverted. Thus, strains of S. enterica serovar Typhi showed many different rearrangements of the gene segments between rrn operons (the I-CeuI fragments). Normally, we did not use all possible pairwise combinations of primers for testing each of the strains; we used only the primer pairs predicted by PFGE data to be effective. However, in a few cases we tested all possible combinations, and only those primer pairs predicted by the PFGE results were effective (data not shown). Changes in I-CeuI fragment lengths. PFGE data indicated that the lengths of I-CeuI fragments are highly conserved, for all seven sizes were indistinguishable from the sizes observed in strain Ty2 (38) for 86 of the 136 strains, as shown in Table 1. PFGE did not detect changes in fragment I-CeuI-C (517 kb), I-CeuI-D (134 kb), or I-CeuI-F (42 kb) in any of the strains (Table 2), although changes of a few kilobases should have
J. BACTERIOL.
been detectable by PFGE. Our methods could not detect changes in the large I-CeuI-A fragment (2,422 kb). The 57 strains with detectable changes included many types (Table 2). The following numbers of insertions were detected: 13 strains had 20- to 90-kb insertions in I-CeuI-B; 25 strains had 20- to 40-kb insertions in I-CeuI-E; and six strains had 15- to 80-kb insertions in I-CeuI-G. The only strains with a fragment with deletions were 12 strains with deletions in I-CeuI-G. These deletions were shown previously to result from a loss of all or part of Salmonella pathogenicity island 7, a 134-kb island which has the viaB operon for Vi exopolysaccharide, due to recombination between genes for tRNAPhe. All these strains are untypeable by the phages used for Vi typing and are not agglutinated by Vi antiserum (7, 47). Thus, deletions were very rare. Insertions were more common, including the insertions in 25 strains with a 40-kb insertion in I-CeuI-E and 19 other insertions of various sizes in I-CeuI-B, -E, and -G. However, small changes in fragment sizes (1 to 2 kb in small fragments and about 10 to 20 kb in larger fragments), which could be detected by sequencing, were not observed by PFGE. Determination of phage type by using Vi phage and of flagellar antigens. The phage types of most of the wild-type strains are shown in Table 1. The data show that many different phage types are associated with specific genome types (for example, the 59 genome type 3 strains have many different phage types). This indicates that the phage type and the genome type are largely independent of each other. In a few cases, the frequency of occurrence of a specific genome type in a phage type is greater than that expected by chance. For example, of 19 phage type E1 strains, 7 are genome type 2, although genome type 2 is found in only 20 of 136 strains; this may represent isolation of similar strains from a clonal population. Flagellar antigens were determined for almost all of the strains; these antigens were usually the d antigen, although a few strains had the alternative j antigen. DISCUSSION Serovars of Salmonella which are pathogens for a wide range of hosts (generalists, such as S. enterica serovar Typhimurium) have very conserved genomes, while serovars which have very limited host ranges (specialists), such as S. enterica serovar Typhi, S. enterica serovar Paratyphi C, S. enterica serovar Gallinarum, and S. enterica serovar Pullorum, show a high frequency of rearrangements among wild-type strains due to homologous recombination between rrn operons (40). Helm et al. (19) showed in laboratory experiments that the frequency of recombination between rrn operons is not distinguishable in strains of S. enterica serovar Typhi and S. enterica serovar Typhimurium. This result suggests that differences in selective value rather than differences in rearrangement frequency are likely to be responsible for the higher frequency of rearrangements found in S. enterica serovar Typhi than in S. enterica serovar Typhimurium. Thus, greater survival of a recombinant might be due to the different lifestyles of generalist and host specialist species. The following four classes of chromosome rearrangements might be formed due to recombination between rrn operons: deletions, duplications, translocations, and inversions (Fig. 3) (40) (56). Deletions of entire I-CeuI fragments would be
VOL. 187, 2005
GENOME STRUCTURE IN S. ENTERICA SEROVAR TYPHI
2645
FIG. 4. Order and orientation of I-CeuI fragments in 136 independent wild-type strains of S. enterica serovar Typhi. The sizes (in kilobases) of the fragments based on the sizes in CT18 (50) are indicated at the top, shown approximately to scale. The order of I-CeuI fragments B to G was determined by PFGE (Fig. 2) and was confirmed by PCR (Fig. 5). The I-CeuI-A fragment (2,422 kb) is inferred to join the left end to the right end of a fragment to form a circle. The orientation of I-CeuI fragments B, D, E, F, and G was inferred from the polarity of the rrn genes and was confirmed by PCR. The order and sizes of fragments for E. coli K-12 and S. enterica serovar Typhimurium LT2 (STM LT2) and the orientation of rrn operons are indicated at the bottom. The chromosomes of the different genome types are shown in the A⫹C⫹ orientation (with both the A and C fragments uninverted); the open square in fragment A indicates pro (proline utilization), and the open triangle indicates his (histidine requirement). Since both I-CeuI-C and I-CeuI-A are flanked by inverted rrn operons, these fragments could be inverted. The number of strains of each genome type that fall into each of the four sets of orientation of A and C fragments was determined from the PCR data (see Fig. 5). The dot in the I-CeuI-C fragment indicates the location of oriC; T indicates the terminus. The sizes of the fragments (in kilobases) were calculated from previously published sequences of S. enterica serovar Typhimurium LT2 (45) (GenBank accession no. NC_003197) and E. coli K-12 (5) (GenBank accession no. NC_000913); these fragments are shown at the bottom.
readily detectable but were not observed, which is not surprising since all the fragments have essential genes. Deletions of a 9-kb segment between two rrn operons were detected in Bacillus subtilis (27), indicating that this short segment has no essential genes. Duplications would be detected by doubled
intensity of the duplicated I-CeuI fragments and in partial digestion data, but these were not seen in these strains. Roth et al. (56) showed that 3% of the cells in cultures of S. enterica serovar Typhimurium LT2 had duplications of the segment between the closest rrn operons (the I-CeuI-F fragment) and
2646
KOTHAPALLI ET AL.
J. BACTERIOL.
FIG. 5. PCR analysis of the rrn skeleton of S. enterica serovar Typhi genome type 3 strains. The primer pairs are indicated above the gel showing the PCR products. Lane M contained the marker (HindIII-digested lambda). The inferred rrn skeleton is shown on the left. (Upper set) Template DNA of strain 26.047 (genome type 3, A⫹C⫹). (Lower set) Template DNA of strain 425Ty (genome type 3, A⫺C⫹).
found that these duplications are unstable since they revert to the haploid state; our failure to find strains with duplications confirms that duplications of this type are unstable. Inversions and translocations occur frequently, for not one of the 136 strains tested had genome type 1 A⫹C⫹ (Fig. 4) (the genome order normally found in Salmonella and E. coli); even the four genome type 1 strains were A⫺C⫹. Thus, all strains tested had at least one translocation or inversion com-
pared with the standard type. Two separate mechanisms could explain translocation. First, there could be deletion of a segment due to recombination between two rrn operons in the same replichore, thus forming a circular fragment, followed by reinsertion of the circle into another rrn operon (Fig. 3D and E). Hill and colleagues observed circles of a size appropriate for the interval from rrnB to rrnE (about 42 kb) (24); this corresponds to the 42-kb I-CeuI-F fragment. Second, translo-
TABLE 2. Changes in lengths of I-CeuI fragments in 136 strains of S. enterica serovar Typhi, measured by PFGE I-CeuI fragment
Fragment size (kb)a
A B C D E F G
2,422 706 517 134 149 42 839
a
Deletion size (kb) (no. of strains)b
Insertion size (kb) (no. of strains)b
20 (4), 80 (7), 90 (2) 30 (1), 40 (25) 70 (1), 134 (10), 170 (1)
15 (1), 20 (2), 50 (1), 80 (2)
No. of fragments altered
NDc 13 0 0 26 0 18
Normal fragment sizes were determined from the sequence of strain CT18 (50); these sizes are very similar to those of strain Ty2 (12). The sizes are slightly corrected from those determined previously by PFGE (38). b Deletions and insertions represent decreases or increases in the size of a fragment relative to the normal size; the number of strains with each size class is indicated in parentheses. The specific strains are shown in Table 1. c ND, not determined, because an accurate size of the I-CeuI-A fragments could not be determined by PFGE.
VOL. 187, 2005
GENOME STRUCTURE IN S. ENTERICA SEROVAR TYPHI
cations could be formed by duplications (e.g., to form DEFEF) followed by two independent deletions of fragments E and F (to form DFE). Genome types 1 to 6 are postulated to result from translocations of the small fragments I-CeuI-D, -E, and -F to form all six combinations. Genome types 13 to 16 involve translocation of I-CeuI-F into rrnD to the left of I-CeuI-C. Inversions are also commonly detected; e.g., genome types 7 to 12 are due to an inversion resulting from recombination between rrnD and rrnE. In addition, these strains have the translocations found in genome types 1 to 6; i.e., genome types 1 and 7 and genome types 2 and 8 have the same translocation, etc. Several hypotheses were devised by Roth et al. (56) to explain the highly conserved genomes usually found in enteric bacteria. At first glance, it seems that rearrangements in S. enterica serovar Typhi have resulted in total reshuffling of the genome. However, in spite of the many genome rearrangements that we have detected in S. enterica serovar Typhi (Fig. 4), there is still considerable conservation (although not as much as in S. enterica serovar Typhimurium); our data support the gene balance and gene dosage hypotheses for genome conservation in S. enterica serovar Typhi. Genome balance. Lengths of the replichores between the oriC and Ter sites on a circular bacterial chromosome must be maintained for balanced bidirectional replication (23). Hill and Gray (22) showed that moving oriC relative to Ter reduced the growth rate of E. coli K-12. S. enterica serovar Typhi CT18 fragment sizes determined from the sequence (50) were used as the standards to calculate the genome balance for all wildtype S. enterica serovar Typhi strains (the sizes calculated from the sequence of strain Ty2 [12] are very similar). The positions of the origin of replication (oriC) and the termination of replication were determined from the E. coli K-12 oriC sequence (46) (GenBank accession no. K01789) and the position of the dif (deletion-induced filamentation) sequence of E. coli (31) (GenBank accession no. S62735), respectively; locations on the chromosome of S. enterica serovar Typhi CT18 were detected with BlastN (2). Rearrangements only in fragments D to F, which result in genome types 1 to 6 (Fig. 4), do not change the genome balance because they are all in the same replichore. When these genome types are A⫹C⫹, the size of replichore 1, from oriC in I-CeuI-C through fragments E, F, D, G, and A to Ter, is 2,362 kb; the size of replichore 2, from fragment C through fragments B and A to Ter, is 2,447 kb. Genome balance was calculated by dividing the size of each replichore by the total genome size; the off-balance value, which was one-half the difference between the replichore sizes, was 42.5 kb (Fig. 6 and Table 3). The genome balance was calculated for genome types 1 to 6 for the four different orientations in fragments A and C, which changed the lengths of the replichores, and the numbers of strains of each type were summarized from Fig. 4, as shown in Table 3. Of the 100 strains with genome types 1 to 6, 28 were A⫹C⫹ (42.5 kb off-balance), 72 were A⫺C⫹ (25.5 kb offbalance), and none were A⫹C⫺ or A⫺C⫺ (over 400 kb offbalance). Thus, the fragment orientations which gave the best chromosome balance were the most frequently detected orientations; among independent wild-type strains, strains with inversions of I-CeuI-C, which would have been highly unbalanced, were not detected. It has been noticed previously by
2647
Eisen et al. (14) that chromosomal inversions around the origin and termination of replication are usually symmetrical, thus retaining chromosome balance, even when comparisons are done between groups as widely separated as E. coli and Vibrio cholerae. The remaining 36 strains were also grouped into sets, such as genome types 7 to 12, etc., and were analyzed to determine chromosome balance and frequency; these strains are shown in Table S2 in the supplemental material. Most of the genome types occur in small numbers; many of them are off-balance by 100 to 300 kb. The most striking are genome types 25 and 27 (731.5 kb off-balance), represented by two strains, and one strain of genome type 26 (582.5 kb off-balance). It was surprising to find such a high degree of genomic rearrangement in S. enterica serovar Typhi (and in S. enterica serovar Paratyphi C [21], which causes paratyphoid fever) since the genomes of most enteric bacteria are highly conserved. We propose that the insertion of a large block of foreign DNA into these two organisms (134 kb in Salmonella pathogenicity island 7 containing the viaB genes inserted into I-CeuI-G) (38) may have resulted in chromosome imbalance which triggered a series of rearrangements (32). Gene dosage. Due to bidirectional replication, there are extra copies of genes close to oriC, resulting in increased gene expression (57), and since dosage differences may cause the strengths of promoters to be evolutionarily optimized for their specific positions, cells in which genes are a different distance from oriC are at a selective disadvantage (56). Although rearrangements were seen in all 136 strains, some classes of rearrangements were rare or never detected. Three small fragments, I-CeuI-D, -E and -F, are frequently translocated from the normal order DEF into all possible orders in genome types 1 to 6 and also into new sites on both sides of I-CeuI-C in genome types 12 to 24. However, there is not a single case of translocation of any of these three fragments into rrnH between I-CeuI-A and I-CeuI-G, although this translocation within the same replichore would not change the chromosome balance (Fig. 4). We postulate that translocations involving rrn operons occur at random at these positions, as well as at other locations, but that the cells are at a selective disadvantage because, according to the gene dosage hypothesis, such translocations move the genes far from oriC such that the copy number and thus the rate of gene expression are less adaptive; thus, rearrangements of these types occur, but strains with these genome types do not survive in nature. Gene position might also be conserved because promoters are tuned to the degree of local supercoiling of the DNA, so that rearrangements would be disadvantageous (55). This may partially explain the conservation that we detected. The direction of transcription and replication is normally the same in highly expressed genes, such as those for rRNA, and this may be an additional basis for conservation of the genome (55, 56). It must be emphasized that in the many inversions and translocations observed in strains of S. enterica serovar Typhi, including those between replichores, I-CeuI fragments retain the same orientation with respect to replication, because homologous recombination between rRNA operons, all of which have transcription oriented from oriC to Ter, enforces this. Genomic rearrangements other than those involving rrn operons are rare in S. enterica serovar Typhi. This was revealed
2648
KOTHAPALLI ET AL.
J. BACTERIOL.
FIG. 6. Analysis of genome balance. The chromosome structure of genome type 3 A⫹C⫹ illustrates the genome balance. Since the chromosome is bidirectionally replicated from oriC, two replichores are shown. The dot in fragment C represents oriC. (A) Linear form. The cross-hatched line represents calculation of the total fragment sizes to determine the length of replichore 1 and replichore 2. (B) Circular form. Rep1, replichore 1; Rep2, replichore 2.
addition, unstable duplications with IS200 endpoints were detected in S. enterica serovar Typhimurium (18). However, there is no evidence that rearrangements involving IS200 or other duplicated regions occurred among the 136 strains in this study, for such events should cause simultaneous changes in the lengths of two I-CeuI fragments if their ends flank an rrn operon. The genomes of most strains of E. coli and Shigella appear to be stable, like the genomes of most strains of Salmonella, but studies with I-CeuI digestion indicated that rearrangements occur frequently in Shigella dysenteriae and Shigella flexneri strains; originally, this was postulated to be due to rrn-mediated rearrangements (60). However, the complete genome se-
by the fact that the lengths of the I-CeuI fragments in wild-type strains seldom varied, except due to rare insertions or deletions in individual fragments (Table 3), which indicates that the vast majority of genome rearrangements which occur are due to recombination between rrn operons. Recombination might occur between the 25 IS200 elements of S. enterica serovar Typhi CT18 or between the six IS200 elements in S. enterica serovar Typhimurium LT2, for IS200 is 700 bp long and should be a good target for homologous recombination. Recombination between IS200 elements can occur; PCR methods detected an inversion in S. enterica serovar Typhi between two IS200 elements (1) (these were in the same I-CeuI fragment and thus were undetectable by PFGE methods in the present study); in
TABLE 3. Genome balance analysis of S. enterica serovar Typhi strains belonging to genome types 1 to 6a A⫹C⫹b
A⫺C⫹
A⫹C⫺
A⫺C⫺
Parameter
Genome balancec Amt off-balance (kb)d No. of isolatese a
Replichore 1
Replichore 2
Replichore 1
Replichore 2
Replichore 1
Replichore 2
Replichore 1
Replichore 2
0.491 ⫺42.5 28
0.508 42.5
0.505 25.5 72
0.495 ⫺25.5
0.589 432.5 0
0.410 ⫺432.5
0.604 500.5 0
0.395 ⫺500.5
Genome types 1 to 6 have the fragment order I-CeuI-ABC(DEF)G; DEF can be in any order. The orientation of fragments A and C as shown in Fig. 4. Genome balance was determined from fragment sizes (Fig. 6). d See the Discussion for the method used to calculate the amount off-balance. e The number of strains in each type of orientation, as shown in Fig. 4. b c
GENOME STRUCTURE IN S. ENTERICA SEROVAR TYPHI
VOL. 187, 2005
quences show that although both S. flexneri 2a strain 301 (28) and strain 2457T (61) have large symmetrical chromosomal inversions spanning the replication origin and terminus, most of these rearrangements are due to recombination between insertion sequences, as was also seen in two strains of Yersinia pestis (11). This is quite unlike S. enterica serovar Typhi, in which recombination occurred between rrn operons. The 136 strains of S. enterica serovar Typhi belong to many different phage types (Table 1). Phage types are very stable and thus have been used a great deal in bacterial typing (49). Genome types are relatively stable but show occasional changes. For example, during 10 years, as we have worked with isolates of S. enterica serovar Typhi strain Ty2 (genome type 9 A⫺C⫹) (38), we have detected only rare changes in genome type (less than 1 per 100 single-colony isolates). Hughes (26) summarized the frequency of genomic rearrangements for members of many different genera and within the same species and genus. He noted that rearrangements occurred most frequently in clinical isolates of pathogens of humans and animals. He concluded that orderly and efficient replication of the genome and global regulation of gene expression are both critically important; our data support these conclusions, for we show the importance of genome balance (for genome replication) and gene dosage (for gene expression). ACKNOWLEDGMENTS The work reported here was supported by grant RO1AI34829 from the National Institute of Allergy and Infectious Diseases and by a discovery grant from the Natural Sciences and Engineering Research Council to K.E.S. and by a discovery grant from the Natural Sciences and Engineering Research Council to S.L.L. We thank Barney Truong and Martin Papez for assistance with the experiments. REFERENCES 1. Alokam, S., S.-L. Liu, K. Said, and K. E. Sanderson. 2002. Inversions over the terminus region in Salmonella and Escherichia coli: IS200s as the sites of homologous recombination inverting the chromosome of Salmonella enterica serovar Typhi. J. Bacteriol. 184:6190–6197. 2. Altschul, S. F., W. Gish, W. Miller, E. W. Myers, and D. J. Lipman. 1990. Basic local alignment search tool. J. Mol. Biol. 215:403–410. 3. Anderson, E. S., and R. E. Williams. 1956. Bacteriophage typing of enteric pathogens and staphylococci and its use in epidemiology. J. Clin. Pathol. 9:94–127. 4. Anderson, R. P., and J. R. Roth. 1979. Gene duplication in bacteria: alteration of gene dosage by sister chromosome exchanges. Cold Spring Harbor Symp. Quant. Biol. 43:1083–1087. 5. Blattner, F. R., G. Plunkett 3rd, C. A. Bloch, N. T. Perna, V. Burland, M. Riley, J. Collado-Vides, J. D. Glasner, C. K. Rode, G. F. Mayhew, J. Gregor, N. W. Davis, H. A. Kirkpatrick, M. A. Goeden, D. J. Rose, B. Mau, and Y. Shao. 1997. The complete genome sequence of Escherichia coli K-12. Science 277:1453–1474. 6. Boyd, E. F., F. S. Wang, P. Beltran, S. A. Plock, K. Nelson, and R. K. Selander. 1993. Salmonella reference collection B (SARB): strains of 37 serovars of subspecies I. J. Gen. Microbiol. 139:1125–1132. 7. Bueno, S. M., C. A. Santiviago, A. A. Murillo, J. A. Fuentes, A. N. Trombert, P. Youderian, and G. C. Mora. 2004. Excision of the large pathogenicity island of Salmonella enterica serovar Typhi. J. Bacteriol. 186:3202–3213. 8. Chee, C. S., N. Noordin, and L. Ibrahim. 1992. Epidemiology and control of typhoid in Malaysia, p. 3–10. In T. Pang, C. L. Koh, and C. L. Puthucheary (ed.), Typhoid fever: strategy for the 90’s. World Scientific Publishing, Singapore. 9. Craigie, J., and A. Felix. 1947. Typing of typhoid bacilli with Vi bacteriophage: suggestions for its standardization. Lancet i:823–827. 10. Crump, J. A., S. P. Luby, and E. D. Mintz. 2004. The global burden of typhoid fever. Bull. W. H. O. 82:346–353. 11. Deng, W., V. Burland, G. Plunkett 3rd, A. Boutin, G. F. Mayhew, P. Liss, N. T. Perna, D. J. Rose, B. Mau, S. Zhou, D. C. Schwartz, J. D. Fetherston, L. E. Lindler, R. R. Brubaker, G. V. Plano, S. C. Straley, K. A. McDonough, M. L. Nilles, J. S. Matson, F. R. Blattner, and R. D. Perry. 2002. Genome sequence of Yersinia pestis KIM. J. Bacteriol. 184:4601–4611.
2649
12. Deng, W., S. R. Liou, G. Plunkett 3rd, G. F. Mayhew, D. J. Rose, V. Burland, V. Kodoyianni, D. C. Schwartz, and F. R. Blattner. 2003. Comparative genomics of Salmonella enterica serovar Typhi strains Ty2 and CT18. J. Bacteriol. 185:2330–2337. 13. Edelman, R., and M. M. Levine. 1986. Summary of an international workshop on typhoid fever. Rev. Infect. Dis. 8:329–349. 14. Eisen, J. A., J. F. Heidelberg, O. White, and S. L. Salszberg. 2000. Evidence for symmetric chromosomal inversions around the replication origin in bacteria. Genome Biol. 1:0011.1–0011.9. 15. Faundez, G., L. Aron, and F. C. Cabello. 1990. Chromosomal DNA, irontransport systems, outer membrane proteins, and enterotoxin (heat labile) production in Salmonella typhi strains. J. Clin. Microbiol. 28:894–897. 16. Fonstein, M., and R. Haselkorn. 1995. Physical mapping of bacterial genomes. J. Bacteriol. 177:3361–3369. 17. Franco, A., C. Gonzalez, O. S. Levine, R. Lagos, R. H. Hall, S. L. Hoffman, M. A. Moechtar, E. Gotuzzo, M. M. Levine, and D. M. Hone. 1992. Further consideration of the clonal nature of Salmonella typhi: evaluation of molecular and clinical characteristics of strains from Indonesia and Peru. J. Clin. Microbiol. 30:2187–2190. 18. Haack, K. R., and J. R. Roth. 1995. Recombination between chromosomal IS200 elements supports frequent duplication formation in Salmonella typhimurium. Genetics 141:1245–1252. 19. Helm, R. A., A. G. Lee, H. D. Christman, and S. Maloy. 2003. Genomic rearrangements at rrn operons in Salmonella. Genetics 165:951–959. 20. Helm, R. A., and S. Maloy. 2001. Rapid approach to determine rrn arrangement in Salmonella serovars. Appl. Environ. Microbiol. 67:3295–3298. 21. Hessel, A. 1995. Genomic map of Salmonella paratyphi C. M.Sc. thesis. University of Calgary, Calgary, Canada. 22. Hill, C. W., and J. A. Gray. 1988. Effects of chromosomal inversion on cell fitness in Escherichia coli K-12. Genetics 119:771–778. 23. Hill, C. W., and B. W. Harnish. 1981. Inversions between ribosomal RNA genes of Escherichia coli. Proc. Natl. Acad. Sci. USA 78:7069–7072. 24. Hill, C. W., S. Harvey, and J. A. Gray. 1990. Recombination between rRNA genes in Escherichia coli and Salmonella typhimurium, p. 335–340. In K. Drlica and M. Riley (ed.), The bacterial chromosome. ASM Press, Washington, D.C. 25. Hone, D. M., A. M. Harris, S. Chatfield, G. Dougan, and M. M. Levine. 1991. Construction of genetically defined double aro mutants of Salmonella typhi. Vaccine 9:810–816. 26. Hughes, D. 2000. Evaluating genome dynamics: the constraints on rearrangements within bacterial genomes. Genome Biol. 1:reviews0006.1–0006.8. 27. Itaya, M. 1993. Stability and asymmetric replication of the Bacillus subtilis 168 chromosome structure. J. Bacteriol. 175:741–749. 28. Jin, Q., Z. Yuan, J. Xu, Y. Wang, Y. Shen, W. Lu, J. Wang, H. Liu, J. Yang, F. Yang, X. Zhang, J. Zhang, G. Yang, H. Wu, D. Qu, J. Dong, L. Sun, Y. Xue, A. Zhao, Y. Gao, J. Zhu, B. Kan, K. Ding, S. Chen, H. Cheng, Z. Yao, B. He, R. Chen, D. Ma, B. Qiang, Y. Wen, Y. Hou, and J. Yu. 2002. Genome sequence of Shigella flexneri 2a: insights into pathogenicity through comparison with genomes of Escherichia coli K12 and O157. Nucleic Acids Res. 30:4432–4441. 29. Khakhria, R., D. Woodward, W. M. Johnson, and C. Poppe. 1997. Salmonella isolated from humans, animals and other sources in Canada, 1983–92. Epidemiol. Infect. 119:15–23. 30. Kidgell, C., U. Reichard, J. Wain, B. Linz, M. Torpdahl, G. Dougan, and M. Achtman. 2002. Salmonella typhi, the causative agent of typhoid fever, is approximately 50,000 years old. Infect. Genet. Evol. 2:39–45. 31. Kuempel, P. L., J. M. Henson, L. Dircks, M. Tecklenburg, and D. F. Lim. 1991. dif, a recA-independent recombination site in the terminus region of the chromosome of Escherichia coli. New Biol. 3:799–811. 32. Liu, G.-R., A. Rahn, W.-Q. Liu, K. E. Sanderson, R. N. Johnston, and S.-L. Liu. 2002. The evolving genome of Salmonella enterica serovar Pullorum. J. Bacteriol. 184:2626–2633. 33. Liu, S.-L., A. Hessel, H. Y. Cheng, and K. E. Sanderson. 1994. The XbaIBlnI-CeuI genomic cleavage map of Salmonella paratyphi B. J. Bacteriol. 176:1014–1024. 34. Liu, S.-L., A. Hessel, and K. E. Sanderson. 1993. Genomic mapping with I-Ceu I, an intron-encoded endonuclease specific for genes for ribosomal RNA, in Salmonella spp., Escherichia coli, and other bacteria. Proc. Natl. Acad. Sci. USA 90:6874–6878. 35. Liu, S.-L., A. Hessel, and K. E. Sanderson. 1993. The XbaI-BlnI-CeuI genomic cleavage map of Salmonella typhimurium LT2 determined by double digestion, end labeling, and pulsed-field gel electrophoresis. J. Bacteriol. 175:4104–4120. 36. Liu, S.-L., A. Hessel, and K. E. Sanderson. 1993. The XbaI-BlnI-CeuI genomic cleavage map of Salmonella enteritidis shows an inversion relative to Salmonella typhimurium LT2. Mol. Microbiol. 10:655–664. 37. Liu, S.-L., and K. E. Sanderson. 1992. A physical map of the Salmonella typhimurium LT2 genome made by using XbaI analysis. J. Bacteriol. 174: 1662–1672. 38. Liu, S.-L., and K. E. Sanderson. 1995. Genomic cleavage map of Salmonella typhi Ty2. J. Bacteriol. 177:5099–5107. 39. Liu, S.-L., and K. E. Sanderson. 1995. I-CeuI reveals conservation of the
2650
40. 41. 42.
43. 44. 45.
46.
47.
48. 49.
50.
KOTHAPALLI ET AL.
genome of independent strains of Salmonella typhimurium. J. Bacteriol. 177:3355–3357. Liu, S.-L., and K. E. Sanderson. 1996. Highly plastic chromosomal organization in Salmonella typhi. Proc. Natl. Acad. Sci. USA 93:10303–10308. Liu, S.-L., and K. E. Sanderson. 1998. Homologous recombination between rrn operons rearranges the chromosome in host-specialized species of Salmonella. FEMS Microbiol. Lett. 164:275–281. Maher, K. O., J. G. Morris, Jr., E. Gotuzzo, C. Ferreccio, L. R. Ward, L. Benavente, R. E. Black, B. Rowe, and M. M. Levine. 1986. Molecular techniques in the study of Salmonella typhi in epidemiologic studies in endemic areas: comparison with Vi phage typing. Am. J. Trop. Med. Hyg. 35:831–835. Marshall, P., T. B. Davis, and C. Lemieux. 1994. The I-CeuI endonuclease: purification and potential role in the evolution of Chlamydomonas group I introns. Eur. J. Biochem. 220:855–859. Marshall, P., and C. Lemieux. 1991. Cleavage pattern of the homing endonuclease encoded by the fifth intron in the chloroplast large subunit rRNAencoding gene of Chlamydomonas eugametos. Gene 104:241–245. McClelland, M., K. E. Sanderson, J. Spieth, S. W. Clifton, P. Latreille, L. Courtney, S. Porwollik, J. Ali, M. Dante, F. Du, S. Hou, D. Layman, S. Leonard, C. Nguyen, K. Scott, A. Holmes, N. Grewal, E. Mulvaney, E. Ryan, H. Sun, L. Florea, W. Miller, T. Stoneking, M. Nhan, R. Waterston, and R. K. Wilson. 2001. Complete genome sequence of Salmonella enterica serovar Typhimurium LT2. Nature 413:852–856. Messer, W., M. Meijer, H. E. Bergmans, F. G. Hansen, K. von Meyenburg, E. Beck, and H. Schaller. 1979. Origin of replication, oriC, of the Escherichia coli K12 chromosome: nucleotide sequence. Cold Spring Harbor Symp. Quant. Biol. 43:139–145. Nair, S., S. Alokam, S. Kothapalli, S. Porwollik, E. Proctor, C. Choy, M. McClelland, S. L. Liu, and K. E. Sanderson. 2004. Salmonella enterica serovar Typhi strains from which SPI7, a 134-kilobase island with genes for Vi exopolysaccharide and other functions, has been deleted. J. Bacteriol. 186:3214–3223. Ochman, H., and A. C. Wilson. 1987. Evolutionary history of enteric bacteria, p. 1649–1654. In F. C. Neidhardt (ed.), Escherichia coli and Salmonella: cellular and molecular biology. ASM Press, Washington, D.C. Orskov, F., and I. Orskov. 1983. From the National Institutes of Health. Summary of a workshop on the clone concept in the epidemiology, taxonomy, and evolution of the Enterobacteriaceae and other bacteria. J. Infect. Dis. 148:346–357. Parkhill, J., G. Dougan, K. D. James, N. R. Thomson, D. Pickard, J. Wain, C. Churcher, K. L. Mungall, S. D. Bentley, M. T. Holden, M. Sebaihia, S. Baker, D. Basham, K. Brooks, T. Chillingworth, P. Connerton, A. Cronin, P. Davis, R. M. Davies, L. Dowd, N. White, J. Farrar, T. Feltwell, N. Hamlin, A. Haque, T. T. Hien, S. Holroyd, K. Jagels, A. Krogh, T. S. Larsen, S. Leather, S. Moule, P. O’Gaora, C. Parry, M. Quail, K. Rutherford, M. Simmonds, J. Skelton, K. Stevens, S. Whitehead, and B. G. Barrell. 2001.
J. BACTERIOL.
51. 52. 53. 54.
55. 56.
57. 58.
59.
60.
61.
62.
Complete genome sequence of a multiple drug resistant Salmonella enterica serovar Typhi CT18. Nature 413:848–852. Parry, C. M., T. T. Hien, G. Dougan, N. J. White, and J. J. Farrar. 2002. Typhoid fever. N. Engl. J. Med. 347:1770–1782. Popoff, M. Y., J. Bockemuhl, and L. L. Gheesling. 2003. Supplement 2001 (no. 45) to the Kauffmann-White scheme. Res. Microbiol. 154:173–174. Popoff, M. Y., and L. LeMinor. 1997. Antigenic formulas of the Salmonella serovars. W.H.O. Collaborating Center for Reference and Research on Salmonella, Institut Pasteur, Paris, France. Reeves, M. W., G. M. Evins, A. A. Heiba, B. D. Plikaytis, and J. J. Farmer 3rd. 1989. Clonal nature of Salmonella typhi and its genetic relatedness to other salmonellae as shown by multilocus enzyme electrophoresis, and proposal of Salmonella bongori comb. nov. J. Clin. Microbiol. 27:313–320. Roth, J. R. 2005. Where’s the beef? Looking for information in bacterial chromosomes, p. 3–18. In N. P. Higgins (ed.), The bacterial chromosome. ASM Press, Washington, D.C. Roth, J. R., N. Benson, T. Galitski, K. Haack, J. G. Lawrence, and L. Miesel. 1996. Rearrangements of the bacterial chromosome: formation and applications, p. 2256–2276. In F. C. Neidhardt, R. Curtiss III, J. L. Ingraham, E. C. C. Lin, K. B. Low, B. Magasanik, W. S. Reznikoff, M. Riley, M. Schaechter, and H. E. Umbarger (ed.), Escherichia coli and Salmonella: cellular and molecular biology. ASM Press, Washington, D.C. Schmid, M. B., and J. R. Roth. 1987. Gene location affects expression level in Salmonella typhimurium. J. Bacteriol. 169:2872–2875. Selander, R. K., P. Beltran, N. H. Smith, R. Helmuth, F. A. Rubin, D. J. Kopecko, K. Ferris, B. D. Tall, A. Cravioto, and J. M. Musser. 1990. Evolutionary genetic relationships of clones of Salmonella serovars that cause human typhoid and other enteric fevers. Infect. Immun. 58:2262–2275. Shimkets, L. J. 1997. Structure and sizes of genomes of the archaea and bacteria, p. 5–11. In F. J. de Bruijn, J. R. Lupski, and G. M. Weinstock (ed.), Bacterial genomes: physical structure and analysis. International Thomson Publishing, Florence, Ky. Shu, S., E. Setianingrum, L. Zhao, Z. Li, H. Xu, Y. Kawamura, and T. Ezaki. 2000. I-CeuI fragment analysis of the Shigella species: evidence for largescale chromosome rearrangement in S. dysenteriae and S. flexneri. FEMS Lett. 182:93–98. Wei, J., M. B. Goldberg, V. Burland, M. M. Venkatesan, W. Deng, G. Fournier, G. F. Mayhew, G. Plunkett 3rd, D. J. Rose, A. Darling, B. Mau, N. T. Perna, S. M. Payne, L. J. Runyen-Janecky, S. Zhou, D. C. Schwartz, and F. R. Blattner. 2003. Complete genome sequence and comparative genomics of Shigella flexneri serotype 2a strain 2457T. Infect Immun. 71: 2775–2786. Weinstock, G. M., and J. R. Lupski. 1997. Chromosomal rearrangements, p. 112–118. In F. J. de Bruijn, J. R. Lupski, and G. M. Weinstock (ed.), Bacterial genomes: physical structure and analysis. International Thomson Publishing, Florence, Ky.