1 2
Novel giant siphovirus from Bacillus anthracis features unusual
3
genome characteristics and relatedness to Spounavirinae
4 5
Holly H. Ganz1#, Christina Law1, Martina Schmuki2, Martin J. Loessner2, Richard Calendar3, Wayne M.
6
Getz1,4, Jonas Korlach5, and Jochen Klumpp2*
7 1
8
University of California, Berkeley, Department of Environmental Science, Policy & Management, Berkeley, CA, USA
9 2
10 11 12
3
Institute of Food, Nutrition and Health, ETH Zurich, Switzerland
University of California, Berkeley, Department of Molecular and Cellular Biology, Berkeley, CA, USA 4
School of Mathematical Sciences, University of KwaZulu-Natal, Durban, South Africa 5
Pacific Biosciences, Menlo Park, CA, USA
13 14 15
Running title: Giant siphovirus specific for Bacillus ACT group
16 17
Key words: Siphovirus, bacteriophage, Bacillus anthracis, genome, PacBio, SMRT sequencing
18 19
* Corresponding author:
20
Jochen Klumpp: Institute of Food, Nutrition and Health, ETH Zurich, Schmelzbergstrasse 7, 8092
21
Zurich, Switzerland. Phone: +41-44-6325378; Fax: +41-44-6321266; email:
[email protected]
22 23
# Present address:
24
Holly Ganz: University of California, Davis, School of Veterinary Medicine, Davis, CA 95616 USA
25
1
26
Abstract
27
We present vB_BanS-Etosha, a novel temperate phage isolated from Bacillus
28
anthracis. Etosha is a giant siphovirus, featuring a long, flexible and non-contractile
29
tail of 440 nm and an isometric head of 82 nm. We induced Etosha phage from two
30
different isolates of a B. anthracis genotype responsible for many anthrax outbreaks
31
occurring in wildlife in Etosha National Park, Namibia. The phage genome is the
32
largest sequenced Bacillus siphovirus, containing 168,876 bp and 269 ORFs.
2
33
We present a novel virus species isolated from Bacillus anthracis, the agent
34
responsible for anthrax infections in wildlife, livestock and humans [1]. Along with
35
Bacillus cereus and Bacillus thuringiensis, B. anthracis is a member of the Bacillus
36
ACT group [2]. Genomic studies have identified a number of putative prophages in
37
the Bacillus ACT group (e.g. [3, 4]). Lysogeny occurs commonly in B. anthracis [5, 6]
38
and may play an essential role in its life cycle [7].
39
We obtained isolates of the siphovirus from two zebra carcass sites in Etosha
40
National Park (ENP), Namibia, a 22,915 km2 wildlife reserve where anthrax infections
41
occur regularly (reviewed in [8]). We isolated bacteriophage from a B. anthracis
42
colony obtained from a swab from 2006 (GPS coordinates: -18.99736, 15.81584) and
43
from a soil sample collected in 2010 (GPS coordinates: -19.1731, 15.92603). Isolates
44
of B. anthracis from the two carcass sites belong to genotype 6 in the A3a group,
45
which has dominated outbreaks in ENP for more than 30 years [8]. We induced
46
phage from bacterial isolates by enrichment culture and mitomycin C induction [9,
47
10]. Both phages were morphologically similar and unusually large. It is intriguing that
48
such a large siphovirus is rarely isolated and yet appeared twice in spatially and
49
temporally separate samples.
50
Phage preparations were purified and concentrated using standard techniques
51
[10, 11]. Plaque assays were performed with a spore preparation of an avirulent B.
52
anthracis strain (6602 R1, [12]) by LB agar soft-agar overlay method [13]. Serial
53
dilutions of plaque-extracts were soft-agar plated with strain 6602 R1 [12]. Phage
54
lysates were PEG precipitated [14] and purified by cesium chloride density gradient
55
centrifugation [10].
56
We used the spot-on-the-lawn method [15] for host-range testing (Table 1).
57
The phage infects the Bacillus ACT group and is particularly adapted to B. anthracis. 3
58
Etosha lysed 25% (6/24) of B. cereus strains, 40% (2/5) of B. thuringiensis strains
59
and 86% (6/7) of B. anthracis strains tested. The phage did not lyse any other
60
bacteria tested (Table 1). Thus, Etosha is a narrow host-range, species-specific virus,
61
with potential suitability for biocontrol approaches of B. anthracis.
62
TEM images have been acquired from a preparation of pure phage particles
63
negatively stained with 2% uranyl acetate on carbon-coated copper grids (Quantifoil,
64
Jena, Germany) and observed using a Philips CM12 microscope at 120 kV
65
acceleration voltage with a Gatan Orius digital camera. Etosha exhibits typical
66
siphovirus morphology (order Caudovirales), and has a long, flexible and non-
67
contractile tail of 440 nm (not including baseplate structure) and an isometric head of
68
82 nm in diameter (Figure 1 A, B). Individual tail striations and a baseplate structure
69
with appendages are visible (Figure 1C). The head features visible capsomers
70
(Figure 1D) similar to those observed in other large bacteriophages [16]. To our
71
knowledge, Etosha is the largest sequenced siphovirus infecting Bacillus. Two larger
72
Bacillus siphoviruses are known but not characterized, B. mycoides phage N5 and B.
73
thuringiensis phage II, which are speculated to be identical (H.-W. Ackermann,
74
personal communication).
75
We sequenced DNA from the isolates using a single-molecule approach
76
(Pacific Biosciences RS) with 10 kb and 800 bp insert libraries (C2 chemistry) and
77
used one SMRT Cell for each library. We used the standard error-correction workflow
78
and SMRT portal software 1.3.1 for assembly of 36166 post-filter reads (2582 bp
79
average read length), resulting in one large contig with an average coverage of 550-
80
fold (Figure 2). Both phage isolates were identical. A repeat structure of 284 bp at the
81
genome end was identified during assembly and confirmed in restriction profiles
82
(Figure 3). Methylome analysis revealed no base modifications in the genome. The
83
genome sequence is 168,876 bp in length. Open reading frames were predicted by 4
84
RAST [17] and edited manually. Etosha features 272 open reading frames, 17 tRNA
85
and 2 pseudo-tRNA genes (Figure 2). The GC content is 34%, similar to published
86
genome sequences of B. anthracis. The genome sequence was deposited at
87
GenBank under accession number KC481682.
88
Like many siphoviruses, the genome is structured in functional modules. The
89
early gene cluster (genes for DNA replication, modification and repair, host takeover
90
and nucleotide metabolism) spans roughly 70% of the genome, indicating active
91
participation of virus-encoded genes in the metabolic processes associated with
92
replication in the host cell. It is notable that the Etosha genome encodes three
93
tyrosine integrase/recombinase enzymes of the Cre/XERD type (gp94, gp227,
94
gp255; [18, 19]) which exhibit no nucleotide homology to each other. Etosha features
95
a temperate lifestyle and these three enzymes may serve as means to integrate into
96
different attB sites and ensure a large host range for lysogeny. Further work will
97
elucidate the specificity and activity of the three recombinases. We also note the
98
presence of two Ig-domain containing proteins, gp233 and gp213 [19, 20], which may
99
play accessory roles during infection [21].
100
Interestingly, Etosha features distributed homologies in its structural proteins
101
to SPO1-related phages A511, A9, LP65 and SPO1 [16]. This finding is very unusual
102
because SPO1-related phages belong to an unrelated family of bacteriophages
103
(Myoviridae); and Etosha is a temperate phage, while SPO1-related phages are
104
strictly virulent. Etosha also displays individual capsomers thought to be a hallmark of
105
the SPO1-related phages (Figure 1) [16]. Virion proteins of Etosha were separated
106
on a 10-20% SDS gradient PAGE. Resulting bands were extracted and protein
107
content identified by mass spectrometry [22]. Six bands were allocated to gene
108
products. The tape measure protein is present in two protein bands of 280 and 100
109
kDa in size, presumably because of instability of the large protein. gp206 features an 5
110
estimated mass of 38 kDa and gp199 and 207 were identified in bands of 26.5 and
111
19 kDa, respectively. Etosha features an unusually long tail of 440 nm, which
112
corresponds with the large size of the tape measure protein (3123 aa) [23, 24] but the
113
protein is disproportionately large in comparison to other sequenced bacteriophages.
114
The large unknown gene 221 likely encodes for a tail fiber component, with the C-
115
terminus featuring significant homologies to Cellobiosidase, S-layer associated
116
endoglucanase or glycoside hydrolase domains.
117
In conclusion, we present vB_BanS-Etosha, a novel temperate phage
118
obtained from B. anthracis that is specific to the Bacillus ACT group. To our
119
knowledge Etosha is the largest sequenced siphovirus infecting Bacillus organisms.
120
Its giant head holds a genome of 168 kb. In addition to potential effects on activity
121
and environmental survival of its host, Etosha may have utility in detection and
122
control of B. anthracis.
123
6
124
Acknowledgments
125
We thank Margaret Smith for help with recombinase identification, Matthew Boitano
126
and Tyson A. Clark for genome assembly and initial analysis, and Martina Kusters,
127
Wendy C. Turner and Wilferd Versfeld for field support. Field sampling was
128
authorized by the Namibian Ministry of Environment and Tourism (MET) under permit
129
number 1448/2009 to HHG. We are grateful to the scientific staff at the Etosha
130
Ecological Institute (EEI) for facility resources and logistical support. The isolates
131
studied here were obtained from EEI carcass numbers EB060318-01WV and
132
EB100228-01MK. This research was funded by NIH Grant GM083863 to WMG. CL
133
was supported by the University of California, Berkeley, College of Natural
134
Resources-Biology Scholar’s Program, a Rosberg-Geist Fellowship from the Center
135
for African Studies and an Undergraduate Merit Scholarship from the Institute of
136
International Studies.
137
7
138
References
139 140 141 142 143 144 145 146 147 148 149 150 151 152 153 154 155 156 157 158 159 160 161 162 163 164 165 166 167 168 169 170 171 172 173 174 175 176 177 178 179
1. 2.
3.
4. 5. 6.
7.
8.
9. 10. 11.
12.
13. 14.
15.
Carter KC. 1988. The Koch-Pasteur dispute on establishing the cause of anthrax. Bull Hist Med. 62: 42-57. Okstad OA, Kolsto A-B. 2011. Genomics of Bacillus Species, p. 29-55. In Wiedmann M and Zhang W (ed), Genomics of Foodborne Pathogens. 2011, Springer, New York. Stromsten NJ, Benson SD, Burnett RM, Bamford DH, Bamford JK. 2003. The Bacillus thuringiensis linear double-stranded DNA phage Bam35, which is highly similar to the Bacillus cereus linear plasmid pBClin15, has a prophage state. J Bacteriol. 185: 6985-9. Rasko DA, Altherr MR, Han CS, Ravel J. 2005. Genomics of the Bacillus cereus group of organisms. FEMS Microbiol Rev. 29: 303-29. Buck CA, Anacker RL, Newman FS, Eisenstark A. 1963. Phage Isolated from Lysogenic Bacillus anthracis. J Bacteriol. 85: 1423-30. Saile E, Koehler TM. 2006. Bacillus anthracis multiplication, persistence, and genetic exchange in the rhizosphere of grass plants. Appl Environ Microbiol. 72: 3168-74. Schuch R, Fischetti VA. 2009. The secret life of the anthrax agent Bacillus anthracis: bacteriophage-mediated ecological adaptations. PLoS One. 4: e6532. Beyer W, Bellan S, Eberle G, Ganz HH, Getz WM, Haumacher R, Hilss KA, Kilian W, Lazak J, Turner WC, Turnbull PC. 2012. Distribution and molecular evolution of Bacillus anthracis genotypes in Namibia. PLoS Negl Trop Dis. 6: e1534. Van Twest R, Kropinski AM. 2009. Bacteriophage enrichment from water and soil. Methods Mol Biol. 501: 15-21. Sambrook J and Russell DW. 2001. Molecular Cloning - A Laboratory Manual. 3rd ed., Vol. 1-3. Cold Spring Harbor Laboratory Press, New York. Klumpp J, Dorscht J, Lurz R, Bielmann R, Wieland M, Zimmer M, Calendar R, Loessner MJ. 2008. The Terminally Redundant, Nonpermuted Genome of Listeria Bacteriophage A511: a Model for the SPO1-Like Myoviruses of Gram-Positive Bacteria. J Bacteriol. 190: 5753-5765. Green BD, Battisti L, Koehler TM, Thorne CB, Ivins BE. 1985. Demonstration of a capsule plasmid in Bacillus anthracis. Infect Immun. 49: 291-7. Adams MH. 1959. Methods of study of bacterial viruses, p. 443-457. In Bacteriophages. Interscience publishers, Inc., New York. Yamamoto KR, Alberts BM, Benzinger R, Lawhorne L, Treiber G. 1970. Rapid bacteriophage sedimentation in the presence of polyethylene glycol and its application to large-scale virus purification. Virology. 40: 734-44. Loessner MJ. 1991. Improved procedure for bacteriophage typing of Listeria strains and evaluation of new phages. Appl Environ Microbiol. 57: 882-4. 8
180 181 182 183 184 185 186 187 188 189 190 191 192 193 194 195 196 197 198 199 200 201 202 203 204
16. 17.
18. 19.
20.
21. 22.
23. 24.
Klumpp J, Lavigne R, Loessner MJ, Ackermann HW. 2010. The SPO1related bacteriophages. Arch Virol. 155: 1547-1561. Aziz RK, Bartels D, Best AA, DeJongh M, Disz T, Edwards RA, Formsma K, Gerdes S, Glass EM, Kubal M, Meyer F, Olsen GJ, Olson R, Osterman AL, Overbeek RA, McNeil LK, Paarmann D, Paczian T, Parrello B, Pusch GD, Reich C, Stevens R, Vassieva O, Vonstein V, Wilke A, Zagnitko O. 2008. The RAST Server: rapid annotations using subsystems technology. BMC Genomics. 9: 75. Smith MC Thorpe HM. 2002. Diversity in the serine recombinases. Mol Microbiol. 44: 299-307. Kilcher S, Loessner MJ, Klumpp J. 2010. Brochothrix thermosphacta bacteriophages feature heterogeneous and highly mosaic genomes and utilize unique prophage insertion sites. J Bacteriol. 192: 5441-5453. Kelly G, Prasannan S, Daniell S, Fleming K, Frankel G, Dougan G, Connerton I, Matthews S. 1999. Structure of the cell-adhesion fragment of intimin from enteropathogenic Escherichia coli. Nat Struct Biol. 6: 313-8. Fraser JS, Yu Z, Maxwell KL, Davidson AR. 2006. Ig-like domains on bacteriophages: a tale of promiscuity and deceit. J Mol Biol. 359: 496-507. Marti R, Zurfluh K, Hagens S, Pianezzi J, Klumpp J, Loessner MJ. 2013. Long tail fibers of the novel broad host range T-even bacteriophage S16 specifically recognize Salmonella OmpC. Mol Microbiol. 87: 818-834. Katsura I. 1987. Determination of bacteriophage lambda tail length by a protein ruler. Nature. 327: 73-5. Katsura I. 1990. Mechanism of length determination in bacteriophage lambda tails. Adv Biophys. 26: 1-18.
205 206
9
207
Figure legends
208
Figure 1: Electron microscopy of Etosha phage. A. Preparation overview. B. Close-
209
up of single phage particle. C. Details of the phage tail distal end. D. Details of the
210
phage head structure. Individual capsomers are visible, an observation previously
211
made for SPO1-related phages ([16]). Scale bars represent 100 nm.
212 213
Figure 2: Genome map of phage Etosha. Open reading frames are drawn to scale
214
and transcription direction is indicated by arrows. Selected proteins with putative
215
function are labeled. Genetic modules (i.e. structural genes, early genes) are
216
indicated by coloring.
217 218
Figure 3: Restriction profile of Etosha phage. Clear band separation up to 24 kb in
219
size could be achieved and the restriction profiles match with the sequenced genome
220
size. The terminal redundancy location and size was determined from the fragment
221
sizes as previously described [10, 11]. Enzymes used: 1: Alw44I (NEB); 2: Eco91I
222
(Fermentas); 3: NheI (NEB); 4: PacI (NEB); 5: SwaI (NEB); 6: Van91I (Fermentas); 7:
223
XcmI (NEB). M1: Lambda 19 Mix Size standard (Fermentas); M2: 1kb size standard
224
(Fermentas). Numbers to the left indicate band size in kb.
10
Table 1: Host range of Etosha on different Bacillus and non-Bacillus strains. The presence of plaques (successful infection of this strain) is indicated by + and an absence is indicated by -. Strain name 6602 R1 Sterne Weybridge UM44 Ames-non reverting
Organism Bacillus anthracis Bacillus anthracis Bacillus anthracis
Ames
Bacillus anthracis
Vollum 1b
Bacillus anthracis
PAK-1
Bacillus anthracis
LA 925 ATCC 14579 ATCC 11778 ATCC 10702 ATCC 10876 DSM 2302 BO 366 BO 372 BO 493 DSM 4218 ATCC 33019 ATCC 14737 DSM1274 ATCC 27522 NCTC 11143 NCIMB 8705 ATCC 6464 B346 DSM360 HER1399 WSBC 10530 WSBC 10556 WSBC 10566 WSBC 10583 DSM4421 WSBC 10204 HER1211 Kurstaki ATCC 10792 DSM168 DSM675 ATCC 23059 DSM395 DSM90 WSBC 10550 WSLC 3009 ATCC BAA-679 PSK Twort 414 100655 602 DT7155 CGSC 4401 DSM 20560 NZ9000 ATCC 19433
Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus cereus Bacillus thuringiensis Bacillus thuringiensis Bacillus thuringiensis Bacillus thuringiensis Bacillus thuringiensis Bacillus subtilis Bacillus subtilis Bacillus subtilis Bacillus sphaericus Bacillus megaterium Bacillus weihenstephanensis Listeria ivanovii Listeria monocytogenes Staphylococcus aureus Staphylococcus aureus Staphylococcus epidermidis Staphylococcus epidermidis Staphylococcus epidermidis Salmonella typhimurium Escherichia coli Streptococcus salivarius Lactococcus lactis Enterococcus faecalis
Bacillus anthracis
Notes pXO1-pXO2 negative pXO2 negative pXO2 negative pXO2 negative
Source {Green, 1985 #228} Institut Pasteur #7702 {Green, 1985 #228} U.S. Dept. of Agriculture, Ames, Iowa U.S. Dept. of Agriculture, Ames, Iowa Laboratory Strain Pakistan isolate, M. HughJones collection CHUV ATCC ATCC ATCC ATCC DSM This study This study This study DSM ATCC ATCC DSM ATCC NCTC NCIMB ATCC Mouse isolate DSM HER WSBC WSBC WSBC WSBC DSM WSBC HER R. Calendar, UC Berkeley ATCC DSM DSM ATCC DSM DSM WSBC WSLC ATCC Laboratory Stock Laboratory Stock Laboratory Stock Laboratory Stock Laboratory Stock Laboratory Stock CGSC DSM Laboratory Stock ATCC
Infection + + + + + + + + + + + + + + -
CHUV: Strain collection of the Centre Hospitalier universitaire Vaudois, Switzerland; HER: Félix d'Hérelle Reference Center for bacterial viruses, Laval, Canada; ATCC: American Type Culture Collection; NCTC: National Collection of Type Cultures; DSM: Deutsche Sammlung von Mikroorganismen; NCIMB: National Collection of Industrial Bacteria; WSBC: Weihenstephan Bacillus Collection; WSLC= Weihenstephan Listeria Collection. CGSC: Coli Genetic Stock Center, Yale, USA