Gene 594 (2016) 97–107
Contents lists available at ScienceDirect
Gene journal homepage: www.elsevier.com/locate/gene
Research paper
Complete genome sequencing and comparative analyses of broad-spectrum antimicrobial-producing Micromonospora sp. HK10 Madhumita Talukdar a,1, Dhrubajyoti Das a,1, Chiranjeeta Bora a, Tarun Chandra Bora a, Hari Prasanna Deka Boruah a, Anil Kumar Singh a,b,⁎ a b
Department of Biotechnology, CSIR-North East Institute of Science and Technology, Jorhat 785006, India Academy of Scientific and Innovative Research, Rafi Marg, New Delhi 110001, India
a r t i c l e
i n f o
Article history: Received 29 December 2015 Received in revised form 29 August 2016 Accepted 2 September 2016 Available online 05 September 2016 Keywords: Micromonospora Anti-microbial producing Bacterial genomics Secondary metabolite
a b s t r a c t Micromonospora genus produces N700 bioactive compounds of medical relevance. In spite of its ability to produce high number of bioactive compounds, no genome sequence is available with comprehensive secondary metabolite gene clusters analysis for anti-microbial producing Micromonospora strains. Thus, here we contribute the full genome sequence of Micromonospora sp. HK10 strain, which has high antibacterial activity against several important human pathogens like, Mycobacterium abscessus, Mycobacterium smegmatis, Bacillus subtillis, Staphylococcus aureus, Proteus vulgaris, Pseudomonas aeruginosa, Salmonella and Escherichia coli. We have generated whole genome sequence data of Micromonospora sp. HK10 strain using Illumina NexSeq 500 sequencing platform (2 × 150 bp paired end library) and assembled it de novo. The sequencing of HK10 genome enables identification of various genetic clusters associated with known- and probably unknown- antimicrobial compounds, which can pave the way for new antimicrobial scaffolds. © 2016 Elsevier B.V. All rights reserved.
1. Introduction Micromonospora is a genus of Gram-positive bacteria that taxonomically belong to the family of Micromonosporaceae with 63 species (Genilloud and Genus, 2012; Hirsch and Valdés, 2009; Genilloud, 2015). The members of Micromonospora genus are widely distributed in diverse geographical habitats e.g., soil, fresh water, marine sediment, rocks, nitrogen fixing nodules from leguminous and actinorhizal plants (Das et al., 2008; Charan et al., 2004; Trujillo et al., 2010; Valdés et al., 2005; Carro et al., 2013; Trujillo et al., 2006; Trujillo et al., 2007) etc. Micromonospora belongs to actinomycetes group, which are prolific producers of N 700 bioactive compounds including gentamicin and calicheamicin (Charan et al., 2004; Bérdy, 2005; Igarashi et al., 2011a; Igarashi et al., 2007; Igarashi et al., 2011b; Furumai et al., 2000; Furumai et al., 2002). Bioactive compounds produced by Micromonospora have been studied considerably for their critical roles in human health and medicines (Bérdy, 2005; Igarashi et al., 2011a; Abbreviation: COG, clusters of orthologous groups; KEGG, Kyoto encyclopedia of genes and genomes; HGT, horizontal gene transfer; GI, genetic island; IS, insertion sequence; CRISPRs, clustered regularly interspaced short palindromic repeats; SRP, signal recognition particle; Sec, sec secretion system; PKS, polyketide synthase; NRPS, nonribosomal peptide synthetase. ⁎ Corresponding author at: CSIR-North East Institute of Science & Technology, Jorhat 785006, Assam, India. E-mail address:
[email protected] (A.K. Singh). 1 Co-first authors.
http://dx.doi.org/10.1016/j.gene.2016.09.005 0378-1119/© 2016 Elsevier B.V. All rights reserved.
Igarashi et al., 2007; Igarashi et al., 2011b; Furumai et al., 2000; Furumai et al., 2002; Shimotohno et al., 1993; Tomita and Tamaoki, 1980; Lam et al., 1996). Advancement of new generation sequencing and improved accuracy of gene annotation, have opened up a new avenue for the discovery of novel secondary metabolite encoding gene clusters and their activities. Although Micromonospora genus belong to the order Actinomycetales along with Streptomyces genus, which are well known for production of secondary metabolites, a limited numbers of studies have been focused on the secondary metabolites and the biosynthetic gene clusters of this genus. Despite the production of various useful antibiotics by Micromonospora species, the full genomes of only limited number of strains have been available to date. The genomes of Micromonospora aurantiaca ATCC 27029T (accession no. NC_014391.1) and Micromonospora sp. ATCC 39149 (accession no. GG657738) isolated from soil and Micromonospora sp. L5 and Micromonospora lupini Lupac 08, isolated from nitrogen fixing root nodules of Casuarina equisetifolia (Valdés et al., 2005; Trujillo et al., 2014) and Lupinus angustifolius (Trujillo et al., 2010; Trujillo et al., 2014) respectively, are available in public database. Intrigued by its potential bioactivity, we contribute here a genome sequence of Micromonospora sp. strain HK10, a soil isolate from Kaziranga National Park, Assam, India (Talukdar et al., 2016). HK10 strain shows high antibacterial activity against both Gram-negative and Gram-positive bacteria (Talukdar et al., 2012). Since secondary metabolites play crucial roles in several ecological functions for example, chemical communication, nutrient attainment, defense mechanisms etc (Wietz et al., 2013), we analyzed genetic information
98
M. Talukdar et al. / Gene 594 (2016) 97–107
related to its unknown active secondary metabolites and respective genetic determinants. HK10 genome was compared within other Micromonospora (Micromonospora sp. L5 and M. aurantiaca ATCC20029T) and also with other actinobacteria such as, Salinispora arenicola CNS-205, Streptomyces coelicolor A3 (2), Nocardia brasiliensis ATCC 700358 and Rhodococcus pyridinivorans SB3094, which are known to produce bioactive compounds. The comparative genome analysis with other strains revealed some significant genetic differences. Furthermore, we compared various critical systems for their function and distributions of genes like two component regulatory systems, secretion systems and transporters mechanisms, which help the microbes to survive and adapt with their surrounding environments. 2. Material and methods 2.1. Strains, media and antagonism test The antibacterial assay was performed against various test pathogens: Salmonella sp. (IDH377), enterotoxigenic Escherichia coli E9 (serotype O146) (Pazhani et al., 2011), Mycobacterium smegmatis ATCC700084 and Mycobacterium abscessus ATCC19977T. The antibacterial activity of Micromonospora sp. HK10 was tested using agar diffusion assay method (Gramer et al., 1976; Forbes et al., 1990) by observing presence of inhibition zone (in mm) around the well. For extraction of crude compound, the pure isolate was cultured into 500 ml Erlenmeyer flasks containing 100 ml of CSPY liquid medium (K2HPO4 0.5 g, casein 3 g, maize starch 10 g, peptone 1 g, malt extract 10 g, yeast extract 1 g, distilled water 1 l. pH 7.5) for different time intervals such as (7, 10, 12 and 14 days) at 28 ± 2 °C at 200 rpm. The culture broth was solvent extracted with equal volume of chloroform (1:1) (v/v) in separating funnels by shaking vigorously for 15 min and the solvent part was dried to yield the crude extract. The dried crude extract was dissolved in 20% dimethyl sulphoxide (DMSO) and antimicrobial bioassay was carried out against the test pathogens. 70 μL of fresh inoculum from each culture was spread on the plates. Wells were punctured on freshly spread bacterial culture on Muller Hinton Agar (HiMedia, India) using sterile cork borer and the bored wells were filled with the 100 μg/ml extract of Micromonospora sp. HK10. All the plates were incubated at 37 °C for 48 h. Growth inhibition of test pathogens was measured with standard scale, the mean values of inhibition zones were taken and compared with 50 μg/ml neomycin sulfate (HiMedia, India). 2.2. HK10 whole genome sequencing, assembly and annotation The Micromonospora sp. HK10 genome was sequenced with the Illumina NextSeq 500 system using a paired-end 2 × 150-bp library. The raw data was trimmed using Trimmomatic version 0.30 (Bolger et al., 2014) for high-quality read length (cutoff quality score 20). A total of 8,896,194 high-quality, vector-filtered reads were used for assembly with CLC Genomics Workbench version 6 (CLC bio, Denmark) and the draft genome thus obtained was annotated for protein coding genes with the help of Glimmer (Delcher et al., 2007) and Prodigal v2.60 (Hyatt et al., 2010). This whole-genome shotgun project has been deposited at GenBank. 2.3. Complete genomic analysis of Micromonospora sp. HK10 The genome sequence was submitted to the Integrated Microbial Genomes (IMG) server (http://img.jgi.doe.gov) of the Joint Genome Institute (JGI) for more detail analysis and genome comparison (Markowitz et al., 2014a; Markowitz et al., 2014b). A one-sample ttest was carried out by using OriginPro 8 software to evaluate the statistically significant differences of gene abundance in each COG category between Micromonospora sp. HK10 and other five genomes of IMG database: M. aurantiaca ATCC 27029T (accession no. NC_014391.1), Micromonospora sp. L5 (accession no. NC_014815.1), Micromonospora
sp. ATCC 39149 (accession no. GG657738.1), M. lupini Lupac 08 (accession no. HF570108), and Micromonospora sp. CNB394 (Biosample no. SAMN02441007). Genomic alignment of the Micromonospora sp. HK10 was performed with Micromonospora sp. L5 and M. aurantiaca ATCC20029T and with four secondary metabolite producing actinobacteria viz., Salinispora arenicola CNS-205, Streptomyces coelicolor A3 (2), Nocardia brasiliensis ATCC 700358 and Rhodococcus pyridinivorans (SB3094) strains using MAUVE software (Darling et al., 2010) and average nucleotide sequence identity (ANI) score was validated by NCBI-Blastn. Complete genome sequence was analyzed using IMG server (http://img.jgi.doe.gov) and antiSMASH (Medema et al., 2011; Blin et al., 2013; Weber et al., 2015) was used to detect the secondary metabolite gene clusters. A Venn diagram of unique and shared genes present in the respective genomes was calculated using DrawVenn (http://bioinformatics.psb.ugent.be/webtools/Venn/). HK10 genome was examined for horizontal gene transfer (HGT) which plays critical roles in shaping the genome assisted by transposons and other origins. We performed two analyses: (i) GIs prediction by IslandViewer (using Dinucleotide-mobility analysis (DIMOB) and IslandPath tools) (Dhillon et al., 2015) and (ii) detection of insertion sequences from HK10 genome using IS Finder database (https://www-is.biotoul.fr/). The DIMOB analyses is based on dinucleotide bias and mobility genes to more accurately predicted regions, termed DIMOB islands (DIMOBIs) that may have been acquired by HGT. IslandPath combine these parameters using additional information of tRNAs and G + C percentage. All the predicted GI contents were compared by BLAST to identify the kind of genes acquired by HGT in HK10 strain from other bacteria. The draft genome of Micromonospora sp. HK10 was examined by CRT, PILERCR (Bland et al., 2007; Edgar, 2007), and CRISPR Finder tool (Grissa et al., 2007) to detect the presence of CRISPR elements, which confer immunity against incoming DNA, including bacteriophages. Transport system analyses of HK10 genome was performed by comparing each predicted protein against transporter classification database (TCDB) (http://www.tcdb.org/). 2.4. Phylogenetic analysis of Micromonospora sp. HK10 strain Total of 67 almost complete 16S rRNA sequences were acquired from public databases NCBI Genbank and aligned by ClustalW using 1.6 DNA matrix (Thompson et al., 1994). The evolutionary history was inferred by using the Maximum Likelihood method (Kimura, 1980; Felsenstein, 1985) based on 1000 bootstrap replicates. Streptomyces coelicolor A3 (2)T was used as an outgroup. Evolutionary analyses were conducted in MEGA7 software (Kumar et al., 2016). 2.5. Nucleotide sequence accession number HK10 Whole Genome Shotgun project has been deposited at DDBJ/ EMBL/GenBank under the accession number NZ_JTGL00000000.1. 3. Results and discussion 3.1. Genome characteristics and comparative analysis of HK10 strain with other Micromonospora strains The final de novo genome assembly of Micromonospora sp. HK10 contains 294 contigs with a total size of 6,911,179 bp (50% of the entire assembly of contigs i.e., “N50” equal or larger than 61,860 bp and the longest contig assembled measures 352,445 bp) with average G + C content of 73.39% (Table 1). The genome contains a total of 6282 predicted protein coding genes/CDSs, having 3 rRNAs and 58 tRNAs (Table 1). From total predicted protein coding genes, 4644 genes (73.93%) were allocated with a biological function while 1552 genes (24.71%) were categorized as of unknown function. Further, out of these 6282 CDS, 3598 genes (57.27%) were observed under the clusters of orthologous groups (COGs) and assigned to 25 different categories
M. Talukdar et al. / Gene 594 (2016) 97–107 Table 1 General statistics of Micromonospora sp. HK10 genome. Feature
Genome
DNA, total number of bases DNA coding number of bases DNA G + C content (%) Genes total number Protein coding genes Protein coding genes with function prediction Protein coding genes without function prediction RNA genes rRNA genes 5S rRNA 16S rRNA 23S rRNA tRNA genes Other RNA genes
6911179 6125905 73.39 6282 6196 4644 1552 86 3 1 1 1 58 25
99
(Fig. 1, Table S1 & S2). A one-sample t-test was used to evaluate if there were statistically significant differences in the gene abundances of each COG categories between the genomes of HK10 and the other five Micromonospora strains present in IMG database: M. aurantiaca ATCC 27029T, Micromonospora sp. L5, Micromonospora sp. ATCC 39149, M. lupini Lupac 08, and Micromonospora sp. CNB394 (Fig. 1, Table S1). The results showed that the abundances of the proteins responsible for carbohydrate transport and metabolism (B, 9.11%), cell motility (D, 0.37%), posttranslational modification, protein turnover, chaperones (S, 4%), signal transduction mechanisms (W, 5.6%) are significantly (P b 0.005) higher than the average levels, which are 8.71%, 0.265%, 3.79% and 5.38%, respectively. The abundances of the proteins responsible for chromatin structure and dynamics (F, 0.02%), coenzyme transport and metabolism (G, 5.99%), defense mechanisms (H, 0.02%), cytoskeleton (I, 2.92%), inorganic ion transport and metabolism (N, 4.49%), lipid transport and
Replication, recombination and repair RNA processing and modification Secondary metabolites biosynthesis, transport and catabolism Signal transduction mechanisms Transcription Translation, ribosomal structure and biogenesis Amino acid transport and metabolism Carbohydrate transport and metabolism Cell cycle control, cell division, chromosome partitioning Cell motility Cell wall/membrane/envelope biogenesis Chromatin structure and dynamics
(A)
(B)
Coenzyme transport and metabolism Cytoskeleton Defense mechanisms Energy production and conversion Extracellular structures Function unknown General function prediction only Inorganic ion transport and metabolism Intracellular trafficking, secretion, and vesicular transport Lipid transport and metabolism Mobilome: prophages, transposons Nucleotide transport and metabolism Posttranslational modification, protein turnover, chaperones Fig. 1. (A) Comparison of COG functional categories between Micromonospora sp. HK10 and other five Micromonospora genomes- Micromonospora sp. HK10, Micromonospora sp. L5, M. aurantiaca ATCC 27029T, Micromonospora sp. ATCC 39149, M. lupini Lupac 08 and Micromonospora sp. CNB394 (from the center to the outer side). (B) Comparison of COG functional categories between Micromonospora sp. HK10 and four actinobacteria genomes- Micromonospora sp. HK10, S. coelicolor A3 (2), S. arenicola CNS-205, R. pyridinivorans SB3094, N. brasiliensis ATCC 700358) from IMG database, (from the center to outer side).
100
M. Talukdar et al. / Gene 594 (2016) 97–107
metabolism (P, 5.03%), mobilome: prophages, transposons (Q, 0.37%), nucleotide transport and metabolism (R, 2.36%), RNA processing and modification (T. 0.02%) and secondary metabolite biosynthesis, transport and catabolism (V, 3.88%) are significantly (P b 0.005) lower than the average levels, 0.03%, 6.26%, 0.033%, 2.95%, 4.88%, 5.43%, 0.51%, 2.44%, 0.023%, and 4.26%, respectively (Fig. 1, Table S1). The genomic characteristics of HK10 strain and five other Micromonospora genomes available in the public databases including M. aurantiaca ATCC 27029T, Micromonospora sp. L5, Micromonospora sp. ATCC 39149, M. lupini Lupac 08, and Micromonospora sp. CNB394 were compared (Table S3). The important differences observed between the six strains were GC percentage and number of tRNA genes. HK10 genome has particularly highest GC content (73.39%) and second highest number of tRNAs (58) compared to other five strains (Table S3). Several reports in bacteria indicate that increase in GC content correlates with a higher temperature optimum and a broader tolerance range for a species (Nishio et al., 2003; Musto et al., 2006). Thus high GC content in HK10 might give more thermo stability for adaptation to the immediate environment and high numbers of tRNA genes can be seen as a potential for high protein synthesis and good growth rate (Rocha, 2004) in Micromonospora strains but it is matter of future investigation. A complete genome sequence makes the fundamental hereditary content of an organism finite and is the most suitable state for any genome sequence with less error bars. Therefore, we have further compared HK10 genome sequence with two fully sequenced genomes available in the database i.e., M. aurantiaca ATCC 27029T and Micromonospora sp. L5. Our analysis disclosed that the highest number (6360) of annotated genes present in M. aurantiaca ATCC 27029T followed by Micromonospora sp. L5 and HK10 strain (6326 and 6282 respectively) (Table S3). There were highest 60 predicted gene clusters having 861 genes (13.6% of total genome) in Micromonospora sp. L5, followed by M. aurantiaca ATCC 27029T with 893 genes (14.04% of total genome) and HK10 strain detected with lowest 49 gene clusters with 721 genes (11.48% of the total genome). Functional annotation using the Kyoto encyclopedia of genes and genomes (KEGG) exposed the total protein coding genes connected to KEGG pathways: lowest 1213 (19.31%) in Micromonospora sp. HK10, followed by 1247 (19.61%) of M. aurantiaca ATCC 27029T and highest 1277 (20.19%) of Micromonospora sp. L5. Similarly analysis with KEGG Orthology suggested, lowest number (2118) of genes (33.72%) found in Micromonospora sp. HK10, followed by 2237 (35.36%) and 2216 (34.84%) number of genes in Micromonospora sp. L5 and M. aurantiaca ATCC 27029T, respectively. Pair-wise genome synteny studies of these strains also revealed a high degree of positional conservation between the three genomes (Fig. S1). Although the three genomes have a considerable amount of genetic similarity, they have encountered with many inversion and translocations. HK10 genome possesses large number of non-conserved regions and unique genomic regions as compared to M. aurantiaca ATCC 27029T and Micromonospora sp. L5. The genomic comparison by average nucleotide sequence identity confirmed that HK10 strain has the highest similarity to Micromonospora sp. L5 (with 95% identity and 48% coverage), followed by M. aurantiaca ATCC 27029T (with 95% identity and 47.9% coverage). To evaluate genetic similarity of strains, the comparative Venn diagrams of the total protein coding genes with predicted function was analyzed. Interestingly, HK10 strain contains the highest number (4644, 73.93%) of protein coding genes with predicted function in comparison to Micromonospora sp. L5 (4358, 68.1%) and M. aurantiaca ATCC 27029T (4259, 66.97%) (Fig. S2). Our analysis also found that HK10 genome shared highest common genes (992 genes, 11.1%) with M. aurantiaca ATCC 27029T and lowest common genes (925, 10.2%) with Micromonospora sp. L5 (Fig. S2). Unique genes comparison of HK10 strain has shown highest 3643 genes (78.4%) with Micromonospora sp. L5 followed by 3252 genes (76.3%) with M. aurantiaca ATCC 27029T. Similarly, HK10 strain contains highest 3710
number of genes (79.8%), compared to Micromonospora sp. L5 3337 numbers of genes (76.5%) (Fig. S2). These genetic similarities revealed the closeness of HK10 strain with both strains. 3.2. Genomic comparison of Micromonospora sp. HK10 with other actinobacteria Since Micromonosporaceae family is well known for production of a number of medically relevant bioactive compounds, it is interesting to understand the genetic composition of other secondary metabolite producing actinobacteria, viz. Salinispora arenicola CNS-205 (Accession no.: NC_009953), Streptomyces coelicolor A3 (2) (Accession no.: AL645882), Nocardia brasiliensis ATCC700358 (Accession no.: NC_018681) and Rhodococcus pyridinivorans SB3094 (Accession no.: NC_023150). Onesample t-test was carried out between Micromonospora sp. HK10 and above four actinobacteria (Fig. 1, Table S2) suggests that the HK10 strain has significantly higher abundance of genes of carbohydrate transport and metabolism (B, 9.11%), cell motility (D, 0.37%), chaperons (S, 4%), and signal transduction mechanisms (W, 5.6%) compared to other actinobacteria strains (Fig. 1, Table S2). On contrary, abundance of genes categorized under cell cycle control, cell division, chromosome partitioning (C, 0.81%), coenzyme transport and metabolism (G, 5.99%), defense mechanisms (H, 0.02%), cytoskeleton (I, 2.92%), energy production and conversion (J, 5.89%), inorganic ion transport and metabolism (N, 4.49%), lipid transport and metabolism (P, 5.03%), mobiolome: prophages, transposons (Q, 0.37%), nucleotide transport and metabolism (R, 2.36%), RNA processing and modification (T, 0.02%), secondary metabolite biosynthesis, transport and catabolism (V, 3.88%), transcription (X, 10.31%) were significantly lower (Table S2). These results revealed a common feature in the secondary metabolite producing bacteria, which is the low abundance of genes in secondary metabolites biosynthesis and high abundance of genes in carbohydrate metabolism. Further we carried out a pair wise comparative genome analysis of HK10 strain between S. arenicola, S. coelicolor, N. brasiliensis and R. pyridinivorans strains using progressive Mauve aligner (Darling et al., 2010) (Fig. S3). Although the five genomes share a notable amount of genetic characteristics, they have encountered various inversions and translocations and Micromonospora sp. HK10 contains the highest number of non-conserved regions. The genomic comparison by average nucleotide sequence identity confirmed that HK10 strain has the highest similarity to S. arenicola. The order of similarity was S. arenicola (identity score of 92%, coverage of 33%) N S. coelicolor (identity score of 85%, coverage of 10%) N N. brasiliensis (similarity score of 85%, coverage of 5%) N R. pyridinivorans (similarity score of 86%, coverage of 4%). These results are consistent with their phylogenic relation of HK10 strain and S. arenicola, as both genera belong to Micromonosporaceae family. Total protein coding genes with- and without- predicted function is shown as: (4644/1552), (6485/2009), (5364/2846), (3703/1204) and (3316/1790) in Micromonospora sp. HK10, N. brasiliensis, S. coelicolor, R. pyridinivorans, S. arenicola, respectively. Here, we have compared ‘total protein coding genes with predicted function’ to validate the similarity between HK10 and the other actinobacteria genomes. Our analysis revealed the similarity in following order, R. pyridinivorans (1953 genes, 42.13% similarity) N N. brasiliensis (1405 genes, 30.31% similarity) N S. arenicola (856 genes, 18.46% similarity) N S. coelicolor (819 genes, 17.66% similarity) (Fig. S4). Surprisingly, this analysis brought the highest sharing (1953 protein coding genes) between HK10 and R. pyridinivorans strains, but above results (Fig. S3) was found just opposite. These variations might be arising because former results appeared after total genome analysis (Fig. S3) however, latter one utilizing only part of the genome for analysis. It is also noted that actinobacteria genomes have high number of genes categorized under ‘without predicted function’ category, shown as S. coelicolor, 2846 N N. brasiliensis, 2009 N S. arenicola, 1790 N R. pyridinivorans, 1204, which is also accountable for these differences.
M. Talukdar et al. / Gene 594 (2016) 97–107
3.3. Genomic Island and horizontal gene transfer
101
1, GI-11 and GI-16) had the similar DNA composition with the Micromonospora genus in general while the rest of the GIs showed similarity with various phyla of bacteria such as, Actinobacteria: Actinoplanes (GI-2, GI-13, GI-15 and GI-17), Salinispora (GI-3 and GI18), Streptomyces (GI-4), Verrucosispora (GI-5, GI-7, GI-12), Frankia (GI-8), Streptosporangium (GI-14); Chlorobi: Chlorobium tepidum (GI6); Planctomyces: Planctomyces limnophilus (GI-9); and Proteobacteria: Thiocystis violascens (GI-10) (Table S4). IMG server analysis revealed that HK10 strain has acquired lesser genes (48 genes) by recent horizontal transfer than Micromonospora sp. L5 (61 genes) and M. aurantiaca ATCC 27029T (67 genes) strains (Fig. 2 and Table S5). Overall horizontal gene transfer from phylum Proteobacteria is predominantly observed in all Micromonospora genome sequences. Interestingly HK10 genome has some unique gene transfer from Armatimonadetes phylum, which was absent in other available genome sequences (Table S5). Moreover, HK10 genome contains 23 mobile genetic elements from 11 different IS families i.e., IS 110, IS 1380, IS 21, IS 256, IS 3, IS 4, IS 5, IS 630, IS As1, IS L3 and IS NCY (Fig. S6 and Table S6). IS 110 family transposases are predominantly present in HK10 genome. Clustered regularly interspaced short palindromic repeats (CRISPRs) are structures, which conceal complex biological mechanisms to
A total of 18 genomic islands (GIs) were predicted by Island Viewer method (Dhillon et al., 2015) and their precise location is shown in Fig. S5 and Table S4. These 18 GIs encode a total of 113 (1.7%) genes in 1,34,243 bp from various sources viz. eukaryote, virus and other bacteria. All the identified genes from these GIs are summarized in Table S4, including genes encoding for numerous modifying genes i.e., hydrolases, acetyl transferases, dehydrogenase, methyl transferase, phosphotransferase, glycosyl transferase, helicase, glycosyl hydrolase. Thus, it shows that HK10 strain evolved by receiving GIs, which make them genetically richer and equipped with secondary metabolite modifying enzymes and eventually making them potential producer of several bioactive compounds. As per IslandPath-DIMOB GI-1 (4,549,755 bp to 4,571,443 bp) was found to be the prominent island representing 16 different genes while GI-3 (7,115,968 bp to 7,130,234 bp) was having 8 genes, which also includes transposes IS 116/IS 110/IS902 family genes (Table S4). Presence of three putative transposase (from GI-3, Ga0063173_03998, Ga0063173_6177 and Ga0063173_03269) and one phase integrase family gene (from GI-4, Ga0063173_04552) or their fragments were scattered in the genome that could support potential active HGT in the strain (Table S4). Interestingly only three GIs (GI-
Micromonospora sp. CNB394
65
M. lupini Lupac 08
66
174
Micromonospora sp. ATCC 39149
Micromonospora sp. L5
43
M. aurantiaca ATCC 27029
67
Micromonospora sp. HK10
48 0
50
100
150
200 Domain
Phylum
Order
Fig. 2. Comparative analysis of horizontally transferred genes of Micromonospora HK10 with other five Micromonospora genomes from IMG database.
102
M. Talukdar et al. / Gene 594 (2016) 97–107
account for their transfer, evolution and behavior. They have probably played an important role in the evolution of archaea and bacteria by providing a defense mechanism against foreign DNA (Horvath and Barrangou, 1994). Using CRT and PILERCR tools (Bland et al., 2007; Edgar, 2007) a total of 10 and with CRISPR database (Grissa et al., 2007) a total of 14 CRISPR sequences in HK10 genome were identified (Table S7). Notably no CRISPR sequences were found in Micromonospora sp. L5, M. aurantiaca ATCC 27029T and M. lupini Lupac 08 strains. The presence of number of CRISPR sequences might be responsible for resistance of foreign DNA sequences in HK10 strain compared to other available genomes of Micromonospora where no CRISPR sequences have been observed. Number of possible roles for CRISPR sequences have been suggested such as, chromosomal rearrangement, modulation of expression of neighboring genes, target for DNA binding proteins, replicon partitioning, and DNA repair (Makarova et al., 2002). Numerous in silico studies have been reported homology between spacer sequences and extra chromosomal elements (like viruses and plasmids), which led to the hypothesis that CRISPR may provide adaptive immunity against foreign genetic elements (Horvath and Barrangou, 1994; Makarova et al., 2006). These studies lead us to speculate that presence of CRISPR in HK10 strain might help in adaptation under variable environmental conditions. 3.4. Antimicrobial activity of Micromonospora sp. HK10 and complete secondary metabolism encoding genes HK10 strain has already been reported to possess a broad range of antibacterial activity against various Gram-positive and Gram-negative bacteria (Talukdar et al., 2012). To reaffirm this broad spectrum phenotype, we have evaluated antibacterial activity of HK10 strain against different clinical pathogens such as, Mycobacterium smegmatis ATCC 700084, Mycobacterium abscessus ATCC 19977T, Salmonella sp. IDH377, and enterotoxigenic Escherichia coli E9. The zone of inhibition was determined as 27 mm, 19 mm, 19 mm and 18 mm for M. smegmatis, M. abscessus, Salmonella sp. and E. coli respectively, compared to 19 mm, 17 mm, 13 mm and 15 mm against M. smegmatis, M. abscessus, Salmonella sp. IDH377 and E. coli E9, respectively using as positive control neomycin sulfate (Fig. 3). Although the concentration of HK10 extract used was double (100 μg/ml) than the neomycin sulfate (50 μg/ml), reasonably higher inhibition zones of HK10 extracts against all test pathogens clearly indicates the effectiveness of secondary metabolite produced by HK10. Secondary metabolite production of HK10 strain was optimized using different media at various time interval (data not shown) and 10 days old culture of HK10 strain grown on CSPY media emerged as the best for antibacterial activity. The antimicrobial activity of HK10 was absent when other media such as Mueller Hinton broth (Himedia), Luria Bertani broth (Himedia), Nutrient broth (Himedia) and Starch Casein broth (Himedia) was used. To investigate the secondary metabolite synthesis potential of HK10 strain, secondary metabolite encoding gene clusters was predicted using IMG server. HK10 genome possesses 49 secondary metabolite
C
C
NS
HK10
(i)
gene clusters, which are encoded by 721 genes with 11.48% of genome coverage (Fig. 4, Table S8). Total 49 gene clusters include 13 polyketide synthase (PKS) genes, 2 non-ribosomal peptide synthetase (NRPS)-PKS genes and 34 encoding other secondary metabolites including siderophores (Fig. 4, Table S8). Interestingly, the length of ten gene clusters exceeds N 25 Kb. The presence of large gene clusters in HK10 genome indicates that these clustering confer an ecological advantage and each gene of the cluster contributes to an enzymatic step in their respective cellular metabolic process. These clusters also imply transcriptional co-regulation of genes, which can be controlled at different levels of expression. HK10 genome is exceptionally rich in pks gene cluster (total 13) among sequenced genomes, providing evidence that HK10 strain harbors an extensive secondary metabolite metabolism (Fig. 4, Table S8). To understand the overall secondary metabolite synthesis potential of HK10 strain, these clusters were compared with Micromonospora sp. L5 and M. aurantiaca ATCC 27029T (Table S9). Compared to M. aurantiaca ATCC 27029T, HK10 genome had 8 unique gene clusters, 30 conserved gene clusters (similarity N60%) and 11 clusters were partially conserved (similarity N 30%) (Table S9). Similarly 9 gene clusters (Cluster 7: Putative, Cluster: 19: Putative, cluster 22: Lantipeptide, Cluster 29: Putative, Cluster 44: Lantipeptide, Cluster 45: Putative, Cluster 47: Lantipeptide; NRPS, T1 PKS, Cluster 48: Putative, Cluster 49: Putative) were found to be unique when HK10 genome was compared with Micromonospora sp. L5, along with 29 conserved gene clusters and 11 partially conserved clusters (Table S9, Fig. 5, Table S10). Overall seven gene clusters have been found unique in HK10 genome (Fig. 5). Our HGT analysis of HK10 genome has confirmed that cluster 7 (present in GI-17), cluster 22 (present in GI-8) and cluster 44 (present in GI-3) have been horizontally transferred from Actinoplanes sp., Frankia sp. and Salinispora arenicola respectively (Table S4 & S10). To explore the biosynthetic cluster closeness of HK10 strain with other secondary metabolite producing strains, predicted gene clusters have been compared with other actinobacteria. Our results revealed the similarity between HK10 strain and other actinobacteria in the following order Salinispora arenicola (16 conserved, 13 partially conserved, 20 absent) N Streptomyces coelicolor (3 conserved, 15 partially conserved, 31 absent) N Nocardia brasiliensis (4 partially conserved and 45 absent) N Rhodococcus pyridinivorans (4 partially conserved and 45 absent) (Table S9). Interestingly seven gene clusters (cluster 7, cluster 19, cluster 22, cluster 44, cluster 45, cluster 47 and cluster 48) of HK10 strain, did not show any similarity with other actinobacteria used in this study. These clusters might be interesting to explore in future for their secondary metabolite products. These secondary metabolite encoding gene clusters were again revalidated by antiSMASH cluster predicting tool (Medema et al., 2011; Blin et al., 2013; Weber et al., 2015). After analysis of HK10 genome, total twenty-two secondary metabolite encoding gene clusters have been discovered. Interestingly, antiSMASH analysis of HK10 genome emerged with one more lantipeptide gene cluster (Cluster 1) that was absent in our earlier IMG prediction (Table S8). There could be two possible reasons that
NS
HK10
(ii)
C
C
NS
HK10
(iii)
NS
HK10
(iv)
Fig. 3. Antimicrobial activity of Micromonospora sp. HK10-chloroform cell extracts (HK10) tested against (i) M. smegmatis, (ii) M. abscessus, (iii) Salmonella sp. IDH 377 (iv) Enterotoxigenic E. coli E9 strains, 50 μg/ml neomycin sulfate (NS) used as positive control and only DMSO used as a control (C).
M. Talukdar et al. / Gene 594 (2016) 97–107
6282 721
5561
Micromonospora sp. HK10
6360
893
5467
M. aurantiaca ATCC 27029
6326
861
5465
Micromonospora sp. L5
4751
Micromonospora sp. ATCC 39149
5692
7104
941
5836 726
8000
6000
4000
59
60
55
72
Micromonospora sp. CNB394
5110
Genes in Cluster
49
M. lupini Lupac 08
6110
994
103
2000
0
Genes not in Cluster
(A)
53
40
20
0 PKS
80
60 NRPS
NRPS-PKS
Other
(B)
Fig. 4. (A) Comparative analysis of total genes distributed under categories of: -present in secondary metabolite clusters/-not present in secondary metabolite clusters in selected Micromonospora spp. (B) Comparative analysis of total predicted secondary metabolite gene clusters types (PKS, NRPS, PKS-NRPS, OTHER) present in selected Micromonospora sps.
cause the differences: (1) in IMG the results of biosynthetic cluster prediction is affiliated with antiSMASH v2 but in our prediction with online versions of antiSMASH, we have used the online version of antiSMASH, i.e., version 3, which now annotates more classes of clusters including lantipeptides, (2) the number of results also depends on the parameters of running the antiSMASH, for example, IMG allow antiSMASH to predict putative clusters based on an implementation of Clusterfinder and this is not available by default on antiSMASH. Our analysis revealed five annotated biosynthetic gene clusters (Fig. S7): lantipeptide (cluster 1), PKS-hybrid (cluster 4), terpene (cluster 8), PKS III (cluster 13) and lantipeptide-siderophore (cluster 18), showing high level of similarity with SapB biosynthetic gene cluster of S. coelicolor (100%), Chlorothricin biosynthetic gene cluster of Streptomyces antibioticus (44%), Sioxanthin biosynthetic gene cluster of Salinispora tropica (80%), Alkyl-O-Dihydrogeranyl-Methoxyhydroquinones biosynthetic gene cluster of M. aurantiaca ATCC 27029T (71%), and Desferrioxamine B biosynthetic gene cluster of Streptomyces griseus (80%) (Table S11). These five clusters further aligned with other secondary metabolite producing strains of the database and all the results have been shown in Fig. S7. Lantipeptide encoding cluster 1, PKS-hybrid encoding cluster 4 and lantipeptide-siderophore encoding cluster 18 have been showing highest gene similarity with Streptomyces davawensis (19%), Streptomyces antibioticus (58%) and Streptomyces fradiae (30%) strains respectively, whereas terpene encoded cluster 8 and PKS III encoding cluster 13 are more similar to Micromonospora sp. ATCC 39149 (29%) and M. aurantiaca ATCC 27029T (82%) strains (Fig. S7). Overall our study provides critical genetic insights into the HK10 strain, which has antibacterial activity against a wide range of bacteria, including Mycobacteria and can produce several bioactive secondary metabolites. The antibacterial activity of HK10 strain and its link with predicted secondary metabolite gene clusters is our undergoing research work. 3.5. Secretion systems of Micromonospora sp. HK10 Secretion systems are essential for translocation of various proteins across the cell membrane which plays crucial roles like host-pathogen interactions and colonization (Driessen and Nouwen, 2008). Proteins in Gram-positive bacteria predominantly translocated by conserved Sec translocase system or twin-arginine translocation (Tat) system.
Sometimes the type VII or ESX secretion system has been also reported in some Gram positive bacteria. Similar to them HK10 genome encodes all the Sec components (Table S12), viz. the essential proteins SecY (Ga0063173_02940) and SecE (Ga0063173_02979), along with SecD (Ga0063173_02596) and SecF (Ga0063173_02597). Both co- and post-translational routes of targeting secretory proteins to the Sec translocon are present in HK10 strain. HK10 genome also encodes SecA protein (Ga0063173_02270, Ga0063173_06183) like Streptomyces lividans, which has been shown in vitro to be required for the secretion of the Sec-dependent α-amylase (Blanco et al., 1998). HK10 genome also contains ffh gene (Ga0063173_03995), encodes for signal recognition particle subunit SRP54 and ftsY (Ga0063173_03984), encoding the SRP receptor. Mutations in ftsY gene affect sporulation and antibiotic production in S. coelicolor (Shen et al., 2008). HK10 genome showed presence of various Tat pathways homologs. Similar to other actinobacteria, TatA (Ga0063173_05927), and TatC (Ga0063173_05926) are found adjacent to each other while TatB (Ga0063173_05707) was located distantly. Unlike M. lupini lupac 08 genome, which has no TatB homologue (Trujillo et al., 2014), HK10 Tat secretion pathway is very similar with Micromonospora sp. L5 and M. aurantiaca ATCC 27029T. The Tat pathway is responsible for export of number of pre-folded proteins across bacterial membranes (Lee et al., 2006). Tat proteins are also involved in different cellular activities including cell envelope biogenesis, metal acquisition, anaerobic metabolism and detoxification and virulence (Lee et al., 2006). Hence, presence of Tat pathway in Micromonospora strains might useful in dealing with the changing environment around them. HK10 genome has total 6 genes (Ga0063173_00846, Ga0063173_00894, Ga0063173_00895, Ga0063173_03797, Ga0063173_03798, Ga0063173_03804), which were categorized under RD1 or ESX-1/Snm protein secretion system family (Table S13 & S14). Like different Streptomyces, the function of these genes in Micromonospora genus is currently unknown, but a significant physiological role is suggested by analogy to similar systems in other organisms. T4SS have been mainly studied in Gram-negative bacteria but in Gram-positive bacteria the information is very limited so far. However, Type IV (Conjugal DNA-Protein transfer or VirB) secretory pathway (IVSP) family gene (Ga0063173_02432) was observed in HK10 genome (Table S14) and similar type of gene (virB4) was also reported in M.
104
M. Talukdar et al. / Gene 594 (2016) 97–107
(A)
(C)
(B)
(D)
(E)
(F)
(G) Fig. 5. Seven unique biosynthetic gene clusters present in Micromonospora sp. HK10: (A) Cluster-7 (B) Cluster-19 (C) Cluster-22 (D) Cluster-44 (E) Cluster-45 (F) Cluster-47 (G) Cluster48.
lupini Lupac 08, an endophytic actinobacterium (Trujillo et al., 2014), but was interestingly absent in most of the sequenced Micromonospora species (data not shown), including Micromonospora sp. L5 and M. aurantiaca ATCC 27029T strains. The precise role of IVSP family genes in Micromonospora genus is matter of future research.
3.6. Transporters of HK10 strain Various active transporter systems are involved in transport of nutrient elements, detoxification and defense mechanisms. HK10 genome contains total 74 families of transporters covered by total 587 genes
M. Talukdar et al. / Gene 594 (2016) 97–107
(9.3% of total CDS), which are involved in the transport systems for carbohydrates, lipid, amino acids, nucleotides, aromatic compounds, metals and inorganic ions (Tables S13 &S14). HK10 genome has emerged with the lowest 587 (9.3%) number of transporter genes with 49 predicted metabolite clusters whereas M. aurantiaca ATCC 27029T and Micromonospora sp. L5 contain 630 (9.9%) genes with 59 predicted secondary metabolite gene clusters and 658 (10.4%) with 60 clusters, respectively. The soil actinomycetes, S. coelicolor has also total 55 predicted secondary metabolite gene clusters with total 834 transporter genes (10.1% genome coverage) (Bentley et al., 2002). These transporters of HK10 strain are potentially involved in metabolite export and are consistent with such extensive secondary metabolism. HK10 genome contains proton-dependent oligopeptide transporter (POT) (Ga0063173_06054), responsible for peptide transport (Table S14), which regulates the expression of secondary metabolite genes (Lorenzana et al., 2004). Major facilitator superfamily (MFS) transporters show specificity for many vital components like nutrients, peptides, nucleotides, drugs, Krebs cycle metabolites, osmolites, siderophores (efflux & uptake) organic and inorganic anions, etc (Law et al., 2008). MFS transporter PenT regulates penicillin production by enhancing the translocation of penicillin precursors across fungal cellular membrane (Yang et al., 2012). RND (Resistance-Nodulation-Division) family transporters are widespread especially among Gramnegative bacteria, and catalyze the active efflux of many antibiotics and chemotherapeutic agents and might help to survive against different toxin compounds (Nikaido and Takatsuka, 1794). Interestingly HK10 genome contains highest number of MFS (64) and RND (10) transporters when compared with Micromonospora sp. L5 (61 MFS & 9 RND) and M. aurantiaca ATCC 27029T (58 MFS and 8 RND). Moreover, membrane fusion protein (MFP) transporter (Li and Nikaido, 2009) (Ga0063173_03875) presents only in HK10, which was not found in both Micromonospora sp. L5 and M. aurantiaca ATCC 27029T strains (Table S14). HK10 genome contains some other transporters, viz. Major intrinsic protein (MIP) family (TC:1.A.8), Lactate permease (LctP) family (TC:2.A.14) and Zinc (Zn2 +)-Iron (Fe2 +) permease (ZIP) family (TC:2.A.5), which was absent in both Micromonospora sp. L5 and M. aurantiaca ATCC 27029T strains (Table S12). In contrast, several transporters were absent in HK10 strain e.g., Urea transporter (UT) family (TC:1.A.28), Betaine/Carnitine/Choline transporter (BCCT) family (TC:2.A.15), Ca2+:Cation Antiporter (CaCA) family (TC:2.A.19), Arsenical resistance-3 (ACR3) family (TC:2.A.59), Monovalent cation (K+ or Na+):Proton antiporter-3 (CPA3) family (TC:2.A.63), Auxin efflux carrier (AEC) family (TC:2.A.69), L-Lysine exporter (LysE) family (TC:2.A.75), Gluconate:H+ Symporter (GntP) family (TC:2.A.8), the Tricarboxylate Transporter (TTT) Family (TC:2.A.80), putative cobalt transporter (CbtAB) family (TC:9.B.69) (Table S13). 3.7. Salt tolerant characteristic of HK10 strain Genomic background of Salinispora suggests its evolution from a terrestrial environment (Penn and Jensen, 2012) which raises the question whether Micromonospora has some common genes related to marine environment. There are Micromonospora species now which have been reported from marine environment (Das et al., 2008; Phongsopitanun et al., 2015; Supong et al., 2013; Maldonado et al., 2009; Zhao et al., 2004; Mincer et al., 2002) and produce various antimicrobials of medical importance (Charan et al., 2004; Thi et al., 2016; Carlson et al., 2013; Kyeremeh et al., 2014). We have confirmed the salt tolerance ability of HK10 strain by growing them on higher concentration (0.5–3.0%) of sodium chloride and best antibacterial activity was observed at 2.5% sodium chloride (data not shown). This characteristic suggests that certain genes may be adapted from marine microbes. Putative potassium uptake protein (Trk) (Ga0063173_05704) which might be useful in marine adaptation (Schlosser et al., 1993; Nakamura et al., 1994) and NDH protein (Ga0063173_00883)
105
constitutes a part of proton pumping NADH dehydrogenase and creates a proton-motive force for generation of ATP (Schlosser et al., 1993). HK10 genome also contains gene encoding mechanosensitive channels MscL (Ga0063173_00971) and MscS (Ga0063173_02549, Ga0063173_04307, Ga0063173_06285) (Table S15 & S16), which enable growth at low osmotic pressure (Bucarey et al., 2012). These genes are also present in Micromonospora sp. L5 and M. aurantiaca ATCC 27029T strains. Presence of these genes in HK10 strain could suggest an adaptation from marine bacteria and might be helpful in salt tolerance in adverse environmental condition.
3.8. Signal transduction and two-component system The full list of genes involved in signal transduction and two-component systems (TCS) in HK10 strain are given in Table S15 & S16. Various studies have shown that biosynthesis of antibiotic and many other secondary metabolites are strongly down regulated by inorganic phosphate (Sola-Landa et al., 2003; Ishige et al., 2003; Lefevre et al., 1997; Braibant and Content, 2001; Glover et al., 2007). In HK10 strain, the inorganic phosphate depriving condition might be controlled by senX3regX3 two-component system. Although the two-component system senX3-regX3 (Ga0063173_04526 and Ga0063173_04527) was found in HK10 strain, the two-component system phoR-phoP was absent. The alkaline phosphatase, phoA gene (Ga0063173_03810) and ABC phosphate transporter operon pstSCAB (Ga0063173_04482, Ga0063173_04481, Ga0063173_04480, Ga0063173_04479) was also present in HK10 genome. We have also observed a few more genes of phosphate transporter for example, phosphate starvation-inducible protein, PhoH (Ga0063173_04785) and phosphate transport system protein, phoU (Ga0063173_04525). Instead of alkaline phosphatase, phoA gene, alkaline phosphatase, phoD gene (Micau_4701 and Micau_4898) was present in M. aurantica. K+ intracellular cation is required for various physiological processes like turgor homeostasis (Epstein, 1986), pH regulation (Booth, 1985), gene expression (Prince and Villarejo, 1990) etc. For regulation of K+ ion and adaptation under various stress conditions, HK10 genome has KdpD-KdpE (Ga0063173_03480, Ga0063173_03479) two-component ATPase genes kdpABC system and K+-transporting (Ga0063173_03483, Ga0063173_03482, Ga0063173_03481), whereas KdpF gene was absent in HK10 genome like E. coli (Prince and Villarejo, 1990). HK10 genome has temperature regulator mechanism DesK-DesR two-component system, which works when the ambient temperature drops below 30 °C (Aguilar et al., 2001). Similar DesK-DesR system is also found in both Micromonospora sp. L5 and M. aurantiaca ATCC 27029T. Beside this, tricarboxylic transporter TctA-TctB-TctC operon (ML5_4359, ML5_4358, ML5_4357) is only present in Micromonospora sp. L5 and not found in both HK10 strain and M. aurantiaca ATCC 27029T. We have also observed gene related to fumarate metabolism fumarate reductase, frdA (Ga0063173_02826, Ga0063173_03575) present in HK10 genome, which was absent in both Micromonospora sp. L5 and M. aurantiaca ATCC 27029T strains.
3.9. Phylogenetic analysis of Micromonospora The phylogenetic position based on 16S rRNA gene sequence analysis of strain HK10 indicated that it was placed in a monophyletic clade with closest relatives, M. auratinigra TT1-11T (99% of identity with 97% coverage), M. peucetia DSM 43363T (98% of identity with 97% coverage) and M. chaiyaphumensis MC5-1T (99% of identity with 94% coverage). On habitat basis HK10 strain was clearly positioned within the genus Micromonospora and form a sub group together with other soil isolates (M. auratinigra TT1-11T, M. peucetia DSM 43363T and M. chaiyaphumensis MC5-1T) (Fig. S8).
106
M. Talukdar et al. / Gene 594 (2016) 97–107
4. Conclusions Despite the ability to produce several useful antibiotics, very few genome sequences of Micromonospora are available in the database. We contribute with a 6,911,179 bp genome of a soil isolate Micromonospora sp. HK10, with broad spectrum antibacterial activity against Gram-negative and Gram-positive bacteria including enterotoxigenic E. coli, Salmonella sp., M. abscessus and M. smegmatis. HK10 genome analysis emerged with total 49 secondary metabolite gene clusters (encoding mainly PKS, NRPS, siderophore, terpene etc) including comparatively higher (13) PKS and 7 unique gene clusters, which can be exploited for conceiving the new bioactive molecules. Further, this study also provides a new genetic insight into horizontal gene transfer (18-genetic islands with 3 new secondary metabolite genetic cluster), secretion system, CRISPRs, two-component system, salt tolerant behavior and transport mechanism in Micromonospora. This study will certainly enhance our molecular understanding about secondary metabolite engineering and environmental adaptations of Micromonospora and other related microbes. Acknowledgement This work was funded by the Council of Scientific and Industrial Research (CSIR), India (CSC131, BSC109, and HCP0001). M.T., D.D., and C.B. are supported by a research fellowship from the CSIR, India. We thank the Director, CSIR-North East Institute of Science and Technology (CSIR-NEIST), Jorhat, India, for his consistent encouragement and support. We are grateful to Dr. T. Ramamurthy, National Institute of Cholera and Enteric Diseases, Kolkata, India, for his kind gift of Salmonella sp. (ID377) and Escherichia coli E9 (serotype 0146) strains used in this study. Authors are also very thankful to Prof. Paul Jensen for his kind permission to use the Micromonospora sp. CNB394 genome sequence in this study. Appendix A. Supplementary data Supplementary data to this article can be found online at http://dx. doi.org/10.1016/j.gene.2016.09.005. References Aguilar, P.S., Hernandez-Arriaga, A.M., Cybulski, L.E., Erazo, A.C., de Mendoza, D., 2001. Molecular basis of thermosensing: a two component signal transduction thermometer in Bacillus subtilis. EMBO J. 20, 1681–1691. Bentley, S.D., Chater, K.F., Cerdeño-Tárraga, A.-M., Challis, G.L., Thomson, N.R., James, K.D., et al., 2002. Complete genome sequence of the model actinomycete Streptomyces coelicolor A3(2). Nature 417, 141–147. Bérdy, J.J., 2005. Bioactive microbial metabolites. Antibiot. 58, 1–26. Blanco, J., Driessen, A.J., Coque, J.J., Martín, J.F., 1998. Biochemical characterization of the SecA protein of Streptomyces lividans – interaction with nucleotides, binding to membrane vesicles and in vitro translocation of proAmy protein. Eur. J. Biochem. 257, 472–478. Bland, C., Ramsey, T.L., Sabree, F., Lowe, M., Brown, K., Kyrpides, N.C., Hugenholtz, P., 2007. CRISPR recognition tool (CRT): a tool for automatic detection of clustered regularly interspaced palindromic repeats. BMC Bioinformatics. 8, 209. Blin, K., Medema, M.H., Kazempour, D., Fischbach, M.A., Breitling, R., Takano, E., Weber, T., 2013. antiSMASH 2.0—a versatile platform for genome mining of secondary metabolite producers. Nucleic Acids Res. 41, 204–212. Bolger, A.M., Lohse, M., Usadel, B., 2014. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120. Booth, I.R., 1985. Regulation of cytoplasmic pH in bacteria. Microbiol. Rev. 49, 359–378. Braibant, M., Content, J., 2001. The cell surface associated phosphatase activity of Mycobacterium bovis BCG is not regulated by environmental inorganic phosphate. FEMS Microbiol. Lett. 195, 121–126. Bucarey, S.A., Penn, K., Paul, L., Fenical, W., Jensen, P.R., 2012. Genetic complementation of the obligate marine actinobacterium Salinispora tropica with the large mechanosensitive channel gene mscL rescues cells from osmotic downshock. Appl. Environ. Microbiol. 78, 4175–4182. Carlson, S., Marler, L., Nam, S.J., Santarsiero, B.D., Pezzuto, J.M., Murphy, B.T., 2013. Potential chemopreventive activity of a new macrolide antibiotic from a marine-derived Micromonospora sp. Mar Drugs. 11, 1152–1161. Carro, L., Pujic, P., Trujillo, M.E., Normand, P., 2013. Micromonospora is a normal inhabitant of actinorhizal nodules. J. Biosci. 38, 685–693.
Charan, R.D., Schlingmann, G., Janso, J., Bernan, V., Feng, X., Carter, G.T., 2004. Diazepinomycin, a new antimicrobial alkaloid from marine Micromonospora sp. J. Nat. Prod. 67, 1431–1433. Darling, A.E., Mau, B., Perna, N.T., 2010. ProgressiveMauve: multiple genome alignment with gene gain, loss and rearrangement. PLoS One 5, e11147. Das, S.S., Ward, L.R., Burke, C., 2008. Prospect of using marine actinobacteria as probiotic in aquaculture. Appl. Microbiol. Biotechnol. 81, 419–429. Delcher, A.L., Bratke, K.A., Powers, E.C., Salzberg, S.L., 2007. Identifying bacterial genes and endosymbiont DNA with glimmer. Bioinformatics 23, 673–679. Dhillon, B.K., Laird, M.R., Shay, J.A., Winsor, G.L., Lo, R., Nizam, F., Pereira, S.K., Waglechner, N., McArthur, A.G., Langille, M.G., Brinkman, F.S., 2015. IslandViewer 3: more flexible, interactive genomic island discovery, visualization and analysis. Nucleic Acids Res. 43, 104–108. Driessen, A.J., Nouwen, N., 2008. Protein translocation across the bacterial cytoplasmic membrane. Annu. Rev. Biochem. 77, 643–667. Edgar, R.C., 2007. PILER-CR: fast and accurate identification of CRISPR repeats. BMC Bioinformatics. 8, 18. Epstein, W., 1986. Osmoregulation of K+ transport in Escherichia coli. FEMS Microbiol. Rev. 39, 73–78. Felsenstein, J., 1985. Confidence limits on phylogenies: an approach using the bootstrap. Evolution 39, 783–791. Forbes, B.A., Sahm, D.F., Weissfeld, A.S., Trevino, E.A., 1990. Methods for testing antimicrobial effectiveness. In: Bailey Scotts Diagnostic Microbiology, Baron, E.J., Peterson, L.R., Finegold, S.M. (Eds.), Bailey Scotts Diagnostic Microbiology. Mosby Co, St Louis, Missouri MI, pp. 171–194. Furumai, T., Takagi, K., Igarashi, Y., Saito, N., Oki, T., 2000. Arisostatins A and B, new members of tetrocarcin class of antibiotics from Micromonospora sp. TP-A0316. I. Taxonomy, fermentation, isolation and biological properties. J Antibiot. 53, 227–232. Furumai, T., Igarashi, Y., Higuchi, H., Saito, N., Oki, T., 2002. Kosinostatin, a quinocycline antibiotic with antitumor activity from Micromonospora sp. TP-A0468. J Antibiot. 55, 128–133. Genilloud, O., 2015. Micromonospora. Bergey's Manual of Systematics of Archaea and Bacteria, pp. 1–28. Genilloud, O., Genus, I., 2012. In: Goodfellow, M., Kämpfer, P., Busse, H.-J., Trujillo, M.E., Suzuki, K.I., Ludwig, W., Whitman, W.B. (Eds.), Micromonospora; in Bergey's Manual of Systematic Bacteriology: The Actinobacteria, 2nd edition GA: Bergey's Manual Trust, Springer, Athens, pp. 1039–1057. Glover, R.T., Kriakov, J., Garforth, S.J., Baughn, A.D., Jacobs Jr., W.R., 2007. The two-component regulatory system senX3-regX3 regulates phosphate-dependent gene expression in Mycobacterium smegmatis. J. Bacteriol. 189, 5495–5503. Gramer, A., Collins, C.H., Lyne, P.M., 1976. Antibiotic sensitivity and assay tests. Microbiological Methods. London: Butterworth and Co, p. 235. Grissa, I., Vergnaud, G., Pourcel, C., 2007. CRISPRFinder: a web tool to identify clustered regularly interspaced short palindromic repeats. Nucleic Acids Res. 35, W52–W57. Hirsch, A.M., Valdés, M., 2009. Micromonospora: an important microbe for biomedecine and potentially for biocontrol and biofuel. Soil Biol. Biochem. 42, 536–542. Horvath, P., Barrangou, R., 1994. CRISPR/Cas, the immune system of bacteria and archaea. Science 327, 167–170. Hyatt, D., Chen, G.L., LoCascio, P.F., Land, M.L., Larimer, F.W., Hauser, L.J., 2010. Prodigal: prokaryotic gene recognition and translation initiation site identification. BMC Bioinformatics. 11, 119. Igarashi, Y., Trujillo, M.E., Martínez-Molina, E., Yanase, S., Miyanaga, S., Obata, T., Sakurai, H., Saiki, I., Fujita, T., Furumai, T., 2007. Antitumor anthraquinones from an endophytic actinomycete Micromonospora lupini sp. nov. Bioorg. Med. Chem. Lett. 17, 3702–3705. Igarashi, Y., Ogura, H., Furihata, K., Oku, N., Indananda, C., Thamchaipenet, A., 2011a. Maklamicin, an antibacterial polyketide from an endophytic Micromonospora sp. J. Nat. Prod. 74, 670–674. Igarashi, Y., Yanase, S., Sugimoto, K., Enomoto, M., Miyanaga, S., Trujillo, M.E., Saiki, I., Kuwahara, S., 2011b. Lupinacidin C, an inhibitor of tumor cell invasion from Micromonospora lupini. J. Nat. Prod. 74, 862–865. Ishige, T., Krause, M., Bott, M., Wendisch, V.F., Sahm, H., 2003. The phosphate starvation stimulon of Corynebacterium glutamicum determined by DNA microarray analyses. J. Bacteriol. 185, 4519–4529. Kimura, M., 1980. A simple method for estimating evolutionary rate of base substitutions through comparative studies of nucleotide sequences. J. Mol. Evol. 16, 111–120. Kumar, S., Stecher, G., Tamura, K., 2016. MEGA7: Molecular Evolutionary Genetics Analysis version 7.0 for bigger datasets. Mol. Biol. Evol. pii: msw054. Kyeremeh, K., Acquah, K.S., Sazak, A., Houssen, W., Tabudravu, J., Deng, H., Jaspars, M., 2014. Butremycin, the 3-hydroxyl derivative of ikarugamycin and a protonated aromatic tautomer of 5′-methylthioinosine from a Ghanaian Micromonospora sp. K310. Mar Drugs. 12, 999–1012. Lam, K.S., Hesler, G.A., Gustavson, D.R., Berry, R.L., Tomita, K., MacBeth, J.L., Ross, J., Miller, D., Forenza, S., 1996. Pyrrolosporin A, a new antitumor antibiotic from Micromonospora sp. C39217-R109-7. I. Taxonomy of producing organism, fermentation and biological activity. J Antibiot. 49, 860–864. Law, C.J., Maloney, P.C., Wang, D.N., 2008. Ins and outs of major facilitator superfamily antiporters. Annu. Rev. Microbiol. 62, 289–305. Lee, P.A., Tullman-Ercek, D., Georgiou, G., 2006. The bacterial twin-arginine translocation pathway. Annu. Rev. Microbiol. 60, 373–395. Lefevre, P., Braibant, M., de Wit, L., Kalai, M., Roeper, D., Grotzinger, J., Delville, J.P., Peirs, P., Ooms, J., Huygen, K., Content, J., 1997. Three different putative phosphate transport receptors are encoded by the Mycobacterium tuberculosis genome and are present at the surface of Mycobacterium bovis BCG. J. Bacteriol. 179, 2900–2906. Li, X.-Z., Nikaido, H., 2009. Efflux-mediated drug resistance in bacteria: an update. Drugs 69, 1555–1623.
M. Talukdar et al. / Gene 594 (2016) 97–107 Lorenzana, L.M., Perez-Redondo, R., Santamarta, I., Martin, J.F., Liras, P., 2004. Two oligopeptide-permease-encoding genes in the clavulanic acid cluster of Streptomyces clavuligerus are essential for production of the beta-lactamase inhibitor. J. Bacteriol. 186, 3431–3438. Makarova, K.S., Aravind, L., Grishin, N.V., Rogozin, I.B., Koonin, E.V., 2002. A DNA repair system specific for thermophilic archaea and bacteria predicted by genomic context analysis. Nucleic Acids Res. 30, 482. Makarova, K.S., Grishin, N.V., Shabalina, S.A., Wolf, Y.I., Koonin, E.V., 2006. A putative RNAinterference-based immune system in prokaryotes: computational analysis of the predicted enzymatic machinery, functional analogies with eukaryotic RNAi, and hypothetical mechanisms of action. Biol. Direct 1, 7. Maldonado, L.A., Fragoso-Yáñez, D., Pérez-García, A., Rosellón-Druker, J., Quintana, E.T., 2009. Actinobacterial diversity from marine sediments collected in Mexico. Antonie Van Leeuwenhoek 95, 111–120. Markowitz, V.M., Chen, I.M., Chu, K., et al., 2014a. IMG/M 4 version of the integrated metagenome comparative analysis system. Nucleic Acids Res. 42, D568–D573. Markowitz, V.M., Chen, I.-M.A., Palaniappan, K., et al., 2014b. IMG 4 version of the integrated microbial genomes comparative analysis system. Nucleic Acids Res. 42 [Database issue] D560–7. Medema, M.H., Blin, K., Cimermancic, P., de Jager, V., Zakrzewski, P., Fischbach, M.A., Weber, T., Takano, E., Breitling, R., 2011. antiSMASH: rapid identification, annotation and analysis of secondary metabolite biosynthesis gene clusters in bacterial and fungal genome sequences. Nucleic Acids Res. 39, 339–346. Mincer, T.J., Jensen, P.R., Kauffman, C.A., Fenical, W., 2002. Widespread and persistent populations of a major actinomycete taxon in ocean sediments. Appl. Environ. Microbiol. 68, 5005–5011. Musto, H., et al., 2006. Genomic GC level, optimal growth temperature, and genome size in prokaryotes. Biochem. Biophys. Res. Commun. 347, 1–3. Nakamura, T., Matsuba, Y., Yamamuro, N., Booth, I.R., Unemoto, T., 1994. Cloning and sequencing of a K + transport gene (trkA) from the marine bacterium Vibrio alginolyticus. Biochim. Biophys. Acta 1219, 701–705. Nikaido, H., Takatsuka, Y., 1794. Mechanisms of RND multidrug efflux pumps. Biochim. Biophys. Acta 2009, 769–781. Nishio, Y., et al., 2003. Comparative complete genome sequence analysis of the amino acid replacements responsible for the thermostability of Corynebacterium efficiens. Genome Res. 13, 1572–1579. Pazhani, G.P., Chakraborty, S., Fujihara, K., Yamasaki, S., Ghosh, A., Nair, G.B., Ramamurthy, T., 2011. QRDR mutations, efflux system & antimicrobial resistance genes in enterotoxigenic Escherichia coli isolated from an outbreak of diarrhoea in Ahmedabad, India. Indian J. Med. Res. 134, 214–223. Penn, K., Jensen, P.R., 2012. Comparative genomics reveals evidence of marine adaptation in Salinispora species. BMC Genomics 13, 86. Phongsopitanun, W., Kudo, T., Mori, M., Shiomi, K., Pittayakhajonwut, P., Suwanborirux, K., Tanasupawat, S., 2015. Micromonospora fluostatini sp. nov., isolated from marine sediment. Int. J. Syst. Evol. Microbiol. 65, 4417–4423. Prince, W.S., Villarejo, M.R., 1990. Osmotic control of proU transcription is mediated through direct action of potassium glutamate on the transcription complex. J. Biol. Chem. 265, 17673–17679. Rocha, E.P., 2004. Codon usage bias from tRNA's point of view: redundancy, specialization, and efficient decoding for translation optimization. Genome Res. 14, 2279–2286. Schlosser, A., Hamann, A., Bossemeyer, D., Schneiderm, E., Bakker, E.P., 1993. NAD+ binding to the Escherichia coli K(+)-uptake protein TrkA and sequence similarity between TrkA and domains of a family of dehydrogenases suggest a role for NAD+ in bacterial transport. Mol. Microbiol. 9, 533–543. Shen, X.L., Dong, H.J., Hou, X.P., Guan, W.J., Li, Y.Q., 2008. FtsY affects sporulation and antibiotic production by whiH in Streptomyces coelicolor. Curr. Microbiol. 56, 61–65.
107
Shimotohno, K.W., Endo, T., Furihata, K., 1993. Antibiotic AC6H, a new component of tetrocarcin group antibiotics. J Antibiot. 46, 682–684. Sola-Landa, A., Moura, R.S., Martin, J.F., 2003. The two-component PhoR-PhoP system controls both primary metabolism and secondary metabolite biosynthesis in Streptomyces lividans. Proc. Natl. Acad. Sci. U. S. A. 100, 6133–6138. Supong, K., Suriyachadkun, C., Pittayakhajonwut, P., Suwanborirux, K., Thawai, C., 2013. Micromonospora spongicola sp. nov., an actinomycete isolated from a marine sponge in the Gulf of Thailand. J Antibiot (Tokyo) 66, 505–509. Talukdar, M., Duarah, A., Talukdar, S., Gohain, M.B., Debnath, R., Yadav, A., Jha, D.K., Bora, T.C., 2012. Bioprospecting Micromonospora from Kaziranga National Park of India and their anti-infective potential. World J. Microbiol. Biotechnol. 28, 2703–2712. Talukdar, M., Das, D., Borah, C., Deka Boruah, H.P., Bora, T.C., Singh, A.K., 2016. Draft genome sequence of Micromonospora sp. HK10 strain, isolated from Kaziranga national park, India. Genome Announcement. 4, e00559-15. Thi, Q.V., Tran, V.H., Maia, H.D., Le, C.V., Hong Mle, T., Murphy, B.T., Chau, V.M., Pham, V.C., 2016. Antimicrobial metabolites from a marine-derived actinomycete in Vietnam's East Sea. Nat. Prod. Commun. 11, 49–51. Thompson, J.D., Higgins, D.G., Gibson, T.J., 1994. CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22, 4673–4680. Tomita, F., Tamaoki, T.J., 1980. Tetrocarcins, novel antitumor antibiotics. I. Producing organism, fermentation and antimicrobial activity. Antibiot. 33, 940–945. Trujillo, M., Kroppenstedt, R., Schumann, P., Carro, L., Martínez-Molina, E., 2006. Micromonospora coriariae sp. nov., isolated from root nodules of Coriaria myrtifolia. Int. J. Syst. Evol. Microbiol. 56, 2381–2385. Trujillo, M.E., Kroppenstedt, R.M., Fernandez-Molinero, C., Schumann, P., MartinezMolina, E., 2007. Micromonospora lupini sp. nov. and Micromonospora saelicesensis sp. nov., isolated from root nodules of Lupinus angustifolius. Int. J. Syst. Evol. Microbiol. 57, 2799–2804. Trujillo, M.E., Alonso-Vega, P., Rodríguez, R., Carro, L., Cerda, E., Alonso, P., MartínezMolina, E., 2010. The genus Micromonospora is widespread in legume root nodules: the example of Lupinus angustifolius. ISME J. 4, 1265–1281. Trujillo, M.E., Bacigalupe, R., Pujic, P., Igarashi, Y., Benito, P., Riesco, R., Médigue, C., Normand, P., 2014. Genome features of the endophytic actinobacterium Micromonospora lupini strain lupac 08: on the process of adaptation to an endophytic life style? PLoS One 9, e108522. Valdés, M., Pérez, N.O., Estrada-de Los Santos, P., Caballero-Mellado, J., Peña-Cabriales, J.J., Normand, P., 2005. Hirsch AM. Non-Frankia actinomycetes isolated from surface sterilized roots of Casuarina equisetifolia fix nitrogen. Appl. Environ. Microbiol. 71, 460–466. Weber, T., Blin, K., Duddela, S., Krug, D., Kim, H.U., Bruccoleri, R., Lee, S.Y., Fischbach, M.A., Müller, R., Wohlleben, W., Breitling, R., Takano, E., Medema, M.H., 2015. antiSMASH 3.0- a comprehensive resource for the genome mining of biosynthetic gene clusters. Nucleic Acids Res. 43, W237–W243 Web Server issue. Wietz, M., Duncan, K., Patin, N.V., Jensen, P.R., 2013. Antagonistic interactions mediated by marine bacteria: the role of small molecules. J. Chem. Ecol. 39, 879–891. Yang, J., Xu, X.X., Liu, G., 2012. Amplification of an MFS transporter encoding gene penT significantly stimulates penicillin production and enhances the sensitivity of Penicillium chrysogenum to phenylacetic acid. J. Genet. Genomics. 39, 593–602. Zhao, H., Kassama, Y., Young, M., Dell, D.B., Goodacre, R., 2004. Differentiation of Micromonospora isolates from a coastal sediment in Wales on the basis of Fourier transform infrared spectroscopy, 16S rRNA sequence analysis, and the amplified fragment length polymorphism technique. Appl. Environ. Microbiol. 70, 6619–6627.