DOI 10.1007/s10517-017-3910-z
814
Bulletin of Experimental Biology and Medicine, Vol. 163, No. 6, October, 2017
METHODS Method of Selection of Bacteria Antibiotic Resistance Genes Based on Clustering of Similar Nucleotide Sequences I. S. Balashov, V. A. Naumov, P. I. Borovikov, A. B. Gordeev, D. V. Dubodelov, L. A. Lyubasovskaya, Yu. V. Rodchenko, A. A. Bystritskii, N. V. Aleksandrova, D. Yu. Trofimov, and T. V. Priputnevich
Translated from Byulleten’ Eksperimental’noi Biologii i Meditsiny, Vol. 163, No. 6, pp. 784-787, June, 2017 Original article submitted December 9, 2016 A new method for selection of bacterium antibiotic resistance genes is proposed and tested for solving the problems related to selection of primers for PCR assay. The method implies clustering of similar nucleotide sequences and selection of group primers for all genes of each cluster. Clustering of resistance genes for six groups of antibiotics (aminoglycosides, β-lactams, fluoroquinolones, glycopeptides, macrolides and lincosamides, and fusidic acid) was performed. The method was tested for 81 strains of bacteria of different genera isolated from patients (K. pneumoniae, Staphylococcus spp., S. agalactiae, E. faecalis, E. coli, and G. vaginalis). The results obtained by us are comparable to those in the selection of individual genes; this allows reducing the number of primers necessary for maximum coverage of the known antibiotic resistance genes during PCR analysis. Key Words: antibiotic resistance; clusterization; next-generation sequencing technology; polymerase chain reaction (PCR) Resistance of microorganisms to antimicrobial drugs is a topical problem of modern clinical microbiology. The use of next-generation sequencing (NGS) technology provides maximum information about the genome of bacteria isolated from clinical material. One of the approaches to analyze of this information [6] is de novo assembly of reads with subsequent identification of resistance genes in databases [9]. This approach is not always convenient, because whole-genome sequencing of each isolate is required, which is not always possible for technical and economic reasons. V. I. Kulakov Research Center for Obstetrics, Gynecology, and Perinatology, Ministry of Health of the Russian Federation, Moscow, Russia. Address for correspondence:
[email protected]. A. B. Gordeev
In routine practice, PCR test systems that provide rapid and available diagnostic information are the best option. The primers for PCR should have high specificity and all target genes (or they should be universal for a set of similar nucleotide sequences) should be present to include the maximum number of antibiotic resistance (AR) genes. The search for optimal set of primers for the detection of AR genes in bacteria continues [2], since current commercial PCR kits in real-time for AR assay are limited in the spectrum of the tested genes or are inaccessible [3]. The goal of this work was the development and testing of a new method of selection of genes predicting AR of bacteria for further selection of primers and development of multiplex PCR test systems for simultaneous identification of a large number of ARrelated genes.
0007-4888/17/16360814 © 2017 Springer Science+Business Media New York
815
I. S. Balashov, V. A. Naumov, et al.
MATERIALS AND METHODS The study was conducted on 81 strains of resistant bacteria of different genus (K. pneumoniae, Staphylococcus spp., S. agalactiae, E. faecalis, E. coli, and G. vaginalis) that were selected by phenotypic characteristics among strains isolated from patients of the V. I. Kulakov Research Center for Obstetrics, Gynecology, and Perinatology. The proposed method consists in clustering of similar nucleotide sequences and selection of group primers for all the genes of each cluster. Pure bacterial cultures were obtained by inoculation of clinical material into nonselective and selective culture media: 5% blood agar, UriSelect medium (BioRad), Endo agar (State Research Center for Applied Microbiology and Biotechnology), mannitol-salt agar (HiMedia Laboratories) and enterococcal agar (BD). Species identification of the isolated cultures was conducted on a Vitek 2-Compact automatic bacteriological analyzer (BioMerieux) and by using MALDI-TOF technique on a Autoflex III mass spectrometer with MALDI-Biotyper 3.0 software (Bruker Daltonics). The antibiotic sensitivity tests for isolated cultures were performed by phenotypical methods: by discdiffusion method and by determining minimal inhibitory concentration of the drug on a Vitek 2 Compact 30 automatic bacteriological analyzer. The results were evaluated in accordance with the EUCAST Breakpoint table 5.0 recommendations. We performed NGS analysis for all strains of bacteria. Genomic DNA was isolated from fresh cultures containing at least 10 million cells by lysis with lysozyme and proteinase K followed by DNA extraction with phenol-chloroform mixture. DNA libraries were prepared with Ion Xpress Plus Fragment Library Kit and Ion Xpress Barcode adapters 1-96 kits (Thermo Fisher Scientific). The quality of the libraries was controlled on a Bioanalyzer 2100 with High Sensitivity DNA Kit (Agilent Technologies). Ion OneTouch Template Kit (Thermo Fisher Scientific) was used for emulsion PCR and sphere enrichment. Sequencing was performed on an Ion PGM Torrent platform with Ion Sequencing Kit and 316v2 chips (Thermo Fisher Scientific). All the stages starting from preparation of libraries were conducted in accordance with manufacturer’s protocols. The search among known AR associated genes was performed in the ResFinder database [8] and 699 of 821 unique AR-related genes were selected that were responsible for resistance to antibiotics of 6 classes (aminoglycosides, β-lactams, fluoroquinolones, glycopeptides, macrolides and lincosamides, and fusidic acid) and grouped in accordance with the targeted antibiotic. For each group of genes, hierar-
chical clustering of similar nucleotide sequences was performed on the basis of similarity matrix of nucleotide sequences. Contig assembly was carried out on the basis of the data obtained from whole-genome sequencing as well as the assessment of coverage together with alignment for the nucleotide sequences of the genes on the ResFinder. The total length of the contigs (more than 500 nucleotide pairs) adjusted to the reference length for the given type of microorganisms was taken further as a correction value n. For E. coli strains, no genes with nucleotide coverage proportion >0.5 were detected. The values >1 are due to normalization (the average value of the covered nucleotides to the genes more than the average in the genome). For each gene in the samples, nucleotide coverage was evaluated; the value