Reconstruction of the Epidemic History of the Beijing Genotype of Mycobacterium tuberculosis in Russia and Former Soviet. Countries Using Spoligotyping.
ISSN 08914168, Molecular Genetics, Microbiology and Virology, 2011, Vol. 26, No. 3, pp. 120–125. © Allerton Press, Inc., 2011. Original Russian Text © V.V. Sinkov, E.D. Savilov, O.B. Ogarkov, 2011, published in Molekulyarnaya Genetika, Mikrobiologiya i Virusologiya, 2011, No. 3, pp. 25–29.
EXPERIMENTAL WORKS
Reconstruction of the Epidemic History of the Beijing Genotype of Mycobacterium tuberculosis in Russia and Former Soviet Countries Using Spoligotyping V. V. Sinkova, b, E. D. Savilovb, c, and O. B. Ogarkovb, d a
bInstitute
Irkutsk Regional Diagnostic Center, Irkutsk, Russia of Epidemiology and Microbiology, Siberian Branch, Russian Academy of Medical Sciences, Irkutsk, Russia c Irkutsk Institute for Medical Advanced Studies, Irkutsk, Russia d Irkutsk TBPrevention Dispensary, Irkutsk, Russia Received January 14, 2011
All known genotypes (3925 spoligoprofiles of M. tuberculosis) were selected using the SITVIT database in nine countries, including countries of the former Soviet Union: Russia, Latvia, Estonia, Poland, Finland, Italy, Portugal, Japan, and Vietnam. The programs SpolTools and DESTUS were used to construct consensus networks to identify the epidemic genotypes of M. tuberculosis in the studied countries. In Russia, Latvia, and Estonia, the Beijing strain was determined as the main epidemic genotype. In Finland, Poland, Portugal, and Italy, all epidemic genotypes belonged to other families. The hypothesis on the explosive nature of the spread ing of the Beijing genotype of M. tuberculosis was put forward for Russia and other former Soviet countries in the 20th century. The basic idea of the hypothesis was that the pattern of dissemination of the Beijing geno type occurred at the CER (Chinese Eastern Railway), in the Gulag, and in the civilian society of Soviet Union. The Beijing genotype of M. tuberculosis affected Russian builders of the CER during the end the 19th and early 20th centuries. With the repression of CER builders in the Soviet Union, the Beijing genotype was spread among Gulag prisoners; after 1953, it had spread throughout the civilian society of the entire country. The distribution of epidemic genotypes of M. tuberculosis in the studied countries was interpreted as evidence of the suggested hypothesis. Keywords: spoligotyping, M. tuberculosis DOI: 10.3103/S0891416811030050
INTRODUCTION Today, tuberculosis remains one of the most chal lenging problems of practical healthcare worldwide. According to WHO data, more than two billion peo ple, i.e., onethird of the Earth’s population, are infected with Mycobacterium tuberculosis, and one in ten infected people experience the tuberculosis disease during their lifetime [29]. In 2009, 9.4 million cases of tuberculosis were registered in the world [29], with an average frequency of 139 disease cases per 100000 peo ple [9, 29]. In 2008, 1.8 million people died of tuber culosis, including 500000 HIVinfected ones, which is an average of 4500 deaths per day [29]. Moreover, the incidence rate has been shown to be increasing not only in developing countries, but also in Western Europe and United States. Russia, where a marked rise in all epide miological markers of tuberculosis has been observed in recent decades, is not an exclusion [12]. Indeed, in 2007, the incidence rate of tuberculosis among the adult population of Russia was 82.6 per 100000 people, with a mortality rate of 18.1 per 100000 people, which accounts for 85% of lethal cases of all infectious and parasitic diseases [7]. In comparison with 1990s (the
beginning of the increase in the incidence rate), the lethality of tuberculosis has increased by 2.6 times [7]. Tuberculosis is characterized by an irregular geo graphical distribution. Minimal incidence rates have been observed in North and South America (3%) and Europe (5%), while maximal incidence rates have been observed in southeast Asia (55%) and Africa (31%) [5]. Presently, the distribution of tuberculosis is characterized as pandemic [14], the most important etiological agent of which is genetically a similar group of strains called the “Beijing” genotype [28]. It should be noted that in Russia this subfamily prevails over all registered genotypes [2, 3, 8, 11, 19, 26], making Rus sia considerably different from other European states [17]. However, it is important to note that newly formed European states founded on the basis of former republics of the USSR (Latvia, Lithuania, and Estonia) demonstrate a situation similar to that in Russia [16, 20, 27]. The genetic diversity of M. tuberculosis has been formed as a result of population migrations. Evidently, the Beijing genotype of M. tuberculosis occurred in northern China more than 2000 years ago and pene
120
RECONSTRUCTION OF THE EPIDEMIC HISTORY OF THE BEIJING GENOTYPE
trated to other regions via population migrations from southeast Asia [6]. I. Mokrousov et al. suggest that the predominance of the Beijing genotype in Russia origi nates from the medieval epoch and is connected with the Mongol–Tatar invasion [22]. This was the time when contact between Slavs and Europeans with the Beijing genotype would have first taken place. How ever, this hypothesis fails to explain the phenomenon of the high similarity between M. tuberculosis geno types in Russia and the Baltic states, which are former Soviet Union republics [16, 20, 27]. It is also impossi ble to explain the differences in this genotype distribu tion in the former Soviet states in comparison with other European states, including former Soviet states, within the framework of the “medieval” model [6, 8, 16, 19, 20, 26, 27]. Therefore, it seems that the sce nario of the Beijing genotype’s distribution must be different. It may be hypothesized that the distribution of the Beijing genotype of M. tuberculosis reached its modern level in Russia and the Baltic states (Latvia, Lithuania and Estonia) in a relatively short period of the USSR, whereas in neighboring states the distribu tion of the Beijing genotype remained the same, pos sibly at medieval levels. In order to verify this hypoth esis, a model of evolutionary relationships among the strains of M. tuberculosis in the studied states was com posed, and epidemiologically important genotypes were identified. Use of this model allowed us to obtain statistically significant data about the differences between the most important genotypes of tuberculosis etiological factors circulating in Russia, the Baltic states, and other European states. The present study was aimed at the formulation of the Beijing genotype’s distribution scenario in the USSR region in the 20th century on the basis of molecular modeling results for the processes of occurrence of epidemiologically important M. tuberculosis genotypes in the studied states. MATERIALS AND METHODS Known genotypes of M. tuberculosis obtained by spoligotyping were used for the study. Spolygotyping profiles were obtained from the online database SITVIT [21]. In total, 3925 genotypes (906 unique variants), which were isolated from the patients from Russia, Latvia, Estonia, Poland, Finland, Italy, Portu gal, Japan, and Vietnam, were studied. The informa tion about each spoligotype was presented as a numer ical sequence of 43 numerals, where a deletion was marked as “0” and the absence of deletion was marked as “1.” All spoligotypes were saved as rsf files (rich spo ligotype format) for further analysis with the SpolTool program [25]. A consensus spoligoforest network was composed on the basis of SpolTool spoligotypes [24]. The size of each cluster in the consensus network cor responded to the number of isolates of identical spoli gotype. Signs between the clusters of the pictures show evolutionary interconnections between spoligotypes
121
Table. States, number of strains and spoligotypes used in the study State
Number of strains
Russia Finland Poland Latvia Estonia Portugal Italy Japan Vietnam Total
1077 374 306 140 119 358 539 239 783 3935
Number of spoligotypes 171 135 87 28 31 97 196 34 127 906
and the calculated direction of mutation event ances tors to descendants. To ensure the correctness of the choice of an epidemiologically important genotype, data verification algorithms aimed at controling for the absence of falsepositive interconnections with multiple data comparison were used (Bonferroni rule) with the DESTUS (Detecting Emerging Strains of Tuberculosis Using Spoligotypes) program [25]. The stability of spoligotrees was assessed using the Ben jamini–Hochberg, Storey, and Dunn–Sidak algo rithms, which calculate false discovery rate (FDR). Genotypes which demonstrated values of p < 0.01 in the sequential analysis with all three algorithms were considered epidemiologically significant. The rela tionships of epidemiologically significant genotypes to the known M. tuberculosis subfamilies were estimated using the spoligotype database SpolDB4 in accor dance with their ST numbers [15]. RESULTS AND DISCUSSION In order to perform a comparative assessment, countries which were both historically and geographi cally related to Russia (Finland, Poland, Latvia, and Estonia) and were wellrepresented in the SpolDB4 database [15] were chosen. The fact that Lithuania was omitted is due to its poor representation in the SpolDB4 database. The other group included “con trast” countries which were poorly connected with Russia both historically and geographically. European states (Portugal and Spain) where the Beijing genotype is insignificantly distributed, as well as Asian states (Japan and Vietnam) where it dominates, were also chosen [13, 18]. The list of countries studied, number of M. tuberculosis strains, and the number of spoligo types used in the study are shown in the table. States where a single genotype obviously dominates were not chosen for the study. The main parameters for a spoli gotree analysis were cluster sizes and the number of ancestors [23]. For example, large clusters with numerous descendants were thought to evidence the “ancientness” of a genotype. Such a spoligotype could
MOLECULAR GENETICS, MICROBIOLOGY AND VIROLOGY
Vol. 26
No. 3
2011
122
SINKOV et al. Rank Spoligotype
Subfamily
p
–log(p)
Spoligoforest
Russia 1 2 3 4 5 6 7 8
ST1 ST53 ST252 ST42 ST254 ST1134 ST262 ST35
Beijing T1 LAM9 LAM9 T5_RUS1 H4 H4 H4
9.9132e60 2.7714e38 1.3814e35 1.3814e35 8.7937e16 4.5748e15 3.3783e13 1.2132e06
135.86 84.48 80.27 39.96 34.67 33.02 28.72 13.62
ST252 (8) ST3134 (18)
ST35 (18)
ST254 (50)
ST1 (467)
ST253 (67)
ST42 (25) ST252 (52)
Estonia 1
ST1
Beijing
1.1651e03
ST1 (50)
7.75 Latvia
1
ST1
Beijing
2.1067e04
ST1 (75)
8.47
Japan
1
ST1
Beijing
2.3208e05
ST1 (169)
10.68
Vietnam 1
2
ST139
ST1
EA14_VN M
Beijing
1.6888e72
1.6637e25
165.26
ST139 (181)
57.06
ST1 (387)
Note: Here and in Fig. 2, the genotype rank shows the calculated epidemic significance of a genotype in comparison with that of other genotypes that circulate in a country; number in the SpolDB4 database [15]; subfamily name [15]; p value cal culated by DESTUS program [25]; logarithms calculated with the same program, which provide the basis for assessment of epidemic significance (rank) of a genotype; spoligoforest of epidemiologically significant genotypes for each state. Fig. 1 States where the Beijing genotype (ST1) was shown to be epidemiologically significant.
have been distributed in a population for a long period and should have had sufficient time to accumulate a good deal of mutations. However, from an epidemio logical point of view, genotypes which can be distrib uted in a population faster than the natural process of mutation accumulation are of much more interest. These genotypes are characterized by larger cluster sizes and a minimal number of descendants and must be considered epidemiologically significant [23]. As a result of modeling, a consensus spoligoforest network which, after performing the Bonferroni test, allowed
statistically significant isolation of the epidemic geno type [25] was obtained for each country [24]. Data on the full structure of the spoligotype consensus network for each country are not shown in the paper due to their bulkiness. The described modeling revealed epidemiologi cally significant genotypes with the confidence level p < 0.01. The revealed genotypes are shown in Figs. 1 and 2. Connections between the clusters in Figs. 1 and 2 show the presence of a final mutation step number between epidemic genotypes, which allows the Spol
MOLECULAR GENETICS, MICROBIOLOGY AND VIROLOGY
Vol. 26
No. 3
2011
RECONSTRUCTION OF THE EPIDEMIC HISTORY OF THE BEIJING GENOTYPE Rank Spoligotype
Subfamily
p
–log(p)
123
Spoligoforest
Finland 1
ST49
H3
3.3853e08
17.2
2
ST47
H1
4.6731e08
16.88
3
ST53
T1
1.2359e35
15.91
4
ST50
H3
1.9054e06
13.17
5
ST262
H4
1.2924e05
10.68
6
ST1134
LAM9
4.2925e05
10.06
ST262 (8)
ST42 (25)
ST50 (28) ST79 (23)
ST53 (49)
ST47 (29)
Poland 1
ST1557
H1
2.638e06
12.85
2
ST891
LAM9
7.3739e05
9.51
ST1557 ST891 (16) (12)
Portugal 1
ST20
LAM1
2.8228e06
12.78
2
ST42
LAM9
1.0952e06
9.12
ST20 (49)
ST42 (44)
Italy 1
ST33
LAM3
2.3044e06
12.98
2
ST53
T1
2.4319e06
12.93
3
ST1346
CAS1DELHI 1.7130e05
10.97
4
ST50
H3
5.8694e05
9.74
5
ST62
H1
1.0525e04
9.16
6
ST73
T2T3
1.1970e04
9.03
ST73 (11)
ST1346 (14) ST33 (16)
ST53 (54)
ST62 (14)
ST50 (30)
Fig. 2. States from the comparison group, where the Beijing genotype was not shown to be epidemiologically significant.
Tool program to calculate the virtual ancestor of epi demic genotypes; in contrast, the absence of these connections indicates significant genetic differences between these genotypes. Figure 1 shows countries (except Vietnam) where the Beijing genotype (ST1) is epidemiologically significant; i.e., it demonstrates the first rank of epidemic significance. Figure 2 shows the countries of the comparison group. The Beijing geno type was shown to be the only epidemiologically sig nificant genotype for the Baltic States (Estonia had 119 strains and Latvia had 140 strains) (Fig. 1). At the same time, the neighboring countries of Finland and Poland demonstrated a totally different spectrum of epidemic genotypes, in which strains of LAM, T, and Haarlem prevailed (Fig. 2). The Russian epidemic
genotype profile (Fig. 1) demonstrates both European (LAM and T) and Asian (Beijing) traits. The Haarlem 4 subfamily, which is more correctly called Ural, was also found among Russian epidemic genotypes [8, 19]. Apparently, this group of closely related genotypes [8] (Fig. 1 shows them all connected with each other) has different origin than other epidemic genotypes shown in Figs. 1 and 2. Therefore, the results of modeling of the formation processes of genotypes of M. tuberculosis strains from three states of the former USSR (Russia, Estonia, and Latvia), two neighboring states (Finland and Poland), and two remote European states (Portugal and Italy) may suggest that the Beijing genotypes identified in Russia, Estonia, and Latvia originate from a single
MOLECULAR GENETICS, MICROBIOLOGY AND VIROLOGY
Vol. 26
No. 3
2011
124
SINKOV et al.
source. Moreover, the absence of other epidemic gen otypes in the studied Baltic states allows us to hypoth esize relatively recent epidemics induced by the ST1 spoligotype. It goes without saying that it is not possi ble to exclude the probability that the Beijing genotype was presented earlier in the Russian Empire, as for example in the comparison states [16, 17, 20, 27]. However, the Beijing genotype (ST1) distribution pat tern in Russia and the Baltic states suggests a single moment and an explosive distribution of this subfam ily’s strains. The scenario for such an epidemiological process could be as follows: the Beijing genotype is endemic to China and neighboring countries [14], and its history dates back several centuries [6]. However, contacts between the Russian Empire and China were very lim ited during this period, and only in the 20th century did a large Russianspeaking community form in Chi nese territory due to the building of the China Eastern Railway and the development of neighboring lands [1]. Later, in the period of cooling of relations between the USSR and China, workers of the China Eastern Rail way were forced to return to the USSR, which coin cided with the period of mass repressions [1]. Tens of thousands of repatriates were jailed in correctional labor colonies, which led to the wide distribution of Beijing subfamily tuberculosis strains—first among Gulag prisoners, and later, after the amnesty in 1953, in the civilian society of the Soviet Union. Within the framework of the unified state (USSR) and, more importantly, the unified penitentiary system, the ST1 soligotype was distributed throughout the entire terri tory of the Soviet Union, including the Baltic states. At the same time, in the countries of the Eastern Bloc, the profile of epidemic strains of tuberculosis did not change significantly or changed in accordance with the principles observed in other European states in the 20th century. These observations were even more sur prising in light of the fact that both Finland and Poland were either totally or partially integrated into the Russian Empire: Finland from 1809 to 1917 [10] and the Polish Kingdom (with a population of 2.7 mil lion) from 1815 to 1915 [4]. Similarly, both Baltic states studied were integrated into the Russian Empire: Estonia from 1721 to 1918 and Latvia from 1796 to 1920 as a part of the Province of Courland [4]. Therefore, the middle of the 20th century is consid ered to be the most likely date for the massive distribu tion of the ST1 spoligotype in the Baltic states. Evi dently, the possibility of distribution of tuberculosis strains across the borders of states under the regular conditions of civilian society is much smaller than is traditionally supposed. It is suggested that even the rel atively short historical period of the unified peniten tiary system drastically affected the process of distri bution of tuberculosis strains of the Beijing subfamily. It should be noted as a conclusion that the methods of molecular epidemiology are doubtlessly important for tuberculosis studies. The identification of epidemi
ologically important genotypes and the analysis of their structure in different countries and continents allow us to retrace the historical paths of epidemic dis tribution and identify the epicenters of highly active genotypes. This, in turn, provides the basis for antiep idemic and therapeutic approaches aimed against tuberculosis infection. REFERENCES 1. Ablazhei, N.N., Emigration from Russia (SSSR) to China and Reimmigration in the First Half of the XX Century, Extended Abstract of Doctoral (Histor.) Disser tation, Moscow, 2008. 2. Balabanova, Ya.M., Nikolaevskii, V.V., and Raddi, M., Probl. Tub., 2006, no. 2, pp. 31–37. 3. Baranov, A.A., Mar’yandyshev, A.O., Markelov, Yu.M., et al., Ekol. Chel., 2007, vol. 7, pp. 34–38. 4. Bokhanov, A. and Gorinov, M., Istoriya Rossii s nachala XVIII do kontsa XIX veka (History of Russia from the Beginning of XVIII to the End of XIX Century), Sakharov, A.N., Ed., Moscow, 2001. 5. Global’naya bor’ba s tuberkulezom (doklad) (Global Tuberculosis Control (Report)), WHO, 2009. 6. Mokrousov, I.V., Genetic Diversity and Evolution of Mycobacterium tuberculosis, Extended Abstract of Doc toral (Biol.) Dissertation, St. Petersburg, 2009. 7. On the Number of Tuberculosis Cases. Letter no. 01/15990832 dated 31.12.2008. Federal Service on Surveillance of Consumer’s Right Protection and Human Wellbeing, 2008. URL:http://www.rospotrebnadzor.ru/ documents/letters/2274/. 8. Ogarkov, O.B., Medvedeva, T.V., and Zozio, G., Mol. Med., 2007, no. 2, pp. 33–38. 9. Onishchenko, G., Vestn. Ross. Akad. Med. Nauk, 2008, no. 3, pp. 19–22. 10. Osmo, Yu., Velikoe knyazhestvo Finlyandskoe 1809– 1917 (The Grand Duchy of Finland), Rumyantsev, A., Ed., Helsinki, 2009. 11. Savilov, E.D., Sin’kov, V.V., and Ogarkov, O.B., Epid. Infekts. Bol., 2010, no. 4, pp. 50–53. 12. Ftiziatriya. Natsional’noe rukovodstvo (Phthisiology: National Guidance), Perel’man, M.I., Ed., Moscow, 2007. 13. Anh, D.D., Borgdor, M.W., Van, I.N., et al., Emerg. Infect. D, 2000, vol. 6, no. 3, pp. 302–305. 14. Bifani, P.J., Mathema, В., Kurepina, N.E., and Kre iswirth, B.N., Trends Microbiol., 2002, vol. 10, no. 1, pp. 45–52. 15. Brudey, K., Driscoll, J.R., Rigouts, L., et al., BMC Microbiol., 2006, vol. 6, p. 23. 16. Devaux, I., Kremer, K., Heersma, H., and Soolingen, D.V., Emerg. Infect. Dis., 2009, vol. 15, no. 7, pp. 1052–1060. 17. Harvard Medical School O. S. I. The Global Impact of DrugResistant Tuberculosis, Heymann, D.L., Ed., Boston, 1999. 18. Iwamoto, T., Kekkaku, 2009, vol. 84, no. 12, pp. 755– 759.
MOLECULAR GENETICS, MICROBIOLOGY AND VIROLOGY
Vol. 26
No. 3
2011
RECONSTRUCTION OF THE EPIDEMIC HISTORY OF THE BEIJING GENOTYPE 19. Kovalev, S.Y., Kamaev, E.Y., Kravchenko, M.A., et al., Int. J. Tuberc. Lung Dis., 2005, vol. 9, no. 7, pp. 746–752. 20. Krüüner, A., Hoffner, S.E., Sillastu, H., et al., J. Clin. Microbiol., 2001, vol. 39, no. 9, pp. 3339–3345. 21. Liens, В., Sola, C., Brudey, K., and Rastogi, N., 6th Annu. Congr. Eur. Soc. Mycobacteriol, Turkey, Istambul, June 26–29, 2005. 22. Mokrousov, I., Ly, H.M., Otten, T., et al., Genome Res., 2005, vol. 15, pp. 1357–1364. 23. Reyes, J.F., Francis, A.R., and Tanaka, M.M., BMC Bioinformatics, 2008, vol. 9, p. 496. 24. Tanaka, M.M. and Francis, A.R., Proc. Natl. Acad. Sci. USA, 2006, no. 41, pp. 15266–15271.
125
25. Tang, C., Reyes, J.F., Luciani, F., et al., Bioinformatics, 2008, vol. 24, no. 20, pp. 2414–2415. 26. Toungoussova, O.S., Sandven, P., Mariyandyshev, A.O., et al., J. Clin. Microbiol., 2002, vol. 40, no. 6, pp. 1930– 1937. 27. Tracevska, Т., Jansone, I., Baumanis, V., et al., Int. J. Tuberc. Lung Dis., 2003, vol. 7, no. 11, pp. 1097– 1103. 28. Van Soolingen, D., Qian, L., de Haas, P.E., et al., J. Clin. Microbiol., 1995, vol. 33, pp. 3234–3238. 29. WHO Stop ТВ Partnership Update Tuberculosis Facts, 2009. URL:http://www.who.int/tb/publications/2009/ tbfactsheet_2009update_one_page.pdf.
MOLECULAR GENETICS, MICROBIOLOGY AND VIROLOGY
Vol. 26
No. 3
2011