The structural protein-coding genomic regions of dengue virus type 1 (DEN-l) ... al., 1987), Japanese encephalitis (JE) (McAda et al., 1987), Murray Valley ...
J. gen. Virol. (1989), 70, 1701-1712. Printedin Great Britain
1701
Key words: dengue 1 virus/nucleotide sequence/geneticvariation
Genetic Relatedness among Structural Protein Genes of Dengue 1 Virus Strains By MAY C. C H U , 1 . E D W A R D J. O ' R O U R K E 2 AND D E N N I S W. T R E N T 1 1Division of Vector-Borne Viral Diseases, Center for Infectious Diseases, Centers for Disease Control, Public Health Service, U.S. Department of Health and Human Services, P.O. Box 2087, Fort Collins, Colorado 80522 and 2Division o f Infectious Diseases, The Childrens' Hospital, Harvard Medical School, Boston, Massachusetts 02115, U.S.A. (Accepted 17 March 1989) SUMMARY
The structural protein-coding genomic regions of dengue virus type 1 (DEN-l) strains representing three distinct topotypes (Thailand, Philippines and Caribbean) were cloned and sequenced. In addition the envelope (E) nucleotide sequences of two recent Caribbean topotype DEN-1 isolates were obtained by direct RNA sequencing. The nucleotide sequence of the DEN-1 viruses in the structural gene region was found to be highly conserved with greater than 95~o nucleotide sequence homology and with less than 4~o change in the amino acid sequence. Although there was a less than 2 ~ change in the nucleotide sequence of DEN-1 E proteins, strains could be differentiated by the clusters of nucleotide changes, Furthermore, the deduced amino acid changes in the E protein were clustered primarily within the proposed immunologically reactive regions. Genomic nucleotide sequence comparisons did not define geographical or virulence markers but located unique clusters of nucleotide/amino acid changes for each of the three topotypes of DEN-1 viruses examined. INTRODUCTION The genomic organization and RNA sequence of many flaviviruses have now been determined. The publication of the genomic sequence of yellow fever (YF) 17D vaccine virus (Rice et al., 1985) was followed by that of either partial or entire sequences of Asibi YF (Hahn et al., 1987), Japanese encephalitis (JE) (McAda et al., 1987), Murray Valley encephalitis (MVE) (Dalgarno et al., 1986), West Nile virus (WN) (Wengler et al., 1987), Kunjin (KUN) (Coia et al., 1988), St Louis encephalitis (SLE) (Trent et al., 1987), dengue (DEN) viruses type 1 (Nauru Island) (Mason et al., 1987), DEN-2 (Jamaica) (Deubel et al., 1988), DEN-2 (PR-159 vaccine candidate, S1) (Hahn et al., 1988) and DEN-4 (Dominica) (Zhao et al., 1986). The flavivirus genome is composed of a single open reading frame approximately 10.5 kb in length, with the structural protein genes [capsid (C), premembrane (prM), membrane (M) and envelope (E) proteins] localized at the 5' end of the genome. The structural protein genes precede the nonstructural protein genes encoded in the 3' three-quarters of the genomic RNA (Rice et al., 1985). Mosquito-borne flaviviruses are divided into three major serological subgroups (Porterfield, 1980) : yellow fever, Japanese encephalitis (JE, MVE, WN, K U N and SLE) and dengue (DEN serotypes 1 to 4). Comparison of the nucleotide sequence encoding the structural genes of viruses between each of the subgroups has shown that YF shares a 4 5 ~ and 32~o similarity with JE (McAda et al., 1987) and DEN-2 (Deubel et al., 1986), respectively. Within the JE subcomplex, members share 60~ to 80~ of their nucleotide sequence (McAda et al., 1987). The nucleotide sequences of DEN-l, DEN-2 and DEN-4 viruses share 62~o to 69~ similarity over the entire genome (Deubel et al., 1988). Strains of the same virus diverge much less; YF Asibi and the 17D vaccine virus diverge by 0-63~ in their nucleotide sequences (Hahn et al., 1987). DEN-2 Jamaica and DEN-2 PR-159(S1) share more than 95 ~ nucleotide sequence similarity (Deubel et 0000-8832 © 1989 SGM
1702
M. C. CHU, E. J. O ' R O U R K E AND D. W. TRENT
al., 1988). The relatedness of these sequences supports the classification of these viruses into their own separate family, Flaviviridae (Westaway et al., 1985) and its serological subgroups. Specific genomic nucleotide sequence changes have yet to be directly associated with alterations in the biological activity of flaviviruses. There were 68 nucleotide sequence differences between YF 17D and Asibi, resulting in 32 amino acid changes (Hahn et al., 1987). Since the 17D YF virus E protein contained 12 amino acid changes, it was suggested that the differences in virulence between these two strains were due to the changes in E protein which affect virus binding to cell receptors (Hahn et al., 1987). Similarly, sequence differences between the Jamaica and PR-159(S1) strains of DEN-2 virus result in a 3 ~ amino acid divergence in the E glycoprotein with regions of high and low sequence homology (Deubel et al., 1988; Hahn et al., 1988). These observations suggest that regions which code for protein domains that define crossreactive group epitopes and virus-specific epitopes may be highly conserved. In tropical and subtropical regions dengue viruses are mosquito-transmitted flaviviruses that cause a spectrum of disease manifestations (Halstead, 1981). Most patients with classical dengue fever (DF) have fever, myalgia and general malaise. Infection with any of the dengue serotypes provides long term immunity to the infecting type but not to infection with a heterologous dengue serotype. Theories to explain serious dengue disease, designated dengue haemorrhagic fever/dengue shock syndrome (DHF/DSS) have suggested that the disease may be caused by specific DEN virus strains or immunological mechanisms (Halstead, 1988). It is as yet unclear whether DHF/DSS is solely the manifestation of increased virulence or whether host factors permit DEN virus infection to progress to a more serious disease (Halstead, 1981, 1988). The four DEN virus serotypes can be antigenically differentiated by serological tests (Burke, 1988) or by oligonucleotide fingerprint analysis (Vezza et al., 1980). Viruses within a DEN serotype can be separated by oligonucleotide fingerprint analysis into geographical varieties or topotypes (Repik et al., 1983; Trent et al., 1983). These topotypes can also be distinguished by signature analysis using specific monoclonal antibodies that differentiate E glycoprotein antigenic epitopes (Monath et al., 1986). Comparison of DEN-1 fingerprints from DF or DHF/DSS patients has not resolved whether the disease outcome is associated with virus strain variation. We have analysed the nucleotide and amino acid sequences of structural genes of DEN-1 strains that represent distinct genetic topotypes to define the nucleotide sequence similarities of strains having unique fingerprints. This information has permitted us to understand sequence variation and evolution of DEN strains over geographical and temporal distances. METHODS Virus strains. Four DEN-1 virus strains were isolated from humans with classical DF: AHF82-80 (Thailand 1980), 836-1 (Philippines 1984, strain 162. AP2), 924-1 (Mexico 1983, strain 1378) and CV 1636/77 (Jamaica 1977). One DEN-1 virus strain was isolated from a human case of DHF grade IV, 495-1 (Aruba 1985, strain- 1566). Virus propagation, R N A preparation and fingerprinting. Viruses were propagated in C6/36 Aeries albopictus cells infected at an m.o.i, of 0.1 and the supernatant fluids were harvested 9 days after infection. Virus titrations and plaque reduction neutralization were done in BHK-21 clone 15 cells (Morens et al., 1985). Indirect fluorescent antibody staining of infected C6/36 cells was done with type-specific monoclonal antibodies (Henchal et al., 1982). RNA purification, extraction and RNase Tl-resistant oligonucleotide fingerprinting have been described (Trent et al., 1983). DEN-1 cloning and sequencing. Complementary DNA to DEN-1 RNA was synthesized using reverse transcriptase in the presence of specific oligonucleotide primers (Kinney et al., 1986). Two specific oligonucleotide primers were used to initiate first strand cDNA synthesis: MC1, complementary to the 5' end of the DEN-2 non-structural NSl protein (5' GCCACATTTCAGTTCTT 3') (Deubel et al., 1986) and primer E1 (5' A T G C A C A G T T T G C G C A G G A C 3') based on the DEN-I AHF82-80 sequence starting from nucleotide 1003 near the 5' end of E protein. Appropriate restriction fragments of cDNA were subcloned into the filamentous coliphage Mt3 mpl8/mpl9 (Yanisch-Perron et at., 1985) for sequencing by the dideoxynucleotide chain termination method (Sanger et al., 1977). Complementary DNA inserts were identified by hybridization and sequenced by generating a series of overlapping deletion clones (Dale et al., 1985). Primer-directed R N A sequencing. DEN-1 virus RNA was sequenced by the dideoxynucleotide method adapted by Dr B. J. B. Johnson (personal communication) from the methods of Biggin et al. (1983) and Sanger et al. (1977). For direct RNA sequencing of the E glycoprotein gene, six nucleotide primers were synthesized complementary to the E protein sequence of AHF82-80. The primers were identified as M2 (nucleotides 1144 to 1173), M3
Variation among dengue 1 viruses
1703
(nucleotides 1327 to 1358), M4 (nucleotides 1474 to 1503), M5 (nucleotides 1681 to 1710), M6 (nucleotides 1886 to 1914) and M7 (nucleotides 2107 to 2136). Two micrograms of virion RNA was hybridized to 100 ng of synthetic oligonucleotide primer for 2 min at 80 °C in 12.5 gl hybridization buffer (0.05 M-Tris-HC1pH 8.1 and 0.1 M-KC1). The sequencing reaction contained 5 gl of hybridized RNA-primer mixture, 2 ktl of 10 x reverse transcriptase buffer (0.05 M-Tris-HC1 pH 8-1, 0-6 M-KCI, 0.1 M-MgC12, 0.1 M-dithiothreitol), 30 units of RNasin (Promega Biotec), 20 gCi [~-35S]dCTP (New England Nuclear), 20 units reverse transcriptase (RAV-2, 20 units/gl, Amersham) and stocks of appropriate ddNTP/dNTP mixes as recommended (Biggin et al., 1983). DNA fragments were separated by electrophoresis on 6 ~ acrylamide-urea gels in Tris-borate-EDTA buffer at 1800 V, 24 mA for 3 to 4 h. The gels were transferred onto Ephortec film (Transfilm, Haakbuchler Instruments), soaked in 10~ acetic acid, dried and autoradiographed on Kodak XAR-5 film. RESULTS
R N A fingerprint analysis The genetic similarity of each of the DEN-1 R N A s to each other was examined by T1resistant oligonucleotide fingerprint analysis. Thailand, Philippines and Caribbean DEN-1 isolates each presented a distinct R N A fingerprint (Fig. 1). Strains CV1636/77, 495-t and 924-1 belong to the C a r i b b e a n topotype described by R e p i k et al. (1983); however, the two recent Caribbean isolates, 495-1 (1985) and 924-1 (1983), contained several unique oligonucteotides. DEN-1 virus AHF82-80, Thailand 1980, shares many oligonucleotides with the Thailand strains isolated during the 1980s yet is quite distinct from the other Thai isolates (M. C. Chu, unpublished observations). The fingerprint of strain AHF82-80 is not similar to those of the other four DEN-1 strains examined in this study. The R N A fingerprint of strain 836-1 is representative of one of the two distinct DEN-1 topotypes cocirculating in the Philippines in 1984 (M. C. Chu, unpublished observations). (a)
(b)
° ~ ,
b
boo 0
lpo
(c)
) O
)
c~ o O
Fig. 1. RNase Tl-resistant oligonucleotide fingerprints of DEN-1 strains: (a) Caribbean CV1636/77, (b) Thailand AHF82-80 and (c) Philippines 836-1.
1704
M. C. C H U , E. J. O ' R O U R K E A N D D. W. T R E N T *
*
,
,
,
,
,
,
,
, &C
10
T
CV1636/77 88AUAACCAAB~ACACAAGGA~CA8CUAUCACCA8AUCUAC~CC~GAAGCU86CUUAAC8UUA~ACUAACA6UUUUUUAUUABA8k~A8CAGAUCUCU8 AUGAACAACCAACGGAAA AHFB2-80
......................................................................................................................
836-1
.........................................
o ..................................................................
30 60 90 120 CV1636/77 AA•AC•••UC•AcC•UCUUUCAAUAU•CU•AAAC8C•C•A8AAACC•C•U•UCAACU••UUCACA•UU••C•AA•A•AUUCUCAAAA•8AUU•CUUUCA••CCAA••ACCCAU•AAAUU8 AHF82-80
......... AAC ......................................................
836-[
..........................................................
C ....... A .............................................
U .............................................................
150 i80 210 240 CV1636177 8U6AU~CUuU~AU~CkUUUC~A~AUUUCUAGCCAUACCCCCAAcA~CA~AA~UUU~CUA~AU~A~CUCA~UCAAGAAGAAU8~R6C~AUCAAA~U~UUAC~UUU~AAAAA AHF82-80 . . . . . . . . . . . . 836-[
..A . . . . . . . .
6. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . U. . . . . . . .
C. . . . . . . . . . . . . . . . . . . .
270
AAA. . . . . . 8. . . . . . . . . . . . . . . . . . . . . . . . . . . . C.,A . . . . . . . . . . . . . 8..................................................
8. . . .
U. . . . . . . . . . . . . . . . . . . . . . . . . . .
300
~prM
330
360
CV1636177 GAGAUCUCAA~CAU~UU8AACAUAAUGAACA66A88AAAA6AUCC8U~ACCAU8CUCCUCAU~Cg6~U~CCCACA8CUCU~8CGUUCC8UUUGA~CACACGA~8~8GA6A~CCGACAcu6 AHFB2-80
..........
A. . . . . . . . . . . . . . . . . . . . . . . . . .
836-[
..A .......
A. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
B. . . . . . . . . . . . . . . . U. . . . . . .
390
A. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ~ ........................
420
C. . . . . . . . . . . . . . . . . . . . . . . . . C. . . . . . . . . . . .
C. . . . . . .
450
UU..
C. . . . . . . . . . . . . . . . . . . . .
480
CV1636/77 AuA~UUA~UAA~CA6~AkA6A6~AAA~UCACUCUU~UUUAA8ACCUCU~CA~U6UCAAUAU8U~CACUCUCAUU8C~AU8~AUUU~A~A~UUAU8U~A8EACACAAU~CCU~CAA~ AHF82-80 . . . . . . . . . . . . . . . . . . . . . . . 8................................................................................................ 836-I ........ C. . . . . . . . . . . . . . . . . A . . . . . U. . . . . . . . . . . . C. . . . . . . . . . . . . C. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 510 540 570 600 ~M CV1636/77 u8CCCCC86AUCACU8A8A~CAACCA6AU6ACGUU6ACU6CUGGU~CAAU8CC~CA8ACACAU886U8ACCUAU~G8~C~U~UUCUCAAACC~8C8AACACC8AC6A8ACAAAC8UUCC AHF82-80 . . . . . . . . . . . 8 . . . . . . 8C.8 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 836-[ ............ 8 . . . . . BC.B . . . . . . . . . . . . . . . . U. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 630 660 690 720 CV[636177
•U•8cACUG•CCCCAC•C8U•••ACUUG•UCUA•AAACAA•AACC•AAACAU8•AU•UCCUCU•AA88C8CcU••AAACAAAUACAAAAA8U••A•ACUU•••CUUU•A•ACACCCA•8•
AHF82-80
.......... A .............................................................................................................
836-[
.......... A .......................................................................................
C ...... C ..............
750 780 810 840~E CV1636/77 UUCAC8~U~UA~CCUUUUUUUAGCAC~U8CC~U~8AACAUCCAUCACUCA~AA~UCAUUUUCAUUUU~CU~U~CU~UAACACCAUCAAU~CCAU~C~AU~C8U~8~AAUA AHF82-80 ........................................................................................................................ 836-t
.....................
C. . . . . . . . . .
U. . . . . . . . . . . . . . . . .
C. . . . . . . . . . .
U. . . . .
U. . . . . . . . . . . . . . . . . . . .
U. . . . .
C. . . . . . . . . . .
G. . . . . . . . . . . .
870 900 930 960 CV1636/77 ••CAACA•A•ACUUc•UU•AA•6ACU•UCA••A•CAAC6U•6•U86AC•U••UAUU••A•CAU••AA6CU•C•UCACCACCAU••CAAAAAAUAAACCAACAUU••ACAUU•AACUCUU• AHFB2-80
........................................................................................................................
836-I
.................
8 .................
990
U ........... U ...... C ............. U ........ U ............ 8.C ......... C .................
1020
1050
1080
CV1636177 AA8AC88A8r~CACGA8CCC~6CC8~CCU8C8CAAACg~U~CAUU~AA8CUAAAAUAUCAAACACCACCACCGAUUCAA8AUBUCCAACACAA~8A8AE8CUACAUU~UG~AAGAACAA AHF82-80
.............................................................................
8 ....................
A..C,.SC ............. 8
836-[
.............. A ..............................................................
G ....................
A..C..8C ............. 8
lit0 [140 1170 1200 CV1636/77 8AC~CGA~CUUUGU8U~UC~AC~AAC8UU8~U~GACA6A8~CU~8~CAAU~UU~U~CUAUUC8~AAAA~UA~CUUCUU8ACGU8U~CUAA8UUCAA~U8U~U~ACAAAACU~AA AHF82-BO . . , A . . . . . . . . . . . . . C. . . . . . . . . . . C. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . AA.A . . . . . . . . . . . . . . U. . . . . . . . . . . . . . . . . . . . . 836-I
,,,A ............. C ........... C .................................................. [230
1260
AA,A ..............
1290
U ..................... 1320
CV[636/77 ••AAA8AUA•UUCAAUAU•AAAACUUAAAAUAUUCA•UCAUA8UCACC•U•CACACU•8••ACCA•CAUCAA•UU8•AAAU•A8ACCACA8AACAUG•AAC•AUU•CA•CCAUAACACCU AHF82-80 . . . . . . . . . . . C. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . A. . . . . A. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 836-I ........... C. . . . . . . . . . . . . . . . . . . . . . . . . . 6.................... A,,,,~.,.C ................................................... t350 13B0 1410 1440 CV1636/77 CAA~CUCCCAC~UC8~AAAUACA~CU~ACA8ACUA~A~CUCUAACAUU~AUU~UUCACCUA8AACA~G~CUA8ACUUUAAUA8A~U~U8UU8UU8ACAAU~AA~AAAAAAUCAU~ AHF82-80 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 836-1 ....................................................................................... A. . . . . . . . . . . . . . . . . 8.A . . . . . . . . . . . . 1470 1500 1530 1~60 CV1636/77 C~C~UCCACAAACAA~8UUUCUA8AC~UACCACU8CCUL~8~AC~UC8G~88CUUCAACAUCCCAA8A~ACUU88AAUA8ACAA~ACUUGCU8~UCACAUUUAA~ACA8CUCAU~CAAA8 AHFB2-80
.......................................................................................................................
A
836-I
.......................................................................................................................
A
1590
1620
1650
1680
CV1636/77 AA6CA88AAGUA~UC6UACUA88AUCACAA6AAG~A8CAAU6CACACU8CGCU~A~U6888C6ACA~AAAUCCA8AC6UCU88AAC6ACAACAAUUUUU8C~868CACCU8AAAU6CA6A AHF82-80
..........................................................................
836-i
................................................... [710
1740
A .............................................
~ ....... A .............. A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 1770
IB00
CV[636/77 CUAAAAAU88AUAMCU8ACUUUAAAA888AU8UCAUAU8UAAU8U8CACA888UCAUU8AAA8UUA8A8AA88AA8UG8CU8A8ACCCAGCAU88AACU8UUCUA8U8CA88UUAAAUAC ~HFB2-BO
..............................
8.A
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
836-1 [830 1860 1890 [920 CV1636/77 8AA88AACA8AUGCACCAU6CAA8AUCCCCUUCLE8UCCCAAGAU8A8AAA88A8UAAcCCA8AAU888A8AUU8AUAACABCCAACCCCAUA8UCAUU8ACAAA8AAAAACCA8UCAAC AHF82-80 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 836-I
................................................................................................. 1950
CV1636/77
1980
2010
C ...................... 2040
AUU8AA8C88A8CCACCUUUU88U8A8A8CUAUAUU8U88UA88AUCA8GU6AAAAA8CUUU8AAACUAA8CUGGUUCAA8AA8G8AA8CA8UAUA888AAAAU8UUU8AAGCAACU8CC
AHFB2-80
.............................................
836-i
................................
8 ..........................................................................
C ............ 8 .............................................
C ............................
Variation among dengue I viruses 2070
2100
2130
1705 2160
CV163~177 ~6~A~CAC6~A~6~6GCCAUCC~G~A~ACACC6CAu6~6ACU~66Uu~U~A66~6~66U~U~AC8uC~6U6~6A~AACU6A~hCACCA6~U~U~U6~6A~g6C~U~h~ AHF82-BO . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
6-
B36-1
G..
................................... 2190
U. . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2220
C. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 2250 22B0
CV1636/77 UU~UUCA~6G~U~UCUU~GACCA~AA~AUAG6AAUA~66AUUCUGCu6ACAU~G~UA6~AU~AAAUUC~6~AGC~C6~C~CUUU~A~UGAC6UGUAUC~CA6~U6~R~UC~CA ~HF82-80 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 836-1 ........... U. . . . . . . . . . . . . . . . . G. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . C. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 2310 I" NSI 2340 2370 CV1~]6177 CUfiUACCUAG6~GUCkU66UUCAflGCfiG~CUCGGGAUGUGUAAUCn~CUGGAA~66CAAAfiA~CUGAAAUGU66C AHF62-80 . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
836-1
.........................................................................
Fig. 2. N u c l e o t i d e s e q u e n c e s c o d i n g f o r the s t r u c t u r a l g e n e r e g i o n s o f D E N - 1 s t r a i n s C a r i b b e a n C V 1 6 3 6 / 7 7 , T h a i l a n d A H F 8 2 - 8 0 a n d P h i l i p p i n e s 836-1.
Nucleotide sequence of DEN-1 structural protein genes The nucleotide sequences encoding the structural proteins of Caribbean CV1636/77, Thailand AHF82-80 and Philippines 836-1 are shown in Fig. 2. The 5' ends of the genomes have 101,97 and 91 nucleotides, respectively, before the first AUG codon. Nucleotides coding for the DEN-1 C proteins are contained in the first 342 base pairs of the coding region, followed by 273 and 225 nucleotides that encode prM and M proteins, respectively. The C proteins are composed of 117 amino acids that have an approximate Mr of 12800. The prM-M coding regions are located between putative signalase cleavage sites that appear to be conserved among all three DEN-1 strains and among deduced flavivirus protein sequences (Rice et al., 1985; Deubel et al., 1986; Mason et al., 1987). Cleavage of prM-M to produce mature M proteins results in the formation of an M protein with an Mr of 8000. The E glycoprotein is encoded by nucleotides 841 to 2325, i.e. the last 1485 nucleotides of the structural gene region. The N terminus of the deduced DEN-1 495 amino acid E glycoprotein begins at a signalase cleavage site which is identical to those suggested for other DEN viruses (Hahn et al., 1988). Two likely glycosylation sites at amino acids 347 and 433 are conserved among the DEN-1 virus amino acid sequences. The C-terminal segments of prM, M and E contain hydrophobic regions that are thought to be membrane anchoring regions for each of the respective structural proteins. Comparison of deduced structural proteins among D E N serotypes
Caribbean CV1636/77, Thailand AHF82-80 and Philippines 836-1 DEN-1 viruses have very similar nucleotide sequences throughout the structural gene region with less than 5 ~ (within a range of 68 to 108 nucleotides out of 2325) sequence changes between the strains. These nucleotide differences translate to 2~o overall amino acid changes between the reference Caribbean CV1636/77 strain and Thailand AHF82-80 and Philippines 836-1 DEN-1 viruses (Fig. 2 and 3). The deduced amino acid sequences throughout the structural protein regions of the three DEN-1 virus strains were compared with the published sequences of similar proteins of DEN-1 Nauru Island (Mason et al., 1987), DEN-2 Jamaica (Deubel et aL, 1988), DEN-2 PR-159(S1) (Hahn et al., 1988) and DEN-4 Dominica (Zhao et al., 1986) (Fig. 3). The relatedness of the deduced amino acid sequences for C, prM, M and E are summarized in Table 1. Portions of the nucleotides in the 5' non-coding region preceding the first AUG codon of the open reading frame appear to be conserved among all the DEN virus sequences (Fig. 3) (Deubel et al., 1988 ; Zhao et al., 1986; Mason et al., 1987; Hahn et al., 1988). A consensus sequence of six nucleotides, CUACGU, between the DEN-1 viruses and DEN-2 Jamaica is located near the 5' cap, the region that Brinton & Dispoto (1988) reported to be highly conserved in all flaviviruses. In comparison with other DEN-1 strains, the structural protein sequence of Thailand AHF82-80 has the most changes in the C protein. Five of the nine amino acid differences in the AHF82-80 C protein sequence are similar to those of the DEN-2 Jamaica and Puerto Rico 159-S1 virus C proteins (Fig. 3).
1706
M.
C.
CHU,
E. J.
O'ROURKE
AND
D.
W.
TRENT
CV16~6/77 AHF 82-BD 836-1 NAURUISL DEN-2 ~AH DEN-2 $1 OEN-4 DON
• .O. . . . VV. . . .
CVI63&/77
SVIMLLMLLP TAL~HLTTR BBEPTLIUSK QERGKSLLFK TSABVNN~TL IAMDLBELCEDTMTYKCPRI TENUP])I)UI)C WCNATI)TWVT YBTCSOTSEH
NNQRKKTGR PSFNHLKRAR NRVSTGS~A KRFSK6LLSG ~PMKLVNAF IAFLRFLAIP PTADILARNS 8FKKNGAIKV LR6FKKEISD MLNIMNRRKR ......... N ........................................ V. . . . . . . . . . . . . . . K..8 . . . . . . . . N. . . . . R. . . . N . . . . . . . . R. .......................... • o,,,,,,,,
,,,,,,,,,,
V. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ,,,,,V,,,,
,,
....
,,
. . . . .
,,,,o,,
,,
8 ...................
......
,,
,,,,,
....
U
........
,,
,,,,,,,,,N
,,,,
....
,,
. . . . . . . ARS TP. . . . . . . . . . . . . .
Q . , I . . . . L.N,g. R,.L,,F . . . . . . . . . .
T. . . . . . . .
K . , . TI..SK.,N . . . . .
R.,.BR . . . . L,.,R,
..D . . . . ARN TP. . . . . . . . . . . . . .
O.,T . . . . L . M . Q . R . , L , . F . . . . . . . . . .
T. . . . . . . .
K . . . TI,,SK..N . . . . .
R.,.BR . . . . L.,.R.
~,'r)'r'M . . . . . P.BLV . . . . T . , F . , K . . L R M . L , . . I , . , V . S . . . . . . . . . . . .
Q L . . . K . . . I . I . . . . . . 8R . . . . L.D...
AHF B2-80
.... I ................................................................
836-I
...................................
NAURU ISL DEN-2 JAM
.....,.+. ....... . .............. ,.. ....... ..,,,.,...... ............... , ..TE...., ...... E....... , , G . I I , . I . . , H . . . . . . . N. . . . . . . 8R .,K . . . . V,A .ED. . . . . . . M.I . . . . . . . . . I . . . . . LL RQ.E.E.I . . . . . S.8 . . . . . . . .
DEN-2 81
..B.II,,I..VH
DEN-4 DOR
, T I T . . . C . I . . V . . . S . S . , O . . . L , , , A , H,..RP.,,A ,EBII,K . . . . . . . . . .
CV1636/77 AHF 82-80 836-1 NAURUISL OEN-2 JAR OEN-2 81 DEN-40DM
RRBKRSVALS PHVBLBLETR TETWfSSEBA WKQIQKVETW ALRHPGFTVI ALFLAHAIGT 81TQKBIIFI LLMLVTPSMA RCV616NRD FVEDLSGATW
CV1636/77 AHF 82-80 836-1 NAURU ISL DEN-2 ~A~ DEN-2 SI DEN-4 DOff
VDVVLEH6SC VTTMAKNKPT LDIELLKTEV TNPAVLRKLC IEAKISNITT DSRCPTQGEA TLVEEQDANF VCRRTFVDRG WBNGCBLF6K GSLLTCAKFK
CV1636/77
CVTKLEBKIV OYENLKYSVI VTVHTGUQHQVBNETTEHGT IATITPQAPT SEIOLTDYGA LTLUCSPRTG LDFNRVVLLT MKKKBWLVHKQWFLULPLPW
AHFS2-SO
.......
M .,AE . . . . . . . . . . . . . . . . . . . . . . . . . .
...................................
N. . . . . . . .
R .,K . . . . . . A ,KD.T . . . . .
200
A.AE. . . . . . . . . . . . . . . . . . . . . . . . . .
M. . . . . . . . . . .
I . . . . . FL KQ.E,E.I . . . . .
M. . . .
+M
. ......
AT. . . .
S.S . . . . . . . .
TT. . . .
V. . . . . LL V N T E , E . I , , _ , , L , S , . , M . . . . T.S,,R E
......... D ........................................................................................... . , , . , , ° , , D
, , , o . , ~
o o , o , o , , , A
. . . . . .
. . . . . . . . . . . . . .
, . . . . .
, . . . . . . . . . . . . . .
, ....
,,E . . . . . . V . . . . II. . . . . . . . . . . . . . . . . ,.E . . . . . . V . . . . M. . . . . . . . . . . . . . . . . ooE. . . . . . l . . S . f . . . . . A. . . . . . . . . . .
*,,,,,,,,,
,,,,,,,
. . . . . . . . . . . . . .
,,,,,,,,,,
,,,,,,~
. . . . .
, . . . . . .
,°°o,o
, . . . . . . . . . .
. . . . . . . . . . . . . . . . . . .
,
300
. . . . . . . . . .
o . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
, . . . . . . . . . . . .
, . . . . . .
o , , . , , ,
, . . . . . .
. . . . . .
HV.RI,o, I IM ,AI.,YT,.. THF,RAL. . . . . TA.A.o.T , . , I , , 8 . . . . . . . f A , - l , . , I . . . . . . . IM .AI..YT,., IitF.RVL. . . . . TAIA,,.T , . , I , , S . . . . . . . HA.R,,S, I..R.R,ALL ,8,M.YM,,Q TBI,RTVF,V ,M.,.A.,Y8 . . . . . V. . . . . . . .
V,,BS. V,,BS. V,,BA.
. . . . . . .
,,,,
. . . . . . . .
,,,,,
. . . . . . . . . . . . . . . . . .
, . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . .
D. . . . . . . . . . . . . . . . . . . . .
..I .............
D. . . . .
, . . . . . . . . . .
F.,I,..A KQ.,L..Y .....
..I .............
D. . . . .
F . . I , . . A KQ.,T,,,Y . . . . .
..L . . . . . G. . . . . .
OG. . . . .
F,.T,.TA KEV,L..TY . . . .
,
LT . . . .
,T . . . . .
, . . . . . . . . . . . . . . . . . . .
I ......
T. . . . .
, . . . . . . . . . . . . . . . . . . .
I ......
, . . . . . . . . . . . . .
.o,,.,o,,o
,,,,,,,T
E. . . . . . . .
. . . . . . . . . . . . . . . . . . . .
, . . . .
P S , f , , , , K R . L.KHSM. . . . . . . . . . . . . . .
LT. . . . . . . . . . . . .
, ....
P ..N . . . . KR, ..KHSM. . . . . . . . . . . . . . .
,,
. . . .
,,,,
[ ......
81V,.,M,T @IV..,M.T
S. . . . I .AT . . . . . . . P Y,K . . . . OO. I..,DV . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
400
6VV. . . . . S
. . . . . . . . . . . . . . . . . . . . . . . . . .
, .....
,,.
$36-1 NAURU 18L
........................................
OEN-2 JAN
.KKNN,,.V. LP...E,TIV I.P.B.EE.A ...D.6K.,K EIK . . . . SSI T.AE.',G,.T V.M. . . . . . . . . . .
EM...Q .ED.A . . . . R . . . . . . . . . .
OEN-2 St
.KKNH. . . . . .
EN...Q ,,D.A . . . . R . . . . . . . . . .
DEN-+ ~ON
°88.1T.NL. R I . . , E . T , V . . . . N..T,A ...I),SN,.V T.N,,.RS.S V,VK.P,.,E . . . . . E.BLE .Y,.EI~I.NK . . . . T . . . . R . . . . N. . . . .
CVI63b/77 AHF 82-80 836-1 NAURUISL
TSBASTSSET WNRUDLLVTF KTAHAKKQEV VVLBSQEBAilHTALTGAIEI OTSBTTTIFA 6HLKCRLKND KLTLKGNSYV MCTGSFKLEK EVAETQRGTV
DEN-2 JAN+
LP..D.QBSN .IQKET. . . .
,NP. . . . . D. . . . . . . . . . . . . . . . . . . . . .
M.SBNLL.T . . . . . . . . . .
.NP. . . . . D. . . . . . . . . . . . . . . . . . . . . .
M,BGNLL.T . . . . . . . R.. ,.Q . . . . . . S . . . . K . . V V . . I . . . . . . . I
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
,,,,,,.,
T,,
. . . . . . . .
, . . . . . . . . . . . . . . . . . . . . . . .
T. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
.,, o ....
. . . . . . . . . . .
........
o ....
, . . . . .
.......
, . . . . .
,
. . . . . . . . . . . . . . . . . . . . . . . . .
, . . . . . . . . . , ......
,.,o.
, . . . . . . . . . . . . . . . . . . . .
..,.,
. . . . . . .
N.,.,
,E,,
EN. . . . .
P . , . E , I . V [.P.S.EE,A ...O.BK,.K EVK. . . . SSI T.AE..G..T V,NE. . . . . . . . . .
. . . . . . . . . . . . . . .
......
. . . . . . . .
E. . . . . . . . . . . . . . . . . .
V . . . . . . . . . . . . . . .
,,,,
....
, . . . . . . . . . . . . . . . . .
LP..D.QBSN ,IQKET. . . .
.A..D,.EVH ..YKERM,.. . V P . . . R . D . T . . E . . . . . . .
CVt636/77 AHF 82-80
LVgVKYEGTD APCKIPFSSU @EKBVTQHBRLITANPIVI@ KEKPVN[EAEPPFBESYIVV888EKALKLS WFKKfiSSIBK NFEATARBAR RNAILSDTAW ,
5..A . . . . V DSGC6NHM. . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
KV.,F . . Q . . . V . , T , . 8 . K . S I . . . M . . . . . . . I
700
A ........
836-i NAURU ISL
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
DEN-2 JAR
VIR.Q°.,YB S. . . . . . ElM ,LEKRHVL. . . . .
DEN-2 Sl DEN-4 DON
,VEP6Q..,D . . . . . . . . . O . . , T . N . . . K . . . . . . . . . . S.Y.,,K . . . . . . E , . . V.K . . . . . A8 . . . . V. IEIR .VNKEKVV.. I.SST.LAEN TNSVT,..L.R.L.D . . . . I ,V,NS.,T.H ..R . . . . . . . . . .
CVI63&I77
DLGSIBGVFT SUGKLIHQiF fiTAY6ILFSfi VSNTNKI616 ILLTNLfiLN8 RSTSLSRTCI AV6MVTLYL6 VMVQADSBCVINNKSKLEKC 68
,T,
,o ......
600
,,.,.
DEN-2 8!
. . . . . . . . . . . . . . . . . . . . . . . . . . . . .
500
, .......
OER-4 DON
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
100
N ..........
, . . . . . . . . . . . .
A ........
T . . . . . . . . . . . . . . . . . . . . . .
V. . . . TE .DS. . . . . . . . . . .
V I R , Q . . . D . S . , . T . , E I N .LEKRHVL...T.V . . . . TE .US. . . . . . . . . . .
D.,.II
A,,,
.....
.VEPGQ...N . . . . . . . . .
O ,.,T,M.,,K ..........
U,,.II
~N S 1
AHF82-80
, . . . . . . . . . . . . . . . . . . . . . . . .
8~6-t
,,,,,,,
NAURU ISL
o°,,,,,.
DEN-2 JAN
. . . . L. . . . . .
I . . A L , , V , .AI.,AA . . . . . . . . . .
L.. V I I . . I . M . . . . . . . .
VSLV L..V . . . . . .
DEN-2 SI
. . . . L. . . . . .
I . . A L , . V . ,AI,,AA . . . . . . . . . .
L.. V I I . . I . M . . . . . . . .
VSLV L . , I . . . . . . . . . . . . . . . .
DEN-4 DUN
. . . . V . . L . . . L . , A V . , V o .SV,TTN,B . . . .
V . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
, . . . . . . . . . . . . . .
. . . . . . . . .
, . . . . . . . .
V . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
, . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . .
, . . . . . . . .
V . . . . . . . .
,o . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
MIR.L.. F,VL, I , L . . N . . M A . . . . . . .
800
, . . . . . . . . . . . . . .
R,,,,
,,
A. . . . . . . . . . . . . . . . . . . . VS,.N . . . . . . .
61..F.. FI . . . . . N.
F i g . 3. C o m p a r i s o n o f d e d u c e d a m i n o a c i d s e q u e n c e s f o r s t r u c t u r a l p r o t e i n s o f D E N - l , DEN-4 virus strains.
DEN-2 and
Variation among dengue 1 viruses
1707
Table 1. Comparison of the structural protein sequences o f D E N virus strains using Caribbean CV1636/77 as the reference strain Type DEN-I DEN-2 DEN-4
Strain Thailand AHF82-80 Philippines 836-1 Nauru Islandt Jamaica~ PR-159(S1)§ Dominica¶
C 9 (8)* 3 (3) 5 (4) 32 (28) 33 (29) 36 (32)
prM 3 (3) 3 (3) 2 (2) 23 (25) 21 (23) 30 (33)
M 1 (1) 1 (1) 1 (1) 23 (31) 23 (31) 31 (41)
E 5 (1) 9 (2) 10 (2) 161 (33) 155 (31) 191 (38)
* Number of amino acid changes. Numbers in parentheses represent percentage of amino acid changes in reference to Caribbean CV1636/77. t Mason et al. (1987). :~Deubel et at. (1986). § Hahn et al. (1987). I1Zhao et al. (1986).
Table 2. Nucleotide substitutions in DEN-1 envelope glycoprotein sequences : comparisons with Caribbean CV1636/77 reference strain
Transitions A-~G G-~A C-~U U-oC Transversions* Total
Philippines 836-1
Thailand AHF82-80
9 10 6 13 8 46
5 9 1 5 3 23
Caribbean * 495-1 924-1 3 0 10 10 4 27
2 0 8 14 5 29
* Transversions: purine to pyrimidine and vice versa.
Divergence in the sequence coding for prM proteins of different DEN-1 strains represented a 3 ~o change. These amino acid changes at positions 56 to 60 are clustered and indicate a possible secondary structure change in Thailand AHF82-80 and Philippines 836-1 compared to Caribbean CV1363/77. The only protein sequence change in the M protein occurred at amino acid position 5 for the DEN-1 strains. The protein sequence similarity was greatest between 836-1 and Nauru Island (Mason et al., 1987) which shared 99.7~ similarity. Comparison o f the E glycoprotein genomic sequences o f DEN-1 viruses To evaluate genetic stability of the E protein nucleic acid sequence variations within a single topotype, the genomes of two recently isolated DEN-1 Caribbean strains (Aruba 495-1, 1985; Mexico 924-1, 1983) were sequenced throughout the E glycoprotein region. Comparison of the E protein nucleotide sequences for the five DEN-1 viruses is presented in Fig. 4 and Table 2. There were 23 to 46 nucleotide changes in the 1485 nucleotides coding the E protein region. This represented a change of 1.55 ~o to 3 ~ with five to nine amino acid changes, giving an at least 97~o sequence similarity over the entire E glycoprotein region. These results show less divergence than those observed between D E N - 2 Jamaica and PR-159(S 1) E sequences (Hahn et al., 1988). The similarity within the E sequence is maintained when comparing the sequence of the DEN-1 Nauru Island (Mason et al., 1987) with our results. The recent Caribbean strains, 495-1 and 924-1, differed from the earlier Caribbean CV1636/77 virus by 27 and 29 nucleotide transitions, respectively, and by one additional transversion in 924-1. Shared nucleotide differences in the Caribbean DEN-1 strains were primarily located at nucleotides 321 to 339. Three nucleotide differences occurred between 417 and 429, three changes between 482 and 547, a region of change between 626 and 720 and four
1708
M.C. E
I0
CHU,
20
E. J.
30
O'ROURKE
40
50
AND 60
D.
W.
70
TRENT
80
90
i00
110
120
CV1636/77
A U•C•AU•••U•••AAUA66CAACA•A8A•UUC•UU•AA6•AC•6UCA••A6CAAC•U6••U••A•••••UAUU••A•CAU8•AA•CU•C•UCA•CACCAU••CAAAAAAUAAAcCAACA
495-I
..............................................................................................................
C ......... C .........
924-I
..............................................................................................................
AHF82-80
........................................................................................................................
836-1
..... 8.............................
.....................
~, .......
8.................
~ .......
~ .......
~,~ . . . . . . .
U. . . . . . . . . . .
~7~. . . . . . .
~
U. . . . . .
.......
C. . . . . . . . . . . . .
~ ....... ~
U. . . . . . . .
.......
U. . . . . . . . . . . .
~ .......
8.C . . . . . . . . .
~ ....... ~ .......
CV1636/77
U U~ACAUu6AA~UCUU8AA~A~A~UCA~6AACCCLl~CC~UCCU8C~CAAACU~U6CAUU~AA~CUAAAAUA8CAA8CACCACCACC~AUUCAA~AU~UCCAACACAA~AGA~8CU
495-i
.............................................
U ..........................................................................
924-I
.............................................
U ..................................................
AHF82-BO
...............................................................................................
836-1
C. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
5~
C ....................... 8 . . . . . . . . . . . . . . . . . . . . A..C
A. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
8....................
..................................................................................................................................... 250 260 270 280 290 300 310
320
330
~40
A..C
350
360
CVI63&177
ACAUU~U~A~AA~AA6~C~C~AACUUU~U~U8~E~C~AAC~UU~U8G~A~AGGCU~G~GCAAU~GUUGUG~8CUAUU~G~AAAA~GUA~UUCUU~AC~U6U~CUAAGUu~A~8
495-I
, , .C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
D .............. G . . U ........ A . . . C . U . . . . . . . . . . . . . . . . . . . . .
924-I
...C
U ..............
AHF82-BO
. .BC ............. 8 . . . A ............. C ........... C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
AA.A .............. U...
836-i
. .BC ............. 8 - . A ............. C ........... C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
AA.A .............. U...
.............................................................
..................... ~ CV1636177
....... ~
....... ~+~ ....... ~
....... ~
....... ~
....... ~
8..U ........
....... ~
....... ~
L..C
.......................
....... ~X~ ....... ~
....... ~
U8U8U~ACAAAACU8~AA88AAA8AUA8~UCAAUAU8AAAACUUAAAAUAUUCA8UCAUA~UCACCGUACACA0U8888ACCA8CAUCAA~UU~AAAU8A~ACCACA~AACAU~8AAcA
495-i
........................................................
8 ........ U..C . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
924-I
........................................................
8 ........ U..C . . . . . . . . . . . . . . . . . C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
AHF82-80
.............................
C ...............................................
836-1
.............................
C. . . . . . . . . . . . . . . . . . . . . . . . . .
8. . . . . . . . . . . . . . . . . . . .
A ..... A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
A. . . . . . . .
...................................................................................................................................... 490 500 510 520 530 540 550 560
C. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 570
580
590
600
CV163,~/77
A UU~CAACC~UAACACCUCA~CUCCCAC~UC~GAA~UAC~8CU~ACA8ACUAC~8A,~CUCUAACAUU~8AUU8U~A~CUA~AACA~CUA~ACUU8A~UA8A~U~U~UU8U8~ACA
495-i
..........................
924-I
........................................................................................................................
AHF82-80
........................................................................................................................
836-I
,C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
..................... ~
....... ~
U .......................................
.......
~]~....... ; ~
....... ~
....... ~
C .....................................................
....... ~,~ ....... ~
....... ~
A ..............
....... ~
....... fi~ ....... ~
CV1636177
AU~A~AA~AUC~U~CuC8UcCAC~ACA~U88UUU~U~8ACUU~C~CU~CCUU~A~UC8~8~CUUCAACAuCCCA~8A~cUU~8AAUA~AC~A~ACUU~CU~UCACAUUU
495-1
....................
924-I
.................... U .........................................
AHFB2-80
........................................................................................................................
836-1
...6.A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
..................... ~ CV1636177
U. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
C. . . . . . . .
C ........ U .............. C
....... ~i~ ....... ~ 6 ....... ~ X ~ - - : - - - - ~ ....... ~&~ ....... ~
....... &~6 ....... ~
495-I
...................................... ...............................
AHF82-@O
................. A ...........................................................
83,~-I
................. A ...................................................
.....................
495-i
....... ~
U. . . . . . . . . . . . . .
....... ~
....... ~
A A~ACA8CU~8CAA~8A~8CA8~AA~UA~uC8UACUA88AUCACA~AA88A8CAAU8CACACU~C~CU~ACU~8~8C8ACA~AAAUC~A~AC8UCU~8AAC8ACAACA~UUUU~CA
924-I
CV1636/77
C
U. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . U ................................
~
....... ~
....... ~
8 .......................
C ..............................
....... ~
....... ~
.......
U ..................................................
C ......
U .........................................................
~ .......
A .............. A . . . . . . . . . . . . . . . . . . . . . . . . . . .
U ....... A . . . . . . . . . . . . . . A . . . . . . . . . . . . . . . . . . . . . . . . . . .
~ .......
~ .......
~ .......
~ ....... ~ .......
~,~
8~ACA~C~AAAU8CA~AC~AAAAAU~AUAAACU8AC~UUAAAA~8~AU~UCAUAU~UAAU~U~CA£A8~C~UUCAA~UUA~A~AA~AA~U~CU8A~ACCCA~CAU~AACU~UU ........................................................................................................................
924-I
........................................................................................................................
AHF82-80
................................................
836-i
........................................................................................................................
B.A . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
...................................................................................................................................... 970
980
990
1000
1010
t020
1030
1040
1050
1060
1070
1080
CV1636/77
CUA~U8CA88UUAAAUACGAAGGAACA~AU8CACCAU6CAA6AUCCCCUUCUCGUCCCAAGAU6AGAAA8GAGUAACCCAGAAuGG8AGAuL~8AUAACAGCCAACCCCAUA8UCAUU8AC
495-1
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
C. . . .
924-1
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
C. . . .
AHF82-80
........................................................................................................................
836-i
...................................................................................................................
C ....
......................................................................................................................................
1090
1100
11i0
1120
1130
I140
1150
1160
1170
1180
1190
1200
CV1636/77
AAA8AAAAACCA88CAACAUu8AAGC88A6CCACCUUUU88[j6A8AGCUAUAU88UG8UAG8AUCA68U8AAAAA8cU8U8AAACUAA8CU86UUCAAGAA888AAGCA8UAuA888AAA
495-1
...............................................................
G. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
924-1
.........................
AHFB2-80
...............................................................
A .....................................
G ........................................................ G ........................................................
B36-I .................................................. C. . . . . . . . . . . . G. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ..................................................................................................................................... 1210 1220 1230 1240 1250 1260 1270 1280 1290 1300 1310 CV1636/77
1320
AU8UuU8AA6CAACUGCCc6U66AGCACGAA88AUG8CCAuCCUB68A8AC~CCGCAU688ACUUC68UUC~AUAG8AG8G8UGUUCAC8UCU8U~88AAAACUGAUACACCAGAUUUUU
495-i
..... C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
924-I
..... C . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
AHF82-BO
........................................................................................................................
836-I
.....................................................
U .............................
C ....................................
......................................................................................................................................
Variation among dengue 1 viruses 1330
1340
1350
1360
1370
1380
1390
1709
1400
1410
CV1636/77 495-1 924-1 AHF02-80
•••ACU•C•t•A•••AA•UUU•t••••••••••••t•UC•U•GA•CAU•AAAA•••••AAUA•••A•U••GCU•ACAU••CUA•SAUUAAAUUCAA••A••AC•tIC•CUUU•AA••AC•U•UAt•C
836-1
1420
1430
1440
. . . . . . . . . . . . . . . 6. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . ...............
G. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
...............
S. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
...............
G. . . . . . . . . . . . .
U. . . . . . . . . . . . . . . . .
G. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
C .................................
...................................................................................................................................... 1450 1460 1470 1480 N S l I 4 9 0
CV1636/77
495-1 924-1
6CABUU66C~UGGUCAC~CUGUACCUAGGA6UCAUG6UUCA~GCGGACUCSGGA ......................................................
......................................................
AHF82-80 836-1
G. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
.................
.....................................................................................................................................
F i g . 4. C o m p a r i s o n o f t h e n u c l e o t i d e s e q u e n c e s f o r t h e e n v e l o p e g l y c o p r o t e i n s o f f i v e D E N - 1 strains.
virus
Table 3. Deduced amino acid changes in the DEN-1 envelope glycoprotein : comparisons with Caribbean CV1636/77 reference strain Amino acid position 37 88 114 160 198 203 297 359 382 446
Philippines 836-1 N ~ A~T L~I I ~ V --* K ~ I~T S ~ I ~
D
T M E
A V
Thailand AHF82-80 -* A--,T L-~I M -~ V S -~ A I -~ V
Caribbean ~ x _ _ ~ 495-1 924-1 -
-
I-~T S-~A [--*V
I~T S-~A I--,V
* N o a m i n o acid c h a n g e is d e n o t e d by -.
changes between 1076 and 1336. There were 22 nucleotide differences common to DEN-1 viruses 495-1 and 924-1, with amino acid changes occurring at identical genome positions. Nucleotide differences are located in clusters for the Philippines and Thailand DEN-1 topotypes (Fig. 4). There were 23 nucleotide differences between Thailand AHF82-80 and Caribbean CV 1636/77 virus strains consisting of 20 transitions and three transversions, leading to five amino acid substitutions. Nucleotide differences do not always reflect protein structural changes; the positions at which differences were observed for Thailand AHF82-80 as compared to Caribbean CV1636/77 were clustered at E protein nucleotides 217 to 288, 339 to 390, 444, 447, 738, 798 to 891, 1206 and at 1336. The Philippines strain 836-1 differed from Caribbean CV 1636/77 by 46 nucleotide changes, with a total of nine amino acid changes in the protein sequence (Table 3). The first 20 nucleotide differences occurred within the E protein gene between nucleotides 6 and 288, followed by six changes between nucleotides 339 and 444, five changes between positions 482 and 611, and finally a series of changes that are clustered around nucleotides 780, 1000 and 1336. Three amino acid differences located at amino acid positions 359, 382 and 446 appear to be unique for Caribbean CV1636/77. DISCUSSION
An unanswered question in fingerprint analyses has been whether geographical variants of D E N can be associated with genomic markers for virus virulence. To analyse diversity and variation among DEN-1 virus strains with different geographical origins and from cases with different clinical manifestations, we have sequenced and compared the genomic regions that encode the structural proteins of different DEN-1 virus strains. RNase T1 fingerprinting
1710
M. (2. C H U , E. J. O ' R O U R K E A N D D. W. T R E N T
randomly resolves 10 ~ to 15 ~ of the genomic RNA into distinct oligonucleotides for analysis (Aaronson et al., 1982), whereas by genomic sequence analyses any region of the RNA can be examined. The total genomic data analysed in this study comprise less than 25 ~o of the genome for any one isolate (2325 of 10269 nucleotides). Within the nucleotide sequences of the DEN-1 structural protein region there is a remarkable similarity among strains separated by geography and time. With the present data we cannot directly correlate genomic variations with specific oligonucleotide changes. However, DEN-I strains obtained from different geographical regions can be differentiated generally by their pattern of nucleotide substitutions at specific loci in the E glycoprotein. Studies by Monath et al. (1986) have shown that anti-E glycoprotein monoclonal antibodies differentiate DEN-2 topotypes. We observed that DEN-1 strains of the Caribbean topotype have E glycoprotein nucleotide sequences which are more similar to each other than to those of strains from Thailand or the Philippines. Caribbean strains (CV1636/77, 495-1,924-1) which were isolated over a span of 8 years show high conservation in the nucleotide and amino acid sequences of the E glycoprotein. Sequence analysis of the E glycoproteins of Australian and Papua New Guinea isolates of MVE virus showed not more than 1.7~ nucleotide sequence divergence in pairwise comparisons of the Australian isolates, 6.8 ~ divergence in the two Papua New Guinea isolates and 9 ~ to 10~ between the prototype Australian prototype and Papua New Guinea strains (Lobigs et al., 1988). These isolates were from widely separated geographical regions over a 25 year period. Oligonucleotide mapping suggests that annual rates of nucleotide substitution in regional isolates of enteroviruses and polioviruses may vary from 0.43~ to 2 ~ (Smith & Inglis, 1987). Although the annual rate of mutation and selection in the DEN-1 genome is unknown, oligonucleotide fingerprint results suggest that DEN-1 viruses in the Caribbean have remained genetically stable (Repik et al., 1983). Based on these observations we could predict a 3 ~ change in the DEN-1 genome sequence between a 1977 and a 1985 isolate (0.43~ x 8 years). The observed nucleotide differences between strains CV1636/77 and 924-1 or 495-1 were 1.9~ and 1.8 ~o respectively, suggesting that the number of nucleotide changes over time was infrequent. These observations may explain why the nucleotide sequences encoding the E protein of the Caribbean DEN-1 isolates from classical DF and fatal DHF/DSS cases are nearly identical although they were isolated over an 8 year period. The JE virus E glycoprotein has two major immunoreactive domains (McAda et al., 1987); Domain I includes the amino-terminus up to amino acid residue 122 and Domain II is located between amino acid positions 293 and 401. Nowak & Wengler (1987) identified R1, R2 and R3 immunoreactive regions for the WNV E glycoprotein. R1 encompasses the same region as the JE Domain I and amino acid residues 184 to 332 and 367 to 446 make up the R2 and R3 regions, respectively. We have examined the amino acid sequence differences among DEN-1 virus E proteins in these regions to determine the genetic stability of the virus. Amino acid differences were observed for the Philippines strain 836-1 at amino acid positions 37, 88 and 114 and the Thai strain at position 88 within Domain I/R1. Two amino acid changes in DEN-1 Philippines strain 836-1 occur at the intra-loop regions between R1/R2 at position 160 and between the R2/R3 loop at position 359. A unique amino acid difference in Thailand strain AHF82-80 was located at residue 297, for Philippines strain 836-1 at residues 196 and 203; all of these are within R2. The final group of DEN-1 amino acid changes are located in Domain II/R3 at residues 382 and 446. The E glycoprotein amino acid changes of our sequenced DEN-1 strains appear to be located in the immunoreactive regions as reported for other flaviviruses. Our results demonstrate that within the DEN-1 serotype the nucleotide sequences of the structural regions are very similar for viruses obtained from different geographical regions. Comparative analyses have not identified genomic markers to permit differentiation of geographical or virulence characteristics. However strains could be differentiated by nucleotide changes and subsequent deduced amino acid changes by examining the clustered pattern unique to each topotype. Assessment of the significance of these genetic changes will require molecular characterization of isolates obtained from different geographical regions and analysis of strains selected for virulence to locate genomic changes associated with alterations in biological characteristics.
Variation among dengue 1 viruses
1711
The authors thank Drs D. S. Burke, D. J. Gubler and C. G. Hayes for providing the DEN-1 strains, Dr B. J. B. Johnson for providing the procedure for sequencing viral R N A , Dr G. J. C h a n g for preparation of the oligonucleotide primers, Dr R. M. Kinney for technical guidance, S. L. Sviat and K. R. Tsuchiya for R N A fingerprinting and Ravithat Putvatana for technical assistance. M.C.C. was a recipient of a post-doctoral fellowship award from the National Research Council. REFERENCES AARONSON, R. P., YOUNG, J. F. & PALESE, P. (1982). Oligonucleotide mapping: evaluation of its sensitivity by computer simulation. Nucleic Acids Research 10, 237-246. BIGGIN, M. D., GIaSON, Y. J. & nONG, C. V. (1983). Buffer gradient gels and 35S label as an aid to rapid D N A sequence determination. Proceedings of the National Academy of Sciences, U.S.A. 80, 3963-3965. BRINTON, M. A. & DISPOTO, J. H. (1988). Sequence and secondary structure analysis of the Y-terminal region of flavivirus genome R N A . Virology 102, 290-299. BURKE, D. S., NISALAK, A., JOHNSON, D. E. & SCOTT, R. M. (1988). A prospective study of dengue infections in Bangkok. American Journal of Tropical Medicine and Hygiene 38, 172-180. COIA, G., PARKER, M. D., SPEIGHT, G., BYRNE, M. E. & WESTAWAY,E. G. (1988). Nucleotide and complete amino acid sequences of Kunjin virus: definitive gene order and characteristics of the virus-specified proteins. Journal of General Virology 69, 1-21. DALE, R. M. K., McCLURE, B. A. & HOUCHINS, I. P. (i985). A rapid single-stranded cloning strategy for producing a sequential series of overlapping clones for use in D N A sequencing: application to sequencing the corn mitochondrial 18 r D N A . Plasmid 13, 31-40. DALGARNO, L., TRENT, D. W., STRAUSS, J. H. & RICE, C. M. (1986). Partial nucleotide sequence of Murray Valley encephalitis virus genome: comparison of the encoded polypeptides with yellow fever virus structural and nonstructural proteins. Journal of Molecular Biology 187, 309-323. DEUBEL, V., KINNEY, R. M. & TRENT, D. W. (1986). Nucleotide sequence and deduced amino acid sequence of the structural proteins of dengue 2 virus, Jamaica genotype. Virology 155, 365-377. DEUBEL, V., KINNEY, R. M. & TRENT, D. W. (1988). Nucleotide sequence and deduced amino acid sequence of the non-structural proteins of dengue type 2 virus, Jamaica genotype: comparative analysis of the full-length genome. Virology 165, 234-244. HAHN, C. S., DALRYMPLE,J. M., STRAUSS,J. H. & RICE, C. M. (1987). Comparison of the virulent Asibi strain of yellow fever virus with the 17D vaccine strain derived from it. Proceedings of the National Academy of Sciences, U.S.A. 84, 2019-2023. HAHN, Y. S., GALLER, R., HUNKAPILLER,T., DALRYMPLE, J. M., STRAUSS,J. H. & STRAUSS, E. G. (1988). Nucleotide sequence of dengue 2 R N A and comparison of the encoded proteins with those of other flaviviruses. Virology 162, 167-180. rIALSTEAD, S. B. (1981). The pathogenesis of dengue: molecular epidemiology in infectious disease. American Journal of Epidemiology 114, 632~548. rIALSTEAD, S. B. (1988). Pathogenesis of dengue: challenges to molecular biology. Science 239, 476-481. HENCHAL, E. A., GENTRY, M. K., McCOWN, J. M. & BRANDT, W. E. (1982). Dengue virus specific and flavivirus group determinants identified with monoclonal antibodies by indirect immunofluorescence. American Journal of Tropical Medicine and Hygiene 31, 830-836. KINNEY, R. M., JOHNSON, B. J. a., BROWN,V. L. & TRENT, D. W. (1986). Nucleotide sequence of the 26S m R N A of the virulent Trinidad donkey strain of Venezuelan equine encephalitis and deduced sequence of encoded structural proteins. Virology 152, 400-413. LOBIGS, M., MARSHALL,I. D., WEIR, R. C. & DALGARNO,L. (1988). Murray Valley encephalitis virus field strains from Australia and Papua New Guinea: studies on the sequence of the major envelope protein gene and virulence for mice. Virology 165, 245-255. McADA, P. C., MASON, P. W., SCHMALJOHN,C. S., DALRYMPLE,I. M., MASON,T. L. & FOURNIER, M. J. (1987). Partial nucleotide sequence of Japanese encephalitis virus genome. Virology 158, 348-360. MASON, P. W., McADA, P. C., MASON,T. L. & FOURNIER, M. J. (1987). Sequence of the dengue-1 virus genome in the region encoding three structural proteins and the major nonstructural protein NS1. Virology 161, 262-267. MONATH,T. P.~ WANDS, J. R., HILL, L. J., BROWN,N. V., MARCINIAK,R. A., WONG, M. A., GENTRY, M. K., BURKE, D. S., GRANT, J. A. & TRENT, D. W. (1986). Geographic classification of dengue-2 virus strains by antigen signature analysis. Virology 154, 313-324. MORENS, D. M., HALSTEAD,S. B., REPIK, v. U., PUTVATANA,R. & gAYBOURNE,N. (1985). Simplified plaque reduction neutralization assay for dengue viruses by semimicro methods in BHK-21 cells: comparison of the B H K suspension test with standard plaque reduction neutralization. Journal of Clinical Microbiology 22, 250-254. NOWAK, T. & WENGLER, 6. (1987). Analysis of disulfides present in the m e m b r a n e proteins of the West Nile flavivirus. Virology 156, 127-137. PORTERFIELD, J. S. (1980). Antigenic characteristics and classification of Togaviridae. In The Togaviruses, pp. 1346. Edited by R. W. Schlesinger. New York: Academic Press. REPIK, P. M., DALRYMPLE, J. M., BRANDT, W. E., McCOWN, J. M. & RUSSELL, P. K. (1983). R N A fingerprinting as a method for distinguishing dengue 1 virus strains. American Journal of Tropical Medicine and Hygiene 32, 577-589. RICE, C. M., LENCHES,E. M., EDDY, S. R., SHIN, S. J., SHEETS,R. L. & STRAUSS,J. A. (1985). Nucleotide sequence of yellow fever virus: implications for flavivirus gene expression and evolution. Science 229, 726-733.
1712
M. C. C H U , E. J. O ' R O U R K E AND D. W. T R E N T
SANGER, F., NICKLEN,S. & COULSON,A. R. (1977). D N A sequencing with chain-terminating inhibitors. Proceedings of the National Academy of Sciences, U.S.A. 74, 5463-5467. SMITH, D. B. & INGLIS,S. C. (1987). The mutation rate and variability of eukaryotic viruses: an analytical review. Journal of General Virology 68, 2729-2740. TRENT, D. W., GRANT,J. A., ROSEN,L. & MONATH,T. P. (1983). Genetic variation among dengue 2 viruses of different geographic origin. Virology 128, 271-284. TRENT, D. W., KINNEY,R. M., JOHNSON,B. J. B., YORNDAM,A. V., DEUBEL, V., RICE, C. M. & HAHN, C. S. (1987). Partial nucleotide sequence of St. Louis encephalitis virus R N A : structural proteins, NS1, ns2a and ns2b. Virology 156, 293-304. VEZZA, A. C., ROSEN, L., REPIK, P., DALRYMPLE, J. & BISHOP, O. H. L. (1980). Characterization of the viral R N A species of prototype dengue viruses. American Journal of Tropical Medicine and Hygiene 29, 643-652. WENGLER, G., NOWAK, T. & WAHN, K. (1987). Analysis of the influence of proteolytic cleavage on the structural organization of the surface of the West Nile flavivirus leads to the isolation of a protease-resistant E protein oligomer from the viral surface. Virology 160, 210-219. WESTWAY, E. G., BRINTON,M. A., GAIDAMOVICH,S. YA., HORZINEK,M. C., IGARASI-II,A., K.~,.RI.~.INEN,L., LVOV, D. K., PORTERFIELD, J. S., RUSSELL, P. K. & TRENT, O. W. (1985). Flaviviridae. Intervirology 24, 183-192. YANISCH-PERRON, C., VIEIRA, J. & MESSING, J. (1985). Improved M13 phage cloning vectors and host strains: nucleotide sequence of the M13mpl8 and p U C I 9 vectors. Gene 33, 103-119. ZHAO, B., MACKOW,E., BUCKLER-WHITE,A., MARKOFF,L., CHANOCK,R. M., LAI, C.-J. & MAKINO,Y. (1986). Cloning full length dengue type 4 viral D N A sequences: analysis of genes coding for structural proteins. Virology156, 77-88.
(Received 12 December 1988)