(GCG package version; Devereux et al., 1984). The percentage identity and similarity between amino acid sequences were computed by using. GAP with the ...
Journal of General Virology (1992), 73, 2189-2194. Printed in Great Britain
2189
The nucleotide sequence of RNA-2 of raspberry ringspot nepovirus V. C. Blok, 1 J. WardeH, 1 C. A. Jolly, 1 A. M a n o u k i a n , ~ D. J. Robinson, t M . L. Edwards 2 and M . A. M a y o 1. 1Scottish Crop Research Institute, Invergowrie, Dundee D D 2 5 D A and 2N E R C Institute o f Virology and Environmental Microbiology, Mansfield Road, Oxford OX1 3SR, U.K.
The nucleotide sequence of raspberry ringspot nepovirus (RRV) RNA-2 consists o f 3928 nucleotides and a poly(A) tract at the 3' end. RNA-2 contains one open reading frame which encodes a polypeptide of Mr 123508 (123K). E d m a n degradation located the N terminus o f the coat protein 514 residues from the Cterminal end o f the 123K protein, which suggests that the coat protein is released from the polyprotein by cleavage o f a C - A bond. T h e R R V coat protein has
some sequence similarities with the coat proteins o f other nepoviruses, but is no more like any one nepovirus than another. In contrast, the portion o f the 123K protein to the N-terminal side o f the coat protein is similar in sequence to the corresponding parts o f the polyproteins of t o m a t o black ring and grapevine c h r o m e mosaic nepoviruses, though not to those o f other nepoviruses.
Introduction
(Meyer et al., 1986), grapevine chrome mosaic ( G C M V ; Brault et al., 1989), grapevine fanleaf ( G F L V ; Serghini et al., 1990) and (in part) arabis mosaic (ArMV; Bertioli et al., 1991), and for the RNA-2 molecule of the subgroup 2 virus tomato ringspot (TomRV; Rott et al., 1991). As a first step towards identifying the gene products responsible for the characteristics assigned to RNA-2, and to examine the relationships among RRV and other nepoviruses, we have determined the nucleotide sequence of RRV RNA-2.
Raspberry ringspot virus (RRV), a nepovirus, can cause economically significant disease losses in raspberry crops (Murant, 1978). It has a bipartite genome (Harrison et al., 1972) composed of an 8 kb (RNA-1) and a 4 kb (RNA-2) R N A molecule (Murant et al., 1981), each of which is polyadenylated (Mayo et al., 1979). RNA-2 carries determinants for the serological specificity of the coat protein, for nematode transmissibility and for a yellowing symptom in infected Petunia hybrida; RNA-1 carries determinants for other types of symptom (Harrison et al., 1974). The Mr of the RRV coat protein has been estimated to be 54000 (Mayo et al., 1971) or 57000 (Acosta & Mayo, 1990b) by gel electrophoresis in different gel systems, but in vitro translation of a mixture of RRV R N A species yields mainly larger polypeptides and none that comigrates with protein from virus particles (Jones et al., 1985). As with other nepoviruses such as tomato black ring virus (TBRV; Meyer et al., 1986; Demangeat et al., 199t), it is likely that RRV coat protein is released from a polyprotein by the action of a virus-encoded protease. Nepoviruses can be divided into several subgroups on the basis of serological relationships (Murant, 1981), or, more simply, into two subgroups according to whether the RNA-2 molecule is smaller (subgroup 1) or larger (subgroup 2) than 5.4 kb (Francki et al., 1985); RRV belongs to subgroup 1. Sequences are known for the RNA-2 molecules of the subgroup 1 viruses TBRV 0001-0883 © 1992 SGM
Methods Virus propagation and purification. An inoculum of the S strain of RRV (Harrison, 1958)was obtained from a single local lesion (Acosta & Mayo, 1990a),and the virus was propagated in Nicotiana clevelandii. Virus particleswerepurifiedfrom infectedplants as describedby Mayo et al. (1982). cDNA synthesis and cloning. RNA was extracted from virus particles by treatment with phenol and SDS (Mayoet aL, 1982),and cDNA was prepared from it by reverse transcription, essentiallyas described by Gubler & Hoffman(1983). Primers were oligo(dT),5"GGACATCCTGACCTTTCCAG (primer A; complementaryto nucleotides2701 to 2720), 5" AGACATCCGTGGCAGGTT (primer B; complementary to nucleotides 1245 to 1262) or 5' TTAATATTTGTCACCAGAAAGA (primer C; complementaryto nucleotides94 to 111). The cDNA was ligated into SmaI-cut pUCI9 and cloned in Escherichia coli JM101 or DH5~ as described (Mayo et al., 1989). The RNA species from which each oligo(dT)-primedclonedeDNA had been transcribed was determinedby probing Northern blots of purifiedRRV RNA with labelled cloned cDNA (Sambrook et al., 1989).
2190
V. C. B l o k and others
Nucleotide sequencing. Sequences were determined by dideoxynucleotide chain termination (Sanger et al., 1977) essentially as described by Mayo et al. (1989). The T-terminal sequencesof RNA-1 and RNA-2 were likely to be very similar, and therefore they were determined only from eDNA clones that also possessed sequence unique to either RNA species. The sequence of the Y-terminal 13 nucleotides was obtained by primer extension from primer C as describedby Mayoet al. (1991).Sequenceswere assembledby usingthe programs DBUTIL and DBAUTO (Staden, 1982), and analysed by using ANALYSEQ (Staden, 1984), COMPARE and DOTPLOT (GCG packageversion; Devereuxet al., 1984).The percentageidentity and similaritybetween amino acid sequenceswere computed by using GAP with the default settings (GCG package version; Devereuxet al., 1984). Determination of the N-terminal amino acid of RR V coat protein.
B component particles of RRV were purified by centrifugation to equilibrium in CsCI in 10 raM-phosphatebufferpH 7 (density 1.5 g/ml) for 16 h at 4 °C, and protein was extracted from them by heating in 1% SDS. Coat protein was separated from contaminatingprotein by SDSPAGE (Acosta & Mayo, 1990b), transferred to a Immobilon P membrane (Millipore) and subjected to Edman degradation using an Applied Biosystems470A protein sequencer (Hewick et al., 1981).
Results Sequence o f R R V R N A - 2
The complete nucleotide sequence of RRV RNA-2 is shown in Fig. 1. Except for the 13 5'-terminal nucleotides and the 27 T-terminal nucleotides, all the sequence was determined by sequencing D N A in both orientations. The molecule contains 3928 nucleotides [excluding the poly(A) sequence of unknown length at the 3' end]. More than 90 % of the sequence was obtained from more than one cloned eDNA. Four variations were found from the sequence shown in Fig. 1 : G replacing U at position 18 (non-coding), C replacing U at position 1427 (no coding effect), C replacing U at position 2552 (no coding effect) and A replacing G at position 3300 (an Ala codon changed to a Thr codon). The sequence contains one large open reading frame (ORF) which encodes a polypeptide of Mr 123508 (123K). The only other ORF of more than 300 nucleotides in length is in the negative sense, commencing at nucleotide 2392 and ending at nucleotide 2021 (numbering as in Fig. 1); there is no analogous O R F in RNA-2 of other nepoviruses. The sequence deduced for the 5' non-coding region of RRV RNA-2, like those of other nepovirus RNAs (Rott et al., 1991), contains relatively few G + C residues and abundant U residues. CU dinucleotides are especially abundant near the 5' end, and UC, CU and U U dinucleotides account for 91 of the 205 dinucleotides. The 3' non-coding region is 397 nucleotides long with a composition of 23% A, 22% C, 23% G and 32% U and substantially the same sequence as RNA-1 (Fig. 1).
Oligonucleotide sequences have been shown to be shared among the Y non-coding sequences of nepovirus RNAs (Serghini et al., 1990) and some sequences have been found in the non-coding regions of both TBRV R N A and cowpea mosaic virus R N A (Meyer et al., 1986); none of these sequences were found by A N A L Y S E Q in RRV RNA-2. Proteins encoded by R R V R N A - 2
The unambiguous sequence of the N-terminal amino acids of isolated coat protein obtained from three analyses was A Y E V D P L H L L Y Y E S V . Its position in the 123K polypeptide is shown in Fig. 1. Therefore the coat protein arises by proteolysis of the cysteinyl-alanine bond between residues 593 and 594. Assuming this cleavage is the primary processing event, the polyprotein sequence on the 5' side of the coat protein would yield a 66K protein or, if cleaved further, proteins with a combined size of 66K. There are three other CA dipeptides in the 123K protein, but only that at residues 407 and 408 has an adjacent sequence (MCAFEV) similar to the sequence around the site of cleavage of the coat protein (GCAYEV), suggesting recognition by the RRV protease of a sequence to the C-terminal side of the cleaved peptide bond; cleavage between residues 407 and 408 would yield products of about 46K (N-terminal part) and 20K (C-terminal part). However, cleavage of the polyprotein of tobacco etch potyvirus by a virusencoded protease is determined by sequence to the Nterminal side of the cleavage site, and does not involve sequence on the C-terminal side (Dougherty & Carrington, 1988). No cleavage products were detected during in vitro translation of a mixture of RRV R N A species (Jones et al., 1985). The Mr of 56859 calculated for the RRV coat protein corresponds well with the estimate of 57000 obtained from gel electrophoresis by Acosta & Mayo (1990b). During electrophoresis, RRV coat protein can adopt a form that migrates with an apparent Mr of 44000 (Acosta & Mayo, 1990b). This change is reversible but can be prevented or reversed by alkylating -SH groups in the protein (Acosta & Mayo, 1990b). However, there is no obvious link between this effect and the cysteine content of RRV coat protein because TBRV coat protein, which does not behave anomalously in electrophoresis, contains nine cysteine residues (Meyer et al., 1986) whereas RRV coat protein contains seven. Comparisons with other nepoviruses
Comparison of the coat protein sequence of RRV with those of TBRV, GCMV, GFLV, T o m R V and ArMV showed 22% to 24% identity and 46% to 49% similarity.
Raspberry ringspot nepovirus RNA-2 10
30
50
70
U G U C U C A A A U C U C A A G C U C ALrtIAUAC G~C~CUCUC U C U C U C U C U C U G G U U A C G n 3 ~
C
k G AUU C C AAAUIJAGUC~
130 150 170 C U ~ C A A U C G C U C A A A A A C AAGIrLrJG AAtn3rJCUC U C U ~ CrJCC C AC A U A C C
~
250
270
90
190 L
~
290
AUCIIC A A U C U C U C L V O U ~
AC~OA~
210 C G C GIJUCUGG AC A C G A U G U C C C A A [ . W 3 U U ~
AG ~
M
E
910
S
2191
110
Q
F
W
~
230
G
930
C CC G ~ ~ F
P
E
X
V
H
H
L
L
~
L
H
A
P
A
W
E
L
C
L
A
S
A~A A G
350
U~A~I~CAAACCUI~CCAGCA~CUUCAGGUCGCCCUUAUUGGCGACAUUA~[JG1~C~CUCCUCCCCCCU~CCGA~CC~UCGAU~CA~CCA~AUCA~
I
Q
T
F
Q
B
L
Q
V
A
L
I
G
D
I
K
K
C
A
L
S
S
P
L
F
P
E
L
370 390 410 430 UGGCCUCC~13CGACC~rt1CCCAGGLrdCGGAGG~GLr~AC1~CCUC~GUCAUC~AACAAC~G~ACC.CAGA~CUGA~C~C~G~C~C~G~ACACG A S F E L P R F G G %; T P G V M E Q L R D A E S E L A E 49O
510
590
S
K
L
D
A
H
S
450 A
550
K
Q 470
Q
R
L
L
;~
570
E
590
CGGUAC,CC~CAGACAA~CAUCCCC~ACCUUC~GGGA~UG~AUGUACUA~GAuC~CACC~r~CAL~ACCCAC~GAAG1kACU~C~A~A~A~CCC~C~
V
A
N
R
Q
N
I
P
Y
610
L
G
D
C
M
Y
630
y
O
A
[=
G
I
S
650
Q
E
K
L
b
Q
A
670
A
730 750 CUI/LK/GAUGCUCUI]GUGGAGGACCAUI GGAUCC F D A L V E E H W M
iii0
1130
1230
L
1050 N
K
1150
1250
V
1270
1370
G
G
F
1470
1590
1690
17 I0
1630
1730
C
Q
I
950 AGA~AU~G Q S M G
D
1070 A ~ G A ~ C A U ~ C T K S A L F H K
Q
E
K
Y
T
G
W
N
N
A
R
N
S
L
V
R
L
P
D
D
I
C
G
V
E
V
N
L
F
R
D
P
E
I
E
K
1430 M
C
A
F
1550 K
I
A
C
1650
1750
S
1310
1530
~UGAAC~AULr~AACGUGGGUA~ACC~AUGUUGAGL~CCUCCCU~GA2~L~CUGAGCu2~KAC,CA~UGC~GA~~AG~A~CAU~G~GA~ C F N V G T P N L ~ F ~ ~ L m F S E L K Q T ^ F Z
T
890
1410
1510
I~i0
D
1290
1390
1490
L
1190
UGGCLK1CC~AGAACAAUAGAUC~LF`3AI%AUCUACCAC~GCUGAGU~CCL~CUAGACL~GGCDGAUCUACAUGAUL~CCA~CCGUGUA~GAUCGC~C~CGUGAuC A S Q N N R S L N L P L L S L P F S R L L A D L H D F Q N R V
1570
D
A
~GGA~CI20GCAGL~AAGU~UCAUAUAG~AGUGGGUA~AC~UGUC-CGAGUUAUUA~3Lr~CC~CU~GUACAUGG~C-CL~UG~CAGAUAU~CCA~GCUA~U~GA~CU D L A V K S H I A V G T p V R V I I S L V D G A C S D M P T A T
1450
T
1170
AUGCUGUGAGGAcUACUGCAGGUC~AUUAUACAUAGAAUGGAUAAACCUGCCACC~AUGUCUGA~UUCCC~CACGCAAC.CU~CGAAGL~GGAAU~AuA~CG A V R T T A G p L Y I E W I N L P R M S E N S A R K L A E V
1350
AIp
710
AGCCCc~AUCGCACUCU~U~GCUCAAACUUGGGCA-KAUULrtJAGGGGUACUCAGGAUA~AC~CGGACL~AAC~ACCAU~ACAUG~UA~G~UAuACA~C~CCUG P L S H S L A Q T W A N F R G T Q D K A D L V K V T M D M N I
1330
E
930 AGGAUUAU~AUA~C~GA~CCAUAC Q D Y A D S Q D S
970 990 1010 1030 A[KJ%ICUC.~GAAAGAGLKJCUAUAC C AAGG~,CUC AGGAAAGA/kAAU AC C C G A ~ ~U ~ U C ~ A G A C U A ~ C ~ U G A U C ~ C ~ G ~ C F W K E F Y T K D S G K K I P D S H K S R L A N D P
1210
L
770 790 810 CUALKIUACCUC CUG A U A U C C A C AtFtF,ZICC A ( 3 G I ~ C AAC A G U A C G C C A C U C A A C C ~ C C ~ J A ~ U ~ U ~ F I Y L L I S T F S M F Q Q Y G T Q P L L L E
850 870 890 910 GU~UGAUUCCUC~CAUCCAUGADGACUGACCA~AUC~F~GAC~CCAACCC~AUCC~AC~CCCGACAAC~GA~CCUCCCC L I P A C M M T D H 11 L E P T G D R Q A 0 K E A R
1090
F
690
AACAUGAACCCAUCAGGCCA~JGUGGCCUAAGGAUGAA~GGUL~JAC~GGACC,CUAGAC~L~GGACCCUAC~JAGAGGAL~3AUGGU~UA~C~GA~AGACA~ H E R I R P L w p K 19 E W F R 13 A R Q G F Y L E D Y G N I P
Q
1670 L
~
[~
S
W
1770
S
1790
AACGAC~CL~I~GCCADGGU~G~GGUCGLr~GUGU~GCUUCCCAGG~t3Atr~GUC~CAGACCUGGGAAAAcGAC~UL~AAUCCUCCUCUCAAAGAAUAUGCACCGC~GGUUL113ACCACCCG~CCCUC
R
A
C
H
G
G
G
R
1810
C
V
A
S
Q
G
I
1830
V
Q
T
W
E
K
E
1850
V
N
P
F
L
K
1870
E
Y
A
P
L
V
L
1930
1950
]970
1990
P
2010
2070
2090
2110
2130
R
I
T
A
T
S
C
F
S
P
I
V
L
G
A
C
F
A
Y
R
R
I
P
A
T
K
T
D
G
D
V
P
M
P
K
I
K
M
2270 G G G U A U C ACC GCAUCCCLrOG[GACAGGD~3
L
2410 2430 2450 2470 2490 2510 [/GAUCUAUGUGCUC AG A G G U A A U G A A A U C A C C G CUGU73GC CGAb73GGUC 2MI~UA~JAUGUIYUCL~CADGUC G A U U G G A C ACAGGAAAGUAUGLrOGGCC A C A C ~ A U C C C U A C ~ A U I Y V L R ~ N E I T A V A D W S I e V S F H V V W T Q E S M L A T b I P T F
V
W
2590
A
V
L
2570
D
S
G
2550
G
Q
2290 2310 2330 2350 2970 2390 UGC C C A A U A C U G U C U G G G C AAC G C G U G A C ACC U C A G A G G U C G A A U G C G A U A U C G A U C U C GC GC,C A G U G U G U G G A C A U A C U L " J C U ~ C C G U ~ G G A A G A C A C C U U U G G A U A C A U G G A L ~ C P N T V W A T R D T S E V E W D I D L A A V C G H T F F A L E D T F G ¥ M D F
2530
A
P
2150
AGGAUACUC~3v0GCUGGUACUUUALF~GG~UCGGAUCGACGUUCGAGCAAAGGC~GC~-AUUL~GAUAGUCCUGUAUGGCGACAAUGGGUUAGAGAUGGUi~GUCUCA~KACCU~UC~ D T L A G T L L A R I D V R A K A A I F D ~ A V W R Q W V R D G C L K
2170 2190 2210 2230 2250 U GC GCAU~JACUGCAGCUACAUC GUGL~JUCUC CGGUAU~3GUA( UGGGUGC AUGtrOL"3GACG C A U A C C G A A G A A U C C C G G C A G C U A C A ~ C
V
2030
GGGGA~CAACAUCCACACAACCGCCAUCA~GGGLF0CLrt23GCGGUGCGAAAAt3~UGCC[JGGUi1Gi~GCCUAuGAGGUAGAUCCACUGCALrtT~AUUGUAUUAUGAAUCC~GAUGU~C~ G S T S T Q P P S K G S L R C E N V P G C A Y E V D P L H L L Y Y E S V .......................
2058
P
1910
1890
AACCA1~GGAAt~IZ~CA~AGAUCAA~AAACUGGGGAGG~GGLrt~AA~CL~A~GCAA~GUCCCC~CUCUAUGCGAUUCAAA~CACCA~CUGAL1L~UA~GGAGCAGGCC~CUGU~AUG P K R N F I D Q Q T G E V V Q S F M Q K S R S M R F K S P E D L W S R
G
I
T
A
S
2610
L
V
T
2630
GGCC~CCGA.AGCCGAC~GA~AUAUCUCLr~CUUAAGGAAGUAUGGGGGCCAt)ACCGUL~ACGCUG43AUGGUAcCGAGGCCAAAGAAAG~UAU~U~CC~ACA~A~CC P P K P T D I S L L K E V w G P %" R F T L D G T E A K E S F A I
A
I
L
H
G
T
Y
M
2770 2790 2~ 10 2830 2~50 2~?0 UGGU~JGGUGUGUCCUGGAAUGCUACUGCGGAUC UCGCCGACAUCGUCACUCGGAAACALrdGGAL~JGUAAAGUCCAAUGAGAL~"JGAC~GGACUI]At)A~GUCCAUAUGGUGAG~ V G V S W N A T A D L A D I V T ~ K H W I V K S N E ~ F E V D L ¥ C P Y G E
N
P
G
L
T
3110 AUACCAG~AG~G~G I e G ~ I
£
0
2650
2670
2690
2710
M
P
G
2730
2890
2910
2930
2950
2970
C~ACCUI3~ACGGGUCAGGCCAA~GGAAAGCCAU~CAUCAUAGUGCA~AAACU~C~C~GAAUCGb~CC~CC~ACUCAG~GGGACG~AUGAUACACAUACAC~C~GA T F T G Q A N G K P F I I V H K L G G 1 V G P K D S V S T W G
3010 3030 3050 3070 3090 C U G G U G U A U A U A A G A A C C C A A C GLroAc A C A G U G G U G A U C G C U C U G UGGGGAGUGCCUGGU~3CCGC AUC A A U A A U A U ~ G C C G A U G A U ~ U C U ~ U G V y K N e T 5 H S O D ~ S V O S A W ~ R I ~ ~ ~ A ~ n ~ L V
T
2950
AUGGGCAG~AGA[K~GUCCGCACAU~CCCGAGUCG~373~CC~CACACUUCCGAUCAUG~ACi~GGAA2sC~GUCAGGAUGi1CCALrdCAAGA~GULKJCCUCUA~CCU~ACC~CAUA~A G Q Q I v R T F P R V V A A H F I~ $ W T G K V R M S I Q E V S $ I F L
T
2990 F
M
~ C F ~
I
a
I
H
3130 3150 1170 9190 3210 3230 AUAUUA[7OGC G G C U G C U G G G A A G U A U G A U G U C A C A A A U U A U G U C A A U C C U A C DAG C[3UGCUG [ K ~ C A G U U A C U G G A b U G C A U G G U G G U A U U A U A C GUCUCCA[3AUI3AC G~JGGUGCCC A A I I A A A ~ I< Y D V T N Y V N P T S L L F S V T S L H G G I I R L H I T W C P N 3250
3270
9290
3310
9330
3350
A~ACCACUCL"JGGUGAGUCAAAGGGUACUL~UGAAGUAUAUGCAAUA~CUGUACCAUACGC.C~ACUGAGA2tUi~UC~It~GGCGACCAAGCAA~ACGUGGGAUCAUUGACCAGGAUGG~CA T T L G E S K G T L K Y M Q Y L Y H T A T E N F F G D Q A T R G I I D Q
G
F
T
CAAU~)GAUALF~GCAUGUGGUGAULKJCUL~UGG~GCUACGAGAGU~1GGULF~G~CCGGGCAAGbUGA~kAGA~UUGGCAULrt~ACAGUUCGAAUGCCAAGU~CAbUG~UGAAAUUAGGGUiF~CUi I D I ~ C ~ D ~ ~ ~ A T r~ ~ ~ L P ~ Z V ~ r~ L G I ~ S ~ N A K S I A ~ I R V
S
I F
3370
3490
3390
3510
3410
3530
3430
3550
3450
3570
D
3470
3590
U~GAAGU~CUAUCCAUGAALrt1UC~A~GGUAGCACCAU~AAAGUGACGUGAACUGCCC~ADGCUCL"~CGGCUCCGAC~CL~A~CGAGCCUACCAAAAGCUGUCCUCU~C~A~G E V L S M N F Y G S T I K V T *
3610 3630 3650 3670 3690 3710 U A G A G A A U C C froG A G U C U A G A C ~ A C U C A U G G G U C C C A C.CG G U A C U G IF0A A U G G U ~ U,J~3C U13C U G G U G G U C A C G U A C C A G A G C A G U A U G A C G U C C U ~ C C U A UL1%ICG A C A G U G A A A [3A G G G C 3730
3750
9770
3790
3810
3830
U~L~0UGCAGAGAGGGUUCGUGUAAAGGUGUCCUACGA~L~3GULr~C~UA~ACAUAADAAC~CCACAAI)UI3AUCI~GUAtruir~AAUGCUi~J~AGCGAUAGAuGAGU~AC~UG~
3850
3870
9890
3910
CCCUCUA2~GAC~CG~1GCCLF~UAGCAA~CACACA2~A~GCADLr~G~U~G.1j~CU1IAAC~CLK3C~CUAG]3GUCG.~CU~CCG~C
Fig. 1. Nucleotide sequence of RRV RNA-2; the RNA sequence shown is followed by a poly(A) sequence. The amino acid sequence encoded by the single large ORF is shown below the nucleotide sequence; an asterisk indicates a termination codon. The amino acid sequence obtained by Edman degradation of RRV coat protein is underlined.
Very' similar values were obtained for all pairwise comparisons except those between viruses in the same serological subgroup: TBRV and GCMV are 5 7 ~ identical and GFLV and ArMV are 69~. identical. The
comparisons also showed three small clusters of highly conserved and similarly located amino acid sequence in all six coat proteins. These were (F/L)-D-A-(Y/F)-X(R/K), (V/L/I)-C-G-(H/Q) and F-Y-G-X-(S/T) (starting
/
2192
V. C. B l o k and others
I
(a)
s
I
(b) /
Coat (57K) 1000
1000
/
Coat
(57K)
J
/
S
•
500
.....
/
f
46K 500
-___- ...............................
50K
I
I
I
I
500
1000
500
1000
Fig. 2. D O T P L O T presentation of comparisons of polyproteins encoded by R R V R N A - 2 (horizontal axes) and T B R V R N A - 2 (a) or G F L V R N A - 2 (b). The horizontal and vertical lines indicate the positions of known cleavage sites in each sequence. The identities of the proteins in the T B R V and G F L V R N A - 2 sequences are shown to the right of each panel. The window length was 30 and the stringency was 17.
from residues 738, 782 and 1167, respectively, in the RRV 123K protein; X represents any amino acid). The extent of the similarity between the coat protein of RRV and that of either TBRV or GFLV is also shown in a DOTPLOT comparison of the sequence of the RRV 123K protein with either the TBRV 150K or the GFLV 131K polyprotein (Fig. 2); the similarities between coat proteins are greatest and most extensive towards the N termini. Although the coat proteins of TBRV and GFLV are almost equally similar to RRV coat protein, this is not the case for the part of the polyproteins on the Nterminal side of the cleavage site of the coat protein. The TBRV 150K protein (Fig. 2a) resembles the RRV 123K protein in this region, as does the 150K protein of GCMV (data not shown), but the GFLV 131K (Fig. 2b) and TomRV proteins (data not shown) do not.
Discussion The sequence of RRV RNA-2 presented here extends the list of sequences known for nepovirus RNA molecules and permits further comparisons to be made. Examination by DOTPLOT and GAP suggests that the coat proteins of nepoviruses, including RRV, from different serogroups (Murant, 1981) are about equally similar in sequence, and that the amino acid sequence similarities detected by Serghini et al. (1990) between TBRV and GFLV coat proteins apply substantially to all nepovirus coat proteins, including that of RRV. In
contrast, the 5'-terminal non-coding sequence of RRV RNA-2 (Fig. 1) does not conform to the consensus sequence of U-(U/G)GAAAA(U/A)(U/A)(U/A) proposed for nepovirus RNAs by Fuchs et al. (1989). The results of amino acid sequencing of RRV coat protein suggest that it is cleaved from the 123K polyprotein by proteolysis of a C-A bond. Cleavage sites for the release of other nepovirus coat proteins from polyprotein precursors are R-A (GCMV; Brault et al., 1989), R-G (GFLV, ArMV; Serghini et al., 1990; Bertioli et al., 1991) and K-A (TBRV; Demangeat et al., 1991). A cleavage site of C-A is unusual in that it has not been reported for any other plant virus or for picornaviruses (Palmenberg, 1987). However, proteases of some alphaviruses are known to cleave this bond (Strauss & Strauss, 1990). Rott et al. (1991) noted that the regions of the RNA-2 polyprotein sequences of TomRV and GFLV which are most similar are those immediately to the N-terminal side of the coat proteins, and suggested that these may be related to a possible transport function. An alignment of these parts of the RRV 123K polyprotein and the TBRV 150K polyprotein is shown in Fig. 3. There are patches of identical or nearly identical sequence in the two polyproteins, just as there are in the sequences of the GFLV and TomRV polyproteins (Rott et al., 1991), but none of the sequences common to either pair of viruses is common to all four. Thus comparisons in this region of the polyprotein distinguish two groups of viruses: RRV
Raspberry ringspot nepovirus R N A - 2
(a) (b)
SENSARKLAEVGWNNADICGVDLAVKSH]AVG'FPVBVIISLVDGACSDMPTATMCAFEVNLASQN , ,i,t tl ....... : : [ [ '' ,, I i( ,, ~: LYNELNKLAESOWKEAKSVCLNLHIRSYVPV~LYAFCVIMWGHSSDAETASLCGAGVYLGDQE
(a)
NRSLNLPLLSLPFSRLLADLHDFQNRVKIACQFRDPEGFNVG~MLSFSSLEFSELKQTAFERNS
(b)
AAVLELPLVCSYLGNSLEDFDAYKRSLVLSq~FFGKSGLFAGQNVFGITAVEFTEYMPTSYGGIT
:III
(a) (b)
LLRDSWl,,,
''i i
i'
i'
,'
i~
SEI EKHACHGGGRCVASQGIVQTWEKEVNPPLKEYAPLVLPPVPQPKRNFI Jl i , , , J l l l l , J J J J JJ~ AGFNVVDALERDKEEPIK -MPNLDLEPVPRTQp-
HERDSWQAMLRNHQGKDKGRFI
I
I
i
I
I
i
DQQTGEV llJ IVRT~GEG
(a)
VQSFMOKSRSMRFKS -PSDLWSRPSVDGG . . . . . . . . . . . . . . . STSTQPPSKGSLRCENVPGC
(b)
KQPLLNKSRSMRI QSFVSFRGSNIPVGRRIDNTAEAINYELGRASTSSVSSRLDESKCNLK
Fig. 3. Alignmentsbetween sequencesto the N-terminal side of the coat protein in RNA-2-encodedpo]yproteins. The sequences were aligned by GAP. The sequencesare from residues351 (RRV, a) or 593 (TBRV, b), and are contiguousuntil the C-A (RRV) or K-A (TBRV) cleavage sites for the release of coat protein from the polyprotein. Matching amino acids are shown by a vertical line.
and TBRV (29~ identical by GAP), and GFLV and TomRV (39~ identical by GAP); comparisons between viruses in different groups gave estimates of about 1 9 ~ identity and no similarities were detectable by using the program COMPARE. This contrasts with the results of pairwise comparisons among the coat proteins of the four viruses, which suggest no marked grouping among them. A feature which is shared by the viruses in either pair is the genus of the nematode vector: RRV and TBRV are
transmitted by species of Longidorus whereas TomRV a n d G F L V a r e t r a n s m i t t e d by s p e c i e s o f Xiphinema. T h e s p e c i f i c i t y o f n e m a t o d e t r a n s m i s s i o n is d e t e r m i n e d by R N A - 2 ( H a r r i s o n et al., 1974) a n d c o r r e l a t e s w i t h t h e serological specificity of the coat protein (Harrison, 1964). It m a y b e t h a t a s e c o n d R N A - 2 - e n c o d e d p r o t e i n influences nematode transmission and that the sequence s i m i l a r i t i e s w e h a v e d e t e c t e d a r e in s o m e w a y r e l a t e d to t h i s f u n c t i o n . W h e t h e r o r n o t t h i s s p e c u l a t i o n p r o v e s to be c o r r e c t , it is i n t e r e s t i n g t h a t c o m p a r i s o n s b e t w e e n sequences of nepovirus RNA-encoded non-structural proteins have suggested relationships between viruses that correlate with their biological properties rather than w i t h t h e p h y s i c a l p r o p e r t i e s u s e d to d e f i n e s u b g r o u p s o f nepoviruses. We thank Mr A. Willis, MRC Immunochemistry Unit, Oxford, for the protein sequencing. This work was supported in part by flexibility funding from the Scottish Office Agriculture and Fisheries Department.
References ACOSTA,O. & MAYO,M. A. (1990a). Accumulation of different types of raspberry ringspot nepovirus particle in infected Nicotiana protoplasts. Journal of General Virology 71, 713-717.
2193
ACOSTA, O. & MAYO, M. A. (1990b). Unusual electrophoretic properties of the coat protein of raspberry ringspot nepovirus. Intervirology 31, 31-37. BERTIOLI, D. J., HARRIS, R. D., EDWARDS, M. L., COOPER, J. I. & HAWES, W. S. (1991). Transgenic plants and insect cells expressing the coat protein of arabis mosaic virus produce empty virus-like particles. Journal of General Virology 72, 1801-1809. BRAULT, V., HIBRAND, L., CANDRESSE, T., LE GALL, O. & DUNEZ, J. (1989). N ucleotide sequence and genetic organization of Hungarian grapevine chrome mosaic nepovirus RNA 2. Nucleic Acids Research 17, 7809 7819. DEMANGEAT,G., HEMMER,O., FRITSCH,C., LE GALL,O. & CANDRESSE, T. (1991). In vitro processing of the RNA-2-encoded polyprotein of two nepoviruses: tomato black ring virus and grapevine chrome mosaic virus. Journal of General Virology 72, 247-252. DEVEREOX, J., HAEBERLI, P. & SMITHIES, O. (1984). A comprehensive set of sequence analysis programs for the VAX. Nucleic Acids Research 12, 387-395. DOUGHERTY, W. G. & CARRINGTON, J. C. (1988). Expression and function of potyviral gene products. Annual Review of Phytopathology 26, 123-143. FRANCKI, R. I. B., MILNE, R. G. & HATYA, T. (1985). Nepovirus group. In Atlas of Plant Viruses, Vol. II, pp. 23-38. Raton: CRC Press. FUCHS, M., PINCK, M., SERGHINI, M. A., RAVELONANDRO, M., WALTER, B. & PINCK, L. (1989). The nucleotide sequence of satellite RNA in grapevine fanleaf virus, strain FI3. Journal of General Virology 70, 955-962. GUBLER, U. & HOFFMAN, B. J. (1983). A simple and very efficient method for generating eDNA libraries. Gene 25, 263-269. HARRISON, B. D. (1958). Further studies on raspberry ringspot and tomato black ring, soil-borne viruses that affect raspberry. Annals of Applied Biology 46, 571-584. HARRISON, B. D. (1964). Specific nematode vectors for serologically distinctive forms of raspberry ringspot and tomato black ring viruses. Virology 22, 544-550. HARRISON,B. D., MURANT,A. F. & MAYO, M. A. (1972). Evidence for two functional RNA species in raspberry ringspot virus. Journal oJ General Virology 16, 339-348. HARRISON, B. D., MURANT, A. F., MAYO, M. A. & ROBERTS, I. M. (1974). Distribution of determinants for symptom production, host range and nematode transmissibility between the two RNA components of raspberry ringspot virus. Journalof General Virology 22, 233-247. HEWlCK, R. M., HUNKERPILLER,M. W., HOOD, L. E. & DREYER,W. J. (1981). A gas-liquid solid phase peptide and protein sequenator. Journal of Biological Chemistry 256, 7990-7997. JONES, A. T., MAYO, M. A. & HENDERSON,S. M. (1985). Biological and biochemical properties of an isolate of cherry rasp leaf virus from red raspberry. Annals of Applied Biology 106, 101-110. MAYO,M. A., MURANT,A. F. &HARRISON,B. n. (1971). New evidence on the structure of nepoviruses. Journalof General Virology 12, 175178. MAYO, M. A., BARKER,H. & HARRISON,B. n. (1979). Polyadenylate in the RNA of five nepoviruses. Journal of General Virology 43, 603610. MAYO, M. A., BARKER,H. & HARRISON,B. D. (1982). Specificity and properties of the genome-linked proteins of nepoviruses. Journal of General Virology 59, 149-162. MAYO, M. A., ROBINSON,D. J., JOLLY, C. A. & HYMAN, L. (1989). Nucleotide sequence of potato leafroll luteovirus RNA. Journal of General Virology 70, 1037-1051. MAYO, M. A., JOLLY,C. A., MURANT,A. F. & RASCHKE,J. H. (1991). Nucleotide sequence of raspberry bushy dwarf virus RNA-3. Journal of General Virology 72, 469-472. MEYER, M., HEMMER,O., MAYO, M. A. & FRITSCH, C. (1986). The nucleotide sequence of tomato black ring virus RNA-2. Journal of General Virology 67, 1257-1271. MURANT, A. F. (1978). Raspberry ringspot virus. CMI/AAB Descriptions of Plant Viruses, no. 198. MURANT, A. F. (1981). Nepoviruses. In Handbook of Plant Virus Infections and Comparative Diagnosis, pp. 197-238. Edited by E. Kurstak. Amsterdam: Elsevier/North-Holland.
2194
V. C. Blok and others
MURANT,A. F., TAYLOR,M., DUNCAN,G. D. & RASCrIKE,J. E. (1981). Improved estimates of molecular weight of plant virus RNA by agarose gel electrophoresis and electron microscopy after denaturation with glyoxal. Journal of General Virology 53, 321-332. PALMENBERG, A. (1987). Genome organization, translation and processing in picornaviruses. In The MolecularBiology of the Positive Strand RNA Viruses, pp. 1-15. Edited by D. J. Rowlands, M. A. Mayo & B. W. J. Mahy. London: Academic Press. Roar, M. E., TREMAINE,J. H. & ROCrtON, D. M. (1991). Nucleotide sequence of tomato ringspot virus RNA-2. Journal of General Virology 72, 1505-1514. SAMBROOK, J., FR1~CH, E. F. & MANIATIS, T. (1989). Molecular Cloning: A Laboratory Manual, 2nd edn. New York: Cold Spring Harbor Laboratory. SANGER,F., NICKLEN,S. & COULSON,A. R. (1977). DNA sequencing
with chain-terminating inhibitors. Proceedings of the National Academy of Sciences, U.S.A. 74, 5463-5467. SERGHINI,M. A., Fucns, M., PINCK, M., REINBOLT,J., WALTER,B. & PINCK, L. (1990). RNA2 of grapevine fanleaf virus: sequence analysis and coat protein cistron location. Journalof General Virology 71, 1433-1441. STADEN,R. (1982). Automation of the computer handling of gel reading data produced by the shotgun method of DNA sequencing. Nucleic Acids Research 10, 4731~4751. STADEN, R. (1984). Graphic methods to determine the function of nucleic acid sequences. Nucleic Acids Research 12, 521-538. STRAUSS,J. H. & STRAUSS,E. (1990). Alphavirus proteinases. Seminars in Virology 1, 347-356.
(Received 29 January 1992; Accepted 11 May 1992)