Document not found! Please try again

Small-T-Antigen Gene - Journal of Virology - American Society for ...

2 downloads 0 Views 1MB Size Report
gressive multifocal leukoencephalopathy, a rare demyelinating disease, for ..... a case of progressive multifocal leukoencephalopathy. J. Infect. Dis. 127:467-470 ...
Vol. 45, No. 1

JOURNAL OF VIROLOGY, Jan. 1983, P. 73-79

0022-538X/83/010073-07$02.00/0 Copyright © 1982, American Society for Microbiology

Genomic Structure of Human Polyoma Virus JC: Nucleotide Sequence of the Region Containing Replication Origin and Small-T-Antigen Gene TATSUO MIYAMURA,' HIROYUKI JIKUYA,2 EIICHI SOEDA,2 AND KUNITO YOSHIIKEl* Department of Enteroviruses, National Institute of Health, Tokyo 141,1 and Department of Molecular Genetics, National Institute of Genetics, Misima 41J2 Japan Received 5 August 1982/Accepted 23 September 1982

The nucleotide sequence of the region of human polyoma virus JC DNA between 0.5 and 0.7 map units from a unique EcoRI cleavage site was determined and compared with those of the corresponding regions of another human polyoma virus, BK, and simian virus 40 DNAs. Within this region consisting of 945 base pairs, we located the origin of DNA replication near 0.7 map units, the entire coding region for small T antigen, and the splice junctions for large-T-antigen mRNA. The deduced amino acid sequences for small T antigen and the part of large T antigen markedly resembled those of polyoma virus BK and simian virus 40. The results strongly suggest that polyoma virus JC has the same organization of early genome as polyoma virus BK and simian virus 40 on the physical map, with the EcoRI site as a reference point. DNA with the genomic organization. From the similarity to simian virus 40 (SV40) and human polyoma virus BK (BKV) DNAs, we could locate the replication origin and the entire coding region for the small T antigen. Also found in this region were the signal sequences conserved near the splicing sites of eucaryotic mRNA.

Human polyoma virus JC (JCV) has been isolated from brain tissues of patients with progressive multifocal leukoencephalopathy, a rare demyelinating disease, for which JCV is probably responsible as an etiological agent (9, 19). Serological surveys have revealed that JCV infection is common among human populations (2, 18). Like other polyoma viruses, JCV has tumorigenic and transforming capacities (7, 20, 33, 35). Therefore, studies of JCV are both etiologically and biologically important. Biological and biochemical studies on JCV, however, have been greatly hampered because JCV can grow efficiently only in primary human fetal glia cell cultures rich in spongioblasts (17) and because viral DNA from the infected cells is usually heterogeneous in size (12, 16). Although JCV has been successfully adapted to growth in more readily available cells, such as human amnion cells (31) and human embryonic kidney (HEK) cells (14), the viral DNA thus obtained is heterogeneous. A JCV mutant growing well in HEK cells has been found to contain two classes of defective DNAs, which probably complement each other for viral growth (38). Molecular cloning of JCV DNA in a bacterial vector (10) from DNA obtained by a low multiplicity of infection of primary human fetal glia cells (12) has now made detailed analyses of the JCV genome possible. In the present study, we determined the nucleotide sequence of the region near 0.7 map units from EcoRI site in an attempt to correlate the physical map of JCV

MATERIALS AND METHODS JCV DNA. A restriction endonuclease cleavage map of JCV (Mad-1) DNA (10, 12, 22) is shown in Fig. 1. A recombinant plasmid, pJC(1-4), which has a JCV DNA insert at the BamHI site of plasmid pBR322 (10), was used. DNA was extracted and purified as described previously (38). Enzymes. Restriction endonucleases BamHI, HaeIII, HindIll, HpaI, PstI, and PvuII were generously provided by Takara Shuzo Co., Ltd., Kyoto, Japan, and were used as recommended by the manufacturer. Bacterial alkaline phosphatase and T4 polynucleotide kinase were purchased from Boehringei Mannheim Biochemicals, Indianapolis, Ind. DNA sequencing. Nucleotide sequences were determined by the method of Maxam and Gilbert (13) with a strategy illustrated in Fig. 1. pJC(1-4) DNA digested with HindIII, BamHI, HpaI, and PstI was subjected to preparative electrophoresis in 5.5% polyacrylamide gels. After elution, HindIII-B and the shorter HpaIcleaved HindIII-A were further cleaved each into two subfragments with HaeIII and PvuII, respectively. Double-stranded fragments isolated from gels were labeled at their 5' ends with [y-32PIATP (specific activity, 2,000 Ci/mmol, Amersham International) and T4 polynucleotide kinase. The labeled DNA was denatured by heat and separated into each strand on 73

MIYAMURA ET AL.

74

J. VIROL.

PvujU

PstI

PvuII .......

.-

lPst I

.I...

HindIll-A

^_-

I

HindIll-C

*w -~--*-

Hindi -0

I

FIG. 1. Restriction endonuclease cleavage map of JCV (Mad-1) DNA and schematic representation of the strategy for sequencing. The cleavage sites are from published data (10, 12, 22). The straight arrows indicate the direction and approximate extent of sequence determination from the indicated restriction enzyme sites. 5% polyacrylamide-0.1% bisacrylamide gels. Fragments labeled at one end were submitted to the four sets of partial degradation reaction of A > C, G, C, and C + T. The degradation products were run on 0.5mm-thick 12% polyacrylamide-0.62% bisacrylamide-7 M urea gels. Autoradiography was done at -70°C with an intensifying screen (Kodak X-Omatic).

RESULTS

Like SV40 and BKV DNAs, JCV DNA has a unique EcoRI site that can be used as a reference point (Fig. 1). Cross-hybridization under less stringent conditions between restriction endonuclease cleavage fragments of these three polyoma viruses has shown that their physical maps could be colinearly aligned by using the EcoRI site as the 0 map position (11). To locate the replication origin precisely on the physical map of JCV DNA, we determined the nucleotide sequence of the area presumably containing the replication origin and coding for 5' end of the early mRNAs and compared it with the corresponding areas of BKV and SV40 DNAs, whose entire sequences and relation to the genetic maps have been established (5, 21, 24, 37). Figure 2 shows the nucleotide sequence of 945 base pairs covering the area from 0.5 to 0.7 map units. The origin of JCV DNA replication was located near the HindIII-A/C junction (Fig. 1). Since the origin of SV40 DNA replication has been confined within a narrow region around the BglI

cleavage site (8, 25), the similar nucleotide sequence to that region was searched for in the JCV DNA sequence. The replication origin of JCV DNA was assigned to the region of nucleotides 36 to 121 (Fig. 2), which shows extensive homology with the replication origin of both SV40 (30) and BKV DNA (3). The structures of replication origins are compared among the three polyoma viruses in Fig. 3. The nucleotide sequence of the replication origin can be divided into three parts with different structural features, AT stretch (nucleotides from 36 to 50; boxed in Fig. 2), two pairs of the inverted repeat sequences (nucleotides 54 to 102), and a palindromic sequence with mirror image form (nucleotides 105 to 121; boxed by dotted line in Fig. 2 and 3). Goldberg-Hogness box structure (24) was detected at the right end of the AT stretch (nucleotides 43 to 50 in Fig. 2 and 3). The region between AT stretch and the palindrome was rich in GC, with intrastrand symmetry which could form two pairs of double hairpin loops, as indicated by the arrows in Fig. 3.

The coding region for small T antigen was found to initiate from ATG at nucleotides 182 to 184 and terminate at TAA at nucleotides 698 to 700 (Fig. 2). This ATG was the first initiation codon in the longest open reading frame downstream from the replication origin. The topology of the initiation codon is similar to that of BKV and SV40 (4, 34). Consequently, JC small-T-

75

JCV DNA REPLICATION ORIGIN

VOL. 45, 1983

80

5' ATGAGCTCAT GCTTGGCTGG CAGCCATCCC TTCC TTTTT TTTATATATA CAGGAGGCCG AGGCCGCCTC CGCCTCCAAG TACTCGAGTA CGAACCGACC GTCGGTAGGG AAGGGAAAAA AAATATATAT GTCCTCCGGC TCCGGCGGAG GCGGAGGTTC HindIII

TTTTTAGGAG G3CAGGGAAA TTCCCTTGTT TTTCCCTTTT TTGCAGTAAT CTTACTCAGA AGTAGTAAGG GCG$g.GAGGC GAATGAGTCT TCATCATTCC CGCACCTCCG AAAAATCCTC CGGTCCCTTT AAGGGAACAA AAAGGGAAAA AACGTCATTA ..........................* *

*

@ -

240

PEACAAA

TTTTTGCTGC AAAAAGCTAA GTGCTGAATA GGGAGGAATC CATGGAGCTT ATGGATTTAT TAGGCCTTGA AAAAACGACG TTTTTCGATT TTACCTGTTT CACGACTTAT CCCTCCTTAG GTACCTCGAA TACCTAAATA ATCCGGAACT 320

TAGGTCTGCA TGGGGGAACA TTCCTGTCAT GAGAAAAGCT TATCTGAAAA AATGCAAAGA ACTCCACCCT GATAAAGGTG ATCCAGACGT ACCCCCTTGT AAGGACAGTA CTCTTTTCGA ATAGACTTTT TTACGTTTCT TGAGGTGGGA CTATTTCCAC HindIII

400

GGGACGAAGA CAAGATGAAG AGAATGAATT TTTTATATAA AAAAATGGAA CAAGGTGTAA AAGTTGCTCA TCAGCCTGAT CCCTGCTTCT GTTCTACTTC TCTTACTTAA AAAATATATT TTTTTACCTT GTTCCACATT TTCAACGAGT AGTCGGACTA 480

1

TTTGGTACAT GGAATAGTTC AGAGGTTGGT TGTGATTTTC CTCCTAATTC TGATACCCTT TATTGCAAGG AATGGCCTAA AAACCATGTA CCTTATCAAG TCTCCAACCA ACACTAAAAG GAGGATTAAG ACTATGGGAA ATAACGTTCC TTACCGGATT

560

CTGTGCCACT AATCCTTCAG TGCATTGCCC CTGTTTAATG TGCATGCTAA AATTAAGGCA TAGAAACAGA AAATTTTTAA GACACGGTGA TTAGGAAGTC ACGTAACGGG GACAAATTAC ACGTACGATT TTAATTCCGT ATCTTTGTCT TTTAAAAATT 640

GAAGCAGCCC ACTTGTGTGG ATAGATTGCT ATTGCTTTGA TTGCTTCAGA CAATGGTTTG GGTGTGACTT AACCCAAGAA CTTCGTCGGG TGAACACACC TATCTAACGA TAACGAAACT AACGAAGTCT GTTACCAAAC CCACACTGAA TTGGGTTCTT

AAAGCTTE2

720

GGTAAACCAC TATTTTTCTT GCTCTTCATT GCTGGGAGAA AGTTCTTGGA GACACCCCCT ACAGGGATCT CGAGAAGTAA CGACCCTCTT TCAAGAACCT CTGTGGGGGA TGTCCCTAGA TTTCGAAATT CCATTTGGTG ATAAAAAGAA HindIII 1

800

TTGCAGGGCA TTTTGTTTTT TACAATAAAC TTAATTGTTT TTTTTTAGGT GCCAACCTAT GGAACAGATG AATGGGAATC AACGTCCCGT AAAACAAAAA ATGTTATTTG AATTAACAAA AAAAAATCCA CGGTTGGATA CCTTGTCTAC TTACCCTTAG

880

CTGGTGGAAT ACATTTAATG AGAAGTGGGA TGAAGACCTG TTTTGCCATG AAGAAATGTT TGCCAGTGAT GATGAAAACA GACCACCTTA TGTAAATTAC TCTTCACCCT ACTTCTGGAC AAAACGGTAC TTCTTTACAA ACGGTCACTA CTACTTTTGT CAGGATCCCA ACACTCTACC CCACCTAAAA AGAAAAAAAA GGTAGAAGAC CCTAAAGACT TTCCT GTCCTAGGGT TGTGAGATGG GGTGGATTTT TCTTTTTTTT CCATCTTCTG GGATTTCTGA AAGGA 5' BamHI

FIG. 2. Nucleotide sequence of JCV DNA: the region containing replication origin and small-T-antigen gene. Numbering is given from near the HindIII-A/C junction (nucleotide 1) to near the BamHI site (nucleotide 945). We identified the orientation of the strands from the similarity to BKV and SV40 DNAs. The upper strand, read from left to right, is believed to have the same polarity as early mRNA. The AT stretch within the replication origin is indicated in the solid line box (nucleotides 36 to 50). The dotted line box indicates the palindrome (nucleotides 105 to 121). ATG at nucleotides 182 to 184 is considered to be the initiation codon for T antigen mRNA, and TAA at nucleotides 698 to 700 is considered to be the termination codon. The arrows refer to the splice junctions for large-T-antigen mRNA. The arrow between nucleotides 424 and 425 is for the donor site, and the other, between nucleotides 768 and 769, is for the acceptor site (see Fig. 6).

76

J. VIROL.

MIYAMURA ET AL.

JCv

51

CT

--

TTTTTTT

BKV

5'

CT

--

TTTTTTT T A TAA

SV40

5'

CT

AA

TTTTTTTTT

-

T T

A TA-

A

A TA

-CAG

C4GCCTC

CAAG

A TA TAA- GAGGCCGAGGCCGCCTC T GCCTC CACC TA TGCA

--- -

80

GAGGCCGAGGCCGCCTC

GAGGCCGAGGCCGCCTC

G

GCCTC TGAG

BglI

:.......................

AGIAAGTAGT| A 7AGGGCGT: GGAGGCTTTTT CTITTCT CITC AAGTAGT Al A|GGGTGT' GGAGGCTTTTT CTI TACT m

CT ATTC C AG AAGTAGT G

............ _130 S

AG GAGGfC

-

AG GGAAA

GAGG;ZC T AG CAAAA A-!GGAGGCTTTTT TG GAGGCC T AG* d

.... .

CT

*@ *@ @ .... ....................s@

184

T TTTTGCA GTAATTTTTTGCTGCAAAAA GC T AA A ATG

3'

--ATTTTT GC A AA A ATG

3'

---GC TTTTGCA AAAAGC-TTT GC A AA G ATGL

3'

TTCCCTTGTTTTT-CCCT

-

CTATTTGGGGAAATCCCTATTC TTTTGCA

---

FIG. 3. Comparison of the sequences around the replication origins of JCV, BKV, and SV40 DNAs. Boxes indicate the sequence homology among the three DNAs in this region. Nucleotides have been aligned to maximize homology. Arrows indicate the areas with intrastrand symmetry which could form hairpin loops. The 17-nucleotide palindromic structures are boxed by dotted lines. The BglI restriction site (GCCN5GGC) is for SV40 DNA. Numbering for JCV DNA is from Fig. 2. In BKV DNA, the first C and the last G correspond to nucleotides 140 and 5151, respectively, from Seif et al. (24). In SV40 DNA, the first C and the last G are at nucleotides 5193 and 5079, respectively, from Reddy et al. (21).

antigen gene can code for 172 amino acids, like those of BKV and SV40 encoding 172 and 174 amino acids, respectively. Furthermore, the deduced amino acid sequences of small T antigens showed extensive homology among the three polyoma viruses (Fig. 4). From the homology with the signal sequences proposed for splicing of large-T-antigen mRNA in SV40, BKV, and mouse polyoma virus (23), we identified the sequence 5'TCAGAG/ GTTGGT3' at nucleotides 419 to 430 as a donor site and the sequence 5'TTTTTTTAG/ GTGCCA3' at nucleotides 760 to 774 as an acceptor site of large-T-antigen mRNA of JCV (Fig. 5). In the nucleotide sequence downstream from the acceptor site of JCV, one reading frame was left open and the other two were closed. The amino acid sequence predicted from the open reading frame downstream from the acceptor site was similar to those from BKV and SV40 T antigen genes in the corresponding regions (Fig. 6). DISCUSSION The complete nucleotide sequences and the genomic organization for BKV, SV40, and mouse polyoma virus have been established, and several recognition sequences for regulatory functions have been mapped on the nucleotide sequences (5, 21, 24, 28, 37). From the homology determined by DNA-DNA hybridization, the cleavage map of JCV DNA has been aligned, with reference to EcoRI site, to those of BKV

and SV40 DNAs (11), but little is known about the genetic map and genomic organization of JCV. In the present study, we attempted to locate the origin of DNA replication and the proximal part of the early region of JCV genome by comparing JCV sequences to those of BKV and SV40 to correlate the physical map to the genomic organization. Apparently, the similarity of the sequences for replication origin and of the amino acid sequences for putative small T antigen between JCV and other primate polyoma viruses proves that JCV has early genome organization similar to that of BKV and SV40 on the physical map of DNA with the conserved single EcoRI site in each genome as a reference point. Since the structural features of the region around the HindIII-A/C junction of JCV DNA (at nucleotides 36 to 121) were strikingly similar to those of the replication origins of BKV and SV40 (3, 30), this area was regarded as the replication origin of JCV DNA (Fig. 3). The area is composed of a long AT stretch, two inverted repeat sequences, and a palindrome. At the right end of the AT stretch, the three primate polyoma viruses had sequences similar to the Goldberg-Hogness box sequence of 5'TATAAATA3' (24). By analogy to SV40 (1), sequence 5'TATATATA3' of JCV (nucleotides 43 to 50) is probably related to the early promoter region. The AT-rich sequence was followed by the GCrich region which could form double hairpin loops (Fig. 3). Although it is unclear whether this structure is actually formed in vivo, it is

JCV DNA REPLICATION ORIGIN

VOL. 45, 1983

JCV Met Asp Lys Val Leu Asn Arg Glu Glu Ser Met Glu Leu Met Asp BKV Met Asp Lys Val Leu Asn Arg Glu Glu Ser Met Glu Leu Met Asp SV40 Met Asp Lys Val Leu Asn Arg Glu Glu Ser Leu Gln Leu Met Asp

77

Leu Leu Gly Leu Asp Arg Ser Ala Trp Gly Leu Leu Gly Leu Glu Arg Ala Ala Trp Gly Leu Leu Gly Leu Glu Arg Ser Ala Trp Gly

Asn Ile Pro Val Met Arg Lys Ala Tyr Leu Lys Lys Cys Lys Glu Leu His Pro Asp Lys Gly Gly Asp Glu Asp Asn Leu Pro Leu Met Arg Lys Ala Tyr Leu Arg Lys Cys Lys Glu Phe His Pro Asp Lys Gly Gly Asp Glu Asp Asn Ile Pro Leu Met Arg Lys Ala Tyr Leu Lys [ Cys Lys Glu Phe His Pro Asp Lys Gly Gly Asp Glu Glu Lys Met Lys Arg Met Asn Phe Leu Tyr Lys Lys Met Glu Gln Gly Val Lys Val Ala His Gln Pro Asp Phe Gly Lys Met Lys Arg Met Asn Thr Leu Tyr Lys Lys Met Glu Gln Asp Val Lys Val Ala His Gln Pro Asp Phe Gly Lys Met Lys Lys Met Asn Thr Leu Tyr Lys Lys Met Glu Asp Gly Val Lys Tyr Ala His Gln Pro Asp Phe Gly

Thr Asp Phe Pro Pro Asn Ser Asp Thr Leu Tyr Cys Lys| Glu Trp Asn Ser Ser Glu Val Gly Cys Thr - Trp Ser Ser Ser Glu Val Cys Ala Asp - Phe Pro Leu Cys Pro Asp Thr Leu Tyr Cys Lys Glu Gly Phe Trp Asp Ala Thr Glu Val Phe Ala Ser Ser Leu Asn Pro Gly Val AsAla MetjTyr Cys LysjGln -

-

-

-

-

Trp Pro Asn Cys Ala Thr Asn Pro S Val His s Pro Cys Leu Met Cys Met Leu Lys Leu Arg His Arg Asn Trp Pro Ile Cys Ser Lys Lys Pro Ser Val His Cys Pro Cys Met Leu Cys Gln Leu Arg Leu Arg His Leu Asn Trp Pro Glu C Ala Lys Lys Met S Ala Asn C Ile&s Leu Leu C Leu L Arg Met Lys His Glu Asn Arg Lys Phe Leu Arg Ser Ser Pro Leu Val Trp Ile Asp Cys Tyr Cys Phe Asp Cys Phe Arg Gln Trp Phe Gly Arg Lys Phe Leu Arg Lys Glu Pro Leu Val Trp Ile Asp Cys Tyr Cys Ile Asp Cys Phe Thr Gln Trp Phe Gly Arg Lys Leu Tyr A Lys Asp Pro Leu Val Trp Val Asp Cys Tyr Cys Phe Asp Cys Phe Arg Met|Trp Phe Gly Cys Asp Leu Thr Gln Glu Ala Leu His Cys Trp Glu Lys Val Leu GlAsp Thr Pro Tyr Arg Asp Leu Lys Leu Leu Asp Leu Thr Glu Glu Thr ILeul Gin Trp ITrpI Val Gin Ile Ile Gly Glu Thr Pro Phe Arg Asp Leu Lys Leu Gln Thr Thr Tyr Arg Asp Leu Lys Leu Leu Asp Leu Cys Glu Gly ThrL Leu Leu Trp Cys Asp Ile Ile

FIG. 4. Amino acid sequences of JCV, BKV, and SV40 small T antigens. The sequence was deduced from the nucleotide sequence. The sequences for BKV and SV40 are from published data (4, 34, 36). Gaps were left where necessary to maximize homology. The boxes indicate the homologous amino acids among JCV, BKV, and SV40. The arrow between glutamine and valine residues divides the amino terminal region (common to large T antigens) from the carboxyl terminal region of these proteins.

possible that the higher structures of DNA near the replication origin affect the expression of the viral genome. In the first hairpin loop indicated by an arrow in Fig. 3, 22 out of 23 base pairs were identical to each other among JCV, BKV, and SV40. The composition of the second hairpin structure, on the other hand, was less homologous, but exactly the same sequence of 5'AAGTAGTAAGGG3' was found. Another highly homologous sequence among the three polyoma virus DNAs was found in 17-base pair palindromes. Of 17 nucleotides, 15 were identical. Although the general features of replication origin were shared by mouse polyoma virus (6, 29), the strong homology of nucleotide sequences was detected only among the three primate polyoma viruses. There was a stretch of 60 nucleotides between the palindrome and the initiation codon for T antigens (Fig. 3). Although the homology in this noncoding region among the three primate polyoma viruses was weaker than that in the replication origin, they had repeats of 5'TnGC3' and 5'GCAn3', which could form hairpin loops (indicated by arrows in Fig. 3), in this region. The structure of the corresponding region in mouse polyoma virus DNA (27) is distinct from those of the primate polyoma viruses.

It seems reasonable to consider the ATG codon at nucleotides 182 to 184 to be an initiation signal for translation of T antigen messengers for the following reasons: (i) this was the first initiation codon in the only long open reading frame in this region; (ii) its topology was almost the same as that of its BKV and SV40 counterparts; (iii) the deduced amino acid sequence for JCV small T antigen resembled those of BKV and SV40 (Fig. 4). Thus, the small-T-antigen gene of JCV can code for 172 amino acids (Fig. 4). In the Nterminal regions, where the small T antigen shares its amino acids with the large T antigen,

JCV

5' TCAGA

TTGGT.....................TTTTTTTA

BKV

5' TCAGA

TTTGT.....................TTTTTATAG GTGCCA 3'

SV40

5' ACTGAG TATTT....................GTATTTTAG TTCCA 3'

Py

5' TTCCA

TGCCA 3'

TAAGA......................TTCTTACAG GCTCT 3'

FIG. 5. Splicing signal sequences for large-T-antigen mRNA of polyoma viruses. The intervening sequences of JCV, BKV, SV40, and mouse polyoma (Py) virus are 344, 344, 346, and 386 nucleotides long, respectively. The sequences for BKV, SV40, and mouse polyoma virus are from published data (5, 21, 24, 28).

78

MIYAMURA ET AL.

J. VIROL.

JCV Val Pro BKV Val Pro SV40 Il1e I--I Pro I' -u

Thr Phe Asn Gliu Lys Ser Phe Asn Glu Lys'

I'

Asp Asp

Giu

Asp Len Asp Leu Asn Leu

A1lA Php Ac,n rlij _ U U -l U"AJ -I J'

I

Cy: Cys

'Ala

'la er Gilu His Glu Asp Met Ser Cys Ser Glu Glu Met Pro Ser

His

sp

Asp Asp

Asp

Asn

r

GiuuG1u Ala

Thr

Asp Glu Ala

Ser Gln His Ser Thr Pro Pro Lys Lys Lys Lys Lys Val Glu Asp Pro Lys Asp Ser Gln His Ser Thr Pro Pro Lys Lys Lys Arg Lys Val Glu Asp Pro Lys Asp Ser Gln His Ser Thr Pro Pro Lys Lys Lys Arg Lys Val Glu Asp Pro Lys Asp

Gly

Ala Asp Ala Asp

Pro

......

Pro ...... Pro ......

FIG. 6. Amino acid sequences encoded by the areas just downstream from the splice junction for large-Tantigen mRNA in JCV, BKV, and SV40. Homologous amino acids are shown by boxes. Aromatic amino acids (e.g., tyrosine, tryptophan, and phenylalanine) are shown by shading. Sequences for BKV and SV40 are from published data (5, 21, 24).

JCV and BKV shared 72 of 81 amino acids (89%). The homology between JCV and SV40 was 83% (67 of 81), and that between BKV and SV40 was also 83%. Amino acids shared by the three viruses in the proximal half of the small T antigen were as many as 63 (78%). The rest of the small-T-antigen coding region seemed to be diverged somewhat more from BKV and SV40 than the former region shared by the large T antigen. In this region, 63 of 91 amino acids (69%) were common to JCV and BKV, 49 (54%) to JCV and SV40, and 54 (59%) to BKV and SV40. The three viruses shared only 43 of 91 amino acids (47%) but had a common stretch of five amino acids, Arg-Asp-Leu-Lys-Leu, followed by stop codon TAA. These data are consistent with the earlier reports showing the relatedness of T antigens of the three primate polyoma viruses by immunological cross-reactivity (32, 35) or 35S methionine-labeled tryptic peptide patterns (26). The signal sequences for splicing of mRNA are conserved among eucaryotic species, and the consensus sequences have been proposed (15). The latter half of the small-T-antigen gene is spliced out in large-T-antigen mRNAs of papovaviruses. The splicing signals for JCV largeT-antigen mRNA agree well with those of polyoma viruses (23) (Fig. 5) and the consensus sequences (15). The amino acid sequence encoded downstream from the splicing site resembled those of BKV and SV40 (Fig. 6). Like BKV, SV40, and mouse polyoma virus (24), JCV had a stretch of aromatic amino acids sequence in the region just downstream from the splicing signal. It has not been determined yet whether JCV has a virus-coded middle T antigen like that of mouse polyoma virus. Downstream from the splice junction (acceptor site), mouse polyoma virus DNA has two long open reading frames, which are used for encoding large and middle T antigens, respectively (28). In the corresponding region of JCV DNA, the two frames not used for encoding large T antigen were short (Fig. 2), and

the amino acid sequences deduced from these two frames (data not shown) had no homology with that of mouse polyoma virus middle T antigen (28). The topology of coding frames of JCV in this area resembled those of SV40 and BKV. We conclude, therefore, that JCV cannot code for the protein like mouse polyoma virus middle T antigen. The present study showed that, in the region containing the replication origin and small-Tantigen gene, the structure of JCV DNA resembles those of the other polyoma viruses. The homology in the nucleotide or amino acid sequence was higher between the primate polyoma viruses than between JCV and mouse polyoma virus (28) and was the highest between JCV and BKV. Despite the similarity between JCV and BKV in this region, the two viruses have different pathogenicity in humans and different host ranges for cultured cells. The sequences responsible for the biological difference between the two viruses remain to be studied. A host range mutant of JCV, JC-HEK, can grow well in HEK cell cultures (14), whereas wild-type JCV (Mad-1) can grow efficiently only in primary human fetal glia cell cultures (17). Interestingly, mutant JC-HEK has now the same host range as BKV. It is possible that comparison of the nucleotide sequence between JCV and JC-HEK, and between JCV and BKV, will reveal the structure that determines the host range of these viruses. A study of the structure of JC-HEK DNA is now in progress. ACKNOWLEDGMENTS This work was supported in part by grants-in-aid for cancer research from the Ministry of Education, Science, and Culture, Japan. We thank Peter M. Howley, National Cancer Institute, Bethesda, Md., for his generous gift of cloned JCV DNA. LITERATURE CITED 1. Benoist, C., and P. Chambon. 1981. In vivo sequence requirements of the SV40 early promoter region. Nature (London) 290:304-310.

VOL. 45, 1983 2. Brown, P., T. Tsai, and D. C. Gajdusek. 1975. Seroepidemiology of human papovaviruses. Am. J. Epidemiol. 102:331-340. 3. Dhar, R., C.-J. Lai, and G. Khoury. 1978. Nucleotide sequence of the DNA replication origin for human papovavirus BKV: sequence and structural homology with SV40. Cell 13:345-358. 4. Dhar, R., I. Self, and G. Khoury. 1979. Nucleotide sequence of the BK virus DNA segment encoding small t antigen. Proc. Natl. Acad. Sci. U.S.A. 76:565-569. 5. Fiers, W., R. Contreras, G. Haegeman, R. Rogiers, A. Van de Voorde, H. Van Heuverswyn, J. Van Herreweghe, G. Volckaert, and M. Ysebaert. 1978. Complete nucleotide sequence of SV40 DNA. Nature (London) 273:113-120. 6. Friedmann, T., P. LaPorta, and A. Esty. 1978. Nucleotide sequence studies of polyoma DNA. The HpaII 3/5 junction to the HpaII 4/HaeIII 18 junction, encoding the origin of DNA replication and the 5' end of the early region. J. Biol. Chem. 253:6561-6567. 7. Frisque, R. J., D. B. Riflin, and D. L. Walker. 1980. Transformation of primary hamster brain cells with JC virus and its DNA. J. Virol. 35:265-269. 8. Hay, R. T., and M. L. DePamphilis. 1982. Initiation of SV40 DNA replication in vivo: location and structure of 5' ends of DNA synthesized in the ori region. Cell 28:767779. 9. Howley, P. M. 1980. Molecular biology of SV40 and the human polyomaviruses BK and JC, p. 489-550. In G. Klein (ed.), Viral oncology. Raven Press, New York. 10. Howley, P. M., F. Rentier-Delrue, C. A. Heilman, M.-F. Law, K. Chowdhury, M. A. Israel, and K. K. Takemoto. 1980. Cloned human polyomavirus JC DNA can transform human amnion cells. J. Virol. 36:878-882. 11. Law, M.-F., J. D. Martin, K. K. Takemoto, and P. M. Howley. 1979. The colinear alignment of genomes of papovaviruses, JC, BK, and SV40. Virology 96:576-587. 12. Martin, J. D., R. J. Frisque, B. L. Padgett, and D. L. Walker. 1979. Restriction endonuclease cleavage map of the DNA of JC virus. J. Virol. 29:846-855. 13. Maxam, A. M., and W. Gilbert. 1980. Sequencing endlabeled DNA with base-specific chemical cleavages. Methods Enzymol. 65:499-560. 14. Miyamura, T., K. Yoshilke, and K. K. Takemoto. 1980. Characterization of JC papovavirus adapted to growth in human embryonic kidney cells. J. Virol. 35:498-504. 15. Mount, S. M. 1982. A catalogue of splice junction sequences. Nucleic Acids Res. 10:459-472. 16. Osborn, J. E., S. M. Robertson, B. L. Padgett, G. M. ZuRhein, D. L. Walker, and B. Weisblum. 1974. Comparison of JC and BK human papovaviruses with simian virus 40: restriction endonuclease digestion and gel electrophoresis of resultant fragments. J. Virol. 13:614-622. 17. Padgett, B. L., C. M. Rogers, and D. L. Walker. 1977. JC virus, a human polyomavirus associated with progressive multifocal leukoencephalopathy: additional biological characteristics and antigenic relationships. Infect. Immun. 15:656-662. 18. Padgett, B. L., and D. L. Walker. 1973. Prevalence of antibodies in human sera against JC virus, an isolate from a case of progressive multifocal leukoencephalopathy. J. Infect. Dis. 127:467-470. 19. Padgett, B. L., and D. L. Walker. 1976. New human papovaviruses. Prog. Med. Virol. 22:1-35. 20. Padgett, B. L., D. L. Walker, G. M. ZuRhein, and J. N. Varakis. 1977. Differential neurooncogenicity of strains of JC virus, a human polyoma virus, in newborn Syrian

JCV DNA REPLICATION ORIGIN

79

hamsters. Cancer Res. 37:718-720. 21. Reddy, V. B., B. Thlimmappaya, R. Dhar, K. N. Subramnanan, B. S. Zain, J. Pan, P. K. Ghosh, M. L. Celma, and S. M. Weissman. 1978. The genome of simian virus 40. Science 200:494-502. 22. Rentier-Delrue, F., A. Lubnlecki, and P. M. Howley. 1981. Analysis of JC virus DNA purified directly from human progressive multifocal leukoencephalopathy brains. J. Virol. 38:761-769. 23. Seif, I., G. Khoury, and R. Dhar. 1979. BKV splice sequences based on analysis of preferred donor and acceptor sites. Nucleic Acids Res. 6:3387-3398. 24. Self, I., G. Khoury, and R. Dhar. 1979. The genome of human papovavirus BKV. Cell 18:963-977. 25. Shortle, D. R., R. F. Margolskee, and D. Nathans. 1979. Mutational analysis of the simian virus 40 replicon: pseudorevertants of mutants with a defective replication origin. Proc. Nati. Acad. Sci. U.S.A. 76:6128-6131. 26. Shnmons, D. T., and M. A. Martin. 1978. Common methionine-tryptic peptides near the amino-terminal end of primate papovavirus tumor antigens. Proc. Natl. Acad. Sci. U.S.A. 75:1131-1135. 27. Soeda, E., J. R. Arrand, N. Smolar, and B. E. Griffin. 1979. Sequence from early region of polyoma virus DNA containing viral replication origin and encoding small, middle and (part of) large T antigens. Cell 17:357-370. 28. Soeda, E., J. R. Arrand, N. Smolar, J. E. Walsh, and B. E. Griffin. 1980. Coding potential and regulatory signals of the polyoma virus genome. Nature (London) 283:445-453. 29. Soeda, E., G. Kimura, and K. Miura. 1978. Similarity of nucleotide sequences around the origin of DNA replication in mouse polyoma virus and simian virus 40. Proc. Natl. Acad. Sci. U.S.A. 75:162-166. 30. Subrama , K. N., R. Dhar, and S. M. Weissman. 1977. Nucleotide sequence of a fragment of SV40 DNA that contains the origin of DNA replication and specifies the 5' ends of "early" and "late" viral RNA. III. Construction of the total sequence of EcoRII-G fragment of SV40 DNA. J. Biol. Chem. 252:355-367. 31. Takemoto, K. K., P. M. Howley, and T. Miyamura. 1979. JC human papovavirus replication in human amnion cells. J. Virol. 30:384-389. 32. Takemoto, K. K., and M. A. Martin. 1976. Transformation of hamster kidney cells by BK papovavirus DNA. J. Virol. 17:247-253. 33. Varakis, J., G. M. ZuRhein, B. L. Padgett, and D. L. Walker. 1978. Induction of peripheral neuroblastomas in Syrian hamsters after injection as neonates with JC virus, a human polyoma virus. Cancer Res. 38:1718-1722. 34. Volckaert, G., A. Van de Voorde, and W. Fkers. 1978. Nucleotide sequence of the simian virus 40 small-t gene. Proc. Natl. Acad. Sci. U.S.A. 75:2160-2164. 35. Walker, D. L., B. L. Padgett, G. M. ZuRhein, A. E. Albert, and R. F. Marsh. 1973. Human papovavirus (JC): induction of brain tumors in hamsters. Science 181:674676. 36. Yang, R. C. A., and R. Wu. 1979. BK virus DNA sequence coding for the amino-terminus of the T-antigen. Virology 92:340-352. 37. Yang, R. C. A., and R. Wu. 1979. BK virus DNA: complete nucleotide sequence of a human papovavirus. Science 206:456-462. 38. Yoshlke, K., T. Mlyamura, H. W. Chan, and K. K. Takemoto. 1982. Two defective DNAs of human polyomavirus JC adapted to growth in human embryonic kidney cells. J. Virol. 42:395-401.