Molecular Cloning and Characterization of a ... - Journal of Virology

1 downloads 0 Views 1MB Size Report
Molecular Cloning and Characterization of a Complete Chinese ... Imperfect long ORFs encode presumed gag and pol gene ...... cloning: a laboratory manual.
JOURNAL OF VIROLOGY, Sept. 1991, p. 4713-4719

Vol. 65, No. 9

0022-538X/91/094713-07$02.00/0 Copyright X) 1991, American Society for Microbiology

Molecular Cloning and Characterization of a Complete Chinese Hamster Provirus Related to Intracisternal A Particle Genomes ANDREW J. DORNER,* FRAN(OISE BONNEVILLE, RONALD KRIZ, KERRY KELLEHER, KEVIN BEAN, AND RANDAL J. KAUFMAN Genetics Institute, 87 Cambridge Park Drive, Cambridge, Massachusetts 02140 Received 19 March 1991/Accepted 4 June 1991

We report here the nucleotide sequence of a full-length Chinese hamster genomic proviral element, CHIAP34. CHIAP34 is 6,403 bp long with long terminal repeats of 311 bp at each end. The genetic organization of CHIAP34 was determined by comparison with intracisternal A particle (IAP) genetic elements from the mouse and Syrian hamster. Extensive homology at the nucleotide and deduced amino acid sequence levels was observed between CHIAP34 and the mouse and Syrian hamster IAP elements. CHIAP34 may represent a defective Chinese hamster IAP genetic element. The gag gene consists of 837 codons, of which 558 codons are in a single long open reading frame followed by several frameshifts. The pol gene begins with a -1 frameshift and consists of a long open reading frame of 753 codons followed by a short open reading frame of 103 codons. The putative env region contains multiple termination codons in all reading frames. CHIAP34 is representative of the predominant retroviral elements in the Chinese hamster ovary cell genome present at around 80 copies per haploid genome.

A full-length provirus, CHIAP34, was molecularly cloned, and its complete nucleotide sequence was determined. CHIAP34 is 6,403 bp long with LTRs of 311 bp at each end. Imperfect long ORFs encode presumed gag and pol gene products with extensive similarity to those encoded by the mouse and Syrian hamster IAP genomes. CHIAP34 represents the first full-length endogenous proviral element isolated from the Chinese hamster genome and is being used to determine the genomic organization and cellular expression of the endogenous retroviruses of CHO cells. The extensive use of the CHO cell for mammalian somatic cell genetic studies (14) necessitates an understanding of the organization and fluidity of the endogenous retroviruses within these cells. With the primary role that recombinant protein production in CHO cells has assumed in biotechnology, information on the retrovirus content of these cells has become of increasing importance.

The genomes of most vertebrates are littered with remnants of past retroviral infections. In species of rodents, the

diversity of integrated proviral elements is quite large. The endogenous retroviruses include type B, C, and A viral elements (18). Intracisternal type A particles (IAPs) are defective retroviruses which are encoded by a large family of structurally diverse proviral elements present in several hundred to a thousand copies per haploid rodent genome (6). IAPs assemble on and bud into the endoplasmic reticulum and are not released from the cell (6). Full-length IAP proviral genomes from both the mouse (MIA14) and the Syrian hamster (H18) have been cloned and sequenced (10, 12). Their genetic organization is similar to that of infectious retroviruses such as Moloney murine leukemia virus. The first long open reading frame (ORF) encodes the gag gene product. The pol ORF encoding reverse transcriptase is entered from the gag gene ORF by a -1 frameshift. In MIA14, the putative gag region is interrupted by one termination codon, while in H18, it is interrupted by several termination codons. Frameshifts also disrupt the pol coding regions of both MIA14 and H18. The env region is closed in all three reading frames by multiple termination codons. The IAP proviral elements are flanked at both ends by long terminal repeats (LTRs) which contain the appropriate regulatory elements present in functional LTRs. IAPs are present in a variety of rodent cells including the mouse and Syrian hamster (9). While IAPs have not been observed in Chinese hamster cells, intracytoplasmic particles associated with kinetochores have been reported (5). Chinese hamster cells also can spontaneously produce an infectious endogenous type C retrovirus (8). Since Chinese hamster ovary (CHO) cells are now extensively used for production of human pharmaceuticals, we have begun studies of the structure and expression of retroviral elements present in the CHO cell genome. *

HindlIl

EcoRI

HindlIl

Hindlil

BamHl

CHIAP 61

-pol

env

n LTR

LTR

HI Hind/il HindIll Hind/l/

EcoRI

Hindlil Hind/il

Hind/il BamHl BamHI Hindl/l

L

11

CHIAP 34

SHIAP Probe 0.0

1.0

2.0

3.0

4.0

5.0

6.0

7.0

Kbp

FIG. 1. Partial restriction maps of two Chinese hamster genomic retrovirus elements. CHIAP61 and CHIAP34 proviral elements are aligned based on shared restriction sites and cross hybridization of restriction fragments. The genetic organization of these elements is based on the nucleotide sequence of CHIAP34 as indicated. The position of the Syrian hamster IAP restriction fragment used as a probe is indicated as SHIAP probe.

Corresponding author. 4713

4714

J. VIROL.

DORNER ET AL.

1

TGTAGGGAGCCGGTTCCTTCATTATCCATTACAATAATGGTGCCTGAAGACACCAAGATG

60

61

TAAATTACTTCACAGGCGCTGGGTAATCTCCATTCCTTTGATCTCTGCCTATCCCGTGGC

120

121

TCATTTGGCCTGAGGAGCTGAAGCCATTCATAGGGTAACACTTCCCAGGCGGCTGGCCAG

180

181

CCTTTATAAGGGATGGTTTTCTTGGTTCAGTGTCTCCGCTCTGGTAAGCTTATGCATTAA

240

241

AGCTTGTCTGCAGAAGGATCCGAGTGTCCTGCGTGTATTTCTTGCTGGCGAGAAAATACA

300

301

GCGCGCGGGACATCTGGTGCGGAAACCCGGGAACTTGTTAACATCTCCGGCATCACGGAG

360

361

ACCCCTCTAACAGGGCGGATTCAGAACTGCAGGGTGGTAAGTTCGGAGAGGTATGCTTCA M L H

421 481

1561

CAGAATGACAGAGGCCGCTAGCCGGATATTCGGTGATGCAGAACAAGCCATGCCTCTGAT R M T E A A S R I F G D A E Q A M P L I

1620

1621

TGAACAATTAGTCTTTGAACAAGCAACTCAAGAATGCCGAGCAGCCATAGCTCCCCGGAA E Q L V F E Q A T Q E C R A A I A P R K

1680

1681

AAGTAAAGGTTTACAGGACTGGTTAAAGATCTGCAGAGAACTCGGAGGGCCACTTACTAA S K G L Q D W L K I C R E L G G P L T N

1740

1741

TGCAGGCTTGGCCGCAGCCATCTTACAAACCCAAAGGCGCCGAAATACGTCTGCCTGCTT A G L A A A I L Q T Q R R R N T S A C F

1800

.p12

PBS

420

1801

TAACTGTGGAAAAACACGGCACCTTAAAAAGGACTGTAGAGTCCCTGAAAGGATTAGAGA N C G K T R H L K K D C R V P E R I R E

1860

TTCTGACACCTTCTTTTTTGACTTTGCTTTTAATTTGCCACCACTATGGGATGGAAAAGC S D T F F F D F A F N L P P L W D G K A

480

1861

AGTGGAGTTGTGCAGGCGCTGTGGAAAAGGTTATCATAGGGCCAGCGAATGCAAATCTGT V E L C R R C G K G Y H R A S E C K S V

1920

CTTAATTCTAGTCTTTATTCTAATTTTGTTCACATTGGTCTTATATTATGCCCACCGTGG L I L V F I L I L F T L V L Y Y A H R G

540

1921

GCGGGACATAAAAGGTAGGCTTTTACCCCCTAGGGAGGAACCTAAAGCATCCCAACCAAA R D I K G R L L P P R E E P K A S Q P K

1980

*.gag-_

541

S

600

W

C

S

S

S

R

P

S

L

P

Q

V

A

S

S

I

M

G

S

S

1981

I

P

T

K

AAACGGGCCTCGGGGCCCATGGTCCCAGGGCCCTCAGAAATATGGGAACCAGTTCTGGAA N G P R G P W S Q G P Q K Y G N Q F W K

2040

601

AAAACAGAGAGACCTAATTAAAAACTGTCTAGAGATTGAGGCTTGCTGTCCCATGGTAGC K Q R D L I K N C L E I E A C C P M V A

660

661

AGAGAGTCAAAAAATGCTTAAAGAGGTACAGGATAATATATCAGAAACCGAACGAGATGA E S Q K M L K E V Q D N I S E T E R D E

720

721

GAGAATAGGGGCTCAAAAAAGGAAGGACATGTCCAAGGAAAAAGGCCCTCCCCAGGATAT R I G A Q K R K D M S K E K G P P Q D I

780

781

AAAAAAAGGGGGAGAGAAAATAGGGAATAACCGGTCACACCCCGGTAAATTTAAGAGAAA K K G G E K I G N N R S H P G K F K R N

840

841

TAAAGACTCAAAACCTAGCCTCTGCCCTACAACGAAACTAGAGGCCTTGGAGCTAAGCAG X D S E P S L C P T T K L E A L E L S S

900

901

CTCAGACTCTGAGATTTTAGACTCTAGCAAGGAAGCAGAGCTAGAGGAGGAGCTGCCAAA S D S E I L D S S K E A E L E E E L P K

960

961

AATAAAAGCAAATATGAGGCCACCGCCTGTTAATCCAGCGGGTGTGCTTCCATCAGCACC I K A N M R P P P V N P A G V L P S A P

1020

2281

TCCAGCCCTCGGGGGTGTTCTCCATAAAGCAGGGAGATAAGATCGCACAATTAGTGCTAT Q P S G V F S I K Q G D K I A Q L V L L S S P R G C S P

2340

102 1

CCCATTATTTGGTATTGACTCTTTTTTACCATTAGAGGAGCGAAGGAAATTGCAGATGGC P L F G I D S F L P L E E R R K L Q M A

1080

2341

TGCCCTCCCCTGGGGATAGAGAGAATTGCACCTCCCGAAAAAGAGCCTTGGGCTCTACCG P S P G D R E N C T S R K R A L G S T G

2400

1081

TTTCCCAGTCTTTGAAAATGAGGGGGCAAGAGTACATGCTCCCGTAGACTATAATCAGAT F P V F E N E G A R V H A P V D Y N Q I

1140

1141

TAAAGAATTGGCTGAATCAGTCCGGAAGTATGGGGTCAATGCCAATTTTACAACAATACA K E L A E S V R K Y G V N A N F T T I Q

1200

1201

AGTAGAAAGACTAGCAAACTATGCTATGACACCCACTGATTGGGAGACAACAGTAAAAGC

1260

1261

AGTGCTCCCCAATATGGGACAATATATGGAGTGGAAGGCTCTTTTTTATGATGCAGCCCA V L P N M G Q Y M E W K A L F Y D A A Q

1320

1321

GGCACAGGCAAAGGCCAATGTCACAGCAGAAAATGAAAATCAGAGACAATGGACCTTTGA A Q A K A N V T A E N E N Q R Q W T F E

1380

1381

AATGCTGACAGGACAGGGGCCACATGCCCTCAATCAAACTAATTACATTTGGGGCGTATA M L T G Q G P H A L N Q T N Y I W G V Y

1440

1441

TGCCCAGATATCAGCTGCCGCCATTAAAGCATGGAAAGCATTGACAAAAAGGGATGAATC A Q I S A A A I K A W K A L T K R D E S

1500

1501

AGGTGGACATCTTACAAAGATAGTCCAGGGGCCCCAGGAGCCATTCTCAGACTTTGTGGC G G H L T K I V Q G P Q E P F S D F V A

1560

K

R

A

S

G

P

M

V

P

G

P

S

E

I

W

E

P

V

L

E

2041

AAGCAACTCAGAGAAGGAAGGGACTCCCGAGGACACTCCGGAGTGGACCTGTGTGCCGCC S N S E K E G T P E D T P E W T C V P P K Q L R E G R D S R G H S G V D L C A A

2100

2101

TCCGACTTCTTATTAATGCCCCAAATGAATGTTCAGCCGGTCCCAATCCAGTCTCCGGGG P T S Y * S D F L L M P Q M N V Q P V P I QS P G

2160

2161

p4

CCTTTACCCCCCGCTACCATTGGGCTTATTTTGGGCCGAGGTTCCTTGACCTTACAAGGA L P P A T I G L I L G R G S L T L Q G

2220

CTCATTGTGTATCCTGGAATCGTAGATCCATATCATAAGGAAGAATTCCAGGTCCTCTGC

2280

P

2221

G L

I

V

Y

P

G

I

V

D

P

Y

H

K

E

R I P G P L L E F Q V L C

2460

Lp27

V

E

R

L

A

N

Y

A

M

T

P

T

D

W

E

T

T

V

K

A

2461

TTAATGGAAAAGAATTTGAGGGGATTACAGATACAGGGGCTGACAAAAGCATCATTTCAT N G K E F E G I T D T G A D K S I I S L

2521

2580 H

2581

2520

W

W

P

K

S

W

P

T

V

T

S

S

H

S

L

Q

G

L

G

GATATCAATCCTCTCCAGCTGTTAGTGCTGCCGCCTTGGTCTGGCGGAGCACCGAGGGCA Y Q S S P A V S A A A L V W R S T E G R

2641

2640 2700

Q G R F

T

P

Y

V

L

P

L

P

V

N

L

W

G

R

D

V

2701

TGTTACAAGCCATGGGCATGACCCTGACCAATGAGTATTCCCCTCAGGCATCAGCTATAA L Q A M G M T L T N E Y S P Q A S A I M

27 60

2761

TGACAAAAATGGGCTATGTACCAGGAAGGGGCCTGGGCAGAAGGGAGCAAGGTAGAATAG T K M G Y V P G R G L G R R E 0 G R I E

2820

FIG. 2. Nucleotide sequence of CHIAP34, a full-length Chinese hamster genomic IAP proviral element. The DNA sequence of the coding strand is given. The deduced amino acid sequences of the gag and pol ORFs are placed under the nucleotide sequence. Stop codons are indicated by asterisks. Protein domains of the gag and pol gene products determined by amino acid and nucleotide sequence homology with the reported sequences of murine (10) and Syrian hamster (12) IAP elements are indicated. Probable regulatory elements in the LTRs are underlined and described in the text. Short inverted repeats at the ends of LTRs are underlined with half-arrows. RT, reverse transcriptase; PBS, primer-binding site.

MATERIALS AND METHODS Genomic DNA cloning. CHO DUKX Bli cells deficient for dihydrofolate reductase were used in this study (20). A CHO cell genomic library in lambda EMBL3 was prepared by conventional techniques (15). CHO cell genomic DNA was subjected to partial Sau3A restriction enzyme digestion, ligated into the BamHI site of lambda EMBL3, and packaged utilizing the EMBL3 vector kit (Stratagene). Restriction enzymes and ligase were supplied by New England BioLabs. Recombinant plaques were identified by hybridization at low stringency (2 x SSC [1 x SSC is 0.15 M NaCl plus 0.015 M sodium citrate] at 50°C) with a 2.5-kbp HindIll DNA fragment derived from the Syrian hamster IAP clone H18, which encompasses most of the pol gene (12). Two clones, CHIAP34 and CHIAP61, were identified for further analysis. CHIAP34 was subjected to nucleotide sequence analysis. DNA sequence analysis. The DNA sequence was deter-

mined for both strands of the 5- and 1. 1-kbp BamHI fraga Bal 31 nuclease deletion series procedure (13) and dideoxy sequencing procedures (16). The DNA sequence of nucleotides 1 to 443 was determined by using synthetic oligonucleotide primers (19) and Sequenase version 2.0 (United States Biochemicals). The nucleotide sequence between the BamHI sites at 5310 and 5380 was also determined by using synthetic oligonucleotide primers. Nucleotide sequences and translations were analyzed by using the GCG package of genetic analysis programs (4). Southern blot DNA analysis. High-molecular-weight DNA was prepared, and 10 ,ug was digested, subjected to agarose gel electrophoresis, and blotted onto nitrocellulose (15). A 5-kbp BamHI fragment derived from CHIAP34 was used as a probe under low-stringency hybridization conditions. Syrian hamster kidney (BHK-21) cells and Armenian hamments of the CHIAP34 provirus by using

2821

I

E

Q K

G

Q

N

S

K G L G F I Y L KR p/'RT T *G F

G

I

A

TGAGGCTTCACGACCCATACCATGGAATACAGAGGACCCGGTATGGGTCTCTCAATGGCC E

S

A

R

P

I

W

P

N

T

E

D

V

P

W

V

Q

S

W

2941

ATTATCCTCTGAAAAGCTGGAAGTAGTCACAAGACTTATACAGGAGCAAGAACAGTTGGG

3001

GCATTTAGAAAGTTCTACTTCTCCCTGGAACAGCCCAATTTTCATTATAAAAAAGAAATC

3061

TGGAAAATGGAGGTTGCTCCATGACCTGCGGGCTATTAATAACCAAATGCGCCCTCTGGG

S

L H

L

G

3121

V

T

L

L

V

V

S

W

P

H

D

L

T

R S

N R

A

Q

I

L

P

I

I

F

N

E

Q I

I

Q

E X

N Q M

R

K

G

L

Q

R

G

L

P

L

L

D

I

K

D

C

F

S

F

F

T

V

P

S

L

H

N

P

Q

G

M

A

N

S

P

P

V

R

K Q F

S

T

C

H

R

K

I

E

S

A

L

I

P

L

M

E

P

T

I

P F D

C Q

M

I

M

L Q Q A

V

S

K

Q N W K

L

P

R

D

R

Q

R

K

R

Y

Q

W

R

L

Y

V

Q

K

A

I

H

Y

M

D

D

I

4621

4681

I

F

P

M

L

V

A

E

3601

TCATCTGAGAACCTTAAATGACTTCCAGAAACTCTTGGGGGATATAAATTGGCTGAGACC

3661

CTTTTTAAAGATCCCCTCTGCAGATTTAAAACCCCTTTTTGACATCCTGGAAGGTGAGCC

L H F

3721

L L I

R K

V

T I

I

L P

T N

S

P

D A

S

A T

K

E

I

Q K

F

L

D

K

K

I L P

V Q V P L

L

Q

G F

K D D

S I I

I

D

E N L

T I W

E

G R L G

L K R E

S

S

P

R

S

F

T

P

A

A

C

Q

A

L Q K

V

D

L

A Q

L

H

R

I

D

E

T

L

P

F

S

L

V

F

K

T

A K

L

T

P

A

I

L

WQ

H

G

P

L

L

GATTCACCCAAATGCTTCCCCCGCTAAGATCATTGATTGGTATCCGGATGCTGTCGTGCA

3961

GCTCGCACTTCGGGGAATAAAAGCAGCTGTCGCTCATTTTGGCAGGGACCCTCATCTCTT

4021

GGTTGTACCTTACACCACTGCGCAAATTCAAACTCTAACAGCTACTTCTAATGACTGGGC

4081

GGTGTTAGTTACCTCCTTTTCAGGGAAAATTGATAATCATTTCCCAAAACATCCAATCCT

I

L V V

4141

H

A V

L

P L P V

N R Y T

A

G T

S

S I

T F

P

A

K

A

A Q

S

G

K

A I K

I V

I A

QT I

D

D H

L N

W

F T H

Y

G A F

P R T P

D

D

S

K

A P

N H

V

H D P

V

L W

I

QF

T

Q N Q A

I

V

F

P

Q M T A K

H

P

I

were

D

S

Y

S

N

V

L

L

E

T

A

G

I

I

R

P

S

4440

S R V

I

F

Q K

I

Q

I

T

L

S

N

R

R

F

P

V

F

4500

V

V

R

H

A

S

G

L

P

G

P

M

S

S

G N

D

4560

A

L

R

A

T

L

K

M

A

A

A

L

S

T

I

Q

'-Endo

4620

Q A A Q E

H

Q R

F

H

V

T

A

E

T

L

R

Q

R

F

A

L

4680

K

T

I

V

T Q

C

K N

C

C

E

F

L

P

A

4740

P

V

A

C

I

W

A

S

Q *

5221

Y Y

L

Q Y

V

H

V

S

V

D

T

C

S

G

A

T

P

L

T

G E

K

A

A

H

V

I

QH

C

4920

L

G

A

W

G

K

P

I

H

L K

D

T

N

G

A

P

4980

Y

K

F

Q

H

F

C

R QH E

I

T

H

L

T

G

5040

L

S

P P

N

R

T

R

0 G

C

H

I

G

V

G E

M

5100

C P S H R T L K C A H R I A H L S

I

L

K Q K

E

G

M

G

A

S

L

P

S

V

P

R

5160

V

*

I

GCAATATCCATGGCACTCTTTACCCTTAATTTTCTAAACACCGACGCTCAGGGCCAkTACAAl I

S

M

A

L

F

T

L

N

F

L

N

T

D

A

Q

G

H

A

K

R

H

T

S

E

P

E

R

S

K

E

H

V

K

W

K

5220

T

5280

GCGGCCAAGCGTCATACCTCAGAACCTGAAAGGTCTAAGGAGATGGTAAAATGGAAAGAT A

5281

G K

TCTTATTTAATCAAACAAAAGGAGGGAATGGGAGCATCCTTACCCTCGGTGCCAAGAGTT S

5161

F

ACCTTATAACCCTCAAGGACAAGGCATTGTGGAATGTGCCCATCGCATCGCACACTTAAG P

5101

S

TACCTCTCAAAAGTTCCAGCACTTCTGCAGACAGATGGAAATTACCCATCTAACTGGCCT

A

D

GTCTTAACTGGTCTTTGGAGAGGCCCGGATCCTATTCTCATAAGATCCAGGGGGGCGATA L T G L W R G P D P I L I R S R G A I

5340

V

4020

Cl A I 5341

TGTGTTTTTCCACAGGATTGAAGAAAATCCTCTGTGGATCCCAGAAAGACTCACCCGAAG C V F P Q D *

5400

4080

5401

AGCCTCTTCGGACCTTCAAAAGTCGGAACTCCACCCTCTCCCCGGGAGTTGAGAGCCCGCiT

5460

4140

5461

ATTGTTATCCAGATTAACTCCTGTCCTTGGCAGAGAGATTGTATCACTGCTTAAG(

GGGTGG

5520

GGGGTGGGATTGTTTGATGCAGCCGTTTGCGGATTGCTGTTACCTCTGGCTGATGITATGu A 5581 GAAAGAGGTGGGGGTGGAATTGTTTGATGCAGCCGTTTGCGGATTGATGTTACTTITTGGCC

5580

5641

TGGCATGTAAGCTCCAGGGCTCAAACCAAGAGATCGCCCAAGCGCTTATATTTTG OGCTAiLA

5700

5701

;CCGGCCAGACAGCTCTTGCACACCCGGAGCCTAGG ;CTCA'.7

5760

5761

TGCACAGGGTAGAGTGTCTGGTTTGAGCAGCCCCAATGAGGGATGCTGAGCAAAGGCATCC

5820

5821

GCACAGAGTTGCCTAATATACAGGCTTCCCTGGGAGGTACGTTGACCTGCATAAGGGTTAA

5880

5881

CCTGCCCCGiAGACTCCCTTTCCCAGAAAAACGGCAGAGGACAGGTCGAGAGTACTTCGGG G

5940

5941

CCAAGCTAACAGCCTGATGGCGACTCTCGTACACAGTCTTAATGTTTGATTTGGGAAGGTT

6000

6001

TCAACCTCTGCCTCTATCCCTCAACATATGGGTGACCTATTTGCTTGTAAAAATATAAAGG

6060

6061

rTAATAAAAAAAGGGGGATATGTAGGGAGCCGGTTCCTGCATT rATCC.:j

6120

6121

TTACAATAATGGTGCCTGAAGACACCAAGATGTAAATTACTTCACAGGCGCTGGGTAATC C

6180

6181

TCCATTCCTTTGATCTCTG

;CCAT T

6240

L A L

5521 4200

P

FIG. 2-Continued. ster lung (AHL-1) cells Culture Collection.

H

L

Q

ACAATTTACACAAAATCAGGCTATAGTGTTTCCACAGATGACAGCAAAGCATCCAATCCC

G

T

5041

3960

S

G

I

AGAGGCTTGGGGTGCCTGGGGTAAACCTCATATCCTAAAAACAGATAATGGGCCGGCTTA E

E

3901

V

V

CATAATTTGTGCCACGCCTTTGACAGGGGAAAAGGCCGCGCATGTGATTCAACACTGTTT

3720

3900

A

QE A R Q

I

4981

W

I

Y

GCAGGAGGCTAGACAAATCGTTACTCAATGTAAAAACTGCTGTGAATTTTTACCTGCACC

P

TGTCTTTAAAACGGCTAAGTTACCGACCGCCATCTTATGGCAGCATGGGCCGTTGCTTTG

N

G A

4860

3660

3840

L

V

ATTTCATCAGCGCTTTCATGTGACGGCTGAAACCTTACGCCGTCAATTTGCTTTGACAAA

H

4921

C

P

G

ACATGTTGCCTCCTTTGGAAAGCTCCAATATGTTCATGTCTCAGTGGACACCTGCTCAGG

3600

3780

G

T

4801

4861

P

P

K

4800

3540

D

F

S

TCATGTAGGAATAAATCCGCACGGCATTAGGCCGCTGCAGATGTGGCAAATGGATGTTAC H V G I N P H G I R P LQ M W Q M D V T

3480

F

AAAGGCCTTGCAGGATGCACAATTGCATCGCATAGATGAGACATTGCCATTCAGCCTATG K A

3841

S

I

TCATATTTCTTCCCCCAGAAGCTTTACTCCGGCTGCTTGTCAAGCCCTACAAAAAGTGGA H

3781

G

E

A

G

4741

3420

L

E

L

D

AGACCGGGCCACAAAGCTGATGGCTGCCGCCTTGTCCACCCAGATACAAGCTGCACAAGA F

3360

L

V

E

T

CACCCATGTTCGAGCACATTCGGGGCTCCCAGGACCTATGTCCTCTGGGAATGATTTGGC D

3300

L

L

Y

V

TGCAGGTATCTTTCAAAAAATACAAATTACCCTATCAAACAGGAGGTTCCCTGTTTTTGT T

4561

V

V

T

TGTGGTTAATGCAGTCAACCTCCTTGAAACTGCTGGTATAATACGACCCTCCAGCAGAGT A

3240

F

G

V V N

4501

CCTGGGCTCAGTAATTACCCCAACAAAAATAATCCCACAAAAAATAGAAATACGCAAGGA

L

4380

N

V

3180

I

I

3541

G

GGTGCTGGAAGTCCTTGAGGCCTTTCCGGGGCCACTTAATATTGTATCAGATTCCTCTTA

4441

AAAACAATGGGGACTGGAGATAGCATCAGAAAAGGTCCAGGTCTCAGATACCGGTCTC7 arT Q W

4321

3120

G

L

P

3481

K

4320

3060

TATCTGCCATAGGAAGATAGAGGTCCTGCAACAAGCCTTCCCCATGCTGGTAGCTGAGTT I

AAATAAGGTAGTTTCTAAACAATTCAATGAAACCTCACCCCAGATCGTTGAATGCCAAGT I V E C Q V N K V V S K QF N E T S P

4381

GGAACCTGTTAGGAAGCAGTTTACATCCATGATTATGATTCACTATATGGATGATATTC Tl E

3421

R

S

E

GCTGCCACAGGGCATGGCCAATAGTCCAACTATATGCCAATTGTATGTCCAAAAGGCATT L

3361

W

L

TGCCTTTACTGTTCCCTCACTCAATCACATGGAGCCAGACAAGAGGTACCAATGGAGGGT A

3301

S

E

K

TATAGATATTAAGGACTGTTTTTTCTCCATCCCCCTCTTTCCTCGGGACCGGCAAAGGTT I

3241

K

E

4261

3000

TCCTGTCCAGAGAGGTCTCCCTTTGCTTTCTGCGCTACCCCAAAATTGGAAGCTTATTAT P

3181

S

GAATGGGACGGTGGTTTATACTGATGGTTCCAAAACCGGTGTAGGGGCTTATGTTA' rAGG

2940

P

4260

4201

R

*

2881

2880

AGCCCATTGAACAAAAAGGGAATCAAAGTAGAAAAGGACTGGGTTTTATTTAGGGGCCAT P

4715

CHINESE HAMSTER GENOMIC PROVIRAL ELEMENT

VOL. 65, 1991

obtained from American Type

RESULTS AND DISCUSSION Molecular cloning of CHIAP sequences. A CHO cell genomic DNA library was screened at low stringency with a 2.5-kbp HindIII fragment derived from the SHIAP clone H18 (12). This fragment contains most of the pol gene. An initial screen of 2 x 105 plaques yielded 75 positively hybridizing plaques. A second screen of 25 plaques yielded 9 positive plaques, which were analyzed by restriction digestion and hybridization with the SHIAP pol probe. From these, three clones were identified which exhibited BamHI fragments which hybridized strongly with the pol probe. Two provirus clones designated CHIAP34 and CHIAP61 were selected for further analysis. Restriction maps of these two clones are shown in Fig. 1. Based on Southern blot analysis (data not shown), CHIAP34 and CHIAP61 contain a common EcoRI site within the gag gene and a BamHI site near the end of the gag gene. In addition, several HindIII sites were detected in common between the clones. Nucleotide sequence of CHIAP34. The complete nucleotide sequence of the CHIAP34 provirus was determined. The

ri

5640

;CCAGCCTTTACAAGGGATGGTTTTCTI rGGTT 'C 6300 6360 6301 AGTGTCTCTGCTCTGGTAAGCTTATGCATTAAAGCTTGTCTGCAGAAGGATCTGAGTGTC

6241

6361

CTGCGTGTATTTCTTGCTGGCGAGAAAATACAACCCGCGGGACA

6404

FIG. 2-Continued.

nucleotide and deduced amino acid sequences of CHIAP34 are shown in Fig. 2. CHIAP34 was 6,403 bp long, with LTRs of 311 bp present at both ends. The LTRs were 2.3% divergent from each other, suggesting an integration time of 1.76 million years based on 0.8 million years/1% divergence (2). Sequence analysis of flanking sequences revealed the presence of a 6-bp repeat of cellular DNA (GATGAT) at the ends of the provirus. On each LTR, common structural and

DORNE'R H"l AL.

4716

JJ. VIROL. i~t CHIlAP

CHIlAP

L _____gag 2

OI

po1

env

A'

LR

6.000

Io

4.O

po1

gag

4,000

env

LTR

B

6L000

c

C:

(L)

(L)

6.000

U/) 000

FIG. 3. Nucleotide sequence comparison of CHIAP34 and at murine IAP element, MIA14 (A), or a Syrian hamster lAIP element, H18 (B). Dot-mattrix analysis wats generiated by the GCG progratm package (4). Each dot represents at minimum homology of 14 nucleotides over at 21-nucleotide interv-al. TIhe genomie organizations are indicated.

regulatory elements were present (3). iThe 5' LT'R contained CAAT box (CCAYT'l) located at base 143 that was followed by at consensus TIATFA box sequence (TTTATFAA) at base 182. A probable polyadenylation signal (ATT'AAA) was located at base 235 aind was followed by the presumed polyadenylation site (CA) 9 bp downstream. Downstreatm of the 5' LTR, the primer-binding site (PBS) was identified by its complementarity to 17 of the terminal 18 batses of matmmalian phenylalanyl-tRNA. Phenylalanyl-tRNA has also been identified as the putative primer tRNA for Syrian hamster and murine lAP elements (11). Adjacent upstream to the 3' LT'R was a typical polypurine tract of' 16 bp. Nucleotide sequence homology with mouse and Syrian hamster IAP elements. Complete nucleotide sequences Of MOuise (MIA14) and Syrian hamster (Hlg) IAP-related proviral genomes have been reported (10, 12). A dot-matrix nucleotide sequence comparison between CHIAP34 and MIA14 i shown in Fig. 3A and one between CHIAP34 and H18 Is shown in Fig. 3B. Substantial sequence similarity existed among the Chinese hatmster aind mouse and Syrian hamster elements att distinct regions of' the genome. Little similatrity existed within the 5' portion of' the genome including the 5'-terminal region of' gaig. However, beginning with se-

a

ORF~

fllliI

~~~~~11OF2111IIII 111 2000

quences which encode protein sequence with homology to p27 and extending through the pol gene, extensive similarity was apparent among all three proviral sequences. Sequence similrtywsreduced in the putative ent' gene and 3'terminal sequences. It is evident fr-om this analysis that the Syrian and Chinese hamster proviruses are more closely related to each other than either is to the mouse sequence. Comparison of a partial Chinese hamster provirus sequence consisting of the 5' LTR and partial gaig gene sequences (17) with that of CHIAP34 revealed 94%, nucleotide sequence homology. However, the absence of significant long ORFs in this clone precluded deduced amino acid comparison with CHIAP34. Several eDNA clones of retrovirus-related sequences expressed in CHO cells have been isolated which consist of partial and deleted sequences without intact gaig and pol genes (1). Their relationship to CHIAP34 is unknown. Grenetic organization of CHIAP34. Translation of the CHIAP nucleotide sequence revealed the presence of several long ORFs (Fig. 4). To determine the genetic organization of CHIAP34. we compared the deduced amino acid sequences of these ORFs with the predicted gaig and pol gene products of MIA!4 and H18. By this analysis, OREs 1

ORF4

Ill I 4000

~ IVflIIll I111 6000 bp

FIGi. 4. ORFs of ('HIAP34. Stop codons in eatch reading framiiie of' the coding-strand sequience arie indicakted by vertical lines. ORFs gag and paol pr-otein donmains are numnbered. ORFs I to 3 comiposc the gag gene. O)RFs 4 and 5 compose the pol gene.

encodling

VOL. 65, 1991

CHINESE HAMSTER GENOMIC PROVIRAL ELEMENT

CHIAP 24 LILVFILILFTLVLYYAHRGWCSSSRPSLPQVASSIMGSSKQRDLIKNCL MIAP

:I 1:1:1 .1::.:.

2 FGLEFFLVLEALLFLFT.

4717

CHIAP 3 TGFYLGAIEASRPIPWNTEDPVWVSQWPLSSEKLEVVTRLIQEQEQLGHL .11 11:.11.:11.111111.:1 11 MIAP 8 SGFSLAAIGAARPIPWKTGDPVWVPQWHLSSEKLEAVIQLVEEQLKLGHI

74 EIEACCPMVAESQKMLKEVQDNISETERDERIGAQKR ....... KDMSKE

53 ESSTSPWNSPIFIIKKKSGKWRLLHDLRAINNQMRPLGPVQRGLPLLSAL

19

58 DPSTSPWNTPIFVIKKKSGKWRLLHDLRPINEQMNLFGPVQRGLPVLSAL

.

:II 11.111l:111111111 11111:11:11. :1111 111:1111

CYQVVKAGRILDEIQDKLSEVKRGERVGTKRKYGTQNKYTGLSKG

117 KGPPQDIKKGGEKIGNNRSHPGKFKRNKD ...... SKPSLCPTTK.LEAL

103 PQNWKLI I IDIKDCFFSIPLFPRDRQRFAFTVPSLNHMEPDKRYQWRVLP

64 LEPEEKLRLGRNTWREIRRKRGKREKKKDQLAEVSRKRSLCSSLDGLGEP

108 PRGWNLII IDIKDCFFSIPLCPRDRPRFAFTIPSINSDEPDNRYQWKVLP

160 ELSSSDSEILDSSKEAELEEELP ....

.1111:.:

11.1.::111:

...... KIKAN. MRPPPVNPAGV

1:11:1.1.I........................

153 QGMANSPTICQLYVQKALEPVRKQFTSMIMIHYMDDILICHRKIEVLQQA

114 ALSSSEADEEFSSEETDWEEEAAHYEKKGYQPGKVLANQLRKPKAAGEGQ

158 QGMSNSPTMCQLYVQEALLPVREQFPSLILLLYMDDILLCHKELTMLQKA

199 LP ............... SAPPLF .GIDSFLPLEERRKL . 111:1 11.11: :: ~~~1:1.:. 164 FADWPQGSRLQGPPYAESPPCVVRQPCAERQCAKRQCADSFIPREEQRKI

203 FPMLVAELKQWGLEIASEKVQVSDTGLFLGSVITPTKI IPQKIEIRKDHL :1:1: .1.1111:11.1111:1111 11111:.1.11:111:1 11:111 208 YPFLLKTLSQWGLQIATEKVQISDTGQFLGSVVSPDKIVPQKVEIRRDHL

221 QMAFPVFEN . EGARVHAPVDYNQIKELAESVRKYGVNANFTTIQVERLAN

253 RTLNDFQKLLGDINWLRPFLKIPSADLKPLFDILEGEPHISSPRSFTPAA

214 QQAFPVFEGAEGGRVHAPVEYLQIKELAESVRKYGTNANFTLVQLDRLAG

258 HTLNNFQKLLGDINWLRPFLKIPSAELRPLFWYLEGDPHISSPRTLTLAA

'_p27 270 YAMTPTDWETTVKAVLPNMGQYMEWKALFYDAAQAQAKANVTAENENQRQ

303 CQALQKVEKALQDAQLHRIDETLPFSLCVFKTAKLPTAILWQHGPLLWIH

264 MALTPADWQTVVKAALPMMGKYMEWRALWHETAQAQARANAAALTPEQRD

308 NQALQKVEKALQNAQLQAIEDSQPFSLCVFKTAQLPTAVLWQNGPLLWIH

320 WTFEMLTGQGPHALNQTNYIWGVYAQISAAAIKAWKALTKRDESGGHLTK

353 PNASPAKIIDWYPDAVVQLALRGIKAAVAHFGRDPHLLVVPYTTAQIQTL 11.111111111111:. 1111:1:111:. 1111.1.11:1111.11:111 358 PNVSPAKIIDWYPDAIAQLALKGLKAAITHFGRSPYLLIVPYTAAQVQTL

314 WTFDLLTGQGAYSADQTNYHWGAYAQISSTAIRAWKGLSRAGETTGQLTK

370 IVQGPQEPFSDFVARMTEAASRIFGDAEQAMPLIEQLVFEQATQECRAAI

403 TATSNDWAVLVTSFSGKIDNHFPKHPILQFTQNQAIVFPQMTAKHP IPNG

364 VVQGPQESFSDFVARMTEAAERIFGESEQAAPLIEQLIYEQATKECRAAI

408 AATSNDWAVLVTSFSGKIDNHYPKHPILQFAQNQSVVFPQITVRNPLKNG

420 APRKSKGLQDWLKICRELGGPLTNAGLAAAILQTQ .... RRRNTSACFNC .1.: ..1111I I111.1111111I::1111111111l11111111 1.1 414 APRKNKGLQDWLRVCRELGGPLTNAGLAAAILQSQNRSMSRNDQRTCFNC

453 TVVYTDGSKTGVGAYVIGNKVVSKQFNETSPQIVECQVVLEVLEAFPGPL

466 GKTRHLKKDCRVPERIRE .VELCRRCGKGYHRASECKSVRDIKGRLLPPR

503 NIVSDSSYVVNAVNLLETAGI IRPSSRVAGIFQKIQITLSNRRFPVFVTH

-p12

11. 1:11111.1::::.1:1111:1111:1.

464 GKPGHFKKDCRAPDKQGGTLTLCSKCGKGYHRADQCRSVRDIKGRVLPPP 515 EEPKASQPKNGPRGPWSQGPQKYGNQFWKSNSEKERDSRGHSGVDLCAAS 111 111 .:..:.11.1 111.111. :...1~~~~~~~ 514 DSQSAYVPKNGSSGPRSQGLKDMGTGLSGPRKQSERRPRRKTHKVDLRAAS

565 DFLLMPQMNVQPVPIQSPGPLPPATIGLILGRGSL . TLQGLIVYPGIVDP 564 DFLLMPQMSIQPVPVEFIPSLPLGTMGLILGRGSASTLQGLVVHPELWIV

prt

614 YHKEEFQVLCSSPSGVFSIKQGDKIAQLVLLPSPGDRENCTSRK . RALGS 614 NIPQKYQVLCSSPKGVFSISKGDRIPQLLLLLPDNTREKSAGPEIKKMGS

653 TGNDSAYLAIPLDERPTMKLLVNGKEFEGITDTGADKSI ISLHWWPKSWP .111111..1:11.:1 :111111 1111111111111.11 654 SGNDSAYLVVSLNDRPKLRLKINGKEFEGILDTGADKSIISTHWWPKAWP

713 TVTSSHSLQGLGYQSSPAVSAAALVWRSTEGRQGRFTPYVLPLPVNLWGR 714 TTESSHSLQGLGYQSCPTISSVALTWESSEGQQGKFIPYVLPLPVNLWGR

763 DVLQAMGMTLTNE ...... YSPQASAIMTKMGYVPGRGLGRREQGRIEPI 11:.1..11.111.1:1.1111 1::1 :1:.1.11 764 DIMQHLGLILSNENAPSGGYSAKAKNIMAKMGYKEGKGLGHQEQGRIEPI 807 EQKGNQSRKGLGFI* 1. 1 IIII ... 814 SPNGNQDRQGLGFP*

.11111111111111111111:11111111.111.:1111I:1.::1:.11I

458 IVVYTDGSKTGIGAYVANGKWSKQYNENSPRMVECLVVLEVLKTFLEPL

508 NIVSDSCYVVNAVNLLKVAGVIKPSSRVANIFQQIQLVLLSRS . PVYITH 553 VRAHSGLP. . GPMSSGNDLADRATKLMAAALSTQIQAAQE .FHQRFHVTA 11111 11111111 :1 1111111:11 :..11..::1111I1 557 VRAHSGLPTSAPWLSGNDLADKAT*SGGCSLSSPVEAAQEIFITTFHVTA L Endo 600 ETLRRQFALTKQEARQIVTQCKNCCEFLPAPHVGINPHGIRPLQMWQMDV 606 EHYRSRNSLTRKEARDIVTQCQSCCEFLPVPHVGINPRGIRPLQVWQMDV

650 THVASFGKLQYVHVSVDTCSGIICATPLTGEKAAHVIQHCLEAWGAWGKP 111.1111111:111:111111I: 1.1111111.1111111111:11111 656 THVSSFGKLQYLHVSIDTCSGIMFASPLTGEKASHVIQHCLEAWSAWGKP 700 HILKTDNGPAYTSQKFQHFCRQMEITHLTGLPYNPQGQGIVECAHRTLKS ::11111111111111:11111:: 11111111111111111 111111. 706 RLLKTDNGPAYTSQKFQQFCRQMDVTHLTGLPYNPQGQGIVERAHRTLKT

750 YLIKQKEGMGASLPSVPRVAISMALFTLNFLNTDAQGHTAAKRHTSEPER 756 YLIKQKRELEEILPQ. HQESLSMALFTLNFLNIDVHGHTAAERQCSEPDR 800 SKEMVKWKDVLTGLWRGPDP ILIRSRGAICVFPQD*

805 PNEMVKWKNVLHNKWYGPDPILIRSRGAVCVFHRMKTTHFGYQKDSPEKS 855 RLTKGIPDVPRW*

FIG. 5. Comparison of deduced amino acid sequences of the gag gene products of CHIAP34 and MIA14. ORFs 1 to 3 were translated and combined together based on homology with the reported gag gene of MIA14 to generate a putative gag gene product for CHIAP34. Sequence comparison with the deduced amino acid sequence of the MIA14 gag gene product is shown. Lines between amino acids indicate a conserved amino acid residue, while dots indicate conservative changes. Periods within the amino acid sequence indicate gaps inserted for maximal alignment. The processed protein domains of p27, p12, and prt (protease) are indicated. The single-letter amino acid code is used.

FIG. 6. Comparison of deduced amino acid sequence of the pol gene products of CHIAP34 and M1A14. ORFs 4 and 5 were translated and combined together based on homology with the reported pol gene of MIA14 to generate a putative pol gene product for CHIAP34. Sequence comparison with the deduced amino acid sequence of MIA14 is shown. Lines between amino acids indicate a conserved amino acid residue, while dots indicate conservative changes. Periods within the amino acid sequence indicate gaps inserted for maximal alignment. The inferred junction between the reverse transcriptase and endonuclease domains is indicated by Endo. The single-letter amino acid code is used.

through 3 were determined to encode the gag gene. Beginning with a methionine codon at base 412, ORF 1 encoded 567 amino acids. As a result of frameshifts around bases 2058 and 2285, ORFs 2 and 3 also encoded gag-related amino acids. When these ORFs were translated together and compared with the deduced amino acid sequence of the MIA14 gag gene product, significant similarity (61% identity) was detected beginning at residue 224 within the putative p27 coding region (Fig. 5). This similarity extended to the end of

the gag gene product. The amino-terminal 223 residues of CHIAP34 gag showed no similarity with the corresponding region of MIA14. This region of gag is highly divergent between different IAP genomes and shows no similarity with the gag genes of other retroviruses (6). ORF 4 is entered by a -1 frameshift from the gag ORF 3 and encodes 752 amino acid residues of the pol gene before interruption by a stop codon. ORF 5 continues the pol amino acid sequence for another 91 residues before interruption by

4718

J. VIROL.

DORNER ET AL. BomHI 2I I m

v

c

HindM x

I

m c

23.1i- _9.4 - a

6 5-3 0*a

4.33

-

*

056-

*

M

1

2 3 4 5 6

FIG. 7. DNA blot analysis of genomic organization of IAPrelated sequences in hamster cell lines. High-molecular-weight DNA from Chinese hamster ovary (CHO) cells, Syrian hamster kidney (BHK) cells, and Armenian hamster lung (AHL) cells was digested with BamHI (lanes 1 to 3) or HindIlI (lanes 4 to 6). DNA was blotted onto nitrocellulose and hybridized with the 5-kbp BamHI fragment from CHIAP34 which contains both gag and pol gene sequences. Fragments contained in CHIAP34 are indicated by asterisks (*). Numbers to the left indicate the sizes in kilobase pairs of HindlIl-digested lambda DNA. a stop codon at base 5358. Comparison of the translation product of these two ORFs with the pol gene product of MIA14 revealed extensive amino acid sequence similarity (78% identity) throughout both the reverse transcriptase and endonuclease domains (Fig. 6). On the basis of this amino acid similarity, the pol gene of CHIAP34 encodes a protein of 832 residues. This is close in size to the 867-residue pol gene product of MIA14. The extensive homology at the nucleotide and deduced amino acid sequence levels with Syrian hamster and murine IAP genomes suggests that the CHIAP34 provirus represents an endogenous IAP-like genome of the Chinese hamster. The region corresponding to the env gene contained multiple stop codons in all three reading frames. Nucleotide sequence similarity with the env gene of H18 was only 67%. The greater degeneracy of the env gene compared with the gag and pol Vegions suggests that this provirus last moved in the genome without an intact env gene. The relative conservation of the gag and pol ORFs compared with the env gene suggests that these intact ORFs but not an intact env gene are necessary for transposition of the proviral elements. Organization of CHIAP elements in the CHO genome. The organization of retrovirus sequences in the CHO cells was compared with that of cells from several different species of hamster. High-molecular-weight genomic DNA was isolated from Syrian hamster (BHK) cells, Armenian hamster (AHL) cells, and CHO cells and digested with BamHT or HindIII. Southern blot analysis was performed with the CHIAP34 BamHI fragment as a probe at low stringency. This fragment encompasses the gag gene and most of the pol gene of CHIAP34. Comparison of the BamHT digestion pattern revealed a lack of common fragments between Chinese hamster and Syrian hamster sequences and no detectable hybridization with Armenian hamster sequences (Fig. 7, lanes 1 to 3). Similar comparison of the HindIII digestion pattern showed a fragment of approximately 2.5 kbp which appeared to comigrate in the Chinese hamster and Syrian hamster DNAs (Fig. 7, lanes 4 to 6). Additional Southern

blot analysis demonstrated that the 5-kbp BamHI fragment (indicated by an asterisk) and the 2.6- and 0.6-kbp HindIII fragments (indicated by asterisks) comigrated with fragments from CHIAP34 (data not shown), indicating that CHIAP34 is representative of a large number of similar members of a family of diverse but related sequences. Dot-blot analysis of copy number with the CHIAP34 BamHI fragment as a probe indicated the presence of about 80 copies of hybridizable sequences per haploid genome (data not shown). Comparison by Southern blot analysis of CHO cell DNA and Chinese hamster liver DNA revealed no differences in pattern or intensity of the retrovirus bands with the CHIAP34 BamHI fragment as a probe (data not shown). The observation that the pattern and intensity of IAP elements in the Chinese hamster liver DNA and CHO cell DNA are the same indicated that no significant rearrangements or amplifications of these proviral sequences have occurred since CHO cells were derived from the Chinese hamster. Since the hybridization pattern of provirus sequences in the genomes of Chinese hamster cells and Syrian hamster cells did not exhibit extensive common components, the proviruses may have entered the Chinese hamster genome after the divergence of the Chinese hamster (Cricetus griseus) from the Syrian hamster (Mesocricetus auratus) 7.5 million years ago (2). The absence of hybridization with Armenian hamster cell DNA suggests that the proviral sequences are more distantly related or are absent. While IAPs are common in most rodent cells, they have not been reported in CHO cells (6). Syrian hamster cells but not Chinese hamster cells can be induced to produce IAPs and IAP-related RNA by treatments which block DNA methylation such as 5-azacytidine (7). Treatment of CHO cells with 5-azacytidine or iododeoxyuridine did not induce expression of retrovirus-specific RNAs detectable by Northern (RNA) blot analysis with the CHIAP34 fragment as a probe, while similar treatments of BHK cells did induce expression of retrovirus-specific RNAs (data not shown). The inability to induce IAP expression in CHO cells suggests that the retroviral elements in the Chinese hamster genome are under transcriptional regulation which is different from that of the elements in other rodent cells. ACKNOWLEDGMENTS We thank M. Ono for providing fragments of SHIAP H18 for use as probes, K. Kuranda for assistance in copy number quantitation, D. Merberg for assistance with computer sequence analysis, J. Coffin for critical review of the manuscript, and E. Fritsch for support. We acknowledge R. Helmiss for computer-generated artwork. REFERENCES 1. Anderson, K. P., Y. S. Lie, M. Low, S. R. Williams, E. H. Fennie, T. P. Nguyen, and F. M. Wurm. 1990. Presence and transcription of intracisternal A-particle-related sequences in CHO cells. J. Virol. 64:2021-2032. 2. Benveniste, R. E., R. Callahan, C. J. Sherr, V. Chapman, and G. J. Todaro. 1977. Two distinct endogenous type C viruses isolated from the Asian rodent Mus cervicolor: conservation of virogene sequences in related rodent species. J. Virol. 21:849862. 3. Coffin, J. 1984. Structure of the retroviral genome, p. 261-370. In R. Weiss, N. Teich, H. Varmus, and J. Coffin (ed.), RNA tumor viruses, 2nd ed., vol. 1. Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. 4. Devereaux, J., P. Haeberli, and 0. Smithies. 1984. A comprehensive set of sequencing analysis programs for the VAX.

VOL. 65, 1991

CHINESE HAMSTER GENOMIC PROVIRAL ELEMENT

Nucleic Acids Res. 12:387-395. 5. Heine, U. I., I. Margulies, A. E. Demsey, and R. G. Suskind. 1979. Quantitative electron microscopy of intracytoplasmic type A particles at kinetochores of metaphase chromosomes isolated from Chinese hamster and murine cell lines. J. Gen. Virol. 45:631-640. 6. Kuff, E. L., and K. K. Lueders. 1988. The intracisternal A-particle gene family: structure and functional aspects. Adv. Cancer Res. 51:183-276. 7. Lesser, J., J. Lasneret, M. Canivet, R. Emanoil-Revier, and J. Peries. 1986. Simultaneous activation by 5-azacytidine of intracisternal R particles and intracisternal A particle related sequences in Syrian hamster cells. Virology 155:249-256. 8. Lieber, M. M., R. E. Benveniste, D. M. Livingston, and G. J. Todaro. 1973. Mammalian cells in culture frequently release type C viruses. Science 182:56-59. 9. Lueders, K. K., and E. L. Kuff. 1983. Comparison of the sequence organization of related retrovirus-like multigene families in three evolutionarily distant rodent families. Nucleic Acids Res. 11:4391-4408. 10. Mietz, J. A., Z. Grossman, K. K. Lueders, and E. L. Kuff. 1987. Nucleotide sequence of a complete mouse intracisternal A-particle genome: relationship to known aspects of particle assembly and function. J. Virol. 61:3020-3029. 11. Ono, M., and H. Ohishi. 1983. Long terminal repeat sequences of intracisternal A particle genes in the Syrian hamster genome: identification of tRNA-phe as the putative primer tRNA. Nucleic Acids Res. 11:7169-7179. 12. Ono, M., H. Toh, T. Miyata, and T. Awaya. 1985. Nucleotide sequence of the Syrian hamster intracisternal A-particle gene:

13.

14.

15.

16. 17.

18.

19. 20.

4719

close evolutionary relationship of type A particle gene to types B and D oncovirus genes. J. Virol. 55:387-394. Poncz, M., D. Solowiejczyk, M. Ballantine, E. Schwartz, and S. Surrey. 1982. "Non-random" DNA sequence analysis in bacteriophage M13 by the dideoxy chain-termination method. Proc. Natl. Acad. Sci. USA 79:4298-4302. Puck, T. T. 1985. Development of the Chinese hamster ovary (CHO) cell for use in somatic cell genetics, p. 374A. In M. M. Gottesman (ed.), Molecular and cellular genetics. John Wiley & Sons, Inc., New York. Sambrook, J., E. F. Fritsch, and T. Maniatis. 1989. Molecular cloning: a laboratory manual. Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. Sanger, F., S. Nicklen, and A. R. Coulson. 1977. DNA sequencing with chain-terminating inhibitors. Proc. Natl. Acad. Sci. USA 74:5463-5467. Servaney, M., J. J. Kupiec, F. Galibert, J. Peries, and R. Emanoil-Ravier. 1988. Nucleotide sequence of the Chinese hamster intracisternal A-particle genomic region corresponding to 5' LTR-gag. Nucleic Acids Res. 16:7725. Stoye, J., and J. Coffin. 1985. Endogenous retroviruses, p. 357-404. In R. Weiss, N. Teich, H. Varmus, and J. Coffin (ed.), RNA tumor viruses, 2nd ed., vol. 2. Cold Spring Harbor Laboratory, Cold Spring Harbor, N.Y. Strauss, E. C., J. A. Kobori, G. Siu, and L. E. Hood. 1986. Specific-primer-directed DNA sequencing. Anal. Biochem. 154: 353-360. Urlaub, G., and L. A. Chasin. 1980. Isolation of Chinese hamster cell mutants deficient in dihydrofolate reductase activity. Proc. Natl. Acad. Sci. USA 77:4216-4220.