bp upstream of the cap site. The repetitive sequences ... - BioMedSearch

2 downloads 0 Views 415KB Size Report
CCATAACCTC ATCCCCCCAG CCCCATGTCC TTECCTCTCC CTCT CCCTCG ... CAGGAGT CCG AGACCAGCCT CCCCAACATO CCAAAACCCC ATCTCTACTA .... CTGACTTTCC TGATACCCCC CATCT CAGCC ATA GAGGCAG OT ...
Nucleic Acids Research

Volume 17 Number 17 1989

Nucleotide sequence analysis of the human fur gene Ans M.W.Van den Ouwelandt, Jan J.M.Van Wim J.M.Van de Vent'2

Groningen', Anton J.M.Roebroekl, Carla Onnekinkt and

'Molecular Oncology Section, Department of Biochemistry, University of Nijmegen, St Adelbertusplein 1, 6525 EK Nijmegen, The Netherlands and 2Center for Human Genetics, University of Leuven, Belgium Submitted August 2, 1989 EMBL accession no. X15723

In previous studies, the nucleotide sequence of a 3'-portion of the human fur gene was determined (1,2). Here, the nucleotide sequence of the region immnediately upstream of the described sequence (1) is presented. The sequence data of both DNA strands were obtained from overlapping clones. Based upon preliminary cDNA sequence and nuclease Si analysis, the cap site of the fur gene could be localized at position '4597. A putative "TATA" box is located 32 bp upstream of the cap site. The repetitive sequences beloning to the Alu famnily (3) are underlined. GAATTCAAGG GCGTAGTATG ATCTGATGGC TACTGTGAGA ACATAGGTGA ACAATTCTTG CTCTTGTAGA GGCAGAAAGC AGAGCGGTTG OTAATCTGTG AAACAGGGAT CATAACAGCA

ACCTGTGAAG AGCTGCTGCA CTCCAGCCTG GGCACCAGAA TCCAAGAAAA AGTCAAATGT GAATCACAGC TGAGCCACCC AAGTGTGTGT AGGCTTTCTA TTCTTATGTC CCAGGTTCAA CCTGCCTCAA GGGTTGTTGT GAGCAATATA TGAAGCAATC

CAATACTGTT TCAGACAAAA ACAAAAGAAA CCAGAAAAAT CAGCATGTAC ACATCACAGAT GGCTGAGTGA ATCTGTGTCT GATTTAGCCA CAGTTCAGCT ATCCTGGGTC TTTATTTTCC TTAGTGACTG CGAGTCCAGG CCTCAGTTTC

ATTATAAGGC ATTGAGCACA GAGCCTGAGA GTATGAGCCC GTAAACTCAC TGTGAGTTTC AGCTGTTAAC TCAAAGCAGC TTTTTGTTGC CAAGTCGCTG GCTAAAGATG CCCAAGGGCT ATGGTCTAGA GAGAAGCACC TGAACCCACC TAAGCAGTTG GGTTGGGGGA

TOCCCGCTGG CTGCTCCGCC AGCTCTTGTC CTGCAGAGGC CCCCACTTCG CAGGTCAGAT GTGTGGTTCA CCTCCTGTCC AGCCTTOCCCA GGCCTGCGCA GGAGCCTACC ACCTGCTCAC TGGTTTGTAA GTCATTCAGG GGCTGGOGCG TTATAGAGTC ACTCAAGGCG GTGACTCACA CCTATAATCC CAGCACTCTG GGAGGCCGAG GCAGGTGGTT CACTTGAAGT CAGGAGTTTG AGACCAGCCT GGCCAACGTA GCAAATGAAA ACCCGTCTCT ACTAAAAATA CAAAAAATTA GCTGGOCGTG GTAGCACATG CCTCTAATCC CAGCTACTCG GGGAGGCTGA GGCAGAGGCT GCAGTGAGCT GAGACCGCAC CACTGCACTC CAGCCTGGGT GACAAGAGrG ACACTGTCTC AAAAAAAAAA AAAGTCACTG GACTCCCCCA AGCCAGTCCC GTCACTATGG TWCOTCCTGC GTCGGGCTGT GGTGGGTTCC CACACGAACC CCCATATCTC TACTCCCCAT GGTAGTCAGA GGAGCCTTTA AAAAGGGTAA CACGCCTGTA ATCCCAGCAC CTCTGGAGGG GGAGGCAGGA GGATTGCTTG AGGCCAGTCT GGGCAACAGA GTGAGAGCCC ATCTTTTAAA TTAAATTTAA TGGGCGCGGT GGCTCACGCC TGTAATCCCA GCACTTTAGGG AGGCTGAGGC AGGTGGACTO ACTGAGGTCA GGAGTTCGAG ACCAGTCTGG CCAACATGGT AATACAAAAA AATTAGCCGG GTGTGGTGGT GTGTGCCTGT AATCCCAGCT ACTCGGGAGG CTGAGGCAGG GGAATTGCTT GAGCCAGGGA GGTGGAGGTT CACTOCACTC CAGCCTGGAC GACAGAGCAA GACTCCATCT CTAAATAAAT AAATAAATAA TTTTTATTAA AGGGTAAATC AGTCAAATTT CCAAAAGAAT TGCTCTGATA ATTAAATTCA ACATCTGGAC CCAGCTTGGC AAGTGCTCTG TGATCCAGCC CGTTACCTAC CTCTCATCTC ATCTCTCGGC CATGCTCCAA GCATAAGCTC ATGCCCCCAG GCCCATGTCC TTGCCTCTCC CTCTGCCTGG GACACTCTTC CTCCAGATCT TTACACTTTT GCCTCTTTTT TCAGCCTCAG AGAGGTCTTC CCTGACCACC CCAGGGATAG CAGCCCTCCC GCTTCCCCAG GCACTOCCCAC ATTACCTTCT TTATCATTTT OTTCCAGAGC TCTTAGGATG CTGGCTTATT CACCTGTGAA CTGCCTGCTT CCCTCGCTAT AACTGAAGCA CTGGAAGAGG TCCTGCACCC AGCAGGCACT CAATGAGTAT CTGAGGAATG CACAATCAAC CTCCACAAAA GCAGTTTTTC TTTTCACATA TCACAGCCCC TCTCCTTGCT CACCCGCCTC 0070800000 TCCTGAGAAC CTCCGTAGAG ATCTGCACCA TCAGAGAGGG CCCCTCACTG CCGCTCGCAG TGCTCCCTGC TGCCACGTGC AGCCCCTTGA GCATGAGCCT 0770A08080 GTTGOCACAC GCTCTCTGCT CCCAGGTGTG CCGTGCCCCA TTGAAGGACA GCACTCACAC TCCTCCAGGT ATTAACTGAC TTTTTCCCCA GCAATGTTTG TTTTAGGGAT ATAATTTCCA 0807087070 TTCCTTTCCT GGCAGCCAGC 8087770708 TTCTATTTTT CCCAAGTCCC TGATAATACC GTTCCAGACA TGTTGATTTG GACTCTCTGG GGCCTOCCGT AGGCCAGAGG ACAGGCTCTC 0000008080 TCAGGGGCGA CTGAGCCAAC TCTTCACGCC 8770077700 TTCCCTGCCT GATCGCTTTT7 0770770770 7707777707 TTTTTCTTCC TGTTTTCTTT TTAACTACTA CA0T0AC0CC TTATTTATTC 8GC8880888 TGGACTACTC ATAATGCCCT 7088008078 8880777888 0880808707 TCAAAACAGC TTTGCTGAGG 7878807080 8080787888 CGGCATOGAT TTAAAGTGTA 7870080808 0008708880 0870800808 8708808788 7088087070 CATCATTCCC 8888077700 7000GC0007 CCCTGCCCCC TTCCCCAAGC

88780GT077 7880007808 8000070808 0800700008 0077008888 7008070708 08GA7GT087 0800070087 GA77077700 CCTGATCACT 8880000008 7707707007 7780770008 00800088CT 07700700CT CGAACCCTGG 0808078800 8008700707 AAGGGACATT TGGCAATACC 7008087077 T770077070 87087GG080 0000887007 0707000078 0788788700 8888878008 0000088080 0080080007

GCTCTAAGGG

TGGCCATGGA 8887080088 0708787888

G087088700 GGTAGGCAAA 7808807800

ATGACTOGGT CCCCTTTACA

CCTTGGGCCC 0707007008 0700070880 0800088007

GGCCAGCCTC 0070887008 0880707780 7770788800 7807000008 0870888007 8777777777 8788000800 0700787877 7007787070 0788888088 0700700700 0008080000 0088080000 0800000000 0000080000 0000007080

0808088778 0800807780 0700007880 8000808070 7077088007 8000800787 8000080008 8870808800 0880077770 0800700707 8808808088

7000800088 0870770878 8700088007

GGAGGCCGAG 0700070787 0807788007 0070788700 7707078088 0088000080 0000800808 GAACACAAGG

GCTGGCTGGC

0088807007 0080070087 0888080000 TGTGGGTGCT 0070000880 GGATTTAGGA C7TATGTGAT 0000008077 8000870800 0780000000 8007080080 8700087707 7770080808 0808080800 0007088070 0007000808 8800000880 0800870007 0000080070 0700707700 7080800078 7700770000 0800008807 7000008870 0780070870 GGGATTTGAA 0000777007 ACTT8AAAGG 8887078070 7000877800 0000880070 0000088007 0070000808 0088078007 7008770800 0888870008 TT88788707 A8AAAAAGCTA 8888787788 0000070080 7007000700 CAGTCCCCCA 8080700000 8000000000 0000000800 0008088000 0000080000 0000700700 0000000000 8000000008 0788808800

QIRL Press

0800780770 8888888888 0080070008 0000800077 8007008007 0870778008 7780008088 0070880787 AAATGTGACC 0077780770 0708008000 0007007007 8007000070 8070708080

0080007088 8888000878 0807080088 0800800700 0000088008 7008077008 0700880000 0080870077 0777087700 0007000708 7070700877

GACCCTGGTG TGTCCCAGGT TCAATGAATG TCTGCCAGGC 8070877880 7080008070

08TT087777 TACCCACCTC 7077880080

0T77007707 0707000888 0800070888 8800800700 8007000807 7080008000 0080000000 0000000000 0000000088 0000000000

7087007770 0080777088 0700708770 8870777807 GACAAACTGA

0770877080 0777000807 7000808070 0807800087 7708888070 GGCCAACATG 0088880000 0808800080 AGGTTGCAGT 0787080087 7080708770 ACCTCCATGC AACCCTACTA GAGACTGCAC 0088800877 0070008700 8008880008

CATCAGGAGG

CACGGTGGCT ATAAAATGGC CTCTACTAAA GAGATTCO

CAGAGCCCAT GAAGTGTCTG CTCCCCAAGT CTCAAACATC ACCTCCTCCC ATCTGAACTT CTCTCATAAA

AGTGAATGAA ATGCCCAGTG GTCTCCATCA GCCAAAAAGC GGTGTCTGGC TTCTGTAAAT TAAAGTTTGT GGAGGAGCCT ATGGCATCAG TCAACGGAAA 8008008007 TACAGCCCAT 0808078070 8877770888

CAACTTGATA TGTTTTGATA AAGCACTGAT CTGTTTTCCA

0877870800 8808008700 8080708707 7770700CT0 8070078007 0707080800 GT70777007 0008080877 TTTGTCTCCC 0008070880 8700700887 0780807008 8808800008 0000000700 7000708000 8707078078 8888780888 8888888888 0800707087 CAGGCCATTG 0877008000 8087770708 8700808080 8088008700 0808007007 CTCTGGCTTC CTATGGACAA 7877800780 8788770700 CAGCACTGGC 0008088007 8807008080 TCACCCCAAT

GTGCGACCAG ATATGTCACC ACCACATCAC TTTTAGCTCC TCCCCCCACC CACAGACACC CTGTTTGCCT 0CCCGCCCTG 0880008780 GAGCCTGACT 0770080070 0807080700 0000088008 0080000008

7877008777 0808088080 0070080080 8000008887

CTCACTCATC 7008080CTC 0800000007 TG8AGAGTGT 8080700077 7088877077 0800000707 7877007877 0708070807 0080880000 8878778707 7080080770 ATGGGGCATG 0008080007 7770777070 0800008070 0087787070 0078077070 0080080008 0700080700 0708000800 0000008000 0000008007 0000000000

GGCATACTGG GACGTTCCCA

8078008007 7080888000 7077080087 0800807000 80800800CCT 0087080887 0000708800 8880888088 8088888888

CCCAATCCTG 7007087700 7000707080 0700807700 8800800880 8008808800 CCCCACTTTG 0087708078

7080087GA7

CTACATTTCA ATTTGCCGGG AAATATAAAA GAAACCCTGT GCAGTGAGCC TTTGAGAATC

0007000700

TTTCTGG00T 7007000708 8807008077 0087777878 0077880887 7007080087 0800070700 8007088008 0077008008 0807888708 0708787870 0070000007 0700000880 0000000000

00700000CCC 0000000008 0070000000

7008770807 CAATGGAATT 8087770808 0880087888 7070808700 8880808808 7778008008 0080008007 CAGTCATCTG 0800777077 0707080700 8707700000 0800007888 7870870877 8880007708 8000800778 0007000007 7070800707 7078087880 0800707078 0007000080 0077700077 0000080080 8000000700 0000080000 0000000000

7888007778 0707707008 0880807000 0808000078

8778777780 0007007870 8080080000 0070080707 CCAAGTCCT0 0788070070 8780070000 8080080770 0887707707 AAAATACAAT 0708088000 8077788808 8000780007 8000007000 0780000708 0080880700 7008078770 7707000880 7088077708 0080008008 7708008807 0007080888 7880070707 0000070807 0808088880 AGCTCTCAAT 0070808000 8000808808 0800088078 0080888007 0070077070 0080777000 8800808788 7078880708 7808887008 0800880700 7788000007 0800070888 7007700080 7700007070 7770008700 0070000080 0880000000 0000807000 0000000000 0800800800 8007008000 0800000800

0807000707 8080877007 0888887008 0800707088 TGTTGTGTAG ACATTCTGCT 8800808088 0800807007 7008070800 TCCTCCCAGA 0000700700 0888870080 TGTAAAACTG GTAAGGTGGG 8700808008 TTGCAATGAG 8788800080 8807080080 0708707088 8000788887 0708080807 ATTTTGCCTT 8700088800 0800877088 0070088808 0008070000 0070000000 0780070000 0080000707 0007077700 7070007000 0078000807 0700000078 7707700700 8077700807 TCTTGCAGTG 7788077070 TAAGGTGTCC 0800888888 7700700707 8788887008 0808070878 0070007000 7087808888 0807080700 0870000808 0007807070 CGACCTTGGA 7707800000 0800080008 0800070000 0888778777 8880077070 0000708077 0700007007 0007000880 0007700707 7080700707 8808078000 0880800800 7700700000 8008070000 8800000008 0000000070 7700000008 0070000000 7700780070 0000780080 0800000800 0000000000 0870000000 0000008008 8000000000 0070000000 0000000000 0070000700 7000000000 00000k0000

12C

24C 36C A8C 60C 72C

88C. 96C 108C 1200

132C 1448C 156C 1680 1800 1920 2000 2160 2280 2800 2520 2640 2760 2880 3000 3120

3280 3360 3880 3600 3720 3880 3960 8080 8200 8320 8880

8560 8680 8800

8920 5080 5160 5280 5800 5520 5680~ 5760 5880 6000 6120

6280 6360 64880 6600

6720 6880 6960 7080 7200 7320 7880 7560 7680 7800

7101

Nucleic Acids Research GCGCGCCGGC CTCCCCGCCT GAAAAGTTTC GCTCTGCCCC AACTGGCCCC TGGGGCCAGG

CGGTCGCCTG GGGCCGTGGA GGCTTTCCGG CTTTTCCGCG ACCGGGCTGC ACACCTGTGC

CGGACCAAGG GAGAGTGTTG CTCTTCTCAA

CGCCGGCTCG GGCCAGGACT GGCGGCCTGC G0ATTCCTAT GCAGGGCAGC CCAA0GAAAC GTGGTGTCTC TGAGGCGCAA

GTTTTTACCA AGGACCCCGC GGTCCTGGGG rGGCCACAGC CTGGAACAGA AAACCCATTC ATTCATTGTT

AT0GCTGGG0 TTGTTGGATA CTTTTAAC0 C CAAACA0CT0 GTGGGGGCAG 000G0T0GTT CTCT0TTAAA TGAGGGACTT TGGGGCTGTG AAAATACTCC GCTCCCCAGG TATGGGATAC GAGAGGTCCC TACGCAAGGC CTGACTTTGC TTTCTTCCTA ACATCTTATT

GCGT0TGTAT ATCATGGTGA

GTACT0TCGC CCACCACACT ATGACAGAAG GGGCTAGATT GACCCTCTCT CAGGCACCTG GAACCTTGGT TCAACCTGGG GGCATGTTCT

CGCACCCCC0 GGACTATTAC CACCCTCCCC CGGGACGTGT GGATGGAGGA ATTCCCATGG TCTCATAAGT ATGGCATCGA CTTCTCCTGA

AGGCCCCG0 T CTGAGGGAG7 CACATCTGCC

T0TAGGTGTG CTTTCCGACT

CCTCAC0ACC TTCTCT0G0 T CTT0707TGG G0CCT9AAC G0TGAGGTCC TT7AAACATT 0CT70GCCTCA

CCGCGCCGTG ACGCTGCCGC 0CGGCGCGGG CCCGCCAGGC TCCCCAGGGT AAGTAGCGAG AGGAGCGGGC TCCGCGGCCC GGGTGCTCGG AGCTGCCGG GG CGG CCGAGGGCGG AGGGT 0CCAGCCCTC TGACCTCAGG AAGCGCCTGC CCCACTCCGC GGCCGAGGAG GGCCTCGTTC CGGCTGCGGO ATCCGCGCGC GCCGGCGGCC TTGCCAGACA GGTCTGACCG CAGTCACTTG TCTGACTTGG GAAGTGCGAG GAGAACCCGG CCTCGTTTTA CTATTTCCTA TATTGGGCTT TGAGGCCCAG GGAGAGGAAG TGACTTGTTA TGCTAGAGGG AGAGACTGAT TGGGT0TCCG TTTGAGTGGC CCAGAGTGGC TCTCTGTGTC GTCCCATGTG AAACAAGACC ATGAGAAATA TTCAGTTCTT ACTCTTTCG GGGATCCTCC GA80 0AGGCT 0GCCCTCTTT CTTTGG0C00 C00CTCCATC TACAGT0TGC TGGAATCTGG GCATGACATC T00ACCCAAG AG8CA7CTGT G0GG0TGG0 0 0CGCGTCCT TCCATCCCCT CTGCCCTGGA AGTCCTGTGG AGTCTGCCC8

GTTCCGAGGC AGGAATTGTG TGTGGAGTAC GCCTAC0T0 A GCT0GGGTGC

CCCCA0CA8C ACTGCT8GGA GCCGTCTGTT TGCTGAAGGA

TGGTG7GCGT 0 0AGCAAGCA TGATAGCGGC CATCTCAGCC GGAGCTGTGG GGCTTTGCGA CCCAT0TAAA ACATACCTAG CCTTCCAGAT TTCGCTTTA7 ACATTTATTC ATTTGTTGTT ACA8ACTG80 AATGCAGT00 TGGCTCATGT TT7TATTTTT TGAGCCACCA TGCCTGGCCC TCTTCTGGGG AGGACTAATT CCCCTTTTCC CCTCTTGTTC GGAGCCGAGG CCCTGTGACC CCTGCTA0CA GCTGATGCTC 0CAGGTAGGT GTTCCCCCAC GGGTG0CCAT GAGCAAAGCA ACCGTGGCGA GCCTCCCATG CACTTCTGGC ATCGAGGA9 T CTCCTGCTCT CAGGAGCCCC ACCAGGAGCC CACAGACCCC GGCTGTCTTC GAGGCCTCCT AGTCCAGACA GCCA0TGGC0 GATGGGGTGG 0TGTCTCCAC GAAGAACCAC CC07ACTTGG TG0TGGCCAA ATCCTTCTTA CTCTGCCTCC CTTCTCCTTT GCTC08TCAG AGTGGAGATG GGGCCCTGTT CACCCCATTT GCCTACAAC0 CCCGCATTGG 0TGGATCCTT TGATCACGTG CACTGT0A0 T CCTCACGTGG CTT0TCTTT7 AAAGGCAGAG

GG0CCCT0AC AGCTG9ACCC CCCAACCACA TCCACATCTA

97ATCTGTCC A9CCCCTGCG AA808ACA0A AGAAGAGTAG A8CGATCCTC CCGCCTTGGC

GACCGCCGCC GGCGCGGGGC GCCGGGGAGG AGCAGACAGG GG0GCCCCT0 CGCCCGGGGA GGCGCAGGTA TCTGGTCAGA GTGATATGGC CTACGGAGCA AGATCATCCA AGTG8CCCTC AGGAC8TTCT CAGCAA0CCT CTTGTCCCAC

GAGCCCAGGC CCGGGGCCCG GTCGGGGCAC GCGCTCCGCA CT700GGCC00

CCCCGCCGCC GGGAAGGCAA TGCCCCCGGT GCCCACCTGA GATGGGAGA

CTCGOCGAG GAGCCGGOCC CCCGGGTCCC CCCGTCTCGG GGCGCCGGGT GAGGCGGGCG CTCTCTGCT CCGCGGCCTO 0000800000 TCTACCCCCT GGCCCCTGGG GCACGTGTGG ACACAGCATA CTGTGGCTTT TTAAAAAGTT GGGAGCGATA AATTCACGGG 0T8TCAGAGC CAGACCCCCA GCCCAA0AAC ACACCCA80A GACTA0AGGG TGA0TCA0A8 CGGCTCTCCG GCGC0TCTGG GCAGCCCTGA GCGCGCCCGG AAATTCCGCA

GGGCGCCCGA CGAGGGGAGC CCGGGCGCCCG CCCCCCGCCC GCTCCAGCCT TGGGCCTCC CTGTGCCCCC GCTCAGACCC

CCCCGCCGCC

792C 804C 816C 8280 8400

8520 8640 8760 8880 CGAGATTAGC 9000 TGAAAGCCGC 9120 A8CCTCCCAG 9240 0T0TCTGGGA 9360 AGA8CACCCA 9480 TCTCGGATCG ATGCTGTGCA GA00CGTCTG TG000GTCTC ATTTTCT80 A A8CCATCCCC ACCCCTCACC 9600 ACTGTTCAGT TGCGCCT7AA GCCTGCTTTC TTTCCGAGTG CTGCAAGGTA ATTAAATA8 T TGGTTAATCA 9720 GCTA0ACCCA 0AT0AACCCA GCCTCGATTT CGCGCCTCCA 0CAG0ACTCA CA0CACG8T0 GTTGGTGGTG 9840 CTCCCACATC 9960 0 0 A AGA0AT0A0 GCCCCGTTCC TATAT0GCTC TGCTCCCA0 TGTGGCAGCA AGAGAGAG8 AGGGCTGGGG CGCCTAGCTG CCCAGCCATT GATCAGGCCA GCTTTCTGGA ACCAACCACC CAAAGCACCA 1 0080 CAGG0ATGTG CAGGAGCTGT ATT8TAGAT8 TCTCCTGCCA TCAGCTGTCA G0TTATTTTG 0TCCAAACTG 10200 CCCCAGCCT0 CA0CAGTCAG 80AGGCCGGT T7TGTCTAAG GGCA0CTTCT GGAGAAGCAA GGAGTTTTCT GTGGTGAGCC TGCCTCTCTA 10320 CTTCCTTTTG GTATCCGGCA GAGTGGCCTT GGCTATGGAA 0T0CTAAGG8 CATCCA0GGC CCTGCTGCTG CTCAGGGAGC TTGGCACTGA 10440 CCTCCCCTAG CTTCCCCCCC TTCCTCCAGG GCCTT0GCTT CTGGCTAGTC AGGGGCACTA TGTGGGGACA GTTATGTGGC CCCGGGCAGA 10560 GGAAAAACAA CACTCGGACT ATTGATTCAG TCTCACCTTG GCGCTCCTG0 CTCAGCAGCC AGTGTTTGCC CTGGCTGCCT G0GCTC8GAT 10680 AGGGGGCAGC TGGG7GCCTC CAGGCCCAGG CAGGGCAGGG CTTGGCGGCC CCAGCGTTCC CCAGGAAACT GTAAAAACGA GTCTGGGGCC 10800 ATA8AGGGAG GTGAAGCCTC TGT7ATCTCC ACCCCCGCAC CCCCTCAGGA CATGGCCCCT AAAAGCACCA TCCCATTTAG GGTCACCCAT 10920 11000 ATAACTCCCT ATGGCTGAAT CTGCTTTGCT GAACTAGGAA ACCCTAGTGA GTTGTATAGA TGCCATTCAT TTAAGAACCT TGAAAAGCCA 0TGATTTGGA GAATACAGTG ACACAAAGAA GAAAATAACA ATTACTT8TA ATTCTCACCA TCTTGAGAGA ACCACCGTTA ACATTTTTTA 11160 TTTGTATAAA TATACATGTA TAAATAGTTT TT8AAAAATA AAATGGGATT ATGTGTACAT ACTGCTTTAT AATGCATCTT CTTTACCTTG 11280 11400 AGGGAGTATC TCTGATTTTG AGTGACTCAG GGCTTGATAG TCTTCATCCT GCTTCTTCTG TTACACTTTT TTTTTTTTTT TTTTTTTGAG CGC0ATCTT0 GCTCACT0CA ACCTCCACCT CCTGGGCTCA AGCAATTCTT GTGCCTCAGC CTCCCAAGTT GCTGGCACTA CAAGCACCCG 11520 A8TAGAGAC9 AGGTTGCACC ATGTTGGCCA GGCTGGCCTC GAACTCCTGA CCTCAG0TGA TCCACCCGCC TCAGCCTCCC AAAGTGCTGG 11640 TTCTGTTACA TTTCTTATAT TCTTGCATGG AGAAAGGATT GGGGGGGATT GTGAGGGCAT CCATCTGTCC ATCTATGTGG CTGGCCTTGG 11760 TTTTTACCCT TCTCCCCCAG ATCTCATTTC AGGCCCCAAT TGAAACTTGG GGTGGGCAGG GCAGCTTTAG TGCGTAGCAG GGGAGGAACT 11880 CTCTCAGGGT CGGCACTCTT CACCCTCCCC GAGCCCT0CC CGTCTCG0CC CCATGCCCCC ACCAGTCAGC CCCGGGCCAC AGGCAGT8AG 12000 GAGCT0AGGC CCTGGTTGCT ATGGGTGGTA GCAGCAACAG 12120 AGGCCAAGGA GACGGGCGCT CCAGGGTCCC AGCCACCTGT CCCCCCCATG AGGGCCAGAA GGTCTTCACC AACACGTGGG CTGTGCGCAT CCCTGGAGGC CCAGCGGTGG CCAACAGTGT GGCACGGAAG CATGGGTTCC 12240 AGGACACTGC CAGGGGGTGG GACCAGAGAA GACAGGGATT CTGGGAGCAG GAGCTGTTGG CCTTGTTTGC TCAGGGGCAT CTGGGTAGCC 12360 T00TGGGTCC CGGACAGGGA GCAGATGCCC 12480 CAGGTGGTTC AGGCAAGCAG CATATCCCAG TGAGAGTGAG TCCTCATGGT CCCCGCATTT 12600 AAGCCGTTGT CCACCCCCGT CCCCCGCCTC CCCGGGGACT GACAGATGGA AA8CCCAGCT CA0TCTCCCT GCTCTATTGC AGATCTTCG8 GACGAAGCGG TCCCTGTCGC CTCACCGCCC GCGCACA0CC GGCT0CAGAG GGAGCCTCAA GTGAGTGTGG CCCCAGCCCC CTCCT00T0 C 12720 TCTCGCCTCC TGCTCCACCC ACACCATCTC TCCCTCACTC CCCCACAGGT ACAGTGGCTG GAACAGCAGG TGGCAAAGCG ACGGACTAAA 12840 AAGTTTCCTC A9CAGT7GTA CCTGGTACGT GGCCTTCTTC 0CT8CTGGGA CCTCCTCCCC AGATGCACCA TCCACCCACT ATGAGCTCTT 12960 CT0ATTCGTT TCCTTTCCTC CTGCT0GGCA 0TCTCTCCTT GGCCCATCCT AATAAGCAAG CCTGGGGGTG GCCCTCCCAG AGTCCTCTAC 13080 GCCTTTCAGG A9CAGGGATG GTACAG0AGG AGGTTTTACG GGGGCAAAGG GATTCTTCAA GATGCTCCTA GCCTCACAAA ACCAATCATG 13200 AGTCTGGTGT CACTCAGCGG GACCTGAATG TGAAG70GGC CTGGGCGCAG GGCTACACAG GGCACGGCAT TGTGGTCTCC ATTCTGGACG 13320 CAGGCAATTA T7TGAGG9AG TCGGGGAGGG AG9CCGTGAT CCCT0CTGAG GTGT0TCTAG AGGCTGTCTT GTTCTATCAG TGAGGTCAGC 13440 ACA0ATGAAT GACAACAGGT AAGAAGTGGC 13560 GGATCCTGGG GCCAGTTTTG ATGTCAATGA CCAGGACCCT GACCCCCAGC CTCGGTACAC CTTCCACTAA 0GAACAG0GC TGA0CCT7GG TGAGATGT7 C CTTGCCTAAA AGGCTGAGGC TTTTCCCACA GTCCTGGCCC ACCTGGGGCT 13680 GCCACAGGCC CAGTGGA0T0 T7GGA0AT7G GG0CTG0GTA CTCA0GGGAT GATG0GT0TC GGATGTGCG0 ATCCCTCGT9 CTCCTCAGGC 13800 GTTCTACTCA TGCTAC0T0 C TT7GCCCT0G CAGGCACGGC ACAC0GT0TG C0G0G0 AAGT G0CTGCG0TG GCCAACAACG GT0TCTGTGG 13920 AGGTGAGT9 T GG0CCTG00C CACCCTGTCT TCAG0AGGGC CCTTCA0TGG AATTTTTCCC GCCCACTTCC CATCTGGCCA ATGCCTGCCA 14040 0CTGTTCCAC 0GAGG0TTCC CAAA0GGTCA AAGACTGAAA GAGCTGGACC CCTGT0GAGT 0AG7GTTTCC TAGCAGGAGT CTCCTGAAAT 14160 CA0ATCCTT7 ATCACCA90C TCTTTT770A ACCAGA0ACT CTTTCTA0TC CT0AATGT0T 0GTT0GGG0 C A0GA0GTGTT CTCTTTTTCC 14280 GCAAACAGGT AGTTG970GT TCAGG7TTTC TCAGCATCA7 CCTCCACCCT TCCCTTACTC ATCCCCTG0 G GTGCA00GCC 0GGTGTGCTC 14400 AT8CA0CATC CCTCTTCGT0 CCCCCCCTTC AC0GCCAGGG GTGCGCATGC TGGATGGCGA GGT0ACAGAT GCA0TGGAG8 CACGCTCGCT 14520 G0GTTAGCCA 1 464C CA0TGCCAGC TG70GCCCC0 A8GAT0ACG9 CAA0ACA0TG 0AT0GGCCAG CCCGCCTCGC CGA0GA0GCC TTCTTCC7TG GGCAG0TTC 0TGCTGTCTT CCGTACCTAT CTT0T0TTTT TT7GTTT7TT TTATTTATTC TCAT0TTTAA CTTTACACAA AGCAT0TTTA 1 4760 A8TGCATATA CCACTT00AC TTGCCTAATT AAAAAACACA TTTTTTTTA7 AAAT8AG8TC TTGCT7T0TT ACCCA00CTG GTCTCAAACT 14880 1 4977 CTCTTGGTCT 0ATTTTTTGT 0TGCCTAGAT A8ACATATTT 0 0ATACATAC TT7ATCCACC AG8 ATC

GGGGAGGAGC GGCCGGCGGC

GGTCCCTGGG CTGCCTTCTG ACCCCGCTTT CCCATCATCC CTGGTGAGGG CGGGTGCTGA TTT7ATTTTT AAAAAGGATT 0CAAATAGTG GGCAGGGACT CC0TGGCCTG CTCT0T0GCT GGTGCTGGAG AAAAGGCTTT CCAGGTTTGA GACAAACTGG

REFERENCES 1. Roebroek, A.J.M. et al. (1986) EMBO J. 5, 2197-2202. 2. Roebroek, A.J.M. et al. (1986) Molec. Biol. Rep. 11, 117-125. 3. Schmid, C.W. and Jelinek, W.R. (1982) Science 216, 1065-1070.

7102

GC CGCCGCG CGGCCGGGCC CTGCCCGCC GTGGCTGGTA CCCCGCCCGC

GCTGTGTGGC TCATCCACCG CAAGCCTGAC TG7TCCACCA GGCTGGTAAC AACAGCAGTC CA0CTCTCTC G00TGG0GA0