or have become accessible in sequence libraries, since the last compilation (271), plus 8 ...... in both libraries but there can be a delay before a sequence submitted to one library arrives in the other one. ...... DmU*~A1P3O. CU. AA. GAC C.
.::j 1990 Oxford University Press
Nucleic Acids Research, Vol. 18, Supplement 2237
Compilation of small ribosomal subunit RNA
sequences
Jean-Marc Neefs, Yves Van de Peer, Lydia Hendriks and Rupert De Wachter* Departement Biochemie, Universiteit Antwerpen, UIA, Universiteitsplein 1, B-2610 Antwerp, Belgium
INTRODUCTION Table 1 lists 275 small ribosomal subunit RNA (further abbreviated as srRNA) sequences (references 1-270) that have been published, or submitted to the EMBL or GenBank nucleotide sequence libraries, to our knowledge. The previous compilation (271) listed 106 srRNA sequences. There is a tendency towards publication of partial, rather than complete sequences. This is a consequence of the availability of new techniques for sequencing or DNA amplification, that allow to collect results faster, but do not allow to determine the complete sequence up to the termini. One such method uses reverse transcription of the RNA with primers complementary to universally conserved sequence areas (272), in which case the sequence adjacent to the 3'-terminus cannot be determined. In another approach (27), the rDNA is amplified by the polymerase chain reaction (273) rather than by cloning. Since the primers for the PCR bind to conserved sequences close to the termini but within the boundaries of the srRNA gene, neither of the terminal sequences is comprised in the analysis. Both these methods, however, can yield a continuous sequence spanning more than 90% of the molecule, provided a sufficient number of primers is used. Some authors (e.g. reference 274) use a limited set of primers, in which case a discontinuous set of partial sequences is obtained, which can nevertheless be aligned with complete sequences from other species and used for phylogenetic studies. The set of 270 different srRNA sequences listed in Table 1 comprises complete sequences and continuous partial sequences, but no discontinuous partial sequences. In order to limit the space needed for the alignment, we have restricted it to 60 sequences. This comprises all the complete sequences that were published, or have become accessible in sequence libraries, since the last compilation (271), plus 8 previously listed sequences added as references in order to allow a comparison of the present alignment and secondary structure scheme with the previous one.
SEQUENCE ALIGNMENT The 60 sequences are listed in 5 groups, consisting of eukaryotic (cytoplasmic) -, archaebacterial -, eubacterial -, plastidial -and mitochondrial srRNAs. Each group comprises a sequence already aligned in the previous compilation, viz. Homo sapiens for the eukaryotic cytoplasmic srRNAs, Halobacterium cutirubrum for the archaebacterial srRNAs, Escherichia coli for the eubacterial srRNAs, Zea mays for the plastid srRNAs. Because of the extreme variability in length and secondary structure of mitochondrial srRNA sequences, an animal -, a plant -, a fungal -and a flagellate sequence were added as references in this group. The species chosen are H. sapiens, Glycine max, *
To whom
correspondence
should be addressed
Saccharomyces cerevisiae and Leishmania tarentolae. The sequences are identified on each alignment page by a number corresponding with that in Table 1 and with the literature reference, and by the initials of the species name. Alignment positions are numbered at the top and bottom of each page. In addition, E. coli srRNA nucleotide positions counting from its 5'-terminus are indicated above the eubacterial sequences. Regardless of the analytical method used, all sequences are listed using the ribonucleotide symbols U, C, A, and G. Posttranscriptional modifications are known for few sequences and not indicated in the alignment, but if data are available this is mentioned by a footnote in the last column of Table 1. The symbol X is used for unidentified and the symbols Y (pyrimidine nucleotide) and R (purine nucleotide) for incompletely identified nucleotides. These symbols point to uncertainty in the identification of a nucleotide in the case of analyses performed on cloned or amplified DNA, which examine the structure of a single gene. In the case of sequencing by reverse transcription of srRNA, however, they can point either to analytical uncertainty or to sequence heterogeneity, since the srRNA used as template may be a mixture of different sequences. Gaps introduced in order to optimize the alignment are filled with hyphens. On the contrary, dots are used for an interruption of the sequence due to partial sequencing. Hence, a sequence interrupted by 10 dots means that partial sequences are known to the left and to the right of the interruption, but that the length of the separating stretch is unknown. An interruption by 10 X's means that exactly 10 nucleotides are present, but their identity is unknown. A sequence starting or ending with dots means that sequencing has not reached the terminus. Lower case characters at termini are used to indicate length heterogeneity of the RNA molecules. It should be noted that the alignment of the sequences for optimal similarity is straightforward in areas of relatively conserved primary and secondary structure, but is much more arbitrary in the structurally more variable areas. Alignment rests mostly on the periodic occurrence of conserved sequence motifs and on the observation of compensating substitutions in complementary areas of the secondary structure (see below). In -,
the variable areas, conserved sequence motifs are rarer and at the same time the sequences differ in length. Whether compensating substitutions are seen or not then depends on how the gaps are placed, which makes their observation less
meaningful. Sequence segments presumedly involved in base pairing according to the secondary structure model described below are indicated by shading superimposed on the alignment. Shaded
2238 Nucleic Acids Research, Vol. 18, Supplement
V7 41
40
39 42
V6
P41-3
P35-1
V8
18 P17-1
V3
48
V9
17
vi 121
9
11
V2
10
Fig. 1. Secondary structure model for prokaryotic srRNAs. The 5'-terminus is symbolized by a filled circle and the 3'-terminus by an arrowhead. Helices are numbered in the order of occurrence from 5'- to 3'-terminus. Helices bearing a single number are common to the prokaryotic and eukaryotic (Fig. 2) models. A composite number preceded by P points to a prokaryote-specific helix. Relatively conserved areas are drawn in bold lines, areas of sequence-and length variability in thin lines. Eight variable areas, numbered VI to V9, are distinguished, V4 being absent in prokaryotic srRNAs. Helices drawn in broken lines are present in a small number of known structures only. Archaebacterial sequences follow the prokaryotic pattern except for helix 35, which is unbranched as in eukaryotes.
Nucleic Acids Research, Vol. 18, Supplement 2239 areas corresponding to complementary strands of helices 1 to 48 of the secondary structure model are numbered 1 and 1' to 48 and 48'. Helix numbers are listed twice, the uppermost row applying to the eukaryotic sequences, and the lowermost row to the bacterial -, plastidial -, and mitochondrial sequences. Internal loops and bulges in a helix are indicated by interruption of the shading. Bases suspected to belong to pairs other than the WatsonCrick pairs G * C, A * U, or the wobble pair G * U, because they are intercalated between Watson-Crick and/or wobble pairs, are put in parentheses.
SECONDARY STRUCTURE MODEL The secondary structure models adopted for indication of the double stranded areas in the alignment are shown in Fig. 1 and 2. The prokaryotic model of Fig. 1 applies to archaebacterial, eubacterial, plastidial, and mitochondrial srRNAs. The model of Fig. 2 applies to eukaryotic srRNAs. Helices common to both models, further called universal helices, are numbered 1 to 48, in the order of occurrence of the 5'-proximal strand when the sequence is scanned from 5' to 3'-end. Helix numbers change
V7 40
41
39
35 E21-9 II
E43-1
V5 27
44
E21-7
V8
V4
E21-1 48
V3
V9
E21-4 14
vi 121
11
9
V2 E10-2
E10-1
Fig. 2. Secondary structure model for eukaryotic srRNAs. Symbols are as in Fig. 1. Helices bearing a composite number preceded by E are eukaryote-specific. Variable area V6 is missing in eukaryotic srRNAs.
2240 Nucleic Acids Research, Vol. 18, Supplement at branching points and at pseudoknot loops, not at internal- and bulge loops. Helices specific to the prokaryotic model are numbered Pa-b, where a is the number of the preceding universal helix and b is a serial number. Helices specific to the eukaryotic model are similarly numbered Ea-b. Structurally conserved areas are drawn in bold lines, whereas structurally variable areas, labeled VI to V9, are drawn in thin lines. Helices that are present in a limited number of species are drawn in broken lines. Such is the case for e.g. helices P41-1 to P41-3, specific for plant mitochondrial srRNAs, or helix E10-3, hitherto found only in Euglena gracilis cytoplasmic srRNA. Conversely, helices can be absent in certain srRNAs. As an example, the diplomonad Giardia kamblia misses helices E21-1 to E21-4. Even the 'universal' helices do not occur in all srRNAs, the exceptions being found among the mitochondrial srRNAs. Those from animal mitochondria retain only 36 of the universal helices, with most of the missing ones belonging to variable areas. In flagellate mitochondrial srRNAs the structure retains only 25 universal helices, with the entire area consisting of helices 31 to 45 missing. The Fig. 1 model should therefore be regarded, not so much as a general prokaryotic model, but rather as a model for eubacterial and plastidial srRNA secondary structure, from which mitochondrial srRNA structures can be derived by addition (in the case of plant mitochondria) or subtraction of sets of helices. As for the archaebacterial srRNAs, these differ from the eubacterial ones only by lacking the branching point leading to helices P35-1 and P35-2. The models of Fig. 1 and 2 concur with those proposed by Gutell et al. (275) as far as the conserved areas are concerned, and for a fraction of the variable areas. However, we propose a structure for a long insertion in area V7 found in plant mitochondrial srRNAs, and for area V4 in eukaryotic srRNAs, both left undefined in (275). The latter structure is different from that adopted in the previous compilation (271) and comprises a pseudoknot. In addition, for areas V2 and V3 of the eukaryotic model, we propose a base pairing scheme slightly different from that of Gutell et al. (275). Although most of the structures that we propose in the variable areas are supported by observation of compensating substitutions, (unpublished) they should nevertheless be regarded as tentative in view of the difficulties connected with alignment in these areas explained above.
AVAILABILITY OF THE DATA Of the 270 different srRNA sequences listed in Table 1, 197 were in our computer file at the time of writing, in aligned form and with delimitation of secondary structure elements. The remaining ones, which are not listed in the papers reporting them but have been submitted to GenBank, have only recently become available in the file server of this library. They will be aligned as soon as possible. The sequences will be available on floppy disks, readable on microcomputers operating under MS-DOS, in the following three formats: 1) In the form of an alignment with indication of secondary structure elements. Aligned sequences will be listed on 42 pages each containing 100 alignment positions of all the sequences. This format is most useful for those wishing to produce a hardcopy of the complete alignment, rather than the sample of 60 sequences listed here. From this file, the complete alignment can be printed using a wide carriage printer, condensed print and reduced line spacing. Note, however, that this takes about 252 pages of 15 inch wide printer paper.
2) The sequences, listed one by one, without indication of secondary structure elements, but interspersed with the gaps required for alignment, i.e. with homologous nucleotides in the same position in each sequence. 3) The sequences, listed one by one, written continuously without gaps or secondary structure-describing symbols. The number of formatted floppy disks of different types that should be sent in order to obtain each of these files is listed in Table 2. In addition, all these files can be obtained on a TK50 tape, suitable for a MicroVax computer operating under VMS 5.0 or
higher. ACKNOWLEDGEMENTS Our work was supported in part by the Fund for Medical Scientific Research and in part by the Incentive Program for Fundamental Research in the Life Sciences of the Belgian Office for Science Policy Programming (grant BIO/03). J.N. and Y.V.d.P. are holders of a scholarship from the Institute for Scientific Research in Agriculture and Industry. We thank A. Wilmotte, A. Goris, and R. De Baere for their assistance in the correction of the computer files.
REFERENCES 1. McCallum,F.S. and Maden,B.E.H. (1985) Biochem. J., 232, 725-733. 2. Torczynski,R.M., Fuke,M. and Bollon,A.P. (1985) DNA , 4, 283-291. 3. Gonzales,I.L. and Schmickel,R.D. (1986) Am. J. Hum. Genet. 38, 419-427. 4. Raynal,F., Michot,B. and Bachellerie,J.P. (1984) FFBSleu., 167, 263-268. 5. Torczynski,R., Bollon,A.P. and Fuke,M. (1983) Nucleic Acids Res., 11, 4879-4890. 6. Chan,Y.L., Gutell,R., Noller,H.F. and Wool,I.G. (1984) J. Biol. Chem., 259, 224-230. 7. Rairkar,A., Rubino,H.M. and Lockard,R.E. (1988) Nucleic Acids Res., 16, 3113. 8. Salim,M. and Maden,B.E.H. (1981) Nature, 291, 205-208. 9. same reference as 1. 10. Nelles,L., Fang,B.L., Volckaert,G., Vandenberghe,A. and De Wachter,R. (1984) Nucleic Acids Res., 12, 8749-8768. 11. Hendriks,L., Van Broeckhoven,C., Vandenberghe,A., Van de Peer,Y. and De Wachter,R. (1988) Eur. J. Biochem., 177, 15-20. 12. Hendriks,L., De Baere,R., Van Broeckhoven,C. and De Wachter,R. (1988) FEBS Let., 232, 115-120. 13. Tautz,D., Hancock,J.M., Webb,D.A., Tautz,C. and Dover,G.A. (1988) Mol. BioL Evol., 5, 366-376. 14. Ellis,R.E., Sulston,J.E. and Coulson,A.R. (1986) Nucleic Acids Res., 14, 2345-2364. 15. Takaiwa,F., Oono,K. and Sugiura,M. (1984) Nucleic Acids Res., 12, 5441-5448. 16. Kiss,T., SzkukAlek,A. and Solymosy,F. (1989) Nucleic Acids Res., 17, 2127. 17. Messing,J., Carlson,J., Hagen,G., Rubenstein,I. and Oleson,A. (1984) DNA, 3, 31-40. 18. Eckenrode,V.K., Arnold,J. and Meagher,R.B. (1985) J. Mol. Evol., 21, 259-269. 19. Unfried,I., Stocker,U. and Gruendler,P. (1989) Nucleic Acids Res., 17, 7513. 20. Nairn,C.J. and Ferl,R.J. (1988) J. Mol Evol., 27, 133-141. 21. Gunderson,J.H., Elwood,H., Ingold,A., Kindle,K. and Sogin,M.L. (1987) Proc. Natl. Acad. Sci. USA, 84, 5823-5827. 22. Rausch,H., Larsen,N. and Schmitt,R. (1989) J. Mol. Evol., 29, 255-265. 23. Huss,V.A.R. and Sogin,M.L. (1989) Nucleic Acids Res., 17, 1255. 24. Sargent,M.,Zahn,R., Walters,B., Gupta,R. and Kaine,B. (1988) Nucleic Acids Res., 16, 4156. 25. same reference as 21. 26. Bhattacharya,D. and Druehl,L.D. (1988) J. Phycol., 24, 539-543. 27. Medlin,L., Elwood,H.J., Stickel,S. and Sogin,M.L. (1988) Gene, 71, 491-499. 28. Sogin,M.L., Miotto,K. andMiller,L. (1986)NucleicAcidsRes., 14,9540. 29. Manlin,A.S., Sliyabin,K.G. and Rubtsov,P.M. (1986) Gene, 44,143-145.
Nucleic Acids Research, Vol. 18, Supplement 2241 30. Edman,J.C., Kovacs,J.A., Masur,H., Santi,D.V., Elwood,H.J. and Sogin,M.L. (1988) Nature, 334, 519-522. 31. same reference as 21. 32. Mc Carroli,R., Olsen,G.J., Stahl,Y.D., Woese,C.R. and Sogin,M.L. (1983) Biochemistry, 22, 5858-5868. 33. Johansen,T., Johansen,S. and Haugli,F.B. (1988) Curr. Genet., 14, 265-273. 34. Sogin,M.L., Swanton,M.T., Gunderson,J.H. and Elwood,H.J. (1986) J. ProtozooL, 33, 26-29. 35. Elwood,H.J., Olsen,G.J. and Sogin,M.L. (1985) Mol. Biol. Evol., 2, 399-410. 36. same reference as 35. 37. Spangler,E.A. and Blackburn,E.H. (1985) J. Biol. Cem., 260, 6334-6340. 38a. Sogin,M.L., Ingold,A., Karlok,M., Nielsen,H. and Engberg,J. (1986) EMBO J., 5, 3625-3630. 38b-46. same reference as 38a. 47. Sogin,M.L. and Elwood,H.J. (1986) J. Mol. Evol., 23, 53-60. 48. Herzog,M. and Maroteaux,L. (1986) Proc. Natl. Acad. Sci. USA, 83, 8644-8648. 49. Gunderson,J.H., McCutchan,T.F. and Sogin,M.L. (1986) J. Protozool., 33, 525-529. 50. Gunderson,J.H., Sogin,M.L., Woliett,G., Hollingdale,M., de la Cmz,V.F., Waters,A.P. and McCutchan,T.F. (1987) Science, 238, 933-937. 51. McCutchan,T.F., de la Cruz,V.F., Lal,A.A., Gunderson,J.H., Elwood,H.J. and Sogin,M.L. (1988) Mol. Biochem. Parasitol., 28, 63-68. 52. same reference as 51. 53. Waters,A.P., Unnasch,T.R., Wirth,D.F. and McCutchan,T.F. (1989) Nucleic Acids Res., 17, 1763. 54. Waters,A.P. and McCutchan,T.F. (1989) Nucleic Acids Res., 17, 2135. 55. Gunderson,J.H. and Sogin,M.L. (1986) Gene, 44, 63-70. 56. Clark,C.G. and Cross,G.A.M. (1988) MoL. Biol. Evol., 5, 512-518. 57. Sogin,M.L., Elwood,H.J. and Gunderson,J.H. (1986) Proc. Natl. Acad. Sci. USA, 83, 1383-1387. 58. Schnare,M.N., Collings,J.C. and Gray,M.W. (1986) Curr. Genet., 10, 405-410. 59. Looker,D., Miller,L.A., Elwood,H.J., Stickel,S. and Sogin,M.L. (1988) Nucleic Acids Res., 16, 7198. 60. same reference as 57. 61. Vossbrinck,C.R., Maddox,J.V., Friedman,S., DeBrunner-Vossbrinck,B.A. and Woese,C.R. (1987) Nature, 326, 411-414. 62. Sogin,M.L., Gunderson,J.H., Elwood,H.J., Alonso,R.A. and Peattie,D.A. (1989) Science, 243, 75-77. 63. Hui,I. and Dennis,P.P. (1985) J. Biol. Chem., 260, 899-906. 64. Mankin,A.S., Kagramanova,V.K., Teterina,N.L., Rubtsov,P.M., Belova,E.N., Kopylov,A.M., Baratova,L.A. and Bogdanov,A.A. (1985) Gene, 37, 181-189. 65. Gupta,R., Lanter,J.M. and Woese,C.R. (1983) Science, 221, 656-659. 66. Leffers,H. and Garrett,R.A. (1984) EMBO J., 3, 1613-1619. 67. Yang,D., Kaine,B.P. and Woese,C.R. (1985) System. Appl. Microbiol., 6, 251-256. 68. Eggen,R., Hannsen,H., Geerling,A. and deVos,W.M. (1989) Nucleic Acids Res., 17, 9469. 69. Jarsch,M. and Bock,A. (1985) System. Appl. Microbiol., 6, 54-59. 70. Lechner,K., Wich,G. and B6ck,A. (1985) System. AppL MicrobioL, 6, 157-163. 71. Oestergaard,L., Larsen,N., Leffers,H., Kjems,J. and Garrett,R.A. (1987) System. Appl. Microbiol., 9, 199-209. 72. Achenbach-Richter,L., Gupta,R., Zillig,W. and Woese,C.R. (1988) System Appl. Microbiol., 10, 231-240. 73. Ree,H.K., Cao,K., Thurlow,D.L. and Zimmermann,R.A. (1989) Can. J. Microbiol., 35, 124-133. 74. Kjens,J., Garrett,R.A. and Ansorge,W. (1987) System. Appl. Microbiol., 9, 22-28. 75. Kaine,B.P., Schurke,C.M. and Stetter,K.O. (1989) System. Appl. Microbiol., 12, 8-14. 76. Olsen,G.J., Pace,N.R., Nuell,M., Kaine,B.P., Gupta,R. and Woese,C.R. (1985) J. Mol. Evol., 22, 301-307. 77. Leinfelder,W., Jarsch,M. and Bock,A. (1985) System. Appl. Microbiol, 6, 164-170. 78. Achenbach-Richter,L., Stetter,K.O. and Woese,C.R. (1987) Nature, 327, 348-349. 79. Yang,D., Oyaizu,Y., Oyaizu,H., Olsen,G.J. and Woese,C.R. (1985) Proc. Natl. Acad. Sci. USA, 82, 4443-4447. 80. Dorsch,M., Moreno,E. and Stackebrandt,E. (1989) Nucleic Acids Res., 17, 1765. 81. Weisburg,W.G., Dobson,M.E., Samuel,J.E., Dasch,G.A., Mallavia,L.P.,
Baca,O., Mandelco,L., Sechrest,J.E., Weiss,E. and Woese,C.R. (1989) J. Bacteriol., 171, 4202-4206. 82. Stackebrandt,E., Fischer,A., Roggentn,T., Wehmeyer,U., Bomar,D. and Smida,J. (1988) Arch. Microbiol., 149, 547-556. 83-85. same reference as 81. 86. Weisburg,W.G., Woese,C.R., Dobson,M.E. and Weiss,E. (1985) Science, 230, 556-558. 87. Dewhirst,F.E., Paster,B.J. and Bright,P.L. (1989) Int. J. Syst. Baceriol., 39, 258-266. 88-94. same reference as 87. 95. Rossau,R., Heyndrickx,L. and Van Heuverswyn,H. (1988) Nucleic Acids Res., 16, 6227. 96. Toschka,H.Y., Hopfl,P., Ludwig,W., Schleifer,K.H., Ulbrich,N. and Erdmann,V.A. (1988) Nucleic Acids Res., 16, 2348. 97. same reference as 87. 98. same reference as 79. 99. same reference as 87. 100. Unterman,B.M., Baumann,P. and McLean,D.L. (1989) J. Bacteriol., 171, 2970-2974. 101. same reference as 100. 102. Woese,C.R. (1989) unpublished. 103. same reference as 81. 104. same reference as 102. 105. Brosius,J., Dull,T.J., Sleeter,D.D. and Noller,H.F. (1981) J. Mol. BioL, 148, 107-127. 106. Carbon,P., Ebe1,J.P. and Ehresmann,C. (1981) Nucleic Acids Res., 9, 2325-2333. 107. Martens,B., Spiegl,H. and Stackebrandt,E. (1987) System Appl. Microbiol., 9, 224-230. 108. same reference as 81. 109-110. same reference as 102. 111. Oyaizu,H. and Woese,C.R. (1985) System. Appl. Microbiol., 6, 257-263 112. same reference as 102. 113. same reference as 111. 114. Lau,P.P., DeBrunner-Vossbrinck,B., Dunn,B., Miotto,K., MacDonell,M.T., Rollins,D.M., Pillidge,C.J., Hespell,R.B., Colwell,R.R., Sogin,M.L. and Fox,G.E. (1987) System. Appl. Microbiol., 9, 231-238. 115. Weisburg,W.G., Tully,J.G., Rose,D.L., Petzel,J.P., Oyaizu,H., Yang,D., Mandelco,L., Sechrest,J., Lawrence,T.G., Van Etten,J., Maniloff,J. and Woese,C.R. (1989) J. Bacteriol., 171, 6455-6467. 116-123. same reference as 115. 124. Green,C.J., Stewart,G.C., Hollis,M.A., Vold,B.S. and Bott,K.F. (1985) Gene, 37, 261-266. 125-126. same reference as 115. 127. Zhao,H., Yang,D., Woese,C.R., Bryant,M.P. (1989) Int. J. Syst. Bacteiol., submitted. 128-129. same reference as 115. 130. Yang, D and Woese,C.R. (1989) System. AppL Microbiol., 12, 145- 149. 131-134. same reference as 130. 135. same reference as 115. 136. Collins,M.D., Ash,C., Farrow,J.A.E., Wallbanks,S. and Williams,A.M. (1989) J. AppL BacterioL, 67, 453-460. 137-139. samne reference as 136. 140-144. same reference as 130. 145-149. same reference as 115. 150. Iwami,M., Muto,A., Yamao,F. and Osawa,S. (1984) MoL Gen. Genet., 196, 317-322. 151-152. same reference as 115. 153. Woese,C.R. and Gutell,R.R. (1989) Proc. Natl. Acad. Sci. USA, 86, 3119-3122. 154. same reference as 115. 155. Taschke,C., Ruland,K. and Herrmann,R. (1987) Nucleic Acids Res., 15, 3918. 156-170. same reference as 115. 171. Frydenberg,J. and Christiansen,C. (1985) DNA, 4, 127-137. 172. same reference as 115. 173. Lim,P.O. and Sears,B.B. (1989) J. Bacteiol., 171, 5901-5906. 174-185. same reference as 115. 186. same reference as 136. 187. Collins,M.D., Smida,J. and Stackebrandt,E. (1989) Int. J. Syst. Bacteriol., 39, 7-9. 188. same reference as 127. 189. Suzuki,Y., Nagata,A., Ono,Y. and Yamada,T. (1988) J. BacterioL., 170, 2886-2889. 190. Edwards,U., Rogall,T., Blocker,H., Emde,M. and Bbttger,E.C. (1989) Nucleic Acids Res., 17, 7843-7853.
2242 Nucleic Acids Research, Vol. 18, Supplement 191. Collins,M.D., Dorsch,M. and Stackebrandt,E. (1989) Int. J. Syst. Bacteriol., 39, 1-6. 192-194. same reference as 191. 195. Charfreitag,O. and Stackebrandt,E. (1989) J. Gen. Microbiol., 135, 2065-2070. 196-200. same reference as 195. 201. Pernodet,J.L., Boccard,F., Alegre,M.T., Gagnat,J. and Guerineau,M. (1989) Gene, 79, 33-46. 202. Baylis,H.A. and Bibb,M.J. (1987) Nucleic Acids Res., 15, 7176. 203. Suzuki,Y. and Yamada,T. (1988) Nucleic Acids Res., 16, 370. 204. same reference as 191. 205. Collins,M.D., Smida,J., Dorsch,M. and Stackebrandt,E. (1988) Int. J. Sv'st.
Bacteriol., 38, 385-391. 206. Woese,C.R., DeBrunner-Vossbrinck,B.A., Oyaizu,H., Stackebrandt,E. and
Ludwig,W. (1985) Science, 229, 762-765. 207. Murzina,N.V., Vorozheykina,D.P. and Matvienko,N.I. (1988) Nucleic Acids
Res., 16, 8172. 208. Weisburg,W.G., Giovannoni,S.J. and Woese,C.R. (1989) System. Appl.
Microbiol., 11, 128-134. 209. Weisburg,W.G., Oyaizu,Y., Oyaizu,H. and Woese,C.R. (1985) J.
Bacteriol., 164, 230-236. 210. same reference as 209. 211. Weisburg,W.G., Hatch,T.P. and Woese,C.R. (1986) J. Bacteriol., 167, 570-574. 212. Oyaizu,H., DeBrunner-Vossbrinck,B., Mandelco,L., Studier,J.A. and
Woese,C.R. (1987) System. Appl. Microbiol., 9, 47-53. 213-214. same reference as 212. 215. Witt,D., Bergstein-Ben Dan,T. and Stackebrandt,E. (1989) Arch. Microbiol.,
152, 206-208. 216. Tomioka,N. and Sugiura,M. (1983) Mol. Gen. Genet., 191, 46-50. 217. Achenbach-Richter,L., Gupta,R., Stetter,K.O. and Woese,C.R. (1987) System. Appl. Microbiol., 9, 34-39. 218-219. same reference as 102. 220-221. same reference as 127. 222. Schwarz,Z. and Kossel,H. (1980) Nature, 283, 739-742. 223. Hiratsuka,J., Shimada,H., Whittier,R., IshibashiT., Sakamoto,M.,
224. 225. 226.
227.
Mori,M., Kondo,C., Honji,Y., Sun,C.R., Meng,B.Y., Li,Y.Q., Kanno,A., Nishizawa,Y., Hirai,A., Shinozaki,K. and Sugiura,M. (1989) Mol. Gen. Genet., 217, 185-194. Tohdoh,N. and Sugiura,M. (1982) Gene, 17, 213-218. Von Allmen,J.M. and Stutz,E. (1988) Nucleic Acids Res., 16, 1200. Ohyama,K., Fukuzawa,H., Kohchi,T., Shirai,H., Sano,T., Sano,S., Umesono,K., Shiki,Y., Takeuchi,M., Chang,Z., Aota,S.I., Inokuchi,H. and Ozeki,H. (1986) Plant. Mol. Biol. Rep., 4, 148-175. Dron,M., Rahire,M. and Rochaix,J.D. (1982) Nucleic Acids Res., 10,
7609-7620. 228. Durocher,V., Gauthier,A., Bellemare,G. and Lemieux,C. (1989) Curr.
Genet., 15, 277-282. reference as 228. Yamada,T. (1988) Nucleic Acids Res., 16, 9865. Huss,V.A.R. and Giovannoni,S.J. (1989) Nucleic Acids Res., 17, 9487. Witt,D. and Stackebrandt,E. (1988) Arch. Microbiol., 150, 244-248. Graf,L., Roux,E., Stutz,E. and Kossel,H. (1982) Nucleic Acids Res., 10, 6369-6381. 234. Eperon,I.C., Anderson,S. and Nierlich,D.P. (1980) Nature, 286, 460-467. 235. Hixson,J.E. and Brown,W.M. (1986) Mol. Biol. Evol., 3, 1-18. 236-238. same reference as 235. 239. Van Etten,R.A., Walberg,M.W. and Clayton,D.A. (1980) Cell, 22, 157-170. 240. Kobayashi,M., Seki,T., Yaginuma,K. and Koike,K. (1981) Gene, 16, 297-307. 241. Gadaleta,G., Pepe,G., De Candia,G., Quagliariello,C., Sbisa,E. and Saccone,C. (1989) J. Mol. Evol., 28, 497-516. 242. Anderson,S., de Bruijn,M.H.L., Coulson,A.R., Eperon,I.C., Sanger,F. and Young,I.G. (1982) J. Mol. Biol., 156, 683-717. 243. Nagae,Y., Fujii,H., Yoneyama,Y., Goto,Y. and Okazaki,T. (1988) Nucleic Acids Res., 16, 10363. 244. Roe,B.A., Ma,D.P., Wilson,R.K. and Wong,J.F.H. (1985) J. Biol. Chem., 260, 9759-9774. 245. Clary,D.O. and Wolstenholme,D.R. (1985) Nucleic Acids Res., 13, 4029-4045. 246. Clary,D.O. and Wolstenholme,D.R. (1987) J. Mol. Evol., 25, 116- 125. 247. Jacobs,H.T., Elliott,D.J., Math,V.B. and Farquharson,A. (1988) J. Mol. 229. 230. 231. 232. 233.
same
248.
Cantatore,P., Roberti,M., Rainaldi,G., Gadaleta,M.N. and (1989) J. Biol. Chem., 264, 10965-10975.
Biol., 202, 185-217. Saccone,C.
249. Chao,S., Sederoff,R. and Levings,C.S.III. (1984) Nucleic Acids Res., 12, 6629-6644. 250. Dale,R.M.K., Mc Clure,B.A. and Houchins,J.P. (1985) Plasmid, 13, 31-40. 251. Gwynn,B., Dewey,R.E., Sederoff,R.R., Timothy,D.H. and Levings,C.S.III. (1987) Theor. Appl. Genet., 74, 781-788. 252. Spencer,D.F., Schnare,M.N. and Gray,M.W. (1984) Proc. Natl. Acad. Sci. USA, 81, 493-497. 253. Grabau,E.A. (1985) Plant Mol. Biol., 5, 119-124. 254. Brennicke,A., Moller,S. and Blanz,P.A. (1985) Mol. Gen. Genet., 198, 404-410. 255. Boer,P.H. and Gray,M.W. (1988) Cell, 55, 399-411. 256. Sor,F. and Fukuhara,H. (1980) C. R. Acad. Sc. Paris Serie D, 29, 933 -936. 257. Li,M., Tzagoloff,A., Undebrink-Lyon,K. and Martin,N.C. (1982) J. Biol. Chem., 257, 5921-5928. 258. Huttenhofer,A., Sakai,H. and Weiss-Brummer,B. (1988) Nucleic Acids Res., 16, 8665-8674. 259. same reference as 258. 260. Dyson,N.J., Brown,T.A., Wafing,R.B. and Davies,R.W. (1989) Gene, 75, 109-118. 261. Trinkl,H., Lang,B.F. and Wolf,K. (1989) Nucleic Acids Res., 17, 6730. 262. Cummings,D.J., Domenico,J.M., Nelson,J. and Sogin,M.L. (1989) J. Mol. Evol., 28, 232-241. 263. Labriola,J., WeissI., Zapatero,J. and Suyama,Y. (1987) Curr. Genet., 11, 529-536. 264. Schnare,M.N., Heinonen,T.Y.K., Young,P.G. and Gray,M.W. (1986) J. Biol. Chem., 261, 5187-5193. 265. Seilhamer,J.J., Olsen,G.J. and Cummings,D.J. (1984) J. Biol. Chem., 259, 5167-5172. 266. same reference as 265. 267. Eperon,I.C., Janssen,J.W.G., Hoeijmakers,J.H.J. and Borst,P. (1983) Nucleic Acids Res., 11, 105 - 125. 268. Sloof,P., Van den Burg,J., Voogd,A., Benne,R., Agostinelli,M., Borst,P., Gutell,R. and Noller,H. (1985) Nucleic Acids Res., 13, 4171-4190. 269. de la Cruz,V.F., Lake,J.A., Simpson,A.M. and Simpson,L. (1985) Proc. Natl. Acad. Sci. USA, 82, 1401-1405. 270. Lake,J.A., de la Cruz,V.F., Ferreira,P.C.G., Morel,C. and Simpson,L. (1988) Proc. Natl. Acad. Sci. USA, 85, 4779-4783. 271. Dams,E., Hendriks,L., Van de Peer,Y., Neefs,J., Smits,G., Vandenbempt, I. and De Wachter,R. (1988) Nucleic Acids Res., 16, r87-rl73. 272. Lane,D.J., Pace,B., Olsen,G.J., Stahl,D.A., Sogin,M.L. and Pace,N.R. (1985) Proc. Natl. Acad. Sci. USA, 82, 6955-6959. 273. Saiki,R.K., Gelfand,D.H., Stoffel,S., Scharf,S,J., Higuchi,R., Horn,G.T., Mullis,K.B. and Erlich, H.A. (1988) Science, 239, 487-490. 274. Field,K.G., Olsen,G.J., Lane,D.J., Giovannoni,S.J., Ghiselin,M.T., Raff,E.C., Pace,N.R. and Raff,R.A. (1988) Science, 239, 748-753. 275. Gutell,R.R., Weiser,B., Woese,C.R. and Noller,H.F. (1985) Progr. Nucl. Acids Res. Mol. Biol., 32, 155-216. 276. Brosius,J., Palmer,M.L., Kennedy,P.J. and Noller,H.F. (1978) Proc. Natl. Acad. Sci. USA, 75, 4801-4805. 277. Kaestner,A. (1969) In Kaestner,A. (ed.) Lehrbuch der speziellen Zoologie. Gustav Fischer Verlag, Stuttgart, Vol. 1, pp 96-789. 278. Engler,A. (1954,1964) Syllabus der Pflanzenfamilien. GebrUder Bomtraeger, Berlin, Vol. 1 & 2. 279. Fritsch,F.E. (1972) The Structure and Reproduction of the Algae. University Press, Cambridge, Vol. 1 & 2. 280. Ainsworth,G.C., Sparrow,F.K. and Sussman,A.S. (1973) The Fungi: an Advanced Treatrise. Academic Press, New York, Vol. 4A. 281. Wetzel, A. (1969) in Kaestner,A. (ed.) Lehrbuch der speziellen Zoologie. Gustav Fischer Verlag, Stuttgart, Vol. 1, pp 21-95. 282. Corliss,J.O. (1984) BioSystems, 17, 87-126. 283. Woese,C.R. (1987) Microbiol. Rev., 61, 221-271. 284. Stackebrandt,E., Murray,R.G.E. and Truper,H.G. (1988) Int. J. Syst. Bacteriol., 38, 321-325. 285. Wayne,L.G., Brenner,D.J., Colwell,R.R., Grimont,P.A.D., Kandler, O., Krichevsky,M.I., Moore,L.H., Moore,W.E.C., Murray,R.G.E., Stackebrandt,E., Starr,M.P. and TrUper,H.G. (1987) Int. J. Syst. Bacteriol., 37, 463-464.
'14
2-
to C'11-.-mflO'.4la0r-0tfl
t-I
NifO
o00 M 00U.-rN 0.- 0 0 -0 00
XzXXzXXX
^0
0O.-O
co to C( f-coN
lU) .-Mm
.-aO- 4
00I0oO.-
.-O
00.-
O,-OOO.-
.
4,
3
-------o000 -ow0000
Q~
0OC'4M 04flN0 N1-
XXXXXX
XXX
Nucleic Acids Research, Vol. 18, Supplement 2243
XX
1@01h
0%Ov4vm
r.m0
r
-
-
--
-w--
--9
--
HH
__O4)
H)-. *-.-..
.0
00
00
-
EXZZXXX)4
000.-.-D MO N 4N N MNW -4' 4 CVC4NN NN(N NN NN .--
--
U t)--0 * HU.0 0 0 0
00
.----.- 0 0,- 0
Xzz
0004
MOM mm
N%o 0 '4t- 4OO IIrgr%Ow0
wk O
N
0 -.-
X>X ~XXXXE
--------N----__________________M_
1-1--1-I---
O1 00
.
00
0
0 muI0 -.OOWOOIJ *0 0U *00000.o.OOO -VV H 0000 .HH--.-. *0000000fa 04, HO-"4-0 -'00000000000000 -----0_, H 4....4 ~ 00 -.--.-.-.-..-.I0 . -H_ . 00 __.___ ......... '4.H'40 00>. 004)0000 V VI000 OH 00.00.000000000000 VI A 00 00 0 4 0. .0.0.0.0.0.0.0.0.0.0.0.0.0 1.414 00-r- r-4 4j j Li so $4 LI u w ut u Li LI Li LI $4 YAQ to QO.OH 154 -4X 0 01. R VI fa4 O ) 4 t0>.>. >. >4 t>0 H 0'-.44000000000-0 OOHH 0'4tJo 0 1 >.0 v to o 0 0la(a 000e 0 ` V 4,'aH 0 1. 1-. '-' H-* .H 4H '4- `4 -' - `4'-H-- 'H u 4 * A ` .4.4 -*. V4 0 4 1 HH -004 v00v 0>.) 0000 Vv vv . 4 Ao 00 0 000 .4'44, , 4,4,4, 4v > iXOOv > 0HaZ 1=VoQUVVf 000000A4.4 .04 OOOH1.a wa Vu"OO0W W4 00 v fiv 00000000 v fi O 04 4,41.40 000 4O 4v 4 h v00 °0 0 u 00 0 0 -H -H >1 0 0 >.4 Q0 Q4 O>.001.. 0
HHH
-
I~~~~~~~~~~ I
2200U> 9U
f
a:-.40Iu U I0
0a -
I
0
0
0
>J Qti
c!
X O -} _ . . . . . . . . . . . H . . . . . 0_000 -
p pe
-
-
-
-
0>.~~
-9
V
ZX
^ffiffiffi0ffiattv...... :00> 1-. 1.4.0.0.0.00 .000 14 fWo00iii0iii0ffii OUO 0 00.000 .0.0.0.0.0.0.0.0.01-.1-.ZS 4,
4,4,4,4,4,4,4,4,0
00000
$-
H0 O 04 1.4H OO 90 0 '4 0VIO 000 0 0HHHHHHHHHHHHHU H lZhZZ )04h4 4,i 4,4 ,4, Z 00'0 HHHH4040(6 '0' oO p000 0000000000UUUUUU 000000000rt 00 00 0000 4
k
0
I I
"I -i
HHvr
9
I H
U;
aF
3 8 sr CL
m.
0fi
4
H
U)
+
('M D
.-
-,
E'
.4 ) f )4
H1 -i -I
0 0I e 000
4) 0
HI, 1.4
ZEI
0
I
l
0
0)
cn
.4
V
.4.40
>*>H
0o
H
ah_ ' eo 000 ('1-I X). .e044 ID
$4 iw
,¢
@NrPk_X O .-1fl _
f 4'0 04 o-0. 4,. O ('444.4
OH
-H
LI A.
4
Io co%l_ H
+
- E
-
.
.40 HZ W L s s 1 .a 00 x 4to0 1 0 ONO !0g 0 V.400i Q t0 0 a @ t *4. 4.4H 001 440LA 014 X $ O ,0.1.. 00 0Z W4 . 40 Z 00.41'. 0 '41N0a004 0$4U .4.0 Z0000
..0
.0
H vel 0
Ui)
4
4
00 4 .00 1 0'44 0H @ ' H0 I U 0U u V.4 H -I.I
V
_. N
00I XI00 000000I
1.4
0
H
0I~Pk
,
O ..
0
0.4-40" 0 00 4) t 0 to
r.
e00 00.
0
$4 0' 0Paa_a0 NID-E' 0 -0 104.4
14.4
r.
to
tn to 4i
01-
u}
ul
rl
tn $
L 0 U) 00 4,1 0OD -..-.1.0.C-00O--I
I
ul 0
0
0
H
U oZ 0r m000 ZOO .> 0. Z
O=e
e
-.-.40
('1
ocoo + -v 0
4, 0'4 )co
0.-
0
lflZO
co o 00uo oo tn fn m
H
4.
44 VI-
VI
40
HN
0
n
-0
GI 41.4
--------ri
-'4
-0
>.z
4.41
z
0 4 00 0
*'4. 1t
U
.-
o>
u Q0.4> V 4'4 00 4r
-
u
x v) H O 4J >
'
0' 44
0 0 0000000 a0 0 0 a
0
0
8
iI
0 H
*,4
!
Ez
° es, ,,>o 4,f 4, 4, 4, 4, 4, ,4 -H 0 I o , Li z 0, uf 4,
l. 0 0 >. 0 0 -000 0 4, 4.4 >0 0. 1.4 0 0 0
a 03 fe v)
-4
M-IH4
$4 >4 0
0 .44
0
0
>4 4,4, 4, 4,
f ---.-.-.-~----
0.0
.
1. 0
NeS
41
>4
o '4
04.4 0
A;
e I~ ~ ~ ~ ~
2244 Nucleic Acids Research, Vol. 18, Supplement 11
0
0
-
4)
-
en N (I co N ai Ln V D _ O O
IV
NDeN
kD
-
.0
r- Ntn
r,
1
Un U1 0
co ON
en as tL %O 0-00 .-
0 %
Ai lh
bE-4 E-4 E-4E4E4E
E-
X41 li
-
-
I.
N
lw
w
IV IV IV
lr
lr
ML A
~
~
~
I
L
O
0
Xn
I
1 0
x
0
o 4 o 0 44044
6
I8
00~~0 tv0
$4 4 4 2OJ 2OJ 0J
4
4
-,
-1
-1
H -m -m
-1
-1
-m -m
-m -m -H
-1
-1
-m -m -H
4
-m -m
-m -H
0 0
0
0~~
4 C 4 - -
-
-
0
4 4 A 4 4 4 A
.
-- -- -- - -- - - --
-
00 t-OHHX www X x x F ss§vUUUU
I0%M i -00-4XX
O O O W O O O W O O HH O SW O W O W) O HO W O O W O W O O $4 O O H4 O O $4HH O W O HH O W O W O O HH iWO SWO O O HO WO O WO HWO 4O HH
.
AJ
n:::x
$J
i i
o
1.H -.
-H .
ON
(a N N.-
0A~
H
,1
_%
v X
uyO
_1
a-
-
o
o:N
v2 Q0
i ~~ . U~~~~~~~ -H
4.
4i 4J
4J
4J 4J 4. 4.
$4J
NL
N v N
-M
-H
P P
tl
r- oL
N r_
en r1
O 00 _ 0r N 0%- -_ 0
UI
4
N
sr o
Om 0n
rn LM
n ~ _ o
0
M
A0
en
MMMr-0%fn Utu v u
N LA MrN MM 0 r- MM M .-_L
tacuut)tS v C 4 z
-0 II
.
tn L IO
00
UCJ
0r U pi-0% .
000
r
-
-H
t)
ON Pk < z P)F EEX8X
s
0 U - IN
'
5>S>SH 000000U 11 0 0>4.1041000 a.4-.4-1.4-.4-.40 0 0 000U0 000000000000000000000000000000 J J 4 JU 4-.-.4.4..4.I4-.4-.-.0 0 000 0 0 0 0.. 0.0.0..0.0.0.0.-'.S~ 0 0 Z0 -
-
l
4SUUUU4
r.
00~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
'rn
0 14-
00.1 A4t.4j onrAl(44.' 8Ak VVV
ouuuu 0 uuuuuuuu ~~~ ~~~~~~~~~~8888ooooo0.0.%
1I
00-
1I
00-
MW
0D
1.N
0D
1.N
0D
('lcalU) O00-
14141414k
zzP4
1I
00-
1I
I
2246 Nucleic Acids Research, Vol. 18, Supplement 4) ;I
4.)4 N (O m korL 7)-tAN
oLor-0)... N
0-
CX
-ZZ-z
10 0
00W.0 0 1.17 e
0'n
0
4)
004.) 4
0.-0000
xxx
wx
----
0
0c0IV - 0 0 OLAco co coN e 0 IV0 N N
L
-
0 0
xx
0.-0.-00
.L4~A LDc4) N N w0 0
A~~
OO Nc p44) .- 0 0.-
xx
0
U)T
4
0
-
4)44 -00 >>L)0>14-
.0.00.E-O4)z.
0
0 M
4
)4)4
r
)4 )4
)4
)
F) c
HH
c
It (
4.-44)
-
C
0 0u0 00000 it)44I) x~HWwzx
0 0*-H 4)000.0 .0 4 0C z zOaOZ> > >OOo).)
ILOE-
-H-
-
0a
.0.0
-
000.-UU-.--
-
ONO
>
.-;-0-0000 x xZXX>X
~OOO
0a)~ SW)-tt -4(aIV4)4)-a--4)40) ----------. *4)4---)4)----4)-q-q 0 H0 04)40004-)-)--I.HO4 . O00 4111.01100 0 w 0 c -4 -4
A).4
0-L LLA -- N
-----.--.----
0
-
)
L -r-r o o r-( r-4) N o or- r LwinN-
.-00,>x>xxxX XP)Zxz
>
-----
00. 00
0 00 00
4.1) H4 fO4)
.H-
NO4 NL NIO .--
~~~~~0
0
t)
co LA- Lnco LAa) w lN 0-ON; t %L OlOr)L
~~~~~~~-
0.0
*
LDolO 10)01010o%O
N
H1-H 0 >
00 00-H-H-H-HI0
ma4X
0. 0 0.4.)
0 11-
4)4)4)4
)4
0.0.
-14
).0
4444
1.41144)4)).)4
4
4)
88::
-H
88
44444
1
4004)
4)
s
t
tn in in
444) 4.) 4) 4)-H tn-H VHV4 4.14414 4) 4) -H -H -H
-H00 4) -HE 4) t4) -Hr4)-Hla
0 0 4) 0 4.4) 4) 4)
0 0 0 4)
4)4)4)4)~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ '3'
4)
U
01.0~~~~~~~~~~~~~~~~~~~~~~~~0 11111111 4)>.>.>.>.>.>.0
.0
4)~~~~4
0
0'00 0 0'>.
.4 . 4-I4)0 0.0.0.0-04)0
.1111 -1V tot 0-4 114)S4)14)4)4)4) 111111 .0k4. xi E-.
4a
------.41------0
VV-----------0
-
0 .H 440 0 0 0 4)0 0 114) 4) 0
0
W
4))4)
0
)
>>.>0.>.>>.444444N 000N
.0000004I.. 0.0 0.0 0.00001
>,,--I
0
0
4C
)0000 ) .
-H-H -H0 0 -HIo000).
4uInl4)
1.1 a aa. 2 00000000C0I00 ICA ALSW
$w A
A
L
0
000
0.
0 4.)
am
0-
-H .0
00
4L4
.0 0'
1-4e $40$ OOlA0n
7asN
OLA
4)0
4444-
4.1.
CO
04)c
UNO-r-
4.)
4.)
0000
4.)4.4444
)(nl H Cn
.00: V0 -H
0-HU
-
0J4)4
4)
4
4)
C-fl
4)
co r- 4)>r- o ~Cn t- LA-ID4 N . 1LAO -M r-ILAO r0NN0. r N N 000r N H-H .- -H .. H -H -H 4) 4)) 4)) 0 toto4to H Le11011011111$ ID 01-
0 >
4).
>0 -HH
S4
4V J4).. cn -HEnE
4)4
4) 4
00u0>
.4)4)4)4 111111)
04
LA LA LA LA
O P14 -i
SW
101 4.-L
HN 4.
-H
SW 4)
NNUt
LI
ELALAE-H
4)
.-
-H4 XI X~ -H
4) 4.)4.4.1 4.14.1 mLALALALA LAU
ALALAc
.H r-
*
04)114114)*H414)44 I
4)4)000H I
4.104).-Int
0110 -H'H41. O
1
00000014
u U.o
i nL ,q
0
I.: 4)
.-.
I
4)1
Jr S
-Ir.f (
,
~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~H4.)
4.4
4
H
222
4))).4i.
~HH *H4)00 -H-H H -H -U) .) 4I4) .o 4) U o0 . . > >>j .0
o4H 4)4)' 04.H4)0t 0u. -4001144-H-1mm'4Min0000.111 .H 0to).-4)H
H 4.) 4 4114444)00.0 44 4
$4 4) n
V~ 4)
044)0
)-
:
n4 C U nI 4 4>
('U Ij-H44 dA H- 0~
N
u LA N
UU U
M
0 W U
a :: 5;00-N' 4 $ .q 0A4 ON N N N
UUUU
la
0gIH
4)4)
j
U
.1
r
H4 )u VIN o- : c: o
NNNNN
U U
UU
iE
1
N
n N
V
0 r
*H-H4-'4o OV S
(a0
>>>> o )4 )4 a4 o00$ 4I04 NSWL W 4NNNN4%4Z )Z J-
U UU U) UU
,I.
I
UU I
Nucleic Acids Research, Vol. 18,
Supplement
2247
Table 2. Number of formatted disks to be sent in order to obtain a copy of the database. Disk type Size
Capacity
Format 1
5.25" 5.25"
DSDD (360 Kbytes) DSHD (1.2 Mbytes) DSDD (720 Kbytes) DSHD (1.44 Mbytes)
15 5 8 4
3.5" 3.5"
Database format (see text) Format 3 Format 2 4 2 2 1
2 1 1 1
Footnotes to Table 1 The same number, and the a) This number corresponds with the literature reference and is preceded by an asterisk if the sequence is printed in the alignment. case character, are attributed lower a different followed numbers, Identical each by on the follow or page. alignment sequence initials of the species name, precede and Tetrahymena borealis). canadensis same the that have Tetrahymena same the sequence (e.g. species, to srRNAs from related species, or from different strains of In such cases only one sequence is listed in our computer file. b) This column contains the following data, if specified by the authors: - Strain name for laboratory animals, (cultivated) variety for plants, culture collection and strain number in the case of microorganisms. - Tissue from which the DNA used for cloning or amplification was extracted in the case of differentiated organisms. - Ribosomal RNA operon to which belongs the cloned srRNA gene in the case of bacteria. to 20), 279 c) The taxonomic position is described according to the following references: 277 for the metazoa (No. 1 to 14), 278 for the higher plants (No.to15Wetzel (281) is according described 34-62) of the (No. The taxonomic 28 to protista position 33). for the algae (No. 21 to 27), 280 for the fungi (No. for the species numbered 34-47 and 49-56, but according to Corliss (282) for Prorocentrum micans (No. 48) and for Euglenozoa, Microsporida and Polymastigotes to Stackebrandt et al. (284) for the Proteobacteria, (57-62). The archaebacteria are classified according to Woese (283). The classification of the eubacteria is according no information yet on the taxonomic position We have taxa. the for to Woese remaining and (283) the for al. Firmicuta, et according (285) according to Wayne of species 218-221. Taxon designations corresponding to an established taxonomic level are followed by the abbreviation Ph. (phylum), Sph. (subphylum), Cl. (class), 0. (order). by comparison with structures from d) The srRNA termini are located experimentally (e.g. by Si nuclease mapping) by some authors, but more often deduced denotes uncertainty on the location the length mark following A is listed. variant of the the question longest of related species. In case length heterogeneity length of the sequenced area. No length the length and gives has been determined partially the that means sequence of the termini. A number enclosed in brackets is mentioned for sequences not yet accessible in sequence libraries at the time of writing. The accession number for a sequence is the same in both libraries but there can e) Accession number in the EMBL and Genbank nucleotide sequence libraries.one. other in the arrives to one submitted a library be a delay before sequence f) PCR: the DNA was amplified by the polymerase chain reaction. RT: the srRNA was sequenced by the dideoxynucleotide method using reverse transcriptase. method. In the remaining cases DNA was amplified by cloning and sequencing was performed in most cases by the dideoxynucleotide sequence. this in the be found reporting can modification paper nucleoside on data g) Complete h) Partial data on modified nucleosides are mentioned in this paper or other papers cited therein. and has been corrected in (105). i) Complete data on nucleoside modification can be found in (276), but the sequence listed there misses I innucleotide are interspersed with genes coding for the rDNA j) Chlamydomonas reinhardtii mitochondrial srRNA consists of a set of 4 discontinuous fragments, which with alignment positions 239 to 312, 911 correspond the fragments between The and interruptions RNA subunit proteins. ribosomal fragments, tRNAs, large 2811. to 2643 to 930, and k) In the mature RNA, the nucleotides in alignment positions 431 to 618 are deleted by processing.
Nucleic Acids Research, Vol. 18,
2248
Supplement
40
30
20
10
.
.
.
.
.
.
.
.
.
.
.
.
I
-
-
-
50
60
.
--c
Hs
11 Ec 12 Tm 13 Dm 16 Le 19 At
-
-
-
20 Zp 22 Vc 23 Cv 24 Ne 27 Sc 30 Pc 33 Pp 51 Pf 52 Pf 53 P1 56 Ng 59 Ld 62 Gl
63 68 72 73 75
Hc Ms Tc Ta Po
95 96 100 101 102 105 153 173 188 189 201 203 207 208
Ng Pa Ap Ap
222 223 225 228 229 230 231
Zm Os
234 241 243 247 248 253 255 256 258 260 261 262 269 270
Hs Rn Rc
Cv Ec
Mg
Oh Me Mb Sa Si Tt Dr
-
-
Gmf
Cm Ce Ce Cv
-6
Sp
P1 Gm Cr Sc Sc An
Sp
Pa G U G A U G U U A U U G U A A U U U A Lt Ls
I lu
A U
U U A A U U U A A U U U A U A A U A U C A U U A A U U G A U A A A G U G G U G
~~~~I IIIIIIIII 4u bu bu
Nucleic Acids Research, Vol. 18, Supplement 2249 70
I
.
.
.
.
80
I
.
.
.
.
.
.
.
-
.
I
.
90 .
.
.
.
I
.
.
.
.
100
I
.
.
.
.
.
.
.
.
.
.
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
120
110 .
.
-
I
.
.
.
I
.
.
.
.
.
I. . .. U- G A U- G A U - G A U - G A U - G AU U - G A U - G A U - G A U- G A U- G A U - G A U - G A U- G A U - G A U - G A U - G A U-GALiA U - G A U- G A j C - G A U
--U A C --U A C --U A C --U C C --A U U - - U A C - - U A C - - U A C - - U A C - - A A C - - U A C --A A C --U A C - - U A C --A A C - - A A C - - A A C - - U A C - - G A U --C A U
-1
Hs
Oc Ec Tm Dm Le At
Zp Vc Cv Ne Sc Pc
Pp Pf Pf P1 Ng Ld Gl
1 7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62
2 ,.........
- - - - - - - -
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
* -_
-
__
-
__
-
-
-
-
-~~~~~~ ~ ~ ~
- - - - - - - - - - - - - - - - - - - - - - -
-
-
A A A A A
U U U U U U C U C U
10 10
1*~
GA
U
........H
-
Ng 95 Pa 96 Ap 100 Ap 101 Cv 102 Ec 105 Mg 153 Oh 173 Me 188 Mb 189 Sa201 SI 203 Tt207 Dr 208
-_
__-
-_
_-
-_
_-
_---
-U
-CUCAU UCUCAU UCUCAU ACC U A U ACCUAU UUUCAU GCCUGC
-~~~~_____________--GC -~~~~~___________--AC .--------------AUCAUACUC--AAAAUA---A .--------------------AAAUU---A ------------- --- - - - ------- - -AAAU U U AUAA .---------------AGUAAA---AAAUUUAUAA ___A _U _- _---GGUAUAGAGUAAUAUCCCCAUUUCCCUAUACUAUGAAAAAUUU--U ------- ---- - ---- - ----------- -AUAAAAA UAU AU AU AAU- UUAAAUAAGGCGUGGCGCAAGCUCGCCUUUACUCACUUUAACCAAAGUU _-
_-
_-
_-
_-
_-
_-
_-
_-
_-
I 70
80
_-
_-
9 90
_-
_-
_-
100
2
G GA .GGAG& G GGA G A U UU G A U GA . 4 U G G A( . i U A G A GA.tY
.________________-------AAU .-----------------------AA------------------------CAA
_-
75
- - - - - - -
-
-
63 68 72 73
- - - - - -
-_
-_-
Ms Tc Ta PO
- 1.
-1-
-
:..
U - G A U-GA.tf. U - G A U . C - G A il:. U - G A U.:
U GA U G AACAUAA G AACUGA A U - G A A A AU UG A A U - G A A AAU U G A A GJU - G A .~~~~~~~~~~~~~~~~~~~~UGU XACUGAA U - G A - - - - - - - - - - - - - - A A A U U G A A U - G A - U U C U G A U - G A - - - - - - - - - - - - - - - - - - A U A C G - A A U - GA - - - - - - - U A U C A U G G A tU - G A GUUUGGAA G A JU -GAUA U -G AI UCACC G G A -_ - - - - - - - - A U UCAC G G AG&%-1JU- GAUA - - - - - - - - - - - - - - - - U U G --UUGGAG GA U-GU UUAUGGAGA U - GAUA
- - - - - - - - - - - - -- - - - - - - - - - - - - - - - - - - - - - - - -
___
-
110
U U U U U U U U U U U U
C G A ieZm 222 CGAUC Os 223 CGA Gm 225 - G A .[OCm 228 : Ce 229 - G A C G AU Ce 230 - G A Cv 231
-
G G G G G G
G G - C A I- C A -
Hs 234 Rn 241 Rc 243 Sp 247 Pl 248
G G
GGU. G G G G
G A Gm 253 G Cr 255 Sc 256 A A Sc 258 A !tG~An 260 A Sp 261 G Pa 262 A - - Lt 269 A - - Ls 270
120
2250 Nucleic Acids Research, Vol. 18, Supplement 140
130
. . . I. . -
*
~
VC
U CA A A
....
U
....C....
~
~
~
..::..:......
-..... U C A A A G - ....*C U C A A A G - 0.U*: U C A A A G - UC....... U C A A A G - U$JCX.U C A A A G - U:.vC U C A A A G - C~Q.) CU C A A A G CU C A A A G - U 8:.U
- - -
AU A U A U A U A U A U A U
U U U U U U U
l.::.
.%
........
ICC A hOC A U:.UcccAG A U..A AU GU(A A A A A A 0CC A U:4 AUG U G U A A VA)QA A AG C£C A . C A U G U G U A AG .( ...A A h,
U CA A A G - - A U U A A
-...... C1 120TpXQ.¢g4Um U 20 Zp 531Hs
180
I
**
A U.C A U G U C U A A Q V AC - - U CA A AAGG -- -- AA UU UU AA AA G.t GCC A U.C A U G U C U A A :UAC --U CA A AAG G --A U U A A ~..C.. A.. . .A.. . A .V.:.. ... .....(t2. -....---...-.. V$.. ..-
FGUC
-
170
160
150
A A G ¢CCt A U t A U G U G U C(A$ GA A. A A A .GCCA.. A U..G.C A U G U C U ...-..-.A A 6 £. $ A ".. C A U G U C U A A AVA AA 24 CU CA A A 0 CC A U GC A U G U C U A A AU A A A 7 Ne k ,, AAC..A...gi; C - - A U U A A G C C A G A U G U G U A A ..G: .....A.. 27 Sc .....,, V......C .CVQ W. * C i 30 Pc CUGAGUA.U ................C..... - - A U U A A G CC A U 4 C A U G U C U A A A U.AA. CAUUg 33Pp ..GC.A....i...-.U .GC - U* C*.tgCU. A A A G A U U A A 0C CA.. Gi A U .CAA U G G ...A..A .......) - .U.VC U C A A A 51 AVQCU 52pf U.U...C.... ......... .C - hG.. C.U C A C A G - - A U U A A 0.C.C A .. ...C A A G U G A A A G ...UVA AU - U.UC U A A A A U C A A A G - - C C U A A 0...CCA U 4:C A A A U G U A A G.AU@)A A U - u:&WA - Kg7U. U C A A G G - - A C U U A GCCCA "... A U G C C U C A Q.AAt 62 Gi t..U A C J.. CG.t.. UC - Y.B..t CCCU C A A G G - - A C G A A .C.AU... - - A U G C C A AG .
.
.~UCI~C .. ... .
U..CA..U.A.....C.
- -
- - - -
..
Pf:X. G..CAS U..CAV ..
...
.........
..
A6 GS CAUGCCUCAr ......
2 -1' 63 Hc 68 Ms
3
4-
A A A "...g :R.. GG -- -- AA CU UU AAU AA |;...................... A U:.ta..u...t ¢t.............................
A ...G..g.. A. ..C....A m.. G
:.
..
63 2c 0 ........ 75 Po ...C....U
MsI-
G
68
....
40
30
I. 95 Ng 96 Pa
A
AP
A
102 Cv
CGGCUA
Ec
A
CA
173 Oh
A
1 88
Me
A
189 Mb
A
201
Sa
I
.
.
203 Si
A
208
A
Dr
*
.
.
.A
...4
50
70
60 .
.
.
.
.
.
.
G G C C U U C A A A A A C.. ..G.C
U
C U
A A
.
.
.
.
.
.
.
.
4 A G C - U U U A C
GA
.£A
A
TI0C
A
A
A
C
GUCC~~A
ACG
A A
U
AUGAA
........... C....... G.. G.
C 4. V...O...C U A A
G
A
207 Tt
.
- - - - C U A A GU.C.8g...G A.G A.G C G G C AC..A. A U...GC.£ A A GCCGCAAC...C A A G..)...,C..G A CG AG C.G >..C. Q.. 4C..:XB....S..£ - - - - C U A A C...A...:C A V..O...C.
A
W
.
.....
A
mg
153
. . .
A
130 Ap AUG
...... .....
*............:.: :. . . . ....................
C
A A G
A
C G G
A
C G
A
G
..UC
UGC..............L
,
2 -1' -
A
A
AG
CG0AC
GC...CU.. ...........'
3
4
6
5 .e.z...@@@@@@*. *. ^........
222
Z7m
223 Os
GCGCA:XXUCBc
G
225 Gm
G .G
228 Cm 229 Ce 230 Ce 231
G
. ..C....... .C..C.. ...
Cv
U U A A U U A A C U U A U U A A U U A A ---- U U A A - - - - U U A A
-
-
-
-
...A
A
..A.U
A
U
..C
.C
G
A
A
A
A
C
A
A
A
A
C G G:
AC4:: :: U:A
A A
'.:
C b...u SX.X...A...s. A - - -.............-.U U.... U:U A fi.GU A A AC - --- U U A - C A C A U.; GC A A G C
234 Hs 241
- - - - - - - - -
Rn
243 Rc 247
Sp
248
P1
C
AC.AS...).V.4)G
CCU
253 Gm
A
255 Cr 256
- A U GC-........... -... A W...W.X AGUG ~~~~~U -A A G C A A-:::-. A U U AU CAA.... ---UGC-C A G:...(.ii .X..A... ---- U UA A C.A.CA UGC.£A A GUCG A A C G U
ACUC CC
C C 0.AGCG.s...V
A
Sc
G
A
2585Sc
G
OVUCA
260 An
U
261
Sp
G
262
Pa
U
269 Lt
U
270 Ls
U
- - G U U U A
V.AC
A V...C. G A G X X X X X X X X X X X X
............ .............. --------. ........ UCCAAGUA CAA C AA UU4C A'CGC -- ------ --- AU UUU GG AAAAC "AC A A CC G... U CA G ACUA(A) GAG AA" CAA AXU A V U:1 AUAC~VUU+ -G C GU AA- AA VC .........A..A A GAU - - - - U U U, , ., A,. ,., .,A,., 0 A UCU. ..,,,...,.,,,..,.,.,.,,.,.,,,....,.,....,.,,..,,.,,,,,,,,, A- A - - -U - A- A-UC- -............... A ............. UC -A
130
140
150
160
170
180
~.
Nucleic Acids Research, Vol. 18, Supplement 2251
190
.
.
.
I
.
.
.
.
.
I.
200 .
.
.
I
.
.
.
.
220
210
I *** I . -
.
.
.
I
.
.
.
.
I
.
.
.
.I
-
-
-
..1j.. ,^ ,.,0
_
0..W
_
U - C-(C)A-*--U--A
230 . -
.
.
- - - - - - - - - - - - - - - - - - - --
.I
.
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ---
.
-
-
.
.I
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - ---
240 .
.
- - - - - - - - - - - - - - - - - - - --
.
.I
.
.
.
.I
G G Hs G G Oc G C Ec A A Tm A C Dm A A Le A A At A U Zp U G Vc U G Cv - - - C U G Ne - - - - A A Sc - - - C A A Pc AG A G C A A Pp - - - U A U Pf - G U A A G Pf - - - U A U P1 - - U U C G Ng - C A C U G Ld - -CCC-Gl
1 7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62
G Hc A Ms - Tc A Ta A Po
63 68 72 73 75
G C A C G C A C UA C A U - U A C - - A C - - C A - - C G - - C U - - A C - - - C
- - - - - - - - - - - - - - - - - --
6
-_
-_ -_ -_ -_
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
-
_-
U G C G C
U C U U G
A A U A G
80
I
.
.
.
.
I
. .
-CI u UG -CLuPUG -CLuPUG -CLuPUG -UL uJCG C -CL uPUG -G(c"AA -G(c"AA -CI uPUG uJCG uJCG -U' -ULuuJCG PU-ULu CG
Ng Pa Ap Ap Cv Ec Mg Oh Me Mb Sa SI Tt Dr
95 96 100 101 102 105 153 173 188 189 201 203 207 208
Zm Os Gm Cm Ce -GCUU Ce - - C U U Cv
222 223 225 228 229 230 231
A U C C Hs - - - - Rn A G - - Rc C A - - Sp C A -- - PI G U U G Gm A A - - Cr A U A A Sc A U A A Sc U U A - An U U A G Sp U U A.A Pa
234 241 243 247 248 253 255 256 258 260 261
-
-UI
6
- - - G G - - - G G - - - G G - G C A A - G C A A
..
...
.......,.;
U. ,.,u UUUU...... U -. U A AUUfi.U.G8G --__
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
- - -
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
..i............G.. G G A G &M..GuG G C A G A A G G A A A A G A G G C ............. X X X X*X X X X X XX.X X X X X X X X X X X X X X X X X X X X X X X
. -_-__-__ _-_-___ _ _ ___-_ - - - - - - - - - - - - - - - C - - - - - - - - - - - - - - - C - - - - - - - - - - - - - - - C
.C~~~~~~~~~~~~~ -P1-....................
X X X - X X X -
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
UU: -A' -:z - ii
--
:
:
...w.. w.....
C G G
. ---------------...---...---...---A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A.A C A A U A C A4*.OU.U ..,......M.." _ Q..A.Aik UG C A GJ.AA G U U A U U C U U U C .I U U C U G U A A.C U G G 262 -- - A .U__________________- - - - - - - - - - - - - - - - - - - - - - - - - - - - U U U U Ls 270 ......... .... u^.....
..
....
...
......
190
200
210
220
230
240
2252 Nucleic Acids Research, Vol. 18, Supplement 250
I
.
1 Hs 70c 11 Ec 12 Tm
13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62
.
.
.
260
I
.
.
.
.
.
.
.
.
270 .
.
.
.
.
I
.
.
.
.
I
280 .
.
.
.
.
.
290
I
.
.
.
.
I
.
.
.
300
I
.
.
.
.
.
.
.
.
.
.
.
CCG-C C G -
-
CCU-C_C C_C
Dm Le At Zp
-
Vc
Cv Ne Sc Pc Pp Pf Pf P1 Ng Ld GI
C C
-
-
-
-
-
_
_
_
_
_
_
-
_
_
_
_
_
___
_
_
_
_
_
G-----_____ GU
-
-
_
_
_
_
_
_
_
_
_
_- ______GU.U CA--------__________
__-_
C
_
_
_
_
_
_
_
_
U A
6' 63 68 72 73 75
95 96 100 101 102 105 153 173 188 189 201
Hc Ms Tc Ta Po C G -_
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
-_
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
-_
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
-_
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
-- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- -- --C------
Ng Pa Ap Ap Cv Ec Mg Oh Me Mb Sa
----
Cu
-_
_-
-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-_-
-
--
- --
-
------
A-A A ~
~
~
~
-A
_________________________________--____--__________
203 Si 207 Tt 208 Dr
A-
6'
222 223 225 228 229 230 231
234 241 243 247 248 253 255 256 258 260 261 262 269 270
Zm
Os Gm
U
-
-_
_-
_-
_-
_-
_-
_-
_-
_-
U
--
-_
_-
_-
_-
_-
_-
_-
_-
_-
__- _- _-
U- - -
-_
_-
_-
_-
_-
_-
_-
_-
_-
__- _- _-
-_
_-
_-
_-
_-
_-
_-
_-
_-
__- _- _-
Cm Ce Ce AGG .-----Cv G-.-.----_______
Hs Rn Rc
Sp P1 Gm Cr Sc Sc An
CC -_
-_
---_-
__- _- _-
>
.>........
------------------------uA,i:h-
-_
_-
_-
-_
_-
_-
_-
-_
_-
_-
-_
_-
-_
_-
_-
-_
_-
-_
_-
_-
-_
_-
-_
_-
-_
_- _-
CA ...C
.U
...... ...
...
Sp Pa G G At4. A A A U UU __________.. Lt _Ls -_
250
UUAUU U U C U C C U C U A WuV A.U AA G G
U
UAU
C
UGG G U
-_
_-
_-
--------------------------------------
-_
_-
_-
--------------------------------------
260
270
280
290
300
~U
Nucleic Acids Research, Vol. 18, Supplement 2253 310 .
.
.
.
.
I
320 .
.
*I
.
- - - - G4fAiC AG U GA AA - - - - GAX&C AG U GA AA ---1,)I~ AA ~ A AA t" A A A ---- U U(A)A A A A A A
.
.
CU CU C C C C C C
.
G G G G G
.
I
330 .
.
.
I
.
.
.
C GA AUG G
.
AU CA U AU A U A U
C GAAUGG C C G A A UGG C G A A A G G C A A A A G G
A.g.C1,Jf>GUGAAACUGCGAAUGGC*J..CAU
- ~~~~ G U G A A A C U GOC G A A UG G CtCA U - ~frW~(G)AC(~)G U GA A A C U G C GA AUG G CU CA U
-. g.U
ACUGUGAAACUGCGAAUGGCUCAU
- UWJAUACUGUGAAACUGCGAAUGG G AU At.*fiA U G U G A A A C U G C G A A U G G A U _ Al,).ACUU U A U C G A A A C U G C G G A C G G ( A U - - U UU AC AG U G A A A C U G C GA AUG G CU C A U - - - - i*,CU(i.)U. .)GA A U C U G C GA A C G G C G
-1JAL>AUG.1*JA- GAAACUGCGAACGGC*JCAU
--A.U:&U(&)UU -GAAACUGCGAACGGCU AU W.*JUiAUA*x> *x*-GAAACUGCGAACGGG CAU A UoAoA U G AU UG U GG AA G G A U - - A* G C A G G- GA.UC U G C G C A U G GCg.>.A U - - ------A ~ C G G C G GA C G G
6'-
340
I. . . . I
.
.
350
..I
.
U AAAU UAAAU U A A A U U A A A U U A U A U U A A A U U A A A U UAAAU U AAAU U A A A U U A A A U U A U A U U A U A U C A U A C U A A A A U A A A A U A A A A U A U A A U A C A U G A C A A
100
110
.
G A A C G G
G GAC G A A AC G G GAOC G G GAC G G GAOC G GA AC G GA AC G A A AC G
.
.
A G U
QG
Om 13
LAu G.,AAC CVc 22
Cv .:A U %IJA C U Ne UiA U ..iGU A C CSc
23 24 27 AAA YVC U 30 UQt1A~ ACPc A...w....C(A{.Q)AW.GCi'p 33
-Pf 52 CA :7A AA.Ng 56 AA U C
C OiC('U
A A A A A
U-:Id
59 GI 62
- - - - - - -
- - - - - - ----
.
I
130 .
.
I
-
-
-
U A A U
A A A A A A
Hc Ms Tc Ta Po
63 68 72 73 75
140 .
.
.
.
.
.
.
I
.
.
.
A A C
U A A U U U U U U U
.
-
A'
A U AC G A U A CA AC G AC G
Ii.A
-- -
A
-- -
-
A-U A--
A UX
A G U A A C A
GA AC G G.LGAG U A A CA G A AC G GGUG A G U A A C A G GAOCG GG AG U AACOG G C AC G G GUG A G U A A C A
6'
~U U AA
~,G UA UC Le 16 4,GG A A CAt 19 GA CU Zp 20
120 .
G L*AG G GGAG G G. AG G AG G AAG G AG G AGAG G GA G
G A A CG G
I
8
G GA A AG CUAG U A A C GA A C U G AG U A A C G GAC G G ACAG U A A C GA A CA G CAG U A A C G GAC G G CAGAG U A A C .. . . . . . I I. I I. . I * .I
360
......... ..
C U C G Hs 1 uG U .CC UCG0Oc 7 A1U A C CEc 11 ~~~~ ACCT 12
7
90 .
I
.....
-
-
-
- -
Ng 95 Pa 96 Ap 100
Ap 101 Cv 102 Ec 105 Mg 153 Oh 173 Me 188 Mb 189 Sa 201 SI 203 Tt 207 Dr 208
8
7 .................
G G G G G G G
A A C G G GUA G U A A C G G G....G A G U G A C G G GU A G U GACG G C G U G A C G G GX$1G C G U G A C G G A G U G A C G G GG A G U
A A A A A A A
-U------- C C A G GA G U U C C A U A A A C C G *, U A - A CACA C C C AG - A U G A - - A A G C G A A C C A (. - -- - - - - - - - - - - - A U G A C A A A C C A W.U.G A - - A :::..~G:.:::A.:A..::~:.A A G U0 CG A A C GGGU CGUA A.A£GAACGGX......................... - - -.- - - - - - - - A A G U U G C - - - U A G W.U . A C C C ..; ..C..G AUA.U.G..G.G U A AA AG.U.A A A G1, A G GA C4A GG U A A G U A A GQ A GU A UUUUUU IJ U B A U J gAX4 A A G . G U A C A G GIG A G U A U U
-- - - - - - - - - - - - - - -
..
GUG~~~~~~ACGUGUGA
1,j
A A A A A A A
CG CG C 9 C d C C d
A-A -A --
C A
A
I
-
-
-- -
- C A C Cot A A U C CC
- - -
U AGAA U A A A A U GCG. - A l,IfA)C U U U U:A U U UA A A Q.U A A G U A U G:S
- -
A C G>UAAU-U 59LdEc-AAACU ... G A A A A~ - - - - - - - - - - - - - - - - - - - - - - - - - 24 C C G .js. U A A U - U C.a 27 UCA CU UU AA CC U . 622Gl..AA ACACC - - - - - - - - - - - - - - - - - - - - - - - - - U C A C UAG Sc - ---------------------------13~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nA A C C G 1.. CAAC ~ ~ ~ ~ ~ U~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~.................... u}U. wUAAU - U :.u. l6Le-U A C-U A C U ..A. AA........A....A AU.. .-........GA. A C-U A C U C G~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ A4JA ACCG........................A GA... 2OZp G ACCG.................A.......A........C:.. C-U A C A C#~~~~~~~~~~~~~~~~~~~~~~~~~~ A A.....U.... - - - - - - - - - - C--U U A Pf - 30l9At--U - -_ U U U A A {.^.A.( C A A AA AA E:Ci f G...G A 22V c---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~.......................................... P1 A C~~~~~~~~~~~~~~~~~U A CC U~~~~~~~~~~~~~~~ 53~~~~~~~~~~~~~~~~~~ A A U U 23Cv--U CU A A A G.;|;..as88 ACC.......CG......L1A.:.....:.....U A U AA.-..........C A A-G:.C,i.^. U.8.I..Sl .....G A A C U U A C U C:4~~~~~~~~~~~~~~~~~~~~~~~~~~~A................GA.. CCCG.........................A A U A 5624Ne--U Pf - - - - - - - - - - - - - - - - - - - - - - - - - UX .AA Q..t CU4. G A A A - - .......... A.
Mg
Oh Me Mb Sa
-
-
-
Sl
Tt Dr
-
-
-
-
-
-
-
-
G(GA
A A G CU.
- _ _ _ _ _ _ _ _ _ _ _ tG4 gAC A(A) -_ _ _(A)A _ _ _ _ A AA AA C
G A A A - -
G A A A - C~ G A -A
9
222 223 225 228 229 230 231
Zm Os Gm Cm Ce Ce Cv
t'-GA.A
C A A
'G
G A A A
".GAA
A
G G 4
-------------
91 G A A A G A A
C: -A-: A' CG::
C A A
U
A
G A A A
`
V
G A G G~~9G)A
234 Hs 241 Rn 243 Rc 247 Sp 248 P1 253 Gm 255 Cr 256 Sc 258 Sc 260 An 261 Sp 262 Pa 269 Lt 270 Ls
-
CCK J. GA A A-GCGA
15()
95 96 100 101 102 105 153 173 188 189 201 203 207 208
..
.........
-
G
A
A
A
V--
-G &--
- - - - - - - - - - u~~~..u...........w....XW^AAU- : --
- - - - - - - - - - - - - - -- - - - - - - - - - - - -
-A
-
-l
lo 38(
A
C A A A A AAU ~CU
....CU A A A
UA
-
-
...^ U
............t...-..
- - - - - - U.u.c....1CU A A A U -A U ,..()UUAU------ ... - - - --AUAU -
-- - - - - - - - - - - - - --
- - - - -~~~~~~~~~~~~... -u. -" ........
3j90
400
410
420
~.U ~.
Nucleic Acids Research, Vol. 18, Supplement 2255
430
.
.
.
.
.
I
.
.
.
.
I
.
.
440
.
.
I
460
450 .
.
.
.
I
I
.
.
.
.
.
.
.
.
.
I
470 .
.
.
.
I
.
.
.
.
I
480 .
.
.
-C G A -
.
.
.
-
I -
Hs
7 Ec 11 Tm 12 Dm 13 --Le 16 - - At 19 Zp 20 - -22 Vc -Cv 23 - - Ne 24 - - Sc 27 30 Pc Pp 33 Pf 51 Pf 52 - - P1 53 - - A Ng 56 - - A Ld 59 --Gl 62 Oc
- --
-
- - C A C A:.A. - - - - - U A AA} - - - - U - - - - -
C A C A
-
-
u
C. Q
-
-
-
- _
-
-
-
-
-
-
iX
FAAA ..
c
-
C
A A A CQ~ --C A-u U1I.A
_ - U GA A A &
A - - -- U UU1.Q U3(C)ft - A AC . U
A A
-
A U
- - -
iUaC U G .i. G C _____.
A A G G ..C%.G G C G C
-..
.. ........
.
-
A AC- - - C CAA... AU.*.. ....>
~
I
-
- A - A
C:bI.. A A G~A
.
-
-
-
-
-
-
......... ..............
_9,0
C U A A A U A AU ACO C IG C U A A :-. G C U A A U(UC C C C
A U A
---
-
10 -
ElO-1
G G A ---
A C
A U A - - - G
A U A - - A U A - --
,U CU C.A U U
U AC G G A A U . U C A U U A C A AG C*CU G G A A C -
Hc Ms - - - Tc -- - Ta --Po - - -
U G G A A U -
- - -
-
-
-
63 68 72 73 75
.. ... ..... ..
170
180
..................
.*I**
.AAUCACc u
C GG A - - - - - - - - - - C C UGAA U G .-. G A A A - - - - - - - - - AUAG C A A - - - - - - - - - A U A - - - C G ..*.:. A C G - - - - - - - - - A U A - - - A C g G C A A - - - - - - - - - - A A AU U A A A U A GC A U AGA A U A - - - A. ACAA*A4+.t C A UjG CU U U U A U A C G U - A.. A U(A)C - - - - - - - C C A C G G . A U A --- G A W()C A U A --- C U C G C U U--------A U(A)C C G G A U A - - - C U QG C C..:U C G C - - - - - - - A U C C C C C A U G - - - U . AC * C U U G - - - A U(A). .U G A U G - -- U A U U U C G U G - - - -
C U A :...G C U A t K S C U A *.. C U A . . .G. *.SC U A QA¢ CU A A U
git(C:)i.U
C U A U C U A U CUA U.' C U A :(C)U A .G C U A
190
A U A A U A A U A
- - --
9,
..
G C U A A
1j()
U(k)CCC
)C U A A U(A)CC(.
..
A A
...
..
...........
:G CU A AU(
U ...FA U ...i4 A U A U A UA U
A A A A A
.G ....-...
_
A
A
G U A
- -
-
-
G
G U A
- -
-
-
G
-
-
)CA
U A
.A
U A
-
.-
- -
- - - - -- -
- -
- - ---
-
- - - - -
- - - - - - -
Ng 95 Pa 96 Ap 100 Ap 101 Cv 102 Ec 105 Mg 153 Oh 173 Me 188 Mb 189 Sa 201 S1 203 Tt 207 Dr 208
10
..........; ..:...,
C U A A
-
- - -- -
- - -
%
*.-
CU
A
G G G
............................. A exG. A~~~~~~~~~~~~~~.
GU A
A A A A A A A
G G G G G G G
G G G G G G A
-
-
-
- - -
- - - - -
-
-
.
- - - -
- -
- - - - - - - - - - - - - - - -- - - - - - -- -- - -- - -- - - - - - - - -- - - - --- -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A - - - - - - - - - - - - - - - - - - - - - - - -
.
-
_ . -_-_-_ _ _ _ -_-_-__ _ _ -_-_ .
- - - GC..U A A A A
-
Zm Os Gm Cm Ce Ce Cv
222 223 225 228 229 230 231
Hs 234 Rn 241 Rc 243 Sp 247 Pl 248 Gm 253 Cr 255
U A __--U::iAUA.U:.:.AA*A AAA U A A-A_ U A -~~~~~~~ A UAAA A -c - _n26 --UUAIJUA AA AA AU _58 A A A A G A A A---...................An 260
G U A A U A A A----
-
41ju
GA.AU..A.GUUA--~~~~~~~~~~--Lt 269 ~...U... .-.s2.0............. ::.Q."A A --.fi..
440
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
4!)U
460
470
480
Ls 270
2256 Nucleic Acids Research, Vol. 18, Supplement 490
I
.
1 Hs
7Oc 11 Ec 12 Tm 13 Dn 16 Le 19 At 20 Zp 22 Vc 23 Cv 24 Ne 27 Sc 30 Pc 33 Pp 51 Pf 52 Pf 53 P1 56 Ng 59 Ld 62 Gl
.
.
.
.
510
500
I
.
.
.
.
I
.
.
.
.
.
.
.
.
I
520 .
.
.
I
.
.
.
.
.
.
.
.
.
.
.
.
I
540
.
.
.
.
I
.
.
.
.
I
- U G C -A U GAU G U G U G.3 G A G,XU - U G - - A U ACC G C G C - - U U U C U G . A)A -C GC --u u GAA G A C O--UUW u UUA A - - - - - - - G G A - uGO - -AU ~AU - - - - - - - G G A ~A-C GO - -A U %A~ A - CG C - - A U C C A . G A - - - - - C U G AQGA -C GU - -A U UA A - - - - - C U Gi - - - - - C U Gi GAAQG A- C GU - -A U'LiA" .U~~~~~~~~~~~ A U U- - - - - - C G G O'..-A; A CC G U - _- _A - - - - U A U . _G_U _A_A G A ... i;:
__________--------
CX
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -_-_
_-_
_-_
_-
_
_-
_
_-
_
_-
-- -- ---
- - - - - - -
--
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
-_
-- -- -- --
- - -
-
- -
-
- - -
-
-
---
-
W-
*..
.u u It
-- -- -- -- - --
U ~'UA~
A U UAAuA. C U G
A GC-
A
VU A
A
... .....
... .... ..
U
G U G U
-
-A UUA -AUUUA
4) ~~-GAGGUAA Gu --A
C0A:$
-
E1-2
E1O-1 ' 63 68 72 73 75
530 .
Hc Ms Tc Ta Po
95 Ng 96 Pa 100 Ap 101 Ap 102 Cv 105 Ec 153 Mg 173 188 189 201
Oh Me Mb Sa
203 S1 207 Tt 208 Dr
222 223 225 228 229 230 231
Zm Os Gm Cm Ce Ce Cv
234 241 243 247 248 253 255 256 258 260 261 262 269 270
Hs Rn Rc
Sp
P1 Gm Cr Sc Sc An
Sp Pa Lt Ls
490
500
510
520
530
540
Nucleic Acids Research, Vol. 18, Supplement 2257 560
550
~
~
~
~
570
..
~
590
580
...
..
..
..
.
..
600
..AA....G .. ..
...~ .#. _________________________C__________________---------------........-..... 19...... ...
..
..
..
..
....
..
..
..
..E11
... C....C... ..A..... ..L e..1 ...
..
.....
. .. p. .2
.. .. .. U..
C G Q&C C
CU
UC---G A-------------------CCAu C
AAAG-~~
AUAG
,R
U C
-----
-
-----
G C
AAAG-~GC
A
GG
CA-
~UGACC
---G
Of5 T1 GU A G A G G A C 12 5 UCC G~~~~~~A~~~Ait1
R
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
P153
AG4
~~A U
-
-
-
-
-
-
-
-
-
d rxt- U C U G A U
-
A
U
L
A U
t
E10-2' -
-
A ~UGA(
ACU
E10-2
-
Pc
GA CCAA
-G AG-A -A-A
-
C u u
-
G A UC A
-AA-A
--------
-------
u CGuA-
--G
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
- -
Hc Ms Tc Ta Po
Ng 95
-_-_-_-_____----_________---------------------------------------------------------------------- - - - -- -- -- -- -- -- - -- -- -- - - - - -- -- -- - -- - - - --
-_____-----_ -__ ____ _______- -____ __ __ __ -_ -_ -_ ____ -__ __
Pa 96 Ap 100 Ap 101
Ec -_ Mg O ---------O-- -- -- -- -- -- -- -- -- -- --- - ---- ----Me - - - - - - - - - - - - - -- - - - -- - - - - - - -Mb - - Sa
- -- - - - - - - - - -- -- -- - -- - -- -- -- -- - -- - -- - -- - -- - - - - - - - - - - - -- - - - - - - - - -
-_
-
-
~
-
-
-
-
-
-
---- - --
-~
-
-
-
-
-
-- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - - -- - -
-
-
570
580
590
600
189 20 1
222 223 225 228 229 230 231
Hs 234 Rn 241 Rc 243 Sp 247
Pl
248
Sp
256 258 260 261 262 269 270
Pa Lt Ls
560
717 I188
Tt 207 Dr 208
Gm Cr Sc Sc An
-
105 1 53
SI 203
Zm Os Gm Cm Ce Ce Cv
-
550
63 68 72 73 75
253 255
2258 Nucleic Acids Research, Vol. 18, Supplement .
1 Hs 70c 11 Ec 12 Tm 13 Dm 16 Le 19 At 20 Zp 22 Vc 23 Cv 24 Ne 27 Sc 30 Pc 33 Pp 51 Pf 52 Pf 53 P1 56 Ng 59 Ld 62 GI
.
.
610
I
.
.
.
.
I
.
.
.
.
I
620 .
I
630 .
.
.
.
I
.
.
.
I
.
~.
640
.
.
.
I
.
.
.
.
.
I
650 .
.
.
.
I
.
V GAA~*JA-C-G G C C G A ~~JAC U ~ G C C G AI iU A
A C A U G C A G A A C G --GA G A CG A U G A A % ~~~~~-AX.---------CUCU(CG--GAA A -----------------CUC A C G --A A: C A~-----------------C -------
-------------C U. C G C G --GA -A C UA C AAC A _-A---cA~AA---------...4AA A U A A G AA A ~~VAVU GOA -CGA A UA A-AA A--G -------------C G -t~~~~~~~C C A G C GA A----U CA GAAA
A
-------A-----G
A|
- 10' -
E1-2
.
I
660 .
.
.
.I
.
.
.I
.
i---CCUCGA----G '----CAUCGU----G. ----CUCUGU----G ,---- U U CU U'---CACGGCG---A U t--UUGC U GU-----CU G C G U -C4 '----AAUAG---: --C UGC G U-C--UUUGCUU ,----CUUGUU----e *--C UUGUU-A----UGA------*----GA-U---
A UGA------AA---CUUCACG--AA..
A
.
----CUUCGU----Q '----UUCGC----A i---CAUCGU----Q --- C U U G U----
___U_____G-C---CG A ~G A&*~&A-----A C G C U G A A
.
11
..............
63 Hc 68Ms 72 Tc 73 Ta 75 Po
................ . ... ....
_ g-.
C. G G
-
............ S.. -..G . A- - V..t..A..#..Z.fi0C G U- AA AA AG..G. A A A
G
G.........
U U
.GtA.- C
t: i
200
....1 95 96 100 101 102 105 153 173 188 189 201 203 207 208
Pa Ap Ap Cv Ec Mg Oh Me Mb Sa
______
S1
253 255 256 258
-
-
-
-
-
-
.. ........
C U - C
...........................
210
1.... 1.
-
,
A...X
_A_-_" ....
-
.AA......--.A.....A. ...........
Hs Rn Rc
-
-
--U C A U G UCU C(> G- - - G 4 .. C A C.... -- C - - C G A A - - - - A..G...S C A UCt - U - :-:- C G A -- - - - C G A A - -
10'
234 241 243 247 248
-
-
-G&G~~~~~~~
Tt Dr
Zm Os. Gm Cm Ce Ce
-
A-.-.-...A.C.C - A CAGAGGG A AG --G CGA A-AAAGU C -CU C UUU U C G GA -----GG -AAC C - ~~~~A ~& - GA C C U U U U G G A A ~ ~ ~ ~ A~~CC-A A A - GA C C U U C G G G --AAA-GG U U C G-C C-A A A AG-...A..G.-. A C C U U C G G G -.......... A%P.A..AA.t.M..UU-AAA...AA.X------AACU-¢---e . C VWV.1i.) X i - A A A A(SA VA-G C A A - G C A U l.4.. *' C' AX''t't'AGCGAA'X''''z'^ G - A A AG G U-U U C G -
Ng
222 223 225 228 229 230 231
CCG---CCG----
C U ----UU C U -
__
UUUA--
11
.A
.t:- A .......... ------g.~~....... _---G
AAA--
-___--G -A AA GGA-P--AAA - -A--PG.CA
-A---_---G .C A - A A A GGGA-...... GA--______A,St.....S.. ....tA ..... -_---G AAA--_---G AAA-A AA - - -G__ GAAU ACU - -A---G ii:.UGA-AA A AU-..-......... -_-__- --G AAA- -
-A---G.U
Cv
-_ -_ -_
Sp P1
Gm Cr Sc Sc 260 An 261 Sp 262 Pa 269 Lt 270 Ls
__-
___-
-----------------AC CA---C A .____________----GC CUA--AC C U A .U----------------
-
-
-
-
-
-~~~~~~~~~~~~~~~~C ------------------- - -U:.AUA-AUA AAA GA U --:A -.-. - A CU UAA---~~......UCU - - - -- A U -i -j -:J A A-I -A ------U - -U --GUU-UG A A-U-----CCC---
U'UA ..U..A.U.A.A.. U A UA.U-C.U A--_ ____u....8.a........... A--_ AA AA GAAGG--...
-_-_- -A-
____-__-_-----------.......i.... ----UUG.t..GA.B AAA. -UA A U U U A-IA:A US::UG U-
..
--
U GAAA-GUCA- GA UAU A GAU---
-- -- -- -
_ _
..u....
A.. :W..:-:
_.
6 blU
620
630
640
650
660
Nucleic Acids Research, Vol. 18, Supplement 2259 670
680
690
700
710 .
A.......AA.....CC..U..A... A.....C.A.U.U....A. ~~~~UC..............C...........
........ Z~~~A C~~~
I
720 .
.
.
.
I
........
....................
-
C~A U U C
...AC.- C... -GU U
AA1U
4"4~~~AU U C AAA*
AU C.A ......U.A..UC..A...U ~ J~ C AA U 4Uf1IB .....r
--C
.......
UQU~~....A..)U UUUA
- ---
~~1B~iJ
A-
(A)
-tCA
C
CC
t* AU....
~*
G c c
-CC -CC
()U
14Ai4
C
12
I~~~~......... U:.~
A U A U GAACCG A A C....
'40
X
-CA G U -- Q(U U .C.G..U.A.....U A~AU A ~k G GAG C-..... UUUC CU QAG AGU WCU ~C.U(U)U G- C GGAJG-~- UGGG A U....CC.C..
-CG U CU C
95 96 100 101 102 105 153 173 188 189 201 203 207 208
Ld 59 Gl 62
8'
-CU 14 G C~
Ng Pa Ap Ap
P1
C
C
- -CU CGUGC U-U
63 68 72 73 75
Pf Pf
c
*1....
Hc Ms Tc Ta Po
Pp
--A C CUAUC --A C C UA1
11X
Ng
11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56
Zp
-C CUAC A
1
7
Vc Cv Ne Sc Pc
UA
u C AAUCci(A)4~IJUu
~
-C(~~~~ 4~~~~(A)(~~~~~ (AC (aT~~~C(G~~~~UC Af&t~c (u u ~~(A)ACAA~~~~~....... it~
Hs
Oc Ec Tm Dm Le At
- -
G
Cv Ec
Mg
-- -
CA U G U()CGU UCO - G CCt~~~~~- A AGG ~~A UG GA ~).G UCC-G c4++~~~'+ ~G A...U CG1 - ~ AAC G Ut C U +Uc
Oh Me Mb Sa Sl Tt Dr
-
-
12
AU AAG~AG U ~~~~A G C ACUUAAGA
Zm 222 Os 223 Gm 225 Cm 228 Ce 229 Ce 230 Cv 231
-~~~~G :U~. Tc 72 ......Ta73 _- --- G G. Po 75
._ -
..¢..... Hc.
___
- -
- - - U AAG
G". .G G U G G A A G.CC - U GA A G C - GA 4 G A A G A A %G~ - C.C G A A G A A W G 4 ArA AG A U G A G.A).S.. - AAGG A U G ;A-.-A--"~CC .... .........
-
-.4g:Tt
Nucleic Acids Research, Vol. 18, Supplement 2263
A-..-
-
G C C A - A - A U
-
A A A U G G - - U A AC - - - U A A
-_
-_
-_
-_
-_
-_
-_ -_ -_ -_ -_
-_ -_ -_ -_ -_
-_ -_ -_ -_ -_
-_ -_ -_ -_ -_
-_ -_ -_ -_ -_
-_ -_ -_ -_ -_
-
-
-
-
-
-
-
-
-
-
-
..
.....
..
......
-
-
920
930
Rc 243
Sp 247 P1 248
...C.U .. Gm
------ - - ---- - - -
253
255 ....ScrSc 256
U-1IQ...Cr 255
-U
S~ ~.3. . . .u. .t. . .Sc 2
- - - - - - - - - - -
G-- --
-
-_
-_
G_
-
-_
-_
-_
-_
-_
-_
-_
-_
_
- - -
A.UA.
940
An26 An 260
- .. 61......... _ - - - - - - - - - - -Pa ........ 262 A.Of--
-_
-_
UW
27
Lt 269 -.U-.UA--
-Ls 270 -A .UU ..........
v-iu
Hs 234 Rn 241
950
960
2264 Nucleic Acids Research, Vol. 18, Supplement
1 Hs ~~4$~A A *4LA AU 7 Oc A 11 Ec AL Tm 12 'C4&t IGU A Le
1020
1010
1000
990
980
970
--------G A A U G A G U C C A C U
---------4.
4G A A U G AG U C C A C U --------------------------------G A A U G AG U A C A C U C ~ --------------------------------G A A U G AG U A C A C Ul C '~ -
13
-G A
A U G AG U A C A
CA
*4
--------------------------------G A A U G AG U A C A A 4UA 4L-
23 Cv
.U UUUA
--------------------------------AA UG G GA GAWC-
33 Pp
A
52 Pf 4*4AAU-G A A U G A U A G G A A A A UG G G G ~VC
59 Ld
A
U
17'
1
u** UA .......v C G # -U~ A4-U3 C4
63 Hc 68 Ms 72 Tc
73 Ta
43040 GU AAA
95 Ng
"G U AAA g X-L*~ G U AAAGC
101 Ap 102 Cv
153 Mg
1730Oh 188 Me 189 Mb
2035S1 Tt 207 Dr 208
4A
4AAAQU~-U&A
A Au1 #AAAG*4CU* ~1AAAC~-J~
-----...UG----
-----
UAAAC----------CUC ...GGUAAA -
GJAAC---U
17'
-
------------
PI17-1
P7-1 l
A
AA
18
-----------------------------------------UUU A UGAAQUVGUAAAC U--------------------------------------------UL~*i -----------------------------------------UI1 G UAAA (U-----------------------------------------UU*U 4AAA 1CUAAACUCUU*
225 Gm 228 Cm 229 Ce 230 Ce 231 Cv
-
-
-
-
-
-
$
-
-
234Hs ~ ~ ~ 241 Rn.~~~......... c ..R ...4... ....
2347 Hsp
-
2481R G U A A A 253 Gm U 255CSp --V& Sc U -UA G G 256258 P1 S ~~I .~A c ~J U
--.
-
-
-----
-
C C
C-~ A C G G A
-. A A
G A A A
-k k
A A A
Ui~A~......A.U.A.A.A.. An U CACACUAACAGCGGAAA4..A Gi 260~~~~~~~~~~~ ~..... ..
261 Sp 262 Pa 269 Lt 270 Ls
~
:.--C U U A k ~ ~ 1 ------------------------------------A UC G AA U AA -. U)U. A U A
~A U-C G.
-
I
I
970
~~~~~~I ~ 980~ ~
990
1 000
1 010
1020
.
Nucleic Acids Research, Vol. 18, Supplement 2265 1030
I
.
.
.
.
.
.
I
.
.
.
.
.
I
.
.
.
.
I
.
.
.
.
I
.
.
.
I
.
.
I
.
.
.
.
I
.
.
.
.
I
.
.
.
.
I
.
.
.
-
Oc
------------------------ ----
7 11
12 13 16 19 20 .~~~~~~Vc22 . Cv 23 ._______-.- -Ne 24 -~~~~~~Sc27 Pc 30 _--.____ . Pp 33 - - - - Pf 51 - - - - . Pf 52 - - - - - - - - - - - Pi 53 .______---Ng 56 - -. Ld 59 ------------ Gl 62
-
AA U
-
-
AA U
Hs
-
Ec Tm Dm Le .----. _______--- -At .___ -Zp
-
-
I
.
-
A U
-
-
-
iA
U AU AA U AA ...A A C - A A C - -
-
-
-
-
-
-
-
-
_-----
-
-
-
-
-
.AA U - -
-
u - -
-
-
-
-
-
-
A A A - -
-
-
-
-
-
-
C- -
A A
.
- . -_______---
-
AA U
j
.
.
1080
1070
1060
1050
1040 .
.
-
-
-
-
-
-
-
-
-
-----
18-
CUA---G U GCU
-
- - - -
C U-----
.....
4.At$iAG G-
A..
.
-
~
.
.
.
.
I
-
-
-
460
450 .
Hc 63 Ms 68 . .__________-.------------------------ Tc 72 ------------------------------------Ta 73 - - - - - - - Po 75 .
-------------------------------------
......s....S A:
.
.
.
.
I
.
.
.
.
I
.
.
.
.
C U C U C U
C U U U
AA U - - - AAU ---A A U---A A U---U A A U---U AAU ---A G U G G A A----G G A A-U C A A - - - -
Ng Pa Ap Ap Cv Ec Mg
-
-
-
-
-
-
-
-
-
-
-
-
-
Oh
-
-
Me Mb
-
Sa Si
-
-
A---_____ U C G------
Tt Dr
-
95 96 100 101 102 105 153 173 188 189 201 203 207 208
18 G A A G A A G A A G A A G A A G A A G A A G A A GAAGAA GAAGAA G A A G A A
A-- -.-_ __ A - - - - . ____ G .-.----___
Zm 222 Os 223 Gm 225 Cm 228 Ce 229 Ce 230 Cv 231
U. .--___ U. .-___ G----______ G .-.---____ -
-
--_______
.--
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
_----------- Hs 234 -------- Rn 241 _ - - - - - - - - - - - - - - - - - Rc 243
247 .~~~~~~~~~~~~~~~~~~~~~~~~~~~~Sp
Sp 247 Pl 248
A
-.3.A...R
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -.~~~~~~~~~~~~~~~~~~~~~~~~~~~~P1 248 - - -- - - - - - - - - -- - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - -- - - -- - - -
C UG A... G.C.G;o. - - - -
-! -m.Q -..(#.
iP.A U
A A A
A
U
U
A
A A
-:-:-C
-An 253 Cr 255
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
.~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L C C U Ls 270 AU .U U A 269
1030
10u40
1050
1 060
1070
1080
2266 Nucleic Acids Research, Vol. 18, Supplement
1 7 11 12 13 16 19 20 22
.
.
.
.
I
1110
1100
1090 .
.
.
.
.
.
.
.
.
I
.
.
.
.
.
.
.
.
1130
1120 .
.
.
.
.
.
.
.
I
.
.
.
.
I
.
. . .
1140
. . . .
. . . .
Hs
Oc Ec Tm Dm Le At Zp Vc 23 Cv 24 Ne 27 Sc 30 Pc 33 Pp 51 Pf 52 Pf 53 P1 56 Ng 59 Ld 62 Gl
63 Hc 68 Ms 72 Tc 73 Ta 75 Po
95 96 100 101 102 105 153 173 188 189 201 203 207 208
Ng Pa Ap Ap Cv Ec Mg Oh Me Mb Sa
S1 Tt Dr 18
222 223 225 228 229 230 231
18'
Zm Os Gm Cm Ce Ce Cv
234 Hs 241 Rn 243 Rc
- - - - - - - - - - - - - - - - - - - - -- - -- -- - -- - - - - - - - - - - - - - - - - - - -
- - - - - - - - - - - - - - - - - - - - - -
- - - - - - - - - - - - - - -
_ - - - - - - -
247 Sp
255 Cr
- - - - - - - - - - - - - - - - - - - - - -
256 Sc
A A A
261 Sp 261 SPa 262
- - - -
2
XU A A U GA C C A U A U A .." iA A U G A u U
A__________ _________
Pa
-
- - - - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
-
-
-
-
-
-
270 Ls
1 090
1100
1110
1120
1130
1140
-A
Nucleic Acids Research, Vol. 18, Supplement 2267
*1..
-
1150 .
I
1160 .
.
.
.
I
.
.
.
1170
.
.
-_
_-
_-
-_
_-
_-
-_
_-
_-
-_
_-
_-
-_
_-
_-
-_
_-
_-
-_
_-
_-
-_
_-
_-
-_
-
-_
.
_-
_-
_-
__-
-_
_-
-_
_-
_-
-_
_-
_-
-_
_-
_-
-_
_-
_-
-_
_-
_-
-_
_-
_-
.
.
.
.
.
.
I
1180 .
.
.
.
I
.
.
.
G G G G G G G G GG U
U U A U U U
----- -- -A G A U ----- -- -C C A U ------C C C A C A
G
1190 .
.
.
.
.
.
.
.
.
I
A A A A A A A A
- - -
470
G
U A A - - - - C A A - - - - U A A ----U A A - - - - U A A
.
490 .
.
.
.
.
.
.
C C
G A CG U U A U U G A C G U U UU G A C U G U A A A1,)IG CU G AC GG U UALU A U A~GAG U G A A C i A G -
A A A G U G A . CUU~A A A A UUGACG G U G A 18
- - - - - - - - - - - - - - - - - - - - - - - - - - -
-
-
- - -
-
-
- - -
C C C A A
A A A A A
A A A A A
G CG
.
.
Zp
Pp Pf Pf Pl
Ng Ld
Gl 62
m Q
1
G G C A A G A G G C A A G A
Hc Ms Tc Ta Po
:: G G C A A G G GG G C A A G A G G C A AG U
510
Ng Pa Ap Ap
95 96 100 101 Cv 102 Ec 105 Mg 153 Oh 173 Me 188 Mb 189 Sa 201 SI 203 Tt 207 Dr 208
A
A
---UAA - - -U AA - U A A -U AA -U
Zm 222 Os 223 Gm 225 Cm 228 Ce 229 Ce 230 Cv 231
AA
- U A A - U A A
A U C A
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - U - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Hs 234 Rn 241 Rc 243 Sp 247 P1 248 Gm 253 Cr 255 Sc 256 Sc 258 An 260 Sp 261 Pa 262 Lt 269 Ls 270
A
GA AA
-. -.
----------
-...u....X....
.A________________________________________,... U A U A A AA AA~A& Ci¢ A A: U'. A~~~~~~~~~~~~~~~ U
A
U
A
U
A
...........
tA..-s. . . A.U..U..................t
-
A U A A U-U A A U AUeW.A A U U - - - - - -
-
-
-
UU ).QM.X.....U AA U A A - - - -
-
- G A A U A A U-GA U A A U G AC AAU uUU C u A-~-. - &- - -GA -~~~~A ----A A U U U G -A UCUUAAUUUU AU U A -
-
-
-
-
-
-
-------
AUAAU-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~........ AUAAU--A-------------------------------
1150
1160
63 68 72 73 75
I.
C G G U GAA - C G G U AU_;A - -C G G U AU.AA.A. - U U C C G G U AUIU A..AA - C A U UGACGG.U AUCU...... - GA G A G A G A
A
1
7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59
19
U GA C G G U
U U U U
.
Vc Cv Ne Sc Pc
18' -
.
-AAU AA -- - U AA - - - - A A - - - - A A - - - - A A - - - - A A - - - U A A - - - U A A - - - - A A - - - - A A - - - - A A - - - - A A - - - - U A - - - - U A
AC----ACUGGACGUU
A. la U
.
Hs Oc Ec Tm :Dm Le At
500
I *
A U GGA U G A C ------A C j $_ G A C G U U ------A A i:>*.*i.....i.*. C G U G A C G U U
1
.
19
A A . A A AKA A A - A A -
480
.
GAUC C ;. A U C C A A UU C U A%. A C A A C C A AK A U C C A U C C A U C C K A U C A A U C A t A U C A A U C A A A C A A A C A, A A C A IC. A A C Ali A G C C A A ACAA - -C A
18' -
1200 .
1170
1180
1190
1 200
.
2268 Nucleic Acids Research, Vol. 18, Supplement 1210
.
.
1 Hs 7 Oc 11 Ec 12 Tm X3 VA G C 13 Dm AG C 16 Le C g C AAGO G C 19 At 20 Zp ..g.. ..........A.G. 22 Vc 1sC AGOC 23 Cv t A G C 24 Ne ACAGOC 27 Sc .B.u....4X....£ A G C 30 Pc ................... 19- AGO 33 Pp 51 Pf A GO 52 Pf .................. AGO 53 Pl s0 GO 56 Ng AAG 59 Ld 62 Gl ...................
I .~C
1230
1220 .
.
.
.
.
.
.
.
.
.
.
.
.
AGO 0 GOC~* AU U 1C AGCCGCAUU"...... A G C C G CGVA;A A U U e A CGC G C A U U Q A G C C G CGA A U U C. A CGC G C A U U. A G C C G C G . A U U " A CGC G C A U U CC A CGC G C A U U C; A G C C G C GiA A U U C A CGC G C A U U C.:t A G C C G C A U U cc AG 0 GOC BU. A G C C G CG1 A G C C GOGC UA CG CC CC GG CC G,S. AAGOC0GOC
A A A A
0 C C GGOC~ A 000 CG
A Uu uU ~ CC
.......
....
.
.
.
.
.
.
.
.
.
.
.
1260
1250 ...
1240 .
.
.
I.
A A G U U G
G G BBfl#M A*.^UAU g ~A A G U U G ...Cx.*.J ...C.A.A.U .A .X.::t.UA.UA.^>.U1J U A A G U U G CUCVCAU S.: :UA. A
A
U
U
A
U
....
A AG U U G A A G U U G e.U ..
..CU.B. .CC ...AA..Ui. AGC~UAUAUU .U A A G U U G C.£X..^...* B.xa 8;C.A A. :"C.U.UA .I .U AA AA GG UU UU GG CU~OCAAU
AGCGUAUA4JUU
A A G U U G K.B.u P0a.t .. Bfi CC CUB AAU. A4.C.UAUAU.U A AG U U G .....AU .4. UAVAV4JA CIJ-CCA AU AAP~~.C~~AUAUAU LI A .....x AA AA G G UU UU GG g.
U U t>. U U CC U U CC ..... UCC ¢... U W.AU UUM
~.
~
.
..C.... --AA
LJ~OUAAU C
GCAUACGU*JAA
AGA
U U G A A A U U G A A U U G
CU.CCx.. .AA.U A....G .U .. .U.... A~~~CAUAUAUUA CU ~~~CCAAU
~~A
.. U - gt .A. A G A .C U. XA.X'U .U gA "X AA UU GA CC UU GG C ~GXC G ASt;SX, ''CG .AC
GC A A~~~~~~ .>. A...AU. A U U CC C C G C i.'.
~ 0A G 0
.
,... -
19'
...........
.....
..............,
.............
3'
.............
...... ........
20 ...........................
...
63 68 72 73 75
.......................................................
Hc Ms Tc Ta Po
OCAUC A1gXC G A GAU0G
.4 &QCIJCt G A;U
OC G 0 0c GG 0 li:G: A A 0 Ai C C
O G 0 0 G 0 G:M 530X A A Ui...AgC
..........
.
............ ........... A
A
.G
U
.
.
XCU~~UA
UC~~~CC00 U A LIU~4~U U A U~~CC~C0 U A
IJAA.CCt G U U AU U A
....A.....U.U.A 540 560........... 550 I GAU.U.A * * * * ..I.*. **.*.I..*.*.*I.*.*..* OC G AGCG.. .G. A...U...................... 5 50 ....40 ..
"Ill
....
..... .........
C*IGGG~CCU
A
570 .
95 Ng 96 Pa 100 Ap 101 Ap 102 Cv 105 Ec 153 Mg 173 Oh 188 Me 189 Mb 201 Sa 203 SI 207 Tt
AGC AGC AGC AGC AGC AGC AGC AGC AGC AGC AGC AGC AGC AGC
208 Dr
AGCCGC AGCCGC AGCCGC AGCCGC AGCCGC AGCCGC AGUCGC AGCCGC AGCCGC AGCCGC AGCCGC AGCCGC AGCCGC AGCCGC
x G
Zm Os Gm Cm Ce Ce Cv
GU
19'
UAUAUAUAUAUAUAUAUAUAUAU AUAUA-
3'
20
.,... ;..................
CCA CA A CCA
GAAUGA GAAUGA GAAUGA GAAUGA GAAUGA GAAUGA GAAUGA
UAUA-AUAUAUAUA-
...
F U-- -- - - -- " .......a AG ¢
.. U~~~~~~~.. ....
2
2 2
Hs
cRn nRc
A
G C
A
G C
CA C CA C
A G C
G
CAA
.GG...U
AC
--
A A- --
--
A
C
AA ..AU.. ACGU-AAGAGAC~~~~~~~~~
Pls
A G
OC
C
G
C
A.A...U... A
:.I4LIU
,
....
CC..CU
A -
:C4
A G- --
GA
AOGG-A. 2
_--
GA
C G C U
....
.............
A
C G C G C
I
.
G G A A U U A G G A U U C A G G A A U C A
G C AGCCGC G a, a A A G A-.v ::A .- A G AU A AG G U G C AGCCGC ~~UA AG AC~~~A UC .. A... C:..VAVO :-" G C GGCCGC C......................................A V A AA GU AACA C ..A...G.....A.....A..G.G............C.G G G C AGCCGC Q UA G AA 1 :U A A G A C A ....A.....A..........A... C ..OU CA G C AGCCGC GG UA G U0 A t:iA C. A G...AGGA..G.AAO.1 IG CC A G C AGCCGC G.L[G.A G C AGCCGC .
.
GGAA U U A G G A A U U A 'GGA A U U A
A A G A
W
222 223 225 228 229 230 231
.
GGAAUUA GGAAUUA AGAAUUA GGAAUUA GGAAUCA GGAAUUA GGA U U U A GGAAUUA
G A A A A A A A
19
.
(A)U(C .gU U U AOG GLIG ..iAA-
A
AOCG ;:A u- .:.......G .......
..............
- U AI C-l4tgU UA -UDU U AA- U -C -U -U -U A-AL(.U}A .gA...... CG.0-. U U U A 4.UK .... .........
A UA A U-
-C
U
A U
A UA A UA
I~~~~~~~~~~~.
..
..
..
..
...
......
A U
..
0~~~~~
A
A
A
U
A
A
A
A
C
U
U
A
A
A
U
- - - -C
A
U
U
U
1250
A.G..GO 4
C U
U A U
A
-
- - -.A....-....
1260
Nucleic Acids Research, Vol. 18, Supplement 2269 1270
I
.
.
.
.
I
1280 .
.
.
.
1290
I
.
.
.
A A A A A A A A A A A A A
U U U U C C C U U U U U U
C C C C U U C C U U U U U
A A A C A C
U U U C C C
U A U U U G - C C -
.
.
.
.
.
I
.
.
.
1300 .
I
.
I.
1310
. . .
I
.I.
.
I
1320 -
I
-
... .. .. ..
AA A A G AA A A G .A A A G A G ~ A...A *..::: A C G A AA A G A A G .A* AA A A G *A A A G A. A A G
C C C C UCC CCC -
-A AAG C AA A C G C -
AAAC -
G C AA;^ A C G C
-
I..4 C-
G G G -A A A C G 4A
20
U-U-U-U-U-U-U-U-U-U-U-U-U-U-U-U-A A-
__ - -Hs 1 __ _Oc 7 Ec 11 .--Tm 12 .--Dn 13 .--Le 16 .--At 19 .--- - - - -Zp 20 - - - - -Vc 22 - -- - -Cv 23 - - - - -Ne 24 - - - - -Sc 27 _ - - - -Pc 30 _ _ _ _ -Pp 33 _ _ _ _ _Pf 51 _ _ _ - -Pf 52 - - - - -P1 53 - - - - -Ng 56 -- - - Ld 59 .Gl 62 .---
-
21 *~~~~~~~~~~. A A
- - -
A
G
G
A G
.
A.AG
A A
G U A
-
G
A
-
U A A
-
. .G
A G
U
A
G U A
590 .
I
.
.
.
.
I
600 .
.
.
.
I
.
.
.
.
I
610 .
.
.
.
I
.
.
.
.
-
-
A
A A GA
G
C
C
A
-
UAA
U
-
A
GA
~
-
CG
A -
-A A G A
A A GAGCU
CG U A
-
G U A
-
A A A
G U A G
G U
C U C A A C U C A A
- C U C A A -
--A
Hc Ms Tc Ta Po
-
A
63 68 72 73 75
Ng 95
Pa 96 Ap 100
C U C A A
C
-
G GCC ---A GA G ---AA A
A A A A
620
I -
-
UA UA C A U A
CU C A A
580
A A
CU CU C U C U
-
>Z
Ap
101
C A A E Cv 102 C A A E Ec 105 UIA A X Mg 153 C A A Oh 173 U A A Me 188 U A A . Mb 189 U A A . Sa 201 U A A Si 203 C U C A A r Tt 207 C U C A A (. Dr 208
C U C U C U C U C U C U CU C U
P21-1
21
AAG C1~~~~....G... ~ 1 U U.AA....AA.U U...-U ~Z A A G C QU CU G U.A....U.... U U U..AA..CC.C.UC.AA C~AIG-CuC AG ~- uC CO 22 U U U t AAUC ---A A G -CUCAA G U A ~GG~lUU U AA...G25 Cm 228 A ---AAG~~~~~...UGUA.W U%AA G U A G A A A U C C U A A C AG .A UGUU A4AAG U&ACUG 229 ---A A G CG UG U A G UGUUUAUUAAG U UAU%UUA A A G A UAGG- C U U A A Ce 230 ---A A G C CUUG U A G-U -(AAAG UCUCU UCA A A G A U AGG- C U U A A CCv 231
~~A~
OnC
---A A G G G~~. CA...C..G..U..A...... CG -AU-C G-----------(A T~ A A AG UCQC-CA- - - A A A A C-~ H 2534 C A aCG1~~~G. CA UGA G1 G - - -I U U A A U G G.CRn 255 --------- -Rc 243 GC U AGA U-A--AAGGAU AU-UA--UA---UAU---------AAGGAUCCGUAGA4........ ........UAU.-..c... UUA A---S --- -UAAAU ---U A A247 ------- -CAAA-------AAAAGUAUG........ U A G A - --GA--- AAAA(-A)A--SPl248 AG%G GtAAA AC ~~~~~ Pa 262 ---U~~G -UUACG)GUGA UU...%UA -.....A UA -UUAUUG.L26 -A U Q ) UAUA.......-A.........U(. s ...
1270
I 1280s
290
l
1310
I 1320
_- _ __-G-C
2270 Nucleic Acids Research, Vol. 18, Supplement 1330
.
.
1 7 11 12
13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62
Hs Oc Ec Tm Dm Le At Zp Vc Cv Ne Sc Pc Pp Pf Pf P1 Ng Ld Gl
.
-_
1340
.
.
.
_
_
.
.
_
I
.
_ _-
_
.
.
.
.
1350
.
_ _ ___--
--
.
.
.
.
.
.
.
.
.
.
u
-
-_
- - -
-
- - - - - - - - - - - - - - - - - - - - - - - - - - - -
-
_
-
_
-
-
.
§
C A E G L GU U
ua N .______-C -__-
-
-
-
_
-
_
_
_
_-
_
-
_
-
~ ~. 1360
.
.
.4
.
.
.
.
.
'b. L ..i
A
.
.
I
.
X. A
1370 .
.
i .....
1380 .
.
.
.
.
.
.
.
.
u
- U - AAA 3A(A$I C G A VU - C - AA#A- U A ~~~ 14 --.
1A3
A ~~~~~ ..... ..
->
E21-1
63 68 72 73 75
Hc Ms Tc Ta Po
630
640
650
I .I 95 96 100 101 102 105 153 173 188 189 201 203 207 208
I
Ng Pa Ap Ap Cv Ec Mg Oh Me Mb Sa
S1 Tt Dr
P21-l' 222 223 225 228 229 230 231
Zm Os Gm Cm Ce Ce Cv
-
;-
;(-
234 Hs. 241 Rn - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 243 Rc - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 247 Sp 248 P1 253 Gm .:X.J'kxG:t A A b. c13(cx U C4.A A - A C.CC9A.A *CA 255 Cr - - - - - - - - - - - - - - - - - - - - - - - - 256 Sc. -. A. .. ... ..
260 An 262 Pa 269 Lt 270 Ls
-
- - - - - - - - - - - - - - -
------
---
--
.
-
1 330
-
----
-.---.U- --C
AC-
--.--
1380__
I"'.'
S-0 A.-1 ?.AA C "A A1340
1350
1
360
1370
Nucleic Acids Research, Vol. 18, Supplement 2271 1400
1390
1430
1420
1410
1440
................GC A-Oc.
C G G IVWTm 12 -------C GU G.---..WA --UAU~1 ---~~~~~~~ -CUA-Le 16c~~~ ~~~-.*~~~~~A~~~)cG~~~~~--CUU~~~~~UGAG U -At------- 19 --- -
~~~~~
-
~~~~~~~ --
----------C UUUA-Z---------p 20L 1
------- -
PSc
30
L C --------------AC UU U UG G---- UAU UCWJ ~ CGC Pf U UA t~~QU ~UE&~~A A &~~C A A A U A U A U U A U A I& tt*G-CuUGUU-A A$ A Pf C .kAu u0uuuC UCUU ;Pl -UG-MUNNAAE. C --A GA GG AC -- ------------- - - Ng UA G---------------------UC GU CC C GU ---------- ---- Ld uCC ---Gi-
51 52 53 56 59
~~~~~
CUUG
UC~~~~~~-- ..A-----AG
--------
-
--
-
626
E21-1'-
E21-1
. Hc ------------------------------------------------------Ms . Tc . Ta . Po
------
-
-
-
-
-
63 68 72 73 75
. Ng 95 . Pa 96 . AP lO . Ap 1Ol . Cv 102 . Ec 1OS . mg 153 . Oh 173 -----------------------------------------------------------Me 188 . Mb 189 . Sa 201 . Si 203 . Tt 207 . Dr 208 -
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
.
-
-
-
-
-
-
-
Zm 222 .0s 223 . Gm 225 . Cm 228 . Ce 229
-
-
-
. .
Ce 230 Cv 231
. . .
-
-
.
. . . . . . . . .
-
-
-
-
-
-
-
-
-
-
Hs 234 Rn 241 Rc 243 Sp 247 P1 248 Gm 253 Cr 255 Sc 256 Sc 258 An 260 Sp 261 Pa 262 Lt 269
.Ls2~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 370 I
1 390
1400
1410
1420
IlI 1 430
I 1440
2272 Nucleic Acids Research, Vol. 18, Supplement 1450
1 7 11 12
13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62
Hs Oc Ec Tm Dm Le At Zp Vc Cv Ne Sc Pc Pp
Pf Pf PI Ng Ld Gl
~A-G C C
1480
1470
1460
I I ~
U G
::AAm
~
1490
~ ~ UUG CC U C--------~AGC C CA -
A4AC C U Ul U A (#4 4& $A A G C 4~A ~ ~ G ~ .~ ~ ~ ~ ~ ~ ~ ~ ~~~~$U ~~~~~~~~ ~~~~ ---~~~~~4~
G
UG
Ju C u u
CU A ~UGA43A U AG ~ G ~A C
~~ C ~~~~*A A A
UU CC CCCC----------
~G4
G$ UC AA CAA A U Q~~~~~AC ~U C ~U$ AAU *A AA C C
U AU U~*
~ U ~ ~ AG A~ U
A
A
AJ~ ---
A
G
UAC
Zm Os Gm
234 241 243 247 248 253 255 256 258 260 261 262 269 270
Cm Ce Ce Cv
Hs Rn Rc
Sp Pl Gm Cr Sc Sc An
-
Sp Pa Lt Ls
1450
1460
--
E21-2
Hc Ms Tc Ta 75 Po
222 223 225 228 229 230 231
--
1470
--
GC AU------------U ---:
63 68 72 73
Ng Pa Ap Ap Cv Ec Mg Oh Me Mb Sa SI Tt Dr
--
C
E21-1'
95 96 100 101 102 105 153 173 188 189 201 203 207 208
A U UA
U C-------------
C
CU
CAUGU
1500
I~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ . . .
I...I..
1480
1490
_-----1510
I
.
.
.
.
I
.
Nucleic Acids Research, Vol. 18, Supplement 2273
1520
.
.
.
.
I
.
.
.
.
I*.
.
1530
I*.
I
.
1540 .
.
.
I**.
.
I
1550 .
.
.
.
I
.
.
.
.
I
1560 .
.
.
.
I
.
.
.
-
-
U. Ui G __-__-__-__C
_
_
_
_-
-_-
-
-- Tm 12
-GGA.
-
U ---------------------------
_________________-----
--------_ _ -__ ----A A G A A
--
- --
---
- - - - - - - - - - - - - - - - - - - - - - - - - -
.
(f
--
-
-
-
.C UU
-U--
__
.--
G GGG
- - - - - - - - - - - - - - - - - - - - - - - - - - -_
_-
_-
_-
-_-
- _-
E21-2
_-
_-
_-
_-
CU
G G C C CU _-
_-
_-
1 7
-- Ec 11
C UC A
-
uccuU cUA U AA U U U::UjAU:U:JC:C C:C AJ i. u
1
I
____- - Hs --~Oc
ti~~
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - CW I
.
- - --
_-
E21-2-
Le At Zp Vc Cv Ne Sc Pc
Pp
Pf Pf P1 Ng Ld ---- GI
16 19 20 22 23 24 27 30 33 51 52 53 56 59 62
E21-3 ___
_
Hc 63 68
.Hc --___-----____________________________-----------Ms __________________________________-_------------Tc . -
-
_
. .
- - - - - --
72 _ _ _ _ _ _---- Ta 73 Po 775
- - -
. _ _- - - - Ng 95 .------ Pa 96 .------ Ap 100 -
. _- --- - Ap . - - - - - - Cv .___--- Ec . - - - - - - Mg . ~ ~ - - - - Oh Me - - - - - - - Mb .- -- -- - Sa . - - - - - - Si . - - - - - - Tt . - - - - - - Dr
. . . . . . .
-
-
-
-
-
-
101 102 105 153 173 188 189 201 203 207 208
-
-
-
-
-
-
Zm 222 Os 223 Gm 225 Cm 228 Ce 229 Ce 230 Cv 231
Hs 234 Rn 241 _-- - - -- - - - - -- Rc 243 .-c243 - - - - - _ . Sp 247 248 .~~~~~~~~~~~~~~~~~~~~~~~~~P - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -------Gm 2 53 - - - - - - - - - - - -- - - - -- - -- - - - - - - - - - - - - - - - - - - - -- - - - - - - - Cr 255
- - - ---- -- ------
----------------------- - ----------- - -- - - - ----------------------- ------- ---------- - -
SS 256 .Gm
. Sc An .5-Sp . ---- ---- -- -- - _ Pa . - - - - - - - - - - - - - - - - - - - - - - - - - Lt
- - - - - - - - - - - - - - - -- - - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
- --
I ]!)-I u
I 1520
1530
I b4u
1550
1 560
258 260 261 262 269 Ls 270
2274 Nucleic Acids Research, Vol. 18, Supplement 1570 .
.
.
.
I
.
.
.
.
I
1580 .
.
1 Hs
7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62
.
I.
.
-_
-_
-_
-_
-_
-_
-_
-_
-_
-_
-
-
.
.
U ---U ---U - - - U ---U ---U ---U ---U - - - U
U U U U U U U U U
A G C U A G C U
.
.
- - -
-
Oc Ec Tm Dn Le At Zp Vc Cv Ne Sc Pc Pp Pf Pf P1 Ng Ld A C A A G Gl
1590
I
.
.
C C A A A A C
A G A A A A A
.
I
C C U U AC U U U U C U
.
.
.
-
-
I
1600 .
.
.
.
I
.
.
.
.
I
Hc Ms Tc Ta Po
95 96 100 101 102 105 153 173 188 189 201 203 207 208
Ng Pa Ap Ap Cv Ec
222 223 225 228 229 230 231
234 241 243 247 248 253 255 256 258 260 261 262 269 270
.
.
.
I
.
.
.
.I
1620 .
.
.
.I
.
.
.
.I
-
---UUCGCU
---U U A A G U -
-
--U U UU
- -
-U U U U AU U ----U U A.. - --
-_
-
..
---
..5, ,,,...,,^
-
-41-..AA ,Ak.,.-r AC-~:- - - __.. ..
..
jU G G U U C -_
_-
.* _-
_-
A. _-
_-
_-
A C G C A U G U C A _-
Mg
A
,~~~~~~~~~~~~~~~~~~~~~~.......... +...
_-
E21-3 63 68 72 73 75
1610 .
C U
---UUCACU
-
.
E21-3'
-
Oh Me Mb Sa
Si
Tt Dr
Zm
Os Gm Cm Ce Ce Cv
Hs Rn Rc
Sp P1 Gm Cr Sc Sc An
Sp Pa Lt Ls
1570
1580
1 590
1600
1610
1620
Nucleic Acids Research, Vol. 18, Supplement 2275 1640
1630
I
.
.
.
.
.
I C-
.
.
-
-
.
I
.
.
1650
I-
-
.
.
.
-
-
-
-
-
-
-
-
-
-
-
-
-
I -
-
-
-
1660
I
i
-
-
-
-
-
-
-
-
.
.
.
.
.
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
G-GG
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
G--
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
--
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
_
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
U-------------
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
a> -
. i-
-
-
-
-
-
-
-
-
... -
-
-
-
-
-
-
-
-
-
-
-
-
i-
-
4CA C C. _ UC G 5-----U C C G U.--
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
E21-3-' -
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
1640
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Hc ms Tc Ta Po
63 68 72 73
Ng Pa Ap Ap Cv Ec Mg Oh Me Mb Sa Si Tt Dr
95 96 100 101 102 105 153 173 188 189 201 203 207 208
Zm Os Gm Cn Ce Ce Cv
222 223 225 228 229 230 231
75
Hs .234 Rn .241
Rc .243 Sp .247 248 .~~~~~~~~~~~~~~~~~~~P1 .Q-uGm 253 Cr .255 Sc .256 Sc .258 An .260 Sp .261 Pa .262 Lt .269 Ls .270 -
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
I
-
-
-
-
-
1630
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
--- ---
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
--
-
-
-
-
-----------------
-
-
-
-
-
-
-
-
-
-
-
-
-
-
- - - - - - - - - - - - - - - - - - - - -
-
-
-
-
-
_ _
-
-
-
-
-
_
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Pc 30
Pp 33 Pf 51 Pf 52 P1 53 Ng 56 - - - -- ---- -- -- - --- Ld 59 ________--------- GI 62
_ _ _ _ _ _ _ _ _
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
v-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
--
-
-
-
-
-
-
-
-
-
-
-
-
-
--
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
I
-
-
-
-
-
-
-
.
-
-
-
.
E21-4
-
-
.
-
-
-
-
.
_________--------
- - - - - - - - - - - -
I
Hs 1 Oc 7 --- Ec 11 - - - --- Tm 12 -__-----__ - - - Dm 13 -- Le 16 At 19 - - - - - - - - - - - - - - - - - Zp 20 _ - - - - - - - - Vc 22 _ _ _ _ - - - - - - Cv 23 _ _ _ _ - - - - - - - Ne 24 - - - - - - Sc 27 -
-_
-
.
------
-
-
.
-
- - - - - -
.
--- -
- --
1680 .
-
- - -
I
I
-
-
-
1670
I
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
I
-
-
-
-
1650
-
-
-
-
-
-
-
I
1660
1670
-
-
-
-
1680
2276 Nucleic Acids Research, Vol. 18, Supplement 1690 .
1 7 11 12 13 16 19
20 22 23 24 27 30 33 51
52 53 56 59 62
.
.
.
Hs Oc Ec Tm Dm Le At
________
Zp
________
.
1700 .
.
.
.
.
.
.
.
1710 .
.
.
.
.
.
.
1720 .
.
.
.
.
I
.
.
.
.
1730 .
.
.
.
.
.
.
.
1740 .
.
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
________
________
__
________
_
.
.
.
.
.
A G A G A- A - A
C G C G CG C G C U
- - - - - -
A G
- - - - - - - - - -
- - - - - - - - - - - - - - - - - - -
________
.
________
Vc Cv Ne Sc Pc Pp Pf Pf P1 Ng - - A C U U - Ld GI
-_
________
G C G C A U U A CC A
-
__.______
- - - - - - - - - - - - - - - - - - - - - - -
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
________
________
________
________
-.tau
.-.u. :-. . .t X -U
________
- - - - - - - - - -
________
- - - - - - - - - - - - - - - - - -- - - - - - - - - - -- - - - - - - - - -
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
_-
GA U U U
-AG
C
E21-4'
63 68 72 73
Hc Ms Tc Ta 75 Po
95 96 100 101 102 105 153 173 188 189 201 203 207 208
Ng Pa Ap Ap Cv Ec Mg Oh
222 223 225 228 229 230 231
Zm Os Gn Cm Ce Ce Cv
234 241 243 247 248 253 255 256 258 260 261
-
Me Mb
Sa
S1
Tt Dr
-
- - -
- -- - -
- - - - -
Hs Rn Rc
Sp P1 Gm Cr Sc Sc An
Sp 262 Pa 269 Lt 270 Ls
-
I 1690
1700
1710
1 720
1730
1 740
Nucleic Acids Research, Vol. 18, Supplement 2277 1750
1760
1770 .
.'.
.
.
.
.
1780 .
.
.
.
.
.
.
.
I
1790 .
.
.
.
.
.
....:...', AAAAA
-
U
A UU A A AA UUJt )44l4 A UtA~ UUA A CC&UA LA A
-
C
-
- -
_____
C C C C C U
U U U U C U
CGUU
I
.
G C C G C C -
- - - - -
- - - - - -
U - - U G A A C U - C-- C U G C - - C - - C - - -
- -
- -
- - -
- - - -
------
G C U C G - - - G A C G G G C A C . --U U U AC A G U U CAU C U A A AA g A G A - - - A C U - - - - - G G - - - - - - - -
-
AAU()A
AL*AA
.A A U A A A C(.).... G A A U AA A A A GA G .C(GIU A G GA A A G A -
--C -C -G C C G C -C - - -
GA '.C. A GU C
E21-5
.
A G C A G C - G C - G C
-
G U AG A A .ACUUG .U4A AA GUA A A GGGU UUU UA UUU CItU U AA AG GU AA AA UUAG AG U ( U GAU U A A G AGU U AA U -U ACUf)G A GA A A U U Atf ~ G U A A
.
.
... UAC....... tHJ A U
UU UtAM U U UWa WU ,AA A A A A A UU G U r1Jl$A A A A A A t$i AX BA GU UUU W IfUA CUU AU4aA A A A A A A A E C AA Aj AA W AU ACA A CUA AA AA UBUA K A U UAIX;S U:AX
1800
.
E21-!5'
Hs Oc Ec Tm Dm Le At
Ld Gl
1 7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62
Hc Ms Tc Ta Po
63 68 72 73 75
Ng Pa Ap Ap
Dr
95 96 100 101 102 105 153 173 188 189 201 203 207 208
Zm Os Gm Cn Ce Ce Cv
222 223 225 228 229 230 231
Zp
Vc Cv Ne Sc Pc
Pp Pf Pf
P1 Ng
E21-6
Cv Ec
Mg
Oh Me Mb Sa
SiTt
Hs 234 Rn 241 Rc 243
. . . .
Sp 247
248 .~~~~~~~~~~~~~~~~~~P1 Gm 253 .(-i. .
Cr 255 Sc 256 Sc 2258 An 260 Sp 261 Pa 262 Lt 269 Ls 270
. .
. . . .
I 1 750
1760
1770
1780
1 790
1800
-.CAQ- . ~UGA ~. ~. ^C~GA'
2278 Nucleic Acids Research, Vol. 18, Supplement 1820
1810
1 Hs 7 Oc 11 Ec
1840
1830
1850
1860
A .. . ................. A . A.. GCA A. A* .. WG tA ^.G G AJA ........ 4-AC - - 4.0(W } .....C A........ -
....... .....................
AA...G........-A. ---A.....U.A.C... .................
Q --............ -- -
......
-G----------------A BA ; ......................... .B......
-
-
-G $mWtG 12 Tm 13 Dm A ....G)" -" --------A'¢ ................................. - G-A WAG4 G A I)A) 16 Le G--L-(A)QCA AIAAC(A UI ... . .G.fi.W A CA G....."UA..:....uG..tAQ..Sg^A..AA A ... 19 At U().A AAC(CGAJGG-C UI: -L ACAU AA A - AA G - -- - L :::.O Ii U(A U.: ----G(A)Al,LAUAU~~~~~U(A) .AAAUA~~~~~~(A)U A A..A::A A.:..AU AU GA G CA U:G 20 Zp UAUSt§S: AG )~~~~~.....A A A U A-g-t*5 GUAC t ....A......... C -A. -U.G-.-A.C CACG ......A...A....... U C 22 Vc --LK..... 23 Cv E ---2AAUACAVVA...AC 24 Ne 27 Sc .....V A(-A 0 -A G C A)l A t) U A G .~ ~ A A A ~~ C A U ~~ 0 G U G A --A CU. 30 Pc 33 Pp 51 Pf AE21G-8 A2O' A EC1A 52 Pf 53 P1 56 Ng 59 Ld 62 GI
-
..
GAi. AU. :.
...
...................A.........
ALIAG.A-AC A-U::AA ----
63 68 72 73 75
Hc Ms Tc Ta Po
95 Ng 96 Pa 100 Ap 101 Ap
102 105 153 173 188 189 201 203 207 208
Cv Ec Mg
Oh Me Mb
Sa
S1 Tt Dr
222 223 225 228 229 230 231
Zm Os Gm Cm Ce Ce Cv
234 241 243 247 248 253 255 256 258 260 261 262 269 270
Hs
Rn Rc
Sp P1 Gm Cr Sc Sc An Sp Pa Lt Ls
1810
1820
1830
1840
1850
1 860
;iC-U - U G C4A
Nucleic Acids Research, Vol. 18, Supplement 2279
1870 ...............
1880
1890
gL*(t4 G
1910
U U
G UUuuU U U CC GG
-
IJ.GG
1900
-
C C------ Hs
1 7 ..U A .------- Ec 11 U A------- Tm 12 U A------- Dm 13 U A------- Le 16 4 U A------- At 19 ----- Z 4 U A . Zp 20 U A------- Vc 22 4 U A -.----- Cv 23 U A------- Ne 24 4UA------- Sc 27 ..U A ------- Pc 30 UAA------- Pp 33 Pf 51 -__-UI-----Pf 52 --___-- P1 53 UGU A .---- -- Ng 56 C UCUAUUGGA Ld 59 ---------- Gl 62
.CC------- Oc
G-----------UU U C G G-----------UU U C A G G-C C u u C G Gt G G--------------C u U C G G4 G-C C U U C G ---
~~A~~~~~4G-~UUUCUUCA: GGU-U C-U G U AUC GUUG-------------U C U G U A A G.U--------U U U CU C U A G Gf) -U ----U. IGUCu - U.U U U U C U U
KGi
i
U
-----
U
----
UU U C A U U A U U U U U U U U G A U A U U C
.*
UU
U U U
~ ~~ --C U---UA
Cu
C ~GAG CUU-C---------UUGU4 CGGCUU-------------UGCG..UU -CC---------GCG4¢
C
-
E21-7'
E21-8
1920
E21-8' -_
_-
_-
_-
_-
_-
-_
_-
_-
_-
_-
_-
_-
_-
_-
-_
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
-_
-_
-_
-_ -_
-_
-_
-_
-_
_-
-_
_-
_-
-_
Hc 63 .----Ms 68 Tc 72 .-----------------------
Ta 73
_--- -Po 75 .__
-_
-
-
-
Ng Pa Ap Ap Cv Ec Mg Oh Me Mb Sa Si Tt Dr
9S 96 100 101 102 105 153 173 188 189 201 203 207 208
Zm Os Gm Cm Ce Ce Cv
222 223 225 228 229 230 231
Hs 234 Rn 241 Rc243 Sp 247 P1248 Gm 253 Cr 255 Sc 256 Sc 258 An 260 Sp 261 Pa 262 Lt269 Ls 270
1870
1880
1890
1900
1910
1920
2280
Nucleic Acids Research, Vol. 18,
Supplement
1930
1 Hs. 7Oc. 1 1 Ec. 12 Tm. 13 Dm. 16 Le. 19 At. 20 Zp. 22 Vc. 23Cv. 24 Ne. 27 Sc. 30 Pc. 33 Pp. 51 Pf. 52 Pf. 53 P1. 56 Ng. 59 Ld GA U UA .--62 Gl
1940
1950
1960
1970
1980
-
-
-
-
-
-
-
-
-
-
-
-
-
..................................
AMA
C G A C
C
WU(
C C A
A A C C U C G
O."O" C
IG
-------------------------------------------------------
E21-9 63 68 72 73 75
Hc Ms Tc Ta Po
95 96 100 101 102 105 153 173 1 88 189 201 203 207 208
Ng Pa Ap Ap
-
-
-
Cv Ec
-
-
-
-
Mg
Oh Me Mb Sa
Si
Tt Dr
222 223 225 228 229 230 231
Zni Os Gm Cm Ce Ce Cv
234 241 243 247 248 253 255 256 258 260 261 262 269 270
Hs Rn Rc
-
-
-
-
-
-
-
-
Sp Pl
Gm Cr Sc Sc An
Sp Pa Lt Ls
I
1930i
1940
1950
1960b
1970
1980d
Nucleic Acids Research,
1990
2000
2010
2020
Vol.
18, Supplement
2030
2281
2040
Hs
-
-
-
-
-
-
-
-
-
-
-
-
-
Oc
7
Ec
11
Tm
12
Dm
13
Le
16
At
19
Zp
20
Vc
22
Cv
23
Ne
24
Sc
27
Pc
30
62....~~~~~ ~ ~~ AcG~
G C C
C
.-
i~
E21-9'
E21-9
-
-
Hc Ms Tc Ta Po
63 68 72 73 75
.~~~~~Ng95 .~~~~Ap100 .~~~~~Ap101 102 -
-
-
-
.~~~~~Pa96 -
-
-
-
-
-
-
-
-
.~~~~Cv .~~~~~Ec105 -
-
-
-
-
-
-
-
-
-
-
.~~~~Mg153 173
-
-
-
-
-
-
-
-
-
-
-
-
-
-
.~~~~Oh 188 -~~~~Me .~~~~Mb189 .~~~~~Sa201 .~~~~Si203 .~~~~Tt207 .~~~~~Dr208 -
-
-
-
-
-
-
-
-
-
-
-
-
. Zm 222 . Os 223 . Gmn225 .Cm O 228 . Ce 229 . Ce 230 . Cv 231 -
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Hs 234 Rn 241 Rc 243 Sp 247 P1 248 Gm 253 Cr 255 Sc 256 Sc 258 An 260 Sp 261 Pa 262 Lt 269 Ls 270
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
I
I
-
-
-
I
I 2000
2010u
I
2020
I
I 2030u
2040
__ _ -
2282 Nucleic Acids Research, Vol. 18, Supplement 2050
.
.
1 7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62
.
.
.
I
.
.
.
.
.
.
I
.
Hc - - Ms - - Tc - - Ta - - Po - -.-
- - - - - - - - - - - - - - - -
- - -
- - -
-
-
-P.UGA:~ .
G C G A GA-
I
2090
. . . .
2100
. . . .
- - - - - - - - - - - - - - - - - - - - - -- - - - - - - - -
. . . . C G G G C G G G U GGG
-'..;;.:.-:-'-.: ---:.:;;.;':';:::: -:- .:-: G G G G U - - - - - - - - - - - - C G G G G U - - - - - - - - - - - - C G G G G U - - - - - - - - - - - - C G G G - - C G G G G U - - - - - - - - - - --GGG GU-C---G U - - - - - - - - - - - - C G G G G U - - - - - - - - - - - - U G G G G U - - - - - - - - - - - - U G G G G A G GU-CGGG~~~~~~~~~~~~.......... G C U - - - - - - - - - - - U G G . - - - - - - - - - - U U G G i X; G U U - - - - - - - - - - - U G G 99
G U
G UG - -
::
-
.-
- - - - - - - - - - - - - - - - -
................... ........ ..........
C G G C G G G
.......
22
UU G U C G U U G U U G U AG
*
660 I . .
.
670
. .
. . . .
UCGG U AG UAG. U AG U AG U UG U AG UAG U AG U AG U AG UAG U AG UUG
Ng Pa Ap Ap Cv Ec Mg Oh Me Mb
Sa S1 Tt Dr
... ................
222 223 225 228 229 230 231
M.
. . . .
. . . .
O)4
.
95 96 100 101 102 105 153 173 188 189 201 203 207 208
I
. . . .
. . . .
- - - - - - - - - - - - A U GAA *J.A G Hs -- - - - - - - - - - - - - - - - A U G A A G Oc Ec Tm _ --~~~--_J ACA U G A ... AAAG A *J -A Dm - - - - - - - - - - - - - - - - - - - - - - A U G A 4 G Le U GA AA C _ AUGA At - - - - - - - - - - - - - - - - - - - - - - A U G A I AC Zp - - - - - A U G A iJ.A G Vc Cv UU A G - - - Ne ----------AA UUU GGG AA V.4$AA0 A Sc ---------A Pc -G U A AA-AGC Pp . A C G A 11..... Pf ---------A C G A Pf ---------A C G A AA4$ P1 Ng -A C A AG G Ld C G C C U U U U -: A - - - - - - - - - - - - - - - - - - - - - - - - - - - - - A G Gl E21-9'
63 68 72 73 75
2080
2070
2060
.
Zm Os Gm Cm Ce Ce Cv
...,.,
-
--U
G GA4A
....
G G UAuGA
--U --U
GUU
A
G
22
>........................ ......... G
-
...........
tAC............................ A G G
..........
.,W.%-
.B ......-..G..-..-
GU A
-_ _ --A; A GA AGOA
G
G
..........................
.................................. ...............
234 241 243 247 248 253 255 256 258 260 261 262 269 270
Hs Rn Rc
- - - - - -
- - - - - - - - - - - - - - - - - - - - - - - - - -
P1 Gm Cr Sc Sc An
- - - - - - - - - - - - -
------
- -
Pa Lt Ls
- -
A
A A AG A:.
U A G A.AA A G A: - - - - - - - - - - - U A G - - - - - - - - - - - U A G A.. .A__ A:
- - - - - - - - - - - - - - - - - - -
Sp
..............
-di
- - - - - - -
Sp
........................
U
-A__
2050
2060
2070
2080
2090
2100
2110
CA
2120
VUCtU(A~.....U.. at................ At.:G
CA QUA
..A... U.. U
CU
2140
Supplement 2283
2150
2160
Hs
..A..
oc 7 GA.... ACA....A.A.C.U.U.A.-.A Ec 11 ACAC........ G.UGA A.....U......U Tm 12 )AU..A.CA.CA.CC.CACG.G..( A.C.GA.AC.A.U.C... GU
GA
AA
U
V
CV ....
c
t
&
a~
C
C
A
G
141.... A...C.* CAC.. G WU:U>$( C .C ...QA. V.. tA...A A CA..G..G.AA.:V.U......UU.A..CA...A..CA
A
.C(
)
CA... ...U....C.....V AU... CA U U CGU
~.
2. . 2130
Nucleic Acids Research, Vol. 18,
ISV
A
.AU(V)C.....
CAV........
U
i A A .G A AA U
U U CA U U C (AJV G U CC U(U)G
C
U
GAG
U
G
GA
A
A
S U...V...AU
Vt)
U+GS
CU
A
G
..
.A
C-
GA CGA.C..A..-.
U(
UU
U
Dm 13 Le 16 C U` X A 19 UA At 20 Zp Vc 22 ACAGG U .AC Cv 23 A G U G A AA:A An Ne 24 "Sc 27 Pc 30 Pp 33 Pf 51 ..Pf 52 :, P1 53 I ~ .:~L.FlU Fl'z ~ .~U G AI,A GU A A C* U U:4a A - G A C C...U.C.A UC U .GU..A. C A c U A A -C Ng 56 A'A C G U GA A A A WV C U -U A~C Q&~(A)CAAG A~A A C U A C -AL 59 CA C&U AC C0 (a-C C~-ACGGG U G A A ACA&GYA AUC (C AGACCCCGGGiC 62
~~ ~ ~ ~ ~ ~ ~ ~ ~~
U
CA C 14(MV U::U U C ~A(9)C'~U
22
C-
23
~t ~~.... ... ~~&U(A)CUUtA&.... (AU..... ...4.. U....
A--:A
A.-G 680 .
.
G
VU
AA
.
.
-A
GA
-
UUVC
U
CO
6
A
.
:
A
23'
24
.......... .............................................................
63
68 W.u.MG .A (.i ......C H GG UU GG AA AA AA UYCU GA'"S GA UU AA ..A.MCee... . .C...C .. A.C A CCCGGU C A G U - GG Ms . CSGUAAAA 72 A... U..G.U.A AU... ..A ACGACCG.G.U.G 73 AGGGC GAAAUCAA AUCUGGAGGACA CC A GUG Ta Po 75 700
690
I
A
I. . . .
710
720
.. .. ............ ...........................
.................
.U---G GG AA AA ----U .t---A GG AA AA ..M---A G A A . ---G .U * ---G G A A ...---G GG AA AAI *.t---G :,-.GGAA . ---G G A AI . ---G G A A C.---G G A A *.---G G A AI . ---G G A A -
-
-
AG
A
G U G A A A
G U G A A A G U G A A A G U G A A A
C4.
C
U
G U GA A A G U G A A A
A
C-
G U G G A A U G U G A A A
G U G A A A G U G A A
A
GA
A
G U G
U..;
Z-4 G G A & 'U
J.
C
'r,
G G A A t-J.':: "..c.
G G A
..............
,
......
G U G A A A WA G G G G G
U U U U U
G G G G G
A A A A A
A A A A A
G G G G G
U.. U.::Ic: G-'-'Aj'-'.'G U .............
:::t................. ..................
U U
......
.:.'A+ t-JA A
.:..
IJ %......... U
k
'U W.:
A A C
G A A C A C C G G U GA A CA CC G U
- G
G A A C G C C G A U G A A C A C C A A U
- G - G
G A A C A C C A A C G A A C A C C A A C G A A C A C C A A C G A A C A C C A G C G A A C A C C A G C G A A C A C C A A U G A A C A C C A A A
G G - G - G - G - G - G
Zm 222 Os 223 Gm 225 Cm 228 Ce 229 Ce 230 Cv 231
A¢
:A.G
A
GA A C A C C AG U G A A C A C C A G U
G
U.(A)C¢
A
A...A C..
AS
- A
G A A C A C C G G U
-
-
U U U U U
G U A A A A A U U A U 'A'%' G U G A A A 'A':: U C A U :U U A 0 -A C G G U A U A A U U U G G U A A A A A A G C A G U A G U G A A A U..................... G G A G U C A A A U G U A A'G A: X...U G A A A U A G U A 'IJ: "G A U A U A A G,:...' '.:::G A U C A A A .0U
A............................................ U ............%
A A A A A
G
...............
G G G G
G u u U u G G A::
A A A A G A
.................
IJ
A U C A
..A.:A (
2120
-
-
-
-
IJ
U -1:
G
C A C A A C C U A A G A A C - - A C A G A A
......... v............
u C ...U:...
WA
:'-.G G
U A A U
G U A G A C
U A U A A A U C
A U U
A
...
U
G C A A A G C A A A 1-.::
2130
2140
C A A G G
C A A U A
.......................
2150
-
U Hs 234 A Rn 241
- -
- - -
G U U G A
G U U A C
A A A A U
A C U G G U A A
.............
A U A U A U A U
-
A A C G C C A A
................
...
A A U A G A C G C C C A A U A U C C A C A G A -
C R-M W A G A A A ::t-)::X.X -A G A C kM :.: V. -J.'.-.-A G A C .A..G A G A C U .................. ...............X .................. .-A U G X
A A
........................
..............
...........
U
................ ....................
2110
G
G A A C A C C A G A - G
CCAS ACC
-
...
...................
C U G U A G C
....
- G - G
...
.....
A G A A G
G A A C A C C A G U A. G A A U A C C G G U
A
A U U G
0"'."
A A A A Ij.:. A
.......................
V.
- G
95
Pa 96 Ap 100 Ap 101 Cv 102 Ec 105 Mg 153 Oh 173 Me 188 Mb 189 Sa 201 Sl 203 Tt 207 Dr 208
...
..........
A A A A C A A A A A A A
....
U A G
G A A U A C C C G U G A A U A C C G G U
a...
AU
G C A G A
...-.G
...
G U A A
A
Ng
U
23'
U'u
G G A A
U A A
A
- G
ACA:S
G U A G A A Ut G C A G A U A
G
G A A C A C C A G U G G A.
G U A G A G G U A G A G
- G C A G A
.....
G G A A
:Aj-
C
G
UA GA
23
..............
"G
G U A G
G U A G A
CC^ G
G A A U A C C
A
G U G A A A U.t..-. G U A G A ..VAVA
G U A A A A G U G A A A
.-.-AGAA
G G A A
.
A G A G
G U A G
.............
A - A A - G U A U U A U C - G G - C AG - U G - U G
2160
Rc 243 Sp 247
P1 248 Gm 253 Cr Sc Sc An
Sp Pa Lt Ls
255 256 258 260 261 262 269 270
~.
2284 Nucleic Acids Research, Vol. 18, Supplement 2170
*
.
.
.
.
.
.
.
2180
1Hs MtG-----------------7Oc ----llEc ... ---12Tm QC---------------13Dm 4------------------------------16Le l9At 20Zp ---------------22Vc .---------------- --------------23Cv 24 Ne .------------------ 27Sc 30OPc 33Pp :C ----51Pf -&-C -----_--___---__ -- - - - -_----52Pf 53P1 -C ----____________ ------__________ 56Ng 59Ld __________ .--------------62 Gl ----------
-----------
24 ........
63 68 72 73 75
Hc Ms Tc Ta Po
...n
_
a-"..T.
2210
2200
.......
2220
.
.
.
.
.
W_____
u> E;-____.
| F_____
fi_____
m§E _ _ _ _ _ !..9._____ B.N_____
%n_____
B_____. >>t .x;
-
_
_
_
_
--GA A A-G------t--G U --GA AG------------------ G CA G----------------
A U U cC-
.........2 249
22'
-G --
-
-G--
-G--
-G --
-
-G --
-
R _ _ _ _ _ *m. :X::
24S .....
222 Zm 223 Os 225 Gm 228 Cm 229 Ce 230 Ce 231 Cv
234Hs 241 Rn 243 Rc
B-----,
-
-
i:*s
e - - - - t - - - - H -
-
-
-
-
Ct - - - - -
AC-------
t-----------A
--------- -
247 Sp UA----------248 P1 -A-C -----------C-------253Gm 255Cr g----------256Sc .t----------258Sc .U----------260 An &-----------
261
Sp
740 .
-------GAAG.---------GAAA---.------GAAG.---------GAAG----------GAAG----------GAAG---.------GAAG.---------GAAA----------GAAG----------GAAG----------GAAG---.------GAAG---.------GAAG----
*...>......
-IIV
.
--
--------GAAG.---
M_____.
.
-GAAA.-----CA---.U
730
95Ng 96 Pa 100 Ap 101 Ap 102 Cv 105 Ec 153 Mg 173 Oh 188 Me 189 Mb 201 Sa 203 SI 207 Tt 208 Dr
.
--GA A A-A----AUU -----------GAAA---------CAU U--GA A A--- -------GA A A---------------AUU --GAAA.--------------.--AUU--GA A A----------------U UU--GA A A--------u u-------GA A A------------ - ---U U--GA A A.--------.------U U --G A A A----------------AUu U --G A A A------------------G A.---AUuUU --G U A A.Q A u --G A A A ---------------U U --GA A A.---.-.-.-.-.-.-.---- U-U --GA A A---------------U U
---GAAA ---GAAG -- -GAAG -- -GAAA ---GAAG
- -
.......
2190
I1
I
-_____A -__-
-QC~~~~~~>
G---~ -_____-
G----------------__-_-_-_-_->---_-
--
G-______ --C
----___ _
_-
G
-_____GC .G ---~ G----____--
C--G------..G--_____
--
--GA-
--
A-_____ .-
249
GAAA---GAAA---GAAA---GAAG---GAAG---GAAA---GAAA - - - -
22'
A -------
-~---A--__ A -___2 ----
----;..G--__ A ---|.. --_ GC A -__ ---~
----GAAA.GU-------G.>.-----------GA A A------------------------GA A A----C C CU- --------
- - - - -
A C---
------AA U A A-______________ C -----GA AG-- - - - - - - - - - - - -A----------G A A A-- - - - - --A-------
----GA A A.---------.-.-.-.--A--------GAAA.----------$A.--AG A A G.--------.-.-.-.-. -A .---A -. -A -------A A G U.-A
262 Pa ,----------269Lt ACAUUU UUAU A 270Ls ACAUU AUUAU j U
AG-Q-O------- A----------GA VAAUUAUAUL%U1 . ,AUUBCcAuCA-cU ~ U U A U A U C A U A U CU U ......
I 2170
2180
....,,,,,..*.*. *.....
2190
2200
2210
2220
.k:A
2230 .
.
.
.
.
2240 .
.
.
.
I
.
.
.
-A
Nucleic Acids Research, Vol. 18, Supplement 2285
2250
*
.
.
..
2260
2270
I
. . . . . . . . . . . . . . . . . . . . . . .I. ... . .......... .........I. ......
1..I. .
.
2280 2280
. .. ¢g A ,UCAGAU A C C Hs - AA A G4 U U C GA A . ........G..U.C. G U.UC.GA.A.. A A A..... A ,UCAGAU AA -AACC Oc
A --A A G U U C G A A G U U C G A A -AAJ~A A .. .r.--A ..G U U C G A A Ai - - A A G UUAGAmS8m.G G
U C G A A .
A
U C GA A
- -AA
GGC U CGAA
A A
G
;^--A - - A A B. U
A A - A GAGA3
A --A
22'
.C
GGG C U C G AA A
. u A B UA.W G C U E --A - - AAGI) A GG AA GUU A -CAAA - A AAS..W:E G.A.GAG A--A A G AAGGG AA GU A ~A A A~GG A G C A A A A... GA U
.
A A A A A A
C
CC GG AA CG A CU GA G A
A X AAA ...l A U G A AA ~:~ U G A A C G A A.
GA ~ U C GA A C U A G A A
21'
A
U
A A
,UCAGAU A C r Ec ,UCAGAU A C C Tm ,UCAGAU A C Dm ,UCAGAU A C CLe ,UCAGAU A C C At ,UCAGAU A C C Zp U U A GA U A C C Vc ,UUAGAU A C Z Cv ,UUAGAU AC Ne U UAGA U SC AC ,UCAGAU ACC Pc ,UCAGAU A C Pp ,UCAGAU A C Pf ,UCAGAU A C Pf ,UCAGAU A C CPi ,UUAGAU AC Ng U UAGAG A C C Ld ,UCAGAC A C CGI
7 11 12 13
16 19 20 22 23 24 27 30 33 51 52 53 56 59 62
25
..........s....S.S
A A
A
A A
..A
A A
A
A AC
..
Q.C
A--A A--A A--A A--A A--A
~ G A GCGA CC. G..A -
A A
.....
:
GC A GA-
G
...
750
760
I*
I
*I
G G G G
. .
. .
. I
I
I
*A
GA ....A GA
`GA
C
gC~A AG
.
...
A.
G
A
..
A..A
U:U AL
A
A U
U G C
CA
U
U-...G........
UW G(C)kU.: f$. U. ." AUUU---------GAACGC AU AAGC
U
G
CA C
U
A A A
A C
C A
G A -. .............-.. G A G UU. .aS..^ .A. .A.i.UX GAG G A - C A.; Uf. Ui. G
G U
. . U ...U
AA
A .
. . . . .A.i9 ...
C
G A
A C
U
-
G G
C"
G
A U G A. C.A.U..G G A
-A
AUU AUU AUU AUU AUU AUU AUU AUU AUU AUU AUU AUU AUU AUU
*
AA
AGAUACC AGAUACC
A G A U A C
A A A A A A A A A A A
G G G G G G G G G G G
A A A A A A A A A A A
U U U U U U U U U U U
A A A A A A A A A A A
C C C C C C C C C C C
63
Ms 68
Tc 72
Ta 73 Po 75
Ng
95
Pa
96
Ap 100
Ap
101
Oh
173
Cv 102 Ec 105 Mg 153
188 189 Sa 201 Si 203 Tt 207 Dr 208
Me
Mb
25
AUUAGAGACC U U AG
Zm222
AUU AUU AUU AUU AUU AU U
Hs 234 Rn 241 Rc 243
A A A A A A
A U U A G A U U A G A U U A G A
G A C C Os 223 U A C Gmn 225 U A C C m228 U ACC Ce 229 U U A G A U A C Ce 230 U U A G A U A C Cv 231
..
;; ;;;
C
I1
790
A--AA A--AA A--AA A--AA A--AA A--AA A--AA
C..U....... AA.C.-A- U AUC. A
I... I. . . . I....
A -AA -AA
'-
GA
u
.
I....
.§
G A GA G A
CA
C
...
.GG AA---AA A AA-AA
21
.A CA C GACA
C C C C
780 .
U U U U
A A A A
XA -AA GA--AA G A -AA GA--AA G A -AA G A -AA -AA GA GA--AA
22'-
gCC
.,
-
A AA A-
,.. 2............. ...........2......1.'..
Q C
A A U U A G A A U U A G A A U U A G A
770
. .
GAGAGA GAGAG A GG AG AA -G A-
....
AUUAGAUACM Hc A U U A G
A
W
.~U
G U U U
_
.x
2230
2240
A.... U............. -
.......
AA- UA C GA- U AC GA--CAGC G A- AAUC^:.. GA- AAAC G A- A AQ G.A- A A A
G G G G G G G
A A A A A A A
U U U U U U G
A A A A A A A
C C C C C C C
Sp 247
Pl 248 Gm 253 AUU Cr 255 AUU G A U A U Sc 256 AUU G A U A C Sc 258 AUU G A U A C An 260 AUU G A UA C .Sp 261 AUU G A U A C Pa 262 AU- -GGAA-- Lt269 AU- -GGAA-- Ls270
UA- AAA GA AG~ UA--AAA G A - - A G . G A-- A GG G A--AG... - C A U GACACCAU
I 2250
A A A A A A A C C A A A
2260
I Zz /u
2280
2330
2320
2310
2300
2290
.AU U............A.G....C.....C..A..A..A..GA.UG.CC..A.CC....CG
Hs
....... ............. .... .. ................ ......... ...
.........._ .-J ...- -t
"' -t
t'
(
I 2
2290
2300
2310
2320
2330
2340
Nucleic Acids Research, Vol. 18, Supplement 2287 2360
2350
I
.
-
.
.
.
.
.
I
.
.
.
.
.
.
2370
I
.
.
.
I
.
.
2390
2380
I
.
.
.
I
.
.
.
.
I
.
.
.
.
I
.
.
.
.
2400
I
.
.
.
.
.
I
.
.
.
.
I
CCc
Hs 1 Oc 7 ------------------------------------------CCUCAAA Ec 11 ------------------------------------------CCUCCGA Tm 12 ._______-__-----------CUUUUA Din 13 .---------------------GCUUUUA Le 16 ----------------------GCUUAUA At 19 ______----------------GCUCUAA Zp 20 C U U U U A Vc .---------------------CUUUUGA 22 GU U U U U A .____-----------------UCUUCGA Cv 23 .---------------------UUUUUG- Ne 24 Sc 27 Pc 30 -_______--------------CAUCUC Pp 33 .----------------------UUUCG Pf 51 .------------------UUAG Pf 52 P1 53 ---------------------GCAA . _----------------------AUCCC Ng 56 Ld 59 %>¢¢giCAG:~~~~~~~~~~~~~~~~~~~~~~~J __- ---------------------CGCGCGUCC Gi 62
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
u
-
-
-
-
-
-
-
-
u C
-
-
-
c c
-
-
-
-_
-
-
-
-
-
-
-
-
-
-_
-
-_
-
-
-
-
-
-
-
-_
-_
-
-_
27 -
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
--------------------------ACG- Kc -____________-------------GCG- Ms --------------------------UCG- Tc -------------------------GUUG- Ta --------------------------UGG- Po
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
--CUUGA --CUUGA --CAAGA - - CU U GA --AUUUA --CUUGA --CGAAU --UAAA- - A U CG A -
-
_
_
_
_
_
_
-
-
-
-
-
-
-
-
-
-
-_
-
-
-
-
-
-
-
-___--------------------ACGU
-
-
-
-
-
-
-
-
-
-
-
-
-_
----------------------AUAUU--.__________-----------AUAUU--_A _G .~~~~~------------GUUUA-
-
-
-
-_
-
-
-
-
-
-
-
-
-_
-_
-_
-
-_
-
-_
-_
-_
-_
-_
-_
-
-_
-
-_
-
-_
-
-_
_
_
_
__
-
-
-
-
-
-
__
__
-
__-
__-
__-
__-
_ _-
_ _-
_ _-
_ _-
__
__
_ _-
__
__
_ _-
_ _ -
_ _-
__-
__-
_
_
----------------------ACAAGGGCG
-_
.__
I
I
235,0
2360
I 2370
I
I 2380
75
Oh Me
96 100 101 102 105 153 173 188 189 201 203 207 208
-A- UC GA Zin222 -ACUC GA Os 223 GA Gi 225 - CG A A U- Cn228 -CGAA U- Ce 229 -ACUC A A Ce 230 AGGUU A A Cv 231
-
-
Ec
Mg
-- U C U Tt --GAUG- Dr
-
-
Pa Ap Ap Cv
- - A U U CC Sa --GAUAU Si -
63 68 72 73
Ng 95
U V C C Mb
-C
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
2390
2400
Hs 234 Rn 241 Rc 243 Sp 247
P1 248 Gm 253 Cr 255 Sc 256 Sc 258 An 260 Sp 261 Pa 262 Lt 269 Ls 270
2288 Nucleic Acids Research, Vol. 18, Supplement 2460
2450
2440
2430
2420
2410
lHs----~~~~~
--_______________ 70c 11lEc U----------------------~~~~~~~~ 12 Tm U-----------------------~~~~~~~ 16 Le G-----------------------------l9At G-------------------------------------------_____________________________--_- - -$. 20 51Hs---------------f Zp G-----------------------------22 Vc U 23Cv U _ 24 Ne________________ ________________ 27Sc 30 Pc U _ --______________ 33 Pp-
______
52Pf--_
_________________ 56Ng g.+.tC.; 59Ld GUAUCUUUUCUAUU.---------¢.:C-UC 62G1 --------------________________ 27'63 Hc 68Ms 72 Tc 73 Ta 75 Po
95 Ng 96 Pa 100 Ap 101 Ap 102 Cv A----------------- -- -- -- -- -- - --___-__-__--__-__-__-__-__--__-__-__-__-__-__--__-_-l05 Ec Iv L.%. 153 MgA-173 Oh --188Me C-189 Mb U U 201Sa AC203S1 ACA 207Tt --208 Dr - - -
222 Zm 2230s 225 Gm 228 Cm 229 Ce 230 Ce 231 Cv
234 241 243 247 248 253 255 256 258 260 261
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
__- _-
_-
_-
_-
_-
_-
-_
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
__-
_-
_-
_-
_-
_-
_-
_-
_-
-_
_-
_-
_-
_-
_-
_-
_-
_-
-_
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
__-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
-_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
-_
I.
_-
-_
-
-
C C ---CC---C C --------------
A A A U C
----
-
-
Hs Rn Rc
Sp P1 Gm Cr Sc Sc An
Sp 262 Pa 269 Lt 270 Ls
-_ -_ U A A G C
2
2410
2420
2430
2440
2450
2460
~I
.
2470 .
.
.
.
I
2480 .
.
.
.
.
.
-
-
-
-
.
2490 .
.
.
.
.
I
.
.
.
.
I
.
J-U C U UC
C C
u C UC
C
2500
.
.
C
G
I
.
.
.
2510
I
.
.
A
.
.
I
.
.
.
2520 .
.
.
.
I
.
.
.
.
I Hs
G A A A A-- A A A A OC: --A A
-
Oc Ec Tm Dm Le
-
-
-- AA -A ~A AA - -A A
gUA UA -A ttA
-
GAA
Pc 30
::ij
UGU A GAAAUCAC- GA G UUA * GAAGAAAACG--AA G AAAC--A JUU A A AA
UA
Pp 33
-N
Pf 51 Pf 52 Pl 53 Ng 56 Ld 59 GI 62
-AA
A GAAA
-UAA GAAAG U-UUG C UCA A............... A-:GGAAA uG-GGA AG
IUCAC 27'
-
28
28'
GUA- -q3 GGA GUA- - GI..-G A A G GAA- - ..- G A A G GUA- -. ...- G A A G GCAGGA A G QC-C
-
850
860
~~G
U
AG
20'
G GAAG G U
Hc 63 -C----C2.
-CA
G
Ms
G U U G U U
Ta 73 -----C
G U U
Po 75 -CG
880
1I.
....
U A A C t~VG U G A A~
A-
C~U A A C G CAAU AA*C
Ng 95 Pa 96 Ap 100
A
U A AC G U U A U A A C G C U U A A C ~VG U U A ~ A UA ACAA UU AAA*C U A A C A U U A ~U A $ ~UUA AC A A U AA U A A C GCA U U A A~
|,V
G U AC A
~~~G
G G A
Ap 101
Cv 102 Ec 105
~U
~G C AG A A
G G A G U A
Mg 153
-
A GAU AACt U
U A A C
28'm
28
*&
269
20'
GUAQVA UAACI;CGUUA GUA- -C UAACGCGUUA GUA--CUAACGCGUUA GUA- -C X-UAAC CaGUUA GUA- -.V.. UAAC.G..*GUUA GUA..-UAAC " GUGA GUA- -G.C -U A A C G U G A
I- .- .- -
A --A U C -A A C---A --A C C -U A C---A U U UACACC-- --
"C C A -
-Hs 234 ~ -- Rn 241
A A
AA--
U A A C
G U GAA A A- G A A A )[U.1 GA --.> -G A A A ..I G A A
-
-
-
-
-A.X. ....' it...>
-wlU -i. -t
(A-------)A (AA
.0 00 i 01..0
_0'
UA
x
:B
A
U A C C :..
.. A A A A
.. A A IU U A A A
...
A
G A
CCRc
- -
- -
G>U
-
G U
C A
G A A G G A A A
G G G U
CSc 256 CSc 258
C
-_
_-
_-
_-
_-
-_
_-
_-
_-
_-
2490
2500
--C --U ---- C
U A A A
A
An Pa 262 Lt 269 Ls 270
260 G..C Sp 261
- - - - - - -- - -- - -
2510
243
Sp 247 G A A P1 248 Gm 253 _ __-CfCr 255
__________
G A A A
__________________________
2480
C CA
A A ______
A-U
__________________________-
2470
Cm 228 Ce 229 Ce 230 Cv 231
-
- - - - -
-
Gm 225
-
A A
-
Zm 222 Os 223
-
........
.....
A-
201 203 Tt 207 Dr 208
-
27'
173 188 189
Si
-
A A
-
-
-
G C A-
__C
Oh Me Mb Sa
-
G C A-
GA A-
68
Tc 72 -C~
870
G C AGA A-
A
-26'-
870 *1..
*
12 13 16
Cv 23 Ne 24 Sc 27
--AA ~AA
GAAA GAA AA
1 7 11
At 19 Zp 20 Vc 22
GAAA.G --AA
.
eU A
I
.
.
A--A A A- -AA
G A A A U.... GG A AA A AU - G A A A
.....
U U A -
UUA
.
G A A A G A A AC.(
U UAC U A U JUA
.__
-
.
Nucleic Acids Research, Vol. 18, Supplement 2289
2520
~.
2290 Nucleic Acids Research, Vol. 18, Supplement .
1
.
Hs
.
.
.
2540
2530 .
.
.
.
G-CA-A
G-CA-A G-CA-A G-CA-A G-CA-A G-CA-A
GG
IGG
Om
iGG iGG
G-CA
GG
G-CA-A
G-CA -A G-CA -A G- C A - A G-CA-A G-CA-A G-CA-A G-CA-A G- C A - A G-CA-A G-CA-A G-CA-A
GG
GG
GG
iGG iGG iGC GC
GC
iGG iGG iGG
63Hc
29 GGGGG-
...
.
68 Ms
vmUG
....J...
C.U-x:G
75 Po
I
.
.
105 Ec 153 Mg 173 Oh 188 Me 189 Mb 201 Sa 203 207 Tt 208 Dr
S1
222 Zm
225 Gm
.. m
228 Cm 229
Ce
231
Ce
Hs Rn Rc
V(g)G
A
,
A
256
Sc P
....
261
Sp
270
Ls
-
-
-
-
-
C
A
C
U
-
A
A
C
U
-
A
A
..
...
A
C -C A....
C - A-:" C -CA
U
c -CUA,,. ....
C
U
[.U..
G
C
30
2'
29'
-
A
C
GA.-x
A
C
U
A
C
.
...
C-CA.
C-CA.
A
A A
A
U
UU
X
G
m
A
U
A U
U.. ..
920
910 I
.
.
.
.
A-A A-A A-A A-A A-A A-A A-A A-A A-A A-A A-A
.
.
.
.
.
:AAA
C
C
-A -U -
A
-
-
A
A
C
-
A
:Ct
A
.U - - G..U.
- - - - - - - - U-
-
U A C A
A..
-
-
C
A.
-
- C A
-
-
C A
-
-
CA'
-
-CA0,'. A.
31
--c
A
c
A::
A.e. At:: A:
--c
A
A
A
C
U
.....
>
.
G
-
C
A
-
A
A
A
A--.-.A
C
.
-
-G
-
-
G
-
C A
-
A
- A
G£A
A
A
...
A.
ACC
.A... A A
U U A A A A ACUC A A A A -A AC C- C C AAt^AC A A
U
A
C U
-
A
AXG %A
-
C C
.AA A C U A C
. .U.C
A
GAC
-
U A U C A U C
C A A
C C
A
CA
-C G--
C
2540
2550
2560
U
-
-
GA
-
-
A A
- -C
u CAAU '.AAA --C
2530
C
A
AA AA-:A: U--G~~~~~~~~~~~~~~~~~~~~~~~~~ A G A A .......~~~~~~~~~~~~~~~~~~~:. .......A ..............~G C A
-W-
-
A
:
....
-
A.
(G-
. .A
-
c --c --c --c
A A~A
A
A
-
30
A AA
A
.
- -C
2'
A A
A
.
A.:.:. -A-:-.
u u u u u u u
AA0A
- A
-UC A.:
930 .
.
.. U - A C - A A....A U A A A A C U C A A ....
.-......
G) J.:AAAAiGs AGUA
A
31
.....................
.
Aq C U'A ~ A GA f -
-CAG. -CA^..
C
AA AA ,AA ,AA AA 'AAI AA IAAA AAA AA ,AAA AAA
A
A.
C -C
A U CA
GA A
A -A A -A
CA
A.
C
A U AA UU
A.A.A
A A
A-A
-C
-
C -C A....
............
A
G
GA A A
C
A U
A A
A A A
..... ..........
-
U
-
m
A A
C A":
-
C -C A.
A U
A A
...
...
-
Sc
-
C C C C C C C C C C C C C C
C
A U A UU A U
AmG
A A
q
s
-
247P1 CpU-.-A-..G~-A-253 2 258
-
G G
A
A
C
-
.
A
G ~(A)A... $...
Gm
A A A A A A
S.1.. A.... A A A... A A. ..... . A.A A A A .C...
-
.
A U A A A&.A 4.- A U
U.
-
.
A AG4 A U A A. A U A ANA A U A A U A t A U A A U
29'
..........................
243
A
.
.
A A
..
G
..
241
-
. . .
A A A A A A
29
G
. .. . . .. ...
234
.
2580
2570
2560
.
AA GGJ
G
.
A A A A A
.
.
G G G G G G G G G G G G G G
20'
223 Os
.
G-G-G-G-G-G-G-G-G-G-G-G-G-G--
95 Ng 96 Pa 100 Ap 101 Ap 102 Cv
C C C C C
900
890
.
-A
G-CA-A
iGG iGG
20'
2550 . . .
.
.
GG
Le At Zp Vc Cv Ne Sc Pc Pp Pf Pf P1 Ng Ld Gl
72 Tc 73 Ta
.
GG
7 Oc 11 Ec 12 Tm
13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62
.
A..
A
2570
2580
.i ; ~. 31 .
.
Nucleic Acids Research, Vol. 18,
2590
.
.
.
.
2600
.
.
G G G G G
G G G G G G G G G G G G G G G
.
.
.
.
.
2610 .
G G
.
.
I
.
.
.
.
I
.
.
.
.
I
2620 .
G
U U A A tYU U 4 A C U C A A C A UA U A A U U:%G A C U C A A C A U.. U A A -............. U C A
G G G G G G
U U U U U U U U U U U
G G G G G
G G
U U U U U U
G G G G
U U U U U U U U U U U
U U U U U C
.
I
.
.
A A 1tUG A C U C A A C A A A U. U. A C U C A A C A A A U.tG A C U C A A C A A A U ..:4. A C U C A A C A A A II.%.G A C U C A A C A A A tflIIIGACUCAA.C....A.C.A y*G. A A UIJ . A C U C A A C A A A $.f....... A C U C A A C A A A t U A C U C A A C A A A 1 A C U C A A C A A A I. A CU C A A C A A A UU.AG A C U C A A C A A A *.... A C U C A A C A A A U.. A C U C A A C A A A U.'.U A C U C A A C A A A . A C U C A A C A A A A C U C A A C G
.
A A A A A
A A A A A
C C C C A A A A A A A A A A A A A A C C
A A A A A
G A A G G G A A A G A G A A G G C
..
2630
.I
.
A A A A A A A A A A A A A
A A A A A A A A A A A A A A
-
2640
C C UU CC A~ A CU U Am CC UU C AA C U UU A C U U A C C U U A . C U U A C U U A ... C U U A C U U A . C U C A ¢. C U C A C U C A f C U C A C U C A C U C A U U U A ¢. C U C A ¢
Hs Oc ~~ G(~~CCG G A Ec C( G G AA Tm Dm Le g A G A At ~~~~C Zp C A G A Vc Cv Ne Sc Pc ~A~G ~)CCG GA Pp ~~~~ G A Pf ~~~~ ~~A G A Pf Pl A
32-
_33
~~A
**A ~~ GW~~C
'
33'
.9. rA
O(A)A ~~~~~ ~ G G A~~~~~~ C(A)A C~~ G A
G
G
A~~
~~
U~
A J U U A A UIJ G~~~~~~~~~~~ G~A U
C~
GU
U
U
A
A
IU
A
G
U
C A A C G
U
C
A
ACG
A C
34
-
A Hc 63 ~~~~~~~~~~~~G
GA ACCUCAAMA6 C A Tc 7
AA CG
U
1 7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 G A Ng 53 56 Ld 59 GG A Gl 62
~~CG G A
gX
:............... ...
C ( A ) A ~ % ~ U G
U C U
A
G
A
A
U C
A
C
UUA
C
C
A
%
C
G
~~~~~~~~~..GG A A U C U U AG A
940
950 .
........... .....
,. .
.
Supplement 2291
.
.
.
960 .
.
.
.
.
970 .
.
.
.
.
.
.
.
980 .
.
.
.
.
.
.
.
.
.
.
I
Ta
Po
7
75
990 .
.
.
.
.............................
..
..i
.
......
,UUAA IUUAA
iUUAA
C(A)A- ~C1~,U G G
.G% UG UG C(A)A -H~UG C(A) * ~-~~)G
G G G G C(A)A~C~U G G A)A~C ~ ~G G C(A)A-~CGQCG G C(A)A-~C~LJG G
IUUAA IUUAA UUAA UUAA UUAA UUAA UUAA UUAA UUAA UUAA
C(A)A- GC%~UG G
31
.C fA:
A-R..
-
AUGCAACG AAGCAACG AUGCAACG AUGCAACG AUGCAACG AUGCAACG ACGGUACA AAGGUACC ACGCAACG AUGCAACG ACGCAACG
iUUAA
X A~t~UGG
ACGCAACGG AAGCAACG AAGCAACG
32
33
U U U U
G G G
G
G G G G
C ( A ) A
A A A A
A A A A
95 Pa 96 UUGA Ap 100 Ap 101 UUUGA ,UUGA Cv 102 ,'UUGA Ec 105
Mg 153 UUUGA UUGA Oh 173 UUGA Me 188
UUGA UUGA UUGA UUGA UUGA
-34 A A
A
C C U U A
GGA A C CU U A
A AUG C A A C G . AGA AAUG C A A C G GA AUG C A A C G G A At.-.AUG C A A C G GGAGA
U A A U A A
iUUGA Ng UUGA
UA UA UA UA UA UA UA CA UA UA UA UA UA UA
33, . AUG C A AG G GAUG C A A AGGCQ A C A A AG AUG
U A A
G G
GAACCU GAACCU AAACCU GAACCU GAACCU GAACCU AAACCU AAACCU GAACCU GAACCU GAACCU GAACCU GAACCU GAACCU
A C C U U A A C C U U Ai A C C
U U A
A CC U U A A C C U U A
Mb 189 Sa 201 SI 203 Tt 207 Dr 208
_ UUGA UUGA UUGA UUGA UUGA UUGA UUGA
Zm 222 Os 223 Gm 225 Cm 228 Ce 229 Ce 230 Cv 231
G.. U ..
,..
U A A U A G
G G A.
G
GG4 u*.2A C(A GG A %XA t . ... .. - C Cc Ccj.. ..W ..... G. AGA .C.
,.A....
A U A
-C
-..U C C
C*.,
.'¢
..1. .,Q
*.
U(A,) -..-..G -.C.- .
v*@ *v
GGA. G A` GGi
U
A
GA
U A A A
U
U.MUAA.(U.Y
GGA^. GGA.
U
G A
U C C C
r..
C
..
.
C
A C C UC A U A C C
U U
A
CAJA U A C C C G A
U U A A
A Ay X4U GG G A
AAiA_
.A
*
U A C G
U G
A U C C G
U A A
U C C G
A U GA C C C A
-_
2600
A U U
C A A A A U C U U A
GAUAAUCCACUAAUCC)AAACUUUA
.A
U U A A
U U A A
C.A
A U A A U C C A
U A
GAA A
2590
A U A A A C C C A U A A A C C C
U G A G8,..,a,',CR A U A A C C C A G-S A .C C A C C U C A U G A A-1.:A U A A U C C A rW AA C A C C U C A U U A A U(,t) A U A C A A C G A A A C C U U A
AG...
_________U
.-...W....... ..K)C
_-
2610
_-
_-
A A C C U U A
C..A.A A _-
2620
A A C C U U A A A U C U
U A
A A C C U U A
U
U G C Hs 234 Rn 241
*C(W.ZQ...Q..W U C G C G C u
UUG
U
~~A(A)~~~~IJU
U G U
P
(~~A~~GC>CC~U
U G A
Gm 253
U U U U G
gJU
UUG
CAC&~~~~*I~U
U G A
2630
248
Cr 255 SC
Sc258 An
260
U U A A
Sp 261
U U G A
Pa
-------------------
__- _- _-
Rc 243
Sp 247
2640
262
Lt 269 Ls 270
2292 Nucleic Acids Research, Vol. 18, Supplement 2650 .
.
.
.
I
.
.
.
.
2670
2660
I
.
.
.
I
.
.
.
.
.I
.
.
.
I.
.
.
2690
2680
I.
.
.
.
.
I.
.
.
.
I
.
.
.
.
.
I
.
.
.
.
2700
I
.
.
.
.
.I
UUGAG----------AGC UUGAG----------AGC UUGAG----------AGC
AUUGAG----------AGU
UUGAG----------AGC CUGAA----------AGA UUAAU----------AGC UUAAU----------AGC UUAAU----------AGC UUAAU----------AGC UUGAG----------UGU 'CGGGC-----------GC
56Ng
59 Ld 62 Gl
35,
35 GrI AA UII - - - GAA---------GAA---------C U GAC---------C U GAC----------
63 Hc 68 Ms 72 Tc 73 Ta 75 Po
-
1000
.I
95 Ng COAU~4 AG$G -A G 96 Pa lOO Ap COAU~1 lOl Ap COA044A
I.
I
. . .. .
COA~ COA* ~4 0 --GAA COAU COA~~U V. --AAA W. 188 Me OA~3 --CUA 189 Mb COAU.~4 G --ACG --AAA 201 Sa COA*.~C
4..
203 Si 0 A 207 Tt COA~1A 208 Dr COA~*C~
I'_
35
222 Zm COA~C 223 Os COA. 225 Gmn OA 228mOAnUt A
- - -
--AAA --A A --A A.
44 AA
- - -A AU
- - - - - - - - - - - - - - - - - - - - - - - - - - -
U U
- - - -
~
260 An
-A
261 Sp
i
-U
-
~
U
A
-
- - - -
. . M- -
t----GUG-
G A
-
-A AA
-
GC------------
UGG-xBCO-------
A;-- - -GGUG - Q--------
GA
A A
-AA-AA
A
- - - - - - - - - - - - - - - -
_-
_-
_-
-_
_-
_-
-
- - - - - - - - - - - - - - - - U
WO
G U A A A A A A A U G C U A U C U A C U A C C A U C A C U
-
-
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
2660
-_
-_
2 2670U
_-
U G'4 A C -~~~A
U U A
G
CU--
-
- - - - - - - - - - - - - - - -
U'. -AA
- - - - - - - - - - - -
-
UUUUG AGA_.. -. -. ............................. ---GUU-.......-
- - - - - - - - -
U
- - - - - - - - - - - - -
P35-2 _- --G
3
-AAGA
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
2650
-
-~~~~ --t-; A GU U u>x.C-u
- - - - -
. A U A
-
U U
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
A.A
258 Sc
. I.
.--GUG-..U--------
G A
GAf~UCAGA
---UUULLCCCU.
- - A U U - - U U A U - - A A C A C - - A A A U 253 Gm A I"k::X..Am. 0.C A A C A A A 255 Or 255 Cr U A - - - - - - - - - - - - - _ ....-U A 256 Sc
-
-
-P35-1'-
AUCCtiCt
- - - A AU
-
----G U G ----G U G _
A
- - -A
-
----GUG-QCU--------GU G-u - -- -- -- -- -- -- ----GUG-CtU------- - - - - - - - - - - - -
-P35-1
-
2290Ce COAU&4& 2300Ce COAU~4C1 2310Cv COALGfAU
U U A
. .
GA--GA GA--GA GA--AA GA--GA GA--GA GA--GA GA--AA GA--AA GA--GA GA--GA GA--GA GA--GA GA--AA GA--GA
--A A --A A C --A A --A A --AAC --A A
1020Cv lOS Ec 153 Mg 1730Oh
-
1020
1010
I.......
262 Pa 269 Lt 270 Ls
I
.
AUUGAG----------AGC
20 Zp 22 Vc 23 Cv 24 Ne 27 Sc 30Pc 33 Pp 51 Pf 52 Pf 53 P1
Hs Rn Rc Sp P1
.
UUGAU ------AGC UUGAU----------AGC UUAAG----------AGC UUGAG----------AGC UUGAU----------AGC OCUGAG----------AGC OCUGAG----------AGC
1 Hs 7Oc 11 Ec 12 Tm 13 Dm 16 Le 19 At
234 241 243 247 248
.
UAUUUAA-AGUAGO AAAGGUAA U U
- - - - - - - -
-___-
-_
_-
_-
-_
_-
_-
-_
_-
_-
-_
_-
_-
-_
_-
-_
_-
-_
_-
G GC GC G GG GC GG C
2680
2690
-_
_-
-_
_-
2700
-
:ags.~ .
Nucleic Acids Research, Vol. 18, Supplement 2293 2720
2710
.
.
.
.
.
.
.
.
.
I
.
.
.
.
I
2730 .
.
.
I
.
.
.
.
.
I
2750
2740 .
.
.
.
I
.
.
.
.
I
.
.
.
.
I
I
.
..
......... ..........
-
-
-
-_
_-
_-
_-
-
-
-
-
-
-
-
-
_-
-
-
-
-
-_
-_
1
Pp Pf Pf P1 Ng Ld Gl
7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62
Hc Ms Tc Ta Po
63 68 72 73 75
Pc ........................I... _-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
%
......
__-_- _- _- _- _- _- _-
-_ __--_ -
-_
Hs
Oc Ec Tm Dm Le At Zp Vc Cv Ne Sc
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
__-_- _- _- _- _- _- _-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
__-_- _- _- _- _- _- _-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
__-_- _- _- _- _- _- _-
.... *.,R~ ~ -.. '. '. ~ ~ R --.:.' ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~.. -:.R; ...-R R __.........
__~~~~~~~~. ......
....... __.. __......
*...*....*.*...*.
.....
-
-
-
-
-
-
-
- -
-
-
-
-
-
- -
-
-
-
-
-
-
-
-
-
-
-
- -
-
-
-
-
-
-
-
-
-
-
-
-
- - -
-
Ng 95 Pa 96 Ap 100 Ap 101 Cv 102 Ec 105
Oh Me Mb Sa Si Tt Dr
153 173 188 189 201 203 207 208
Zm Os Gm Cm Ce Ce Cv
222 223 225 228 229 230 231
Mg
Hs 234 241 .~~~~~~~~~~~~~~~~~~~~~~~~~~~Rn 243 .~~~~~~~~~~~~~~~~~~~~~~~~~~Rc 247 .~~~~~~~~~~~~~~~~~~~~~~~~~~Sp 1 248 .~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 253 .~~~~~~~~~~~~~~~~~~~~~~~~~~Gn 255 .~~~~~~~~~~~~~~~~~~~~~~~~~~Cr .~~~~~~~~~~~~~~~~~~~~~~~~~~Sc 2S6 258 .~~~~~~~~~~~~~~~~~~~~~~~~~~Sc 260 .~~~~~~~~~~~~~~~~~~~~~~~~~~An 261 .~~~~~~~~~~~~~~~~~~~~~~~~~~~Sp
G A A A G C U C C
U CC UCG C UC UA A AU
UUA A GU GU UA G GC GC A AG CUCU AA G AU
A U AU
A GC U GG
Pa 262
269 .~~~~~~~~~~~~~~~~~~~~~~~~~~Lt 270 .~~~~~~~~~~~~~~~~~~~~~~~~~~Ls
I
2710
2720
2730
2740
2750
2760
.
2294 Nucleic Acids Research, Vol. 18, Supplement
1 Hs 7 Oc 11 Ec 12 Tm 13 Dm 16 Le 19 At
20 22 23 24 27 30 33 51 52 53 56 59 62
Zp Vc
Cv Ne
Sc Pc Pp Pf Pf
P1
Ng Ld
Gl
2770 .
.
.
.
2780
.
.
.
.
.
.
95 96 100 101 102 105 153 173 188 189 201 203 207
Hc Ms Tc Ta Po
Ng
Pa Ap Ap Cv Ec Mg Oh Me Mb Sa Sl Tt 208 Dr
.
2790
.
.
.
.
.
.
.
.
2800 .
.
.
.
.
.
.
2810
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
..
................................ ...
..
:.:.: .......
.......... ......
'41 -.-VM
:.-,
-
-
-
-
........... '"l,
GAGGG--GG - - G G--GA-AGG--GA-GAGGA-GAG-
G-__
-
-
-
-
-
-
-
-
-
-
-
-
-
...............
-
A A--G G---
-
.................
..............
.............
35' -
-
-
-
_
- - - - - - - - - - - - - - - - - - - - - - -
A- - - A-- -A----
-A-- --
A-
1030
1040 . ..
-
- -
U
-
-
-
U
C G C G
U
C G
-
-
-
-
-
-
-
-
-
- -
AG-G A A - -
G
C C
G GG A
A
A A
G G
A
-
-
- -
-
Zm Os Gm Cm Ce Ce Cv
-
A
:i
-
-
U
G
A
A A
U
A..
~ ....A
A G G A
U
U
G A A
U U G U :G -C1IG*JmUS U U G U G. - - - U U G U 1 - - - - G A AG-G - C G G. A A A - -
-
-
-
-
-
-
-
-
-
-
-
-
-
-
U C G U C G
35, A A
GG
A
- - - - - - - A A - - - - - - -A A U
A
A -
A
G' - -
G
C G G G
- -
KAG
AG G
G .
P35-2' 222 223 225 228 229 230 231
2820 .
.
GG - - GG - - GG - - GG - - GG - - GG - - -
....................
63 68 72 73 75
.
J
.OG
U
A A
A A U
-
-
- -
A A
U
U............................, - - U --
- - - - - A .A
234 Hs - 241 Rn - 243 Rc - 247 Sp
G - -
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
248P 1
247 Sp - - - -- -- -- -- -- -- -- -- -- -- -- -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 253 -l - - - - - - - - - - - - - - - - - - - - - - - - - - - - 248 Pl---c - - - -...-- -.. -.B.g 256 Sc - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -.............. 255 CrGm
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
.
.
.
.
.
-
-
- - - - - - -
-
262
269 Lt 270 Ls
-
- - - - - - - - - - - - - - - - - - - - - - -- - - -- - - - - - - -
2770
2780
2790
-
-
-
-
-
-
-
- - - - - - - - -
2800
- - - - - - - - - - - - - - - -
2810
2820
2830 .
.
.
.
G~.
2840 .
.
Nucleic Acids Research, Vol. 18, Supplement 2295
.
.
.
I
2850
2860
2870
2880 .................
CG..
..........
A G -C G A ............U UG GAG-CGAUU*J~~~~~~~~G-MUGGUU
CT~~WU A ~~UU GG- U ~~-~
~~~GAG C G A U U
A
A
G
UU
U U O C 4 UU UUA4~~~~~~~~~~~~~UU~~........ .....U.'.iG G -UCG A U.-'.-'U U
A~
C G GAU-UG A H A LU GA U UA - U G ..A..U.G..&AA.A-UGA..U. C.....
CUU
...........
UAGLCGLA ........ CA
C A
.
1060 ..
38
...G - U GG A G -U
....
G(
-CAA
-CA~V
R -.
R
()
ACA -CA
--&G()
-ACA C A G(C Y. - C A GC . - C A U.¢*
............B¢G
GRU A 1080
..
19
Zp 20
Ld GI
22 23 24 27 30 33 51 52 53 56 59 62
Hc Ms Tc Ta
63 68 72 73
Vc Cv Ne Sc Pc
Pp
Pf Pf
P1
Ng
.
.
.
.
.
.
.
.
U
uC.G
U :A U
UG::UA.
ul-,a.::A'
uCCG..G.. Po 75
1
.
1.
Zm Os Gm % U U A AG U AAUU ¢ U U A A G U .C U G Cm A U C `.. U U A A G U'. U G Ce Ce GG U U A A G U C Cv GU U A A G U C.C"
222 223 225 228 229 230 231
-U G A -U G A
U G A U G A
-U G A -U G A
- 38' A A A A A A G A G A G A GA
Ng Pa Ap Ap
Oh Me Mb Sa Sl Tt Dr
Gs A
U G A
G- U G- U G -U U - U U-U G- U U- U
AG AG A G A G AG
Mg
G'A
-38
A A A A A
95 96 100 101 102 105 153 173 188 189 201 203 207 208
A
-U G A A U G U
U U U U U
U U AA G U U UAAG U U U AAG U U U A AG U U U AAG U U U AAG U U U AAG U U U AA G U U UAAGU U UAA G U UUAAG U UUAAG U UUAAGU U U A AG U
U G A U G
U U U U U
1090
1
-
U G A
37
ACAG()
7 11 12 13 16
39'
AU
U G A
... .........
39
t- ¢ - .A .
U(A
1070
-CAGU
-
-38'-
C..
*.I... I..
I.
U.U U
Hs Oc Ec Tm Dm Le At
................ ................
* -s a *- iB
1050
U U
U G CU
UCGU G-GA~C 4UCU
C A
I ..
-
O G -U GA
UG
37
GA GA G A GA GA
-
AUA
AUUCUG U U G- UGAUCU U% AU U U
~U
CCA
36
U
A AAUU A A U U QAG A.......... U A U U A A A U U C. ... A A A U U 1:..G... A A A UU U A A U UCCG U A A U U~...U.....U G A U U CCG G G A UU G G G A U U G G A A U U G U A A U U A U A U U C A A A U U A A A U U CCGA A A U U A A A U U :A.. A G A U U U.CU C A U U A .
Cv Ec
39'-
39
G'G~ UU U A A G U
t--C-U
M A-C¢ GAUGA ;C UA A --A A -- - Hs234 ...U.CAGAAA-C AA-AA C U A -AA - Rn 241 - G A A AQ%UUgA -C -(A)UtG - UG AA(CGC U UG ---C A GU ----Rc 243
A C AG X()
C A U CC AA G Cf1 _(_) _C_ G U s )A A C A G(t)U U.
U A . ji .U.Wl U U- A U - -- -- -- - P1 248 ij..C -GUAAG G .A.. AGA. U U.. U C U) U Gm C.. 253 WK.~~*tU U A GUUAA-U e~dtC G - U G.A U QG U CUU UAA AAG 260 U A CQU*A ()GUUAAUUC~~)S A AGAcU A A GI UU iC A- )C - UCG 6 U CG --UC AGA A AGA G~U(U)U -A AAG UG -O C SAnR Cr 256 A UG - C A A A ':G C U(A .Lt269~~~~ A A A URn.VC UU C A A A S 258 ... ..
- -
2830
2840
---U C A
G.. U.GA..........C
.......
...
....
-
-
.~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a.--------
-
- -U
- U
G -U
AA- C--
GU
-
- - - - - - - - - -
I 2850
2860
ls248
2870
-
2880
as 267
.
~A
2296 Nucleic Acids Research, Vol. 18, Supplement
1 7 11 12 13 16 19 20 22 23 24 27
30 33 51 52 53 56 59 62
Hs Oc Ec Tm Dmn Le At Zp Vc Cv Ne Sc Pc Pp Pf Pf P1 Ng Ld Gl
.
.
.
.
.
.
.
A A C G A t. G A G A A C G A...AC. t GAG A A C G AA G A G A A C G AAPF.' G A G A A C GA G A G A A A C G A GAG A A A C G A G AG A A C G AA G AG A A C G A..A..C G A G A A C G A. A G A G - - A A C G A-'-. G A G - - A A C G A.AC GAG U - - A A C G GAG A U - - A A C B.k S.C. GA G
U U U U U U U U U U U U
-
-
U U U C C
-
-
A A A A A
A A A A A
C C U U C
-
-
A A A A A
A A A A A
...
C A... G A G C GRACG A G G A G C A AG C C GA G A G
2920 .
.
.
.
.
.
.
.
2930
.
.
.
.
.
.
.
.
2940 .
.
.
.
.
.
.
-
-
.
.
1110
95 Ng C--AAC 96 Pa U--AAC C--AAC C--AAC U--AAC C--AAC C- -AAC C--AAC C--AAC C--AAC C--AAC S1 C--AAC Tt C--AAC Dr C--AAC
G G G G G G G G G G G G G G
.1 ...
*.
C
A
..
--.-.. ..
..t..t....
A C
-
C .. AGG.
A.
A--....
1120
C C C C C C C C C C C C C C
A A A A A A A A A A A A A A
-
-
-
-
A A A - A - A - A - A - A - A - A -A - A - A - A
- - - - - - - - - -
- - - - - - - - - - - - - - -
- - -
- - - - - - - - - -
UU U
-
C C U C
- - - - -
U C U A
-
-
-
G - - - - - G C U - - - G C - - - - -
1130 .
.
.
.
.
.
.
.
.
U U C G
C C C C C C C C C C C C C C
- - - - - - - - - - U C-----------
.4
U
C-
A. A A U
-
-
-
-
-
-
-
-
-
-
- - - - - -
C -C-
cC
rgG U A
-
-
-
-
-
-
- - - - -
GU A A
-
-
-
-
-
-
-
-
-
-
-
-
-
- - - - - - - - -
.G C C C U U C G G G G--> G C C C U U C G G G G--U U C40
37,
..........
41
440
A
~
. . . . I . . .. Ap Ap Cv Ec Mg Oh Me Mb Sa
.
C AA G A GG C G A G __----A- C G AAC GAG _-_---A- C G " G A G - _---A- G A G _-_---A- C G A
11 100
100 101 102 105 153 173 188 189 201 203 207 208
.
__----A- _---A- _---A- __----A- __----A- -__---A--- - -A- - _---A- -
-
37,
Hc Ms Tc Ta Po
.
.
_---A- _---A- __----A- -
-
.-.... *....-.........
63 68 72 73 75
2910
2900
2890
.
41
*
222 223 225 228 229 230 231
Zm Os Gm Cm Ce Ce Cv
C--AAC GA£ C--AAC mA~C C--AAC C--AAC C--AAC C--AAC &4~ C--AAC
234 241 243 247 248 253 255 256 258 260
Hs Rn Rc
Gm Cr Sc Sc An
261
Sp
262 Pa
-- -- -- -- -G C A A G - - U - A C C C .....AA U........ -------A C A - U U A A U CCG A U A A C AC.t U--------------U U A - - - - C C UA G G - G A A A A U-~~~~~~~ U --A A C .A"G. G A A - - A C C...C....LfiCGS - U . .. . U G A U. G C C - - - - - G.U--------: A_ U _A U A A A C ___A A A - U:W,:.:_A-:-i¢ -h:is :-iLU -:-:: -4 U:: ___A_ G_ _A _U A A A C AA. A A A - U U-- A A CG A A - A A 2e.U C A U U G C - A U U U g-WU~ U U A A G A A- - A G t --U --- U U G G G A A U AA G A A - - A A C G U - A A A
269 Lt 270 Ls
- - - - - - - - - - - -
Sp P1
-
-
-
-
--
-
-
G G G G
C C C C
A
- C A - .....G G C A - G C A
A A . A
-
-
-
-
-
A A A A A A A
U A U G-------
C C C C C C C
CAU U- -
- - - - -
A UA A
A A U U U Ai..
.
....................
- A A - A A G G - GG
-
I.
-
-
_
_
_
-
A
A
C
G
U
- - - - G A
C.
- - - - - - - - - -
C..U.g
A A G G A G
U
*
i
:
4
1
.l.U........ A.
-------
-.
U
AU
U _A U
- - - - - -
A _
_
__-
_
_
_-
U
-
- - - - --- - --
2890
2900
2910
2920
2930
2940
.
G.
.
Nucleic Acids Research, Vol. 18, Supplement 2297 2950
I
.
.
.
.
2960
I
.
.
.
.
.
.
-.
.
%A
U IJ
2970 .
.
-C %,
.
.
.
.
.
.
2990
2980 .
.
.
.
.
I
.
.
.
I
.
.
.
.
I
.
.
.
3000 .
.
.
.
.
.
.
.
- - - %.
J .)G........*.. G ............... (C.)1.. .. :..A..(. A ..X .. A.A.A. ..............
.
.......
U U A U U
-
-_
------------------------------------------
----------------------------------__------
----------------------------------__-----------------------------------------------
..G.)..A..
UC
G
-_
U0A h.A A A A A A
A UA
A
AAAAAAU
UU UU
A.AA AC .A A U AA
1-
-C... G G A A CA -C .. .....IU U U --.. .A.... U____ 1-~~C
G A U A G C A A A
-.%
u A IIu II u A
W--.--W
M..v
----------------
.... ..
Hs Oc Ec Tm Dm Le At
- - -- -
Vc Cv Ne
. ... U U ----
Pf Pf
Sc _- Pc pPp
-_
.%.4A.Af:^{|)#::|::Ui A G U U U
---- ----
-- Zp
------------------------------------------
UA. U AA-AUUQAAUAUA&GUU) OCaU
.
----------------
----------------
P1
Ng
Ld
GI
1 7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62
41
-
-
- -
.
Hlc 63 Ms Tc Ta Po
- - - - - - - - - ------------------------
Ng 95
-_ __
Ap 100 Ap 101
__
__--_
- - -- - - - - - - - -- - - - - - -
Pa 96
Cv . ---Ec . -------___________ _-.. Mg .---- - -Oh . Me - - - - - - --- - . -._ Mb - ---- - - - - -- - - - - - -- - - - -- Sa . Si - - - - - - - - - - - - - - - - - - - - - - - Tt - - - - - - - - - - - - - - - - - - - - - - -Dr
-
-
68 72 73 75
102 105 153 173 188 189 201 203 207 208
41 -_
_-
_-
_-
__-
-
-.K. .........
Zm 222 Os 223 Gm 225 Cm 228 Ce 229 Ce 230 Cv 231
-
-
-
-
-
- - .......
.......
>...
. .
P41-1
.~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 241 .~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
243 .~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 .~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ - - -_ S s 2356 .__ _P1248 GCC AA A GW.X..u...UU G G. AACCGA ACCG.... AA... CG A G.G AGA C AA ...U GACC> C G C UA C UA-A -m5 __
-_
- - - - - -- - - - - - - - -- -- - -
Rc 243
247 Sp - - - - - - - - - - - - -.Pa262~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~...... - - - - - - - - - - - - .Sc_________258_____241 Scm269 .Lt~~~~~~~~-: - - - - - - - - - - - - - - - - - - - - - - - - - - - - - G-,CGAGGAGCCGAG'. ........ PLs 248
-_-___-__-__-__-__-__-__-__-_-- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -_
_
......L... .......7 0.:....... _
_
_
_
_
_
_
_
...
2950
261 .~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
2960
2970
2980
2990
3000
Nucleic Acids Research,
2298
1
Hs
O
18, Supplement 3040
3030
3020
3010
7
Vol.
3050
-
c
11
Ec
-
12
Tm
-
13
Dm
-
16
Le
-
19
At
20 Zp
-
2 2 Vc
-
23 Cv
-
24
Ne
-
27
Sc
-
30
Pc
-
33
Pp
-
51
Pf
U
52
Pf
U
53 P
U
~
I
56
Ng
-
59
Ld
-
U
U
A
U
U
-
A
U
U
-
U A U C.--
62 G 1
63 Hc 68 Ms 72 Tc 73 Ta 75 Po
95 96 100 101 102 105 153 173 1 88 189 201 203 207 208
Ng Pa Ap Ap
222 223 225 228 229 230 231
Zm Os
Cv Ec
Mg
Oh Me Mb Sa SI Tt Dr
Gmn
Cm Ce Ce Cv P41-l'
234
Hs.--
241
Rn.--
243 247
Rc.--
Sp.--
248
PI
253
Gm
-A
C
A
G
G
U A G C U G U
Ox. ''O:
..........
C A Q
G C C G G C
255 Cr.--
256
Sc.--
258 Sc.--
260 An.-261
Sp.--
262 269
Pa.-Lt.--
270
Ls.--
3010
3020
3030
~P41-2
3060
3040
3050
3060
Nucleic Acids Research, Vol. 18, Supplement 2299 .
.
.
I
3090
3080
3070 .
.
.
.
.
.
I
.
.
.
.
.
.
.
.
.
.
.
.
I
3110
3100 .
.
.
.
I
.
.
.
.
I
.
.
.
.
I
.
.
.
I
.
3120 .
.
.
.
.
.
.
.
I
Hs 1 Oc 7 Ec 11 Tm 12 Dm 13 Le 16 At 19 Zp 20 Vc 22 Cv 23 Ne 24 Sc 27 Pc 30 Pp 33 Pf 51 Pf 52 P1 53 Ng 56 Ld 59 Gl 62
-
-
Hc Ms Tc Ta Po
63 68 72 73 75
95 96 100 101 Cv 102 Ec 105 Mg 153 Oh 173
Ng Pa Ap Ap -
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-_
Me 188 Mb 189 Sa 201 S1 203 Tt 207 Dr 208
- - - - - - - - - - - - - - - -
-
- - - - - - - - - - - - - - - - - - - - -
Zm 222 Os 223 Gm 225 Cm 228 Ce 229 Ce 230 Cv 231
P41-2 --- - ----------------------- - ---------- Hs 234 - - - - - - - - -- - - - - -- - - - -Rn 241 - - - - - -- -- - -- -- - - - - - - - - - - - - - - - - - - --
_
_~ ~ ~~7G C
Q.W)
C
XA
U C G
- - - - --
Rc 243
- - - - - -
Sp 247 PI 248
- -7 ---....... .. ... : Q W U U U..A U U .U. . A G U G U G C G %.J.Cr~~~~~~~~~~~~~~~~~~~~~~~ A )A:Q W C G 253 255
.__ _Sc
256
_Sc 258 .__ - - - - - - - - - - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - --
Cr 255
--
_Sp .__
-_-_-_-_-_-_-_-_-_-_
_-..______________________________
- - - - - - - - - - - - - - - --- - ---- - -- - ----
I
I 3070
3080
3090
3100
3110
3120
261
Lt 269 Ls 270
2300 Nucleic Acids Research, Vol. 18, Supplement 3140
3130
*
.
.
.
.
.
.
3150
3160
3170
3180
.1
.
.
1 Hs 70c 11 Ec 12 Tm 13 Dm 16 Le 19 At 20 Zp 22 Vc 23 Cv 24 Ne 27 Sc 30 Pc ---------______ 33 Pp ------_________ 51 Pf -.-.---________ 52Pf -P-----________ 53P1 --------------56Ng - .-.----_______ 59Ld - .-.-_______ 62G1 ---------------
63 68 72 73 75
Hc Ms Tc Ta Po
95Ng.
96 Pa. 100 Ap. 101 Ap.
102Cv. 105 Ec. 153Mg. 173Oh. 188Me 189 Mb
201Sa. 203S1 207 Tt. 208 Dr.
222 223 225 228 229 230 231
Zm
Os
Gm Cm Ce Ce Cv
P41-2'
P41-2-
234 241 243 247 248 253 255 256 258 260 261 262 269 270
Hs
-_
Rn Rc
-_ -_
Sp
P1 Gm Cr Sc Sc An Sp Pa Lt Ls
A:Q:A:A.:A
__-
__-
_- -_ -_ -_
-_
-C-G--U-G -C-CGAAC-
C
----------- -_ ------ -_ ------ -_ ------ -_ ------ -_ ------ -_ ------ -_
G
G
A
______3130
U
_______-
G
C
-_
G
C
G
A
A
C
_-
-_
_-
_- _- _- _- __- _- _- _- __- _- _- _- _-
-_ -_ -_
_- _- _- __- _- _- __- _- _- _-
_-
_-
_-
_-
......
G_C _G_G_C
-_ -_ -_ -_ -_ -_ _- -_ 3140
_______-
_______-
U_C_G_
_______-
3150
_______-
.....g
.......... Eg.."-...a.''.S..g
_______-
3160
U
_-
_-
_-
,_ A cu_.._ .._ ...._
3170
3180
18., Supplement
Nucleic Acids Research, Vol.
3210
3200
3190
3230
3220
2301
3240
--- ss 1 --O c-c 7 --Ec- 111 -Tm- 121 131 Le 16 . . At 19 . Zp 20 . -Vc 22 Cv 23 . Ne 24 . SC 27 . -----------------------------------------------------------Pc 30 . Pp 33 P 51 .RE P 52 .RE . P1 53 -----------------------------------------------------------Ng 56 Ld 59 . . GI 62 -
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
. . . . . . . .
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
------------------------------------------------------------
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
---
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
63 68 72 73 75
Ng 95 Pa 96 AplOO0 AplOl1 CvlO02 EclOS5 mg 153 Oh 173 Me188
. Mb 189 . Sa 201 . SI 203 . Tt 2 07 .0r 208 -
-
-
Hc Ms Tc Ta Po
P41-2'
Zm Os Gm Cm C. Ca Cv
222 223 225 228 229 230 231
P41-3 -
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
s
-
234
1 U-0 3 .Cr~~~~~~~~~~AGU UCUUGCA25 cr 25 A
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
I
I 3190
I
I 3200
-
-
-
-
-
5
5
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
I
Gm
-
.S -
I
-
-
-
-
-
-
-
-
Sc 25 S 2 n 26
pa262 2 a 26
-
-
-
I
-
-
-
-
-
I
3230
-
-
-
-
-
I
Lt 27
-
3240
9
2302 Nucleic Acids Research, Vol. 18, Supplement 3250 .
1 7 11 12 13 16 19 20 22 23 24 27 30 33 51
52 53 56 59 62
Hs Oc Ec Tm Dm Le At Zp Vc Cv Ne Sc Pc Pp Pf Pf P1 Ng Ld Gl
.
.
.
I
.
.
.
.
I
3260 .
.
.
.
I
-_
_-
_-
_-
_-
_-
-_
_-
_-
_-
_-
_-
.
.
I
.
3270 .
.
.
.
I
.
.
.
.
I
3280 .
.
.
.
I
.
.
.
I
.
3290 .
_-
.
.
.
I
.
.
.
.
_-
_-
__- _-
_-
__-
I
_-
3300 .
.
.
.I
.
.
.I
.
_-
_-
_-
_-
_-
_-
_-
_-
-_ __--
__--_ -_
-_
_-
_-
_-
_-
_-
-_
_-
_-
_-
_-
_-
-_
_-
_-
_-
_-
_-
_-
__- _- _-
_-
_-
_-
_-
_-
_-
_-
_-
__- _- _-
_-
_-
_-
_-
_-
_-
_-
_-
__- _- _-
_-
_-
_-
_-
_-
U G U U C.A
-_
__U
_- _-_-_-_-_-_ _-_-_-_-_ _-_-_-_ __-_-_ __-_-_ __-_-_ --_ -_ _--_ __-_ __-_ __-_ 41'
63 68 72 73 75
Hc Ms Tc Ta Po
95 96 100 101 102 105 153 173 188 189 201 203 207 208
Ng Pa Ap Ap Cv Ec
222 223 225 228 229 230 231
Mg Oh Me Mb
Sa S1 Tt Dr
Zm
Os Gm Cm Ce Ce Cv P41-3
234 241 243 247 248 253 255 256 258 260 261 262 269 270
Hs Rn Rc
-_
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
-_
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
-_
_-
_-
_-
_-
_-
-_
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
-_
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
_-
Sp P1 Gm C GA C G A C G U C Cr Sc Sc An
P41-3'
..Q.
.
_-
_-
_-
G G C G G A G A A A G j>
_-
_-
_-
-_
-_
_-
-_
_-
_-
NS¢A U U C A G G U G A G C C
-_
---------------------------------
-_
---------------------------------
-_
---------------------------------
-_
---------------------------------
Sp
---------------------------------
Pa Lt Ls
---------------------------------
-_ -_ -_ -_ -_ -_
---------------------------------
3250
3260
3270
3280
3290
_-
_-
_-
G__..... U.g..S. G G U G
_______-
---------------------------------
_-
_- _______- _-
_____-
_______-
_______-
3300
.
-
-
*:Ait. -
-
-
-
Nucleic Acids Research, Vol. 18, Supplement 2303 3310 .
.
.
.
I
3320 .
.
.
.
I
.
.
.
.
3330
I
.
.
.
3350
3340 .
.
.
.
.
.
.
.
.
I
3360
. .
I . . . .
I
. .
. . .
.
-- - - - - - - - - - - -- - - - - - - - - CCGA GC -CCGAGCz f A r
Hs Oc Ec Tm Dm Le At
, .I.__ \, MZ\
:-~ ~ ^. .L .' -
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
:
@i: -
5,;- M--. -::->.IU -
-
-
-
-
^
-' .
-1
F-v
-
- - - - - *;.:
-va
-
..;
U G .GM .X.W _.
G
ACU U C OF
---------------------------------------
,,.,it.
C A U G U U - G A - U U - U C
-------------------------------------__ -------------------------------------__ -------------------------------------__
---------------------------------------
...
C U C U G C GC C G.
1
1ii.:.:.iir,
i
U c U c
.i.i.I1r
II rii
Ld GI
Hc Ms Tc Ta Po
63 68 72 73 75
Zp
Q_
Vc Cv Ne Sc Pc
..,,, ,
X.ii.W X.gX .St$Si:i..:r::
1 7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62
Pp
.. ...UCUC U U t
C,
U U
Pf Pf
G. c~ f. u
U U U A C
P1
U
..Q..
C C C G C~A~ACU~ A UU ¢¢.
*
Ng
41 '
-~~~G .~~~G -~~~A .~~~G
-
-
-
Ng 95 Pa 96
Ap 100 Ap 101 Cv 102
.~~~G Ec 105 Mg 153 .~~AU Oh 173 .~~~G Me 188
Mb Sa Sl -~~~G Tt .~~~A Dr -
189 201 203 207 208
41' - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - - -
-
-
-
-
. - - - - - - - -C A U A A .
-
-
-
-
-
-
-
-
- - - - --
4..:.,........... A
-
-
-
A Zm 222 A Os 223 U A Gm 225 - - Cm 228 - Ce 229 A ....... Ce 230 -.- Cv 231
- - - -- - - -
......
-
-
P41-3'
-__
__ -__
__
U G G U A C G U A
{
. ____ --- --- - - -- - ----------- -- -- - - -- - -------Hs 234 __ _----- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - Rn 241 - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -- - Rc 243 . - - - - - - - _ Sp 247
.~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P
A ...................................... 253 ._Cr255 ------------- - --------- -r - - - - -_ Sc 2 56 . ~ ~ ~ ~ ~ ~ ~ Sc 258 -- - - - -- - - - - - - - - - - - - - - - - - -_- -2.An___________________ An 260 .~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 261 - - -- - - - - - - - - - - - - - - - - - - - - - - - 261 G U A C G C C C C
A A A A C
C A A A C G A A A
~~~~~::~~~~~~~~.A~3tJ1
-
. . .--
3310
3320
I
1
3330
3340
- - - - - - - - - - - - - - - - -- - - --- - - -- --- -
3350
3360
SPaLt 269
Ls 270
260An-
2304 Nucleic Acids Research, Vol. 18, Supplement .
.
.
3370
I.
.
.
.
.
I.
.
I.
.
.
.
59 62
63 68 72 73 75
Pp G U U Pf - A Pf A U A P1 --A Ng Ld -
Gl
-
A A A A
41 '
G G iG ,G IG
* *
Ce
Ge
2 31
234
241 243 247
Gv
I
.
.
.
.
I
-
3410 .
.
.
.I
.
.
.I
3420 .
.
.
.I
.
.
.
.I
_---
-
G G G G G
C C A C U
C U U U C
A A A A
A - A - A - A - C A A -
- - - - - - - - - -
- - - - - - -
1170
1
G-AiA G- A
G A C A -GAC A - - U A U A - - G A U A - - G A U A .------ -- -- -G A U A - - - - - - - - - - - - G - U A - - - - - - - - - - - - G A U A -- - - - - - - - - - - G A C A i- - - - - - - - - - - - G U C A i- - - - - - - - - - - - G U C A ,- - - - - - - - - - - - G U C A ,- - - - - - - - - - - GGA- A - - - - - - - - - - - - %a G Am -A - - - - - - - - - - - - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
G-AI G-A G- A G- A G- A G- A G-A G- A G- A G-A G-A G- A
-
A A A AA A A A A A AA A A -
- - - - -
-
- - - - -
- - - - -
-
- - -
. .
- - - - - - - - -
42 _ A
UcA
----G U ----G U ----G A
A
A C AU A AUU C U A -
-A
..
G
UAAUAAUAAGAAA A - -
___-G A
-
----G
-A
A
GAAUAA-
UAA--
Hs.
Rn ---
.
G U U CA--GC G U U CAGCG G U U UA--U U C UA--. - - - - - - - - - - - - - G U C UA--UU UA-._____ - - - - - - - - - - - - - G U U UA-- - - - - - - - - - - - - G U U UA--- - - ---- - - -- - GU U CA - - - - - - - - - - - - - - G A C UA-- - - - - - - - - - - - - G A C UA-- - - - - - - - - - - - C U AC AA-- - - - - - - - - - - A U G AA-- - - - - - - - - - - - - G A U AA - - - - - - - - - - - - - - G U C UA-- - - - - - - - - - - - - A U C UA-- - - - - - - - - - - - - G U C UUA-- - - - - - - - - - G U A A A C 'UA-- - - - - - - - - - - - - A A U IU GC - - - - - - - - - - - - - G C G iA---
U A A A: C. AA& .. -
Rn
- - - - - - - -- - - - - - - - -- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
Sp.
253 Gm
255 Cr
u.
.
GA
256 Sc AX .h...L. U A - - UU 258~~~~UU U ScSc. A -.". - - fKA. V." 260 An - - - - - 261 Sp - - - - - - - -UU 262 Pa - - - - - C U A A G U -
Lt
-
- - -
A U
- - - - - - - - - - - - - - - - U --A.uUU U A - - - -
258
26 9
.
- - - - - - - - - - - - - - - - - - - - - - -
...
299 7m
230
.
A A A A - A
1
Sa SI Tt Dr
--
.
1160
*
Mb
-
3400 .
-
1150
I * *
.
Oh Me
229
I
42
Ng Pa Ap Ap Cv Ec Mg
228 C
.
A-A C A G A C A * A A A G AUUC G -A CGGG-
.
G U
.
G - A G -A Cg)~ G -ACu....
-
G U U
.
G - A CVAB.
Hc Ms Tc Ta Po
223 Os 225 Gm
.
G-A C A AG-A C A A- A C A A - A G - AC A - A CUA
G G G G
1140
95 96 100 101 102 105 153 173 188 189 201 203 207 208
I *** I
.
IHs C CC C 7Oc C CC C 11 Ec C. ' 12 Tm A C A A 13 Dm C A C i> 16 Le . G C 19 At : G C 20 Zp GGC C G C A 22 Vc GCG G 23 Cv 24 Ne .G G G A i 27 Sc 30 Pc GC G
33 51 52 53 56
3390
3380 .
-
iN"..
..........
^
- - - - - - - - - - - - - - - -
--G -
- - - - -
-
J
C U U U A U AU----G A U ---U A U A---
-
270OLs.--
3370
3380
3390
3400
3410
-i4ZU
AU
Nucleic Acids Research, Vol. 18, Supplement 2305 3450
3440
430
* IC... IA
.1I
I
.
.
.
.
.
I
.
.
.
.
.
.
I >
-C G A G -C G A G A U G A G 4G A A G G A A G
A A A C
A C A U A U U U%. U.
G G G G
U
.
U A. C A U A
~CAt
.Afi::..
A A G A . UG A A A U 1 A G G A C G UG
-
GA
*t*~G A CA C A AG A C(~tA
U U C
A
A A
~~GA ~G A
A A
C~
AIJ~
1190
1200.........
1180
1190
1200
AAGG AAAGG
A A A A A A
IAAGG
AAGG AAGG AAGG
1AAGG
,AAGG ,AAGG
A AC AC AC AU
'AAGG
jAAGG iAAGG AAGG
1220
I. ..
.
A-A-A-A-A-A-A--
A A A A A A A
AAAGG~ A G Gt
AAGGWJ AA G
:..'.:....S.. s..'s
A
-----------
A
AG
...
.
.
.
.
I Ng 95 Pa 96 Ap 100 Ap 101 Cv 102 Ec 105 Mg 153 Oh 173 Me 188 Mb 189 Sa 201 Si 203 Tt 207 Dr 208
-34'-
36'9
42' AAGGA AAGGA A AG GG A AG G
I
A ----U A -- -- C A -- -- C A -- -- C A -- -- U A -- -- C A -- -- U A -- -- U AA -- -- U A- - --U A -- -- U A -- -- C A -- -- C
A
|AAGG
....
Zm Os Gm Cm Ce Ce Cv
--U --U --U --C
--C
--U -
-C
A
U
A A A A
A..C
O C C A A QGu.:::.::::~:
-UA
------
A
-
_______________________ U _
_ _ _
_
_
_
_
_
_
_
_
_
_
_
_
. .
I
3430
3440
222 223 225 228 229 230 231
...
-U A
U AAC U C CCAA -:G: A UC A A ~ G~AC-UU A-A-AA--A CA. U-C A -----C A A U UAA~ .. AC)UA G G AC G U -C A G C u CAQR -----GC -A -UA () A CAAA -G~ AC CGU -CA C--CUCOC A-GA ~C A U*U A AG U ~CA1 ~~A A G G A UC At(U AGU(~~~~) ----CA A A~~6 AU ACA¢UCAG ------ GA IUA - UAA~~~ ~~A~~AcA ~AAU C -AAi U A A~ GC A~~~~ A -A--U AA ~~~~~ ~~~A U C - - ------A A G A A A: A U U A A--U O A ~1AG U C --A ~ $ -A - - U A?K~~~~~ A &W~~~~~W~~4AAU A AA -------t G U U A U --A-U .AU GQGA - -U &A.~~~ A -4G*~~~~- -~ CU CA A.A A &A........GA.U.A.
----AC
--
63 68 72 73 75
34,-
1210 .
.
Hc Ms Tc Ta Po
P1
Ng Ld
- - -
1180
GI
24 27 30 33 51 52 53 56 59 62
Ne Sc Pc Pp Pf Pf
G A A - U ---GAA-U - - - G A A - A - - - G A A - U - - - G A A - U
AAGGA A A A G G A A .C GA A AGG A AG GA CuU CAX..: A.-. . A.. A A G G A QQ&GG C C A
....
Cv 23
-~
36'
42'
Hs 1 Oc 7 Ec 11 Tm 12 Dm 13 Le 16 At 19 Zp 20 Vc 22
U U
~G A J**G A
At
3480
I.... I
U
U U U U U U U U U U
)..G
mJQG A A G U A A AG A..
A A QGG GA A A A A A G G A A
GA
A
A tfl¢ C(C)AGA A A G A U G g Ag UGAA G A A A GA ~C C~~G A ~G A A ~~ GA & ~G A
o.0
A(W)
>:.>:.........> -...:. ..:.>>.. ...>.>.>. ...... :.>>>>
...>
.i.A
C C C C C
C - CG A G A U
3470
3460 .
.
3450
3460
3470
3480
Hs 234 Rn 241 Rc 243 Sp 247
P1 248 Gm 253 Cr 255
Sc 256 Sc 258 An 260 Sp 261 Pa 262 Lt 269 Ls 270
.~
2306 Nucleic Acids Research, Vol. 18, Supplement .
.
3490
.
.
3500 .
.
.
.
.
1 Hs :..4 G C U
7Oc 11 Ec 12 Tm 13 Dm 16 Le 19 At 20 Zp 22 Vc 23Cv 24 Ne 27 Sc 30 Pc 33 Pp 51 Pf 52 Pf 53 Pl 56 Ng 59 Ld 62 GI
G 4G 4G . G 4 G 4 G 4 G
C C C C C C C G C G C G C G C G C G C G C G C G C G C G C
4 4 4 4
4 4
C C C
C C C C C U U
U U
G
....
63 Hc
A A A A A
A
72 Tc 4GCU A 73 Ta 4G C U VA C4. 75 Po A UCG C U
.... 95 96 100 101 102 105
Ng Pa Ap Ap Cv Ec Mg Oh Me Mb Sa Sl Tt
G G G G G 4.G G G G G G G G G
153 173 188 189 201 203 207 208 Dr
.. ...A
...
CZm
G
GC
229 Ce
G
G
22
..
..
A
4G C GACe(
Cv
4G C
234 241 243 247
Hs 4-C Rn . A Rc G Sp..- G
248
p1i
G AA
253 Gm 255 Cr
G C C G C UA
258 260 261 262 269 270
Sc An
~4G UAA 4
Sp . Pa Lt Ls
G G G G
U C C
C C C C C
A A A A A
.
.
.
.
*
.
3540
.
U
A~
G A A
AA C
A A A A A
A A A A G
U U U U C
-
- - -
- - - - - - - - - -
- - - -
1250 *
.
ACAGA---ACAAA---ACAAA---ACAAA---ACAGA---ACAAA---ACAAA---ACAAA---AAUAG---ACAAA---ACAAU---AC A A U
-
-
-
-
ACAAA---ACAAC----
..
..
A C A A U
A C A A C A A C A A U A C A A C A A U l. .¢.2A C A A C A A U A C A A C A A U A C A A ...,x .................A...............
.U44C444 .
230 Ce
Sc
.
..
.. ..
2 31
256
.
E43-1
A C A AU
AA AA AA GA
.
.
43
.. A~~... ~
.
3 _ _ _
AC AC AC AC AC AC AC AC AC AC AC AC AC AC
U U U G U
~
.
U U U
1240
U U U U
~
3530
3520 .
UUGUUA ____AC
32'
~
.
.
....I.. .I
34'
~
.
C C C C C
1230
I...
C U C U C U C U C U
C C C C C C C C C
.
43
A'A~A
4G C
3510 .
.
......
4GUG C UCA
68 Ms
.
32'
..........
. ..
.
ACAC ACAC ACAC ACAC ACAA ACAC ACAC ACAC ACAC ACAC ACAC ACAC ACAC ACAA ACAC A C A A AC AC ACAA ACAA ACAC
U C C U
G C C
34'
.
A A A A A A A
A A A -
- - - - - - - - - - - - -
- - - - -
U A
- - - - - - -
AUQ44C U AC - A ........... U AGO . ..... -A..C-. AC-AAUU......-......... B. -----. ...f...-......A C AA U UA .......................
-
-
A AAAUG. U CU
A 4 ~~J 4WflICA ~A UC 4>4:.|A4i
--AAU
-
-
-
-
-
A ACA UAG - - - A A AU U % - A AU COCA A ~~ ~E ~~cc~tj AA...uX - ... U A~~A -U -U A - C- ~$A .. ~ C. A ~A AX .. U AAUG U A-
A U
A
A A
U
C Ui
.A.(:4LI4.gC GAOUAA ().i~
...Ct
A
UA A A A
- - - -
-
-
- - - - -
3490
3500
3510
3520
3530
3540
~
Nucleic Acids Research, Vol. 18, Supplement 2307 3550
3570
3560
3580
3590
*
3600
*~**
~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~. . . . .
-- U GO C 1J A -U GO C U A ~ ~ ~ ~ ~ ~-~~~~~~~~~...... ~ ~ ~ ~ ~ ~ ~ ~ ~C~ ~ ~ ~ ~ ~ ~ ~ ~ ~ G-00 0 ~ ~ ~ ~ ~ ~u~ ~ ~ ~ ~-~ ~ ~ ~ ~ ~~~ ~~ ~~~ ~~ ~~~ ~~c~~~ ~~~~~ ~~~ ~~c~~~ ~~ ~~ ~~~~~~~. ~. . U A C -U G G-u -~~~~~~~~~~~~~~~~~~~~~~~~~~~AU~~~~~~~~~~~~~~~~~~~~~~~~ .......CC... A G G-CU U A~ ~ ~ ~ ~ ~ ~U~ ~ ~ ~ ~ ~ ~ A~ ~ ~ ~ ~ ~ ~ ~ ~. ---------------------------------------U CA CA-CC - CG A--UG .U CU A U A A ~~~~~~~~~~~~~~~~~~~~ U A C GAC-U.'A U -----------------------------------------UGC ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~ ~. . . . . . . . . -CC -~ ~ ~ ~U~ ~ ~ ~ ~A~ ~ ~ CC.. u G GA ...........
---------
U U
U~
~fl.t&~~GGGAA-
Hs
Cc Ec Tm Dm Le At
Zp
Vc U1~~.44.~ A- Cv *G A- Ne SC c -G~~~~~~~~~~~~~~~~~~~~~~~~ G G A- PC C~CG A- Pp 41---' G A- Pf U &~~JAC~GA- Pf
~ GO UU. U A U U U UGO--UAU
- -
t A U A CU V
G
*~(C~A~ UG A- P1
~~AAU - ~AG- Ng Ld j.0. Gl GO
44
G A G A G A G AG A-
Hc Ms Tc Ta Po
7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62
63 68 72 73 75
1260
Ic 1c 1c -c 1c
3 Ic
3 1u 1c
3 -
A
%c
3 Ic
3 1c
3 1c
3 1c
3 1u
3
-
-
-
-
-
-
-
-
-
-
-
-
Ng 95 Pa 96 Ap 100 Ap 101
Cv 102 Ec 105 Mg 153 Oh 173 Me 188 Mb 189 Sa 201 Si 203 Tt 207 Dr 208
44
GU C -& G G ~~~UC ~U C ~ ~ ~ ~ ~ ---- ~~~~ U A --- ~~ A ~ U A -& A G A
G C G A U -~-C~~G C C1CG C G C G AU GO G GOC G A U ~G C G C U U A G C G C U U
~~~UG C A ........C.
Zm 222
Os 223 Gm 225
Cmi
228
Ce 229 Ce 230
G C
Cv
231
Hs
234
Rn
241
Rc
243
Sp 247
a. A..... A A G C -
-
A A
A -
-A A--A -
G
AU
A
U
G-
-UAUAUUUAU
3560
3570U
3 580
3590U
248 253
Cr
255
Sc
256
Sc
258
An
260
Sp 261
I
I 3550
P1 Gm
3600U
Pa
262
Lt
269
Ls
270
2308 Nucleic Acids Research, Vol. 18, Supplement 3610 .
.
I
.
.
.
.
.
.
I
3620 .
.
1 Hs -CA70c -CA-UA-GO-GA-CA-CA-GA-GA- A A A: -GA-GAA A - GA -AA-AA-
11 Ec 12 Tm 13 Dm 16 Le 19 At 20 Zp 22 Vc 23 Cv 24 Ne 27 Sc 30 Pc 33 Pp 51 Pf 52 Pf 53 P1
.
I
.
.
.
3630
I
.
.
.
.
.
.
I
.
.
.
.
I
3640 .
.
.
.
I
.
.
.
I
.
AAC-
---
---C.
.
.
UAAC---C IAAAU-------...;
UGAUGGGGAU UGAUGGGGAU UGAUAGGGAU UGCUAGGGAU UGCUUGGGAU UGAUGGGGAU UGAUGGGGAU UGAUGGGGAU UGAUGGGGAU UGAUGGGGAU UGACGGGGAU UGAUAGGGAU UGCUGGGGAU UGACUGGGAC UGAUGGGGAU UGAUGGGGAU UGACAGGGAU CACUUGGGAC U
AA C A
A
U A UAAU--------------C..u~... A U -C-------U USUA A A .....
.QAAU-----UQ--%gCG-AAAK UAAU-OU------aUUG-AAU VAAU--------QVUVIAAC UAAU ----UU---- LJGGAAA UAAC.-------CCUUAGUC
-AAis
-U
----------
UA
U
----
-
-
-
-
-
-
-
-
-
-CU
AC----------
U
U
U U AC A : GAU CAA:
U.tAAUUGGGAU
U U U U C A A
AC CCCCGGAA AGOC-----------------------
AA-----------
GGCCGGGAC
433'
*... -GA - AOG -C GOUAA(4C--C U A A A -AA - ... A G G G A A -C UCLi--C U A A A A CC.W G-OCGCA AoCe-AA - G G GA A $#O~~~~~~ -A- As V C UCAU A A AA --- AA - ...GCGC GA SC CGA AtICCC-.AGCAtA ". -AA - S ax"G C A G-~~~~~~A -U U A A A ...........
;
1270
1280 I. . . .
95 Ng -GA96 Pa -GA 100 Ap -AA101 Ap -GA102 Cv -GA105 Ec -GA153 Mg -AA173 Oh -AA188 Me -GA189 Mb - GA 201 Sa - GA 203 SI -A A 207 Tt -AA208 Dr -GA-
I
. . ..
C.GCAGUUGGGAU U U~1300 A G U C C G G A U
I
*1..
. . ..
**
I.
CGU A G U C C G G A U
--------ACAAA --------AUAAA --------AUAAA .-------ACAAA --------ACAAA --------AUAAA U
G U COGGA U
CQ
AG U COGGAO A G U CC G GA U C4C AG U CCG GA U
454
UCCAGUUCGGAU
-GU-------AAA .--------AAAAA .--------AAAAA -U-------UAAAA ---------AAAAA ---------AAAAA A AA A ---------------
gUCAGUUCGGAU
CC.gCAGUUCGGAU .CUCAGUUCGGAU CgUCAGUUCGGAU CUCAGUUCGGAU CCCAGUUCGGAU
-U-------GAAAA
CtCCAGUUCAGAU
44' 222 Zm -GA223 Os -GA225 G(n 228 Cm 229 Ce -GA230 Ce 231 Cv -GA
-
CtUUAGUUCGGAU C..CCAGUUCGGAU C.*tGUAGUCAGGAC
1290
I....
G U U C G G A U
.
WtVA
-GA
.. .1
3660
.I
.
CA AC
44,
234 Hs 241 Rn 243 Rc 247 Sp 248 P1 253 Gn 255 Cr 256 Sc 258 Sc 260 An 261 Sp 262 Pa 269 Lt 270 Ls
.I
.
--AAC -CC---#UO---O AAC--------------C CAAC
56Ng
Hc Ms Tc Ta Po
.
-A A C
59 Ld -AA62 Gl - G A -
63 68 72 73 75
3650 .
43, -------AAAAA
GCUG-AGCUAACIYCC
UCAG U UCGGA U
....UG-AGCUAACSe::cc-------AAAAA
UCAGUUCGGAU U CA G U U CGG A U UCAGUUCGGAU UCAGUUCGGAU UCAGUUCGGAU UCAGUUCGGAU
---------AAAAA
-------A--GAAA
A#lU-GGCAAAUCV#A--------GAAA A-AGCUAACCUC.t---------AAAAA CCAAC -U-------------AAAAA
----CC---
------CUACGAUA -CGCAC A
~~~~~~~~~~~~~~~~~o-
-U
U
U A C G A A A
C A A A C G A A A A G U U G GA A
A.
~~~~~~~~~~~~~~~AAACC
-~~~~~
U~~~~~~~~%C AGU CG GAU CGGG UAC AG AIittt.C~~~~~ tA 'Wr A,1 -4I--A u*A A-4-4 AAG A A --
-A ~ ~ ~ G V~~ UG
A A
A A G A U ;~~~~~~~ C U A
4
A
-A
UU
4--$ -A-
A
A
C A
G G A
A
A U AL
U
A
AU
U
AG
U
A
U
C
A U G
U
U
G A
U
A
U
G A A ____________________________________________________________
.A_ __
1 3610u
3620
3630
3640
5
3650
3
360t
bC..~GU~-
Nucleic Acids Research, Vol. 18, Supplement 2309 3680
3670
C 00.i.. U C i . A C:....CU C U U 5.gAA AA A GG UU C G U A A C-- - - - - A A G G UU^4W ^5.W*: A Ai. A G.(AA..U A A A G U C G U A A C- -----A A G G M g S AcAG G A A G U CC GG UU AA AA CA G G b....... utXM. ....+..X CA A G GGG "XA k.A ... 5-..,8.A A G U C G U A A CC G U A A CA G G b _ . . eA QGA)A A A G U CC GG UU AA AA C-- -----A - - - - A A G G C- -----A A G U A AA G GGG - - - -- A A X ..A:*A . C A A G A U A G U G UA AC4-A GA A G U C A A G G AG A A A G G A A G U CC GG UU AA AA CU. CA A Q.A) GA^ A A G U C G U A A CA A G G A. A A)A.A..A A G U C G U A A CC G U A A CAA~~~~~AA -AA AA GG GG C G U A A C- C G U A A CA A G G A A A G U C G U A A C- -- A A G G - - - - -
C-
-
-
-
-
-
-
G~
47'
48
.
63 Hc +. A.A
AU.
A
A
A.fl..
A AG U C C GA 'U ......U. GA-A A AGU C 73 TAca GA AA G C A , ,G ..., .. . .A C ""A" AA ..
72 Tc 7
T
U
1470
1480
*
.
I
.
.
.
.
G U A A G U A A G U A A G U A A G U A A
1490 .
.
.
.
.
C - --A A G G ~ C - - - - - A A G G U C - - - - - A A G G
C
J
w
---A A G G U...
- -
C - -
G4
---A A G G UACG
1500 .
.
.
.
.
.
.
1510
.
*
95 96 100 101 102 105 153 173 188 189 201 203 207 208
Ng Pa Ap Ap Cv Ec Mg
AAGUC GUAAC-- AAGUC GUAAC-- AAG U C GUAAC-- AAGUC GUAAC-- -
Oh Me
AAGUC GUAAC--
xxxxx GUAAC--
-
-
A A G G
*.......
...........
...
AAGG A AG GXAA C AAGG
AAGUC GUAAC-AAGUC GUAAC--
xxxxx GUAAC-- - - - A A G X X X X X X X X
Mb
AAGUC AAGUC AAGUC AAGUC AAGUC
Sa Si Tt Dr
GUAAC-GUAAC-- - - - A GUAAC- - ---A GUAAC-- ---A GUAAC-- ---A
AA GG GG 5 UA CG A G G UA~~~ AG G ACJ
47, 222 223 225 228 229 230 231
Zm Os Gm Cm Ce Ce Cv
234 241 243 247
Hs ...iA C G -C A Rn C U A A C -A A Rc ....i...... - - - - -.-- - - - Sp
48 AAGU AAGU AAGU AAGU AAGU AAGU AAGU
A C A A U A - C A A C A .A A ( A U A G A A t. A IJU A A A C A i..A
-
248 P1 - - - - - - - - - 253 Gm -AC1 255 Cr -UAUf*4
256 Sc 258 260 261 262 269 270
CA
U C G C AU Sp.. C Pa Lt - - - Ls - - - - -
Sc An
- -
- - - - -
3970
V
3980
A A A A A A A A
G G G G G G G G
AAG
A A A A A
G G G C C
3990
U U U U U U U U U U U U G G
C G C G CG C G C G C G C G
U U U U U U U
A A A A AA A A A A A A A A
C C CC C C C -
-
-
-
A A G G AA GG A A G G -A A G G - A AGG - A A G G - A A GG
-
C G U A A C - - - - - A U G G C G U A A C- - - - - A A G G C G U A A C - - - - - A U G G C G U A A C - - - - - - - - - - - A C C G U A A C - - - - - - - - - - - A C C G U A A C - - - - - A A G G C G U A A C- - - - - - A G G ...f> U G A A - U -A C AGG .AtN UGAAAU ----- ACAG C G A A A U - - - - - A U G G gi 4 C G A A A U - - - - - A A G G 1 C G A A A U - - - - - A C G G VWC. U G C A G U - - - - - A A U U.> U G C A G U - - - - -A A U U -W
,.-.-..-. 4000
4010
4020
Nucleic Acids Research, Vol. 18, Supplement 2315
I
AU
.
.
.
.
I
.
G A A
.
.
.
I
I
I
.
I
.
.
.
.
GAUCA---U UA----_______ GAUCA---U UA----------GAUCA---U UA--_________ GA U CA ---U UA----------GAUCA---U UA.-.---_____ GA U CA ---U UG---GAUCA---U UG----------GAUCA---U UG----------GAUCA---U UG----------GAUCA---A GAUCA---U UG---------GAUCA---A GAUCA---U UA---------GAUCA---U GA UCA--- U GAUCA---U GAUCA---U U A GAUCA-UUU GAUCA--UU GA UCC--U C ,UA.-____
:GU G A A
*$U G A A :GU G A A SU G A A .GUGAA SU G A A ...GUGAA UGUGAA GSUGAA ..GUGAA EU GGAA .G.UGAA U GGA A GUGAA UGUGAA U G A A U GGA A
-
-
-
-
-
-
-
-
-
-
-
UA.----IUA.----eUA.--___
S UGAA
-
GGGAA UGG A A S.GUGAA G.. UGA US G A AA .....UGAA G A .8GA ..G.GA A
-
.
.
I
.
GAUCAGA UCA GAUCAGAUCAGA U C A -
. . .
.
I. *
I
.
.
.
.
I
.
.
.
.
I Hs
1 7 Ec 11 ________________-------Tm 12 ____Dm______----------- 13 ----------------------- Le 16 -----------------------At 19 --------------------Zp 20 Vc 22 Cv 23 24 -Ne _________-_-------Sc 27 _______-_------------- -Pc 30 ______________-------- -Pp 33 Pf 51 _____________---- --Pf 52 ___________-----P1 53 _________-_---------M----------------------g 56 --- Ld 59 ---------------Gl 62 -
_____O________------c -
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-----
-
------------
I.
GGA U C A
.
.
.
I
---CCUCCUUUCU-
SG G A A ctsscaGAUC A ---CCUCCUUA.
CGGA A G AkA
-
-
-
-
-
-
GA U CA ---CCUCCUUA.----
-
4P5t.0EG* GGGAA UU CC AA ---CCUCCUUUCU.-----CCUCCUUUCU.----C U CC U U U GAUCA ---C U C CU U
G.G.....G A U C A 48 48'
-
-
-
-
-
-
-
-
-
-
95 .Mg______Pa 96 100 ._____-----101 ._____-----
----------
----
-
---
---------
--Cv 102 EclOS
153 .Mg______--------------Oh 173 --------------
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
U-------U--------
#GCtLGGA U C A ---CC
-
-
GA U CA ---CCUCCUUA.----
GGA A
GGAA UGGAA
-
-
XXXXXXXXXXXXXXXGAUCA ---CCUCCU-----GG A A ........... G A U C A ---CCUCCUUA. AGAA G A U U A ---CC UCC U U UCU------VG G A A G~4AtGA U C A ---C U C CU U U-------XXGAAGXXXXXXXXXGAUCA ---CCUCCUUUCU.--CG GA A G4~4$40 GA U C A ---CCUCCUUUCU.--." CGGAA
--------Hc 63 -------- Ms 68 --------Tc 72 --------Ta 73 _Po 75 ._
----------
1540
..I I
SG G A A
UG G A A UM 6 G A A
.
--CCUCCU--CCUCCU- .-----------CCUCCU- .----------CC UCC --CCUCCU- .----------
1530
1520
CG G A A GG G A A
.
'48
~iG G A A
.... I
.
4080
4070
4060
4050
4040
4030 .
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
--------------Mb 189
Sa201 S 203 Tt 207 -Dr208 .___------
-------------- ------------------
Zm Os Gm Cm Ce Ce Cv
U CC UU G A U C A ---CC UCC UU.-----------.------GA UCA ,---CCUCCUUU .------,---CCUCCUUC ------------
~ &CJGAUCA ,---CCUCCUUC
.-------
#,SUSG A U C A ---CCUCCUUA-------GGS A A S UG G A A G&51~UGA U C A ---CCUCCUUG.-------
4GACGA---AC.----------------------------------- Hs 234
U..GGAA
G GA A
4GAAUA---AU.-----------------
Rn 241
Rc 243 Sp 247 P1 248 GD 253 255 4WAGUCA---AAUUUAUAUAA.------------- Cr Sc 256 .GGCUUA---UAAAUAUCUUAAAUAUUCUUACA-------Sc 258 5GCUUA---UAAAUAUCUUAAAUAUUCUUACA.------An 260 U*.AUGAA--UUAACCACUUAGCAAUAAAUAAAUGCAUAUAUAUACCA U U A A C C C C C C C C C------------- Sp 261 AACU A - - - A U U A A U U C A A ACU CA A A A U A U UU U A U G A A A A U C C U C A Pa 262 GS AC U U A --- U A UAU U Lt 269
*JGGAA
GGUUU----AUA.-----------------
-GAAAAU-------------------UGGAAAAU.------------------G..,GAUGG---AAUCC-----------------
...SCACA CACA SGGAA
CGGAA GSGAA
*:GGAA AU G G A A
UGSB6AA U-.GGAA
A :.'I G.G.A. *J..AUAA A. !.
Ls 270
UU
.-AUAA
I
222 223 225 228 229 230 231
I 4030
I
I
4040
I
4050
I
I
4060
I
I 4070
408
2316 Nucleic Acids Research, Vol. 18, Supplement 4090 .
I
.
.
.
.
I
4100 .
.
.
.
.
.
.
4110 .
.
.
.
.
.
.
.
.
4120 .
.
.
.
I
.
.
.
.
4130 .
.
.
.
.
.
.
4140 .
.
.
.
.
.
.
.
.
1 Hs 70c 11 Ec 12 Tm 13 Dm 16 Le 19 At 20 Zp 22 Vc 23 Cv 24 Ne 27 Sc 30 Pc 33 Pp 51 Pf 52 Pf 53 P1
56 Ng 59 Ld 62 Gl
63 Hc 68 Ms 72 Tc 73 Ta 75 Po
95 Ng 96 Pa 100 Ap 101 Ap 102 Cv 105 Ec 153Mg 173 Oh 188 Me 189 Mb 201 Sa 203 S1 207 Tt 208 Dr
222 223 225 228 229 230 231
-
-
Zm
Os
Gm
Cn Ce Ce Cv
234Hs. 241 Rn -.-.---- ---------------------.----243 Rc -.-.---------------------------.---247 Sp. 248P1. 253 Gm. -.----------------------------------_-_-___ CACUUGUUA 255Cr --------------------------------------------256 Sc --------------258Sc --------------260 An A A U U U U A U A U A C U U U U G U A U A U A A A A G A U G G A A C A A A U A U A U A U A ______4140 261 Sp. 262 Pa AAUCAAAUUUUGAUUGAUUUUUAUAUAUAUAUAGGAAUAU A U U C U A A A U A U 269 Lt ---------------------------------------------
270Ls
-_
_-
_-
-_
_-
_-
-_
_-
_-
-_
_-
_-
-_
_-
_-
-_
_-
_-
-_
_-
_-
--.------------------------------------------
I 4090
I
1
4100U
4110
4120
41.30
Nucleic Acids Research, Vol. 18, Supplement 2317 4160
4150
Hs Oc Ec Tm Dm Le At Zp Vc Cv Ne Sc Pc Pp Pf Pf P1 Ng Ld GI
1 7 11 12 13 16 19 20 22 23 24 27 30 33 51 52 53 56 59 62
--------Hc -____----Ms --------- Tc --------- Ta .____---- Po
63 68 72 73 75
--------------------
_-.-----.______ --------------------
.__--------------------------------------------------------------------
-
--
--------
-
.-----------------
_______-_------------------------------
-
-
-
-
-
-
-
-
-
-
-
-
-
-
._______-----------.___________-------.___________----------- - -------.___________-----------------------------------------------------------------
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
.__________--------- Ng 95 Pa 96 .__________--------- Ap100
--------------------
-------------------- -----------------------------------
AplOl
Cv 102 . EclO5 ________------------ Mg153 ---------- Oh173 .Oh-------
--------------------
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
Me188
-----Mb 189 . Sa201 . Si 203 . Tt207 . Dr 208 -
-
-
-
-
-
-
-
-
-
Zm 222 Os 223 Gn 225 Cn 228 Ce 229 Ce 230 Cv 231
Hs 234 Rn 241 Rc 243
Sp 247 P1 248 Gm 253 Cr 255 Sc 256 Sc 258 An 260 Sp 261 U AU UU AU UA UA UA CU AU AU A Pa 262
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
-
_
_
-
_
_
-
-
-
-
-
-_ -_
-
-
-
-_
-
I
_ _-
_-
_-
_-
_-
_
_
_-
__-
_-
_-
_
_
_-
_-
_ _-
_
__
_
_
I 4150
_-
__-
-
__-
-
_
_
Lt 269 Ls 270
_-_-
I 4160