Comprehensive classification of nucleotidyltransferase fold proteins: identification of novel families and their representatives in human SUPPLEMENTARY DATA
Krzysztof Kuchta1, Lukasz Knizewski1, Lucjan S. Wyrwicz2, Leszek Rychlewski3, Krzysztof Ginalski1,*
1
Laboratory of Bioinformatics and Bioengineering, Interdisciplinary Centre for Mathematical and
Computational Modelling, Warsaw University, Pawinskiego 5a, 02-106 Warsaw, Poland 2
Laboratory of Bioinformatics and Systems Biology, Maria Sklodowska-Curie Memorial Cancer
Center and Institute of Oncology, Roentgena 5, 02-781 Warsaw, Poland 3
BioInfoBank Institute, Limanowskiego 24a, 60-744 Poznan, Poland
*To whom correspondence should be addressed. Tel: +48 22 5540800; Fax: +48 22 5540801; Email:
[email protected]
TABLE OF CONTENTS REACTION CATALYZED BY NTASE FOLD SUPERFAMILY ENZYMES..........................3 KNOWN NTASE FOLD SUPERFAMILY MEMBERS ...............................................................3 Group I .......................................................................................................................................3 Group II...................................................................................................................................... 4 Groups III and IV....................................................................................................................... 4 Group V...................................................................................................................................... 5 Group VI .................................................................................................................................... 5 Group VII................................................................................................................................... 6 Group VIII.................................................................................................................................. 6 Group IX .................................................................................................................................... 6 Group X...................................................................................................................................... 7 Group XI .................................................................................................................................... 7 Group XII-XVI .......................................................................................................................... 7 SUPPLEMENTARY REFERENCES.............................................................................................. 9 SUPPLEMENTARY FIGURES..................................................................................................... 12 Supplementary Figure S1......................................................................................................... 13 Supplementary Figure S2......................................................................................................... 15 Supplementary Figure S3......................................................................................................... 16 Supplementary Figure S4......................................................................................................... 17 SUPPLEMENTARY TABLES....................................................................................................... 45
2
REACTION CATALYZED BY NTASE FOLD SUPERFAMILY ENZYMES In majority of cases the nucleotidyl transfer catalyzed by enzymes belonging to NTase fold superfamily may be depicted as follows: X + (d)NTP → X-(d)NMP + Y, where (d)NTP represents any of five (deoxy)nucleotides ((d)ATP, (d)GTP, (d)CTP, (d)TTP or UTP) and (d)NMP corresponds to (2’-deoxy)ribonucleoside 5’-monophosphate. Usually, during a nucleotidyl transfer reaction, the 3’-OH group of the substrate X attacks the 5’-α phosphate of an incoming (d)NTP (1). Substrates (X) and products (Y) differ for various NTase fold proteins as shown in Supplementary Table S1. Importantly, there are some known exceptions from the described above general reaction mechanism. For instance, N-terminal NTase domain in glutamine synthetase adenyltransferases, that are involved in regulation of glutamine synthetase (GS) activity, catalyzes a removal of covalently bound nucleoside monophosphate from GS and in consequence acts as a reverse nucleotidyltransferase (2). Another NTase fold proteins, adenyl cyclases, produce cyclic product 3’-5’-adenosine monophosphate (3’-5’-cAMP) from ATP but do not catalyze NMP transfer from NTP to the hydroxyl group (3).
KNOWN NTASE FOLD SUPERFAMILY MEMBERS Group I Group I consists of the following NTase fold families: PolyA polymerases (COG5186, KOG2245),
DNA
polymerase
sigma
(COG5260,
KOG1906,
PF04928),
tRNA
nucleotidyltransferases (COG1746), S-M checkpoint control proteins CID1 and related nucleotidyltransferases (KOG2277). Proteins with solved structure in this group include: yeast and bovine PAPs (PDB: 2o1p (4) and 1q78 (5), respectively), TUTases 2 and 4 from T. brucei (PDB: 2b4v (6) and 2ikf (7), respectively), archeal CCA-adding enzyme (PDB: 1r89 (8)) and OAS 1 (PDB: 1px5 (9)). These protein families are involved in many biological processes (mainly in nucleic acids metabolism) such as: DNA repair (mostly proteins from DNA polymerase sigma family), mRNA polyadenylation (polyA polymerases), tRNA maturation (CCA-adding enzymes), pre-mRNA editing (TUTases), 2’-5’-oligoadenylate synthesis (OASes) and chromatin remodeling (TRF4/5). Proteins from group I possess various domain contexts. Usually, they have a three-domain structure, which includes N-terminal NTase domain and C-terminal PAP/OAS1 substrate binding domains 3
followed by Poly(A) polymerase predicted RNA binding domain. Exceptionally, NTase domain may contain an inserted RNA binding domain, like in TUTase 2 from T. brucei. Some proteins from group I harbor also various additional N-terminal domains preceding NTase domain. These domains are involved in the tRNA splicing (2’-5’ RNA ligase domain (PF02834), cyclic phosphodiestrase domain (PF07823)), post-translational modifications (dual specificity phosphatase domain
(PF07827)),
DNA
repair,
apoptosis
or
intracellular
signaling
(10)
(endonuclease/exonuclease/phosphatase (PF03372)) and protein-protein interactions (Ankyrin repeats (PF00023)). Additional C-terminal domains include: adenylate and guanylate cyclase family (PF00211), which catalyzes formation of signaling messengers (cGMP and cAMP) and domains of unknown function: DUF659 (PF04937), DUF599 (PF04654), DUF504 (PF04457). Our in silico analyses indicate that DUF659 family is probably DNA transposase. 3D-Jury method suggested that this domain has ribonuclease H-like fold together with three conserved catalytic amino acids (two aspartates and one glutamate responsible for coordination of divalent metal ions) critical for nuclease function. DNA transposase domain (11) linked to NTase domain may participate in V(D)J recombination, in which variable (V), joining (J) and diversity (D) gene segments are joined to generate immunoglobulins and T-cell receptors in vertebrates.
Group II Group II embraces CCA-adding enzymes and polyA polymerases (COG01617, KOG2159, PF01743). Several CCA-adding proteins have solved structure including those from B. stearothermophilus, A. aeolicus and H. sapiens (PDB: 1miv (12), 1vfg (13) and 1ou5 (14), respectively). NTase domain in proteins from group II is always followed by polyA polymerase C-terminal domain. Additional domains, which probably have a regulatory role include cystathionine-betasynthase (CBS) extracellular domain (PF00571) (15), Adaptin N-terminal region and uncharacterized so far DUF1091 (PF06477). Using various fold recognition methods we predict that DUF1091 is a lipid binding domain, which adopts Ganglioside M2 (gm2) activator fold, and similarly to Adoptin N-terminal region and CBS domain may have a crucial role in controlling NTase activity.
Groups III and IV Groups III and IV include streptomycin (PF04439) and kanamycin adenyltransferases, both having representatives of known structure: aminoglycoside 6-adenyltransferase from B. subtilis (PDB: 2pbe) and kanamycin adenyltransferases from S. aureus (PDB: 1kan (16)). Proteins from these 4
groups consist of two domains: N-terminal NTase domain and C-terminal substrate binding, domain and are responsible for chemical modifications of antibiotics leading to streptomycin and kanamycin resistance in bacteria.
Group V NTase fold families in this group (COG1669, COG1708 and PF01909) have several structurally characterized representatives such as nucleotidyltransferase from T. thermophilus (PDB: 1wot), HI0073 from H. influenzae (PDB: 1no5 (17)), putative nucleotidyltransferases from S. solfataricus and A. fulgidus (PDB: 2rff and 1ylq, respectively). This group includes minimal NTase fold proteins, whose structure consists of core fold elements only. Unfortunately biological role of these enzymes is not known. Some proteins from this group posses additional N-terminal and C-terminal domains including helix-turn-helix domain (PF01381) responsible for DNA/RNA recognition, HEPN domain (PF05163) probably involved in binding nucleotides and uncharacterized domain of unknown function DUF86 (PF01934).
Group VI Rat DNA Polβ (PDB: 1bpd (18)), human DNA Polλ (PDB: 1xsl (19)), mouse TdT (PDB: 1jms (20)), mouse DNA polymerase mu (PDB: 2ihm (21)) and DNA polymerase X from african swine fever virus (PDB: 1jaj (22)) are structurally solved polymerases belonging to DNA polymerase IV family (COG1796, KOG2534). These NTases are involved in DNA repair, transcription, and V(D)J recombination at immunoglobulin chains. In contrast to group I proteins, in this group substrate binding domains (Pol DNA β N-terminal domain followed by Pol DNA β second domain) are localized on the N-terminal side of NTase domain. Additional N-terminal domains include SCP-like extracellular protein (PF00188), BRCA1 C-Terminus (BRCT) domain (PF00533), elongation factor Tu GTP binding domain (PF00009) or high-affinity nickel-transport protein (PF03824). BRCT domain usually occurs in proteins involved in cell cycle checkpoint functions responsive to DNA damage (23). Elongation factor Tu GTP binding domain binds GTP and aminoacyl-tRNA, while SCP-like extracellular and high-affinity nickel-transport domains probably regulate NTase activity. Some proteins from group VI may also posses additional C-terminal domains such as PHP domain (PF02811) and Barwin family (PF00967). PHP domain is phosphoestrase, probably responsible for the hydrolysis of the pyrophosphate released during nucleotide polymerization, which is necessary to drive the polymerization reaction forward (24). Barwin family domain, similarly to SCP-like protein and high-affinity nickel-transport domain may play a regulatory role of NTase activity. 5
Group VII This group is formed by viral PAPs that belong to the family of poxvirus poly(A) polymerase catalytic subunit (PF03296). Structural representative, viral poly(A) polymerase (PDB: 2ga9 (25)) is a crucial protein participating in polyadenylation of mRNA. It forms a heterodimer composed of a catalytic component (VP55) and a processivity factor (VP39). VP55 consists of N-terminal domain (containing three helix-hairpin-helix motifs probably responsible for RNA recognition), central NTase domain and C-terminal non-active NTase domain, which binds poly(A) (25).
Group VIII Group VIII is represented by the structure of E. coli glutamine synthetase adenylyltransferase (PDB: 1v4a (2)) and includes the following families: UTP:GlnB (protein PII) uridylyltransferases (COG2844), glutamine synthetase adenylyltransferases (COG1391), predicted signal-transduction proteins containing cAMP-binding and CBS domains (COG2905), glutamate-ammonia ligase adenylyltransferases (PF03710) and putative nucleotidyltransferases (DUF294 (PF03445)). Majority of the proteins from this group take part in regulation of GS activity, what is very important during nitrogen assimilation. Most of the proteins classified to group X are composed of duplicated NTase domain followed by C-terminal four-helical up-and-down bundle domain responsible for substrate recognition. First NTase domain catalyzes disjunction of AMP from glutamine synthetase, while second one processes GS adenylation. Remaining proteins that have only single NTase domain may have additional C-terminal four-helical up-and-down bundle substrate binding domain and extra regulatory/binding domains such as cyclic nucleotide-binding domain (PF00027), CBS domain pair (PF00571), Cache domain (PF08269), PAS fold (PF08447) and ACT domain (PF01842).
Group IX Group
IX
consists
of
the
following
protein
families:
guanosine
polyphosphate
pyrophosphohydrolases/synthetases (COG0317, KOG1157), uncharacterized protein conserved in bacteria (COG2357), and region found in RelA/SpoT proteins (PF04607). Proteins of known structure from this group include GTP pyrophosphokinase family protein from S. pneumoniae (PDB: 2be3) and RelA/SpoT homolog from S. equisimilis (PDB: 1vj7 (26)). Enzymes from the Rel/Spo family consist of two domains: the N-terminal HD domain (PF01966) which degrades (p)ppGpp and C-terminal NTase domain which is responsible for (p)ppGpp 6
synthesis (26). Some proteins from this group possess also additional N- and C-terminal regulatory domains such as Methyl-accepting chemotaxis protein (MCP) signaling domain (PF00015, Nterminal), ACT domain (PF01842, C-terminal) and TGS domain (PF02824, C-terminal).
Group X Group X includes single family of NTase fold that possesses only NTase domain with no specific biological role determined. Structure in this family is solved for putative nucleotidyltransferase TM1012 from T. maritima (PDB: 2ewr).
Group XI This group embraces uncharacterized proteins (COG2320 and PF04229) with conserved NTase domain including E. faecalis GrpB of known structure (PDB: 2nrk). Domain context for this group of proteins is quite rich and diverse. Some proteins possess additional N-terminal domains such as uridine kinase (PF00485) and acetyltransferase (GNAT) family (PF00583). Acetyltransferases are involved in many diverse processes such as transcription activation, gene silencing, DNA repair and cell-cycle progression. Acetyltransferase domain connected to NTase domain may thus have a crucial role in DNA repair. The suggestion is proved by the experimental data showing that histone H3 acetylation levels increase following UV irradiation in mammalian cells and Esa-1 dependent acetylation is required for the DNA double-strand break repair (27). Group XI proteins may have also additional C-terminal domains that include acetyltransferase (GNAT) family (PF00583), cytidine and deoxycytidylate deaminase zinc-binding region (PF00383), NUDIX domain (PF00293) or another NTase domain. NUDIX domain is a hydrolase and play a crucial role in DNA repair. It removes an oxidatively damaged form of guanine (7,8-dihydro-8-oxoguanine) from DNA, the spontaneously formed mutagenic substrate of DNA replication (28,29).
Group XII-XVI Another five groups of NTase fold proteins include following protein families: predicted nucleotidyltransferases (COG2413, group XII), predicted nucleotidyltransferases (COG4914, group XIII), predicted nucleotidyltransferases (COG3541, group XIV), adenylate cyclase (COG3072, PF01295, group XV) and uncharacterized protein conserved in archaea (COG1665, group XVI). Apart
from
adenylate
cyclases,
remaining
proteins
were
annotated
as
putative
nucleotidyltransferases with an unknown specific biological role. Adenylate cyclases participate in signaling pathways by producing intracellular secondary messenger - cyclic AMP. Proteins from 7
these groups have only single NTase domain with the exception of some group XIV members that possess also additional C-terminal substrate binding domain.
8
SUPPLEMENTARY REFERENCES 1.
Pelletier, H., Sawaya, M.R., Kumar, A., Wilson, S.H. and Kraut, J. (1994) Structures of ternary complexes of rat DNA polymerase beta, a DNA template-primer, and ddCTP. Science, 264, 1891-1903.
2.
Xu, Y., Zhang, R., Joachimiak, A., Carr, P.D., Huber, T., Vasudevan, S.G. and Ollis, D.L. (2004) Structure of the N-terminal domain of Escherichia coli glutamine synthetase adenylyltransferase. Structure, 12, 861-869.
3.
Aravind, L. and Koonin, E.V. (1999) DNA polymerase beta-like nucleotidyltransferase superfamily: identification of three new families, classification and evolutionary history. Nucleic Acids Res, 27, 1609-1618.
4.
Balbo, P.B., Toth, J. and Bohm, A. (2007) X-ray crystallographic and steady state fluorescence characterization of the protein dynamics of yeast polyadenylate polymerase. Journal of molecular biology, 366, 1401-1415.
5.
Martin, G., Moglich, A., Keller, W. and Doublie, S. (2004) Biochemical and structural insights into substrate binding and catalytic mechanism of mammalian poly(A) polymerase. Journal of molecular biology, 341, 911-925.
6.
Deng, J., Ernst, N.L., Turley, S., Stuart, K.D. and Hol, W.G. (2005) Structural basis for UTP specificity of RNA editing TUTases from Trypanosoma brucei. Embo J, 24, 4007-4017.
7.
Stagno, J., Aphasizheva, I., Rosengarth, A., Luecke, H. and Aphasizhev, R. (2007) UTPbound and Apo structures of a minimal RNA uridylyltransferase. Journal of molecular biology, 366, 882-899.
8.
Xiong, Y., Li, F., Wang, J., Weiner, A.M. and Steitz, T.A. (2003) Crystal structures of an archaeal class I CCA-adding enzyme and its nucleotide complexes. Mol Cell, 12, 1165-1172.
9.
Hartmann, R., Justesen, J., Sarkar, S.N., Sen, G.C. and Yee, V.C. (2003) Crystal structure of the 2'-specific and double-stranded RNA-activated interferon-induced antiviral protein 2'-5'oligoadenylate synthetase. Mol Cell, 12, 1173-1185.
10. Lahm, A. and Suck, D. (1991) DNase I-induced DNA conformation. 2 A structure of a DNase I-octamer complex. J Mol Biol, 222, 645-667. 11. Hickman, A.B., Perez, Z.N., Zhou, L., Musingarimi, P., Ghirlando, R., Hinshaw, J.E., Craig, N.L. and Dyda, F. (2005) Molecular architecture of a eukaryotic DNA transposase. Nat Struct Mol Biol, 12, 715-721. 12.
Li, F., Xiong, Y., Wang, J., Cho, H.D., Tomita, K., Weiner, A.M. and Steitz, T.A. (2002) Crystal structures of the Bacillus stearothermophilus CCA-adding enzyme and its complexes with ATP or CTP. Cell, 111, 815-824. 9
13.
Tomita, K., Fukai, S., Ishitani, R., Ueda, T., Takeuchi, N., Vassylyev, D.G. and Nureki, O. (2004) Structural basis for template-independent RNA polymerization. Nature, 430, 700-704.
14.
Augustin, M.A., Reichert, A.S., Betat, H., Huber, R., Morl, M. and Steegborn, C. (2003) Crystal structure of the human CCA-adding enzyme: insights into template-independent polymerization. Journal of molecular biology, 328, 985-994.
15.
Scott, J.W., Hawley, S.A., Green, K.A., Anis, M., Stewart, G., Scullion, G.A., Norman, D.G. and Hardie, D.G. (2004) CBS domains form energy-sensing modules whose binding of adenosine ligands is disrupted by disease mutations. J Clin Invest, 113, 274-284.
16. Sakon, J., Liao, H.H., Kanikula, A.M., Benning, M.M., Rayment, I. and Holden, H.M. (1993) Molecular structure of kanamycin nucleotidyltransferase determined to 3.0-A resolution. Biochemistry, 32, 11977-11984. 17.
Lehmann, C., Pullalarevu, S., Krajewski, W., Willis, M.A., Galkin, A., Howard, A. and Herzberg, O. (2005) Structure of HI0073 from Haemophilus influenzae, the nucleotidebinding domain of a two-protein nucleotidyl transferase. Proteins, 60, 807-811.
18. Sawaya, M.R., Pelletier, H., Kumar, A., Wilson, S.H. and Kraut, J. (1994) Crystal structure of rat DNA polymerase beta: evidence for a common polymerase mechanism. Science, 264, 1930-1935. 19.
Garcia-Diaz, M., Bebenek, K., Krahn, J.M., Kunkel, T.A. and Pedersen, L.C. (2005) A closed conformation for the Pol lambda catalytic cycle. Nat Struct Mol Biol, 12, 97-98.
20.
Delarue, M., Boule, J.B., Lescar, J., Expert-Bezancon, N., Jourdan, N., Sukumar, N., Rougeon, F. and Papanicolaou, C. (2002) Crystal structures of a template-independent DNA polymerase: murine terminal deoxynucleotidyltransferase. Embo J, 21, 427-439.
21.
Moon, A.F., Garcia-Diaz, M., Bebenek, K., Davis, B.J., Zhong, X., Ramsden, D.A., Kunkel, T.A. and Pedersen, L.C. (2007) Structural insight into the substrate specificity of DNA Polymerase mu. Nat Struct Mol Biol, 14, 45-53.
22. Maciejewski, M.W., Shin, R., Pan, B., Marintchev, A., Denninger, A., Mullen, M.A., Chen, K., Gryk, M.R. and Mullen, G.P. (2001) Solution structure of a viral DNA repair polymerase. Nat Struct Biol, 8, 936-941. 23. Bork, P., Hofmann, K., Bucher, P., Neuwald, A.F., Altschul, S.F. and Koonin, E.V. (1997) A superfamily of conserved domains in DNA damage-responsive cell cycle checkpoint proteins. FASEB J, 11, 68-76. 24. Aravind, L. and Koonin, E.V. (1998) Phosphoesterase domains associated with DNA polymerases of diverse origins. Nucleic Acids Res, 26, 3746-3752.
10
25. Moure, C.M., Bowman, B.R., Gershon, P.D. and Quiocho, F.A. (2006) Crystal structures of the vaccinia virus polyadenylate polymerase heterodimer: insights into ATP selectivity and processivity. Mol Cell, 22, 339-349. 26.
Hogg, T., Mechold, U., Malke, H., Cashel, M. and Hilgenfeld, R. (2004) Conformational antagonism between opposing active sites in a bifunctional RelA/SpoT homolog modulates (p)ppGpp metabolism during the stringent response [corrected]. Cell, 117, 57-68.
27.
Carrozza, M.J., Utley, R.T., Workman, J.L. and Cote, J. (2003) The diverse functions of histone acetyltransferase complexes. Trends Genet, 19, 321-329.
28.
Koonin, E.V. (1993) A highly conserved sequence motif defining the family of MutT-related proteins from eubacteria, eukaryotes and viruses. Nucleic Acids Res, 21, 4847.
29.
Bessman, M.J., Frick, D.N. and O'Handley, S.F. (1996) The MutT proteins or "Nudix" hydrolases, a family of versatile, widely distributed, "housecleaning" enzymes. J Biol Chem, 271, 25059-25062.
11
SUPPLEMENTARY FIGURES Supplementary Figure S1. Domain context for NTase fold superfamily proteins. For each group all possible combinations of the domains are shown for selected representative sequences. Sequences are labeled according to PDB code or NCBI gene identification (gi) number. Domain context was derived with SMART, Meta-BASIC and 3D-Jury methods. Supplementary Figure S2. Schematic representation of choline incorporation into cell membrane in H. influenzae. Diagram of H. influenzae operon, which encodes proteins involved in this process is shown in left upper corner. LicD is newly identified NTase fold protein responsible for attachment of phosphocholine to teichoic and lipoteichoic acids in the cell membrane. Supplementary Figure S3. Multiple sequence alignment of human NTase fold proteins. Only conserved regions of the fold core are shown. Sequences are labeled according to ENSEMBLE protein identification number and the number of group they belong to. Alignment presentation scheme is the same as in Figure 2. Supplementary Figure S4. Multiple sequence alignments for NTase fold families. Only conserved regions of the fold core are shown for representative proteins of the families.
12
Supplementary Figure S1
13
14
Supplementary Figure S2
15
Supplementary Figure S3
16
Supplementary Figure S4 Group I COG1746 1r89_A gi|40889617 gi|73619832 gi|18976398 gi|14520321 gi|57641676 gi|20094802 gi|20092365 gi|73669513 gi|91773113 gi|147920408 gi|52550127 gi|116753987 gi|126178273 gi|88604043 gi|154151248 gi|124485196 gi|76800814 gi|15789453 gi|153895332 gi|55378979 gi|110667400 gi|148642113 gi|84488916 gi|15678612 gi|157402334 gi|45358512 gi|150398994 gi|150401678 gi|15669299 gi|16082415 gi|13540948 gi|48477607 gi|126008310 gi|18313990 gi|145591941 gi|119872084 gi|126460342 gi|126354026 gi|70607053 gi|15921188 gi|17380358 gi|146304547 gi|126466123 gi|124026917 gi|156937642 gi|119719285 gi|154241577 gi|118195553
24 24 24 24 23 24 24 29 29 22 24 29 1 24 24 26 23 24 25 25 26 35 20 1 22 45 30 32 40 27 21 21 21 21 25 24 25 24 25 21 21 20 21 28 34 26 32 22 21
GREAEEELRRRLDE----L-G GREAEEELRRRLDE----L-G VSTLMEEIKEKTEETIEEL-N VKTLMSEIEEKARETIEEL-N VKLIMDELRGIAQEVIEES-G VKELIGELEGIAREKAQEL-G VEGFARRILSEVRDRLKER-D LTAVQEELAAEVKAAAEKL-C LFAIQEELASQVKIAAEKL-G LTKVADELMYKIDCLTFKV-G LKNTAREIIGRIDSEAARL-G VKAVTERIINMINQKALEM-G --------------------VRTMGQQLIEAVERSG----MYALADRLIADIRGSG----IAGVARRLLDAINASG----VFEMGQKLIEAVREIA----LQAAVAKLTARAEAAITDL-P LAAAASRLTERAEAAIAEL-P LRTVAVELADRTREAIADL-P LQRVADAVMADAEAAIADL-P MQRVMSQLRNRITTEVESF-P ILKTSDKLISYLNEVCKDE-G --------MNIISNYAKSK-D VMELSDSLVECLNGLAEEQ-G LKVFSDKIIAKIKDISKY--LKIFSDKIISKIKDISKY--LTLFSNAVLHKLKSISEY--LNGFSKKLINELFGQLKKR-N LQLKANEIIDKIWEIVREN-S LKIISDDIIRKINSICRSR-G LKSISEYLTTRASEICRHK-N LKCIENGIIKKLNDIISSR-N LKNIVSAIASDINAICKTE-G IREVTQKVKRLVSQIIEEG-G VKEISQSVKELVSQIVREE-G LKKVTEKVKALIADAINKY-G VAEVASNVKGLVARWVEER-G INDTASEVMHILEGMLHKL-G LEKVAEEVLSRLKG------LRERAQIILDRLKG------IEKVLEIIRERLNK------IQSAVDLVISRLKD------LRKAFNFIKKHVENVLKEN-N ARKAVEEATRLLDAVLKEL-G RDELFVKTKSVLEDVLKKR-G LDLFAERLMAVVLEEANTRAG KEGIAKTALELVKKEIEKF-P KDRTAARALALVESEIAGR-P
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 2) 2) 2) 2) 2) 3) 3) 3) 2) 2) 2) 1) 1) 1) 1) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 5) 3) 2) 2) 2) 2) 2) 2) 2) 2) 2) 0) 0) 0) 0) 2) 3) 2) 2) 2) 2)
VEYVFVGSYARN VEYVFVGSYARN AKPYFVGSLAKD AKPYFVGSLAKN IEVKFVGSLAKD VKPYFVGSLAKD AEVELIGSVARD IFVKMVGSAARG VLVKMVGSAARG VKTQLVGSAARG VHAIHVGSTARD AHALSVGSTARN CRGILVGSAARN AKAMMVGSVARD ADGMMVGSVARK ATGMVVGSIARN VPAMMTGSVARG ADTRLVGSTARG ADVVQVGSTARG ADVVQVGSTARD AEVVQVGSTARG ADVVQVGSTARG ASASLVGSVAKK ITCKLVGSVAKK AEAVLVGSVAKG VDIIQVGSTARD LDIIQVGSTARD LDIIQVGSTARN IDIIQVGSTARN LEVLLVGSSARN AEAVIVGSYAKG ADPVLVGSYAKG AEPVSVGSYSKG ATPLEVGSVSKG ALVDVYGSGARG SEVEVYGSSARG AEIGVYGSSARS AEVQVLGSSARG AEVTLQGSIAHG FDAQIQGSFRKG YNAEIEGSFRKG LDFEVEGSFRKG LDAEVHGSFRKG AKVTLQGSFAHD YEVRVEGSYAKD GEVTLQGSARKG VSVELHGSYRHD IGVEFGGSFAKG TGAEFGGSYPKG
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 5) 5) 5) 5) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6)
LEIDVFLLF LEIDVFLLF HDVDLFIAF HDLDLFIAF HDIDMFLAF HDVDLFLAF SDVDVFCVF HDIDVFISF HDIDVFISF HDLDIFITF RDIDIFLMF TDIDIFIMF RDIDIFIAV RDLDIFMLF RDLDVFMLF RDLDVFLLF KDIDIFMLF RDIDLFVRF RDIDLFVRF RDIDLFVRF RDVDVFVCF RDIDIFVRF SDIDIFISF ADIDIFMAF ADIDIFIHF HDIDIFLRF HDIDIFLRF HDIDIFLRF YDIDIFVRF YDIDIFVLF GDLDIFIAF GDLDLFIAF SDLDIFIVF SDIDLFITF RDIDIFVVL RDIDVFVVM RDIDIFIVL RDIDIFIVL RDIDVFLIF TDIDIFVFY TDIDIFVFF TDVDVFVFY TDVDLFVFY NDLDVFVLF LDIDLFILL LELDVFVLF ADIDLFLLF ADVDIFVRF SDIDIFVRF
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 6) 6) 6) 6) 8) 8) 8) 8) 8) 8) 8) 7) 8) 8) 8)
LRERGL--EIGKAVL LRERGL--EIGKAVL LREKGL--ELGKVLG LREKGL--EAGKVLG LKSKGL--EIAESIG VRERGL--ELGKEIG IVEVTL--EVGREAI LERRGM--EIAREVA LETLGM--AIAREVA LESYGL--SVGRQVA LKLEGL--QLARSVS LKDNGL--ALAKSIS ----AL--EIAREIA LQEQGL--SLAHRIA LEREGL--ALAWSIA LETEGL--ALARSIA LQEKGL--AAAYGVV LETYGL--EVGHAVL LETYGT--TVGAAVL LEEYGL--AVGHAVL LEEYGL--AVGHDVL LEEYGL--KVGSSVL LKKIGL--YLGYKCS LKKYGL--EFGKYCI LKESGL--RLGYGCI LKESGI--RIGKEAI LKELGL--KLGKDVI LKEVGL--RVGKEVI LKNIVL--ECGKTVI LEEIGL--KIGTEAI INTEGL--HIGHAVI MEKLGL--SIGHDLL MESIGL--SLGHYIL IEKKGL--EIGHAVL PEDVVK---ILTSRF PEDVVR---LLTRRF IKDVIT---LLSRYF PEEVVE---SLSKFL LVKSGELVRSLAAAI LREKSL--KELIQLF LREKAL--KELIERF LERNAL--NDIINRI LAREAL--REILDRL LRDKGF--NILLEAA IHAGLI--DQIEEKL IKEVAF--PALLEAA MKEVAR--RVSEGSS FEKISK--KIGFDSL FEDVSK--EVGFAAL
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
19) 19) 22) 22) 22) 22) 27) 22) 22) 22) 19) 25) 19) 23) 23) 23) 23) 19) 19) 19) 19) 19) 23) 23) 23) 23) 23) 23) 23) 23) 19) 19) 19) 19) 23) 23) 23) 23) 23) 21) 21) 21) 21) 22) 23) 22) 24) 21) 21)
VEVDVVPCY VEVDVVPCY IRVDIVPCY VKVDIVPCY YQVDIVPCY VKVDLVPCY FEVDVVPCY FDVDLVPCF FDVDLVPCF FDVDLVPCY FDVDLVPCF YEADLVPCF FDVDLVPCY LDIDLVPCY FDVDLVPCY IDVDLVPCY FDVDLVPCY FDVDCVPCY YAVDLVPCY FDVDLVPCH YAVDLVPCY FEVDLVPCY FEVDFVPCY FEIDFVPCY YEVDFVPCY FNLDIVPCY FNLDIVPCY FSLDIVPCY YDIDIVPCY YEVDIVPCY VKIDVVPCY IKIDVVPCF VKIDIVPAY IKIDIVPAF YEVDIVPCY YEVDIVPCY YEVDVVPCY YEVDVVPCY FNIDVVPCI VEVDVVPAL VEIDVVPAL VEVDIVPAL VEVDVVPAL IRADLVPGY YWVEVVPAC VPVEIVPAF VEVDVVPAY TKINVVPFY TRINVIPCY
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
10) 10) 10) 10) 10) 10) 9) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 9) 9)
AVDRTPFHHKW AVDRTPFHHKW AVDRSILHTHW AVDRSILHTKW AVDRSILHTEW AVDRSILHTKW PVDRTPHHNRY AVDRTPFHNEF AVDRTPFHNDF AVDRTPFHNEF AVDRSPFHNQY AVDRTPFHTEY AVDRTPFHSRY AVDRTPFHTRY AVDRTPFHNRY AVDRTPFHTRY AVDRTPFHTRY AVDRTPFHTEY AVDRTPFHTTY AVDRTPFHDAY AVDRTPFHTRY AVDRTPHHNQY AVDRTILHTKY AVDRTILHTNY AVDRTILHTRY AVDRTPLHNEF AVDRTPLHNEF AVDRTPLHNEF AVDRTPLHNEF AVDRTPLHHKF AVDRTLLHTEY AVDRTLLHTRY TVDRTPLHTKY SVDRTPLHTIY AADRSPLHHKF AADRSPLHHKF AADRTPLHHKF AADRSPLHHQF AADRTPLHTLY AADRTPFHTKF AADRTPFHTKY AVDRTPFHTKY AVDRTPFHTRY AVDRTPFHTQY AVDRTPLHTRY AVDRTPFHTEW PVDRTQLHTAY AADRSPFHTKF AADRSRFHTAF
136 136 145 145 144 145 149 151 151 144 142 153 95 141 141 143 140 142 143 143 144 153 142 115 144 165 150 152 165 150 138 138 138 138 144 143 144 143 149 133 133 132 133 149 157 146 156 141 140
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
437] 437] 449] 453] 448] 456] 485] 454] 454] 446] 443] 454] 401] 457] 460] 460] 464] 448] 452] 462] 466] 465] 456] 428] 454] 459] 444] 446] 464] 449] 431] 437] 427] 427] 419] 418] 418] 418] 455] 416] 413] 412] 412] 474] 484] 436] 467] 443] 442]
17
gi|118431594 gi|41614948
32 20
LGRLYSLVARALEECEDLRLS ( AKKALDYIIPILE----KT-P (
3) YRVELVGSAAKG ( 2) YDVFVGGSYAKG (
6) WEVDVFLLL ( 4) RDIDIFVRF (
6) VRRLGE--SLLRSCL ( 22) MQADVVPAP ( 9) GVERTPFHTRY 8) ISI-----YIE-QTL ( 23) LKIEIVPIL ( 10) VTDISQFHVEW
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5)
4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4)
152 [ 465] 132 [ 392]
Group I COG5186 gi|73537013 gi|71660695 gi|7108951 gi|146180895 gi|145505515 gi|145551783 gi|145546083 gi|118376886 gi|146163985 gi|154417414 gi|147856999 gi|145338191 gi|125601581 gi|19073993 gi|19112154 gi|50557132 gi|126031338 gi|156843263 gi|50304345 gi|146416103 gi|50422465 gi|63101121 gi|149237693 gi|68482965 gi|126139251 gi|68482706 gi|71023817 gi|119494846 gi|154282671 gi|119186325 gi|156037648 gi|46122023 gi|145608864 gi|157069956 gi|116207486 gi|111069056 gi|134107007 gi|114577511 gi|120952595 1q78_A gi|58865618 gi|21064419 gi|66500359 gi|91079612 gi|156405096 gi|72012045 gi|17565368 gi|17536297 gi|17533995 gi|66806183
37 35 35 1152 53 43 65 166 64 38 135 36 39 47 43 42 44 44 44 43 43 43 45 43 43 43 57 46 46 46 46 44 44 45 43 46 47 56 56 57 58 65 61 66 53 51 49 51 56 90
SMENTDEVVRLVRQLLEKML VADRVREAIDLIELTAHRLV PSISPNPVLTLVESMALRIV ETKQKDEKLKQLRQIIRDWI SKKTKDKLIEELRQNIIRWL --RKAKRFKHIQEKIVKKWS NQQKSKDIQAYLGQIVKRWS KEKLKEQIIMKIQEIVNQWM TELSKQRYVNELQEIVTKWS DYQSRKIVVYIIKNIAIKFA EELKRKNVIEKLKEIVLTWV DEVKRRGVINQLRKIVVRWV ADKKREQVIRKLNKIVMDWA EGQTRERVLGKLNFMVREFV ESEKRVKVLDELQQITTEFV ESNKRKKVLLIMQQLAQEYV ETANRVQVLKILQELAQRFV ETAKRVKVLEIFQQLAQEFV ETGKRVAVLEIFQKLTQEFV ATKKRVEVLNTLQKITEEFV ATKKRVEVLHILQKLTEEFV ATLKRVEVLTTLQKLTEQFV ATRKRVKVLNTLQKLTETFV ATKKRVEVLNILQSMTEEFV ATKKRVEVLTQFQKMVQEFV ATKKRVEVLTLFQRLVQEFV ESKIREVILGKLDAMVKDFV ETERRKQVLQLIQRVTHEFV ETERRKQTLQLIQRVTVEFV ETEKRNQMLHLIQRVTVEFV DTDKRTAVLASLQNITEEFV ETAKREEVLASIQIICDAFV ETEKRFAVLRSLQEIADAFV ETDKRWAVLADLQRITDEFV ETEKRKEVLRQLEKITTVFV GNKRREEVLAHVQKVVEEFV ERKVRERLLSNIAQLVAKFV ELNHRLVVLGKLNNLVKEWI ELQRRILVLGKLNTLVKFWI ELQRRILILGKLNNLVKEWI ELQRRILILQKLNNLVKEWI ELNHRMEILAKLNTLVKQWV ELNHRMEILSKLNALVKQWI ELNHRMVILGKLYSLVKQWI ELQQRMIVLAKLNELVKEWI ELNHRLVVLGKLDQIVKDWI ETEQRMEVLRNLNRLVKEWV ETLLRIKVLKNLDGLVKDWV ETSQRVKVLQKLNGCVKQWI ESRKREEILGKLNQIVREWA
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
10) 11) 11) 14) 11) 18) 18) 19) 19) 18) 18) 18) 15) 17) 18) 18) 18) 18) 18) 18) 18) 18) 18) 18) 18) 18) 18) 18) 18) 18) 18) 19) 19) 19) 19) 18) 18) 18) 18) 18) 18) 18) 18) 18) 18) 18) 17) 17) 17) 18)
TQVYVFGSSGLD VRAYPFGSCGLG ARAYPFGSCGLS GELLFFGSYKLK GELAVFGSYRFG AVLLPYGSVLLG AVLLPYGSVLLG ARVLPFGSYFLG AKLLTFGSSILG IFIHENGSYLMD ATILTYGSYGLG ATILPYGSYGLG ATVLTYGSYTLG GKIFTFGSYRLG GKIFTYGSYRLG GKIFTFGSYSLG GKIFTYGSYRLG GKVFTFGSYRLG GKVFTFGSYRLG GKIFTYGSYRLG GKIFTFGSYRLG GKIFTFGSYKLG GKLFTFGSYRLG GKIFTFGSYRLG GKVFTFGSYRLG GKVFTFGSYRLG GKIFTFGSYRLG GKIFTYGSYRLG GKIFTYGSYRLG GKIFAYGSYRLG GRIFTYGSFRLG GRVFTYGSYRLG GKVFAYGSLRLG GRIFTYGSFRLG GRVFTYGSYRLG GKIFFFGSYALG GRIYTSGSYRLG GKIFTFGSYRLG GKIFTFGSYRLG GKIFTFGSYRLG GKIFTFGSYRLG GKIYTFGSYRLG GKIYTFGSYRLG GKIYTFGSYRLG GKIFTFGSFRLG GKIYTFGSYRLG GKLFTFGSYRLG GKLIPFGSYRLG GQLMAFGSYRLG AKIFTFGSYRLG
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
SDIDLIALC SDLDIVLFC SDLDVALIC SDIDTVCLV SDIDLIILA SDIDLICIA SDIDLICIA GDVDLVCIA GDIDMVCTC SDVDVILIA SDIDALCVG SDIDALCIG SDIDALCVG ADIDALCIV SDIDTLVVV SDIDTLIVV SDIDTLVVV SDIDTLVVV SDIDTLIVV SDIDTLVVV SDIDTLVVV SDIDALVVV SDIDALVVF SDIDTLVVV SDLDTLVVV SDIDTLVVV SDIDTLCVV SDIDTLVVG SDIDTLVVG SDIDTLVVG SDIDTLVVV SDIDTLIVA SDIDTLVVA SDIDTLVVV SDMDTLVVA SDIDTLIVV SDIDTICVC ADIDALCVA ADIDALCVA ADIDALCVA ADIDALCVA ADIDALCVA ADIDALCVV ADIDALCVA ADIDTLLVA ADIDTLCVA ADIDTLAVV ADIDSIVVA ADIDTLVLA SDIDTLCVG
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
NTS-LFFQ--------TFPRSAE TAE-VFFR--------EFPPLLQ TTD-IFFD--------EFPRLLY DRDQHFFG--------DLVNILQ DRELHFFQ--------QLPSILS DRREQFFN--------GLFYLLS DRREQFFN--------GLFYLLS DRFKHFNG--------QLYDMIS KRNPHFEE--------ELCQILL TYK-DFFE--------GFYEELI SMADDFFI--------VLRNMLE SIAEDFFI--------SLRDMLK TLQYHFFI--------VLRQILE SRS-DFFT--------HFYEELK SRD-NFFQ--------DLEPMLR TRE-DFFE--------VFERLLR TRE-DFFT--------VFDSLLR TRE-DFFT--------VFDELLR SRE-DFFT--------VFDALLR NRT-DFFT--------VFESLLR TRS-DFFE--------VFDKILR TRD-DFFT--------VFEKILR TRE-DFFT--------EFEKLLR SRN-DFFE--------VFYELLK SRE-DFFT--------VFAEIIR TRD-DFFS--------VFADIIR QRE-DFFS--------VFESMLK LID-DFFS--------DFPPVLE SFE-DFFS--------DFPPILE AIE-DFFL--------DFPPTLE SRE-EYFE--------MFPDLLV TRE-DYFK--------YFPDLLV TRE-DYFE--------HFPGLLV TVK-QYFD--------IFPNLLV TVE-QYFE--------IFPEVLV FIE-DFFK--------IFPSIFR YRE-HFFG--------EFQDMLR ERS-DFFQ--------SFFEKLK ERT-DFFS--------SFYEKLK DRS-DFFT--------SFYDKLK DRN-DFFT--------SFYDKLK ERT-DYFQ--------SFFEVLK YRS-DYFT--------SFFELLK SRN-DFFG--------SFYELLK DRA-DFFS--------SFYDLLK ERS-DFFK--------SFYDIVK DRS-DFFT--------SFKEMLN TRS-DFFN--------SFKDILA TRA-DFFS--------SFKQMLA MRS-DFFD--------DLSDILK
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
23) 22) 22) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 28) 28) 28) 28) 28) 28) 28) 28) 28) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25)
VHVDLLFAS TSVDVVFVS TAVDVVFVS ILFDLSFAR IQFDLVFAC FHIDINFAQ FHIDINFAQ VPIDISFAQ QDIDISFAQ ISVDISFAP ISVDLPYAQ ILVDLPYAQ ISVDFTYAQ IPIDLVFAR ISIDLIFAR ISIDLISAR ISIDLICAR ISIDLICAR ISIDLIFAK ISIDLICAK ISIDLICAR ISIDLLFAK ISIDLIFAK ISIDLIFAR VPLDLIFAC ISIDLIMAR VDIDFTFAR ISIDLIFAR ISIDLIYAR ISIDLIFAR ISIDLIFAR ISIDLIFSR ISIDLIFSR ISIDLIFCS ISIDLIFCS VSLDLLFAS VEVDLLFAR IEIDLVFAR IEIDILFAR IEIDILFAR IEIDILFAR IEIDLLFAR IEIDMLFAK IEIDMLFAR IEVDLLFAR IEMDILFAR VELDILFAR VDIDILFAR IEIDFLFAR IPIDLIYAK
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
26) 26) 26) 26) 19) 29) 29) 27) 27) 26) 26) 26) 38) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 27) 27) 29) 29) 29) 27) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 27)
ATVN-GIRTILE 179 [ 500] HSVN-GLRTVLE 177 [ 485] PSAN-GIRFTFE 177 [ 487] RSFN-GCRVAEW 1301 [1765] RSLN-GLKVSDY 192 [ 487] YALN-GRKNGIL 197 [ 533] YALN-GRKNGIL 221 [ 550] LSIN-GYRCNQA 321 [ 707] LSLS-GRRCNQA 219 [ 641] LSLN-GYLNCLL 190 [ 467] KSLS-GVRANEC 288 [ 604] KILS-GVRANKC 189 [ 507] RSLS-GVRVNEQ 201 [ 461] LSLN-GSRVTDE 198 [ 505] LSLN-GTRVTDQ 195 [ 566] RALN-GTRVTDD 194 [ 518] RALN-GTRVTDE 196 [ 546] RALN-GTRVTDE 196 [ 572] RALN-GTRVTDE 196 [ 598] RSLN-GTRVTDE 195 [ 554] RSLN-GTRVTDE 195 [ 571] RALN-GTRVTDE 195 [ 552] RALN-GTRVTDE 197 [ 587] RALN-GTRVTDE 195 [ 555] RSLN-GTRVTDE 195 [ 556] RSLN-GTRVTDE 195 [ 558] RSLG-GSRVTDG 209 [ 667] RSLN-GTRVTDE 201 [ 603] RCVN-GTRVTDE 201 [ 594] RCVN-GTRVTDE 201 [ 587] RSVN-GTRVTDD 202 [ 573] RSLN-GTRVTDE 201 [ 624] RSVN-GTRVTDE 203 [ 591] RSLN-GTRVTDE 204 [ 632] RSLN-GTRVTDD 202 [ 592] RSVN-GTRVVKE 202 [ 569] RSLNAGPRVTDM 200 [ 643] RSLN-GCRVTDE 208 [ 714] RSLN-GCRVTDE 208 [ 723] RSLN-GCRVTDE 209 [ 514] RSLN-GCRVTDE 210 [ 642] RSLN-GCRVTDE 217 [ 659] RSLN-GCRVTDE 213 [ 615] RSLN-GCRVTDE 218 [ 565] RSLN-GCRVTDE 205 [ 486] RSLN-GCRVTDE 203 [ 670] RSLN-GCRVAEQ 200 [ 655] RSLN-GCRVAEQ 202 [ 554] RSLN-GCRVAEN 207 [ 554] LSLN-GCRVTDQ 243 [ 809]
18
gi|84999896 gi|156082541 gi|68070217 gi|66357032 gi|58220769 gi|157343612 gi|42562141 gi|157338862 gi|79326212 gi|157342728 gi|147790301 gi|125538688 gi|125597568 gi|125549511 gi|145354311 gi|123495483 gi|123480837 gi|146081651 gi|72391414 gi|67481711 gi|71077475 gi|115378897
50 50 118 57 48 53 48 52 58 48 47 59 57 138 57 43 52 46 43 20 111 678
GKKRREDILESLNRLLQQFV GKRKRERVLESLNRVLQQFV ALKKREKVLGMINKLFHEFV GMKKREYVLASLNKLVREWI EAVSREEVLGKLDQIVKAWI EAVSREEVLGRLDQIVKIWV EAVRREEVLGILDQIVKTWI EAIRREEVLGRVDQIVKVWV DTMRREEVLGRIDQIVKHWV EARKREEIIEKLRVVVKSWV EAIKRAEVLDRLGQIVKDWV ESAKREEVLREIDQIVKEWV ESARREEVLGELDKIVKDWV ETAAREEVLRGLRGVVDRWV ECVRREEVLGEINALLQDWV QKTKRQEVLSKIKILINEFV AMKHRQAVFETVSKIVDQFI ESNRRRSVLERIVNVVRVWI EANRRRSVLEHINCIVRAWI QVEKKRKAISKMTEYIQQWG ETRKKEAILRNLELIISDWV AHRARIQAVKRLNEVC----
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
19) 19) 18) 18) 18) 18) 18) 18) 18) 18) 18) 18) 18) 18) 17) 18) 18) 19) 19) 18) 27) 4)
GKLLTFGSYRLG AKLLTFGSYRLG GNLYTFGSYRLG GGIFTFGSYRLG AKIFTFGSYRLG AKIFTFGSYRLG AKIFTFGSYRLG AKIFTFGSYRLG AVIFTFGSYRLG ALIVTFGSYRLG AVLFTFGSYRLG AVLFTFGSYRLG AVLFTFGSYRLG ALVLPFGSYRLG -NLYTFGSYRLG SQLVPYGSYRLG GKLFYSGSYKLG GRIFATGSYRYN GRIFATGSYRYN ATIYVYGSYRLN VRLYPFGSYYLG LELYPYGSFLMG
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5)
SDIDVLCLC SDIDCLCLC SDIDCIFLA SDIDTLCIA ADIDTLCVG ADIDTLCVG ADIDTLCVG ADIDTLCVG ADIDTLCVG SDIDTLCIG TDIDTLCIG ADIDALCIG ADIDTLCVG SDIDALVVG ADIDTLCLG SDIDCIVVA SDIDCVILA SDIDMVLIA SDIDIVLIA SDIDACIVS SDVDTVVIF SDVDAVVIG
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4)
( 2) AELQVYGSMYI ( ( 2) ADLHVFGSYST ( ( 2) ADLHVFGSFAT ( ( 2) TEVHVFGSSAT ( ( 2) AKVHVLGSFTT ( ( 2) AEVTPFGSWQT ( ( 2) AEVYPFGSQET ( ( 2) ADVQIFGSFKT ( ( 2) ADVQIFGSFST ( ( 2) AVVEIFGSFRT ( ( 2) SKVEIFGSFRT ( ( 2) CKVEVFGSFRT ( ( 0) --VEVFGSFRT ( ( 2) SSVLVYGSMYT ( ( 2) CSVVAQGSTGT ( ( 2) VQLRPFGSFAS ( ( 6) GRILCFGSYPA ( ( 2) VDIQAFGSFKT ( ( 2) CEVKTFGSFST ( ( 4) ATVRLFGSCAT ( ( 13) SQVCLFGSCAT ( ( 2) AGVCLFGSCVT ( ( 7) IKVEIFGSSST ( ( 2) YCMIIFGSCNN ( ( 2) AKVILFGSNST ( ( 1) LQVALFGSWST ( ( 2) MRLYTFGSTVV ( ( 2) MRLYTFGSTVV ( ( 2) MRLYMFGSTAV ( ( 15) FDLKCFGSLRN (
6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6)
SDVDVSLKS SDIDCVVTS SDIDCVVNS SDIDMVVIS SDIDLVVCS GDIDLVVAH GDLDLVVVS SDIDLVVFG SDIDLVVFG SDIDLVVLG SDIDLVVIG SDIDVVIFD SDIDLYDPL SDLDITLLD SDIDLIITN ADIDLVLLS ADMDLVYAS ADIDVVMID SDIDMVIVK SDIDIGITG SDIDIGITG SDIDIGITG SDVDIVMSF SDIDICIYN SDIDLSVVI SDMDFAAVS SDVDFVVLN SDVDFVVLN SDVDFVALS ADLDLVMTT
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
6) 6) 6) 6) 6) 7) 7) 2) 2) 2) 2) 5) 12) 4) 6) 16) 16) 6) 6) 8) 8) 5) 5) 4) 4) 49) 20) 20) 20) 9)
TRE-SFFS--------DFYNTLT TRE-SFFT--------DFYNALK TRE-IFFN--------EFYLKLQ SRE-SFFS--------FFLAKLQ TRTEYFFQ--------ALYDMLV TREEDFFG--------ELHKMLS TREGDFFG--------ELQRMLS TRDEDFFG--------ELHRMLA NREEDFFI--------ILHDILA NREEDFFI--------RLHNILI SREEDFFF--------ILHNILA KREEEFFV--------TLYGALS NREEDFFI--------VLHDILA DRDRDFFG--------ALAAALA SREEDFFGWDENDYEGSFYDVMR KRS-DFFN--------TFYEMMI QKE-EFFS--------IFYELLA TRE-HFFN--------TLAPRLK TRE-HFFN--------TLAPRLS TRD-DFYD--------GLYAELL RIA-DFFD--------KFPTIIG SRE-DFAG--------ALLQALT
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25) 25)
ISIDILFAN IDIDLLFAN VDIDLLLAT MEMDLLFAC VSIDLLYAN VSIDLLYAK VSIDLLYAQ VSIDLLYAR IPIDLLYAS VSIDLLYAS ISIDLLYAS LPIDLLYAS ISIDLLYAS VQVDLVYAG FEIDMAYTS IEIDLSFAS IEFDISFAA IDIDLSFGS IDIDLSFGS IEFDLNFSR VDFDLSAAV VHFDVSYAS
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
26) 26) 27) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 27) 26) 26) 23) 23) 26) 26) 26)
RSIN-GCRVASY RSIN-GCRVAAL RSLN-GIRVADL RSLN-GCRVNDM RSLN-GCRVTDK RSLN-GCRVTDQ RSLN-GCRVTDQ RSLN-GCRVTDQ RSLN-GCRVADQ RSLS-GCRVADQ RSLN-GCRVADQ RSLN-GCRVADQ RSLN-GCRVADQ RSLN-GVRVADE KSLN-GCRVADQ RSLN-GRRVNDM RALN-SLRVNNM LSCN-AVRVAHN LSCN-AVRVAHN RAIN-GVRNTDI LSLN-GYRTNIH RSLT-GWADTDA
203 [ 512] 203 [ 512] 271 [ 626] 209 [ 699] 201 [ 748] 206 [ 505] 201 [ 713] 205 [ 408] 211 [ 765] 201 [ 598] 200 [ 778] 212 [ 683] 210 [ 761] 291 [ 648] 217 [ 598] 195 [ 494] 204 [ 508] 196 [ 709] 193 [ 687] 172 [ 522] 272 [ 674] 812 [1080]
Group I COG5260 gi|19075773 gi|151945519 gi|81360384 gi|146419896 gi|50557292 gi|58260578 gi|71005312 gi|149258974 gi|74003095 gi|45554544 gi|156554609 gi|116235017 gi|125571535 gi|157864534 gi|123456846 gi|119482616 gi|111069215 gi|145533334 gi|146184040 gi|145509541 gi|145514940 gi|145541945 gi|66823977 gi|124806252 gi|123504651 gi|72390948 2ikf_A gi|134104550 gi|71405824 gi|83772505
37 196 193 185 266 160 116 198 137 288 63 159 159 421 50 277 419 81 177 628 610 494 163 427 58 133 45 45 25 71
E-LKYRK-LLLEKLQTHIREVV E-IEIRN-KTISTIREAVKQLW E-IKCRN-RTIDKLRRAVKELW E-ITTRN-NVIGRLKSTITKFW E-IKARQ-DLVERVRGAVNGLW E-FEVRL-FMIELITRTINKLW E-HETRC-MVIELISRAIKSQF E-EKMRM-EVVSRIESVIKELW E-AAMRR-EVVKRIETVVKDLW E-HAIRN-EVVKRIEAVVHSIW E-HLLRV-KVIKRIENVIYDLW E-QSSRT-AAVKAVSNVIKHIW E-QSSRT-AAVKAVSN-----E-VTMRR-YIEKDIGRLADRLW E-ELSRY-LTVRKFANFIENLF E-QIVRA-DLITRLQVAFQSRY E-HSARD-KLVQRISNALSSQR E-HRRRE-QAIMRVETFIKEFA E-HEIRL-KSMERLKKILLDAV EILQMRR-LIYDRIQFVINSLF QILKFRR-IIYDRLQFVINYLY QQFPIRQ-LIFNRIQFTIQLLF ---RDRT-IILKRLEEVIKRET E-IYMRR-SVLYNLHLFLKEIY E-KYVRL-RIYKNIQECITKIY A-HRKAR-LVMHDVAQTLYNVH-TRHVD-ATYRLVLDCVAAVD H-TRHVD-ATYRLVLDCVAAVD H-TRRVE-EAYRVVVECITAVD E-FRSKE-TLRIFLTNIAREAL
EKRR---------VTMVL SRNNLY-SL----ASHLK DRNYIY-EL----ARHLK QRSRLY-QL----STHLR ERACIY-QL----SSVIR KQRLLA-EL----GKAMR VQSALR-TM----AACLR ENLPLW-TL----EEALR ERPPLQ-LL----EQALR EKLPLR-TL----EFELV TNLPLH-TL----ERALI PQVGLY-AL----AKALS VILNIL-MD----EIAIQ AEEALT-SL----AKEIS GNHLLK-KL----SKDFW QIYAFA-AF----LKNLE KNAFTK-TL-RKASYRLQ --ELYK-KV----AQSLM --SLYK-KV---ADKIMN LNGPIQ-KI---IEFLQN LNLPIQ-KL---TEFLQK LNQKMD-AI---IEFLSK KRNDIT-KWCYQFSSILR DLINIR-KL-YKNMLHHP IDDTLE-EL-YFIGRYLK YSLALR-TV----AKSMR QADILA-KL----ARVIR QADILA-KL----ARVIR QAEFLG-KL----ARVVR EARCPQ-IL----YKAF-
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
26) 27) 27) 27) 27) 44) 27) 29) 29) 29) 29) 27) 35) 29) 27) 26) 27) 27) 27) 54) 63) 48) 26) 26) 26) 26) 26) 26) 26) 58)
IGVDLTFGIHIDVSFELHIDVSFEIHIDVSFEVHVDISFEINVDISLNLKVDISLNVKVDISFNVKVDISFNVKVDISFNIKVDISFNIAFDISFDLLKRYSFDIDVDISVGYHLDICIGLRVDLSFDLDVDISFEYQFDISFNINFDISFNFSVDISFFQ FSVDVSFFQ IQVDISFFG FHCDITLTLSVDISFNLPIDIGIVT KAVDVSFQVDFDITAYVDFDITAYVDFDITVNIQCGINLS-
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 6) 6) 1) 0) 0) 6) 0) 0) 0) 0) 0)
-NDKACRTAEL -RTNGIEAAKL -RTNGLEAAKL -RSNGIDAAIK -KDGGIKTAST -QANGVTAGKI -HTNGLTTASY -VQNGVRAADL -METGVRAAEL -MQSGVQSAEL -MNNGVKSAEL -MDGGPQAADF -MDGGPQAADF -AVDGKRNSEC -NINGLLNVPR -NDSGLIANNT -NLSGVQAQAT -QMDGLKQIDE -KEDGVKQLSE NWHLGQISTEQ NQHLGLISTDL IQHQGLTSTEL -KDSG--NTGV -QLSALTSTIQ NDLNYLAYMER -S-DGLKTTAF -RRNGVRNSAL -RRNGVRNSAL -RRNGVRNSAL -GCLVLYNTEL
143 307 304 296 377 289 228 307 246 397 172 269 274 532 161 397 547 190 287 780 780 630 277 538 177 283 170 170 150 228
[ 336] [ 584] [ 642] [ 588] [ 716] [ 779] [ 730] [ 680] [ 672] [1001] [ 509] [ 578] [ 583] [ 916] [ 431] [ 701] [ 749] [ 361] [ 463] [ 962] [ 962] [ 805] [ 466] [1172] [ 418] [ 488] [ 353] [ 353] [ 183] [ 417]
19
gi|116196370 gi|111063562 gi|71021859 gi|157349293 gi|41052754 gi|149755241 gi|149693612 gi|56605820 gi|149572607 gi|42565594 gi|125562559 gi|79571331 gi|116059739 gi|126302611 gi|47226027 gi|21361704 gi|21312970 gi|68355704 gi|157113025 gi|156539883 gi|71028114 gi|71032421
254 178 369 415 290 1024 970 173 196 60 61 24 119 176 149 203 206 199 156 671 155 292
E-IMEKE-TFRLMIEKVARAAV E-FEEKE-SFRSRLEGVFQQAF E-YRIKE-ATRRQLERLANRVS E-KAKQK-QLLTLLEKLVSKEW H-KAKQR-QLIESLTNSVSKEW E-DQARE-HIRQNLESFIRQEF E-QHNRE-QILIGLEKFIQKEY D-LKKKE-LCRAQLQREIQLLF D-LKKKE-LCRTELQREIQRIY D-YNTRK-ELVKNLNTMALDIY D-YEQRH-LMIDVFNKIAEEIY D-RDTRI-TVIDQLRDVLQSVE E-DAKRQ-TLMNKFKSMIGSRF E-RQLRS-LVVALMQEVFTEFF E-KKARD-LLVQLLQEVLVEFF N-TKLRY-LTCSLIEDMAAAYF N-IRLRH-LTCSLIEDIAAAYF N-ISLRF-LVCSLLGDIAGAYF G-KRLRF-LAVRQVESSLQGMF E-KYKET-ELLKAIEETVESIY D-LKMRSDRITEFLEKILREKV C-TNEKN-QLFDSVKRFLQYCL
( 17) VQLKCFGSLAS ( 6) ( 12) ISLVGFGSLAS ( 6) ( 2) AKLLAFGSMAN ( 6) ( 2) AQLFLYGSCAN ( 6) ( 2) AQLHLYGSCAN ( 6) ( 2) TKLSLFGSSKN ( 6) ( 3) ARLCLFGSSKN ( 6) ( 2) SRLFLVGSSLN ( 6) ( 2) SRLFLVGSSLN ( 6) ( 7) PVLEAYGSFVM ( 6) ( 6) PVVEAFGSFTM ( 6) ( 4) ATVQPFGSFVS ( 6) ( 2) VRVAPFGSYVS ( 6) ( 2) CVVHPFGSSIN ( 6) ( 2) EGMSDDGRSED ( 3) ( 2) CIVRPFGSSVN ( 6) ( 2) CVIRPFGSSVN ( 6) ( 2) CIIRPFGSTVN ( 6) ( 2) AVAYPFGSSVN ( 6) ( 2) AKAHTFGSRLS ( 6) ( 3) CSVSFFGSAIN ( 6) ( 3) TIVHLTGSTAY ( 15)
SDMDLGILS SDMDLAVVP SDMDLCCLM SDIDVCLAI SDVDVCLQI SDLDVCMTI SDLDICMTL SDGDLCLVV SDGDLCLVV SDLDVSINF SDLDLSVNF GDLDISVDL SDIDISLQI CDLDLFLDL SDIDLSTAT CDLDMFLDL CDLDMFLDL CDVDMILDL CDLDLILDL SDVDIFLDC SDLDVCVQI SDLDIVLLS
( 10) ( 13) ( 13) ( 6) ( 7) ( 10) ( 10) ( 14) ( 18) ( 11) ( 12) ( 13) ( 35) (113) ( 1) ( 30) ( 30) ( 29) ( 28) ( 12) ( 9) ( 13)
GSMIPR-LI----EKAFDHDIPR-LL----EQAVASELVE-IL----GQLIR KSEFLL-KL----ADILQ IAELLL-AL----AETLR CVRTIE-EL----ARVLK CKEIIE-SL----AKILK ARHILT-LV--HKHFCTR ARYILS-LV--QNHFSTR KLEILK-RF----AKKLR KISVIR-NL----AKVLY KQTLLG-HL----LRALR RAQLLR-KV----ASELR GAAMLE-LV----GSILR -AEVLD-LV----ATILK TQKILS-VL----GECLD TQKILS-VI----GECLD TQSILS-VV----GKCVD VQRQLE-SI----GDVLQ SQHYLM-SV----KKNFE -IRNLR-RI----SSVLT VYSELT-NM----QHVLK
(413) (163) ( 31) ( 26) ( 26) ( 27) ( 27) ( 26) ( 26) ( 30) ( 30) ( 27) ( 26) ( 28) ( 28) ( 28) ( 28) ( 28) ( 28) ( 28) ( 48) ( 33)
SDVDVFIDC SDVDFVVLN SDVDFVVLN SDIDVCLAI SDIDVCLAI SDGDLCLVV SDGDLCLVV SDADLCLVI SDVDMCLHV SDMDLCLMI SDVDMCLCY SDLDVCMTI SDLDICMVL SDLDICMTL SDLDICMTL CDVDMSLSF SDIDLVVFG SDIDLVVFG SDIDLVVIG SDIDLVVIG SDIDLVVLG SDIDLVVYY SDIDCVVNS SDIDCVVNS SDIDCVVTS SDIDMVVIS SDIDMVVLS SDIDAVILSDIDLYDPL GDIDLALFL
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
YSEIGSKEVSQAYIQTTKNQFC QADILA---------KLARVIR QADILA---------KLARVIR KSEFLL---------KLADILQ KSEMLL---------KLAEILE ARHILT---------LVHKHFC ARHILT---------LVHKHFC AVYVLS---------LVRKLLY AIYRLE---------QIMMCLR AVVVLN---------LILSTLQ SAKVLR---------KLDKAIR CVRTIE---------ELARVLK CIALIE---------SLARLLR CKEIIE---------NLAKILK CKEIIE---------NLAKILK SDRVMR---------AVAKALV --P-LQ---------LLEQALR --P-LW---------TLEEALR --P-LH---------TLERALI --P-LR---------TLERALL --P-LR---------TLEFELV --RLLH---------ELQNELV DRNYIY---------ELARHLK NRQYLY---------ELARHLK SRNNLY---------SLASHLK QRSRLY---------QLSTHLR NRSRLY---------QLSSFLK PGVCLK---------ALAIALA PQVGLY---------ALAKALS ILTIMT---------HIRECLK
VQCDINFSIQCDINFEIACDIGFEISCDICINLSCDICVNLEVDISLYLEGDISLYVEFDLNVNVEFDLNVNVECDLSVEVECDISVEISCDISIDVACDVCIELHGDVSLSLQGDITTNFQCDLTTNFQCDLTANFQCDLTANLEIDLTMNLKCDISFIPSIDISVNLLCDISVN-
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
-AHLGVQNTLL 769 -NPLGIHNTHM 441 -NRLALENTRL 491 -NVLAVVNTKL 525 -NLFAVANTKL 401 -NTLALHNTRL 1139 -NTLAQHNTRM 1086 -NTVGIRNTFL 293 -NVVGIRNTFL 320 -NKDGILNSQI 184 -NKDGVSRSMI 185 -NLDGLLKSRF 144 -ND-GVYKSAV 257 -NRLALHNSRF 395 -NRLAVRNTRF 252 -NRIALTSSEL 339 -NSIALKSSEL 342 -NKVAMKSSEL 334 -NMTGVYMSEL 290 -NGLSVEKSKL 789 -NDLAIVNSIL 291 -TLYPLLHTEL 426
[1123] [ 717] [1174] [ 720] [ 597] [1501] [1647] [ 484] [ 502] [ 682] [ 565] [ 511] [ 761] [ 874] [ 475] [ 582] [ 585] [ 580] [ 564] [ 977] [ 487] [1230]
Group I KOG1906 gi|156539885 2ikf_A gi|134104550 gi|147782453 gi|18406841 gi|19527122 gi|114599472 gi|66472546 gi|156538415 gi|32563609 gi|157748691 gi|76651813 gi|47226593 gi|119890207 gi|114556617 gi|17554126 gi|5902142 gi|109128463 gi|156554609 gi|66557991 gi|45554544 gi|24649854 gi|1050861 gi|50294195 gi|6324457 gi|146419896 gi|149244754 gi|116058666 gi|125527217 gi|123439840
640 45 45 415 451 173 173 179 340 566 478 1021 630 470 968 146 7 149 63 138 288 45 193 204 196 185 240 144 159 57
YKEQ--KLLESAEETVRSFY HTRHVDATYRLVLDCVAAVD HTRHVDATYRLVLDCVAAVD EKAKQKQLLTLLEKLVSKEW ELEKQRQLMAHLENLVAKEW DLKKKELCRAQLQREIQLLF DLKKKELCRTQLQREIQLLF DLEKKESCRAALQTDIQKIF TFCHKMYLWRYLFGFIKSRF MLQRKLHLRDMLYTAISPVF VLDMKLDARRMLHREFQRLF EDQAREHIRQNLENFIRQEF EMGVRELILKDLETFIRRQL EQHNREQILIGLEKFIQKEY RDQPGQHASQNLEEMGKKDF DLF--HRFALEMQVHLSACF EAAMRREVVKRIETVVKDLW EEKMRMEVVNRIESVIKELW EHLLRVKVIKRIENVIYDLW EHSLRIRVVKRIEQVIYDLW EHAIRNEVVKRIEAVVHSIW EFCLRAGAVRRIEDVVLSIW EIKCRNRTIDKLRRAVKELW EIETRNKTIAKIRRSVKRLW EIEIRNQTISTIREAVKQLW EITTRNNVIGRLKSTITKFW EIVIRNKVVNTLKTQIALFW EASSRTAAVERVRDVVKGIW EQSSRTAAVKAVS------DIAVRRYIVDIICTRIRQFF
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 4) 2) 2) 3) 2) 3) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 0) 7)
VKAYAFGSRTSMRLYTFGSTVVY MRLYTFGSTVVY AQLFLYGSCANAKLYLYGSCANSRLFLVGSSLNSRLFLVGSSLNAKVFLGGSSLNYGLYMVGSTLNSGLYVVGSSLNVMMQITGSTINTKLSLFGSSKNARLQLFGSSKNARLCLFGSSKNSQLCLFGSSKNVVLDIYGSTRNADVQIFGSFSTADVQIFGSFKTSKVEIFGSFRTSKVEVFGSFRTAVVEIFGSFRTASVDLFGSFRTADLHVFGSFATADLQVFGSYATADLHVFGSYSTTEVHVFGSSATTEAHVFGSSATARFEVHGSFAT-NVEVFGSFRTVIILPCGSCLN-
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6)
3) 20) 20) 6) 6) 14) 14) 10) 11) 10) 19) 10) 10) 10) 10) 10) 5) 5) 5) 5) 5) 5) 6) 6) 6) 6) 6) 6) 44) 6)
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
29) 26) 26) 26) 26) 28) 28) 26) 26) 28) 35) 27) 27) 27) 27) 26) 29) 29) 29) 29) 29) 29) 27) 27) 27) 27) 27) 27) 27) 26)
LKCDISFLVDFDITAYVDFDITAYISCDICINISCDICINVEFDLNVNVEFDLNVNVEFDLNFNLDVDLNYNITVDLNANMEVDINVNLEVDISLYLEGDISLYLEGDISLYLEGDISLYMEADISYKVKVDISFNVKVDISFNIKVDISFNIKVDISFNVKVDISFNIRFDVTFNLHIDVSFEIHIDVSFEIHIDVSFEIHIDVSFEIHVDISFEHQFDISFDIAFDISFDIHIDISID-
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 1) 0) 0) 0) 0) 0) 0) 0) 0)
-SGLGVEKSKL 757 -RRNGVRNSAL 170 -RRNGVRNSAL 170 -NVLAVVNTKL 525 -NVLAVVNTKL 561 -NTVGIRNTFL 293 -NIVGIRNTFL 293 -NTVGIRNTFL 293 -NVVGIRNTHL 455 -NSVAIRNTHL 682 -NIAGIYNSHL 612 -NTLALHNTRL 1136 -NTLALHNTRL 745 -NTLAQHNTRM 586 -NTLAQHNTRM 1083 -NDLALHNTQL 259 -METGVRAAEF 116 -VQNGVRAADL 258 -MNNGVKSAEL 172 -MNNGVKSAEL 247 -MQSGVQSAEL 397 -VASGVQAADL 156 -RTNGLEAAKL 304 -RSNGLEAAKL 315 -RTNGIEAAKL 307 -RSNGIDAAIK 296 -RKNGLDAARR 351 -VANGPASAEI 254 -MDGGPQAADF 298 -ELHGPLSVNP 172
[ 947] [ 353] [ 353] [ 720] [ 764] [ 484] [ 441] [ 489] [ 671] [1113] [ 802] [1498] [1066] [1146] [1651] [ 508] [ 542] [ 631] [ 509] [ 539] [1001] [ 407] [ 625] [ 626] [ 584] [ 588] [ 664] [ 555] [ 626] [ 346]
20
gi|123440101 gi|119188673 gi|154272285 gi|46127561 gi|157344863 gi|156096154 gi|82596357 gi|154339183 gi|157871013 gi|119479751 gi|19114069 gi|108706800 gi|125586178 gi|2642156 gi|66804901 gi|67480509
58 369 262 394 878 400 295 181 141 53 68 43 36 24 356 1
EKHLRYLVIKRFRVAINQLW EDVIRTDLITRFERLMQNRF EHSIRNDLVERLQRHFERRH EQRIRDNLVENLRKAMRRDG NMIRKPYINWAVKRVTRSLQ EKLLKQKALIKLEIVVKSLF EKLLKLKSVIKLEMIVKSIY ERQTKLRVIDDVRATIQQSG DRETKLRVIDDIRTTMQRAG SDDRRRQLVRKLEKLFNDQW GLERRYAFVQKLEQILKKEF SERRRRAVYDYVRRLITNCL SERRRAAVVGYARRLVGTAL DRDTRITVIDQLRDVLQSVE RVSKKEKSFNRLEMFLSNKF ---MRYEVMQRIEQVLNQNY
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 5) 5) 2) 2) 0) 0) 4) 4) 1) 1) 4) 2) 4)
AKVICHGSTATSQLHAFGSYASSQLCAFGSFASASVHPFGSFMSSRTNIFGSNATATMQPFGSFVTCKMEIFGSFVTMDIEIYGSLYTMDIQIYGSLCTIKVHVFGSSGNIKTSLFGSTQSCQVFTFGSVPLCEVFAYGSVPLATVQPFGSFVSSSIQLYGSFLTFRAQVYGSTDY-
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6)
GDLDFCVLG ADVDLVLLS ADMDLVLCS ADMDLVVCS SDVDLVICL SDLDVCFLG SDIDVCFMD SDVDCVLMR SDVDCVLML SDVDICITT SDIDLCIIT GDIDVTAFS GDVDLTVLG GDLDISVDL SDLDVNFKI GDLDICCSS
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
5) 13) 13) 13) 21) 4) 4) 37) 37) 4) 4) 8) 7) 13) 5) 6)
DEDLLT---------ELNDHLQ KIKDIY---------SLTAYIR RPREIH---------SFAAYLK AKSWLY---------KFQKFLV KETCLQ---------HAARYLA DLDALL---------IISYALV EIETLT---------IIGYVLI LSVALR---------TVADRMR LSTAVR---------IVAERMR L-EHVC---------LLAEVLA CAPTTC---------EVSAAFA WANLVR---------DALEHEE LIDDIY---------HILQSEE KQTLLG---------HLLRALR QVTHLE---------VVSKYLE SAIILE---------SFAECFK
NTKLRYLTCSLIEDMAAAYF NIRLRHLTCSLIEDIAAAYF NTQLRYLVCSFIEDIAAAYF NIKLRYLACSLVRDFARAYF NSRLRFLVCSLLRDLAATYF NISLRFLVCSLLGDIAGAYF GVRMRFLAALQVQQAISGMF GKRLRFLAVRQVESSLQGMF GVRLRFYTAYQIECCFHGLF DLKKKELCRTQLQREIQLLF DLKKKELCRTELQREIQRIY DLKKKDICRAELQREIQQIF DLEKKESCRAALQTDIQKIF MLQRKLHLRDMLYTAISPVF KYLEKMQLWRDLYISIKKGF VYRNKMMLWRYLYVYIKTAF KFKIKMRLWRFLLLWMAPMF TYTSKLHLWKSIFLFFRM-L VLDMKLDARRMLHREFQRLF EISKQKQLLATLSRLINKEW EKAKQKQLLTLLEKLVSKEW ELEKQRQLMAHLENLVAKEW HKAKQRQLIESLTNSVSKEW EENRGKSLLIRLQNLVSKIF EFKEKRAALDTLRLCLKRIS SESRRRRLVRKLEDLFNRQW SDDRRRQLVRKLERLFNEQW SEQRRIKFVKKLENLLNTQW TDERRRKLVLKLEDMFNKEW TDERRRRLVSKLEDMFNKEW VKANRDKLIKKLEKMFNDQW VEENRKKLVSKLEKIFNDEW NTAVRNKFVAKVQRILETEF GLERRYAFVQKLEQILKKEF EVSRRQQFVDKLRTILSTEI TRHVDATYRLVLDCVAAVD-
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 4) 2) 2) 2) 2) 4) 2) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 2)
CIVRPFGSSV-N CVIRPFGSSV-N CTIKLFGSSV-N STVKPFGSSV-N CTIKPFGSSV-N CIIRPFGSTV-N AQAHPFGSSV-N AVAYPFGSSV-N IAVLPFGSSV-N SRLFLVGSSL-N SRLFLVGSSL-N SRLYLVGSSL-N AKVFLGGSSL-N SGLYVVGSSL-N YSLYLVGSTI-S YGLFLVGSTM-N YRIWLVGSTI-T YGLYLVGSTM-S VMMQITGSTI-N SKLYLYGSCA-N AQLFLYGSCA-N AKLYLYGSCA-N AQLHLYGSCA-N VKLHLFGSSA-N AELVAFGSLE-S IKVHVFGSSG-N IKVHVFGSSG-N IKVHVFGSSG-N IRVHVFGSSG-N IRVYVFGSSG-N IKVHLFGSSG-N IRVNLFGSSG-N FKVSIFGSSG-N IKTSLFGSTQ-S LDLFVFGSTE-N MRLYTFGSTVVY
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6)
CDLDMFLDL CDLDMFLDL CDVDMFLDL CDVDMFLDF CDLDMILDL CDVDMILDL CDLDLILRF CDLDLILDL CDLDLSVIF SDGDLCLVV SDGDLCLVV SDADLCLVL SDADLCLVI SDMDLCLMI SDVDMCLVS SDVDMCLLV SDIDMCLLG SDIDICLLT SDVDMCLCY SDIDLCLSI SDIDVCLAI SDIDVCLAI SDVDVCLQI GDIDICMVI SDMDLCVLM SDVDICITT SDVDICITT SDVDICITT SDVDICITT SDVDICITT SDVDICITT SDVDICITT SDVDICIQT SDIDLCIIT SDVDVCIIT SDVDFVVLN
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
29) 29) 29) 29) 28) 28) 31) 27) 25) 13) 17) 9) 9) 9) 11) 9) 21) 9) 18) 5) 5) 5) 6) 6) 5) 4) 4) 4) 4) 4) 4) 4) 4) 5) 5) 20)
ATQKILS-VLGECLD ATQKILS-VIGECLD ATQKILS-VIGECLD ATQKILS-IIGDCLD VTQSVLS-VIGESLD VTQSILS-VVGKCVD QTQRHME-CFGDMLH QVQRQLE-SIGDVLQ QMKRLME-TVADTMN EARHILT-LVHKHFC EARYILS-LVQNHFS EARHILS-LLHKHFY DAVYVLS-LVRKLLY DAVVVLN-LILSTLQ EALLNLS-LVKEYFM EAIGHLE-QILKCLK EALIILN-LFQSVLK DSLHHLD-YLQHALL QSAKVLR-KLDKAIR SKVDIIL-KLAHILH NKSEFLL-KLADILQ NKSEMLL-KLAEILE NIAELLL-ALAETLR TSDVIIE-RLAEMLK -SDTIAL-QFYEELI --LEHVC-LLADVLA --LEQVC-LLAEVLA --LEHVC-LLADFLA --MEGVC-MIAELLA --MEGVC-MIAELLA --LENVC-MIAQLLQ --LEGVC-MIANLLA --LEEMH-MLAEALD --APTTC-EVSAAFA --LNSTC-QLAQLLY QA-DILA-KLARVIR
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
26) 29) 29) 29) 85) 27) 27) 27) 27) 26) 26) 28) 28) 27) 27) 26)
FKIDISINLKVDLSFDLKVDLSFDLKVDVSFEVRIDISFKVQVDVCTNANVDICINVKVDMSFEVKVDLSFELACDMNVNLSCDCNINIVVDISFNIVVDISFNISCDISIDYHFDMSCNVNIDLSFN-
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 2) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
-NENGVLNIKR 167 -NNSGLAANRT 489 -NSTGITANKT 382 -NLGGVNAIDT 517 -SHTGLQTTEL 1067 -QLSSRQTTDF 509 -QKSSKESTDF 404 -KGGCVSSNYL 321 -QSGCVSSNYL 281 -NTLALENTRM 162 -KTISTLNTRL 178 -QVGGLCTLCF 156 -QTGGICALCF 148 -NLDGLLKSRF 144 -SSKAYFNSLL 466 -QPVAQIHSEF 110
[ 439] [ 785] [ 673] [ 708] [1236] [ 808] [ 600] [ 519] [ 479] [1008] [ 578] [1316] [1104] [ 474] [1677] [ 287]
Group I KOG2277 gi|74753002 gi|148691104 gi|149634744 gi|118085635 gi|47225120 gi|68355704 gi|125983372 gi|157113025 gi|156544415 gi|114599470 gi|149572607 gi|133919900 gi|66472546 gi|32562829 gi|118795258 gi|66553051 gi|125981601 gi|91076532 gi|39584413 gi|115481348 gi|147782453 gi|18406841 gi|41052754 gi|66816699 gi|19115813 gi|67901522 gi|83772230 gi|119182218 gi|156058866 gi|154308271 gi|149210899 gi|46105240 gi|111057711 gi|19114069 gi|19112002 2ikf_A
203 216 203 197 173 199 184 156 179 173 196 157 179 489 122 304 439 211 478 65 415 451 290 769 61 53 53 164 59 291 445 300 288 68 64 46
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
28) 28) 28) 28) 28) 28) 28) 28) 28) 28) 28) 27) 26) 28) 28) 26) 26) 26) 35) 26) 26) 26) 26) 26) 30) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26)
FQCDLTTNFQCDLTANFQCDLTANFQCDLSVSFQCDLTANFQCDLTANLEVDLSMSLEIDLTMNVECDLAMTVEFDLNVNVEFDLNVNAEFDLNVNVEFDLNFNITVDLNANIEVDLNFNLEVDLNCNIEVDLNFNFEIDLNCNMEVDINVNLSCDICVNISCDICINISCDICINLSCDICVNLSCDICMNFQCDIGFNLACDMNVNLACDMNVNVACDMNVNLFCDMNVNLLCDMNVNLACDMNVNLACDMNVNLSCDMNVNLSCDCNINIHCDLNINVDFDITAY-
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
-NRIALTSSEL -NSIALKSSEL -NRIALKSSEL -NSIAIRCSEL -NRVAVKSTEL -NKVAMKSSEL -NLSGFYMSEL -NMTGVYMSEL -NMSAYYMSEL -NIVGIRNTFL -NVVGIRNTFL -NVVGIRNTFL -NTVGIRNTFL -NSVAIRNTHL -NCVGIRNTHL -NAVGIRNTHL -NCVGIMNTYL -NIVGIQNTRL -NIAGIYNSHL -NLLAVVNTKL -NVLAVVNTKL -NVLAVVNTKL -NLFAVANTKL -NRLAIYNTRL -NRLAIHNTLL -NTMALENTRM -NTLALDNTRM -NTMALENTRM -NTLALENTRM -NTQALENTRM -NTLALENTRM -NTLALENTRM -NVAALENTRM -KTISTLNTRL -NDVAKINTKM -RRNGVRNSAL
339 352 339 333 308 334 322 290 311 293 320 272 293 605 240 418 565 324 612 175 525 561 401 882 174 162 162 273 168 400 554 409 397 178 174 170
[ 582] [ 595] [ 579] [ 576] [ 540] [ 580] [ 615] [ 564] [ 550] [ 484] [ 502] [ 466] [ 489] [1036] [ 445] [ 669] [ 758] [ 524] [ 802] [ 372] [ 720] [ 764] [ 597] [1090] [ 405] [ 999] [ 493] [1069] [1017] [1246] [1474] [1289] [1299] [ 578] [ 478] [ 353]
21
gi|73946411 gi|118104149 gi|47226593 gi|125817787 gi|156392397 gi|118094562 gi|109476883 gi|114556629 gi|93003164 gi|110735731 gi|7019641 gi|17554128
1018 1006 630 739 74 932 954 897 794 24 60 1026
EDQAREHIRQNLESFIRQEF EDQAREHIRQNLENFIRLEF EMGVRELILKDLETFIRRQL ELKVREHILQDFESFLRCQV EGQFRQEVLRNLEDYIREVY EQQNREQILASLERFIRKEY EQHNREQILIGLEKFIQKEY RDQPGQHASQNLEEMGKKDF EVQERNKICEALMNYIQRKY DRDTRITVIDQLRDVLQSVE DYNTRKELVKNLNTMALDIY RLKMLDHKIDELQSFLRKNY
( ( ( ( ( ( ( ( ( ( ( (
3) 2) 2) 2) 2) 3) 3) 2) 3) 4) 7) 3)
TKLSLFGSSK-N TKLNLFGSSK-N ARLQLFGSSK-N AKLVLFGSSK-N ACLYLFGSSV-N ARLCLFGSSK-N ARLCLFGSSK-N SQLCLFGSSK-N CQMNLFGSSR-N ATVQPFGSFV-S PVLEAYGSFV-M VTLTTFGSVM-T
( ( ( ( ( ( ( ( ( ( ( (
6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6)
SDLDVCMTI SDLDICMTM SDLDICMVL SDLDICMTL SDLDICMTL SDLDICMTL SDLDICMTL SDLDICMTL SDLDICMTF GDLDISVDL SDLDVSINF SDIDICLRF
( 9) DCVRTIE-ELARVLK ( ( 9) DSSMQLI-KLLHLVT ( ( 9) DCIALIE-SLARLLR ( ( 9) DSMAIIE-SLAKALR ( ( 8) DPIKVIH-DLSKKLK ( ( 9) NCKEIIE-GLAKVLK ( ( 9) NCKEIIE-NLAKILK ( ( 9) NCKEIIE-NLAKILK ( ( 9) DFVSIIT-DVAKCLR ( ( 12) QKQILLG-HLLR--- ( ( 10) KKLEILK-RFAKKLR ( ( 10) TAKEVIQ-KTESVLR (
-MKPPFQEALGIIQQLKQHG -MKPPFQQALGIIRQLNRHG --MKRFKRAGAIIETLKEHG NMNPVFLKAAPVLEKIETAG -MLPPFQTALPIIRTLEDAG -MEKVFIKALPVLRILIEAG -MEDLFLKALPLLRELKKHG TIPNEFKEAAPVIREINAQG QLPLEYQKAIPVLKKLENAG QLPAEFQAAKPIIETIEAAG HLPEEFEMARPVLQTIEQAG NIPDEFKKALPILEKIREAG YLPSEFQKALPVLEKIKAAG NLPSEFQEALPILEKIKAAG TMPSEFQKALPILTKIKEAG NLPNIFTKAMPVLQRLEDAG NLPEVFTAALPVLKEINEAG RITEVFTQAMPVLEKLEEAG QLPQEFIDAQPILTKLEDAG QNNKEWQTAYSVIEQLEQAG -DKSLFEQARPILEQIQDNG RNMKMLESANQIIQTIEAAG MEIRMPENAKKVIETLEAAG --FEFPENAGYVIKKLNEAG MKIELPRKVVLIIKNLQRHG KNIEMPKNVALIIDRLLENG INIEIPKKVDYIIKELEKNG IKINMPKEVKYIIDILEEHG IKIEIPKGVKYIINTLQENG IKIQIPKAVEQILDIFSING HRADIPRPILDVLQRLRELG EQARFPEAILDVLRRLAAAG MRYPVSKKMSEIASVFFNAG M--PIKQNAIEIVKTLQDKG FTPTLPKEVLYILESLQSAG FQSLFTEGLKSLTELFVKEN NLGKNNQNIIKIGKIFKKNN -MIKLKDEPLQVIKRLNQHN IPKIFLEDILHITQIIRKEG QKAQFSRYAVNIVERLQGAG
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
YDAYFVGGAVRD YEAYFVGGAVRD HEAYFVGGSVRD YEAYFVGGSVRD FQAFFVGGSIRD HQAYFVGGAVRD WQAYFVGGAVRD FEAYFVGGSVRD YEAYFVGGSVRD YEAYFVGGCVRD YEAYFVGGSVRD FEAYFVGGSVRD FEAYFVGGSVRD YEAYFVGGSVRD YEAYFVGGSVRD FEAYFVGGSVRD YEAYFVGGSVRD FEAYFVGGCVRD FEAYFVGGSVRD FEAVIVGGAVRD FEAYYVGGSVRD GEAYIVGGAVRD YEAYIVGGCVRD YDAYLVGGCVRD YDAYAVGGCVRD YEAYMVGGCVRD YEAYIVGGCVRD YEAYAVGGCIRD YEAYIVGGAVRD YEGYIVGGCVRD FAVFLVGGCVRD HRSWLVGGAVRD FSAYLVGGAVRD FDALFAGGCVRD YEAYIVGGCVRD HELRIAGGAVRD YELYLVGGALRD FKAYLVGGCLRD GECYLVGGSVRD YQAYLVGGCVRD
( 7) GDVDIATSA ( ( 7) GDVDIATSA ( ( 7) GDIDIATSA ( ( 7) SDVDIATSA ( ( 7) HDVDIATSA ( ( 7) GDVDIATDA ( ( 7) GDIDIATDA ( ( 7) HDVDIATSA ( ( 7) HDVDIATSA ( ( 7) HDVDIATSA ( ( 7) HDVDIATSA ( ( 7) HDVDIASSA ( ( 7) HDVDIASSS ( ( 7) HDVDIATSS ( ( 7) HDVDIATSS ( ( 7) HDIDITTSA ( ( 7) HDVDIATSA ( ( 7) HDVDIATSA ( ( 7) HDVDIASSA ( ( 7) HDVDVATNA ( ( 7) HDIDITTSA ( ( 7) GDYDLATSL ( ( 7) GDWDITTAA ( ( 7) YDWDITTNA ( ( 7) EDWDITTSA ( ( 7) KDWDITTNA ( ( 7) NDWDITTSA ( ( 7) HDWDITTDA ( ( 7) NDWDITTSA ( ( 7) QDWDICTNC ( ( 7) KDFDVASSA ( ( 9) TDFDVATPA ( ( 7) KDYDIATDA ( ( 7) ADYDIATNA ( ( 13) NDYDITTSA ( ( 7) QDIDFATTA ( ( 7) CDFDFATNA ( ( 7) QDFDIATDA ( ( 7) KEFDLTTSL ( ( 7) KDFDVATSA (
27) 29) 27) 27) 27) 27) 27) 27) 27) 30) 30) 30)
LEVDISLYLEVDISLYLEGDISLYLEGDISLYREGDISLYLEGDISLYLEGDISLYLEGDISLYLEGDISLYISCDISIDVECDLSVEIDVDISYY-
( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
-NTLALHNTRL -NTLALHNTRL -NTLALHNTRL -NTLALHNTQL -NTLALENSRM -NTLAQHNTRM -NTLAQHNTRM -NTLAQHNTRM -NLLAQKNTAM -NLDGLLKSRF -NKDGILNSQI -NILAIYNTAL
1134 1123 745 854 188 1048 1070 1012 910 144 184 1146
[1496] [1543] [1066] [1207] [ 418] [1608] [1618] [1573] [1410] [ 511] [ 690] [1425]
113 113 112 117 113 113 113 118 118 118 118 118 118 118 118 119 118 118 118 115 114 152 141 118 114 116 118 123 123 122 120 123 114 111 160 167 115 112 126 159
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
Group II COG0617 gi|73619752 gi|138895759 gi|152975060 gi|89099892 gi|124522747 gi|16079302 gi|52080753 gi|29376121 gi|69250632 gi|28378528 gi|116333429 gi|90961683 gi|116627321 gi|81096799 gi|94990240 gi|58337270 gi|42519026 gi|104774167 gi|116617867 gi|126654221 gi|49483646 gi|68054095 gi|156862915 gi|118725182 gi|154505528 gi|150016764 gi|126700083 gi|28210893 gi|153938706 gi|106893666 gi|19572329 gi|86157905 gi|42526543 gi|91199952 gi|32267207 1ou5_A gi|111115537 gi|146297248 gi|24214670 gi|77461030
1 1 1 4 1 1 1 5 5 5 5 5 5 5 5 6 5 5 5 2 2 39 28 7 1 3 5 6 6 7 7 8 1 1 41 51 2 1 13 34
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
LPEDVM-AIFPKTI LPDEVM-AIFPKTI LPEEVM-KLFPKHV TPMEIK-GIFPRTA KPEEVK-ALFKHTV APDQVE-RLFQRTV SPEEIE-AIFPKTV YPEEIK-QIFKRTV FPEEIK-QLFPKTI FPAEVK-QLFKRTV YPDEIK-HLFKRTV YPEEIK-QIFSKTV YPEETK-QIFERTV YPQETK-QIFSRTI YPEETK-AIFNRTV YPEEVK-ELFEKSI YPMEIK-QIFKKTI YPEEVK-ETFAKSI FPEEVK-SLFHNTV LPEEVK-TVFQRTV TPDEIE-SIFSHTI LPEAVM-HLFPVVI RPEQVK-ALFRRTI LPIDIK-SIFDRTY KPEQVK-RIFRRTV KPLEVV-ELFDKVI RPEVVV-ELFEKTI KGEQVI-KIFKSLD NPQEVV-NIFENLG TPEKMM-ELLSGFK LPQEVQ-GAFKKVI TPQQVM-ALFRRVI EPKEVQ-ALFRKTI LPHDVM-NLFKKTF LPQEVM-KLFAHTI TPTQMK-EMFQSAG TPEEII-KLFPNNI KPEDVM-KLFEKTI LPEKIL-SLFKRTI TPEQVR-AEFRNAR
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 19) 19) 17) 15) 15) 15) 15) 15) 20) 15) 15) 15) 15)
KAYEVTTFK TAYEVTTFR EPYEVTTFR DSYEVTTFR NQYEITTFR ETYEVTTFR ESYEVTTFR QQYEVTTFR EQYEITTFR NGYETTTFR TGYETTTFR TGYEITTFR GEYEVTTFR KEYEITTFR GEYEITTFR ESYEITTFR ESYEITTFR SSYEITTFR TGYEITTFR GPVEVTTYR ENYEVTTFR VPFEVTTFR EAYEVTTYR MCLEVTTYR DGFEVTTYR ESYEITTYR EPFEVTTYR IGYEITTYR IGYEVTTFR VGYEVTTYR NHVEVTTFR EKVEVTTFR EKIECTTFR HNFEVATFR QSYEVTTFR ENFEITTLR KIFEITTYR VKIEVTTFR RAYEITTFR EIIEVATFR
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 19) 18) 19) 17) 19) 18) 19) 31)
LEE-DLKRRDF LEE-DLKRRDF LEE-DLKRRDF LDE-DLKRRDF LLE-DLKRRDF LEE-DLKRRDL LKE-DLKRRDL LKE-DLKRRDF LSE-DLKRRDF LKE-DLKRRDF LAE-DLKRRDF LEE-DLKRRDL LEE-DLKRRDF LEE-DLKRRDF LEE-DLKRRDF LDE-DLKRRDF LSE-DLKRRDF LDE-DLKRRDF LEE-DLKRRDF LQD-DLQRRDF LYE-DLQRRDF LEE-DLTRRDF LEE-DLKRRDF LRE-DLARRDF LEE-DLKRRDF LKE-DLARRDF IVK-DLSRRDF LKE-DLSRRDF LRE-DLKRRDL LRE-DLKRRDF ITQ-DLSRRDF LEA-DLARRDF IEE-DLSRRDF AEE-DVRRRDF LAQ-DLQRRDF WQK-DAERRDL LIK-DLKRRDF LYE-DLKRRDF LSE-DLKRRDF LEE-DAQRRDF
404] 404] 397] 400] 400] 397] 397] 406] 249] 407] 397] 404] 402] 403] 402] 399] 398] 396] 401] 392] 400] 421] 473] 448] 444] 443] 448] 451] 450] 450] 423] 436] 448] 437] 468] 448] 410] 394] 432] 466]
22
gi|146283614 gi|88705696 gi|54294419 gi|134095805 gi|115421815 gi|74318068 gi|114332256 gi|37680945 gi|149190136 gi|153825676 gi|75238848 gi|77976052 gi|90406862 gi|145642144 gi|152979389 gi|113461167 gi|154707640 gi|78224373 gi|148262375 gi|118580041 gi|29840548 gi|15605135 gi|24215701
37 1 13 33 32 32 35 40 30 30 29 92 14 85 1 33 34 20 17 19 37 38 35
HRNEISRHAVSVVERLQQAG ------------MARLRDKE SKADISNNALKVLNRLISHG DPKLVSANAIRVTSTLQEAG DRRNVSRHAIKVCEVLRQHG RREQLDDCALKVCETLAQAG SRSSISAGSLKVALTLQQAG SRKQISDHALKVLYRLHGAG SRKQISENALKVLYRLNGAG SRQQISENALKVLYRLHGAG SRKDISENALKVMYRLNKAG SRRDISDNALKVLYRLNKSG SRANIDDNALKVLYRLHNAG SPRDFSRNALTVVEKLQRQG ---MISKNALTVVEKLNRNG MPRMISRNALSVVEKLHRNG SRSDISPNALRVLYRLSKSG SRSQVSPNALRVLYRLKDNG SRRWLSPNAVKVLYRLKDNG SRKLVSPNALRTLYRLKDNG KLKDFSPHALSVVKTLRKAG DLQSFSTHALSVVRTLKKAG RKNMIDEDAVKIIHRLNKFG
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
YEAYVVGGCVRD HQAYLVGGAVRD FQAYIVGGSVRD FKAFLVGGAVRD YEAYIVGGAVRD FKGYLVGGAVRD YSAYIVGGAVRD FDAFLVGGGVRD YDAYLVGGGVRD FEAFLVGGGVRD YEAWLVGGGVRD YEAYLVGGGVRD FRALLVGGAVRD FEAYIVGGCIRD YEAYIVGGCLRD YEAYIVGGCLRD YEAYLVGGGVRD CIAYLVGGCVRD FTAYLVGGGVRD FIGYLVGGCVRD HKAYIVGGCIRD YEAYIVGGCIRD FKAYIVGGGVRD
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7)
KDYDVATSA KDFDIATDA KDFDIATNA KDFDIATNA KDFDVATNA KDFDVATDA KDYDIVTDA KDFDIATNA KDFDIATNA KDFDVATNA KDFDVTTNA KDFDITTSA KDFDITTNA KDFDVATNA KDFDVATNA KDFDVATNA KDFDIATNA KDFDVATNA KDFDIVTDA KDFDVVTNA KDFDISTSA KDFDISTSA KDFDVVTNA
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
TPEQVR-AEFRNAR TPEEVH-ALFGNSR TPNEVK-NLFKNAR TPEQVK-KLFRRAF TPEQIR-PLFRRAR TPEEVR-RLFRRSR TPEEVR-AIFRHSR TPEQLK-QLFRNCR TPEQIK-KLFRNCR TPEQLR-QLFRNCR TPEQVR-KLFRNCR TPEQVR-KLFRNCR TPEEIK-ALFRNCR RPEQIQNIFQRQCR RPEQIQTVFQRQCR KPEQVQAIFQRQCR RPHEIR-KLFKNSR TPNQVK-RLFRNCR TPGQIK-RMFRNCR TPNQVK-RIFRNCR KPEEIK-AVFKNCI KPEEVK-TLFKNCI TPNQIK-KIFNNCR
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15)
EIIEVATFR EVIEVTTFR EIIEVATFR DLLEVTTFR EIIETSTFR ETIEVTTFR ETVEVSTFR EIIEVATFR DIIEVATFR DIVEVATFR EIIEVATFR EIIEVATFR EVIEVATFR DIIEVATFR DVIEVATFR DIIEVATFR EIIEVSTFR EIIEVATFR EIIEVATFR EIIEVATFR QIIEVSTFR QIIEVATFR KVIEVSTFR
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
31) 28) 25) 23) 23) 39) 31) 27) 27) 29) 29) 32) 30) 28) 28) 28) 31) 48) 47) 40) 18) 18) 26)
LED-DAQRRDF LEE-DAVRRDL LDE-DAWRRDF QHE-DALRRDF HEE-DAARRDF MAD-DAARRDF QEE-DVRRRDF IDE-DAERRDF IDE-DAERRDF IDE-DAERRDF IEE-DAQRRDF IED-DAQRRDF LEE-DAERRDF IEQ-DAARRDF LEQ-DAERRDF IEQ-DAERRDF IEE-DARRRDF PEE-DAVRRDF PEE-DAIRRDF PEE-DALRRDF AEE-DVLRRDF AEE-DVLRRDF PQE-DAARRDF
162 111 132 150 149 165 160 161 151 153 152 218 138 208 121 156 159 162 158 153 149 150 155
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
466] 403] 423] 453] 452] 462] 462] 463] 449] 457] 454] 521] 448] 509] 421] 457] 439] 457] 454] 455] 428] 425] 487]
-----------MLYRLKKAG SRANISRNALKVLYRLNKAG SRKDISENALKVMYRLNKAG SRSLVSPNALRVLYRLRDNG RESQLR-NLLLDVANFVNES IENTLR-SLLLDVAEYIREQ LENTLK-ELLLDVANYIRER KETRIC-NLLKDYTAHYNSL TEEKIR-NVLVGYCDYYNKT VEKRIF-DRLLATLRFFNLQ SENRLF-NLLNECVDFYDLQ AHSNIF-AFLLRVNEERRIF FQSLFTEGLKSLTELFVKEN FQSLFTEGLKSLTELFVKEN FQSLFTEGLKNLAELFSKEK FRGLFTPELKQLAELFEKYQ FKSIFTPELNDLVALFKKYD FRTLFTPQLLKLRDLFAKRN TTSMISKPTRIVLNGLKSKG SSSMIAKSTRKVLNGLKSKG ---MIAKPTRYVLNGLKKKG QRSMIPDSTRMVLNKLKKKG
( 0) FSAYLVGGCVRD ( ( 0) YAAYLVGGGVRD ( ( 0) YEAWLVGGGVRD ( ( 0) FIAYLVGGCVRD ( ( 6) VELRWAGGWVRD ( ( 13) LVLRFTGGWVRD ( ( 13) MVLRFTGGWVRD ( ( 6) LTLRITGGWVRD ( ( 4) LELRITGGWVRD ( ( 0) THLRVAGGWVRD ( ( 0) MDLRVVGGWVRD ( ( 0) ATLRVAGGWVRD ( ( 0) HELRIAGGAVRD ( ( 0) HELRIAGGAVRD ( ( 0) YELRIAGGAVRD ( ( 0) YELRIAGGAVRD ( ( 0) YELRIAGGAVRD ( ( 0) YELRIAGGAVRD ( ( 0) YDVYLVGGCVRD ( ( 0) HDVYLVGGCVRD ( ( 0) YEVYLVGGCVRD ( ( 0) FQVYLVGGCVRD (
7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7) 7)
KDFDVTTDA KDFDVATDA KDFDVTTNA KDFDVATDA HDIDIAINV QDIDVAINT HDIDVGISS HDLDIAINI HDIDIAVNH YDIDIALDK KDIDIAIPK QDIDIAIES QDIDFATTA QDIDFATTA QDVDFATTA QDVDFATIA KDIDLATTA ADVDFASTA KDFDILTSA KDFDILTSA KDFDIITSA KDFDVITTA
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 2) 2) 2) 2) 2) 2) 2) 7) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
RPEQIR-DLFRSCR RPEQVK-QLFRNCR TPEQVR-KLFRNCR TPNQIK-RIFRNCR GQNFGL-KVREYLE GYQFGM-RLKEYLD GYQFGM-ALKDYLD GEEFAT-GLNGYLL GEEFVN-GLHDYLR GTEFVD-KVREYLL GIKFCE-YLNNFTK GELFAR-EVSAYQE TPTQMK-EMFQSAG TPTQMK-EMFQSAG TPAQMK-EMFQAAG TPDQMK-EMFTKEG TPDQMK-QMFEKEE TPTQMK-EMFEEDK ELREVV-RSFSRCE ELREVV-RTFPRCE ELKEVL-RAFPRCE ELKEVR-KVFPGCQ
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
15) 15) 15) 15) 28) 34) 34) 23) 21) 19) 19) 19) 20) 20) 20) 21) 21) 21) 15) 15) 15) 15)
EIIETATFR EIIEVATFR EIIEVATFR EIIEVATFR KQLETATTN KHLETVTTK KHLETVTTK KHLETATTK KHLETCTTK KHLETARMR KHLETATMN KHIETATVC ENFEITTLR ENFEITTLR ENFEITTLR ENFEITTLR ENFEVTTLR ENFEITTLR DMIEVSSFS DLIEVSSFS TIVEVSSFS IIIEVSSFS
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
22) 29) 30) 50) 30) 30) 30) 30) 30) 31) 30) 31) 17) 17) 17) 17) 17) 17) 27) 28) 27) 26)
EQ--DAWRRDF EE--DALRRDF EE--DAQRRDF GE--DALRRDF EE--DALRRDA EE--DAMRRDA EE--DALRRDA EE--DALRRDA EQ--DAVRRDA EE--DAYRRDL LE--DAMRRDF LE--DALRRDF WQK-DAERRDL WQK-DAERRDL WQK-DAERRDL WRT-DAERRDL WQL-DANRRDL WQL-DANRRDL FN--NCLQRDF LN--NCLQRDF WR--NCLQRDF WK--NCLQRDF
105 184 163 159 154 164 273 158 164 201 276 150 167 167 155 125 185 133 192 196 118 176
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
401] 475] 465] 455] 516] 522] 633] 531] 547] 560] 605] 526] 448] 448] 436] 411] 477] 436] 881] 527] 455] 505]
Group II KOG2159 gi|77164400 gi|88810481 gi|89107024 gi|39998340 gi|154299760 gi|115387873 gi|146323072 gi|50293743 gi|68466285 gi|1139585 gi|71027783 gi|71423978 1ou5_A gi|30750040 gi|126336363 gi|115640822 gi|21357337 gi|71995920 gi|15229038 gi|22327016 gi|157359927 gi|42571667
1 62 40 16 11 8 117 20 30 72 148 16 51 51 39 8 68 16 72 75 1 57
Group III PF04439 2pbe_A gi|56962634
2 1
SLRSEQEMMDIFLDFALNDE ( -MRSEQEMMTLFLDFAKNDE (
2) RLVTLEGSRTNR ( 2) RLVTLEGSRTNK (
7) QDYDISYFV ( 7) QDYDISYFV (
1) DVESFK-ENDQWLE ( 34) NKLDLTLIP ( 1) DIDSFK-ENDQWLD ( 34) NKLDLTLIP (
2) EAEDYFANND2) EVKDYFSQSD-
120 118
[ 294] [ 284]
23
gi|42761424 gi|110556097 gi|106894096 gi|38049244 gi|89098156 gi|57118014 gi|157150112 gi|32261251 gi|4959487 gi|32261257 gi|32448570 gi|29377329 gi|73663233 gi|42781114 gi|52842365 gi|156869363 gi|152975576 gi|15612885 gi|149181905 gi|89894067 gi|150391095 gi|153952904 gi|56965390 gi|52078682 gi|89099969 gi|106888177 gi|37525410 gi|285308 gi|22538164
1 1 1 1 1 1 1 1 4 5 4 1 1 1 3 1 1 1 1 1 1 1 1 1 2 1 1 1 1
-MRTEKEILNLVSEFAYQRS -MRSEKEMMDLVLSLAEQDE -MRTEQEIMNLMLDIAKQDE -MRSEQEMMNLILSIAKKDD -------MMDLGMNFALNCE -MRSEKEVYDIVLNFAKTDK -MKTETEMFDVILQTAKALQ -MRTEPEMLDLILQTAKTLK NMRTETEMLDVVLKTAETLQ NMRAETEMLDLILQTAKTLQ NMRAETEMLDLILQTAKTLQ -MRTEEEMFQLIMDVAKQEE -MRTEKEMLNLILNIAKQDK -MRTEKEMLDLIINTAKEDE NRRDEQTMLTLIIKIANDDE -MRSQEEMLAIILKKAKQEE -MRSEKEMFDLIIGFAQRDE -MRTEQEMMDVILTIAKKEE MLRTEQEMMNLILNTANEDE -MRSEKEMMEIILSTAKKDE -MRTEKEMMDRILNTAKEDE -MRSEKEMMDLIIAVANNDV -MRTYNEMINLLLGVAKSDE -MRTEQEIIDLVLKVAREDS NKLTYETIIAGFMEMAKNDE -MNRFELIINNFMKWGNRTD -MEDPVILLEKILTFAHNDP -MKVREEKLRTIIEWSEKNE -MRDEQEIYNLVLNIANQDK
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 1) 1) 1) 1) 1) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2)
KIIALEGSRTNE RIVTLEGSRANI RIMILEGSRTNV RMVTLEGSRTNK RLFTLEGSLTNT RMVTLEGSRTNT YAVAMSGSRSNP EAVALSGSRTNQ TAVAMSGSRTDT KAVAMSGSRTNP KAVAMSGSRTDT RAVGMVGSRTNV KAVCMNGSRVNV RAVIMNGSRVNP KIVIMNGSRASP RAALLQGSRVSK RAVYMNGSRTNS RAVYMNGSRTNS RAVYMNGSRTNP RAVYMNGSRTNP RAVVMNGSRVGN RAAYLEGSRANP RAVCMNGSRTNR RAVGMNGSRTNP RAALIVGSRART HAALMIGSQARN DTVIQTGSRARN RVLLLTSSLVNP EAVLLNGSRANP
( 7) QDYDFAFFV ( ( 7) QDYDITYFV ( ( 7) QDYDMTYFV ( ( 7) QDYDVTFFV ( ( 7) QDYDFSYFV ( ( 7) QDFDITFFV ( ( 7) QDYDVVYIV ( ( 7) QDYDVVYVV ( ( 7) QDYDVVYVV ( ( 7) QDYDVVYVV ( ( 7) QDYDVVYVV ( ( 7) QDFDIVYIV ( ( 7) QDFDIVYIV ( ( 7) QDYDIIYVV ( ( 7) QDYDIVYLV ( ( 7) QDYDVVFLV ( ( 7) QDYDIVYVV ( ( 7) QDYDIVYVV ( ( 7) QDYDIVYVV ( ( 7) QDYDIVFVV ( ( 7) QDYDIVYFV ( ( 7) QDYDIVYVV ( ( 7) QDYDIVYIV ( ( 14) RDYDIVYVV ( ( 7) SDLDLVAIV ( ( 7) SDLDIIMVV ( ( 6) SDLDIELIG ( ( 7) SDLDIEFVF ( ( 7) QDYDIVFVT (
1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1)
DIEYFT-HEESWLS DIEPFI-SNDDWLN DIKSFI-SNDEWLK EMDTFK-TSDEWLS DMDYFK-KSDDWLS DMDSFT-SDDKWLD DLHALV-ADLAWLE DLDNLT-SDLAWLD NLDELI-TDLSWLD DLANLT-SNLSWLD DFDNLT-SDLYWLD PCAEFF-ETATWIA NLEDII-ADLEWIN DIRSFT-SNHNWIH EIESFV-HDKNWIN AMEPFL-SNPRWID DTASFI-EDQRWIT DIAPFI-QEQQWLK ETAPFI-EEKEWIG ETDSFL-ADKDWIG DFEYFA-CNHSWID ETRSFR-EDKAWID DVDSFV-SDPGWID DMQSFL-DEPGWVD NPSAFL-NDTDWLG DPNFFL-QSDHWLE GTDELI-GNDLWFK DNTNYI-SDKSWTL FIEDII-SDTNYHK
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
34) 33) 33) 34) 34) 33) 26) 26) 26) 26) 26) 34) 34) 34) 34) 28) 37) 37) 37) 37) 34) 34) 34) 34) 28) 27) 31) 30) 32)
IKMDITLIN NKIDLTLLP NKIDLTLLP NRIDLTLIP TKIDLTLIP VKIDLTLLP NRIDLTLCP NRIDLTLCP NRIDLTLCP NRIDLTLCS NRINLTLCP QRIDLTLCP NRIDLTLLS NRIDLTLVP NRIDLTLLH NRIDLTLVS NRIDLHIET NRIDLTLQT NRIDLQIQS NRLDLVIEI NRIDLTFAS NRLDLHVKT NRIDLTLCP SRIDLILVP LDVDFAFFP LDVDFVILS RKVDFTLAS VKVDFKLYS TRIDLRLIK
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 1)
DLNRYFSDSDELDNYLK-GDELEDYLN-GDEKDDYFSKSDEYEDYFKNSDLIDEYFTW-DHIKEWVDS-EYIKEWVES-EQIQEWVDS-EHIQEWVDS-EHIKEWVDS-EEKDNWHEG-DKLSEYLAE-DLIKKFVGQ-DQLETMPR--DARDTDP---RAMLKDYGK-DTMREVYEN-DCMNERYLS-DEAQKNFIG-DNIDTVIEN-DNVLNSL----RRATWNQG-DEKLEYSRE-DALPELEQD-PE SLENAIRN-NE RLKDMKQR-GL KFIKETQE-KE EFLEDYLD-D-
118 116 116 118 112 116 108 108 112 113 112 117 117 117 119 109 120 120 121 120 117 114 117 124 114 111 114 114 114
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
282] 302] 283] 284] 280] 288] 277] 270] 276] 278] 290] 287] 261] 290] 288] 281] 293] 292] 294] 289] 296] 287] 289] 296] 291] 281] 283] 287] 284]
Group IV 1KAN 1kan_A gi|56962049
8 10
TREERMKIVHEIKERILDKY ( SQKERLQTCHEIAKRLHEVY (
4) KAIGVYGSLGRQ ( 4) LAIGVYGSVSRG (
5) SDIEMMCVM ( 5) SDIEMFCVL (
0) -------------- ( 15) WKVEVNFYS ( 12) SDWPLTHGQFF 0) -------------- ( 17) WKAEVNVCS ( 12) DRWPLTHGPYF
104 108
[ 253] [ 256]
DLETLRARREAVLSLCARHG LIGPVSEQREKILSVAGEHG SIAGVEIDRARLAAICARYG IYELIAQKRPDILALADRFA MHPVIETHREELRALARRYG -MKNIHLSPECIAAFCRKHG VSSLSDVSFQALEAFCRRRH LDEALTKLRA-AKPLLDRYG LEDVIAILRA-HEAFYRAKG GQRLSERIKEKILPILKKYG VEER-EDLFRKISSFLKKYG GRTLTRMARETIIAILTRND IEPIIQQLQNILPQLREEYD KQEIIDIIRHSKPEIEARYG TIQFMAILRQNLPEISRKYK LDHILEHLRAIQPELRRRYP
1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1)
3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3)
6) 6) 6) 6) 6) 8) 8) 7) 8) 8) 6) 6) 6) 6) 6) 6)
94 118 93 92 214 92 108 92 97 117 100 92 100 92 102 98
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
Group V COG1669 1wot_A gi|94265949 gi|86738875 gi|154150397 gi|56477651 gi|148656575 gi|148655002 gi|16127310 gi|83312696 gi|52548423 gi|20090159 gi|126179112 gi|148656651 gi|148262492 gi|20088999 gi|144899593
5 29 3 3 125 1 17 3 7 26 12 3 11 3 13 9
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
VRVRVFGSVARG VNVRLFGSVARG AELQVFGSQARG KNIRIFGSVARK RSIKVFGSMARD RQLAVFGSALRD RQLSLFGSVVRG ARVGVFGSTARG IHMAVFGSVARG KKAALFGSFARG TKVSVFGSYVRG EWIAVFGSYARG ERLGVFGSYVRN MRVGLFGSYVRE SYLGIFGSYVRG RSLGVFGSYARG
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
EDSDLDLLV PESDIDLLV ADSDIDILY EKSDIDFLV VESDVDLLV PDSDVDILV PDSDIDVLV PDSDVDVLV PDSDVDLVI PDSDIDILV PESDIDVLV PSSDIDILV AESDIDILV KKSDIDILV PESDLDILV ADSDVDLLV
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
---TLL-DHARLKL ---SLL-DLGGALI ---RLGWDIEQLAD ---SLI-THAAFQQ ---SGF-TLGALLM ---TLF-DIAGMEQ ---GFL-ALSRMQR ---PGL-DFFRLQD ---SLL-TLCGVQN ---TLL-DLVGLEL ---SLL-TLVNIEL ---SLF-SLVRIED ---GLL-KYISLEQ ---DLF-EFIDLQE ---GFF-EYIQLED ---DLL-AYAGLQQ
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
8) 8) 8) 8) 8) 9) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8)
RVDIVSERG KVDIVTERG PVDLVSARG SVDVASANG RVDIVTPAS TVDLRTPED PVDLVPRAG EIDLVTPDG KVDMLTWDG KVDVLTYNS KVDLLTEKS KVDLVTENA RVDLVMESA KVDLVMEKA KVDLVMKSA PVDLVEREA
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2)
-PRLREQVLRE -WYLRDKIINE -PLLRSSVLAE -EHFRRIVLQE -PAIRARVLRE -PYFRSQVIAE -PTIRAHILEQ -RLVRDRVLRE -PRLDERVRQE -PLLKDYILKE -PYLIDGIKKE -PYLADAIYRD -PEIGARIRTE -PAIGKRIISE -PAIGKHILEE -PRIAAQVLAE
98] 122] 100] 96] 218] 100] 116] 96] 102] 125] 106] 113] 104] 96] 106] 102]
24
gi|88809744 gi|118579580 gi|2128139 gi|20807236 gi|126660252 gi|55980984 gi|148657659 gi|46199450 gi|126657863 gi|134045668 gi|154149914 gi|109646961 gi|67937903 gi|88603950 gi|21226289 gi|150021566 gi|76258736 gi|20089001 gi|77919380 gi|37520733 gi|16330630 gi|154149919 gi|83309845
9 20 8 4 39 14 161 4 12 4 12 4 4 17 10 4 5 6 3 19 3 12 74
IRQRLEALKPQVLEVARRYG LETMLQSLKESLPELQARYG LSEIKEILRKHKKELKEKYK LTEIIKVLKEHKEELKERHK LEEIKEILINYKPFLVEKFK LEEIRRILKAHKAELAAQYG LTEIRNIIRQQSDILADKYG LGEILSILARHKPELRARFG LQTFKTCLQAQKADLKQQYH LSEFKTILHENRDLLIKKYK TAGILDQLRGMRHELEEQYH KNALEARLREYKPILEKRYS IDELKEIIAQNRTILEQKYK KDLILKTIRELSPEFRTVYK VNQIRLLILERKDEIKEKFK LNEIISIIKDLKKEIEQKYK ANDIVSRLRELKPIISAHYK FVESLNILKSHVEVIHQKFG REQVIRILSQHMEEIRQKFD RQGVLECLRCHWPEI-QSYG CQSVLQLLSQSKPDLQSRFG KSPVFMRLGHAVPSLRSRFG REEVIGRLQ-SRADIMERFG
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1)
SNLRIYGSIATG KSLGIFSSYVHG KSIAIFGSYARN KQIRVFGSYVRG SELGIFGSYVKE QRIAVFGSYARG AVVAIFGSYARG RELAIFGSYARG KELGIFGSYVRG KTLGIFGSYVRN KRIGIFGSYAKK KKIGVFGSYARN KSLAFFGSFVRG KRLGLFGSYASG EIIGIFGSYARG EILGIFGSFARG KELGLFGSFVRG KRIGIFGSFARG QSLSLFGSVARG KTIAVFGSTARD TQLALFGSTARD KRIGIFGSFARS KNLYLFGSVVRE
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3)
PASDLDLLV KGSDLDLLV ETSDIDILI ETSDIDIIV QDSDVDILV PESDLDLLV GESDLDLLV PVSDVDILV PNSDLDLLI ETSDIDVLV DESDLDLVV EDSDIDLIV DGSDIDVMV EHSDIDILA EGSDIDVLV ETSDVDILV ANSDIDILA EDSDLDVLV PDSDLDILV PDSDVDLLV PHSDVDILV RTSDVDILV PTSDIDLMV
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
6) 8) 6) 6) 6) 6) 6) 6) 8) 7) 6) 6) 6) 6) 6) 6) 6) 7) 6) 8) 6) 7) 8)
---SLL-GLISLRQ ---TLV-GFVRLER ---SLL-KLIELEN ---TFI-EFIKIQE ---TLF-DLVEIEY ---GLL-KFVELER ---SLL-ELVGAEI ---GW--EIVDLRD ---GLL-TFCHLEN ---NFD-NYMGLKY ---GML-AFVHLKD ---GF--QFVELKL ---GLL--FIHLAD ---DIW-DLSGLKI ---SLF-ELVGLGD ---TLF-DFIGLSI ---DLF-DLVGLTL ---TFD-NYMDLKF ---GMF--KYLDLK ---TFY-QFCDLQD ---TSH-RYFGVQF ---TFD-NFMQLVY ---GLF-AYVELKH
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 9) 8) 8) 8) 8)
PVDLTEAEN KVDLVERDT KVDLITKNS RVDLLTEES PVDLVDKES KVELVTPNA KADLVPKRD PVDLLTGTP NVDLVIKDS DVDLVIKDD RIDLVTPDG KVDLVTPNA KVDLLTPEA SVDVTTVSA SVDVVSERA DVDIVPQDA QVDIVPKRA EVDLVTEKA PVDLVTMKRVDLGTFDS AVDLATEKA KVDLLTVGS PVDLITSGN
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 2) 1) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2)
-PLIRTQILEQ -PVIGKRILEE NPYVKKSIEED -PFIKQYIKEV -PYIAENILNE -PHPHR-----AELRESILKE ---SKRSLSRA -PKIGQQILSE -VEIRDKILKE -PLIRDRVMHE -PQIKENILRE -LNRKKHIMSE -PEMRDSILAN -PMMKDDVLRE -KELREKINQE -AELQAAVLRE -PQLEDIIMKE -KKQLRDKILG -PRTAAKILKE -PELRSQIEQE -KYIRSRVERE -PRLKKRILEE
98 111 98 93 128 98 250 89 103 94 101 92 92 106 99 93 94 96 91 109 92 102 164
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
102] 115] 102] 96] 132] 98] 254] 90] 107] 101] 105] 96] 96] 110] 103] 97] 98] 100] 97] 113] 96] 108] 176]
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
589] 592] 569] 570] 570] 570] 573] 574] 570] 569] 570] 571] 567] 569] 573] 559] 563] 580] 574] 598] 588] 589] 650] 574] 575] 516] 335] 337] 322]
Group VI COG1796 gi|153004169 gi|86158982 gi|108803083 gi|16079911 gi|154686994 gi|157693277 gi|65321919 gi|56421229 gi|21282755 gi|57866653 gi|153201936 gi|56964441 gi|147678282 gi|134299491 gi|89211285 gi|27383138 gi|110636887 gi|156740035 gi|74317464 gi|82702030 gi|84359973 gi|78062648 gi|76817244 gi|108803335 gi|46199091 gi|13541687 gi|148747154 gi|51860136 gi|20067683
147 148 143 144 144 144 147 146 144 144 146 144 147 147 150 150 146 151 150 150 150 150 150 150 149 152 151 151 151
RRMRERATERRVLLADALAVGE ARLHAREADRRVLLAEALAAAG RAHGARE--RRMLLDEATAFGE GEAGKQP--ERFPIGYALRIAR GEAGKQP--ERFPIGYALSIAS EEAGKQP--ERLPIGFALDVAE DQVGSRP--ERLPIAMVLPIAG EKAGKRP--ERLPLARVLAIAA KQLGAKK--DRYPIDQMRRLNQ KAIGAKK--DRYPIELMRGLNQ REMGERP--DRYPLNDVLPIIQ AVWNVRP--ERLPVADVLPFSE EMREDRA--GRVLLATARELAG EMIRNRR--GRTRLSVSRELAE EKYESFK--NTYLLKEALGILD AIAKSGE--GRLHLHRAAALLE QFVMKQS--DKHLYGDIEALAF SAAERQE--RRMLLAHAIDSAE EAHASKT--QRFKLAVAAQYAD EAHTDQK--GRFKLAVAAQYAE DERLQREP-QRFLLPDAARSLM DDRLKREP-QRFLLPDAAHSLL GARLRRKS-QRFLLSFATQYLT GR--KERP-QRFRLSAAEPAAR ALAQAAG--KRRPLGAVLSLAR EMAERVS--ERIPIWKAYPEAL PREEMLQ--MQDIVLNEVKKLD PRVEMEK--MEVLILGELKKID PREEMLQ--MQDIVLNEVKKLD
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 13) 11) 13) 12) 12) 12) 12) 12) 12) 12) 12) 2) 2) 4)
LDVAVAGSTRRG EAVELAGSARRG ERAEIAGSLRRQ IKFSRAGSLRRA QKYSRAGSLRRA IRFSRAGSLRRA IRFSRAGSLRRV IRFSRAGSLRRL DQYSSAGSFRRF KQYSTAGSFRRY ETFAQAGSLRRL IRFSRAGSLRRM SRVEAGGSLRRW VEIEVAGSTRRW SRVKMTGEARRK KRATFAGDFRRG PQIQLCGDVRRM SQAAYAGSLRRG TKVAVAGSFRRM KEAIVAGSYRRM VEPVPAGSFRGR GKAVPAGSFRRR SEAVAAGSFRRR REAVVAGSFRRR ERAELCGSARRY LIVEIAGSTRRM YIATVCGSFRRG YIGTICGSYRRG ATVCGSGSLRRE
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4)
ADLDVVAAS GDVDLVAAS GDLDIVAAS KDLDYIIAT KDLDYIIAT KDLDYIIAT KDLDFIIAT KDLDYVIAT KDLDFIIST KDLDFIIST KDLDYVIAT KDLDYIIAT GDIDLVVAA GDLDLVAAA NSLEILVAS GDLAIVAEA DCISFVVPS GDLDILAAA GDLDIVATA GDLDIVVTA GDLDILVTA GDLDILVTA GDLDIVVTS GDLDVVACS GDLDFLVAS GDIDILAAS GDMDVLLTH GDIDILLTH NDVDLLIIV
( 0) --RDPG-ALSVRLV ( ( 0) --RDPA-ALAAALT ( ( 0) --DDPK-ALTDAFA ( ( 0) --DHPA-EVREQLL ( ( 0) --DHPE-AVREQLL ( ( 0) --DNHE-QVREQLV ( ( 0) --TEPA-AVREHLL ( ( 0) --DRPA-EVRDGLL ( ( 0) --DNPK-AVQQQLL ( ( 0) --SEPK-KVQQQLL ( ( 0) --EKPE-EVQKALL ( ( 0) --DNPS-SVKDQIL ( ( 0) --SEPE-PVIQAAV ( ( 0) --EEPA-PLLRALA ( ( 0) --KEKE-SVLEYLR ( ( 0) --AKPD-KTSTPP- ( ( 0) --TDFA-ETVSVLQ ( ( 0) --DDAP-AVVRAFT ( ( 0) --AADS-PVIARLT ( ( 0) --ASGS-PVMERFT ( ( 0) --RDPA-AVGHAFV ( ( 0) --RDPV-AVAEAFV ( ( 0) --GDPA-KVSARFV ( ( 0) --QEGR-RVIEHFV ( ( 0) --REGE-RAVEGFV ( ( 0) --EYPN-ALMDKFV ( ( 11) --KLLH-RVVEQLQ ( ( 11) --KLLH-AVVDHLE ( ( 3) --KLLK-HVLP--- (
23) 23) 23) 25) 25) 25) 25) 25) 26) 26) 25) 26) 23) 23) 23) 3) 25) 23) 23) 23) 23) 23) 23) 23) 23) 23) 33) 35) 31)
LQVDLRVVP LQVDLRVVP VEADLRVVS TSVDFRLVT MSVDFRLVT ISVDFRLVT ISIDFRLVK IAADFRLVG IGVDFRLIE IGVDFRLIE ISVDFRLVE ISVDFRLVP ISVDLEIVP IAVDLQVVS IRIKVFIVP LQIRVSDRK IPVEIYLTT MQADLIAVP LQVDLRVVA LQVDLRVVA LQVDLRVVD LQVDLRVVD IQADLRVVS LEADLRVVP LQVDLRVVP TTCDLRIVS RRIDIRLIP RRIDIRLIP RRIDIRLIP
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 1) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2)
DFPTLLHHFTG DFPTLLHHLTG AFGSLLHHFTG QFPTTLHHFTG QFATTLHHFTG QFATTLHHFTG EFITTLHHFTG EFATALHHFTG AFYHTLQHFTG AFYHTLQHFTG AFATTLHHFTG AFATALHHFTG LFVQALFRSTG EFVATLHRSTG MFCFYLFYLTG -FGAALLFATG KFGSELFKHSS MWGSALQHFTG AFGAALHYFTG SYGAALHYFTG AFGAALVYFTG AFGAALVYFTG ALGAALVYFTG SFGTALLYFTG SYGAGLQYLTG SFGAAMQYFTG QYYCGVLYFTG QYYCGVLYFTG EKPYAIFHFTG
261 262 255 258 258 258 261 260 259 259 260 259 259 259 262 240 259 264 262 262 263 263 263 261 261 264 274 276 263
25
gi|20067709 gi|156370197 gi|34014748 gi|146078013 gi|71666008 gi|60594052 gi|47220317 gi|67521542 gi|51535806 2ihm_A gi|58865402 gi|113913509
151 151 487 1172 155 148 335 454 358 155 291 328
PREELIQ--GKKIVNHLRSRLA PREEMIK--LRDIVLVHVKKQD PHAEVRL--HEAFLKLRLRKYL PHEEGRL--HEAFMKLRMRKYL PMHESVL--HENFLRESVQARL PREEATE--IEQTVQKAAQAFN PRGEAAA--IEKVVKDAALAVD PRTEVQA--HGNFVRRVVRMES PRHEVSE--MEKLLQEVGTDIL RRADAEA--LQQLIEAAVRQTL SRAEAEA--LQQLVEAAMREIL SRNECFA--HLEKVQNALSEID
( 12) KNIVAVGSLRRE ( ( 2) LTATVCGSFRRG ( ( 3) YELAICGSYRRL ( ( 3) YELVVCGSYRRV ( ( 3) YEIQVCGSYRRH ( ( 2) LLCVACGSYRRG ( ( 2) LVAMACGSYRRG ( ( 2) MQVIIGGSYRRG ( ( 2) VIIVCGGSYRRG ( ( 2) ATVTLTGGFRRG ( ( 2) ATVTLTGGFRRG ( ( 2) CQVELQGSYNRG (
4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4)
NDVDLLIIV GDIDILLSH GDIDVLITR GDIDVLITH GDVDAILAR GDVDVLITH GDVDVLISH GDIDLIITR GDMDIIITH HDVDFLITH HDVDFLITH GDIDLLFFK
( ( ( ( ( ( ( ( ( ( ( (
3) 13) 33) 15) 13) 7) 7) 10) 7) 7) 7) 10)
--KLLK-HVLPNIR --SLLH-GLVHCLE --QEVL-AAFVSAL --SEVL-GSFLAGL --TGVL-GTLVDYL --GIFS-RLLDSLR --GVFT-KVLQSLH --LMLD-NVVPKLF --GFLP-KFVQRLK --GLLP-KVMSCLQ --GLLP-RVMRCLQ --KIME-TLCIKLY
( 31) ( 32) ( 40) ( 53) ( 37) ( 34) ( 34) ( 32) ( 38) ( 61) ( 60) (102)
RRIDIRLIP RRIDIRLIP RRLDIRFVE RRLDVRYVD RRVDIRLIE RRLDIIVVP RRLDIIVVP RRIDLLFVP HRIDLKVYP VRVDLVVTP VRVDLVVTP RRLDFFCCK
( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2)
QYYCGVLYFTG 274 [ 335] QYYCGILYFTG 275 [ 336] CFPAALLYFTG 640 [ 813] SFPAAMLYFTG 1320 [1481] SVPTALLTFTG 285 [ 403] EFACALLYFTG 268 [ 335] ELACALMYFTG 455 [ 521] ELGAALIYFTG 575 [ 625] RHAFGLLAWTG 482 [ 549] QFPFALLGWTG 302 [ 360] QFPFALLGWTG 437 [ 495] ELGAGRIHYTG 519 [ 582]
Group VI KOG2534 gi|60827766 gi|148232172 gi|51860136 gi|20067709 gi|20067701 gi|20067705 gi|20067677 gi|115707284 gi|156370197 gi|91085421 gi|145239427 gi|154281005 gi|46107476 gi|39945190 gi|50550535 gi|113913509 gi|62079061 gi|114632435 gi|126273242 gi|149690123 gi|149412246 gi|118092900 gi|154147652 gi|33416901 gi|47220317 gi|156369028 gi|118197144 gi|30681747 gi|47232548 gi|118380982 1jms_A gi|112734847 gi|109090097 gi|149689800 gi|74136509 gi|149642513 gi|45382381 gi|33860211 gi|147899762 gi|40218593
151 151 151 151 151 151 149 163 151 903 511 528 612 659 527 328 386 399 377 389 402 388 391 380 335 215 606 338 361 643 175 304 280 303 306 304 304 300 300 301
PREEMLQMQDIVLNEVKKVD PRKEMLQMQEIILDKVNNLD PRVEMEKMEVLILGELKKID PREELIQGKKIVNHLRSRLA PREEMLQMQDIVLNEVKKLD PR--LIQGKKIVNHLRSRLA ----LIQGKKIVNHLRSRLA PREEVTRLEGILK-SQVAAE PREEMIKLRDIVLVHVKKQD PRNEIKQIENLIR---GHLE PRSEVEAHGEIVRKAVQTAD PRAEVEAHGAIVKELLFKVD PRSEVEALGLVVKRTAQHID PRSEVEALGSLVQREAAKID PRDEVTRHFMVVQKAAHEID SRNECFAHLEKVQNALSEID PREEAAEIEQMVRVSAQAFN PREEATEIEQTVQKAAQAFN PREEAAEIEKTVRETAHTLN PREEATEIEQTVQKSAQAFN PREEAAEIEETVRAAAQVLN PREEAAEIEQTVRQAALALK PRDEAGKIEQTVREAAHAVN PRSEANAIEKTVKDAAHSVD PRGEAAAIEKVVKDAALAVD PREEAGKIGEVVKAATEIID PREEAKAIYDIIKPIALSLD PRQEVQEMEQLLQRVGEETL PRHEVSEMEKLLQEVGTDIL PREEATLIVNEVIKGFKELY NRPEAEAVSMLVKEAVVTFL NRPEAEAVSMLVKEAVVTFL TRAEAEAVSVLVKEAVQAFL TRPEAEAVSVLVKEAVWAFL SKAEADAVSLLVQDAVWTFL AKEEADAVYLIVKEAVRAFL SKAEADAVSSIVKNTVCTFL RKAEADAVAMVVRDAVWTFL SRAEAETTEQLIKSIVWKFV TRPEAEAVAQIIETIVHNYA
( 2) YIATVCGSFRRG ( ( 2) YIATVCGSFRRG ( ( 2) YIGTICGSYRRG ( ( 13) NIVAV-GSLRRE ( ( 5) TVCGSFGSLRRE ( ( 13) NIVAV-GSFRRG ( ( 13) NIVAVGGSFRRG ( ( 3) YVATVCGSYRRG ( ( 2) LTATVCGSFRRG ( ( 2) IHLTICGSYRRG ( ( 2) MQVIIAGSYRRG ( ( 2) VKVIIGGSYRRG ( ( 2) VELIIGGSYRRG ( ( 2) VELLVGGSYRRG ( ( 2) VDAHIMGSYRRG ( ( 2) CQVELQGSYNRG ( ( 2) LLCVACGSFRRG ( ( 2) LLCVACGSYRRG ( ( 2) LLSVACGSYRRG ( ( 2) LLCVACGSYRRG ( ( 2) LLAVSCGSYRRG ( ( 2) LVCVACGSYRRG ( ( 2) LICVACGSFRRG ( ( 2) LLAMACGSYRRG ( ( 2) LVAMACGSYRRG ( ( 2) LLCITCGSYRRG ( ( 2) LFVEIMGSYRRG ( ( 2) VNIVCGGSYRRG ( ( 2) VIIVCGGSYRRG ( ( 6) YDIIACGSYRR- ( ( 2) ALVTMTGGFRRG ( ( 2) ALVTMTGGFRRG ( ( 2) AFVTMTGGFRRG ( ( 2) AFVTMTGGFRRG ( ( 2) ALVTITGGFRRG ( ( 2) ALVTLTGGFRRG ( ( 2) ALVTITGGFRRG ( ( 2) AVVTLTGGFRRG ( ( 2) AIVTLTGGFRRG ( ( 2) AIVTLTGGFRRG (
4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 3) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4)
GDMDVLLTH GDMDILLTH GDIDILLTH NDVDLLIIV NDVDLLIIV GDMDVLLTH GDMDVLLTH GDMDVLLTH GDIDILLSH GDIDALITH GDIDLLITK GDIDLIITK GDIDLIITK GDVDFIVTR GDIDMIFTK GDIDLLFFK GDVDVLITH GDVDVLITH GDVDVLVTH GDVDVLLTH GDVDVLVTH GDVDVLVTH GDVDVLVTH GDVDVLITH GDVDVLISH GDVDVLVSH GDIDILITR GDLDIVVTH GDMDIIITH GDVDILICR HDVDFLITS HDVDFLITS HDVDFLITS HDVDFLITS HDVDFLITS HDVDFLISD HDIDFLITS HDVDMLITS HDVDILITC HDVDFLISC
( 9) QPKLLH-QVVEQLQ ( 33) RRIDIRLIP ( ( 9) QPRLLH-QVVQCLE ( 32) RRIDIRLIP ( ( 9) QPKLLH-AVVDHLE ( 35) RRIDIRLIP ( ( 1) EKKLLK-HVLPNIR ( 31) RRIDIRLIP ( ( 1) EKKLLK-HVLPNIR ( 31) RRIDIRLIP ( ( 9) QPKLLH-RVVEQLQ ( 33) RRIDIRLIP ( ( 9) QPKLLH-RVVEQLQ ( 35) DLFTA--LP ( ( 9) KPELLV-SVVKRMI ( 35) RRLDIRLIP ( ( 11) KKSLLH-GLVHCLE ( 32) RRIDIRLIP ( ( 10) KQNLLK-NVVTALQ ( 28) RRLDIRLTP ( ( 8) RTIMMG-SVVPKLL ( 38) RRIDLLFVP ( ( 8) CALMTD-VVIPALF ( 34) RRIDFLYVP ( ( 8) LRPFLD-SLVQRLE ( 42) RRIDFLLVP ( ( 8) LVPFLD-RLVERLT ( 53) RRVDFLLVP ( ( 8) -QPFLK-ELVHKLT ( 28) RRIDFLLVP ( ( 8) LAKIME-TLCIKLY (102) RRLDFFCCK ( ( 6) -QGIFS-PLLDSLR ( 34) RRLDIIVVP ( ( 6) -RGIFS-RLLDSLR ( 34) RRLDIIVVP ( ( 6) -QGIFS-QLLDALR ( 34) RRLDIIVVP ( ( 6) -QGILS-RLLDSLR ( 34) RRLDILVPP ( ( 6) -QGVFS-RLLDGLR (144) KALSLKSTL ( ( 6) -RGLFS-KLLDSLH ( 34) RRLDIIVVP ( ( 6) -RGVFS-KLIDGLK ( 34) RRLDIIVVP ( ( 6) -KGVFS-KILHLLH ( 34) RRLDIIVVP ( ( 6) -RGVFT-KVLQSLH ( 34) RRLDIIVVP ( ( 6) -HGVMG-PLLCELK ( 33) RRLDIIVVP ( ( 8) -AGVLG-RLIQELH ( 36) RRIDFLTVP ( ( 5) HKGFLT-KFVKRLK ( 38) RRIDFKVYP ( ( 5) HVGFLP-KFVQRLK ( 38) HRIDLKVYP ( ( 7) PKKLMI-NLICKLE ( 33) RRIDLKYYP ( ( 8) -QQLLH-KVTDFWK ( 60) IRVDLVMCP ( ( 8) -QQLLH-KVTDFWK ( 60) IRVDLVMCP ( ( 8) -QQLLQ-KVMNLWE ( 60) IRVDLVMCP ( ( 8) -QELLS-KVINLWE ( 60) IRVDLVVCP ( ( 8) -DQLLQ-KVTNLWK ( 66) IRVDLVVCP ( ( 6) -EQLLP-NIIKLWE ( 66) IRVDLVMCP ( ( 6) -DELLH-K------ ( 64) IRVDLVITP ( ( 6) -KELLH-KVINLWK ( 66) IRVDLVFCP ( ( 6) -KNILH-NTMSVLK ( 63) VRLDLVITP ( ( 6) -N-FLR-KIVNKLD ( 78) IRVDLVIVP (
2) 2) 2) 2) 6) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2)
QYYCGVLYFTG 274 QYYCGVLYFTG 273 QYYCGVLYFTG 276 QYYCGVLYFTG 274 EKPYAIFHFTG 271 QYYCGVLYFTG 282 QYYCGVLYFTG 279 QYHCGTLYFTG 288 QYYCGILYFTG 275 QYFCAVLYFTG 1019 EIGAALIYFTG 638 EIGAALIYFTG 651 ERGAALLYFTG 743 EYGAALIYFTG 801 EKGAAFIYFTG 643 ELGAGRIHYTG 519 EFACALLYFTG 506 EFACALLYFTG 519 EFACALLYFTG 497 LFSCGLGPQCP 509 SFGTSCRSFSQ 632 EFACALLYFTG 508 EFACAIMYFTG 511 EFACALLYFTG 500 ELACALMYFTG 455 EWACAIVYFTG 334 SRGAALIYYTG 730 IYSFGLIAWTG 462 RHAFGLLAWTG 485 LYGYALLYFTG 766 RRAFALLGWTG 323 RRAFALLGWTG 452 RRAFALLGWTG 428 NHAFALLGWTG 451 RYAFALLGWSG 460 QYAYALLGWTG 456 QYAYALLGWTG 448 QYAFALLGWTG 452 QYPYALLGWTG 449 QFAYALLGWTG 464
[ 336] [ 334] [ 337] [ 335] [ 329] [ 335] [ 340] [ 349] [ 336] [1080] [ 705] [ 718] [ 810] [ 868] [ 710] [ 582] [ 573] [ 586] [ 564] [ 632] [1156] [ 575] [ 577] [ 566] [ 521] [ 400] [ 800] [ 529] [ 552] [ 832] [ 381] [ 530] [ 485] [ 508] [ 518] [ 514] [ 506] [ 510] [ 507] [ 522]
26
gi|40037389 gi|74136105 gi|6094445 gi|62414130 gi|17366888 gi|58865402 gi|7019493 gi|57231410 gi|110645438 gi|72138850 gi|47228542 gi|19113889 gi|85102863
303 293 295 291 291 291 291 297 330 307 141 317 382
KKSEAEAVIQIIGDIVGQCA SKAEARALTKAIGETVQAIT SKAEAKAVGCIIEDTFHWIA SRAEAAALKMMMEEALLFIN RRADAEALQQLIEAAVRQTL SRAEAEALQQLVEAAMREIL LRSDVDALQQVVEEAVGQAL TRAEAELIMAIVEAAVNSVL TREEAGTVEQLVKGALQSFV TKEEAKWIHDAVSQEVAAIQ TRAEADHIGEIVRRVVLCVL TIEEATEIYETIVSRMPDGI ARAEVESIANIILEHANKIH
( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 1) 2)
AKVTLTGGFRRG ALLALTGGFRRG AILALTGGFRRG ATVTITGGFRRG ATVTLTGGFRRG ATVTLTGGFRRG ATVTLTGGFRRG CQIQLMGGFRRG VRVTMTGGFRRG STVVMTGGFLRG AQLTLIGGFRRY IQSCLVGGFRRG FQMVIVGGYRRG
( 4) HDVDLLITC ( ( 4) HDVDIIFTT ( ( 4) HDVDFLLTM ( ( 4) HDVDFIIKA ( ( 4) HDVDFLITH ( ( 4) HDVDFLITH ( ( 4) HDVDFLITH ( ( 4) HDVDFLITH ( ( 4) HDVDFLITH ( ( 4) HDVDFLISH ( ( 37) HDVDFLITH ( ( 4) ADVDMVLSP ( ( 4) GDVDVVLSH (
6) 6) 6) 5) 6) 6) 6) 6) 6) 6) 6) 6) 6)
-EGVLH-KAISKLD -ENLLL-AVIKSLE -EGLLL-HVIDRLK -DRILP-AVIKRFK -VGLLP-KVMSCLQ -VGLLP-RVMRCLQ -AGLLP-RVMCRLQ -EGLML-KIISWLE -NGLLR-KAVAWLD -KGILG-TLLQALT -VGLLP-KVVSLLK -KHLVD-VLLRILD -RGFVE-QIVVALE
( 78) ( 61) ( 63) ( 53) ( 61) ( 60) ( 59) ( 66) ( 64) (122) ( 91) ( 35) ( 62)
IRVDLVIVP VRVDLVSPP VRVDLVAPP VRVDLVAPP VRVDLVVTP VRVDLVVTP VRVDLVVAP VRVDLVVSP VRVDLVVCP RRVDFVIAP VRVDLVVSP RRVDIIVVP RRVDIIISP
( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2)
QFAYALLGWSG RYAFALLGWTG RYAFALLGWTG RFPYALLGWTG QFPFALLGWTG QFPFALLGWTG QFPFALLGWTG QFAFATLGWTG EYFYALLGWTG QYAFALLGWTG QFAFALLGWTG YIGSAVLGWSG TAGCAVLGWTS
467 440 444 429 438 437 436 449 480 515 351 437 530
[ [ [ [ [ [ [ [ [ [ [ [ [
525] 498] 501] 487] 496] 495] 494] 507] 538] 573] 451] 506] 602]
LVLHVNKIMEEYLRRHS--LVGHVYKIMEEYLRRHS--LVEVVKRLVKEHLRRHN--LVEHVNRLMEEYLRRHN--LVVHVNKLMSEYLRRHN--LVKNVNKLMEEYLRRHN--LVCNVNSLMEEYLRRHN--LVTNVNLLMEEYLRRHN--LVGYVISMLKEYLKRHN--DVINICKLLENILKTILVIK LAESIKKILEEILKILLIIK
( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 1) 1)
NSCICYGSYSLH NSCLCYGSYSLH KRCVCYGSYALH KSCLCYGSYSLH RTAICYGSYSLH KSCICYGSYSLY KSCICYGSYSLH KSCICYGSYSLY KSAFCHGSYSLH HECLVYGSFTCF NDCVAYGSFTCY
( ( ( ( ( ( ( ( ( ( (
8) 8) 8) 8) 8) 8) 8) 8) 8) 8) 8)
GDIDILQTGDIDVLQTGDIDMVQTGDIDILQTGDIDILQTGDIDILQTGDIDILQTGDIDIMQTGDIDMLQTNDIDLYSINDIDLYST-
( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
---NAR-IFLINIA ---NAR-TFLINIA ---NAR-PFLINLA ---NSR-TFMINLA ---NAR-LFLINLA ---NSR-TFLIDLA ---NSR-TFLIDLA ---NSR-IFLINLA ---NSR-TFLINLA --NAYK-LMIFFMS --DAYR-ILIFFMI
( ( ( ( ( ( ( ( ( ( (
31) 31) 31) 31) 31) 31) 31) 31) 31) 29) 29)
HVMDSFNIK HVMDTFNIR HILDSFNVR HIVDSFNVR HIIDSFNIR HIIDSFNIR HIIDSFNIR HIIDSFNVR HIVDSFQVS VLIDCIFLP FIIDCIFLD
( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2)
TMDKIPKIMID TMNMIPKIMID TLRALPTLLVD TISQVPKILVD TMQNVPKVLVD TMNVVPKIFID TMQHIPKVLID TMHSIPKILID IFDRIPKILIN IIDNIPKVMIN IINVINKSLIN
263 263 260 260 261 261 261 261 262 269 276
[ [ [ [ [ [ [ [ [ [ [
472] 472] 472] 470] 470] 469] 470] 470] 473] 571] 573]
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
19) 19) 19) 19) 19) 19) 19) 13) 17) 17) 21) 19) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16)
WIVLGMGKFGAR WILLGMGKLGAH LIILGLGKLGAG LIVLGMGKLGAR LFVIAMGKLGAR MVVLAMGKMGAG AFALAMGKMGAG YFVLTLGKHGTR MVVLGMGKLGAV MVILGMGKLGAV MVVLGMGKLGGR MLVLAMGKMGAE LLILGMGKLGGG LLILGMGKLGGG MLIIGMGKLGGG MLIIGMGKLGGG LYILGMGKLGGF LLILGMGKLGGR FFIFAMGKLGGK LYIFVMGKLGGR LVILAMGKLGGG LLILGMGKLGGR LMVLGMGKLGGQ
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5)
SDIDLIVFI SDIDLVVFF SDIDLIVFI SDIDAVVFF SDIDLIVFF SDIDLIVFF SDIDLIMLF SDIDLVIAY SDIDLIFGY SDIDLIFAY SDIDLIFAF SDIDLIFAY SDIDLIFAW SDIDLIFAY SDIDLIFTY SDIDLIFTY SDIDLIFTY SDIDLIFTY SDIDLIFTY SDIDLIFCY SDIDLIFTY SDIDLIFTF SDIDLIFAY
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
13) 13) 13) 13) 13) 13) 13) 13) 15) 15) 15) 20) 15) 15) 15) 15) 19) 15) 15) 15) 15) 15) 15)
VETFSR-LTRRLVR TELFSR-LTRRLVR VDVFSK-MVRRLIR TENFGR-MMRRLVR VATFSK-IVRRLVR QPFFVR-VTQGLAR RASLIR-AVRKMSS LKLFSR-IAQKLSN QEFFTR-LGQKLIK QEFFIR-LGQRLIK QQFFIR-LGQRLIQ QEFFTR-LGQKLIA AQFFTR-MGQRLIK AQFFTR-LGQRLIK AQFFTR-LGQRIIK AQFFTR-LGQRLIK GKFFTR-LGQRLIS SKFFTR-MAQRLIK QSFFTK-LGQRIIG QPFFIK-LGQRLIA QKFFQR-LGQRLIA QQFFIR-MGQRLVN QVFYTK-VAQKLIT
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11)
FRVDLRLRP FRTDLRLRP FRLDFRLRP FRTDLRLRP FRTDLRLRP FRVDLRLRP FRTDLRLRP FRVDLRLRP FRVDMRLRP FRVDMRLRP FRVDMRLRP FRVDMRLRP YRVDMRLRP YRVDMRLRP YRVDMRLRP YRVDMRLRP YRTDMRLRP YRTDMRLRP YRVDMRLRP YRVDMRLRP YRVDMRLRP FRVDMRLRP FRVDMRLRP
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22)
--QNWERAAMI --QNWERAAMI --QNWERAAMI --QNWERAAYI --RNWERAAMI --RTWERAAMI --RTWERAAHI --QNWERAAYA --RDWERYAMI --RDWERYAMI --RDWERYAMV --REWERFAMV --RDWERYAMV --RDWERYAMV --RDWERYAMV --RDWERYAMI --RDWERYAMI --RDWERYAMI --RDWERYAMI --REWERYAMV --RDWERYAMV --RDWERYAMV --RDWERYAML
286 289 281 289 291 289 276 263 280 278 276 287 259 259 264 263 280 253 261 261 273 308 257
[ 980] [ 986] [ 974] [ 988] [1007] [ 988] [ 947] [ 959] [ 982] [ 979] [ 966] [ 975] [ 440] [ 951] [ 949] [ 948] [ 981] [ 963] [ 946] [ 954] [ 957] [ 990] [ 931]
Group VII PF03296 gi|40556067 gi|9634772 gi|41018505 gi|88769941 gi|9628963 2ga9_D gi|115503107 gi|38229195 gi|115531718 gi|9631476 gi|9964352
156 156 153 153 154 154 154 154 155 159 166
Group VIII COG1391 gi|153009990 gi|13476391 gi|49474038 gi|17934889 gi|114706875 gi|90423354 gi|84684938 gi|83945609 gi|116053162 gi|70734014 gi|120553709 gi|90415513 1v4a_A gi|16120978 gi|84394152 gi|15642434 gi|42632038 gi|57012867 gi|119943974 gi|90407219 gi|149907758 gi|24375254 gi|77361510
145 148 140 148 150 148 135 128 139 137 131 139 119 119 124 123 136 113 121 121 133 168 117
LTDLAEACTGAAVRFLLLDA LSDLADACTRAAVDFLLRDA LTRLGEAALGVALRFLLREA LSEMADASLSAAIDHLLLSA LSRLAESAIRAALRFCLREA LTDLAVASVQCALRFLLRQE LTRLADIATDKALKFHVGRE LTDFADAAVQASLACAVRSH LSGLADACIDLACEWLHRRQ LSDLADACIDQAYQWLYLRH TSAFADTAIDGALNWLYERA LTWMAEAAITASLDRLYPIT LSYLAETLIVAARDWLYDAC LSTLAESMIIAARDWLYQVC LSMLAEAMIFETYQWQYDIC LSQLAEALIFESYQWLYQRC LSQLAESLIIAARDWLYHQA LSDLAESLILAARDWLFQRC ISYLADQLILQCMSWLYKKQ LSYLAEQLLEQSLNWLYLKM TSHLADRLIEGALDWLYQLQ LSALAEALVIGARDWLYKEM VSQLADNLIESANQWAYTQV
27
gi|114770623 gi|85713289 gi|54296704 gi|84356880 gi|121527807 gi|121606786 gi|120613288 gi|115423458 gi|30249308 gi|15609358 gi|118617029 gi|108800293
119 133 117 98 115 110 96 97 101 184 184 178
VSWLADTLILQAYEFAYAGF YSELADTLIIESLHWLEQRF WSDLADSIILHTLKYIGFTL MTDLAEVAVQRSLALLSAEL MTDLAEFAVRTAVSVIGQEL MTELAELALDVAMRHSREAL TTELAELALDEALRQARQDL MTSLADLSVAAAYRCVADEL MTALADTTIRFALEFLHTAM LADAADAALAAALRVAEASV LADIADAALAAALRLAEKTV LSDLADAALASALEVAMSSV
( ( ( ( ( ( ( ( ( ( ( (
15) 16) 16) 16) 17) 16) 16) 16) 19) 8) 8) 8)
LYILGMGKLGGK LYVLGMGKLGGR LYALAMGKLGGR LGVVGMGKLGGR LVVVGMGKLGGR MWVVGMGKLGAR LWIVGMGKLGAR MLIVGMGKLGGR LLIVAMGKLGGG LAVIAMGKCGAR LAVIAVGKCGAR LAIIAMGKCGAR
( ( ( ( ( ( ( ( ( ( ( (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5)
SDIDLIFVY SDIDLIFYY SDIDLIFAF SDIDLIFVY SDIDLIFLY SDIDLIYVY SDIDVIYVY SDIDLVMLY SDVDLIFIY SDVDVIFVA SDVDVIFVA SDVDVIFVV
( ( ( ( ( ( ( ( ( ( ( (
15) 15) 14) 16) 15) 18) 18) 14) 14) 5) 5) 7)
FLTPYKQAVEELKVKFKGTR FLTPYKQAVEELKVKLKGIR FLAPYQIAVDELKVKLKAIR FLVPYRQAVEELKVKLKGIR FLAPYKQAVEELKVKLKGMR FLGPYKQAVDELKIKLKGMR FLAPYKQAVEELKIKLKGMR FLSPYKQAVDELKIKLKGLR FLTPYKQAVDELKVKLKGMR FLSPYKQAVDELKVKLKGMR FLAPYKQAVEELKVKLRGIR FLAPYHQAVAELKVKLKGMR TLAPYKQVVEELKVKLKGMR FLDPYIQAVGELKIKLRGIR FLDPYVQAVGELKVKLRGIR FLDPYIQTVGELKIKLRGIR FLDPYIQAVGELKIKLRGIR FLDPYIQAVGELKIKFRGIR FLDPYVQTVGELKIKFRGVR FLLPYQQAVDELKVKLRGMR FLLPYSQAVSELKVKLRGMR FFIPYIQTVSELKVKLRGLR FLMPYEQAVGELKIKFRGMR FLMPYEQAVSELKIKLRGMR FLAPYEQTVSELKVKLRGIR FLNPYEQTVNELKLKLREMR FLWPYNEAANELKVKFRALR FLWPYNEAVRELKVKFRSLR FLWPYQQAVSELKVKFRSLR FLIPYEQAVEELKVKLRSIR FLIPYEQAVEELKVKFKSIR FRIPYEQAVEELKVKFKSIR FLMPYEQAVDELKVKLKSIR LLIPYDNAVEELKVKFKGIR TLIPYEQAVEELKVKFKSIR ILSPYEQAVDELLLKFKHII ILCPYELAVKELMVKFEHII ILAPYEHAVEELKIKFKNIR FMLEHRFGMDEIVTKLTILR -MATYQFAIMEIETKVSILQ
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12)
EFVTGRVKPVSS EFVTGRVKPISS EFVTGRVKPVKS EFVTGRVKPVAS EFVTGRVKPIAS EFVTGRVKPIAS EFVTGRVKPVAS EFVTGRVKPMTS EFVTGRVKPIAS EFVTGRVKPLAS EFVTGRVKPIAS EFVTGRVKSVAS EFVTARVKPIPS EFVTGRVKPIES EFVTGRVKSIES EFVTGRVKSIES EFVTGRVKSVAS EFVTGRVKPIES EFVTGRVKRRES EFVTGRVKPVDS EFVTGRVKPVES EFVTGRVKSQAS EFVTGRVKPVSS EFVTGRVKPIDS EFVTGRVKPVDS EFVTGRIKSVDS EFVVGRVKTVDS EFVIGRVKTVDS EFVVGRVKTVDS EFVTGRVKEISS EFVTGRVKEISS EFVTGRVKELSS EFVTGRVKEVSS EFVTGRVKKVSS EFVTGRVKKISS ENVTGRVKRISS EQVSGRVKSLPS EFVTGRTKKISS ESVSSRLKSPES EHIISRLKSPES
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
20) 20) 20) 19) 20) 20) 20) 20) 20) 23) 20) 20) 20) 20) 20) 20) 20) 20) 19) 20) 20) 20) 20) 20) 20) 19) 20) 20) 20) 20) 20) 20) 20) 20) 20) 20) 20) 18) 26) 22)
DLAGVRIVT DLAGLRIVT DIAGLRIMC DIAGLRIMC DIAGLRMMC DIAGIRLMC DIAGLRMMC DIAGLRLMC DIAGLRMMC DIAGLRIMC DIAGLRIMC DIAGLRMMC DIAGVRVVC DIAGLRVMV DIAGLRIMV DIAGLRIMV DIAGLRVMV DIAGLRIMV DIAGVRVMV DIAGLRIMC DIAGVRVMC DIAGVRIMT DIAGLRIMC DIAGLRIMC DIAGLRIMC DIAGVRIMC DIAGIRIMC DIAGIRIVT DIAGIRIMC DIAGIRVMC DICGIRIMC DIAGIRIMC DIAGIRIMC DIAGIRIMC DIAGIRIMC DIAGIRIMC DIAGIRIIC DIAGIRIMC DIAGVRVVC DIAGVRVVC
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5)
QQFFIK-VAQKLIH SVFFTR-LAQALIA QHYFSK-MVQTFVH QEYFTR-LGRRLIG HEYFTK-LGRRLIN HEYFAR-QVRAVFS QEYFGR-AVKAIHA HEFYGR-LTRRMMP HEFFVR-LGRKLIA -------RNARVAS -------ISIRVAG ----IR-TTTRVAG
( ( ( ( ( ( ( ( ( ( ( (
11) 11) 11) 11) 11) 11) 11) 11) 11) 10) 10) 10)
YRVDMRLRP FRVDMRLRP FRVDLRLRP FRVDMRLRP FRVDMRLRP FRVDLALRP FRMDLALRP FRTDLRLRP FRVDMRLRP FEVDAALRP FEVDAGLRP FEVDAALRP
( ( ( ( ( ( ( ( ( ( ( (
22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22)
--RSWERFALL --RDWERYAMV --RDWERYAMV --REWERYAWI --REWERYAWI --REWERFAWL --REWERFAWL --REWERYAWL --REWERHAWI --KTWEFQALL --KTWEFQALL --KTWEFQALM
258 273 256 239 256 253 239 236 243 299 299 297
[ [ [ [ [ [ [ [ [ [ [ [
954] 958] 913] 935] 947] 929] 918] 923] 929] 994] 995] 991]
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
207] 211] 216] 211] 212] 216] 222] 211] 211] 223] 221] 212] 202] 226] 221] 223] 241] 221] 226] 220] 204] 191] 223] 224] 234] 214] 212] 210] 208] 262] 262] 262] 277] 268] 267] 266] 268] 270] 247] 223]
Group IX COG2357 gi|56964285 gi|15615412 gi|68056392 gi|16078225 gi|89098121 gi|149182745 gi|116872367 gi|73663088 gi|15923996 gi|126654171 gi|138894381 gi|152974726 gi|23098675 2be3_A gi|157149687 gi|15675105 gi|55821443 gi|24379484 gi|116511221 gi|28378821 gi|116492964 gi|116617790 gi|116494409 gi|81429053 gi|69250163 gi|90962279 gi|104773694 gi|58336971 gi|42518748 gi|28210095 gi|15893930 gi|118444636 gi|150019412 gi|106895006 gi|150391408 gi|156865648 gi|153810855 gi|126697917 gi|119714541 gi|119716419
6 7 9 9 7 7 7 7 7 7 17 8 6 11 6 8 25 6 14 7 8 6 8 8 8 6 10 8 7 9 9 9 23 10 9 9 9 9 16 1
IDKVIELMRAR IETVVQLIRSR IIHVLELLRSR IQIVKEMLFAR IKLVVEMLRRR IHAVVNLLRAR IEVVVRLLRQR IDIVVNLLRQR IDVVVNILRQR IATVTELIRQR IKTVVKLLRQR IKPVVEYLRKR IYTIVDMLHHR VKEVVDILHKR ITEVLKVLHQR VEEVLALLRQR VDEVLDLLRHR VNDVLELLRQR VWDVLELLRKR IYQVVDLLRKR IYEVVDLIRKR IYTVVDLLRQR IYQVVALLRQR IHEIVGLLHQR IHQVVEIIRKR IYQVVSDLRKR IYRVVDLLHAR IYKVVDLIHAR IYRVVDLIHER IEKVVDIIRNR VTKVVNIIRSR IYRVVDLIRKR IDTVVQILRER IYTVVDYIRER IYTVVDYIRER IYKVVEIIRNR IDTVASIIRSR IYAIVDLIKVR VYRVFDMLVEQ VYRVQQVLCSQ
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 41) 41) 39) 39) 39) 39) 39)
VLVELQIRT ILVELQIRT TLVEIQIRT VLVEIQIRT ILVEIQIRT ILAEIQIRT ILAEIQIRT ILAEIQIRT ILAEIQIRT VLAEIQIRT ILAEIQIRT VLVEIQIRT VLAEIQIRT ILAEIQIRT VLAEIQIRT VLAEIQIRT ILAEIQIRT IMAEIQIRT VNAEIQIRT ILAEIQVRT ILAEIQVRT VLAEIQVRT ILAEVQIRT ILAEIQIRT ILAEIQIRT IKAEIQIRT LLAEIQIRT LIAEIQIRT IIAEIQIRT ILAEFQIRT IIAEIQIRT ILAEFQIRT ILAEFQIRT ILAEIQIRT ILAELQIRT IQAEIQIRT LQAEIQIRT IICEIQIRT VLVEVQFRT VPVEMQFRT
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2)
MNFWATIEHSL MNFWATIEHSL MNFWATIEHSL MNFWATIEHSL MNFWATIEHSL MNFWATIEHSL MNFWATIEHSV MNFWATIEHTL MNFWATIEHTL MNFWASIEHSL MNFWATIEHSL MNFWATIEHSL MNFWATNEHSL MNFWATIEHSL MNFWATIEHSL MNFWATIEHSL MNFWATIEHSL MNFWATIEHSL MNFWATIEHSL MNFWATIEHSL MNFWATIEHSL MNFWATVEHDL MNFWATIEHSL MNFWATIEHSL MNFWATIEHSL MNFWATIEHSL MNFWATVEHTL MNFWATVEHTL MNFWATVEHTL MNFWATIEHSL MNFWATIEHSL MNFWATVEHSL MNFWATIEHSL MNFWGTIEHSL MNFWAIIEHSL MDFWATIEHSL MNFWATIEHSL MNFWATIEHSL MDFWASLEHKI QDFWASLEHKI
155 156 158 157 156 156 156 156 156 159 166 157 155 160 155 157 174 155 162 156 157 155 157 157 157 154 159 157 156 158 158 158 172 161 160 158 158 156 171 151
28
gi|50955835 gi|145295317 gi|111025072 gi|157694241 gi|154687967 gi|52082365 gi|54023222 gi|106893514 gi|150389950 gi|15614448 gi|15426419 gi|56964303 gi|126654197 gi|153815059 gi|106886023 gi|116872180 gi|150017195 gi|15896583 gi|154483555 gi|89896553 gi|156865437 gi|153813047 gi|153855160 gi|116511698 gi|89206308
38 30 27 21 21 29 29 22 23 21 22 21 20 23 16 15 34 25 28 38 40 29 39 21 14
FIMPYKFGIDEISTKVSILR LMLNYQFGIDEILTKINILK FVLPYQCAIATLTTKVQILR ELLVYKFALDEMDTKFSIIS ELLVYKFALDELDTKFSIIS ELLVYKFALDQMDTKFSIIS FMLGYKFAIDEITTKINILR FLMVYKFGLDEMNTKIHILR FMMSYKFAMDELNTKIDILK FMMSYKFALQELNTKIDILK FMMMYKFALDEMNTKINILQ FMMMYKFALDEMETRISILQ FFLAYKFALQEVETKINILQ IIFLYNAALKEVGTKLEILN ALLIYDAALKEVNTKLEILN VMLLHRFALEEVNTKLKILN TLLVYRSAIKEVKTKLDILD MIMKYSAAIKEVKTKLEILD LMTYYECAILEVKTKLDVLN MQQVYSAAIREVSTKLEILD LMMMYSCAIKEVQTKLDVLN LMMMYRCAIREIQTKLEVLD QMSYYQCAIMEVETKFKVLN KLVRYECALDVVRTQLSNLN FVLPYTFALEELKTKFEIMN
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12) 12)
EHISSRLKSAES EHVSSRLKTPES EHVSARVKSPDS EHTKSRIKSFES EHTKSRVKSFES EHTKSRVKSFES EHVRSRLKSVES ENVKSRLKSPES EHVESRLKTPES EHVSSRVKSPES EHTKSRLKSPES EHVKTRLKSPES EHISTRVKSPKS EHIKTRIKTPES EHITSRVKTPQS EHLKSRVKSLES EYMKSRVKTPSS EYMKDRVKDPKS ETIKTRIKSPAS HHMESRLKQPQS EFIQSRIKKPVS SFIKTRIKKPNS EDIKSRVKSMES EHIKHRLKSPES EHIKTRLKQPES
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22) 22)
DIAGIRIVC DIAGIRIVC DIAGVRITC DIAGVRIIC DIAGVRIIC DIAGLRIVC DIAGVRVVC DIAGIRITC DIAGLRITC DIAGIRITC DIAGIRISC DIAGIRITC DIAGARITC DIAGVRLIC DVAGIRIIC DIAGIRITC DIAGIRVIC DIAGIRVVC DVAGIRVIC DIAGIRVIC DVAGLRVIC DVAGVRIVC DIAGVRVIC DIAGIRIIC DIIGIRITC
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5)
VYRVFQLLTAQ AYAIADMLTNQ TYRVADLIGGQ IYNIIDVLRQR IYNVAEVLKQH IYNMVEVLKEH ITTIRDMLVSQ IYKISEMLQKQ IYELSEMLRNQ IYTLSEQLMQQ IYIISEMLQKQ IYRLEEMLRKQ IYKVSAMLQAQ IYRLAEMIGNQ IYRIADLIRKQ IFRIHEMLAGQ IYDIANMLIRQ IYEIADMLKNQ IYKVADMLTNQ IYTIAEVLLKQ IYAVAEKLISQ IYMISDLITQQ IYMLADCLLSQ IYEIVEIIKSQ IYHLKEVIENR
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 39)
KRREREALVDDIVTKIKSYT KRREREALVDEVVTKLEEYT KRAEREEYLDEVMTGIREKL KRDARERYLHDVIDGVNENL KRDEREAYVAEAVEDIRLAT KRSEREGYIADAISTLKQTL KRSEREAYIANAIDKIQTEM KRSEREEYIQNAINQIQSDL RRVERIAYIELIVNELTKSL KRKDRENFISDIIETLREKL RLEHREEFMSQLIAEVAGYI RKSERQEFVNNIVKQVKQHM KRAEREVQINEVISQLSKRL KRRERENYVNDLVEQIKVGL RRDARERLIHRVIQRLRQVL RAPKRDEYLAVVTDEVQQDL RAPAREEYLATVRASVEGDL RAPSRDQFLAEVIAQVEADL RAPSRDRYLKEIIDQVTGGL RRDEREAFISARAAELKRYL KRSEREQRLGVTVGLLNERL KRSDREKRLDNTLNLMKENL KRTAREEKLARATNMLQERL KKKERESYVDEVRKIIVDKL TAKERDKYIHEVSKVLAGKL KREKRDEYTEEVKSIIQAKL KRKEREKRTNEYISLLKSAL
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 7) 7) 7) 6) 6) 6) 6)
GDVYGRPKHIYS GKIYGRPKHIYS PEISGRPKHIYS ADISGRPKHIYS AEIYGRPKHIYS YEIYGRPKHIYS GEISGRPKHIYS GDINGRPKHIYS SDIEGRPKHFYS NEITGRPKHFYS ATIDGRVKHYFS AQVDGRIKHFFS ADISGRPKHFYS ADIKGRPKNIFS ADITGRPKHIYS ATVTGRPKHYYS ATVTGRPKHYYS ASVTGRPKHYYS AGVLGRPKHYWS AEVRGRVKHFYS CEVSGRPKHLYG FEITGRPKHLYG QDISGRPKHLYS GEVFGRSKHLYS VDVTGRAKHLYS GVVEGRAKHFWS AIVEGRYKHYYS
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16)
DLIAIRCVM DLIAIRCIL DLLAVRVVV DLLAVRIVV DLLAIRVIV DLLAVRVIV DLLAIRVIV DLLAIRIIV DLTAIRILV DFMAVRVIV DVFAIRIIV DLFAVRILV DLTAVRVIV DKIAIRVLV DQLAVRVIV DLVGIRVLV DLVAVRVLV DLVGIRVLV DLVGIRILV DLAGLRVIV DVAALRIIT DVAALRIIV DLAALRIIV DLIAIRVMV DVIAFRVLV DLTAFRIIL DLIALRVIV
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 7) 5)
VYAMVG-YIHELWR VYAMLG-YVHEFWK CYAVLG-IIHTCWK CYAVLG-IIHTRWK CYAVLG-AIHTRWT CYAVLG-AVHTKWK CYAILG-LVHTLWK CYAVLG-LVHTLWK CYAALG-IVHTMYK CYGVLG-VVHTIWK CYGALG-VIHEKYT CYAALG-VIHEMYK CYGALG-IIHTMWK CYGVLG-IIHTMWK CYRVLG-LVHATWP CYAALG-TVHARWN CYAALG-AMHARWT CYTVLG-VLHSRWN CYAAIG-VVHSLFN CYGALG-VIHSVWK CYRALA-VVHDTFR CYKALA-VVHDTFR CYRALA-VVHDAFR CYEVLG-LIHSTWK CYATLG-VIHSQWT CYEALS-IVHSLWI CYTVLG-IVHNIWK
VPVEIQLRT VNVEVQIRT VPVELQIRT VKVEIQVRT VKAEIQIRT AKVEIQIRT MPVEIQIRT TYVEVQIRT VYVEVQIRT VYVEVQIRT VKVEVQIRT VFVEIQIRT VYTEIQIRT TKVEIQIRT TKVEIQIRT MTVEIQIRT VRVEVQIRT VRVEVQIRT MRVEVQIRT VPVEVQIRT MRVEVQIRT MRVELQIRT MTVEIQLRT CRVEIQLRT VFAEIQLRT
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2)
MDFWASLEHKI MDFWASLEHKI MDFWASLEHKI MDFWASLEHKI MDFWASLEHKI MDFWASLEHKI MDFWASLEHKI MDFWASLEHKI MDFWASLEHKI MDFWASLEHKI MDFWASLEHKI MDFWASLEHKI MDFWASLEHKI MDFWASLEHKI MDFWASLEHKM MDFWASLEHKI MDFWASLEHKL MDFWASLEHKL MDFWASLEHEI MDFWASLEHRL MDFWASLDHQL MDFWATLEHQL MDFWASLEHKL MDFWATLEHKV MDFWASLEHKL
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6)
189 181 178 172 172 180 180 173 174 172 173 172 171 174 167 166 185 176 179 189 191 180 190 172 165
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
240] 251] 286] 210] 210] 219] 254] 253] 246] 251] 245] 253] 243] 239] 313] 212] 226] 217] 221] 231] 248] 226] 246] 224] 216]
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
393] 740] 727] 738] 737] 749] 729] 729] 731] 724] 759] 776] 726] 734] 771] 847] 766] 743] 760] 742] 776] 769] 757] 716] 724] 723] 751]
Group IX KOG1157 1vj7_A gi|111657862 gi|75760574 gi|14325225 gi|29376496 gi|42519264 gi|70726288 gi|73662434 gi|28211806 gi|106895755 gi|154482685 gi|153810165 gi|89895197 gi|114566344 gi|156741205 gi|1515319 gi|84496349 gi|119716619 gi|62390535 gi|108804177 gi|78185706 gi|123965455 gi|23124721 gi|78223528 gi|50403843 gi|85859448 gi|15643492
210 210 210 210 210 212 210 210 203 204 233 247 203 204 256 307 239 218 235 222 251 245 226 205 205 205 259
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26)
GPKG-PIEI GPKG-PIEF GPKGDPLEV GPQGEPLEV GPKGNPVEV GPGGKPLEI GPNGDPLEI GPNGDPLEI GPQGKPFEI GPQGDPFEI GPNGQPFEI GPNGQPFEI GAHGEPFEI GRGGEPFEV IPGGQPCEI GPGGKPVEL GPEGKPVEI GPQGKPVEL GPGGKPLEV SNEGKLLEI GRH-RPIEV GKH-RPIEV GLTGRPLEV GPHGERMEV GPGRERIEI GPYGQRVEV TGYGEPLEI
MHQVAEYGVAA MHEVAEYGVAA MHEIAEFGIAA MHQIAEYGVAA MHQIAEFGVAA MHQVAEYGVAA MHEIAEHGVAA MHEIAEHGVAA MHKTAEYGIAA MHQIAEYGIAA MHKTAEYGIAA MHKTAEYGIAA MHRTAEYGIAA MHRTAEYGIAA MHEIAERGIAA MHRRAEYGIAA MHRRAEYGVAA MHRRAEYGVAA MHYNAEFGIAA MDRTAEYGIAA MHQVAEFGIAA MHQIAEFGIAA MHHIAEYGIAA MHGVAEAGIAA MHRVAEQGIAA MHEWAEEGIAA MHREAEYGLIA
341 341 342 342 342 344 342 342 335 336 365 379 335 336 388 439 371 350 367 354 383 377 359 337 337 339 391
29
gi|88797492 gi|28867313 gi|121997760 gi|146309756 gi|91224995 gi|117922244 gi|148827158 gi|134301665 gi|39935759 gi|71083748 gi|145336732 gi|125603648 gi|125605857
205 205 224 205 205 205 205 205 239 206 402 485 820
QIGARKETVAHIEGALQARL ARGNRKELVNKIEESLSHCL QRGRRGVILDTVCDQVRTRL ARGNRKEMIQKILSEIEGRL ARGNRKEMIQRIHSEIEGRL ARGNRKELIQGIEAAVLTRL ARSNRQDLIERISQEIKVRL VEKNKEKVFYEVKEALAEKL LAERNRNLIGEIESQLSANL IKEDKVNSFNSISLQLSELL SFD--EAMITSAIEKLEQAL SFD--EALLTSTLDKLDKGL SFD--EVLITSAVDKLDRGL
( ( ( ( ( ( ( ( ( ( ( ( (
6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 7) 7) 7)
ANVNGRQKHLFS GDVSGRQKHLYG AEVVGREKHLWS CRVSGREKHLYS ARVVGREKNLFS GKVKGREKNLYS ARVWGREKHLYK EDIKARKKTLYS AEVTGRRKRPFS AEIFGREKTPFS HVVSGRHKSLYS HSLSGRHKSLYS HNLSGRHKSLYS
( ( ( ( ( ( ( ( ( ( ( ( (
16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16)
DVYAFRIIT DVYAFRIIV DVYGFRVVV DIYAFRVIV DIYAFRVVV DIYAFRVIV DIYAFRVIV DMYAYKIIV DIYGFRVVL DIIGFRVIL DIHGLRLIV DIHGLRLVV DIHGLRLVF
( ( ( ( ( ( ( ( ( ( ( ( (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5)
CYRILG-IVHNLYK CYRVLG-AVHNLYK CYRVLG-ILHGLYK CYRVLG-QMHSLYK CYRVLG-QVHSLYK CYRVLG-AMHGLYK CYRVLG-QMHNLYK CYVALG-KVHELYK CYRALG-VVHTTWP CYKALG-IFHQHWN CYKALG-VVHKLWS CYQALD-IVHKLWP CYRALD-VVHELWP
( ( ( ( ( ( ( ( ( ( ( ( (
26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26) 26)
GANGVPMEI GMHGVPIEI GPHRIRLEA GPHGVPVEV GPHGVPVEV GPHGVPVEI GPKGVPVEV GPYNIPLEI GPGQQRVEL GPNRRPIEI GDGTIPLEV CEGIHPFEV SENVHPFEV
( ( ( ( ( ( ( ( ( ( ( ( (
6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6)
MEDMANHGIAA MEEMANNGIAA MHAVAESGIAA MDQMAEMGVAA MDQMADKGVAA MDQMADKGVAA MEQVAEMGITA MDRQAEYGIAA MNQIAEYGIAA MHEFAERGIAS MHLQAEFGFAA MHLQAEYGFAA MHLQAEYGFAA
337 337 356 337 337 337 337 337 371 338 533 616 951
[ 700] [ 701] [ 729] [ 703] [ 706] [ 700] [ 704] [ 704] [ 760] [ 581] [ 715] [ 901] [1130]
EYLRVLRKIYDRLKNE---EYLCVLRKIYDRLKSK---AHLKVLRKLYERLKDS---RHLRALRAVYEALSGR---IYEEAFRYFLNKIINI---PWEQALTEFCRKAQGT-----FRTLSNIGENLNKE-----LKALSYIGNKFNDL-----ESVISLLSPLFASA---KKMDVLQKIAAEFNKD----
( ( ( ( ( ( ( ( ( (
1) 1) 1) 1) 1) 1) 1) 1) 1) 1)
VNWVVTGSLSFA VNWVVTGSLGFA VNWVVTGSLGFA VRWALTGSLSFA IDWALTGSYRLY IDWWLTGSCAAC ILWAVGASIVLN IVWGAGGSVLLS VRWGIGGSVLLA LLWAIGASLLLY
( ( ( ( ( ( ( ( ( (
8) 8) 8) 8) 9) 8) 8) 8) 8) 8)
HDIDIQTDHDIDIQTDHDIDIQTDDDIDIQTDSDIDIITDHDVDIMVDNDIDLLVDNDISIFVDNDVDLLFHHDIDIMVD-
( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
-EEGAY-EIERIFS -KAGAY-EIECLFS -KEGAY-EIERLFS -REGAY-EIQRALH -NKGIE-LISSIFH -SRCID-EITEVFS -VQDME-KADRILQ -VNDII-KIDEMFK -PDDFA-VAERILK -EKDVE-RCKNILL
( ( ( ( ( ( ( ( ( (
26) 26) 26) 28) 26) 25) 26) 26) 26) 26)
IKVEIMGDI IKVEIMGDI IKVEIMGDI VRVEVMGDV TTFDIMSDV -RIDIASDP FDVDVMAGF SEVNIIAGL VELDLIVDF VEIDVIAGF
( ( ( ( ( ( ( ( ( (
35) 35) 34) 35) 33) 36) 35) 35) 35) 35)
YEYQAYLK--YEYQAYLK--YEYQAYLK--YEAAAYRR--SELELCNH--LQLNVNRR--DWYILYQL--DWYVLYQL--DWYVFYLL--LWRQYYEW---
151 139 138 141 138 145 134 134 135 175
[ [ [ [ [ [ [ [ [ [
Group X 2EWR 2ewr_A gi|157364415 gi|14520626 gi|108803471 gi|91214520 gi|21226634 gi|158321085 gi|150017300 gi|68053742 gi|160941021
17 5 5 5 5 12 2 2 3 41
170] 168] 157] 168] 170] 161] 190] 181] 185] 200]
Group XI COG2320 gi|49082092 gi|15807518 gi|115374018 gi|115371899 gi|157403336 gi|118728110 gi|138893985 gi|124520550 gi|89896432 gi|15614451 gi|56965735 gi|134100148 gi|30249891 gi|84496903 gi|119497481 gi|121703443 gi|83774858 gi|67538794 gi|115402269 gi|46139919 gi|108758453 gi|118471248 gi|62425330 gi|21225492
28 37 24 45 125 37 37 36 45 39 39 39 42 33 30 68 33 34 35 37 39 39 54 32
-DP-AWPQHFAAEAEAIRTAL -DPGRWAARFDRHRWRIRLAL -DP-RWPETFEAEKHRIHGAL -DP-SWPVLFERERQRLQKAL -DP-AWPQAFGDEKQRLLAAL -DP-SWSDLFEQEANRIRSVL -DP-RWPELFEREAKRIRSVL -DP-NWPKLFEQEADRIRSIL -DP-GWPKAFEREAHRIRSIL -DP-RWPKLFDREAKRIRSVL -DP-EWPNQFDREASRIRSVL -DP-RWPELFRREADRVRGAL -DL-RWPSMFSVAADKIRSAL -DP-LWQSLFASHAAAIRSAL -DP-SWPDAFASIAQRIKSAL -NP-LWPAHFSGIADRIRAAL -DP-TWPESFAIIARRIADAL -DP-AWPAAFAVIEARIKAAL -NP-EWKPRFQDISQKIQDAL -DP-SWPHHFDLAKTRIETAL -DA-GWPTRYAELAVAIQGVM -DP-RWPARYRRLAEQIRAAL -DS-SWNEQYSHLSELILSAL -DA-RWAETYLRHRRRILDAL
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
4) 2) 5) 5) 5) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 3) 4) 5) 4) 4) 4) 4) 4) 3)
LGIEHVGSTAVP ARTEHIGSTAIG VDIQHIGSTSIA IDIQHVGSTSIP AGIEHIGSTAIP LQIEHVGSTSVP LQIEHVGSTSVP LQIEHVGSTSVP IQLEHVGSTSVP LQVEHVGSTSVP LQLEHVGSTSIP LRVEHVGSTSVP LLVEHVGSTSVP CAVDHVGSTSVP LSVSHVGSTSIP LAIHHVGSTSVF LSIEHVGSTSVP LYIQHVGSTSVP LSVAHVGSTSVP VSINHVGSTSVP MQVEHVGSTAVP LDLDHVGSTSVP LGIEHVGSTSVP VDVEHIGSTSVP
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5)
P-IIDILLLP D-VVDILVGE N-IVDLAVAV N-IVDLAVAL D-ILDIVVAM P-IIDMLLVV P-IIDMLLVV P-IIDMLLVV P-IIDMLLVV P-IIDILLVV P-IIDILLVV P-VIDMLLVV L-IIDMLLGV P-IIDILLQV A-VIDVDLVV A-VIDIDLVV A-VIDVDVLV A-VIDVDVVV D-VIDVDVVV R-VIDIDLTV P-IIDIDLIV D-VIDIDLTV P-IIDIALVV P-IVDIVVAV
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5)
PQR--L-VA-P-LEG--L-A------AQ-A-LVS--A-SVP--Q-TE-A-LLG--L-TDS--Q-KE-R-LES--L-IER--V-AR-T-LVV--L-ELS--Y-VP-A-LES--A-EPS--Y-VP-A-LEA--A-EPS--Y-VP-V-LES--A-EMS--Y-VP-A-LEA--A-ETT--Y-VP-D-LEK--V-ETT--Y-VP-N-LEK--A-EPS--Y-VP-A-LEA--A-EKS--Y-VL-P-LEQ--Q-EAA--Y-VP-A-LTG--L-EDE--Y-VP-A-LEA--A-EAE--F-VP-A-LQA--A-EDS--Y-VP-A-LEA--A-EES--Y-VP-A-LEN--A-EET--Y-VP-A-LEA--A-EAS--Y-VQ-A-LEN--A-EDD--Y-VP-A-LES--L-ESA--Y-VP-A-LQN--L-EIA--Y-RP-H-LHS--I-EED--Y-LD-A-LLA--A--
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
26) 21) 25) 23) 25) 21) 21) 21) 21) 21) 21) 21) 23) 26) 21) 21) 21) 21) 21) 22) 21) 21) 29) 18)
RT--HH--VHVMP RE--AV--VHVVI GA--FV--VHVCG TA--LV--VHLCD RS--FV--VHTCA TD--IN--LHVFS TD--IN--LHVFS TD--IN--LHVFS ID--IN--LHVFS TD--IN--LHVFS TD--IN--LHVFS TA--VN--LHVFR ME--WH--LHVFS HD--IN--LHVLS PY--AN--VHVFG PY--AN--VHVWG PY--AN--IHVFG PY--AN--IHVFA PH--AN--IHVFG VP--TN--LHVWG PR--VN--LHVFG PR--AN--LHVFG PL--CN--LHVFG RD--VH--VHVYE
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 3) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1)
---AQADRYLL GG-DIWHQRLL QS-PWFSHLIH RA-AWYTNLLH DN-PRLAEIKN GT-SEIDRMLR GT-SEIDKMLR GT-PEIDRMLR GT-SEVERMLR GT-SEIDRMLR GA-SEVTRMLR GC-AEAARMLR EC-EEIDRMLA GA-QEIARMLG GA-AELSRHQR DA-VKLIRHQR NS-AEHVRHRL DS-PELVRHRL DS-PELVRHRL DC-PEAARHKI DC-PEVIRHLM DC-PEVVRHRM RC-PEVARMRM GA-AAVHEYLL
136 138 134 153 235 142 142 141 150 144 144 144 149 145 135 172 138 140 140 143 144 144 167 133
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
243] 188] 190] 216] 293] 406] 196] 193] 209] 195] 196] 196] 217] 198] 194] 236] 203] 266] 284] 215] 199] 198] 225] 186]
30
gi|152966703 gi|148271835 gi|119856079 gi|26989704 gi|152985589 gi|70729277 gi|126739417 2nrk_A gi|153938414 gi|126179943 gi|153814077 gi|73663616 gi|66824357 gi|147668808 gi|157403583 gi|16079424 gi|154686611 gi|52080887 gi|148378927 gi|153940685 gi|119487289 gi|67078121 gi|68053563 gi|89099038 gi|149181650 gi|54295912 gi|89207366 gi|152976508 gi|29653728 gi|73670296 gi|76786937 gi|150424973 gi|87121950 gi|91794109 gi|126175339 gi|88800339 gi|152970272 gi|127512637 gi|156934016 gi|114570307 gi|71282203 gi|88797828 gi|88799864 gi|153831951 gi|75762695 gi|52142732 gi|89207408 gi|149181928 gi|157150164 gi|24379783 gi|126698187 gi|148323409 gi|153940414 gi|2498428 gi|149179944 gi|153834620 gi|89100697
65 32 11 11 11 11 8 12 9 25 11 9 12 11 11 160 160 159 16 16 10 10 10 15 11 17 10 10 14 11 12 12 12 9 9 11 10 11 13 9 16 16 16 28 15 15 15 15 25 25 25 25 25 15 15 16 16
-DE-RWADTYRWHRARIEEAL -DH-GWADSYLAHRRRILEAV -NP-TWPARFLADKPLIASAF -DP-TWPARFLTDKPLIASAF -DE-RWPIRFQVARAQIAGAF -DD-HWPAAFAGEKARIAKGF -DP-TWPAQAQVEADRWRGAD -QP-AWVEQFEEEAQALKQIL -NS-EWPNLYLEEAEKIKNIL -DP-VWPDLFRTEARRIEDIL -NP-LWPKKYEEEALLIKDIL -QK-QWIMQFENEKQKILSIL -DP-RWKELFLEEEKQIKEVI -NP-CWAEQYDAEAVRLQALL -NP-RWITLYEAEAERLRKLL -NE-KWAECFDEEKERLKLVF -KP-EWKEMFEREKADLIRIF -KE-EWKGEFFKEKKQLEALF -NP-RWKIGYEKEANKIYNIM -NS-KWKIEYEKEADKIYNIM -NP-NWKQDFQKESQQIANIF -NE-EWSKMFEEEATEINAIF -QS-SWPDAFQQAKEQLETIF -SR-SWADSFQEEKQLLAAVF -NP-QWKEQFFYEKERLDKVF -DA-NWSMQFEQEAERIKKAL -EN-HWSEKFQMEAERLKSAM -EN-HWGEKFQAEAKRLKEAM -DP-DWSNQFEKEALLLKLIF -NE-VWVDLYKIEKELLESIF -NN-KWPEKYQEIKEELTKLL -QS-SWADDFNREKALICNAL -NP-NWVNEFEKEKALILSQF -RL-GWRKDFEIEKEVLIKYF -NS-VWPQKYESEKKLLLSVV -DP-AWKADFQIEKRLLQHFL -DE-MWPTLFENERTLLQMTL -QM-SWHQAFKVEKAQLLTAL -NP-AWPAQFAEEEKRVREVL -DP-GWPARFTDERKRLQRAL -DP-NWKNIFEIEKVALTQAI -SE-QWPIEFEKERSVLLELT -ST-KWPKAYEQERLRLLNVA -VH-TWSAAYDLIVSAIAPNL -TE-EWEIEFMKEKQIIEEQI -TE-EWELEFRKEKQFIEKQI -SE-EWEFEFSKEKQLIEEHI -TE-EWEMEFLSEKEMIEDKL -QL-EWKDWYEEERLRLLSFL -KE-EWEDWYEEEKERLLHLL -KE-CWKNWFKEEQYRLTGIL -KD-YWKEWYREEESLLKKLL -NP-MWEKWYLEEKKLIMSII -SE-NWKRLFHKEKSLLETII -SK-DWADLFQKEKQVLMEIL -DP-AWRQEFVRCRFELALMT -SP-GWKTEFQKVRKNLLENT
( 4) VEIEHIGSTSVP ( ( 4) LAVEHIGSTSVP ( ( 4) IAIQHIGSTSVP ( ( 4) IAIHHVGSTSVP ( ( 4) LAIHHVGSTAVP ( ( 4) MDIHHVGSTAVP ( ( 4) IAVHHIGSTSVP ( ( 4) LKVEHIGSTSVP ( ( 4) VDIYHIGSTSVV ( ( 4) VQIFHIGSTSVP ( ( 4) VAIYHIGSTSVE ( ( 4) IEIHHIGSTSVP ( ( 27) VNVFHCGSTSVP ( ( 4) LEIHHIGSTAVP ( ( 4) LEIHHIGSTAIP ( ( 4) IAIHHIGSTSIP ( ( 4) HSVSHIGSTSIP ( ( 4) LHIHHIGSTSIP ( ( 4) IKIYHIGSTSIE ( ( 4) VQIYHIGSTSIE ( ( 4) IDIHHIGSTSIS ( ( 4) INIHHIGSTAIP ( ( 4) LAVHHIGSTSVP ( ( 3) APIHHIGSTSIP ( ( 3) AAIHHIGSTSVE ( ( 4) IEIHHIGSTSVP ( ( 3) VKVHHIGSTSVP ( ( 3) VKIHHIGSTSVP ( ( 4) VAIHHIGSTSVP ( ( 4) IDIQHFGSTSIK ( ( 4) IECHHFGSTSIK ( ( 5) VEVHHIGSTSVE ( ( 11) FNIFHIGSTSVV ( ( 4) SEIHHIGSTSVI ( ( 4) DNIHHIGSTSIE ( ( 4) CQIHHIGSTSIP ( ( 4) SRIHHIGSTSVP ( ( 4) VSLEHIGSTSVP ( ( 4) LAVHHIGSTSVP ( ( 4) ARIEHVGSTSVP ( ( 4) VKIDHIGSTSVI ( ( 4) SRIEHIGSTSVP ( ( 4) TRIEHIGSTSVP ( ( 2) VAIYHIGSTAIP ( ( 4) LAIHHIGSTSIP ( ( 4) LAIHHIGSTAIP ( ( 4) LAIHHIGSTSIP ( ( 4) VAVHHIGSTAVK ( ( 5) VCLSHIGSTSVE ( ( 5) KSISHIGSTAVP ( ( 5) VRISHIGSTSVS ( ( 5) IRISHIGSTAIN ( ( 5) ERINHIGSTSVN ( ( 4) KDIQQFGSTAIK ( ( 4) VDFEHFGSTAIC ( ( 4) TQIEHIGSTAIM ( ( 4) SHIAHIGSTAIT (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5)
P-IVDIVVAV P-IVDVVVVV P-EIDVLIEV P-ELDVLIEV P-EIDLLVVV P-EIDLLVVV P-IIDLLPEF P-IIDFLVIV P-IIDIMPVV P-IIDIMPVV P-IIDIMAAV P-IIDILPVV P-VIDILLVV P-IIDIMPVV P-IIDIMSVV P-IIDMLIEV P-VIDILAEV P-IIDILIEV P-IIDILVEV P-IIDILVEV P-IIDLLVVV P-IIDILIEV P-ILDILPVV P-IIDILAEA P-IIDFLIEV P-IIDMIPVV P-IIDMIMEV P-IVDMIMEV P-TIDIILEV P-IIDIMIVV P-IIDVLIFV P-IIDILLEV P-IIDILLEV P-IIDIILEV P-IIDIIIES P-IIDMLLEC P-VIDILIEV P-IIDMLLEV P-VIDMLIEV P-VIDILVGT P-IVDILIEV P-IIDIGIEV P-IIDIGLEV P-IIDIGIAY P-IIDIAIEL P-IIDIAIEL P-IIDIAMEL P-IIDIAVEI P-IVDIMLEI N-IVDILLEV P-IVDILIEI P-TIDILIEI P-TIDILLEI P-IIDILVGV P-IIDILAGV P-IIDMVLGI P-VIDVMAAV
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 6) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5)
EED--H-LD-A-LLA--A-EED--H-VE-R-LVA--A-ASA--R-DE-V-LMG--L-TST--R-DE-V-LMA--L-EVR--R-DE-A-LRD--L-EAA--R-NA-F-MAT--L-MDA--A-RP-R-IEA--L-VDL--L-QW-E-FER--I-VDK--Y-NK-E-FED--L-VDE--R-SG-Q-FEA--I-VDD--A-AE-A-FSK--I-IDH--F-NE-A-MAN--I-LDE--M-NE-R-FES--S-IDR--L-QK-T-FER--A-VDL--L-QK-T-FEQ--A-VSQ--F-DE-Q-MKA--N-ADR--F-SP-Q-LEA--L-AAG--F-EK-G-MKM--L-VDN--Y-NE-E-MKS--L-VDN--Y-NK-K-MKS--L-VDE--K-NA-E-MES--F-VDE--F-IN-G-MEQ--I-IEA--F-DA-A-MEQ--I-FDE--A-AP-K-LQG--W-ADD--K-TA-E-LED--L-VDS--A-NA-A-MQA--L-VDH--W-NE-R-FKE--L-VDE--W-NK-L-FEK--L-VDK--F-NE-P-MEK--L-VDN--Y-NE-L-MIE--K-VDT--Y-NN-S-MLN--S-LDE--Q-SY-L-MES--L-LDR--D-SD-K-FAA--I-LDS--E-LN-I-FQG--L-LDS--C-KD-Q-FEL--L-LDR--F-ND-Q-LAS--I-LDS--L-NQ-A-MEG--V-LDA--Q-GE-H-LRA--L-LDG--C-NA-V-MQA--L-LDG--R-NA-E-MIA--A-LDA--A-DK-N-IAA--L-LNE--I-IK-A-LPN--D-LET--L-AK-R-LPP--D-LKQ--N-IL-A-LES--L-GIQ--C-IA-K-LEL--L-GLN--C-IN-K-LEL--L-GLK--C-IT-N-LEQ--L-GDL--C-IA-P-LED--M-MAV--M-RD-L-LLQ--N-LQD--F-KK-R-LLD--A-MKD--I-KD-L-IES--N-ISK--I-KN-I-LLE--N-LKF--L-VN-V-LEE--N-VEK--FNNE-R-LKE--A-IEQ--FDTI-R-LKE--E-VSP--KLID-A-LKS--V-IDK--QLIA-G-LRA--A--
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
18) 18) 22) 22) 22) 22) 25) 22) 22) 22) 22) 22) 23) 23) 23) 22) 22) 22) 22) 22) 24) 22) 22) 24) 24) 22) 23) 23) 22) 26) 25) 22) 22) 22) 22) 22) 22) 42) 22) 24) 22) 23) 23) 23) 20) 20) 20) 21) 25) 25) 25) 25) 28) 26) 26) 25) 25)
RD--VH--VHLYE RD--VH--VHVYE RT--HK--LHACA RT--HK--LHVCT RT--HK--IHVCT RT--HK--VHVCV RL--VQ--AHCYA RT--HH--VHIYQ RT--HQ--IHVFE RT--HQ--IHAFQ RT--HQ--IHIFQ RT--HH--VHAFQ IW--VN--MHAFQ RF--AH--VHIFG RL--AQ--VHIFG RT--HH--VHMYE RT--HH--VHIYE RT--HH--VHLYE RT--HH--VHIFE RT--HH--VHIFE RT--HH--VHVFA RS--HH--IHVFE RT--HH--IHLYA RL--VH--LHGYP RL--YH--IHVFP RT--HH--AHIFE RS--YH--LHVFE RS--YH--LHVFE RT--NH--VHTFQ HT--HH--IHIYQ HL--VH--LHFYE RT--HQ--VHAFL RT--HH--IHAFL RT--HQ--IHAFE RT--YQ--IHAFV HS--HH--LHAYL RS--HH--IHAFT RT--HH--LHAFQ RT--HH--IHAFV RL--VH--VHAYL RS--HH--VHAFQ RT--HY--IHVSF RT--HY--IHVSL RFDYIH--VHAYE RT--HQ--IHMYE RT--HQ--IHMYE RT--HQ--IHMYE RT--HQ--IHMYE RV--FH--LHLRY KV--FH--LHLRY RV--FH--LHIRY KV--FH--IHLRY KV--FH--LHVRY KT--HI--LHVVE KT--HI--LHVVE KT--HY--LHITQ KT--HF--LHLVE
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1)
DD-PAVGEYLL GD-PAVEAYLL GH-LAITQMLG DH-LTITQMLG GH-AQIDRMLR GH-EQVGRMLR GS-SEITRHLA DNTQEILRHLA SNSNDIERHLA DNVQNITRHLA EDWNNIGRHLA ENHYEINRHLA DNITDIERHLT SS-SDIERHLA SS-PDIKRHLA GN-PAIERHLL GH-PDVKRHLL GH-PDIRRHLA GD-SEIERHIN GD-SEIERYIN GS-TEIKRHLA GN-EEIIRHLA GN-PEIMRHVV GH-PEIGRHLL GS-KEIERHLV GN-SEIERHLK GN-PEIVRHLA GN-PEIIRHLA GD-REIARHLY GN-THITDELM ED-PKGLQELQ GS-HEAQRHIA GS-DHAIRHLA GD-ANIDRHIV DD-KNVHRHIA DD-PNVDRHLA GD-AQIIKHLA GS-QQLIAHRA GS-HHVTRHLA GS-EDFRRHLV SD-LNLHRHKV GT-SQLRDYVK ES-DRLERYLI GD-SKLRRHIQ GN-KYLIEQLQ GN-TYLIEQLQ GN-PYLIEQLK GN-KYLLQQLK GD----HDELY GD----NDELY GD----NDELF GN----NNELY GD----WDELY QG-DWWNEHIS EG-DWWKEHIT QG-QLWRDWIQ EG-VLWKNLLF
167 134 117 117 117 117 117 119 116 132 118 116 144 118 118 266 266 265 122 122 118 116 116 122 118 123 116 116 120 121 121 119 125 115 115 117 116 137 119 117 122 123 123 135 119 119 119 120 132 132 132 132 135 126 126 126 126
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
221] 186] 171] 181] 169] 167] 177] 173] 171] 193] 299] 170] 204] 176] 176] 343] 316] 323] 177] 197] 171] 176] 171] 185] 175] 319] 174] 174] 176] 183] 176] 173] 180] 170] 171] 178] 172] 192] 173] 178] 176] 170] 174] 183] 169] 169] 169] 169] 190] 201] 189] 188] 195] 174] 174] 173] 179]
31
gi|49482996 gi|27467439 gi|70727151 gi|73663286 gi|94493620 gi|145955446 gi|16517139 gi|39996671 gi|52840759 gi|148360238 gi|55380348 gi|16554477 gi|118478029 gi|30020784 gi|110601999 gi|118579907 gi|111223165 gi|148262043 gi|149375561 gi|118593506 gi|89209335 gi|20092342 gi|21226552 gi|89210794 gi|116872047 gi|47564636 gi|89206581 gi|149181932 gi|150385443 gi|152966309 gi|116253178 gi|86358593 gi|116249929 gi|86355813 gi|111056696 gi|15601119 gi|37680371 gi|84386840 gi|148978122 gi|90414152 gi|54302974 gi|87122945 gi|121586065 gi|39935907 gi|91975897 gi|90424234 gi|118593310 gi|83859664 gi|32474089 gi|87308925 gi|15608769 gi|118617253 gi|15827721 gi|41407424 gi|120404327 gi|145224143 gi|108799970
14 14 14 14 106 15 10 9 211 181 15 16 14 11 29 24 18 10 21 14 27 27 15 1 12 11 11 12 184 57 13 11 10 10 9 41 14 10 10 10 10 10 10 17 19 28 16 18 28 10 217 217 217 217 217 220 216
-TN-CFQSQYTEIESLLFNLL -VK-QFTYEYNKVKDVLFTIL -QS-EYDTLYQTTKTSLFNLL -DN-YFADQYQNLKKMLFNLL -NT-NWTKLAEAEIKAIKRIS -QM-IWDKSAKDVIILLKSIW -SE-EWPNIFRQVRQELLSVF -SS-AWERAFLAERKRLLGTA -NT-EWPLLAKAEMVKLRASF -DS-QWPKMAEVEIKKLRKIL -QE-EWKHHYENEVAHLKAVA -DS-AWHDAYEREAQRLHELV -EK-EWQEEYVKEKEKFISLF -QK-EWVEEYIKEKEKFCSLL -DP-RWRDLFEEERRHLLSCL -DA-CWPLLFEEEKAYLLAMF -DP-GWAGRFAEQRDRVTDIL -DP-AWPGMFETEARLLKRVL -DP-KWPSMFEVERSRLFSLF -DP-RWADLYESEVTLIAEAL -NP-DWQDKGRCEEQQLYELL -DP-EWKTEFLKIKAMIVDCT -DP-EWKTEFLKIKAMIVDCA -------MEFKKIKNKILKYI -DE-NWPKEFQRIEAAILNTI -NI-KWESEFNKLQTLINDVM -NT-QWESEFCKLQTLINNVM -NA-NWGVEFESIKGVIKNRL -NP-KWPERFNRLKELLAPVL -DP-HWREDFDMLRSCLHPLV -DP-QWPQAFQRIRERLLALL -DR-EWPEAFQRIRANLLALL -DP-SWPRLFAEISAEVSALL -DP-SWPLLFAEISAEVATSL -DP-AWADDLAQIKKELEAGL -QA-SCENLYRKYELEIAALL -QA-SCEQRYQKYKSEIEALL -QA-ACNDLFIRYERDIKKLI -QA-ACHEMFERYERDIKKRL -QA-SCEELFLRYEREILKLL -QA-SSEELFLRYEREIQKLL -QA-SCEALFLIYKDKVKALL -QG-KCQERFLRYQSEIQELL -RD-AAERLFAVVKQQLIAVL -RA-AAERLFAAVQQQLHALL -RR-AAERLFGEVTRELGSML -SL-RAERLFNEVRLLVAAAV -RR-RAEAAFETWRRQHGRQL -DA-RWKQEFEQTRSSLLQSC -DP-RWPQAFEQMKSSLRFAA -DP-SWPDQARRIVNRLKIAC -DP-SWPEQAQRIVARLKTTC -DP-IWLGQAKRIVARLKTTC -DP-EWPAQAQRIVNRLKTAS -DP-SWPAQAQRILARLRTAC -DP-QWAAQAQRILARLRTAC -NP-AWAADAQRIVNRIKAAA
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
4) 4) 4) 4) 5) 4) 4) 4) 5) 5) 4) 4) 4) 4) 5) 5) 5) 5) 4) 4) 5) 4) 4) 4) 4) 4) 4) 4) 4) 4) 3) 3) 4) 4) 5) 2) 2) 2) 2) 2) 2) 2) 2) 3) 3) 3) 2) 3) 4) 4) 4) 4) 4) 4) 4) 4) 4)
KYTKHIGGTNHF KYTQHIGGTCHF KFAQHIGGTSHF KTTLHIGGTSHF VSIKHLGSTAVP IDIQHIGSTSIP VSIEHIGSTSVP AHVEHIGSTAVT IDIQHVGSTAIP IDIQHVGSTAIP LDFEHIGSTAIE LDFQHVGSTAVD IAIEHIGSTSVQ IAVEHIGSTAVE KRIEHFGSTAVP RRVEHFGSTSVP GAVEHIGSTAVA VRMEHIGSTAVP AAIEHFGSTAVP FEIDHIGSTAVP SKVIHIGSTSIP TGVEHVGSTAVE IDVVHVGSTAIE LSVEHVGSTSVE LRVEHVGSTSVR LSIEHVGSTSIK FSIEHVGSTSVK IAIEHVGSTAVP CRLEHVGSTSVP LSIEHVGSTAVP LSIDHIGSTSIP LAIDHVGSTAIP LSIDHIGSTSVP LSVDHIGSTSVP VSIEHVGSTSIP ASIEHIGTSSIP AVVEHVGASSIP ARVEHVGASSIP ARVEHVGASSIP ARVEHVGASSIP ARVEHVGASSIP ARVEHIGASSIP ARIEHVGASSIP SEFLHIGATSVP AEVLHVGATAVP AEVLHVGATAVP ADIRHIGATAVP ARIDHVGSTAIE TQIDHVGSTSIS SAVEHVGGTAIP LRVDHIGSTAVS SRVDHIGSTAVA LRVDHIGSTAVP LRVDHVGSTALP SRIDHIGSTAVP VRVDHIGSTAVP LRVDHVGSTSVD
( 5) P-ILDILVGV ( ( 5) P-ILDILVGV ( ( 5) P-ILDVLIGV ( ( 5) P-ILDILVGV ( ( 5) P-IIDIFIAV ( ( 5) P-IIDIVVGV ( ( 5) S-VIDVLLGA ( ( 5) P-VIDIMIGL ( ( 5) P-IIDIQIAV ( ( 5) P-IIDIQITV ( ( 5) P-IIDILAVV ( ( 5) P-VIDVLAVV ( ( 5) P-LIDMMIGV ( ( 5) P-LIDMMIGV ( ( 5) P-IVDILVEV ( ( 5) P-VIDMLIEV ( ( 5) P-VVDLLAPV ( ( 5) P-VIDILAGV ( ( 5) P-IIDILAGV ( ( 5) P-ILDIAVRS ( ( 5) P-IIDLMAET ( ( 5) P-IIDIDVVI ( ( 5) P-IIDIDVVI ( ( 5) P-IIDIDVVI ( ( 5) P-IIDIDIVI ( ( 5) P-ILDIDIVI ( ( 5) P-ILDVDVVI ( ( 5) P-ILDIDIVI ( ( 5) P-VIDADLIL ( ( 5) P-IIDVDVVV ( ( 5) P-LIDIDIVL ( ( 5) P-LIDIDIVL ( ( 5) P-KIDLAAVM ( ( 5) P-KIDFDVVA ( ( 5) P-ILDIDVVA ( ( 5) G-DLDILVGV ( ( 5) G-DLDIYVGV ( ( 5) G-DLDIFVGV ( ( 5) G-DLDIFVGV ( ( 5) G-DLDIFVGV ( ( 5) G-DLDIFVGV ( ( 5) G-DLDIFIGV ( ( 5) G-DLDIFVGV ( ( 5) G-DLDIVVRV ( ( 5) G-DLDIVVRV ( ( 5) G-DLDIVVRV ( ( 5) G-DLDIVVRV ( ( 5) G-DLDIAVRV ( ( 5) P-IIDVVALV ( ( 5) DGVVDLLIGV ( ( 8) D-VIDIQVTV ( ( 5) D-VIDIQVTV ( ( 11) D-IIDIQITV ( ( 8) D-VIDIQITV ( ( 5) D-VIDVQVTV ( ( 5) D-VIDVQVTV ( ( 5) D-VVDIQVTV (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 2) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 5) 5) 5) 5) 5) 5) 5) 5) 5)
ITS--LDEK-R-LNY--V-ITS--LDEK-R-LNY--A-ITA--LDEK-R-LNY--L-ITA--LDEK-R-LNY--E-AKQ--W-IK-P-LET--F-AKL--Y-LE-R-LEQ--C-VES--K-IE-P-LGR--L-LDV--L-VG-P-VRS--L-MKI--IAVP-I-LQK--L-IKQ--TAID-V-LKA--Y-VRD--L-IP-A-FEE--S-ADD--L-AV-V-LAA--H-IEK--W-ID-D-LLK--I-AEK--W-IE-V-LSE--I-TQK--L-IAPI-LEA--Q-TKV--E-IAQV-LER--K-ARG--A-LP-A-LAV--D-ART--I-AS-PLLAP--N-ADS--L-IK-P-LCR--S-ETQ--I-AA-A-LTG--L-VLD--I-SA-K-LAI--Y-FPA--V-KD-R-LSE--I-FPA--V-KD-R-LAK--I-LPD--I-IK-G-LEK--A-FEA--V-KT-G-LLS--L-FPE--I-VK-K-LET--I-FPK--V-IN-R-LEQ--V-FPK--V-VF-A-LER--L-LGE--L-KA-R-LEP--L-SRA--A-VH-A-LAS--G-IED--A-TH-K-LLA--E-IES--A-TQ-V-LLS--Q-LPA--A-IE-I-VRA--A-LPP--A-IE-L-IRA--A--QP--V-LE-A-LQHPDL-LEN--A-VK-L-LST--L-LES--A-VK-I-LKG--L-FED--V-IE-R-LAT--L-LEE--A-VE-R-LTT--L-LEC--A-AQ-L-LTT--L-LEN--A-AQ-L-LMV--L-FEV--S-MQ-R-LKV--L-FGL--A-IQ-R-LMT--L-FQV--T-EA-A-LAA--R-FQA--V-EA-A-LAA--R-FLA--V-ES-R-FAK--F-FEQ--A-DQ-A-LSL--L-FGA--A-RD-Y-LDR--T-LDA--A-SL-H-IEG--L-LAE--V-AP-Y-IEG--L-ADE--L-AE-P-LLA--A-ADE--L-AE-G-LRS--A-ADE--L-AD-P-LLS--A-ADE--L-VE-P-LLA--A-ADG--L-AD-A-LLG--A-ADE--L-TD-A-LQA--A-ADE--I-AG-A-LAA--V--
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
25) 25) 25) 25) 26) 24) 25) 25) 26) 26) 23) 23) 22) 22) 29) 29) 24) 26) 25) 21) 25) 24) 24) 24) 26) 32) 32) 32) 23) 22) 22) 22) 21) 22) 22) 21) 21) 21) 21) 21) 21) 21) 21) 20) 20) 20) 20) 20) 24) 26) 44) 44) 44) 44) 42) 43) 31)
QQ--VR--LHIIQ QE--VR--LHIIQ QT--IR--LHIIQ QI--AR--LHIIQ RT--HH--IHIVQ RT--HH--IHVVE LR--IH--LHAVE VS--FH--LHAVV RT--HH--VHIVE RT--HH--VHIIE RT--HY--LSLAE RT--HY--LSLVT GT--HH--LHVYI GT--HH--LHIYI RT--HH--IHMIE RT--HH--LHMVE RT--HH--LHVLA RT--HH--LHMAL RT--HH--LHLVV RT--HN--LHLYP RV--AH--LHLMV MP--YH--LYVCP MP--YH--LYICP MK--HH--LYVCP ME--HH--IYVCP ME--HH--LYVCD ME--HH--LYVCD MQ--HH--LYVCD FP--HN--FYVCL PY--HH--LYVVI PA--ER--VYLCP PA--ER--VYLCP YG--FR--LYLCG YG--FR--LYLCG PV--RN--LYVCI -G--EDVAFQVVA -N--DDVAFQVVA -G--DAVALQVVA -S--DDVALQVVA -T--DDVAFKVVA -V--DDVALQIVA -T--EDVALQVVA -G--DDVAFQIVV CM--PHLGIQLTV RT--PHLGIQLTA RT--PHLGIQLTT KA--PHLGVQLVA YA--LEVGVQLVV TT--HQ--VLLMV PA--YR--LLLTP RP--TN--VHLRV RP--TN--VHIRV RP--TY--VHIRV RP--TN--VHIRV RP--TN--VHVRV RP--TN--VHVRV RP--AN--IHLRV
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1)
NT-QTFESYLA ET-SLFKQYQT DT-TLFKQYIE DS-NLYKAYIQ GN-STIEHSVL NS-VAWNNYIN GS-RFWQEHLA ET-AFWQDHLL SS-RHWKGKTF KS-KRGHDRII DS-DFYDEKIA DS-DCHREQIA NS-DEWKNNLL NS-DEWRNNIL DF-EHWD-RLL HF-EQWD-RLL AD-PHARALLA DG-A-MWDTLV GG-SQWQRRLA ND-TDCLDQIA GT-ERWEQQLL DG-KGYLEHIA DG-KGYLEHIA DG-KGYLEHIA NS-AELHRHLT ES-EELRRHIA NS-EELRRHVA QS-SELAKHIA GA-APLRNHLE GS-QPHLDHVL GN-GTHRNRLA GN-GTHGNRLA DN-RAHRGRIL DN-RAHQERLL GS-AALRNHLA GS--EFEFFVG GS--EFECFLA GS--EFECFLR GS--EFECFLA GS--EFECFLM GS--EFECFLV GS--EFECFLQ GS--EFECFLV GG--EFDVFHR GG--EWDVFHR GG--AFDHFHR DG--PHDFFHQ GD--EQDRLLD GN-PAHQRVIQ SG-DFWRNAVR GW-PNQQFALL GW-PNQQFALL GW-PNQQFGLL GW-PGQQFALL GR-PGQQFALL GW-PNQQFALL GW-PNQRFALL
124 124 124 124 217 123 119 118 323 293 122 123 120 117 143 138 127 121 130 116 137 135 123 104 122 127 127 128 291 163 118 116 115 116 117 143 116 112 112 112 112 112 112 120 122 131 118 121 136 121 348 345 351 348 343 347 331
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
172] 175] 171] 172] 268] 169] 174] 365] 373] 601] 173] 174] 173] 168] 207] 197] 187] 175] 184] 192] 183] 186] 174] 158] 174] 175] 178] 181] 363] 220] 171] 167] 167] 168] 386] 188] 160] 157] 157] 157] 157] 157] 157] 175] 173] 187] 167] 165] 187] 175] 407] 407] 410] 407] 402] 410] 385]
32
gi|111017992 gi|119881263 gi|54023867 gi|134101928 gi|119961325 gi|116670616 gi|84496578 gi|62426309 gi|152967776 gi|119714258 gi|50545679 gi|83859581
216 252 217 214 219 219 219 219 11 23 9 12
-NS-QWPAQADRLIARVKLVC -DP-TWPEQYERLAARIYRAV -DA-ERQAQAQRLVARLAVAG -DP-EWPRTARRLLARVERAA -KG-EWAAQAERLAERILAVA -DP-TWGTQAARLAGRIAAAV -RA-EWSAEAARLIARLQRAL -GH-DWSLQAQRIIAKLHHGL -DP-AWPGRAAELLAAVRRAL -DP-AWAPRAAGYLARARAAL -SP-SWPAEYETLRAKIASVA -DP-AWAEEGLAWADAICRAL
( ( ( ( ( ( ( ( ( ( ( (
4) 4) 4) 4) 4) 4) 4) 3) 6) 8) 3) 4)
VRVDHIGSTAVE LRIDHIGATAVP ARIEHVGPTAVP VRADHVGSTSVP LAVDHIGSSSVP LAVDHIGSTAVP VSVDHIGSTAVP FRIDHIGSTAVP ESHDHIGSTSVP ALYDHIGSTSVP AQIDHIGSTSVS LRVDHVGSTAVP
( ( ( ( ( ( ( ( ( ( ( (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5)
D-VIDVQVTV D-VIDIQLTV D-LLELQIVV D-VVDLQLTV D-VIDLQVAV D-IIDLQLTV D-VIDLQIGV D-IIDLQLLV A-YVDLQVRV P-YVDLQIRI N-IIDIQMSV D-VIDIQALV
( ( ( ( ( ( ( ( ( ( ( (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5)
ADD--L-AE-S-LAD--A-ADGP-L-AE-L-LAE--A-ATG--L-RD-A-LGA--A-ADG--L-AE-P-LAR--A-ADR--I-SP-L-LAA--S-ADR--I-AP-L-LAA--A-ADAPEF-VE-A-LQR--S-ARD--L-SD-R-LAE--L-P-EV-L-DR-A-LAP--A-HDE--L-GH-R-LTP--L-VD-----IE-A-LRT--I-PG---L-IS-A-MKA--A--
( ( ( ( ( ( ( ( ( ( ( (
38) 31) 25) 34) 34) 34) 30) 28) 43) 40) 36) 37)
RP--VN--IHLRV RP--AR--LHVRA RP--AV--VAVRV RY--AN--LHVRV RP--VN--LHVRV RP--VN--LHIRP RV--AH--IHVRE QA--VN--LHVRT GA--SI--LHVRL ES--AI--LHIRR RP--SH--LHVRE RR--LH--IHVRR
( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
RVPSYL-IELALGD RVPSYL-IELALEG VIPSYK-VEVALDG VVSSVT-VSLCMEP PVEPYK-VEAAL-PVPPGL-VEEALEA --PDML-KIELIP--PNIL-GLDLIS--PNIV-WLDLIE--PNQI-VLDTLNTVPSYI-IEYTLES IVPSYL-VELALER PVSPHV-VDAFMDA CVAPSY-VTGLLEE
TVTFFLTNP SVTFPLLEP TVSFPLVRM TVTFPLISP VVSFPLGKL VVSLPLARL VISFPLNKL VISFPLSKL VISFPLGKL VISFPLGRL NISFPLAKL VVSFPLAKL EVTFPITPP KVTLPVTDL
( ( ( ( ( ( ( ( ( ( ( (
1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1)
GW-PGQQFALV GS-PGWRYALL GT-PEQRFASA GG-PGWRYALL GS-PEWRYALL GS-PGWRFALS DG-PGYRWALD DS-VGAAFAWK AS-PWGRYTVW DS-PWGRYTVW GR-FNQKYPLL GA-AGARNTLL
338 368 326 332 337 337 335 330 140 151 126 132
[ [ [ [ [ [ [ [ [ [ [ [
403] 426] 383] 394] 412] 430] 388] 386] 197] 209] 188] 202]
Group XII COG2413 gi|14520225 gi|57640880 gi|11499561 gi|147920747 gi|156937217 gi|118430937 gi|15922402 gi|70605919 gi|146304906 gi|15897051 gi|126465785 gi|124027766 gi|20094418 gi|119719561
25 24 43 23 10 12 12 12 12 1 24 24 15 21
ILWEKR--EKALKIMELLKDFYLWEKR--ENALKIMERLKDFILAEKR--ERAKEVMESLLSFG LLEEKR--ARAIEVLGLLEACQ LLRRKR--EKAIRVMERLA--LLREKR--RKALEIVRALSRTV ILKNKR--ERAIEILRQIKSLG VLIEKR--KIAMEILNYLTKLG ILKTLR--GWALDMLNLLEQGS ---------------------ILRELR--KEAINILETLAKRG LLGQLR--EEAARIMKPLREAG LLRTLR--DRARKVVETLSQFG VFRELR--EKARVALRCLDVLR
( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 1) 2) 0) 0) 0) 0) 0) 0) 0) 0)
-DPHVYGSVARG -DPVLYGSVARG IESVVYGSVARG LDGFVFGSVARG FSPVVHGSVARG GLAVVHGSVARG MEGYIYGSVARG MEGFVYGSVARG MRGFVYGSVARG MEGYVYGSVARG IQGLVHGSIARG LAPIIHGSIARG LEGWVHGSVARG IPVYVVGSVARG
( ( ( ( ( ( ( ( ( ( ( ( ( (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5)
SDIDIVIPY SDIDIFIPI SDVDIFIPE SDIDIFIPA SDVDVAILF SDVDVAIVE SDIDIIIFN SDIDIVIFN SDVDVIVFA SDVDIIVFN SDIDVFIPY SDIDVLIPS SDVDVFIPT SDVDLFLER
( ( ( ( ( ( ( ( ( ( ( ( ( (
26) 26) 27) 27) 30) 31) 27) 27) 27) 27) 31) 31) 29) 29)
( ( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2)
RELEFYKWGGM RELEFYRWGGA REMDFYRFGGC VELEFYRFGGS KEREFYAFGGE REREFYMWGGE KEIEFYYFGGI KETEFYYFGGL VEEDFYRFGGL NEIEFYSFGGL REYEFYKFGGI TEYEFYYFGGA RELEFFDFSGK AEKEFPRFAGS
129 128 150 130 116 125 116 116 116 85 135 135 124 130
[ [ [ [ [ [ [ [ [ [ [ [ [ [
233] 234] 252] 234] 220] 230] 218] 221] 223] 187] 239] 243] 229] 235]
190] 183] 189] 185] 186] 211] 189] 187] 197] 193] 191] 191] 194]
Group XIII COG4913 gi|124028295 gi|126465593 gi|156937630 gi|118431075 gi|70606561 gi|15920383 gi|15897121 gi|146302987 gi|18312848 gi|126459248 gi|119873199 gi|145590981 gi|159041075
3 3 3 2 2 27 5 2 11 7 5 5 5
YTLNDLSWLFRKLQEHG--YTLEDLAVVLGKLSSYG--FTLDDLIYVMKKLQEAG--VSRRGVASSLRVLLDRG--IHFSKVGEILSEIRELT--IPFSKIGDILAEIKGLT--IPFSKVGEVLHELQDLT--IAFEKIGEILSQIKEKM--KYKTALRKVAKSLNEKG--KYKTALKLVSTELSKRD--KYRYALKYVAEAFNKRG--KYKTALKKVAESLEKRG--VYSLALLRVGGVLGGRG---
( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
IKGVVIGSTVIE VKFVVIGDTVVQ LDAVIIGGTSVE FRFTIIGGTVVE -DFVIIGDTVVD -DFVIIGDTIVD -KFIIIGDTIVD -DFVIIGDTVVD VEFVLIGSAVLP IEHVLIGSAVLP IEYILVGSAILP VEFVLVGSAVLP VGFVVVGSLILP
( ( ( ( ( ( ( ( ( ( ( ( (
9) 9) 9) 9) 9) 9) 9) 9) 9) 9) 9) 9) 9)
DDVDVFALE GDVDLFVYE GDLDLFPTN DDVDLFGEE SDVDLFVLS SDIDLFILG SDVDLFALD SDVDLFPTN RDVDLFIIN RDVDLFILN KDVDLFILN GDVDLFILN HDVDLFLTN
( ( ( ( ( ( ( ( ( ( ( ( (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5)
EEDFYR-SVAERED EEDFYR-EIVEKEN EEDFFR-TIADEEG EEEYYG--VAHELG DEDKIR-DFAFERG DEDKIR-EFAFERG DDDKIR-NFAYERG EYDVLR-DLADERG DNELFE-EIAEEND DNEMFE-EIAREND DNELFE-EIAREYD DNELFE-DVAQEND DAEFFE-GIAREFD
( ( ( ( ( ( ( ( ( ( ( ( (
22) 21) 21) 21) 21) 21) 21) 21) 21) 21) 21) 21) 21)
VIVEFYENI IIVELYENF VEIEFYDNI VPLEFFDNV LQVDMYENI LQIDMYENI LQIDLYENI LQVDFYENI VRVDLLENI VRVDLLENI VRVDLLENI VKVDLLENI VRVDLMENL
( ( ( ( ( ( ( ( ( ( ( ( (
29) 29) 29) 29) 29) 29) 29) 29) 29) 29) 29) 29) 29)
DYIVLKAKAAR HYFVLKARQGV DYVVLKSRSED DHIVLKAHAGR DYLLLKANAFR DYILLKTNAFR DYILLKANAFR DYILLKANAFR QLLALKAKIAT ELLVLKAKIAT ELLVLKAKIAT QLLVLKAKIAT ALIVLKAREAT
138 137 137 135 135 160 138 135 145 141 139 139 139
[ [ [ [ [ [ [ [ [ [ [ [ [
0) IRKALHLFGKS 0) LRKALKLLSKS 0) IKKALQLFAKS
88 88 105
[ 255] [ 419] [ 277]
Group XIV COG3541 gi|124004969 gi|161326955 gi|75759596
1 1 14
----MHTTIQNELAKLEESQ ( ----METKILEKLNEIERDK ( SCYEMREKIELELERIEKEN (
4) LYACESGSRAWG ( 4) LFAVESGSRAWG ( 4) LFAVESGSRAWG (
5) SDYDVRFFY ( 5) SDYDIRFVY ( 5) SDYDVRFIY (
0) -RHSRD-KYLSIHP ( 10) KDLDFNGWE ( 0) -KHKTD-WYLNLWE ( 10) DELDGSGWD ( 0) -IHPVE-WYLSIHD ( 10) DDLDISGWD (
33
gi|45658062 gi|118744845 gi|127511533 gi|118074192 gi|159879000 gi|15894776 gi|149197362 gi|150018378 gi|149179917 gi|37528139 gi|34496493 gi|154684812 gi|118051354 gi|83643081 gi|29135173 gi|149278331 gi|119878942 gi|13473604 gi|86139414 gi|114568723 gi|114326961 gi|89099830 gi|15668174 gi|15894775 gi|125972552 gi|75907739 gi|124002747 gi|149278330 gi|149197361 gi|159038439 gi|21220256 gi|149173584 gi|161166010 gi|149923839 gi|32475975
2 8 2 1 1 6 1 1 1 7 10 1 82 1 7 1 43 1 5 10 10 1 1 2 5 1 1 1 1 17 4 74 4 3 1
ILPEIKKKIQDRLVEIESEF VNEVMKISISEQLSNIETAH ITENIKKEILRRIRNAEKEH ----MRTEILENLKTLEIKE -----MDRILQKLEKIEKEN FEGTIEELVNMKLDEIEKKE ----MKEHIQKTLSDLAENE ----MDKVIQAQLRDIEEKN --MLMDDLVLSELNRMEREY ISDIMKQHICQKLAQLETEL LDDGVRRRVMEELAALERRH ----MKQRIIEELKRIEAQH ISGAMRSAVLAQLKALEREH ----MEKRIQETLERIEWEH VNDEMRVTINVELDRIEKKY ----MKTIILDRLRSIEQEH IAPDKRAALLSTLAGIEQRH ----------------MRGQ VSAGMRDLIQAKLKEIEAEE ISPAMHDTILNQLRQIETRE IPSSIRSVIQERLDRIVAEN --------MKEWLQDLEKKH ---------------MEIFM ISKDILKNKEYDFIKTNKHL DIKDRLNTDKYEFLRTDPRL --------------MKRIEV ----------MLMDTAYLKE ------------MTIQEMKD --------MFKNLDDLLKSK GDQALLGDAPVASLGMTELV ----------EAPGSHELLV AQYKEGEIGNSEINANRSDL QVLEAKQQEVAARVIAEEEA DALTPDQRAATDQLFAEELP -MIDARIIDYDKMLPHIRAA
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 3) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4)
LLAIESGSRAWG LYAVESGSRAWG LYAVESGSRAWG LYACESGSRAWG LYACESGSRAWG LHAVESGSRAWG LYACESGSRAWG LYAIESGSRGWG LYAVQAGSRAWG LYACESGSRGWG LYACESGSRAWG LYAVESGSRAWG LFACESGSRGWG LYACESGSRAWG IYACESGSRGWG HFACESGSRGWQ LLACESGSRGWG LFAIESGSRAWG LFAAESGSRAWG LFAIESGSRAWG LLAVESGSRAWG YYACEAGSRAWG IFVVISGSDLYG ILLGLSGSYSYG LILTTAGSIAYG ILVGLAGSHGYG LLDSISGSKAYG LFECVSGSRAYG LYEAIAGSHAYG VLAVVVGSRAYG VYACVMGSRAFG IFKCIIGSRAFS LVIALSGAHAYG LLVSLAGAHAYG LFATISGAHLYG
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 4) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5)
SDYDIRFIY SDYDVRFIY SDYDVRFIY SDYDVRFVY SDYDIRFIY SDYDVRFIY SDWDVRFIY SDFDVRFIY SDWDIRFIY SDYDVRFIY SDYDVRFLY SDYDVRFLY SDYDVRFVY SDYDVRFIY SDYDVRFIY SDYDVRFVY SDYDARFIY SDYDCRFVY SDYDVRFVY SDYDARFVY SDYDVRFVY SDYDVRFIY SDVDIRGAH SDIDVRGVA SDIDIRGVT SDLDFRGVF SDTDIRGVF SDTDIKGVY SDKDIKGIF SDYDRRGVF SDTDRRGVF SDTDYRGIY SDLDLKAVH SDVDLKGIW SDFDLRGVH
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
-KHKTE-WYLSVLP -IHIPD-WYLSIRE -AHPKD-WYVSVDL -IRNKD-WYLRVDY -VNRLG-YYLSILP -VRPRE-YYLRLED -VNQLN-WYLNLEK -IHPVE-WYLSVFE -KHSLD-WYLSLEK -VHAPS-WYLRVDP -VHRPQ-WYLQVEP -VPKKE-WYFSIEP -VPRLP-WYLRTRA -QRPLS-HYLSLGK -VNHPS-HYVRVDN -TRPMD-NYLSVMP -VHRQP-WYLSVNE -IRPVS-DHLVLQQ -SRPVS-WHLRLDG -ARPRD-WYLSLEP -VRNRD-EYLRLHA -KYNEVRSYLALKP -ILDRE-LFIKNCL -LNRKS-DLIGMTS -IETKQ-DLLGLSS -IAPKR-YYLGFDH -ILPKK-TLYGMEY -YLSRD-QFFGLTD -IFPAT-YYLLEVD -AVPTR-AFWHLDK -LAPTA-LFWRFDK -LPPAD-LQWSLYG -VEPTE-KLLGLHR -LAPTR-RLLGLGS -LLPLQ-TVVGLED
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
FRTDIEKALLRCFPSESHLF PTTLSGDLFRRFFPTAQATA TEKDILKRERLFKKTFHIRN PDDVQRQWVNKLTNASCDEP HDEAQKSYLNDLENKLDHFI PDETQQEYLNDLEDKWGSPF PNETQQDYLSEVEAKSGAPL PDEKQQQYLDSIALRWGQFD PDEKQQHYLDSVELRWGELS PDETQRHYLNELELYRGMSP PDETQRAWLDDIGRQAGAAV ANDAQQAFIEDLCQNANCLN PDEQQQDYLTSLNVALDDDK VSAFQQQFIDDCGLGAQSLI VTDVQQQFVDDCAHSAELSV ANKKQREWMAQLTLRYPLST LNELQQQFLDDTQLTIGQAL
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
13) 13) 14) 11) 10) 10) 10) 10) 10) 10) 10) 9) 7) 12) 12) 6) 8)
DALVLMGSVGTI LSLKTIGSIGTI EGIYTIGSVGTI TGIYSMGSTSSI TGIYSMGSTSSI TGVYSMGSTSSI TGVYSMGSTSSI TGIYSMGSTSSI TGVYSMGSTSSI TGVYSMGSTSSV TGVYSMGSTSSI QGLYSMGSTSSI YSLYAMGSTGSI TGLYAMGSTSSI TGLYAMGSTSSM LGLYSMGSTSSI LGLYTMGSTSSI
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5)
SDFDFWVCI SDCDYWVSV SDCDLWICI SDIDIWVCH SDLDIWVCH SDLDIWVCH SDLDIWVCH SDLDIWVCH SDLDIWVCH SDLDIWVCH SDLDIWVCH SDLDIWVCH SDLDIWVCH SDLDIWVCI SDLDIWVCI SDLDIWVCI SDLDVWVCI
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1)
KQF-TS-TALSLLQQ QQL-DG-QGLELLQQ KTLYDE-EAFDLLTL SWL-DQ-DERARLQR SWL-DN-DERLLLQR SWL-DN-EERNRLQQ AWL-DT-DERHRLQQ SWL-DS-EERQLLQK SWL-DN-EERQ-LQQ SWL-DS-EERQLLQR SWL-DN-EERAQLQR AGL-SQ-ERLELLDL PAM-SD-EDAQLLEQ TTL-SV-SCREKLDA TSL-TP-EERSALDS ENL-PA-EKKALLQQ PEM-DT-SQRELLTN
10) 10) 12) 12) 9) 10) 10) 10) 10) 10) 10) 10) 10) 10) 11) 10) 17) 10) 10) 10) 10) 8) 13) 6) 6) 18) 6) 6) 7) 7) 7) 7) 13) 23) 13)
DLMDCSGWD NDLDISGWD DEIDINGWD DLLDISGWD ETFDFAGWD ETLDINGWD LELDLGGWD EVLDVNGWD EDLEISGWD NELDICGWE DELDISGWE DMLDISGWE AELDVSGWE DELDVNGWD KELDLAGWS AELDVYGWD EELDVSGWD GEIDAGGWD DELDLSGWE GDLDINGWD DEIDLNGWD SPADSAGWD GKCDFVSFE DNTDTCIYA RITDTVIYG GNKDTVIYE ESNDITYYE ERNDKVYYE DKNDIVYYS PAAEQFSWE PGEEQFSWE HETQETYWE VEIDYTSNE VEVDYTVNE LEIDLVTHD
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
LRKSFFLMNKS LRKALGLFSKS IRKALQLFSKS LKKALKLLRKS LKKALSLLRKS LNKALRLFHKS LQKALRLFASC LRKALKLMYRS LRKALNLFKKS LRKTLGLLKRA LRKALQLLNRS LRKALRLFKKS LRKALQLMQAS ISKALTLLRSG ITKTLQLAYKS LRKVLRLMMKS LRKALRLVSKS LRKALLLALSG LGKALKLACNS IKKALNLLLKP IRKALGLLLKS LFKAFSLLEKS LGKFLRELLKP FNKIINLLLNC LKKFISLCLDS LRKILQLLSGA LKRFIELLAKN LGRFVELLLRN LKRFIELASKA FERFCLLALQA LERFCELALRA LQKFLVLALKA LHPVLIGVLQG LGQAVHGVLKG VGKFFGLMLRR
93 99 95 90 86 97 88 88 90 98 101 88 173 88 99 88 141 76 96 101 101 83 78 89 92 86 78 76 81 105 82 162 98 107 94
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
258] 269] 258] 259] 209] 259] 253] 259] 256] 276] 275] 261] 354] 251] 277] 251] 309] 249] 266] 270] 269] 249] 243] 339] 336] 364] 421] 353] 362] 254] 253] 316] 259] 277] 269]
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
661] 656] 653] 865] 856] 850] 866] 852] 851] 871] 850] 844] 834] 857] 851] 815] 886]
Group XV PF01295 gi|88793996 gi|94268641 gi|85859161 gi|2492890 gi|37528460 gi|157368428 gi|77957818 gi|50123107 gi|729245 gi|157144434 gi|85060325 gi|117618443 gi|149908503 gi|90414242 gi|89074446 gi|59710674 gi|159172578
79 65 90 66 66 66 83 66 66 89 66 66 66 66 66 39 110
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
12) 11) 12) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11)
FGIEVHFFL RGVEVHFFL LKMPVCFFL LGIDVTFFL LGIEVNFFL MGVEVSFFL MGVEVSFFL QGAEVSFFL QGVDVSFFL LGVEVSFFL RGVDVNFFL RGVDLNFFL FNVEVNFFL QGVEANFFL QGVEANFFL FYIEANFFL QGVEANFFL
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1)
EIDKVKQNDFG DITQTRENSFD DVEDIRACRFG DENRFRHHASG DENRFRHNASG DENRFRHNESG DENRFRHNASG DESRFRHNESG DENRFRHNESG DENRFRHNESG DENRFRHNESG PEDKFRQRNDA AEDKFRKVNNA DENRFRDNFSE DENRFRHNFSE EQNRFRDAYSE DEERFRSNYSE
185 170 197 168 167 167 184 167 166 190 167 166 164 169 169 136 209
34
gi|27364544 gi|84393563 gi|149191477 gi|153803407 gi|119943854 gi|90408989 gi|16272546 gi|15603676 gi|46156421 gi|152978045 gi|52424332 gi|126208526 gi|33151801 gi|153091210 gi|157377264 gi|118071323 gi|161331810 gi|127511253 gi|157960203 gi|160877197 gi|120600511 gi|114049124 gi|119776385 gi|114561628 gi|120553429 gi|149376985 gi|83643198 gi|94499874 gi|119856247 gi|104784261 gi|26991898 gi|77461714 gi|70733292 gi|66043447 gi|146305314 gi|15600465 gi|152989680 gi|146280911 gi|119475313 gi|90023240 gi|88800954
66 66 66 66 74 66 65 65 65 65 65 90 57 90 66 66 66 65 66 65 65 65 69 67 82 82 81 76 119 81 81 81 81 81 81 81 81 81 81 82 118
LNDIQQQFVDDTELTLGQSL PNEFQKQFVSDTELTLGGKL ANDVQRQFLDDIALALGQPL ANPIQQQFIDDAQLTLGEPL VNKKQKKQLAELFPYQEEPQ ITAKQTQLLSQFLNVDKLYK ASDYQKKWLTNEYGIHYADH ISPYQKQYLLTTVPSLEANQ LSKYQQEFLTTHLPQGDFEQ LTDYQRHYLHEILAQSSDIQ LSDYQKNFLAQQFPTGFDFV LSEYQANYLSELSLSHLAIR LSAYQQAYLANLMVSDDVVE LSDYQKQFLVSYRFDEAISA PSESELAACDTLDLVKPTQH ASESEYSACSTLGLVPPNLH PSEQELLACDTLGLVRPMVH PSDDALEACRTLGLSEPQVI TDSETLDACETLEIDLSTEV LSEIESQACDVFSLPFINQE EGKNASLACDVFKLPFIPQE PDSLEAQACDVFKLPFIANE PQEPHLEAARVLSLGLPSLR LSDAIVSACDALSLSIPEMI PGQEVLDAARRYARSFHYRD ASESTLNAARRLARTFSLKD PDKNVVAQAQRLSRSFNYRE PSKDVLRSASTLSMGFQYQQ PDANLVAEGQRLARSFTYKT PDTELLAEAQRLTRSFTYKA PSAELVADAQRLARSFTYKA PDADVLAEAQRLTRSFSYKP PDPQALAEAQRLTRSFSYKP PDTLALVEAQRLTRSFSYKA PDDEVLAEVQRLTRSFVYKP ADDEVLAEAQRLARSFVYKP PDPQALAEAQRLARSFCYKP PDAEQLAEAQRLSRSFAYKP PTKKDVEKTQRLSRSFTYRR PDKDILQLGRRVARSFTLGY PSDTVIRLAQSITRGQVGKR
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
8) 8) 8) 8) 38) 7) 18) 14) 16) 12) 16) 8) 8) 9) 7) 7) 7) 7) 7) 6) 6) 6) 7) 7) 8) 8) 8) 8) 9) 9) 9) 10) 9) 9) 9) 10) 10) 9) 8) 8) 8)
LGLYTMGSTSSI LGLYTMGSTSSI LGLYTMGSTSSI LGLYTMGSTSSI IGLYAMGSTASI LGLYAMGSTSSI LGVYVMGSFGSI LGVYVMGSIASI LGVYVMGSIGSI DGVYVMGSIASI YGVYVMGSIASI DGLYSMGSTGTI DALYSMGSTGSI DALYSMGSTGTI EGVYSMGSMASF EGIYSMGSMASF EGVYSMGSMSSF YGVYTMGSTASF EGIYSMGSTSSF DGVYAMGSTGSF EGVYAMGSTASF EGVYAMGSTASF EGIYAMGSTGSF EGIYAMGSTASF DSLFLMGSPGTL EALFLMGSPGTL ASLYLMGSTGSI RSLFIMGSCGSL HGLFLMGSLGTL HGLFLMGSLGTL HGLFLMGSLGSL HGLFLMGSLGTL HGLFLMGSLGTL HGLFLMGSLGTL HGLFLMGSLGTV HGLFLMGSLGSL HGLFLMGSLGSL LGLFLMGSLGTA NGIFLMGSSGSV YGIYVMGSVGTI SALYLMGSSGSI
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5)
SDLDIWVCI SDLDIWVCV SDLDIWVCI SDLDIWVCV SDLDIWICY SDFDIWICY SDLDTWICV SDLDTWVCH SDLDIWVCH SDLDIWVCL SDLDTWVCH SDIDLWLCY SDLDLWLCY SDLDLWLCY SDIDVWLVH SDVDLWLVH SDIDVWLVH SDIDVWLVY SDIDVWVVH SDVDVWLVH SDVDVWLVH SDVDVWLVH SDVDVWVIH SDVDIWLVY SDLDVWLCH SDLDIWLCH SDFDVWLCH SDFDIWLCH SDIDLWVCH SDMDLWVCH SDMDLWVCH SDMDVWVCH SDMDVWVCH SDMDVWVCH SDMDLWLCH SDLDLWVCH SDLDLWVCH SDLDVWVCH SDLDIWLCH SDLDIWLCH SDLDIWVCY
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1)
TQM-GH-DERESLAN PDM-DG-ASRDSLTN PEM-PT-EAREKLSN AHM-SC-EARDNLSS HQM-DK-QRVQLLKN HQI-DS-RRIQLLNE DGL-SL-DEYTLLTQ DDL-ST-KEKEALQR EDL-RL-EDKEQLSK EGL-KA-REREKLQQ PDL-TP-YALNKLQQ DRL-NA-LEYQLVEQ DRL-TA-VEYQLIEQ NHF-TI-EQYKLMEL RLL-TK-EECRLLED RLL-SP-NECRLLEE RLL-SK-EKCRLLEE PTL-AD-DEIASLQD EQL-SK-DQCELLAN ADL-SL-EELALIRI PQL-CD-EDLALIRT AQL-CD-EDLALIKL PNL-SA-DDIQALAD SKL-TD-DQLKLIEY QDL-PA-SGIRCLER PDL-SE-RGVRCLER PDL-DA-MALMELQR SNL-DE-ESMALLQN PGL-GD-SQLDELRR PGL-DE-QALAELRR PGL-PE-QLLGELRR PDL-SD-SDLAELRK PDL-GE-NELAELQK SEL-EP-EAIAELRK PTL-TA-QQLQELRK PEL-EP-GARQELRR ADL-DA-AARRELRR PEL-DE-RQLEELRR PI--SP-DETLELKQ PGL-SK-QAVDELEK SEL-ND-RDIDRLQT
( ( ( ( ( ( ( ( ( ( (
3) 3) 3) 3) 3) 9) 3) 3) 4) 4) 4)
SRFGVTGSLLPE KEFGVTGSLLLG KNMGVSGSTVLK KKMGVSGSTVLK ENMGVSGSTVLK ENMGVSGSTIPK GCMGVSGSTVIK KSMGVSGSLLLK ENLGISGSILPG DNMGITGSTLAR RSMGVSGSILPG
( ( ( ( ( ( ( ( ( ( (
6) 5) 5) 5) 5) 5) 5) 5) 6) 6) 6)
SDLDLVVYG SDIDLVVYA SDIDFVIYG SDIDFVIYG SDIDFVVYG SDIDFVIYG SDIDFVIYG SDIDFVIYG SDIDFVIFG SDIDFIVFG SDIDFVVYG
( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2)
EFLAVRDAILRIQE TFDAARTVVAEATA NHKNARDILSETFK NHKDARDILSETFK NHKFAREILNMAFQ NHKIARKILKECFE NHKKARGILKSIFD MHKKAREALKQAFE NHRKAMKAFRENKG NHKKARQLYGRLKD NHRRAMEAFGELKD
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11) 11)
HGVEANFFL LGVEANFFL LGVEANFFL QGVEANFFI YGVELNFFL FNVELNFFL FNVEINFYL FNIEINFYL LGVEINLYL YEVDVNLFL FNTDITLFL LGVDVSFYL FGVEINFYL FQIDINIYL YGLEVNIYL YGLEVNIYL YGLEVNFYL HDLEVNIYL FDFEVNFYL YQFEVNFYL YHFEVNFYL YQFEVNFYL FEFEVNFYL FEFEVNFYL LGVELHVFV LGVELHVFV MGSEMHIFL LGIELHFFL LGAEAHCFL QGAEAHLFL LGAEAHFFL QGAEAHFFL QGAEAHFFL MGAEAHFFL QGAEVHVFL QGSEVHCFL QGCEAHCFL QGSEAHFFL VGLEVHFFL LRLEMHFFT FNLEVHFFL
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1)
DEDRFRSNRSE DEQRFRSNHSE DEERFRSNISE NEERFRHNHSE PDNKFRIDNKS PDNKFRVKNNA DQQRFRNEHYA DQKRFRCFRYA DQQRFRSFRYA DQNRFRNFQSS DEFYFNHYRYS NPAHFKAHLYH NPSHFKAHLDH NPEQFKSQTYN HPEQFSKASIS HPEQFIKSSDS HPEQFSGGSAD HPKQFISSSRD HPEQFIRDQGE HPQQFCGDQTL HPQQFSGGQKG HPLQFSGDKSQ HPFQFRADKSY HPMQFRECSAF SAADWRAGRQR SAADWRAGRQR DAEAFRSGRHD DAKRFVRGERN DPQSFAQGQRD EPQGFVQGERD DTQGFAQGQRD DPVRFVKGERD DPNRFVRGERD DPQRFKSGERD DPQRFTQGARE DVARFGQDERE DVERFGRGEDD DPERFTQGQRD EDEKFRLGERE DYEEFKRGTLS DAHRFSGGEQQ
165 165 165 165 203 164 174 170 172 168 172 189 156 190 164 164 164 163 164 162 162 162 167 165 181 181 180 175 219 181 181 182 181 181 181 182 182 181 179 181 217
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
844] 842] 735] 845] 869] 830] 843] 838] 837] 846] 834] 842] 804] 845] 807] 807] 807] 806] 807] 803] 804] 804] 809] 819] 949] 959] 971] 952] 992] 954] 951] 947] 948] 948] 952] 950] 944] 938] 966] 954] 985]
[ [ [ [ [ [ [ [ [ [ [
338] 317] 352] 351] 354] 387] 382] 340] 357] 344] 340]
Group XVI COG1665 gi|20094049 gi|53803142 gi|134045406 gi|159905826 gi|150399295 gi|158972258 gi|150401305 gi|15669274 gi|148643118 gi|84490216 gi|15679242
106 102 119 119 119 134 118 116 113 106 105
RDDLEEMAARAVLELAEDSG SRDAIEAKARRLLGVFAGEG QNTEAEIKCAKLADTLHDYG QNTEAEIKCAKLADTLHEYG QNTNAEKKCAKLAEILNNYG PQTLLEEKCKKLAEKLKECG PTNEFEIKCKKIAEILNKNG NLNELEEKCRKLALILEDYG KNPEIMAKLMDVSDFFHYVA SPNPFYEKVRLLANIFHNEA PSDPLLKKAVKIADTLHDAA
( ( ( ( ( ( ( ( ( ( (
47) 43) 45) 45) 45) 48) 72) 45) 55) 47) 47)
RVCDVLLVR TKFDLTLVL VMFDLLATR VMFDLLATR VMFDLLATR TMFDLLATR TMFDLLATR TMFDLLFTR TLFDILCTK TLFDILSTM TLFDILAAR
( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
-DESELSDVRW -PGSWDSFQAA -GWDEINEKYG -AWDEINEKYG -DWNEINESYG -EWDEINDNYG -EWNEININYG -EWDEITEKYG -NYDEISGEWG -NSEDLSIEEN -DWSEIDGHWG
237 228 247 247 247 271 273 244 253 238 237
35
gi|20090059 gi|91772132 gi|116753774 gi|147919403 gi|126178387 gi|124485442 gi|154151515 gi|88601864 gi|11499549 gi|87309566 gi|15920449 gi|70606295 gi|146302910 gi|15897161 gi|126465365
101 98 97 98 99 89 102 99 99 88 103 103 103 103 96
-----DSRVRAIVKVLDLAG -----DERVKAVYEVLHAAG -----DERVGRIAEILEEHG -----DSRVKKMVEALSA------NRRVARLLAHLD-------NPRVARLFSHFD-------HDRVRKLLHLFD-------HPKVAQLIKALH--------AEVRKIVEFFSE-KPVGLLQKSLDFANLLASTL ---SENKLESLALEIYDY---PSQSKQELTLLEIIQY----ERKVDLAPLFSLLDK---SVSDSIIYALISFIEEYPRDKLEELTANLSSILYLS-
( ( ( ( ( ( ( ( ( ( ( ( ( ( (
3) 3) 3) 3) 3) 3) 3) 3) 3) 4) 3) 3) 3) 3) 3)
TSMGVTGSMLAG EKMGVTGSFLPG RCIGITGSMLLG CDMGITGSKLVG GSFGCTGSLLCG MTFGCTGSLLAG GTIGCTGSLLCG CTFGVTGSLLCG EKMGVTGSRLIG EACGLTDSLLWG KNIGITGSLLAG SRLGVSGSLLTN IRLGITGSYLVG NDIGVTGSILLG NALGVTGSLLIR
( ( ( ( ( ( ( ( ( ( ( ( ( ( (
5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 5) 6)
SDIDFVVYG SDVDFVVYG SDIDLVVYG SDVDFIVYG SDIDLVVYG SDIDMVVYG SDIDMVVYG SDIDGVVYG SDVDFIVYG SDIDLVIYG SDIDFVIYG SDLDIVIYG SDVDMVIYG SDIDIIIYG SDIDMIVYG
( ( ( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2)
WFRA-RDAITIAKQ WFNA-RDVIAVAKA WWKA-RDVIADAKR WYKA-RDELQKAIA WFVA-QRRLRDLVE WFLA-REQLMAAVK WFDA-QAQVKRGID FRHA-QKQLVRAIQ WFEA-RERLKGGIE RCRTWLDQAERVFA DTIDFLESF----ESLNVIESF----DAYDFLSTF----SALDFIESF----ESLNIIEFINENKD
( ( ( ( ( ( ( ( ( ( ( ( ( ( (
44) 44) 43) 43) 43) 43) 43) 43) 43) 38) 38) 39) 38) 36) 42)
TYFDLLFVR TYFDLLFVR TYFDLLFTR ALTDLLYVR TYFDLLYTR TYFDLLYSR TYFDILYTR TYFDLLYTR TYFDLLYVR TRFSVRGVR IKYSFLFVN VKYSILFVD MKYSFLFVD IKISVLFAD RDYSIIYNN
( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
-EWDQIKEPLL -DWDQISLPIS -DWDEI-EPVP -DYDQIGPALP -DYGNLDAIPA -GYDNIRSVPP -TYNDCKGVPA -GYQDLPGFTM -DYDELSRNVP -AWNEFPQPLP -DKVEKYCNI-DKPQRYCED-DKPHPYCRD-DKPWRYCND-GV-YKHLDTC
222 219 216 216 216 206 219 216 216 210 213 215 213 213 220
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [
316] 309] 311] 308] 308] 298] 310] 309] 334] 291] 305] 304] 296] 299] 317]
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
EQLMLVGAQCRD GAILLIGAEARD ENIMLVGARCRD ---MLVGARCRD IPYMIVGAMARD IHFFVVGAFARD IAIVITGAFARD AAFVVAGATARD AAFVVAGATARD AAFVVAGATARD AKFVLAGATARD LRYLVVGATARD LDVYVVGALARD DEVVYVGGAMVS DEVVFVGGCAAG DRLVLVGGWAFR
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 15) 14) 14) 13)
NDTDIAGTL TDVDIGIAL KDVDFALAL KDIDFALAL RDVDFGICV EDIDIGVEV EDIDFGLAV RDVDVAVCA RDVDVAVCA RDVDVAVCA RDVDVAVCA PDIDFAIAV SDLDVAIAL KDIDLTFEL ADVDVIVEL KDTDFAVEL
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4)
HFEAIR-ATFRALG AYEGVR-QAFVPVG DFNLLK-QQFSPTG LFRALK-QRFPSTT QFHNLA-TNLIAIG TFRYLT-DSLIDRD AFVALK-ERLTASG FHERLV-DALVATG FHAGLV-DALVATG FHDRLV-DTLVATG SHDALI-ELLVQTQ AFDGLR-AALLQEP QFELLS-ENLLKNN KLEELR-EELYNRG DYHCLS-EKLRNRG AFQEIS-NHLLANG
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
10) 10) 10) 10) 20) 17) 17) 22) 22) 22) 22) 21) 20) 17) 18) 24)
RAVDALPFG MAVDVVPFG IPVDLVPFG ITLDIIPFG WELDIVPFG IVVDIIPYG LPVDLVPFG TELDLVPFG TELDLVPFG TELDLVPFG GELDIVPFG FPVDLVPFG YEIDIVPFG LKVDVMATE VKVDLMPTD KWIEILPVG
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
20) 21) 19) 19) 20) 21) 21) 21) 21) 21) 21) 21) 20) 8) 8) 11)
HGCTDAYLRAD FGFVEVMRRAW AGFKEVFEQAE RGMQEVFEHAQ LGFQEALDSSW LGFEEAFQSAM FGFREALATAH LGFQEAVDTAL LGFQEAVDTAL LGFQEAVDHAL LGFQEAVDTAQ AGYAEALDSAV KCFRDVMNIAD RWFKLGFDKAN RWYADAILSAR KEYRLAFDDNE
131 110 124 99 144 142 124 204 153 153 153 144 143 119 120 143
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
248] 254] 269] 142] 281] 290] 262] 352] 302] 310] 313] 287] 291] 233] 233] 279]
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
LRGILIGTVAFQ LRGILIGTIAFH LRAVLIGSVAFS LRALLVGSVAFS LRGVLVGTVAFQ LRGVLIGTVAYQ LRGVLVGTIAFQ LRAAVVGTVAFQ LGGTIVGTNAFR LGGTLIGTNAFR LGGTLVGTAAYA LGGTVVGTQAFR LGGTLVGTQAFR LGGVLVGTHAFR AGGILGGTQAFR AGSILIGTHAFL
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
17) 17) 17) 17) 17) 17) 17) 17) 18) 18) 18) 18) 18) 18) 18) 16)
GDADIAQDY GDADVAQDF GDADFAQDF GDADFAQDF GDADFAQHY GDADFAQFH GDLDLAQDY GDLDIAQDY GDIDIAQFE GDVDIAQFE GDIDFASFE NDVDIASFE DDIDIASFE SDLDIASFQ GDVDLIAAN MDVDFAHAG
( 9) ( 9) ( 9) ( 9) ( 9) ( 9) ( 9) ( 9) ( 9) ( 9) ( 9) ( 9) ( 9) ( 9) ( 9) ( 10)
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
364] 362] 358] 357] 282] 380] 348] 337] 349] 271] 339] 410] 338] 344] 346] 349]
Group XVII COG4849 gi|15840249 gi|148271572 gi|19553981 gi|38234745 gi|153824564 gi|119356126 gi|121605631 gi|118710309 gi|84361789 gi|118718571 gi|126454170 gi|17548488 gi|156110107 gi|88805938 gi|114776770 gi|67917971
9 1 3 1 12 12 1 69 18 18 18 12 11 3 3 18
PVLLAWVTPIVTALADVVPA --------------MTGADP NSPNADIILVVNKLSKFIDI -------------------FPKGLTELYADVNSEAQKLA IDPERVTLLRDIKEVADGLA -------MVADVEAVAAPMG LEPAAVALLQAVGAACARLD LEPAALALLQAVGTACARVD IEPATVALLRAVRDACADVD VEALTIALLRDVKHACAQLG EEK--QAVLIIMDRVARQAN QNNLLYDTLEALSKVMNDLQ HSFINREATKKVAKALGRLN LDDPNIAMLQLAADGLGPLL AIDFENDELYAACKAVEPFL
Group XVIII COG5397 gi|21492951 gi|16262563 gi|116255410 gi|15890131 gi|36959046 gi|86748711 gi|114798432 gi|16125195 gi|149204565 gi|85705423 gi|77404639 gi|13488492 gi|69934822 gi|114797750 gi|146280108 gi|145588476
101 102 101 101 21 105 101 100 110 29 101 170 100 106 98 106
APDAMSGDIVEALADGGLFR APDRMSGDVIEALASGGLFR GPDPFAGDVTKALADAGLFR APDRFAGEVTKALADAGLFR GPERFTGDVVAAMGAAGIFR RPLMTSGQVVEALAKAGFFR EPDPLTGAVIEAFAEAGVFR APDNRTGRVLQVLAQAGAFR PTDRATGSILSAMAAAGTFR PVDRGTGGLLLAMAKVGTFR GTDRETGSLLLAFARAGVFR GVDAATGSILHAFASAGVFR MTDRGSGQVISAMARAGVFR ALDMQNGKTLRALAQAGTFR APDMTTGKLLAAIAKTGFFT STLTKHFRVVNRLDQYGFFR
L-PS-IL-ELLQGVD M-PP-IL-QLLQSVD L-PP-VL-EVLRSID L-PP-IL-DLLRSID L-PP-VI-DILKSVD L-PP-IL-DVLRSVD LDRS-LI-DILRSVD LDQT-FL-DILRGAD VDPG-LA-ETFSALK VEPS-LA-QTFSALK VEEE-PG-DILQALK VSAP-LE-SVLKDFS VTEP-LA-EVFRDLR TEPE-LP-EVMHALG GDPKGLA-LRLQKLG VKIS-VH-DALNSLE
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
23) 23) 23) 23) 23) 23) 23) 22) 21) 22) 21) 21) 21) 21) 22) 24)
YRVEFLTSN YKVEFLT-N YRVEFLTGN YRVEFLTTN YRVEFLTPN FKVEFLTPN YRVDVLTTN YAVDVLTTS QLVEFLTPA TMIEFLTPS AMVEFLTPA TLVEFLTPS TLVEFLTPC FVVDFLSPS IELEFLSSM LRLDFVTPQ
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
53) 53) 53) 53) 52) 53) 50) 50) 48) 49) 48) 49) 49) 49) 49) 49)
RYAVHKLIVAS RYAVHNLIVAS RYAVHKLIVSS RYAIHKLIVAS RYAVHKLIVAF RYAVHKLIVGS RYAAHKLIVSS RYAVHKLIISR RYAIHKLIVAD KFAIHKLIVAD RFAIHKLIVAD RFAIHKLIVAD RYAIHKLIVAD RFAIHKLIVST RYALHKLIVAQ RYAVHKLIIYG
275 275 275 275 194 279 273 271 279 200 270 340 270 276 270 278
36
gi|91789417 gi|118743492
104 105
PALPKHVRAVRRLSDYGFFR ( TASHPVARVIRELADSAVFR (
0) AGGVLVGTHAFI ( 16) TDVDFAHAG ( 10) VQID-VH-AALTSFE ( 24) FQIDFLTSA ( 51) RYAVHKLLIVG 0) VGGVLIGTHAFG ( 17) LDIDVAAER ( 9) -RAD-IP-KALESLS ( 24) LRVDLLTPK ( 49) VFGLHKLIVSQ
278 276
[ 353] [ 342]
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 2) 1) 1) 0) 1) 1) 1) 1) 0) 6) 0) 1) 0) 0) 0) 0) 1) 0) 0) 0) 0) 1) 1) 1) 0) 0) 0) 0) 0) 0) 0) 0) 0) 1) 1)
157 156 158 160 159 156 161 159 156 155 190 217 219 219 235 218 218 212 227 224 230 234 232 209 201 193 179 175 176 185 191 191 182 192 179 172 172 145 188 188 187 188 183 178 181 186 191 184 202
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
Group XIX COG2253 gi|84687346 gi|116254914 gi|114045858 gi|149377222 gi|37523907 gi|153821459 gi|56130647 gi|134288436 gi|100123181 gi|94310504 gi|109648174 gi|154246975 gi|28558846 gi|33595089 gi|152973184 gi|154483055 gi|124485712 gi|150383990 gi|113935977 gi|46446123 gi|154252297 gi|153890141 gi|53714634 gi|24375280 gi|145226088 gi|89075003 gi|126007628 gi|110668071 gi|10803613 gi|30249776 gi|150025585 gi|121609792 gi|94263491 gi|23466016 gi|118194765 gi|153891417 gi|85714532 gi|78186398 gi|68549151 gi|67919277 gi|154493753 gi|156110978 gi|154500813 gi|77406078 gi|153811079 gi|111025002 gi|153003937 gi|77163543 gi|82659492
6 6 7 9 7 7 5 5 5 5 17 32 34 32 38 31 31 32 33 31 36 47 31 17 31 24 25 22 22 28 22 27 26 33 21 23 23 1 32 32 29 30 27 25 27 34 33 19 34
-YEAQVALLVRVLPHVAGE-YISQLELLVQALPMIADE-YYRQVQLLLRIIPFVAQH-YQNQVTLLLQLVPFVAKH-FSRQVRLLVSLLPQVAKQ-YYKQVSLLIRMLPVVATE-YLDTVRLLIDIAPPVFDT-YVDTVRLMLGIAPIIFDT-FADTVRLLLRIAPDVFTN-YLDTARLLTQVAPLVFVDEKDYYLTLLLWELSQKQGAEKDVWVVWALATLYAAPLGEKDIWVVWVLDALFGSDLGEKDIWVVWTLRALFASPLAEKDLWVTEILRLLFDEGFLEKDFWVCFTLDYLFHRSPWEKDFWVCWMLDYLFHDCPWEKDFWVCWMLKQLFDSSLREKDYWVVWLLGLLFDPARSEKDYWVVWVLERLFSLQKLEKDFWVCWTLDALFNGLADEKDFWVCWTLRDLFNLPGWEKDWWVTTVLYALFHTSVSEKDFYVTCILYILYVDIAPK EKDYWVCEVLRTIAASHR-EKDYWVVALLAQLEKLTIDEKDYLQEIVLNSLFYISPVEKNYVNSWLLWGIFTSDCGEKNYVNSWILWGIFTSDFGERDYVLAWFLTGLAGHPLRDKDWMLGHFIAAIFNEPELEKELLHHEIFQALDDAGLLEKELLHHDILRLLNDAGLLEKELLHYRILDAMMREGFFERDYVLTNLLSVMVDFPKIEQDLIICRALVALFSDTFLEQDLLIARALVNLFEDAFL---MIICRSLVELYSHPVALQYYAMERFLYRLSISSFALQYYAMERFLYRLSVSSFALARYFNERLLYRVSVSQYKLTRYFQERLLYRISQTHYRLQLFCQEEFLRRLEKSQYAMTYYFLEVILKKLSQSSYSMRNYMMERFLERISLSEYKRRRFVIARFLTRVFDADP-RQLLVFDRYLARLVRVLG-FRQ-AVHTILTAIAGTPSLMRQ-AVHIILKSISIEESL-
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
GVFALKGGTAIN PVFALKGGTAIN DCFALKGGTAIN QCFALKGGTAIN ECFALKGGTALN RVFALKGGTAIN DKFAMKGGTAIN PLFAMKGGTALN DHFVMKGGTAIN DTFALKGGTAIN LPAYFKGGTALY EHLVFKGGTSLS EHLVFKGGTSLS ADLTFKGGTSLS YTVAFKGGTSLS ESITFKGGTSLS NTLTFKGGTSLS DTVIFKGGTSLS GSMIFKGGTSLS PHLTFKGGTSLS PRLLFKGGTSLS GHLTFKGGTSLS EYLLFKGGTSLS IPFVFKGGTSLS DEIVFKGGTSLQ HQLVFSGGTALA NDIIFIGGTAIS DNLLFKGGTALS ENLMFKGGTALS DVLAFKGGTALR ETLIFKGGTCLK KHLVFQGGTSLR AGLTFIGGTCLR SSLVFQGGTSLR DKMVFKGGTSLK KELRFRGGTALN DELRFRGGTALN ANLVFRGGTALF DRFILKGALLLR DRFILKGALLLR DKFLLKGGSLLY ENFYLKGGALMY ENLVLKGGLFIY NHYIFKGGFLLS NQFILKGGMLVA DGWILKGGIGMM DAAILKGGLVLE TAMIMKGGVLLA EMMVMKGGTLLG
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 9) 10) 9) 11) 11) 10) 10) 9) 9) 9) 10) 11) 11) 10) 10) 10) 10) 10) 10) 10) 10) 9) 9) 9) 9)
VDIDLTYLP VDIDLTYLP VDIDLVFLP VDIDLVFLP VDIDLAYLP VDIDLAYLP VDIDVVFTE VDIDVVVVD VDIDVVYVP VDLDLVFPD EDIDLTVEI EDVDLTYDI EDVDLTYDI EDIDLTYDI EDIDLSIHW EDIDLILDW EDIDLILDW EDIDLILNW EDIDLAFDK EDIDLSIER EDIDVTVFR EDIDVVIDR EDIDLVLSR EDIDLSFSM EDLDLLVLG EDVDIKLIP EDLDFIFDK EDLDFGVEG EDLDFGVEG EDLDFTLIR EDLDFTSKS EDLDFAGGR EDLDFSGGH EDLDFAGGT EDLDFGCLE EDIDLVRTT EDIDLARTK EDIDLVQKE MDIDMLGRT MDIDMLGRT VDVDFMADR LDIDFLGTH VDVDFLLRK VDIDFLFHQ MDLDATIKG KDIDLLAAT RDIDLRLVG SDIDFSTAT KDIDFSTEK
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
6) 6) 6) 6) 6) 6) 7) 7) 7) 7) 5) 17) 17) 16) 15) 14) 14) 12) 14) 15) 15) 10) 15) 14) 5) 7) 6) 4) 4) 2) 5) 3) 3) 3) 3) 2) 2) 2) 2) 2) 3) 3) 3) 3) 3) 1) 1) 5) 5)
SLDEIN-AAMDRIA SLAEID-AALDRIA ALQTIK-SNLDTLA ALDTIR-KNLSELA SLAAID-QALEKIR ALINVR-AALQRIT ALQSIG-EELARAK AIAAIS-NELDRAR ALAAIQ-QELAAIE ALARIN-EAVRQAA --SQGK-KRLETAT TRSEEK-RWSSEVR SRSQAQ-KWTEAVR SRSQAS-KWTQAVR KASTRS-RSQNQRF SNTKQD-AFNKEAN TKNQQD-HFNKEAN SNAQQA-KFNEELD SKKKAA-ELIDDLA SKKKQN-AIIDNLS SGKKRG-KALDAIK SNKQIE-KLRAECS SNTQIH-NLREKGQ SRKIMR-AEATAID -RTAKR-AMLTMLE SRSKKK-TVRKSLL VENRVK-KAIDEIN SKQELR-TVLDTIA SEDDLR-DVLDTVT ---TLD-EILAGLN TSGHLK-FITKHIE -ADKMQ-GIEECIE -RESMA-GLKAALE -MDTLK-GLGSCIS -EEFKD-YLRHSIR ---PI-KPVIDRIR ---AS-KPIWDRIH ---PI-GPVMDAIK APESIT-TIIRQVL APEKIT-TIIHQVL DRDFLA-RVFQEIL DGERIA-EAFREIC TPEKLK-KVLEEII SEETVK-QQLKEIL SVEDVE-MIISKII -LSDS-V-DELRRA -PDGL-LARLQRAG NLDDF-RGKLEASL DLGDF-FNRFNDSM
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
33) 32) 33) 33) 34) 31) 36) 34) 31) 30) 45) 52) 52) 55) 62) 55) 55) 51) 62) 60) 62) 59) 67) 55) 46) 46) 38) 40) 40) 43) 50) 51) 43) 45) 44) 35) 35) 34) 44) 44) 44) 44) 42) 40) 41) 40) 47) 57) 57)
IKIET-SPVT IKIET-SPVA IKIEL-SPVL IKIEL-SPVL VKIEV-TPVL IKIEV-SPVA VKVEV-NYVF VKVEV-NYVF VKIEV-NTVF VKVEV-NFVM VKIEA-TSFT VMLEF-GARS VLLEF-GARS VTLEF-GGRA ILLEF-GGRN IRLEI-GALA IRLEI-GPLA ILLEI-GAKA VRLEI-GARS VKIEI-GARS VKIES-GAKS VKIEM-GARS VKIEI-SILS VLLET-GSLS VLIEL-GQAG IKLEL-METI IKLDI-NLYV TSINV-MVDE TSLDV-MVDE VKVDI-TIDE IKIEI-ILYE IKLEI-ANIP INIDI-CAVP IKLEV-ASIP VDIDM-NLSG LKIEI-NTWE LKVEV-NEVE LKVEI-NSRE IQLDI-GFGD IQLDI-GFGD MSVDI-GFGD MTMDI-GFGD FSIDF-GVGD IHLDI-ATGD LKIDI-STGD FSLDVVGSRR FGVDV-AFGD ISLDY-SLNE INIDF-SFNE
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
31) 31) 31) 31) 31) 31) 32) 32) 32) 32) 43) 34) 34) 34) 36) 35) 35) 35) 35) 35) 34) 35) 37) 34) 40) 33) 29) 26) 27) 33) 31) 30) 30) 31) 30) 31) 31) 31) 28) 28) 29) 29) 29) 28) 28) 32) 31) 22) 25)
DLFAGKLHAAL DLFGGKMHAAV DLYAGKICAAL DLYAGKICAAL DLYAGKLCAAL DLYGGKLCAAM ELYGSKLVAAM ELYGSKLVAAL ELYGGKLVAAL DVYGGKLVAAL RIFADKILAAE RTFWEKATAIH RTFWEKATAIH RTFWEKATAAH YILWEKITALH RTFWEKATILH RTFWEKILILH RSFWEKITILH RTFWEKATILH RTFWEKATILH RTFWDKVVILH RTFWEKFMAVH RTFLEKAFLLN RTLVEKMFGVH RTLIEKLLRVN ATLAEKIISML EFAAEKIKAIM EIFAEKLRAIY EIFAEKLRAIF EIVIEKLLALS EVLSEKIRALI EILADKIVAFP EILADKLLALA EILADKLLSYA ETMAEKVSALT ELMATKLRALL EVLSTKIRALL ELLGTKLRALY SMIAEKFEVMA SMIAEKFEVMA TVIAEKFHTMI TVIAEKMHAII TTIAEKIDAIL TILAEKLQTIY TVLAEKLETVI DQIADKICALY THVAEKLHAYT EMVAEKFRALL DLMAEKYRSVL
305] 305] 304] 306] 305] 312] 316] 318] 306] 306] 323] 338] 341] 338] 362] 332] 337] 326] 348] 340] 361] 354] 351] 335] 326] 313] 264] 266] 267] 280] 285] 337] 296] 317] 280] 243] 282] 256] 312] 306] 291] 289] 301] 289] 265] 304] 299] 285] 303]
37
Group XX COG3575 gi|23129930 gi|119510602 gi|17227968 gi|146280612 gi|77459764 gi|148547966 gi|84323565 gi|152988832 gi|86145703 gi|148975778 gi|151939271 gi|89072259 gi|37680370 gi|153802910 gi|149911730 gi|156977705 gi|149189975 gi|113947505 gi|120599509 gi|117921217 gi|24373240 gi|157374607 gi|124002348 gi|114562511 gi|91793941 gi|119774507 gi|77956835 gi|123441553 gi|77974564 gi|22125393 gi|157371674 gi|88797212 gi|152996757 gi|90412612 gi|83649206 gi|117619227 gi|145299784 gi|88794368 gi|107028317 gi|34499669 gi|50086607 gi|149011130 gi|125716945 gi|157151528 gi|24378804 gi|146317729 gi|55820637 gi|77406324 gi|28378161 gi|116332743 gi|149180411 gi|56965761 gi|75762489
8 1 9 9 22 1 9 9 11 11 36 6 10 32 6 8 27 28 32 12 12 10 22 19 16 21 9 9 9 11 9 22 9 24 7 31 22 1 9 13 43 12 9 10 8 30 6 6 9 9 9 10 15
QMILADTPIDTVLPAIAQLN --------MGIVLPAIAQLN SQVLS-AMTSSRLETISQVN DIIAADPLRMRCLAHVRALA ALMASDRARMQILEIVRGLQ ---------------MRSLG ALIAADSSRLHLLRVLREVG ALVAADAWRMGLLRALREVG TLIKQDPLRMQVLDCVDQLD ALISQDPLRMKVLECVSQLE ELVKQDPIRVEALNCVSELG ELIKQDSVRTEALYYVSLLG ALIKEDRMRTEALGHVAELS DLIRKDPIRLEALECVYQLE TLIEKDELRVKALDCVQQLD QLLAEDPRRIQALECVRSLA KLLVQDENRIQLLQTVQALS EWIVQDCERVRALELALQCA EWITQDYERVRALELAFLCA TWLSQERGRMRALELVQQCA TWLSQDNERMRALELVLQCA QWINESPRHIRALLAA---QLMNNDVLRKDILFAVREVQ LWLQQDETRLAALMVCQKVM HWLQHDEERMYALHAALNIA SLVREDAVAMTCLKAAREVM QWLYADSYRMQALSIARELG EWLRADPYRMQALSIARELQ QWLQADPYRMRALSIARELG QWLRTDPYRMQALYTARELG QWLQQDNERMTILRTARRLG HWLTESSLHWQALIEARKLN EWIESDRDRMQALELASTLG QLLLSDTLRMECLRAVKSLN VLLKKDLLRMRCLAAARSLG ALLRADEQRMDCLRAARELA RLLRADRQRMACLQAAAELA ---------MDCLRALSALN EIARQSSWCMSALSAARAMG GLVLASPWLMRALRAARALG YMVFSNQALYQRLKALYSIH NAFRENSDMMTILTIIRDLG DQLGQDPDIRAILEIIRSLE DLFYADQHLMTIVKIIRELN TLLLANEPISRILTIIRDLK RCLLADKNILVILDIMDRLN KLIKQNSELMALLKIIHSFQ HMILCNSDIMKILAIIKSLP HIIKTTPALMTILQLIQDCH AIIEETPALMEILRLIQACH MVIKEDKWMMDILKAAQTLD KLIQSDKKMMQIIKTTSSLD RLIENDEWMMNVLQMAKSLE
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 4) 6) 6) 6) 6) 6) 6) 6) 6) 6) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 1) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2)
NWWLAGGAVRNT NWWLAGGAVRNT NWWLAGGAVRNT DCWVAAGFVRSA DCWVAAGFVRSL DCWIGAGFVRNA GAWIGAGFVRNA DAWIGAGFVRNA QCYVAAGFVRNL QCYVAAGFVRNL QCYIAAGFVRNL QCYIAAGFVRNL QCYIAAGFVRNL QCYIGAGFVRNL HCYLAAGFVRNL DCYIAAGFVRNL RLYVAAGFVRNL QWCLAAGFVRNL QWCLAAGFVRNL QWCLAAGFVRNL QWCIAAGFVRNL QWMLAAGFVRNL LLYVSAGFVRNL DWLISAGFIRNL HWLIAAGFVRNL EYLLAAGFVRQR QWCLAAGFVRNL QWCLAAGFVRNL QWCLAAGFVRNL QWCLAAGFVRNL DWCLGAGFVRNL DWCLAAGFVRNL DWCLAAGFVRNL DCFLGAGFLRNA DWYLAAGFLRNA DWALGAGFIRNL DWALGAGFIRNL QGYLGAGFVRNA SWCIGAGAIRNL DWCIGAGAVRGL EAYLAAGIIRQL DSWLAAGSIRNF DSWLAAGCVRNF DSWLAAGSVRNF DAWLCAGTLRNF DCWLCAGTIRNF DCWLCAGTLRNY DCWLCAGTLRNF QGALAAGSIRNT QGALAAGSIRNT DWWICAGFVRSK DWWICAGFVRSK DWWVCAGFVRSK
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 12) 14) 14) 14) 14) 13) 13) 13) 13) 13) 14) 13) 19) 13) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 15) 14) 14) 14) 14) 14) 10) 16) 13) 13) 13) 14) 14) 14)
KDFDIAFFD KDFDIAFFD KDFDIAFFD ADIDVIWFD SDVDVIWYD TDVDVIWFD ADIDVLHFA ADIDVLYFA NDIDVIYFD NDIDVIYFE NDVDVIYFD NDIDVIYFD NDVDVIYFD NDIDVIFFD NDVDVIYFD NDLDVVYFD NDIDVIYFD NDIDLIYFC NDIDLIYFC NDIDLIYYC NDIDLIYYC TDIDVIYFD NDVDVVYFD NDIDVIYWR VDVDFIYFC ADVDLIYWG NDIDLVYFD NDIDLVYFD NDIDLVYFD NDIDLVYFD NDIDVIHFD NDIDLIYFN NDIDLIYFD NDIDVVYFD NDVDLVYFD NDIDLIYLD NDIDLIYLA NDIDVVYFS ADVDLAYFD SDLDLVYFD MEIDVVFYD TDVDVIFFD TDVDVIFFD TDIDVIFFD TDIDVVFFD TDVDLVFFD SDIDVIFFD SDIDVVFFD SDIDVVFFD SDVDVVFFD PDVDVVYFD PDVDVIYFD PDVDVIYYD
( 3) NRSQEL-AAKATLT ( 5) DEFDVKNQA ( 13) STEDAITEWLH ( 3) NRSQEL-AAKAQLT ( 5) YQFDVKNQA ( 13) STENGIQDWLH ( 3) NREQEL-AAENYLT ( 5) YLFDVKNQA ( 13) STEEGIENWLH ( 4) DAAVDA-EIEARLR ( 5) LEWSVKNQA ( 12) TAADAMCHWPE ( 4) SPERDA-QLESLLM ( 5) ISWSVKNQA ( 12) CATEAMMFWPE ( 4) TPEQDE-ALEAVLR ( 5) VLWSVKNQA ( 12) SATDAMRYWPE ( 4) DPQADA-DFEAALR ( 4) VPWSVKNQA ( 12) DCADALCHWAE ( 4) DPRADV-AFEAALR ( 4) APWSVKNQA ( 12) DCADALCHWPE ( 4) FYESGL-RYEALLQ ( 5) LNWQVRNQA ( 12) SSLDAMRYWPE ( 4) DYELNL-QYEAQLL ( 5) LNWQVRNQA ( 12) STLNAMGYWPE ( 4) NTNTYL-EYEAQLK ( 5) LNWQVRNQA ( 12) SSVDAMSYWPE ( 4) NPNAYL-DYEAELN ( 5) LNWQVRNQA ( 12) STMNAMSYWPE ( 4) NPDANL-QYEAHLK ( 5) FNWQVRNQA ( 12) SAVDAMRYWPE ( 4) DSDYEK-SLELKLS ( 5) LNWQVKNQA ( 12) STLDAMSYWPE ( 4) NSDNYR-LIESELS ( 5) LNWEVRNQA ( 12) NIIHAMSYWPE ( 4) DEQAYL-KYEAQLN ( 5) LNWQVRNQA ( 12) STLDAMSYWPE ( 4) GLEIQR-TIEKELS ( 5) YHWQVKNQA ( 12) SLQDAMSYWPE ( 4) SPERDL-AIETYLN ( 5) LPWSVKNQA ( 12) SCVDAMAYWPE ( 4) RPERDR-AIEAYLY ( 5) LPWSVKNQA ( 12) SCIDAMTYWPE ( 4) RPERDR-AIEAYLH ( 5) LPWSVKNQA ( 12) STQDAMCYWPE ( 4) RPERDW-AIEACLR ( 5) LPWSVKNQA ( 12) SSSDAMAYWPE ( 4) SREAER-RYEAKLR ( 5) YPWSVKNQA ( 12) STLDAMGYWPE ( 4) SKKQDQ-QLESILY ( 5) VNWSVKNQA ( 12) SLHHALSNWVE ( 4) SSAEDQ-AIEALLV ( 4) LPWSVKNQA ( 12) SCADAMSFWPE ( 4) AKARDV-DLEQQFK ( 5) LAWSVKNQS ( 12) DLKDAMGHWPE ( 4) --ALEA-QLTEALC ( 5) TPWEVKDQR ( 12) NLVDAMAAWPE ( 4) SERHDL-QLEAQLR ( 9) FPWSVKNQA ( 12) STEDAISYWVE ( 4) SEQHDL-QLEEQLN ( 9) FPWSVKNQA ( 12) STQDAISYWVE ( 4) SEQRDL-QLEKQLL ( 9) FPWSVKNQA ( 12) NMEDAISYWVE ( 4) SEQHDL-QLEALLL ( 24) FPWSVKNQA ( 12) STEDAISYWVE ( 4) EAERDQ-MLEARLQ ( 4) QPWSVKNQA ( 12) NSEDAISYWTE ( 4) TEATDR-ALEAELR ( 4) FPWSVKNQA ( 12) STEDAMSYWVE ( 4) DENADL-DYEARLK ( 4) FPWSVKNQA ( 12) STADSMSYWVE ( 4) SCKTEQ-ELTAKLK ( 5) MNWEVRNQA ( 12) NTTDAISRWVE ( 4) SEERDK-EIEAKLR ( 5) VKWEVKNQA ( 12) DTADAISHWVE ( 4) TGLQEA-EHEAWLA ( 5) QQWEVRNQA ( 12) SSLSALSHWVE ( 4) QGLAES-EHEQWLT ( 5) QRWEVRNQA ( 12) SSLEALSHWVE ( 31) AKAQEK-AFEHELA ( 5) ANWQVKNQA ( 12) SCSEAISYWIE ( 4) SSIRDA-EIQHSLT ( 5) IPWEVTNQA ( 18) SLTQAVASWPE ( 4) SPDREA-ALQRRLA ( 5) LPWEVTNQA ( 18) SLEAALATWPE ( 3) KHISEK-DIQAALH ( 5) NVWDVVNQA ( 18) SLIDALATWPE ( 3) SYEETL-SLEKKLR ( 5) YQWELKNQV ( 13) SSRDAMSKYPE ( 3) AEAETI-ELEAKLK ( 5) YRWELKNQA ( 13) NACEAMSKYPE ( 3) SYEETL-QIEKRLR ( 5) YRWEVKNQV ( 13) SSQDAMSRYPE ( 3) SYEKTL-RIEKELQ ( 5) YQWELKNQI ( 13) NSCDAIAKYPE ( 3) SYEETM-EIESNLY ( 5) YQWELKNQV ( 13) SSRDAIEKFPE ( 3) SYEQTV-EIENQIK ( 5) YNWEIKNQY ( 13) SSTDAVSKFPE ( 3) SYEETV-ALEQQLK ( 5) YDWELKNEF ( 13) SSKDAISKFPE ( 3) PASDDL-KIYQQLT ( 5) YQWQIKNEV ( 14) SVTDAIGHFVE ( 3) PVEADL-QIYTALR ( 5) YQWQVKNEV ( 14) SVSDAIGHFVE ( 4) DEAHEK-KLENCLK ( 5) VPWSVKNEA ( 12) STVDAISKFPE ( 4) DENVEK-GLESNLK ( 5) IPWSVKNQA ( 12) SSEDAISKFPE ( 4) DEIYEQ-SLETKLM ( 5) IPWSVKNQA ( 12) SSVNAISKFPE
118 103 118 119 132 96 118 118 121 121 146 116 118 142 116 118 139 141 145 125 125 119 136 131 135 132 123 123 123 140 118 131 118 134 117 141 132 129 125 130 157 122 119 120 118 136 118 115 119 119 119 120 125
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
178] 163] 179] 185] 199] 171] 188] 188] 195] 184] 208] 177] 180] 204] 178] 180] 201] 235] 218] 195] 195] 197] 200] 200] 201] 195] 186] 186] 188] 203] 182] 194] 181] 200] 183] 206] 197] 192] 188] 194] 223] 190] 189] 192] 187] 204] 187] 183] 186] 187] 186] 187] 195]
38
gi|42781312 gi|89207881 gi|89097677 gi|126650546 gi|68053821 gi|13474215 gi|153008128 gi|118589625 gi|16127452 gi|113935307 gi|148252405 gi|146343482 gi|8708900 gi|154251148 gi|126359659 gi|104782690 gi|15806957 gi|118050780 gi|114568687 gi|46109886 gi|145256616 gi|71280642 gi|84385612 gi|149908946 gi|148976111 gi|83644846 gi|88800820 gi|15673070 gi|125624291
11 11 11 12 3 45 24 31 11 14 8 8 49 8 47 11 8 9 22 17 35 22 8 8 7 13 10 9 9
RLIENDEWMMNVLQLAKSLE HLIENDEWMMNVLQLAKSLE KAIQEDGWMMDILKAAEAAG SLIAADDWMMNVLEAVEKVH QSLYENETLREQLILVDRLG EIVSRDSLVREALARARTLG QMILESPLLTDALHRLHELG QIIRSIPHVMEILTTVRDLD EIVRGVPTTMHVLKTVRELD DIIRGVPTTMQVLRTIRDLD AAALRNPINTAIMHVLRETD TAALRNPINAAILHVIRRID ALALRNPVNVTIIDELARLA SLALTNEKNRTILQHLPELK ATIQENPINRALLAILPTLD AIALENATNRALLALLPKLD ALVRHNPVNAALLDRLPQLA ADILQNVNNRTILERWKDLN DAVMADPLARIVLERAHGLD ETVKHNKTLMVVLSRAAEMK EAISQNATLTKILSQALDLD EFVMTIDGMPQILEKINH-NLIQQVPELIETANVCREVG HLIQQVPELIETAEACREVG YLIHQVPELVETAKVCREVG ALLQSVPEIMETMEACANYG DILVTVPEIQQVLEAIEQLG KILKKNSELMFILDEVSVLN DILTKNSDLMMILDEISELK
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 4) 2) 2) 2) 2) 2) 2) 2) 2) 2) 4) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2)
DWWICAGFVRSK DWWICAGFVRSK DWWICAGFVRSK DCWVCAGFIRSK DAWICAGYIRSL DWLVVSGALYNS KFWLVSGALYNT DAWLVSGGIYQT DWMIFSGAVYQP DAMVFSGAVYQP DAWIVSGCLVQT DAWLVSGCLVQT DAWLVAGCLVQT DAWLVSGSLFQT HSMLTAGALFQT NCMLTAGCLFQT GALLVAGSLFGT DGWLVAGCLFQT DWALMAGAVYKA NWYLAAGALTQT EWYLASGCIFQT NAWVGAGIIFQN NFYIAGGVITQI NFYIAGGAITQI NFYIAGGAITQV DYYLAGGALTQA DAFLAGGAITQC NYYLAAGSVFQT NWYVAAGSVFQS
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
28) 37) 37) 37) 37) 36) 38) 38) 36) 38) 45) 42) 39) 45) 34) 47) 24) 33) 42) 42) 42) 42) 42)
REP-YLIGAAATG AQV-TVVGSYLLG SQI-TVVGSYLLD TQV-TVVGSYLLG SSI-KVVGSYLLG ASV-SLVGSYPLG LEV-KIIGSYLLG AAV-NIVGSYQLK SDI-AIIGSYMFD SSF-TVAGSYALE EKS-EIFGLDAIN HQV-RVIGGCDGM RIV-HQIGSTRLG VKI-DSLGSWRIC DKV-LKTGSAAIG TGV-KVVGSFPLG IRL-MN-----VE SQI-DVVGSFMTR VNV-NVVGSFSLR TNV-NVVGSFVLR SDI-NVVGSFALR TNI-NVVGSFASK TNI-NVVGSLALK
14) 14) 14) 14) 9) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14) 14)
PDVDVIYFD PDVDVIYYD PDIDVIFFD ADIDVIYYD QDIDVVYYD KDVDLFYFD KDLDIIYFD KDFDIIYFD KDYDVAYHD KDYDVAYHD LDYDVFYSD LDYDVFYSD ADYDVFYFD KDYDVFYFD KDYDIAYFD KDYDIAYFD KDYDLFYWD KDYDLFYFD NDYDLGYFD DDYDLIYFD ADYDLVYYD KDIDVLYWD KDFDVVYFN KDFDVVYFD KDFDIVYFD KDFDVVYFA KDFDIIYFD HDIDIVYFD HDIDLVYFD
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 3) 3) 3) 3) 3) 3) 3) 4) 4) 4) 4) 4) 2) 2) 2) 2) 2) 7) 7)
DEIYEQ-LLEKKLI DEAYEQ-SLKQKLA GEASEK-EAEQLLG TEEKEK-HYEQELL SEATEK-QYKEQLK SYEAED-AVIRRAA SWEAED-AVITRGA SYEAED-AIIGKVN SYEAED-VVIKRVA SYEAED-VVIRRVA SWEAED-AVIKLVA TWEAED-AVIKRVA SWEAED-AVIRKLQ GWDAED-AAIRRCR SWEAED-RVIAKVQ SWEAED-RVIARVR SYEAED-AVIRRAA SETAEQ-KVQAHAD GAAAEA-AVVGPAE SWEAED-AVIQKGK SWAAED-AAIQKGQ SWQSEN-SYIQALT SAITED-EFKNRIR CIFTED-EFKSRIS QLTSED-EFKRRIC ASALEK-AHENTLN PTQSEA-DIEDHLT SAKNDK-KLEKVLS SVSNDK-QLEKSLS
( 5) IPWSVKNQA ( 12) SSVDAISKFPE ( 5) IPWSVKNQT ( 12) SSVDAISKFPE ( 5) VPWSVKNQA ( 12) SSIDAVSKFPE ( 5) EPWSVKNQA ( 12) SSLDGISHFPE ( 4) LPWSVKNQA ( 12) STCDAIAHFPE ( 7) LPVEVRNQA ( 18) NASESISYFAS ( 7) LPVQIRNQA ( 18) SATESIERYST ( 7) SLLEVRNQA ( 18) CAMDSLTTYAS ( 8) DLVEVRNQA ( 20) DSAAALKRFVA ( 8) DLVEVRNQA ( 20) SSADALKRFVA ( 7) AKVEVRNQA ( 18) RSTDGIDRFLT ( 7) AKVEVRNQA ( 18) CSTDGIDRFLT ( 7) VKVEIRNQA ( 18) RSTDGIDRFLT ( 7) IDVELRNQA ( 18) SSAEGIDRFLA ( 7) VNVEVRNQA ( 18) KVTDGIDRYLI ( 7) VNVEVRNQA ( 18) SVTEGIDRYLI ( 7) APIELRNQA ( 18) SVQEGIDQFLI ( 7) IRVEVTNQA ( 18) SSSDGIDRFLI ( 7) VPVEVCNQA ( 18) DTNDALRHFAS ( 7) VEVEIRNQA ( 18) SSEAAMTAWGT ( 25) TPVEIRNQA ( 18) SAEAAIATFPT ( 7) IPFDVKNIA ( 18) SVQESISTWPV ( 6) VDVDVKNQA ( 18) RVEQGIESWLS ( 6) VDVDVKNQA ( 18) RVEQGIESWLS ( 6) VDIDVKNQA ( 18) CVEQGIESWLS ( 6) IPIDVKNQA ( 18) SVEAGISSWLP ( 6) RPVDVKNQA ( 18) SSEDGLRMWLP ( 4) YQFDIHNEA ( 14) NTENAIERWIA ( 4) YRFDVHNEA ( 14) NTENAIERWIA
121 121 121 122 107 163 144 149 132 135 125 125 166 125 164 128 127 127 140 135 171 138 123 123 122 128 125 123 123
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
220] 191] 188] 189] 174] 225] 215] 214] 194] 198] 189] 189] 375] 187] 222] 186] 185] 189] 204] 198] 262] 199] 185] 185] 184] 152] 189] 192] 192]
Group XXI KOG2054 gi|62484300 gi|62900709 gi|118026904 gi|126333897 gi|62900684 gi|68361230 gi|115675586 gi|156382059 gi|110751151 gi|156548202 gi|91085895 gi|158298261 gi|157105710 gi|158603717 gi|156087062 gi|95007283 gi|66359678 gi|66802672 gi|115402991 gi|83773141 gi|121704620 gi|71000741 gi|145245609
109 101 110 76 101 107 14 67 49 103 102 150 141 39 66 153 92 165 78 88 46 80 96
YTDFIENWLESFTAFTRQLK KKDRIDAFLREVNQRVVRVP KKERIDNFLKEVTKRIQKVP KQHRIDTFLHEIKQRILSVP RRKTIDGFLHEINALLGTIP RKKLVDSFVQQVTEFLDCVP KKKGLLPILQSLEEILRKLP -NEALEDTLHALNEVLLAIP YKRLFDIWFKNFKKNIESIK YQKLFQAWYEKFETHVKSIK HRNAFKDWYQSFESFLNELP VGRFSHVWVEQFKQFLHTIE VVNFVEQWLADFRKFLRTVK ERCKLEALAETVKSAIYASK TVMQLNGFIDTIIDFIKNVP ETEAIAAFLVYLRQMLREAP RENSKTMLFADILKLDEEFKLTSLESALHQLKSVIEKIP QISKLQDILHKLKNAIESLP HVSKLQDTLHRLKEVIDNIP QTARAQDILHKLKDLIERLP QVSRVQDTLHRVKEAIEQLH QVSRLQDTLHKLKSVIECLP
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 8) 6) 6) 6) 6) 6) 6) 8) 8) 8) 8) 8)
IVVDVALEM INVDVALTM INVDVAVTM VNVDVVLTM INVDLAVTM VCVDLAVII LTIDMAVQI FNVDVSINM ITVDIMIEM ASIDILVEM LRVNINLTM LVVDLLMVM LKVDLLMEI PVLDLIIII PTIDLSIEL KNVDVAVEM SVVDLVFDL ENVDLMIEI YTIDLAVTM YTVDLAVTM HTIDLAVTM YTIDLAVTM LTVDLAVTM
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17)
KRALYLTYVTERMM KRALYLAHLAHHLA KRALYLAHLAYHLA KRALYIAYLGYHLS KRALYLAHIASHLT KRALYLAGLAQHLS KRALYCMYLVKHLK KRAAYLCVVAHHVI KKAIYLAFIAFNIT KKAIYLAYITSKIG KRYYFLAYIFYHLK KRAHYLCTIAERLL KRAHFLCHLAESLG KRAHYICQVARILI KRNAWLCKLYDDMQ KRSAYVERLYHHVR KRGALVMDLFHKLL KRNLYMLKIFNSVS KRAYYIACLAAGIK KRAYYIACLAAGIK KRAYYIACIAVGIR KRAYYIACLAAGLR KRAYYIASLAAGIR
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
36) 33) 33) 33) 33) 33) 31) 33) 29) 29) 28) 38) 37) 31) 39) 84) 84) 37) 39) 39) 39) 38) 39)
LQVRLFITA VTVRLHPCP VTVRLLPCP VTVRLYPCP VTVRIHICP VTLRIHAIP ATVRLIPCL CTVRIHPVI MNVFIHISA FVVHVHVVV ITIKIFATP VHFQLHVVA VVFVVHVVP GFLRIHFAP FLIRIGAHI WTVRLLFTP FRFRILPSI FQIRIIPTI SQIRVITAI SQIRIFTAI SRIRIITAV SQIRVITAI YQIRIITAV
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
40) 42) 39) 42) 42) 42) 39) 39) 40) 41) 37) 44) 37) 40) 44) 47) 48) 67) 36) 36) 36) 36) 36)
NVLFDLTLSEN WVLQDTVLESH WILQDVALETH LLLFDEVLESH TILSDLTLEHH TVLGDHLPLSH SILRDMSLERH LILRDMVLGGH IILHDLTM-KI AILHDLTALKA SLAHDATLTLN SILYDVRLVKN SILYDLRLLKN KILIDMLREEI SISEDLQRDAI LVLEDLRMQVR AILSNTKNQML SILEDIFINDH ALRSEATVGLF ALRSEATVAPF ALRSEATVASY ALRSEATVGSY TLRSEAAVAQY
310 310 316 285 310 315 219 273 250 308 309 373 351 252 280 428 339 399 294 304 262 295 312
[1193] [1146] [1152] [1121] [1147] [1153] [1023] [1143] [1088] [1151] [1115] [1188] [1154] [1058] [1094] [1628] [1301] [1278] [1099] [1110] [1075] [1117] [1116]
39
gi|67525995 gi|119195467 gi|154281257 gi|85090796 gi|116203599 gi|145608234 gi|46123925 gi|156042404 gi|111055725 gi|50418831 gi|150951287 gi|68466831 gi|149247283 gi|146412656 gi|6321527 gi|156839623 gi|50288129 gi|50307475 gi|45187902 gi|50554475 gi|71018559 gi|159105096 gi|116504072 gi|134110986 gi|19113115 gi|145337144 gi|157348130 gi|162686619 gi|108862244 gi|116058956 gi|159464747 gi|146084045 gi|154335575 gi|71748928 gi|71412160
95 83 71 56 54 58 55 98 139 116 113 117 154 101 120 117 85 118 121 139 241 99 111 142 79 28 29 24 30 101 110 38 38 38 59
QLSTIKDTLHELKDLIESMP QLRFVQQPLRRLKEIIEGIP LLARVEKPLRTLKSIIEKIP ALEGVDSLLHKIKGSIEAIE ALEGADSLLHRIKGVIEGIK SLAGIEELLHQVKSGVEALQ ALKGVDEHLHRLKEAIDTAT RAAGIKSALHQLKNVIEGIE KEAAAENAMRTLKALIEQIP HVTKIEKVLHRLHDFINKIP HVAKIEKVLHRLHDLIAKVP HEEKIEKVLHRLHDLIKQVP HEEKMEKVLHRLHDLIKQIP HITRLEKVLHRLHGLIEQIT HVLKVEKFLHKLYDILQEIP HVLKVEKFLHKLYDMIQLVP HILKMEKVLHKFYDMVQQIP HIVRVEKFLHKLYDMIQEVP HILRVVKFLHRLYDLLQGVP HTKRIGKMLHRVHSIIAEIP KAGALETVLRRLHQLFEALS HKQAIEHLLRSLQACIMCVP RIPPLEKFLLNLHTFLLKIP PHAALKDLLTAIHSRILGMP YFRHANTFVEKIKDLIFKTP LRKLVDDTVSSIKEAIDGIP TTKLVDDTVSAIKQAIDTIP RCRIVDSAVAAVKERLLSLP ALRAASEAADAVAGLVKRIP PRDRRRRRMDRVNVVEGEGR KGSGVEAVLARLKEALMSMP -YPTEVESRHAVTAEAAAAA -YPTEVETRHVAIIQRAVTG -YSTASCTGAQALDSVSEPI -YSAVTHTTATVEE--DEEQ
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
42) 41) 42) 42) 42) 42) 42) 42) 41) 40) 40) 40) 40) 40) 42) 42) 42) 42) 41) 39) 45) 44) 41) 44) 42) 30) 29) 30) 30) 23) 29) 20) 19) 5) 5)
ANI-NVVGSFALR TAI-SVVGSFALK VNI-NVVGSFAVK SQF-NVVGSYVSK AQF-NVVGSYVSK STF-NVVGSYVLK ASY-NVVGSYVAK SGI-NVVGSYALG ASI-NATGSYPLK EDV-SLVGSFGLK EEV-SLVGSFGLK GDL-SLVGSYGLK DEV-SLVGSYGLK QDI-SLVGSFGLK D-I-SLIGSFALK S-V-SLIGSFALK G-V-DLIGSFALK D-V-SLIGSFGLK S-V-ALIGSFALK SDV-NVVGSFALG SAM-HLVGSWPLK EAL-NLVGSWPLH SEI-TLVGSWANK EEV-VIGGSWSVV KTV-TPGIFSCSN NGF-NLCGSYSIC KLF-EIGGSYSIR DGV-EIVGSYAVQ EVV-RVAGSHAAG TRV-RTIGSHAEG SAV-TVVGSYAAR DVV-SAAGSFLLR DVV-SPAGSFLLR DPV-QAAGSFLLR GPI-SSVGSFLLR
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
50) 50) 27) 26) 30) 28) 27) 21) 21) 26) 31) 43) 43) 43) 43) 38) 24)
KGVMRVGLLAKG KGVMRVGYLAKG RGVMRVGNLAKG KGVLRVGVLAKG KGVMRVGVLAKG KGVMRVGILAKG KGVMRVGLLAKG KGVMRVGILAKG KGVVRVGILAKG KGVMRVGILAKG KGVMRIGVLAKG RGVMRVGLVAKG RGVMRVGLVAKC RGVMRVGLVAKG RGVMRVGLVAKG RGVMRVGLVAKG HGVMRVGLVAKG
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
8) 8) 7) 8) 8) 8) 8) 8) 8) 8) 8) 8) 6) 8) 8) 8) 8) 8) 8) 10) 8) 8) 10) 10) 8) 6) 6) 6) 7) 12) 6) 16) 16) 16) 16)
YVVDMSVTM PTVDLAVTI TTIDLAITI HAVDMVVVI YAVDMVIVL LAVDMIVEM FGIDMVVQM LCVDMIVNM FSIDLVVTM MSIDIALTM SSIDVALTM QSIEVALTM SSIEMAVTM TAIDVALTM SSIDTLLTM SAIDVLLTM SNVDILLTM SAVDVLLTM SSIDVLLTM VAVDVSVTM VDVDIAVVM MDVDMEVVM YGVDVAVEM GGIDLVVAM WSYDLFLEI TSVDLLVHL VDIDLFVRL QTVDLAVRL VSADLLVRL IVVDVALEM PTVDVALQL VGADLVLSI VGADLVLSI VGADLVLTV VGADIAIKI
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 17) 56) 17) 17) 17) 17) 17) 17)
KRAYYIACIAAGIK KRAYYISCIAAGIK KRAYYIACIAAGIK KRAYFLGVVAAALQ KRAYYLAVVAAALR KRSYFLAHVAAAVR RRAYYIAYIAASLK KRAYYLATIAAGLH KRAFYLACLTSGIK KRAFYLAYVADNLI KRAFYLAHLAEHLV KKSFYLAYLGENLI KRAFYLAYLADHLM KRAFYIAYVAEHLI KRSVYLAYLTHHLL KRSVYLAYLTHHLS KRSVYLAYFTHHFS KRSVYLAYLTHHIS KRSVYLAYLTYQLS KRAFYLTVLAAALK KKAFYLATLAHAIQ KRAFYLAVILAYVE KKAFYLATIAQAIQ KRIFYLAVIFSELQ KRAFYLTCIAKHLL KRCLYLCVIEKHLL KRFLYLCIIKKYLN KRALYLAVLKKAIT KRCLYLHVIEKSLR KRAVYLEVLLKALS RRAVYLVAVAHHLR RRQSFVDEIQAYLQ RRQSFLREIQEYLE RRHEFLLKVERFLR RRHVFLLKVEKFLK
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
38) 39) 39) 39) 45) 48) 54) 37) 40) 45) 45) 45) 45) 45) 49) 51) 49) 45) 45) 37) 45) 39) 41) 41) 38) 35) 35) 36) 37) 31) 33) 69) 84) 59) 53)
FQIRIITAV LIVRVITAV SRIRILIAI YRVRIIPCA FRIRILPCA PKVRVIPCA YTIRLIPCA FEVHIIPAA CAINILVAV FTINIIAGF FTINLIVAF IAINLIVAF YTINLIAAF FYINILVGF FSINLLIGF FSINLIVGF FSIKFIIGF FSINLIVGF FSVNLIVGF FYIRILPVI CTIRIHPSL TVVRIHAAH ARVCIIPTL VDIRIHASI FTVFLIPTV FSIRLIPSA LSVRIIPTA FVIRIIPTI FYVRIIPTA RSLRLLLTI FRLRLLPAL QILQLRFRR QILQLRFRR YIIRIVFLR DILRICFLK
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
35) 38) 46) 37) 37) 37) 36) 38) 39) 34) 34) 34) 34) 33) 36) 36) 36) 32) 32) 32) 42) 40) 40) 38) 33) 33) 34) 82) 34) 35) 62) 40) 40) 40) 40)
CLRSEATVALY TVRAESSVAEY AIRSEASVSAF TIVSESCFFPY TVVAESCYLPY TVKSDSLYFAY TLKAEETFISY SLSADCNFESY TVQSDANVTAY SILSLTSYDYY SIISMSAYGYY SVLSQTSYDYY SIITQSSYDYY SILSMTAYDHY SVLSSSTHENY SVLSSSTHEHF AVQSSTTFSSY SVLSSSCHEHY SGLSSCTHEHY AILADMTTTRY AILADTLHLPH CIAAESLRLAH ALLYSLTPKPH SILHDTLHKPH SVLEEQNLLFY SILEDMFLEEN SILEDMFLEDN SILEDMAVEST SILEDMFLEEN TIAEDMYANVH GILMDMLLPVH CILEDYLMPHY CILEDYLMPHY LVLEDALMTTH LVQEDALMTVH
309 300 296 273 277 284 286 314 358 334 331 335 370 318 345 344 310 335 337 348 472 321 334 366 291 223 224 269 268 293 331 273 287 248 261
[1107] [1098] [1060] [1135] [1086] [1127] [1102] [1073] [1206] [1180] [1176] [1175] [1217] [1161] [1237] [1230] [1191] [1212] [1216] [1157] [1395] [1227] [1252] [1238] [1097] [1053] [1083] [1111] [1036] [1068] [1429] [1257] [1268] [1236] [ 574]
Group XXII PF07528 gi|157118116 gi|125978459 gi|66510594 gi|149027296 gi|123227937 gi|160395566 gi|119894960 gi|126323186 gi|109480165 gi|47225665 gi|115970594 gi|119604541 gi|5762315 gi|29126828 gi|514259 gi|62510785 gi|47220271
510 536 587 676 735 606 558 754 498 738 585 27 27 17 27 33 17
ELQTIQRIVSHTERALKLVS ELQTIQRIVSHTERALKLVS ELLAVQKIVSHTEKALKFVS ELQAVQKIVSITERALKLVS ELQAIQKIVSITERALKLVS ELLAVQRAVSHAERALKLVS ELLAVQKAVSHAERALKLVS ELQAIQKVVSHSERALKLVS ELQAIQTAVSHTERALRLVS ELQAVQRIVSHTERALKLVS ELQLVQNMVSQMEKALKGVS ELEAVQNMVSHTERALKAVS ELEAVQNMVSHTERALKAVS ELEAVQNMVSHTERALKAVS ELEAVQNMVSHTERALKAVS ELEGVQNMVSHTERALKLVS ALEAVQTIISDVERALKNVS
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6)
NTVQLVVLC NAVELVVLC NHVCLVVLC RNVNLVLLC KEVNLVLLC RNVRLALLC RAVQLILLC RNVQLILLS HSVQLTLLC RNVQLILLT LNMGLVILC LDLELVLLC LDLELVLLC LDLELVLLC LDLELVLLC LDLELVLLC SSLELVLLC
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4)
TTSLLK-RVVTELP TVGLLQ-RVATKLP TRSLLN-KVAEILP SKSLLS-RIAENLP TKNLLT-RIVEHLP THSLLR-RIAQQLP TRALQR-RVAEQLP TQSLLQ-KITEQLP THSLLQ-RIKQELP TVSLLN-SIAKQLP TRTLLE-RIANLLP TTALLD-KVADNLA TTALLD-KVADNLA TISLLK-RVADNLV TISLLK-RVADNLV TITLLK-KVSDNLA TVSLLK-DVAEKLT
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
29) 31) 29) 31) 31) 30) 30) 31) 30) 31) 29) 31) 31) 31) 31) 32) 30)
ITVKISLTS VSVKITLTS VTVSVTLTS MQVTITLTS MQVTITLTS MQVTISVTS IQVAVSITS MQVTVSVTS VKVTVSATS MQVTISLTS MTMTVSLTS LSLTIHLTS LSLTIHLTS LTLHIRLTS LTLNIRLTS LTLTIHLTS LTLTIHVTS
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
31) 35) 34) 31) 31) 30) 31) 31) 30) 39) 28) 31) 31) 31) 31) 31) 37)
QALAALRHAKW RALADLRHAKW EALAALRHAKW DALAALRHAKW DALAALRHAKW ESLAALRHARW QALAALRHAKW ESLAALRHAKW ETLAALRHAKW EYLAALRHAKW EALAALRHAKW AALASLRHAKW AALASLRHAKW AALASLRHAKW AALASLRHAKW TALASLRHAKW SALASLRHAKW
703 735 760 847 910 777 729 920 662 917 756 215 215 205 215 217 191
[ 884] [ 915] [ 928] [1010] [1074] [ 939] [ 892] [1083] [ 822] [1082] [ 936] [ 765] [ 894] [ 854] [ 695] [ 833] [ 570]
40
gi|71834292 gi|109110278 gi|68367102 gi|47209094 gi|47227654 gi|148233852 gi|71996882 gi|92081460 gi|21357109 gi|91076380 gi|157112084 gi|66531196 gi|114051277 gi|54261787 gi|118344266 gi|156367032 gi|158598844 gi|76162585 gi|17563154
124 27 27 10 27 27 276 622 70 70 76 72 73 70 62 16 76 61 58
QLDAVQNMVSHIEFAMKAVS ELEAVQNMVSTVECALKHVS DLEAVQSLVSTVECALKHVS ELEAVQALVSTVERALKHVS ELEAVQTLVSTVEGALKKVS ELEAVQSMVSTVECALKHVS FISNVDRLISDINESLKYVS ELEAVQSTVQTVETAFKDVS EQTAIGNLVTKVQAVLDNLV EQTAILNLVTKIQTVLDNLV EQTAISNLVTKVQAVLDNLV EQTSILNLVTKLQSVLDNLI DQAAVLSLVTKLQTVLDNIV EQASILSLVTKINNVIDNLI DQTTVLGLITKCTTALDSLV EQALVQNLVTKVTSVLDNLI EQTAVLGLISKIKAAIEKIS SQYALSNLSNRICEILDNII IRKRINEYAKKVIVALEKEK
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
22) 36) 52) 46) 37) 35) 31) 27) 11) 11) 11) 11) 11) 10) 11) 10) 11) 11) 10)
RGVLRVGLVAKG CGVMRIGLVAKG CGVMRIGLVAKG CGVMRIGLVAKG CGVTRVGLVAKG CGVMRIGLVSKG LGCSRVGIIAKG TAVVRVGELSKG EEVRQVGSFKKG EEVRQVGSFKKG DEVRQVGSFKKG EEVRQVGSFKKG EEVRQVGSYKKG EEVRQVGSYKKG EEYRQVGSFKKG EEVRAVGSHKKG EEFREVGSFRKG DQVRSVGSFKLN ATISHVGSFVTD
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6) 6)
LDLELVLLT MDLELVLMC MDLELVLMC MDLDLVLLC MDLELVLMC MDLELVLLC RCAEVVLTC LHVALVLMC NVADVVVIL NVADIVVIL NVADIVIIL NVADIVVIL NVADIVVIM NVADLVVIL LIADVAVIF PIADLTVIL NVADIVVVL CISDLTCVF DKSDVVVQL
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4) 4)
TSSLLR-LVSGKLS TETLLN-TVKDNLP TKTLLS-TVCEHLP TQTLLD-TVCHRLP TKPLLY-TISTNLP STSLLA-TVAENLP TSGLVE-QIRRLFG TKTMLN-RVAGHLP TKEAVD-ALAKKVE TREAVE-ALGNKVK TKDVAE-VLGKKVE TKTAVE-ALGTKVN TKEAVE-GLSNKVN TLEAVA-ALGNKVV TKESVK-LLADKIV TEADII-ALATRVQ TVEAVS-ALGQKIV TLEAVE-NLANFVR SYETVA-ELGRKVV
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
31) 31) 31) 31) 31) 31) 27) 31) 33) 33) 33) 27) 30) 27) 27) 24) 26) 30) 27)
LKLRIHLTS LTLKVILTS LTLKVTLTS LTLKVTLTS LSLKITLSS LILKVILTS MKCRILITS VTIRITLTS AKVRILIAT ACVRVMVTT ARVRCLIAT ATVRVLITT AAVRVLITT ATVKILITT TTVNVLIAT ALVRVLITT AIVRLLITT YVVQVLITT CQVRLLITI
( 31) ( 30) ( 30) ( 31) (127) ( 31) ( 22) ( 32) ( 20) ( 20) ( 20) ( 20) ( 20) ( 20) ( 20) ( 20) ( 20) ( 20) ( 20)
ASLASLRQAKR NALASLRHAKW VALASLRHAKW LALAALRHAKW AALASLRHAKW NALASLRHAKW YALASIRNTKW DALAALRRAKW SHLAAIRHTRW SHLAAIRHSRW SHLASIRHARW GHLAAIRHSRW SHLAAIRHSRW SALAAIRHARW ASLAAIRHVRW GALATIRHSRW SHMAALRHARW TALASIRHLRW INFFSTRHITW
291 207 223 201 305 207 439 795 217 217 223 213 217 210 203 153 216 205 198
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
820] 619] 690] 673] 850] 483] 608] 947] 396] 393] 398] 383] 382] 387] 367] 339] 484] 241] 370]
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
RYFLFHGTLLGA RYWLTRGALLGA TYFIAFGTLLGA DYWLEGGTLIGA RYWLDSGTLLGA PYWLSSGTLIGA SYWLDGGTLLGA QYWLDSGTLLGA DYWLDYGTLLGA NYIINYGTLIGA PYFLSYGTMLGA KYFVNYGTLLGS EYSLGGGSLLGA EYTLAGGSLIGA EYSLAGGTLLGA RYALIYGTLIGA RYSLGGGTLIGA HYSLGGGTLIGA QYSLGGGTLIGA PWYLSGGSLLGA TFFAIYGTLLGA RYMLDAGTLIGA TYYALGGTLLGA TYYALGGTLLGA KYFASGGSLLEA RYFALGGSLLGA DFFLRGGSVLGA RYYIMAGTMLGA TYYIAFGTLLGA LCYLCGGGAIGA LCYLCGGGAIGS TCYFCGGGCIGA DYFLIGGSLLGA
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10)
NDIDIAIPIDLDIALPDDLDIYINDDIDLSIMDDIDIIIMDDLDIEMLDDIDIGMTDDIDISMPTDTDVGMLDDIDLSMPDDIDISLYDDIDISMYDDIDLMLEDDIDIELTDDIDIMMPDDVDIMMPDDIDVYMHDDIDVYMHDDIDVYMHDDIDLMMYDDIDVVMPDDADVAFTDDIDIGMPDDIDIGMPDDMALGLPDDMDLGIPDDMDIAVPDDLDIGMPDDVDVCMPDDLDFFMPDDLDFFMPDDLDFFMPDDIDIGMR-
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
-EEDFH-KLQAAAE -EEDYF-RLLYLIK -DKGLK-KLRKIIN -RKDYE-KLIKVLP -QEDAK-FLKENYK -RSDYL-RLLKVLP -LEDMQ-AFMKVAP -LEDYL-KFKELGA -RSDYA-LFLEKGV -REDYQ-RFINIFQ -REDYE-RLLKIIE -REDYE-RFQKAVI -RSQYE-RLMKALA -RPHYD-HLISVLM -REDYE-YLLQNFA -RPDYD-SLLQYLK -RDEYQ-RFVDVWF -RDEYQ-KFINAWL -RDHYQ-KFVNIWK -RKDFE-RLCEVAS -RKDFE-KFKKMAV -RNQYE-AFMKVAP -REDYE-KFKKIAS -REDYE-KFKKVAP -RKHFE-KFINEID -RNDFD-KFTNGID -REAYD-KLPSVFK -RADYD-LLMTNAK -REDYD-RFIREGS -RKDYE-KLAELWP -REDYE-KLALLWN -RPDYE-KLKVLWP -RHDYQ-RFIAVAN
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
49) 83) 45) 58) 34) 46) 46) 47) 51) 43) 44) 44) 43) 44) 41) 45) 42) 42) 42) 47) 46) 51) 43) 43) 36) 36) 46) 43) 45) 45) 46) 46) 45)
LMVDVSVLP LMLEIYPVK VQIDLFPYG VKIDLFPKD LDIDIFLVT IYIDIFPLE LYVDIFPFI VFVDIFPML LQLDLFIYD IFIDIFPID LFIDVFPID VFVDVFPID IFIDIFPLD VFLDIFPMD INIDIFPID VFIDIYPYD MFIDIFIYD IFIDIFIYD IFIDIFIYD IFIDIFPLD IWIDILALD IWVDLFILD LWIDIFPID LWIDIFPLD VCLNLFLLD VCLDIFPLD HLIDIIPLD VYIDVFPLD VFIDIFPLD LALDVLPLD IALDILPLD IPIDIFPLD VFVDIFPFD
( 46) ( 38) ( 92) ( 90) (109) ( 96) (100) ( 95) ( 46) (109) (110) (110) (115) (116) (110) (115) (113) (110) (110) (116) (113) (113) (117) (117) ( 70) (115) (116) (106) (107) (109) (110) (110) (109)
NYEKMLKQWYG DWRRVLRTTYG NPGLVLEVDYG DIEHTLEVQFG NYDTYLKKMYY QTDAVLRKIYG NVDAYLKDLYK NPSHYLDAIYS GYDAYLTRCYG KFDTILTQFYG KYDQFLTQMYG NYDKILTQFYG NEHAYLNQLYG NHEEYLTRQYG RYDDYLSMIYG EFDKILRHEYG GYHEHLTQYYG GYHEHLTQYYG GYHEHLTMYYG DYDPLLTKQYG GYKRVLEMTMG GWHQILTEVYG KYKDYLKAIYG KYKEYLIAIYG DDNVNI----NYDCYLKNIYC GYDRYLKRLYG EYDTYLSQKYG NYDEYLRVLYG GYDVYLRTAFG GYDGYLKTAFG GYKRYLTEVFG AYDTILTRMYG
253 331 256 296 235 243 245 234 187 243 245 246 249 251 242 251 246 243 243 275 309 256 249 249 185 244 249 240 263 246 247 250 242
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
280] 374] 280] 346] 261] 265] 271] 260] 208] 267] 269] 270] 274] 278] 269] 273] 268] 265] 266] 303] 454] 280] 273] 276] 185] 265] 268] 270] 296] 281] 281] 282] 269]
Group XXIII PF04991 gi|156346924 gi|156368811 gi|68643684 gi|148643572 gi|19704581 gi|150003461 gi|160891638 gi|149199197 gi|60683615 gi|15901133 gi|148998913 gi|160947073 gi|25011530 gi|15672196 gi|28898096 gi|68643822 gi|53732817 gi|46133414 gi|46156860 gi|104774621 gi|160936491 gi|160937625 gi|68644462 gi|68644601 gi|149006105 gi|68644065 gi|2209209 gi|153807282 gi|160946338 gi|148993075 gi|160947432 gi|29376683 gi|28377688
79 131 40 69 13 22 20 13 11 12 12 13 12 12 12 12 12 12 12 33 71 13 10 10 5 14 8 12 32 13 12 15 9
ILTRLLRVLALLCDKHGV-VLTRLLLAFDALFKRYGL-RMIDMLSFLNDICKENNI-LYIELLRFVDNVCKKHDI-KKLEILIDIAKFCNENKI-RMLELLEVIDVICRKHQI-KQLSILKEVDRICRKHKL-IMLDMLIEFDAVCKRNQL-RMLYLLQSFDTVCKKHDI-IELEILDYIDTLCKKHNI-IQLALLDYIDETCKKHDI-YELNILKFIDYVCKKYDI-VQLEMLAYIDKVARDNKI-IQLEQLKYIDRICRENGI-VQMSILDRVHLFCERHDL-VSLEILHTIASICEKQHL-VCLDILDYFHALCEKHQI-VCLNILDYFHALCERHQI-ASLDILKYFDQLCAKNDI-VELDLLDQLDRVCQKYDI-TQIDLVKVLEKICKKYNL-ANLTILKEIDRICRKYNI-KILEILKIFIETCEENNL-KILIILKEFINICEENNL-EKLKLLKEFIKICSKNKM-EELKLLKEFIRICSQNKI-VELDAIKEFKKICEENDI-RILKNLLAIDKVCKEHNL-KELEILKNFISCCEKMNL-RSLEMAEYFVAFCKEHDL-KSLEIGKYFVSFCEENNL-VVLRMSKYFVAFCEEHNL-VELRLMRTVIAICEAENI--
41
gi|116492338 gi|110800391 gi|68643136 gi|68642747 gi|68643227 gi|68643175 gi|68644524 gi|153854451 gi|153854450 gi|125718998 gi|88713802 gi|150865893 gi|156358426 gi|21314856 gi|111034980 gi|152003209 gi|18309604 gi|157828907 gi|67458620 gi|91205323 gi|74212177 gi|46395992 gi|148223756 gi|91087595 gi|156538256 gi|72389006 gi|71654695 gi|157873903 gi|156354208 gi|69937308
10 10 11 43 26 11 15 16 21 35 14 305 301 276 273 184 49 57 61 57 321 321 358 245 328 108 202 212 87 186
VEIEMLKVIIEICDKHDI-INLEMLLETKRICEKNNI-LELLLASEVLKICGKYNL-YLLKIMEDIVTVCEEEGL-ELLKMISDVFTFFDENGI-TLIEILDFVKEICEKHEL-LELKILKEIIRICKKEKI-VELLMLKDFMKLCDDNNI-VELEILKDFMDICDRHGL-AYLQMYKKFDSECRRHHI-QAEKLLIDVISIFETCKI-IFHRLSRAWLRFANSLNI-KLIELTSQIHSILNSIGL-RAKELLQLAAKTLKDLGV-RAKSLLHLASRVLFMLRV-EAAEKLAEFRDVLLTFNM-HAKDCLETIKENLDKNNL-ALYQLMKDTHELLGKNNI-ALYQLMKDTHELLGKNNI-SLYQIMKDTHELLTKHNI-ALRETARYVVGVLEAAGV-ALRETARYVVGVLEAAGV-ALRETTKYVINILESSGV-NLRKTAKHVFNSLDEAGI-GLRKVAYHVFDKLEEVGI-AMHGLLQEVIDVFQQAGI-LFQRTLLDFFGVTNSLNV-AFQEALLAVQEVLTAQKI-QLTELLNYWNKFTKNNNI-GAETALRDTLALLETGGF--
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
KYMLIGGTLLGA KYFLIGGSLIGA KIVMLAGTFLGA YYSLSGGSALGA AYSLSGGSILGA TYFLVFGTALGA EYFLIGGSALGA EYFAISGTAIGA DYFGIAGTGIGA HSYMVAGTLIGS EYWLEGGTLLGI STWLAHGTLLGW QHWLMYGSIIGA PFWLSSGTCLGW PFWLSSGTCLGW FAFLNGGTLLGW HFWLDYGTLLGA NYWIDGGTLLGA NYWIEGGTLLGA NYWIDGGTLLGA RYWLEGGSLLGA RYWLEGGSLLGA RYWLEGGSLLGA RYWLEAGSLLGA RYWLEANSLLGA RYWAAGGTLLGA PLFLCCGTALGA RFFLACGTALGA SYAIMWGSLLGL RPFILSGTLLGA
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10)
DDVDLGMTDDLDIGMLDDMDFGMPDDMDIFMLDDVDINIPDDVDIALPDDIDVGMTDDIDVALLDDIDVAMPDDIDLVMFNDLDISIHEDLDVQITHDVDLGARKDVDLGIFKDVDVGIWADIDLAMFLDIDLGMFDDLDIGIMDDLDIGIMDDLDIGIMYDVDLGIYYDVDLGIYYDVDLGIYHDVDVGFNHEVVVGINDDVDLAISNDIDVGVFEDIDLGILYDIDVIVSYDIDLGLF-
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 1)
-RENYD-EFARVAP -REDYE-KFLSVCK -REDFE-RFKELCV -GSERE-IFFQKFS -RESYD-KLFSLFE -REHYN-IFIDALS -RENYD-NFLRVAE -RKDYE-RFVKVMK -RDDFE-KLLPLVE -REDYN-RLQEIFA -EKEGN-KLAPLLK -MESMI-KLARNYN -REDFD-KLDR--K -IQDYK-PD-----IKDYR-PD-----AEDFH-PE-----YENQV-EE-----HEDEI-RLQQILP -HEEEI-HLQQILP -QEDEI-RLQQIFP -LEDVG-NCEQLRG -LEDVG-NCEQLRG -LEDVP-NCDYLKN -RDDLL-RSPWLKK -RDDLN-RSPWLVK -VEDEL-KLRSAFK -YEDLQ-RLGGTME -YADIA-VPVTEGD -LKDLV-KLRKLAK DETDLT-RLERLLH
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
45) 45) 46) 43) 46) 45) 45) 47) 45) 43) 43) 51) 31) 32) 31) 36) 31) 20) 20) 20) 32) 32) 32) 32) 32) 79) 43) 60) 61) 39)
IFIDIFPFD IFIDIVPFD IYIDIFPID FFIDIFWLE ISIDLYIVE IYIDIFPLD IYVDIFPFD ILMDIFIYD IFLDLYPLD LYLDMLPID VCLDVFI-FYVDITALA LSVDLFLFQ VKLDIFFFY VKLDIFFFY VNMDLFLMY LYFDIFYYF RCLDIFVFH ACLDIFIFH GCIDIFTFQ LHVDLWPFY LHVDLWPFY LHVDLWPFY IYVNLFPFY LHVNLLPFY YFVDIFIMR VALDLNVYY ARIDINLYY FFIDIYTYV IMIDIFLHY
(109) (113) (110) (122) (122) (112) (116) (118) (119) (121) ( 41) ( 88) ( 49) ( 42) ( 42) ( 46) ( 58) ( 44) ( 44) ( 44) ( 42) ( 42) ( 42) ( 42) ( 40) ( 57) ( 64) ( 73) ( 39) ( 36)
HYDRILKRMYG GYHNYLKKVYG NYRAYLKHMYG NIEHYLTVMYG WSEEYLEMFYG KLEDYLKCFYG DFHQYLTTHYG NYDEQLRKHMG NLHNNLTGMYG DADKYLTNRYG LTDDYLTYRYG NYEKILKREYA GELEILRHLYP ETVDYIEANYG DTEDYVRANYG SPQKMLVFEYG NPDGYLRENYG EYKENLNRQYP DPIGNLNRQYP NPKKNLNRQYP NYRRFLELKFG NYRRFLELKFG NHRAFLELKFG NIRDFLELKFG NIRDFLEIKYY YPLMHLHRLYG PPVSYLVEYFG PPERYLVENYG NPERLLEDWYG NAEQNLTENYG
243 247 246 287 273 247 255 260 264 278 175 523 458 424 420 340 212 200 204 200 474 474 511 398 479 323 388 424 266 342
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
270] 270] 273] 311] 298] 271] 284] 283] 293] 307] 194] 622] 473] 461] 457] 379] 246] 266] 254] 249] 494] 495] 536] 416] 495] 352] 414] 449] 296] 424]
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
429] 429] 362] 669] 351] 363] 359] 471] 378] 368] 365] 365] 365] 366] 368] 365] 359] 360] 364] 364] 348] 506]
Group XXIV KOG3963 gi|31982050 gi|157824039 gi|147742988 gi|73981150 gi|119889567 gi|118083484 gi|68366726 gi|126313543 gi|149470435 gi|148223978 gi|66511238 gi|91080531 gi|66511241 gi|118779375 gi|18857945 gi|118779373 gi|41282159 gi|72004700 gi|119625405 gi|25151341 gi|156386236 gi|118093007
90 90 23 68 12 23 23 131 38 23 32 32 32 32 32 32 26 26 26 26 15 154
QISQTMEEVQKIIHLLTTEI QISQTVEEVQKIVHLLTTEI QISQAVEEVQKVVHHLTTNI RISQIVEEVQRVVHHLTTEI WISQMVEEVQKVVHHLTTEI LVSKTVEEVQKIIQQLTTEI QVSKCVEDVQKIIKDLTTEV TISKTVEEVKKIIHELTSVI REWQEVLEVQSVIQKLTAEI RVSRMVEEVQKILLQVTSDI QVMKTIQEVCRVVQDVLKEV QVSKTIQEVCRVVQDVLKEV QVQKTIREVCKVVQEVLKEV QIHKTIQEVCRVVQDVLKEV KTATAIREICKIVQDILKEV NIALVLREICKIVQEVLREV AISKTIREVCKVVSDVLKEV NSNKTVREVCKIVTDVLKEV AIAKTIREVCKVVSDVLKEV RVTKTVQRIAKVVQEILKEV ETYQAIANVCRVVPEVLKHV DMARMREMVEGFADDLLEAL
( 7) ( 7) ( 7) ( 7) ( 7) ( 7) ( 7) ( 7) ( 7) ( 7) ( 7) ( 7) ( 7) ( 7) ( 7) ( 7) ( 7) ( 7) ( 7) ( 7) ( 7) ( 10)
EAVPVS-DTHNES EAVPAS-DTYNDS QAVPYS-DTYNEN QAVPYS-DTYNGN QAIPYC-DTYNEN QAISNS-GIHNEN QSIANA-GVHNAS QPISKS-DLYNEN QAVSDS-GAR--HCISNA-GIH--ISSLTD-YNGR-ISSLTD-YNGR-ISSLTE-CNGR-ISSLND-YNGR-ISSLVE-CNGR-ISSLVE-CNGR-ISSLNE-MDNR-ISSLVE-NETGRISSLSE-IDAR-INTLSE-TTTGRISSLKQ-LDGR-SGDSFP-PDEQG-
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
6) 6) 6) 6) 6) 6) 6) 6) 9) 9) 9) 9) 9) 9) 9) 9) 9) 9) 9) 9) 9) 9)
SLFHVTVTL SLFHVTVPL SQFLVTVPI SQFLVTVPV TQFLITVPM SQFLITVPL TLYLISVPL NQFLITVPL SQLLVPVPL SQFLVTVPL TEFEIVIYL VEFEVVIYL GEFEVVLYL TEFEIIIYL NEFEIVLYL TAFEVVLYL TEFEVVLYL NEFEVVLFL TEFEVVLYL SEYEAVLYL KRFEVCLYL DEDETGCIC
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
62) 62) 62) 62) 62) 62) 62) 62) 62) 62) 44) 44) 44) 44) 44) 44) 44) 44) 44) 44) 44) 39)
PAKVLQ-VFRTLVE PAKVLQ-IFQMLVE PAKVLL-VFRKLVE PAKVLQ-VFRKLVE PAKVLQ-VFRKLVE PAKVLV-VFRELVE PAKVVN-VFKDLLE PAKVLQ-VFRNLVD PAKVLA-VLRGLLE PARIIH-VFLEHLY ARKIRS-RFQTLVA ARKMRS-RFQTLVA ARKIRS-RFQTLVA ARKIRS-RFQTLVA SRKIRA-RFQTLVA ARKIRS-RFHTLVA ARKIRS-RFQTLVA ARKIRS-RFQTLVA ARKIRS-RFQTLVA ARKIRH-RFQNIVA ARKMRS-RFTSLVG ADQVMK-WFQIAVT
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
31) 31) 31) 33) 31) 32) 32) 32) 32) 32) 28) 28) 28) 28) 28) 28) 28) 28) 28) 28) 29) 34)
VELELAPTV VELELAPTV VELELVPAV VELELVPAV VELQLTPSV VEVELAPTV IEVKLVPTV VEVELVPVV VEVELVPTL VEVELVPVV YVVQITPAF FIVQITPAF YVVQITPAF IIVQITPAF FVVQITPAF YTVQITPAF YVVQITPAF FIVLITPAF YVVQITPAF YTVQITCAF YTVELIPAF IAFNLTPVV
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
39) 39) 39) 39) 39) 39) 39) 39) 39) 39) 52) 52) 52) 52) 55) 52) 52) 52) 50) 57) 51) 24)
AYHWQLSFSQA TYHWQLSFSQA NYHWQLSFLRA NYHWQLSFLQA SYHWQLSFLRA NYHWQLCFSRA NYHWLLSFSRA RFHWQLNFSQA RYHWQLSFGRA NYHWQLSFLRG GDAWALSFIDA GDAWVLSFIEA GDAWVLSFTEA GDAWVLSFTEA GDAWVLSFFEA GDAWLLHFTEA SDAWVLQFAEA GDAWVLSFTDA GRERRLGATVR ADAWAMKMHGA GDAWVLSFTDS STHWFLTFAVY
308 308 241 288 230 242 242 350 257 242 243 243 243 243 246 243 237 238 235 243 226 342
42
gi|156389018 gi|156390711 gi|109485035 gi|109071820 gi|114608195 gi|119901086 gi|126310381 gi|118088911 gi|68371332
23 2 166 211 210 298 172 72 239
AKQKALAAWQPPVQKILKFV KSPKKEQHTTKFVAALISKV EISAAAETVNKVVDQLLRRM DISKAAKVVNGVVGHLLRRL DISTAAGMVKGVVDHLLLRL EISVAAEVVNRLGDHLLRRL ERSEACGLVNKVLDHLQRQL DVSEASGLVNHVVSHLIQAV ERSKASRCVNEITEKVIAHL
( ( ( ( ( ( ( ( (
7) 7) 7) 6) 6) 7) 7) 7) 7)
KFDRLL-MTGSYDFSDII-LTGSVKGVEQL-NTGSYGGVEQL-HTGSYRGVGLL-NTGSYKGVDLL-RTGSYKGIVAL-GTGSYSSIRRL-GAGSYADIERL-RTGSY-
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 9) 9) 9) 9)
QDVRLNGSAAGH HDVRLNGSAAGH KDVRLNGSTASH KDVRLNGSTASH RDVRLNGSAASH RDVRLNGSAASH RDVRLNGSAASH REVRLNGSAASH HNVRLNGSAASH HNVRLNGSAASY HGVRLHGSAASH HSVRLNGSAASH RDVRLNGSAASH RDIRLNGSTASH KDIRLNGSTASH KDTRLNGSTASY KDTRLNGSTASY KDARLNGSVASY RCIRMNGGAASY RDIRLNGSTASY RDIRLNGGAASH RDIRLNGGAASH KDIRLNGGAASH KDIRLNGGAASH
( ( ( ( ( ( ( ( (
9) 9) 9) 9) 9) 9) 9) 9) 9)
DEFDLMIEI DEYDYLLIL NEFDVMFKL NEFDVMFKL NEFDVMFKL NEFDLMFTL NEFDVMLKL NEFDIMLVM DEFDVMLTV
( ( ( ( ( ( ( ( (
38) 31) 40) 40) 40) 40) 40) 41) 41)
ATGIKS-TASGLVN PLRLRK-RFSSALM ATKVLS-KFRELIK ASKMLS-KFRKIIK ASKMLS-KFRKIIK ASKMLF-KFRKIIK ASKLLS-KFRKIIK AFKMLE-DLRRIIK ASEMLS-EFRDGVK
( ( ( ( ( ( ( ( (
16) 37) 32) 30) 30) 32) 32) 33) 35)
YAVDLTFAF CDIDLVIAL ISVDIILAL ISVDITLAL ISVDITLAL ISVDIILAL ISVDIVLAL ISVDIILAL ISVDFVLGL
( ( ( ( ( ( ( ( (
50) 53) 47) 47) 47) 47) 47) 47) 53)
SLCWRYSFSTA IPMWRISFSLA GETWRLSFSHT EETWRLSFSHI EETWRLSFSHI EETWRLSFSHI ANTWRLSFSHI GNTWRLSFSHI RDSWRISFSHI
215 211 373 415 414 505 379 281 456
[ [ [ [ [ [ [ [ [
334] 521] 510] 555] 554] 645] 518] 417] 592]
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
644] 390] 388] 378] 523] 428] 393] 389] 345] 395] 475] 505] 393] 391] 390] 390] 390] 389] 468] 358] 428] 704] 608] 880]
[ [ [ [ [ [ [ [ [ [ [ [ [ [
332] 332] 331] 383] 524] 544] 508] 442] 537] 473] 457] 415] 374] 386]
Group XXV KOG3852 gi|114558778 gi|56118962 gi|45544634 gi|47221963 gi|6706655 gi|47216008 gi|125838708 gi|156739295 gi|118101544 gi|62857547 gi|149695038 gi|126328689 gi|118150624 gi|50745836 gi|126342630 gi|74007920 gi|123294632 gi|31873975 gi|115705900 gi|156351177 gi|110755666 gi|91076834 gi|157138163 gi|51092248
297 43 45 42 174 48 43 38 6 45 127 152 43 39 39 36 36 36 40 37 73 353 173 430
LEITLKDIVQTVRSRLEEAG LKITLKDIVQTVRSRLNEAG LEVRLKDIVQMVRNRLELRG LEVRLKDIVARVRSRLELSG LELQPSLIVKVVRRRLAEKR LEMQPRQIVKVVRTRMEEKQ LQMKPRQIVTAVRRRMREKS LEMKPRDIVKVVRCRMEERK ---------QVVRSRLEKKG LSCRPRDLVQVVRSRLEQKG LSVQPRQIVQVVRSRLEEQG LTVQPRQIVQVVQSRLEQLG LSVQPRQIVQVVRARLEERG LEVKPKDIIRVVKEQLIEKQ LEVKTKDIILVIQEELQKKE LEVKPKDIIQVVKEQLIEQG LEVKPKDIIHIVKDQLIKQG MEVKPKDIIHVVKDQLIGQG LDVTLLDLVENVREKLVGSN LDVHLRELINVVSSGLKDEN LEVRLRDLVNVVRSKLESDP LEVRLRDLVNVVRKKLETDT LEVKLTDLVNIVREKLEADI LEVKLKDLVNLVRRKLEAEV
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 10) 9) 9) 9) 9)
KDLDLIFHV KDLDLIFQV KDLDVIFRV KDLDVIFRV KDLDLIFCA KDLDLIFCA KDLDLIFRV KDLDLIFCA KDLDLIFGV KDLDLIFMV KDLDLVFRV KDVDLIFRV KDLDLIFGV KDLDIIFGV KDIDIIFGI KDLDVIFGV KDLDIIFGV KDLDVIFGV NDLDLIFGV NDIDLIFGV NDLDLIFAV NDLDLIFGV NDLDLIFAV NDLDLIFAI
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3) 3)
TEAEFQ-LVRDVVL SEAEFQ-LVRDVVL REEEFQ-LIKDVVL CEEDFH-VSKDVVL GEGEFQ-TVKDVVL GESDFQ-IVKDIVL GETEFQ-IVKNIVL GEVEFQ-TVKDIVL SEDVFQ-QVKDVVM GPDAFQ-VVKHAVL SEASFQ-LTKEVVL DEAAFQ-LVREAVL DDQAFR-VVKDVVL SELEFQ-VVKEAVL GEHEFQ-VVKETVL SDQEFQ-VVKDAVL SDEEFH-VVKDAVL GNEEFQ-VVKDAVL KTENLD-IIRDAVF SHTHLQ-QIKSVVL SGRNYD-KVKAAVL NVRNFD-RVKSAVL SNRHFD-RVKSAVL SPRVFD-RVKVAVL
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
49) 49) 49) 49) 49) 79) 49) 49) 49) 49) 49) 49) 49) 49) 49) 49) 49) 49) 49) 49) 50) 52) 52) 52)
KNVELKFVD RNVELKFVD RNVELKFVD RNVELKFVD KNVELKFVD KNVELKFVD KNVELKFVD KNVELKFVD KNVELKFVD KNMELKFVD KNVELKFVD KNVELKFVD KNVELKFVD KNVELKFVN KNMEFKFVS KNVELKFVN KNVELKFVN KNLELKFVS KNIELKFVD KNIELKFVD RNVELKFVD RGVELKFVD KNVELKFVD KNVELKFVD
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
16) 16) 16) 11) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16) 16)
LDSLLFFYDCS LDSLLFYYDYS LDSVLSYYDLS LDSMLAHYELP LDSLLLFYECS LDSLLLFYECS LDSLLLFYECS LDSLLLFYECS LDSLLLFGECS LDSMLMFSQCS LDSLLLFGQCS LDSLLLYGQCS LDSLLLFDRCS LDSILNVYRET LSNLLDVYRDA LDPMLEFYSDK LDSMLDFYSVP LDPMLEFYSDK LDSLLFFRKIS LDSLLGFYKLS LDSLLLFYECS LDSLLLFYECS LDSLLLFYDCT LDSLLLFYDCA
451 197 199 191 328 232 197 192 151 199 281 306 197 193 193 190 190 190 194 191 233 515 335 592
Group XXVI KOG2986 gi|157338577 gi|79314581 gi|77551384 gi|19112600 gi|119495106 gi|116195182 gi|46120902 gi|111065036 gi|39940662 gi|83770757 gi|145248425 gi|50425159 gi|146421841 gi|150865530
2 2 6 54 132 137 108 73 115 107 120 62 53 44
EKEKKAELASLL-KFLPPV ETTQKDELSSFL-SVLPPV EVARAAALAGPLGELLPPV EEELKENLTKVVNYFQAPI NQEFKEALRQILWQFRAPI DAEFKEVLKAIPWQFRAPI NYELKEALRLMLRQFNAPI NEEFKRSLRGILRQF-PPI NEEFKKALGDIVKSFQAPI KPEFKHALQQITQQFRAPI QELFEEELNYIVAKF-PPV --KNDEQLSQLIANFKSPI --DNDDDLEALARRFDGPV SVVRQEELRGIIDTFNSSI
( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
EFCCVYGSALHP DFCCVYGSTLHP DFCCAYGSTLLH DVAVGYGSGVFR RYAFAYGSGVFP RYAFAYGSGVFP VYSFAYGSGVFP TYAFAYGSGVFP IYAFAYGSGVFP SYSIAYGSGVFP AHAFAYGSGVFP KFSIGYGSGVFE EFTFGYGSGVFD KVSIGYGSGVLP
( ( ( ( ( ( ( ( ( ( ( ( ( (
6) 6) 6) 9) 27) 33) 32) 29) 33) 28) 9) 9) 9) 25)
TMVDYILGV KMVDYILGV SMVDYILGV PMIDFIFQV KMIDFIFGV KMIDFIFGV KVLDFIFGV KMIDFVLTP KMIDFIFGV KMIDFIFGI SMIDFILGV PQIDMIHMV PQIDMIHVV VQIDFINIV
( ( ( ( ( ( ( ( ( ( ( ( ( (
1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1)
DPMQWHSQNLKMN DPIKWHSANLKMN DPLQWHSENLERN DPVKWHKINLQQN YTQHWHALNLSQH HTQHWHSLNMKQH HVEHWHSINMKQN FSEHFHSLNLRSH HTQHWHSLNMRQN HAHTWHTINLQQH CAEEWHGLNLQQN KPADFHEQNLEQF DACNFHTTNLRQF DNQTFHKQNLVKN
( ( ( ( ( ( ( ( ( ( ( ( ( (
38) 38) 37) 36) 36) 36) 35) 35) 36) 37) 35) 40) 40) 31)
EMFKYGVVR RKLKYGVVR KRIKYGVVR NIIKYGVTS TLIKYGVVN ILIKYGVVQ MLIKYGVTS TMIKYAVVD TLIKYGVVN ILIKYGVVN VLIKYGVIS NIIKYGIIS HLLKYGVTT RLVKYGTMS
( ( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2)
DLVEDVLNWET DLVQDILDWKR DLAMDVLTWDR DVYEDLKNWNT TLCRDLSQWDT TLEKDLTQWDT NLVHDLSSWDS TLLRDLTEWDT TLCRDLTEWDT TLRRDLVGWNT TLCQDLATWDT KSLIDLSEWSS LCLRDLCEWNS ASLLDLCEWTS
120 120 124 174 270 281 250 211 259 247 238 184 175 175
43
gi|149237543 gi|68470330 gi|50304697 gi|156848141 gi|71064034 gi|45199184 gi|50288203 gi|50548571 gi|80751155 gi|118096799 gi|82183634 gi|119584517 gi|114150032 gi|126336427 gi|72011038 gi|156389436 gi|156546851 gi|118783673 gi|157135775 gi|91083107 gi|45553371 gi|17510507 gi|71023059 gi|116502976 gi|58260144 gi|66818457 gi|116058688 gi|145348884 gi|56752953 gi|118370652 gi|145549684 gi|88810915
75 134 113 20 106 97 33 148 6 6 6 6 6 1 4 3 4 4 4 6 1 1 109 82 42 61 4 4 6 10 2 32
EYEDQEKLEGIVKSFHAPM SPIQQDQLQEIVNSFDAPI DKSLERDLQSILGHFKSPI EKELQAELDGIVNSFKAPV DKELQKELDGVMSSFKAPC GREIDAELRGIMGHFQAPV ESGLQRELDGILNSFDAPI DADLRTTLRQVLWTFKAPI LQNSAVFYRRIINRFPQDF LSSSGVKFRRVLAHFPQEL LQGTGFQFRRILSFFPQDI LQSSWVTFRKILSHFPEEL LHSSGVGLRRILAHFPEDL ----MATFRRILANFPDNL AVRITKLYNRILGHFPREI DESNLNYFRDIVSRFPDGI KISQVSHFKNILQEFPRNM RGNTAPMFFRLLSKFPQGF KGNTAPMFYRILSRFPPNI PMIVAPIYARILSKFPQNF ---MLDLYRRTVARFPLGS ----MDEYRELISVLPLET SDETHHRLHSILACFDAPV SNSTRALLESIVAKFDAPI KPTAYDRLRPVISTFQAPI TPETQERINELLKLF-PPI RTLAR-RVVDALSNA-PEC NARARERIAGVLAAL-PPV LTGKIKRLVDALNIDRTNY SLVEELQLDSIIQRDFPKI QQIPMNLLKQFIQ-TLPQS EPALHRLCERMIARFGAAV
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0) 1) 1) 0) 0) 0) 0) 0) 0) 0) 0) 0) 0)
ETAIGYGSGVLP DVSIGYGSGILP KYAFGYGSGVFQ KYAFGYGSGVFQ RFAFGYGSGVFE RYAFGYGSGVFE TYAFGYGSGVFK RYSFAYGSGVFS SLAFAYGSAVFR SLAFAYGSGVFR SLAFTYGSGVFR SLAFVYGSGVYR SLAFAYGSAVYR SLAFAYGSAVYR SLAFAYGSGAFK TLAFAYGSGVFK KFCFAYGSGVFK SFCFAYGSGVKQ TFCFAYGSGVKQ TFCFAYGSAVKK SYMFAYGSGVKQ EYAFAYGSGAIQ RFAFAYGSGVFS RYAFAYGSGVFE DWAAAYGSGVLP KYGFAYGSGVIS EHVLAYGSAVLR EHALAYGSAVLA VASFAYGSVVFP DFAFGYGSGVFR ELSFAYGSAVQP AGVIFYGSCLRS
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
50) 18) 10) 10) 10) 10) 9) 9) 11) 11) 11) 10) 10) 10) 8) 8) 7) 11) 11) 8) 18) 8) 18) 14) 13) 24) 8) 8) 8) 10) 10) 5)
KQMDFIFVV KQLDFMFLV PQIDMIFGV PQIDMIFGV PQIDIILGV PQMDLILGV PQIDLILAV PQVDLIFGV NMLDFVFAV NMLDFVFAV KMLDFVFAV AMLDFVFTV PMLDLVFTV IMLDFVFSV NMLDFIFVV NMIDFVFVV NMLDLIFVV NMIDLIYVV NMIDLIYTV NMIDLIYCV TVVDLVFCV KMVDFVIVT KMIDFVMAV PMLDFMFAV PLTDLLIST PMIDLIFAV SALDILCVV RALDVLVAV SLIDLILIV PMIDLIFGV PMIDLIIAV GVVDLYVVV
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1) 1)
NSASFHKENLLQN DCGKFHQENLKQN HPEHFHSLNMRQN HPSHFHSLNLRQN YPSHFHSINMRQN HPEHFHSLNMRQN DPVEFHSRNIKQN YPNHWHSLNLKQN DPVTWHTMNLIEN DSVTWHMMNLLKN DPVTWHTMNIIQN DPVAWHSKNLKKN DPVAWHAMNLKKN DPVTWHSKNLQKN NPEEWHRKNIQEN NPYDWHSKNLSRF NPYQWHAENMNKN NAHRWHTANLDQN NAHRWHASNLERN NPSAWHEANMKEN DARGFHAENLHRH NAQEFHRDNILKN HPHHWHSLNMTQH HPAHFHSINMHQF DAEAFHKINLEQN NSTKWHSLNLVNN NVQEWHATNVHRN DPAAWHDANATRN NPVEWHRKNISDN DADEWHRMNLIKN NVEEWHMQNIQIN RYAA---------
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
30) 31) 35) 35) 35) 35) 35) 35) 36) 35) 36) 36) 36) 36) 36) 36) 38) 38) 38) 36) 38) 36) 36) 20) 37) 35) 38) 38) 47) 42) 36) 30)
HLTKYGIIE KLVKYGVIS HDVKYGVVS HEVKYGVIS HDVKYGVVS HQVKYGVVS NEVKYGIVS LKIKYGVVS RLIKYGVIS RMIKYGVIS RLIKYGVVS RLIKYGVIS KLIKYGVIS KLIKYGVIS RLIKYGVVK RKIKYGIIS RNIKYGVIS VIIKYGVVS IIIKYGVVC VWIKYGVIA ITIKYGVVS RKIKYGVIS ELIKYGVIS VTIKYGVTT VNVKYGVIS IKFKYGVIE EPFKYGVAS RAYKYGVVD LSFKYGVVA LNIKYGVVD LKLKYGVVS LRAKCAVLS
( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( ( (
2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2) 2)
SAMMDLSEWSS SALMDLSEWHS RLLKDLATWDT NLLKDLATWDS TLLKDIATWNT NLLKDLATWDT RILNDLKNWES TLSRDLSDWNK ALIDDLLHWKT TLIDDLLHWKT TLLQDLLHWRT VLIEDLLNWNN TLIEDLLNWNN TLIKDLLTWDT TLVQELINWNN NFIKDLNDWEW SLVEDLLDWNT DLLEDLTDWRC DLLDDLTTWSN DLVTDLLEWSD ELLEDLLDWRH NVKQDLLDWRW DLCADLLDWET NLCADLLNWRT TLEKDLKEWTT DLIDDLKNWKT DVVRDLERWEY DVVDDLERWKH SLLTDLLNWSH QLSKDLLKWNI ELIKDLENWKW DLELGTAQWFH
230 258 233 140 226 217 152 267 128 127 128 127 127 118 123 122 124 128 128 125 130 117 238 191 167 194 123 124 136 137 122 133
[ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [ [
437] 469] 393] 298] 385] 376] 310] 508] 338] 337] 338] 337] 337] 325] 325] 326] 375] 335] 335] 339] 342] 321] 552] 399] 393] 427] 337] 321] 370] 340] 319] 326]
44
SUPPLEMENTARY TABLES Supplementary Table S1. Reactions catalyzed by NTase fold superfamily enzymes REACTION: X + (d)NTP → X-(d)NMP + Y Biological process
Substrate X
(d)NTP
Product Y
mRNA polyadenylation
mRNA
ATP
PPi
Type of substrate
tRNA maturation
tRNA
ATP, CTP
PPi
mRNA editing
mRNA
UTP
PPi
chromatin remodeling, DNA repair, immunoglobulin gene rearrangement
DNA
dATP, dCTP, dGTP, dTTP
PPi
DNA
antibiotic resistance
kanamycin, streptomycin
ATP
PPi
antibiotics
signal transduction (during viral infection)
2’-5’-oligoadenylate
ATP
PPi
(p)ppGpp synthesis
GTP
ATP
AMP
regulation of GS activity (C-terminal NTase domain)
GS
ATP
PPi
RNA
NTP
proteins
EXCEPTIONS Biological process
Reaction
Type of substrate
regulation of GS activity (N-terminal NTase domain)
GS-AMP + Pi → GS + ADP
proteins
3’-5’-cAMP synthesis
ATP → 3’-5’-cAMP + PPi
NTP
The columns are as follows: "Biological process", biological function the reaction contributes to; "Substrate X", substrate to which (deoxy)nucleoside monophosphate is attached; "(d)NTP", (deoxy)nucleotide used in the reaction; "Product Y", reaction product; "Type of substrate", type of substrate X. Known exceptions to the general reaction scheme carried out by NTase fold proteins are presented at the bottom of the table.
45