signal peptide Additional data file 1. Alignment of the ...

6 downloads 0 Views 177KB Size Report
Additional data file 1. Alignment of the B6 and S7 MUPs. Each predicted coding sequence from both strains as annotated in Figure 1 is included. The signal ...
signal peptide 129S 7gene9 129S 7gene10 C57B l6gene1 7 C57B l6gene1 129S 7gene1 C57B l6gene2 129S 7gene2 C57B l6gene1 6 129S 7gene8 C57B l6gene4 129S 7gene4 C57B l6gene1 2 C57B l6gene6 129S 7gene7 129S 7gene6 C57B l6gene1 5 C57B l6gene8 C57B l6gene3 129S 7gene3 C57B l6gene7 C57B l6gene5 C57B l6gene1 1 C57B l6gene1 0 C57B l6gene1 3 129S 7gene5 C57B l6gene9 C57B l6gene1 4 C57B l6gene1 8 129S 7gene11 C57B l6gene1 9 129S 7gene12

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1

MK-LLV--L LLCLGLTLVCVHA EEASSMERN FN VE KI NG EW Y TI ML AT DK RE KI E EH GS M MK-LLV--L LLCLGLTLVCVHA EEASSMERN FN VE KI NG EW Y TI ML AT DK RE KI E EH GS M MK-LLV--L LLCLGLTLVCVHA EEASSMERN FN VE KI NG EW Y TI ML AT DK RE KI E EH GS M MK------L LLCLGLTLVCIHA EEATSKGQN LN VE KI NG EW F SI LL AS DK RE KI E EH GS M MK------L LLCLGLTLVCIHA EEATSKGQN LN VE KI NG EW F SI LL AS DK RE KI E EH GS M MK-L----L LLCLGLILVCVHA EEASSMGRN FN VE KI NG EW Y TI IL AS DK RA KI E EH GI M MK-L----L LLCLGLILVCVHA EEASSMGRN FN VE KI NG EW Y TI IL AS DK RA KI E EH GI M MK-LL---L LLCLELTLVYVHA EEASSEGQN LN VE KI NG KW F SI LL AS DK RE KI E EH GT M MK-LL---L LLCLELTLVCVHA EEASSERQN FN VE KI NG KW F SI LL AS DK RE KI E EH GT M MKML----L LLCLGLTLVCVHA EEASSTGRN FN VQ KI NG EW H TI IL AS DK RE KI E DN GN F MKML----L LLCLGLTLVCVHA EEASSTGRN FN VE KI NG EW H TI IL AS DK RE KI E DN GN F MKML----L LLCLGLTLVCVHA EEASSTGRN FN VE KI NG EW H TI IL AS DK RE KI E DN GN F MKML----L LLCLGLTLVCVHA EEASSTGRN FN VE KI NG EW H TI IL AS DK RE KI E DN GN F MKML----L LLCLGLTLVCVHA EEASSTGRN FN VE KI NG EW H TI IL AS DK RE KI E DN GN F MKML----L LLCLGLTLVCVHA EEASSTGRN FN VE KI NG EW H TI IL AS DK RE KI E DN GN F MKML----L LLCLGLTLVCVHA EEASSTGRN FN VE KI NG EW H TI IL AS DK RE KI E DN GN F MKMM----L LLCLGLTLVCVHA EEASSTGRN FN VE KI NG EW H TI IL AS DK RE KI E DN GN F MKML----L LLCLGLTLVCVHA EEASSTGRN FN VE KI NG EW H TI IL AS DK RE KI E DN GN F MKML----L LLCLGLTLVCVHA EEASSTGRN FN VE KI NG EW H TI IL AS DK RE KI E DN GN F MKML----L LLCVGLTLVCVHA EEASSTGRN FN VE KI NG EW H TI IL AS DK RE KI E DN GN F MKML----L LLCVGLTLVCVHA EEASSTGRN FN VE KI NG EW H TI IL AS DK RE KI E DN GN F MKML----L LLCLGLTLVCVHA EEASSTGRN FN VE KI NG EW H TI IL AS DK RE KI E DN GN F MKML----L LLCLGLTLVCVHA EEASSTGRN FN VE KI NG EW H TI IL AS DK RE KI E EH GN F MKML----L LLCLGLTLVCVHA EEASSTGRN FN VE KI NG EW H TI IL AS DK RE KI E EH GN F MKML----L LLCLGLTLVCVHA EEASSTGRN FN VE KI NG EW H TI IL AS DK RE KI E VN GN F MKMLL---L LLCLGLTLVCVHA EEASSTGRN FN VE KI NG EW H TI IL AS DK RE KI E DN GN F MKMLL---L LLCLGLTLVCVHA EEASSTGRN FN VE KI NG EW H TI IL AS DK RE KI E DN GN F MKLLLPLLL LLCLELTLVCIHA EESSSMERN FN VE QI SG YW F SI AE AS DE RE KI E EH GS M MKLLLPLLL LLCLELTLVCIHA EESSSMERN FN VE QI SG YW F SI AE AS DE RE KI E EH GS M MKLLL---L LLCLGLTIVCIQA EEYSSMGRN FN VE QI SG YW F SI AE AS DE RE KI E EH GS M MKLLL---L LLCLGLTIVCIQA EEYSSMGRN FN VE QI SG YW F SI AE AS DE RE KI E EH GS M

129S 7gene9 129S 7gene10 C57B l6gene1 7 C57B l6gene1 129S 7gene1 C57B l6gene2 129S 7gene2 C57B l6gene1 6 129S 7gene8 C57B l6gene4 129S 7gene4 C57B l6gene1 2 C57B l6gene6 129S 7gene7 129S 7gene6 C57B l6gene1 5 C57B l6gene8 C57B l6gene3 129S 7gene3 C57B l6gene7 C57B l6gene5 C57B l6gene1 1 C57B l6gene1 0 C57B l6gene1 3 129S 7gene5 C57B l6gene9 C57B l6gene1 4 C57B l6gene1 8 129S 7gene11 C57B l6gene1 9 129S 7gene12

58 58 58 55 55 56 56 57 57 57 57 57 57 57 57 57 57 57 57 57 57 57 57 57 57 58 58 61 61 58 58

RVFVEYIHV LENSLALKFHIII NEECSEIFL VA DK TE KA GE Y SV TY DG SN TF TI L KT DY D RVFVEYIHV LENSLALKFHIII NEECSEIFL VA DK TE KA GE Y SV TY DG SN TF TI L KT DY D RVFVEYIHV LENSLALKFHIII NEECSEIFL VA DK TE KA GE Y SV TY DG SN TF TI L KT DY D RVFVEHIHV LENSLAFKFHTVI DGECSEIFL VA DK TE KA GE Y SV MY DG FN TF TI L KT DY D RVFVEHIHV LENSLAFKFHTVI DGECSEIFL VA DK TE KA GE Y SV MY DG FN TF TI L KT DY D RLFVEHIHV LENSLGFKFHTVI DEECSEIFL VA DK TE KA GE Y SV TY DG FK KF TV L KT DY D RLFVEHIHV LENSLGFKFHTVI DEECSEIFL VA DK TE KA GE Y SV TY DG FK KF TV L KT DY D RVFVEHIDV LENSLAFKFHTVI DEECTEIYL VA DK TE KA GE Y SV TY DG FN TF TI L KT DY D RVFVEHIDV LENSLAFKFHTVI DEECTEIYL VA DK TE KA GE Y SV TY DG FN TF TI L KT DY D RLFLEQIHV LENSLVLKFHTVR DEECSELSM VA DK TE KA GE Y SV TY DG FN TF TI P KT DY D RLFLEQIHV LENSLVLKFHTVR DEECSELSM VA DK TE KA GE Y SV TY DG FN TF TI P KT DY D RLFLEQIHV LENSLVLKFHTVR DEECSELSM VA DK TE KA GE Y SV TY DG FN TF TI P KT DY D RLFLEQIHV LENSLVLKFHTVR DEECSELSM VA DK TE KA GE Y SV TY DG FN TF TI P KT DY D RLFLEQIHV LENSLVLKFHTVR DEECSELSM VA DK TE KA GE Y SV TY DG FN TF TI P KT DY D RLFLEQIHV LENSLVLKFHTVR DEECSELSM VA DK TE KA GE Y SV TY DG FN TF TI P KT DY D RLFLEQIHV LENSLVLKFHTVR DEECSELSM VA DK TE KA GE Y SV TY DG FN TF TI P KT DY D RLFLEQIHV LEKSLVLKFHTVR DEECSELSM VA DK TE KA GE Y SV TY DG FN TF TI P KT DY D RLFLEQIHV LENSLVLKVHTVR DEECSELSM VA DK TE KA GE Y SV TY DG FN TF TI P KT DY D RLFLEQIHV LENSLVLKVHTVR DEECSELSM VA DK TE KA GE Y SV TY DG FN TF TI P KT DY D RLFLEQIHV LENSLVLKFHTVR DEECSELSM VA DK TE KA GE Y SV TY DG FN TF TI P KT DY D RLFLEQIRV LENSLVLKVHTVR DEECSELSM VA DK TE KA GE Y SV TY DG FN TF TI P KT DY D RLFLEQIRV LENSLVLKFHTVR DEECSELSM VA DK TE KA GE Y SV TY DG FN TF TI P KT DY D RLFLEQIHV LENSLVLKVHTVR DEECSELSM VA DK TE KA GK Y SV TY DG FN TF TI P KT DY D RLFLEQIHV LENSLVLKVHTVR DEECSELSM VA DK TE KA GE Y SV TY DG FN TF TI P KT DY D RLFLEQIHV LENSLVLKVHTVR DEECSELSM VA DK TE KA GE Y SV TY DG FN TF TI P KT DY D RLFLEQIHV LENSLVLKFHTVR DEECSELSM VA DK TE KA GE Y SV TY DG FN TF TI P KT DY D RLFLEQIHV LENSLVLKFHTVR DEECSELSM VA DK TE KA GE Y SV TY DG FN TF TI P KT DY D RAFVENITV LENSLVFKFHLIV NEECTEMTA IG EQ TE KA GI Y YM NY DG FN TF SI L KT DY D RAFVENITV LENSLVFKFHLIV NEECTEMTA IG EQ TE KA GI Y YM NY DG FN TF SI L KT DY D RAFVENITV LENSLVFKFHFIV NEECTEMTL IG EE TE KA GI Y YL NY DG FN TF TI L KT DY D RAFVENITV LENSLVFKFHFIV NEECTEMTL IG EE TE KA GI Y YL NY DG FN TF TI L KT DY D

129S 7gene9 129S 7gene10 C57B l6gene1 7 C57B l6gene1 129S 7gene1 C57B l6gene2 129S 7gene2 C57B l6gene1 6 129S 7gene8 C57B l6gene4 129S 7gene4 C57B l6gene1 2 C57B l6gene6 129S 7gene7 129S 7gene6 C57B l6gene1 5 C57B l6gene8 C57B l6gene3 129S 7gene3 C57B l6gene7 C57B l6gene5 C57B l6gene1 1 C57B l6gene1 0 C57B l6gene1 3 129S 7gene5 C57B l6gene9 C57B l6gene1 4 C57B l6gene1 8 129S 7gene11 C57B l6gene1 9 129S 7gene12

118 118 118 115 115 116 116 117 117 117 117 117 117 117 117 117 117 117 117 117 117 117 117 117 117 118 118 121 121 118 118

NY IMIHLINKKDGET FQLMELYGREPDL SS DI KE K FA QL SE EH GI VR E NI ID LT NA NR CL NY IMIHLINKKDGET FQLMELYGREPDL SS DI KE K FA QL SE EH GI VR E NI ID LT NA NR CL NY IMIHLINKKDGET FQLMELYGREPDL SS DI KE K FA QL SE EH GI VR E NI ID LT NA NR CL NY IMFHLINEKDGKT FQLMELYGRKADL NS DI KE K FV KL CE EH GI IK E NI ID LT KT NR CL NY IMFHLINEKDGKT FQLMELYGRKADL NS DI KE K FV KL CE EH GI IK E NI ID LT KT NR CL NY IMFHLINEMNGET FQLMSLYGREPDL NS DI KE K FV KL CE EH GI IR E NI ID FT KT NR CL NY IMFHLINEMNGET FQLMSLYGREPDL NS DI KE K FV KL CE EH GI IR E NI ID FT KT NR CL NY IMFHLINKKDEEN FQLMELFGREPDL SS DI KE K FA KL CE EH GI VR E NI ID LS NA NR CL NY IMFHLINKKDEEN FQLMELFGREPDL SS DI KE K FA KL CE EH GI VR E NI ID LS NA NR CL NF LMAHLINEKDGET FQLMGLYGREPDL SS DI KE R FA QL CE EH GI LR E NI ID LS NA NR CL NF LMAHLINEKDGET FQLMGLYGREPDL SS DI KE R FA QL CE EH GI LR E NI ID LS NA NR CL NF LMAHLINEKDGET FQLMGLYGREPDL SS DI KE R FA QL CE EH GI LR E NI ID LS NA NR CL NF LMAHLINEKDGET FQLMGLYGREPDL SS DI KE R FA QL CE EH GI LR E NI ID LS NA NR CL NF LMAHLINEKDGET FQLMGLYGREPDL SS DI KE R FA QL CE EH GI LR E NI ID LS NA NR CL NF LMAHLINEKDGET FQLMGLYGREPDL SS DI KE R FA QL CE EH GI LR E NI ID LS NA NR CL NF LMAHLINEKDGET FQLMGLYGREPDL SS DI KE R FA QL CE EH GI LR E NI ID LS NA NR CL NF LMAHLINEKDGET FQLMGLYGREPDL SS DI KE R FA QL CE EH GI LR E NI ID LS NA NR CL NF LMAHLINEKDGET FQLMGLYGREPDL SS DI KE R FA QL CE KH GI LR E NI ID LS NA NR CL NF LMAHLINEKDGET FQLMGLYGREPDL SS DI KE R FA QL CE KH GI LR E NI ID LS NA NR CL NF LMAHLINEKDGET FQLMGLYGREPDL SS DI KE R FA QL CE KH GI LR E NI ID LS NA NR CL NF LMAHLINEKDGET FQLMGLYGREPDL SS DI KE R FA QL CE EH GI LR E NI ID LS NA NR CL NF LMAHLINEKDGET FQLMGLYGREPDL SS DI KE R FA QL CE EH GI LR E NI ID LS NA NR CL NF LMAHLINEKDGET FQLMGLYGREPDL SS DI KE R FA QL CE EH GI LR E NI ID LS NA NR CL NF LMAHLINEKDGET FQLMGLYGREPDL SS DI KE R FA QL CE EH GI LR E NI ID LS NA NR CL NF LMAHLINEKDGET FQLMGLYGREPDL RS DI KE R FA QL CE EH GI LR E NI ID LS NA NR CL NF LMAHLINEKDGET FQLMGLYGREPDL SS DI KE R FA QL CE EH GI LR E NI ID LS NA NR CL NF LMAHLINEKDGET FQLMGLYGREPDL SS DI KE R FA QL CE EH GI LR E NI ID LS NA NR CL NY IMIHLINKKDGKT FQLMELYGREPDL SL DI KE K FA KL CE EH GI IR E NI ID LT NV NR CL NY IMIHLINKKDGKT FQLMELYGREPDL SL DI KE K FA KL CE EH GI IR E NI ID LT NV NR CL NY IMIYLINEKDGET FQLMELYGREPYL SL DI KE K FA KL CE EH GI IR E NI ID LT NV NR CL NY IMIYLINEKDGET FQLMELYGREPDL SL DI KE K FA KL CE EH GI IR E NI ID LT NV NR CL

1 29 S7 ge n e9 1 29 S7 ge n e1 0 C 57 Bl 6g e ne 17 C 57 Bl 6g e ne 1 1 29 S7 ge n e1 C 57 Bl 6g e ne 2 1 29 S7 ge n e2 C 57 Bl 6g e ne 16 1 29 S7 ge n e8 C 57 Bl 6g e ne 4 1 29 S7 ge n e4 C 57 Bl 6g e ne 12 C 57 Bl 6g e ne 6 1 29 S7 ge n e7 1 29 S7 ge n e6 C 57 Bl 6g e ne 15 C 57 Bl 6g e ne 8 C 57 Bl 6g e ne 3 1 29 S7 ge n e3 C 57 Bl 6g e ne 7 C 57 Bl 6g e ne 5 C 57 Bl 6g e ne 11 C 57 Bl 6g e ne 10 C 57 Bl 6g e ne 13 1 29 S7 ge n e5 C 57 Bl 6g e ne 9 C 57 Bl 6g e ne 14 C 57 Bl 6g e ne 18 1 29 S7 ge n e1 1 C 57 Bl 6g e ne 19 1 29 S7 ge n e1 2

17 8 17 8 17 8 17 5 17 5 17 6 17 6 17 7 17 7 17 7 17 7 17 7 17 7 17 7 17 7 17 7 17 7 17 7 17 7 17 7 17 7 17 7 17 7 17 7 17 7 17 8 17 8 18 1 18 1 17 8 17 8

EA R E EA R E EA R E KA R E KA R E QA R E QA R E QA R E QA R E QA R E QA R E QA R E QA R E QA R E QA R E QA R E QA R E QA R E QA R E QA R E QA R E QA R E QA R E QA R E QA R E QA R E QA R E EA R E EA R E EA R E EA R E

Additional data file 1. Alignment of the B6 and S7 MUPs. Each predicted coding sequence from both strains as annotated in Figure 1 is included. The signal peptide common to all MUPs is indicated; note that certain MUPs are identical over the mature peptide sequence whilst differing from one another in their signal peptide. All amino acid positions in the text are numbered relative to the mature protein, i.e. beginning [EEAS...].