cotransporter (lac permease) in Escherichia coli (1) and the. Na+/glucose cotransporter in intestinal brush borders (2). Two prokaryote Na+ cotransporters have ...
Proc. Natl. Acad. Sci. USA Vol. 86, pp. 5748-5752, August 1989 Biochemistry
Homology of the human intestinal Na+/glucose and Escherichia coli Na /proline cotransporters (symport/amino acid sequences/structure-function relations)
MATTHIAS A. HEDIGER, ERIC TURK, AND ERNEST M. WRIGHT Department of Physiology, University of California at Los Angeles School of Medicine, 10833 Le Conte Avenue, Los Angeles, CA 90024-1751
Communicated by H. Ronald Kaback, May 3, 1989 (received for review March 17, 1989)
vol) formamide, and the filter was washed at high stringency [650C in the presence of 0.1 x SSC (1 x SSC is 0.15 M sodium chloride/0.015 M sodium citrate, pH 7)/0.1% SDS].
ABSTRACT Cotransport proteins are responsible for the active accumulation of organic substrates in cells. Na+ gradients provide the driving force for uptake of most substrates into eukaryotes and for a few substrates in some prokaryotes. We report here the cloning and sequencing of the human intestinal Na+/glucose cotransporter (SGLTI) and compare its structure with other cloned transporters. At the DNA level and the predicted amino acid and secondary structure levels, dose homology is evident between the human and rabbit intestinal Na+/glucose cotransporters, and a sigificant homology is found between these and the Escherichia coli Na+/proline cotransporter (putP). No homology is detectible with other known proteins. We infer from these results that the mammalian Nat/glucose and prokaryote Na+/proline cotransporters share a common ancestral gene.
RESULTS AND DISCUSSION A 2.6-kb cDNA clone for the human intestinal Na+/glucose cotransporter was selected from the cDNA library using the rabbit intestinal Na+/glucose cotransporter cDNA clone (6) as a probe. mRNA synthesized in vitro from the clone and injected into Xenopus oocytes increased the Na'-dependent, phlorizin-sensitive sugar uptake >70-fold above that of control water-injected oocytes. In the presence of sodium, the rate of 50 AM a-methyl-D-[14C]glucopyranoside uptake increased from 0.7 ± 0.1 to 74 ± 19 pmol per oocyte per hr in six experiments. Transport was inhibited >80% by 25 AM phlorizin and by 25 mM D-galactose. The DNA and deduced amino acid sequences of the clone are presented in Fig. 1. There is 94% similarity between the human and rabbit amino acid sequences (6), 84% of the residues being identical and another 10% being conservative substitutions. Among the notable differences are substitutions for lysine at positions 139, 596, 616, and 622 and for tyrosine at positions 20, 246, and 660. This result indicates that these residues are not among the important lysine and tyrosine residues previously identified at the glucose and Na+ binding sites (7, 8). At the DNA level, an 82% similarity is evident between the sequences of the rabbit and human clones. This similarity is reflected in the results of Northern blot analysis of human and rabbit mRNA using the full-length cDNA coding for the human intestinal Na+/glucose cotransporter as a probe (Fig. 2). The probe hybridized with 2.3-kb rabbit intestinal mRNA, which matches the size of mRNA coding for the cotransporter (6). Three bands of human mRNA (2.2, 2.6, and 4.8 kb) hybridized to the probe. We have isolated clones with inserts of these three sizes from the human intestinal library and found all to contain the same coding region with different lengths of the 3' noncoding region. The deduced amino acid sequence was analyzed for potential membrane-spanning regions using hydrophobic moment analysis (Fig. 3). There are 11 predicted membranespanning regions and 2 highly charged hydrophilic regions (residues 408-420 and 567-631). These regions are similar to those predicted for the rabbit intestinal Na+/glucose cotransporter (6). An additional putative membrane segment, 7a, is indicated for the human sequence; it is due to the replacement of Met-387 in rabbit by a less polar leucine. Another hydrophobic segment, 9a, is also shown, but its hydrophobic moment is relatively high (it contains an arginine at residue 499), and it is probably a membrane surface component. There are two potential N-linked glycosylation sites (AsnXaa-Thr/Ser) at positions 248 and 306. We have evidence that only Asn-248 is glycosylated in the rabbit (10), thereby
In both prokaryotes and eukaryotes the active accumulation of organic substrates occurs by cotransport (symport) systems. These systems are driven mainly by electrochemical potential gradients of H+ in enteric bacteria and of Na+ in eukaryotes. The archetypal transporters are the H+/lactose cotransporter (lac permease) in Escherichia coli (1) and the Na+/glucose cotransporter in intestinal brush borders (2). Two prokaryote Na+ cotransporters have been identified, cloned, and sequenced-the E. coli melibiose (meiB) and proline (putP) carriers (3, 4). There is no homology between meiB and putP and between these Na+ cotransporters and H+ cotransporters such as lac permease. We have cloned the human intestinal Na+/glucose cotransporter (SGLTJ)* and find clear homology with the putP, but not the meiB transporter, indicating an evolutionary link between bacterial and human Na+-cotransport proteins.
METHODS A human intestinal cDNA AgtlO library was constructed from ileum mRNA (5), and clones were selected using the rabbit intestinal Na+/glucose cotransporter cDNA (6) as a probe. A positive clone with a 2.6-kilobase (kb) insert was subcloned into Bluescript plasmid for in vitro transcription and into M13mpl8 and -mpl9 vectors for Sanger dideoxynucleotide sequencing. RNA synthesized from the Bluescript plasmid in vitro was injected into Xenopus oocytes, and the expression of the Na+/glucose cotransporter was tested by measuring the Na+-dependent uptake of a-methyl-D-['4C]glucopyranoside. All methods including DNA sequencing by the Sanger method were performed as described (6). For Northern (RNA) analysis, the RNA gel was run in the presence of 2.2 M formaldehyde (2.5 ug of RNA per lane). The gel was capillary blotted to a nitrocellulose filter (Schleicher & Schuell). Hybridization was performed in 50% (vol/ The publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. §1734 solely to indicate this fact.
*The sequence reported in this paper has been deposited in the GenBank data base (accession no. M24847). 5748
Proc. Nati. Acad. Sci. USA 86 (1989)
Biochemistry: Hediger et al. 10 Rabbit Human
L M
D
S S T W S P
L K
T T
S TA A A V T R
R L
I
R
N
A
A
I
D
S
V I
I
V
I
Y
F
L V
110
90
70
60
50
40
30 E
70
50
30
10
20 S Y L P.V E T H
5749
VV MA V GL WAMF ST N RG TV GG F FLAG R SM VW W PI GA SL FA S
350
330
310
290
270
250
V
I M
T
150
140
130
120
230 110
100
90
80
210
190
170
150
130
K
R
I
I
PIYIKAGVVTMPECAGATACILrKAGAOrrGGQR YIC QV
LL
L
470
450
430
410
390
370
LS
190 180 170 G V V I I T D I Q I SA DI F SGA IFINL A L GL NL YL A I FL LLA I TA L YTI T GGL 160
530
510
490
570
550
590
230
220
210
200
E T V H A A A VIYT.D TL Q TVI MLV G SLI L TG FA F HEV GG Y DAF ME KY M
250 S I P
240
SQ
R
730
670
650
630
610 I
270 A I
A
E
770
750
870
A L M SHV KG G CIL C
910
890
V
N
L I
990
970
V
350 D
1010
V
R
1030
370
360
950
930
340
330
320
830 310
L V G FI FG M SIL TLW YW C T DQV I VQR C L SA K
850
I
810
790
300
290
280
710
690
260
1070
1050 390
380
M R F K V G CT NIAY P TLV VE L MP NGL RG L ML SVNL A SL MS SL T SI
1090
1130
1110
1230
K
1270
1350
1370
1510
1490
530
I I
1570
C GV H Y L Y
F
560
L F
V A
I
I1SF
1630
1610
1710
1810
1830
1850
650
640
H V V I K P L WRTV LHNV NGI
OKI
D V
590 A T D--
E
1790 630 L
A
L
1910
1890
660
670
L A Y IL V TV AV F CHA Y FA ~~~ ~~~~zzAT~~~~~~~~c~ATAT~rKCCTI0AGrcrc''I= 1990 2010 1970
AA~rcA0a~G 2030
1930
1950
2050
2070
2090
2110
2130
2170
2190
2210
2230
2250
2270
2290
2310
2330
2350
2370
2390
Cl
L Y
K
1770 K
1870
P IP
1670
620 I
F
A
D
1750
1730 610
F L TK
1650
580 G
600
550
V V I TI V V I SL
570
1690
1550
1530
540
AlIl
1590
1430 510
F
1470
520
1410
500
V
1450
1310 470
1390
490
480
F
1290
460
450
1330
P T
I
1250
440
430 H
420 T
1210
1190
1170
1150
410
400
CI
A~rTGCTG GAAAAC 2150
G~cGccC~~cCA¶I~r~rrfAAAAGAAAGCIGrrrlICCcCrCAn 2410
2430
2450
FIG. 1. Nucleotide and deduced amino acid sequences of the human brush border Na'/glucose cotransporter. Above the amino acid sequence are shown those residues that differ between the rabbit and human Nal/glucose cotransporters. *, First stop codon TGA.
5750
Biochemistry: Hediger et al.
Proc. Natl. Acad. Sci. USA 86 (1989)
..-
C
,
o
kb 9.5 7.5 -
4.4-
W._ i..-
2.4 -
I.
1.4 -
FIG. 2. Northern blot analysis of intestinal RNA from rabbit and human. The blot was probed with the full-length cDNA coding for the human intestinal Na+/glucose cotransporter (Fig. 1).
suggesting that the N terminus of the protein is on the cytoplasmic face of the membrane (Fig. 3). The GenBank database (Release 57.0) was searched for sequences related to the human Na+/glucose cotransporter using the program TFASTA (11). We found a significant similarity to the E. coli Na+/proline cotransporter. The sequence alignment of the Na+/proline cotransporter and the human Na+/glucose cotransporter, obtained using the program BESTFIT (12), is presented in Fig. 4. There is 28% identity between the amino acid sequences of the two transporters, and there are conservative substitutions at another 25% of the residues. The significance of this alignment was evaluated using the program RDF (14) and is nine SD units. Similar values were obtained using the programs SEQDP (15) and ALIGN (16). Comparison of the human Na+/glucose and E. coli melibiose transport proteins yielded a SD of -0.66. Because the SD must be >3-5 to show significant similarity between two sequences (17), we conclude that the Na+/ glucose transporter is homologous to the Na+/proline trans-
9
06 8
porter but not to the Na+/melibiose transporter. At the DNA level, there is to 44% similarity between the Na+/glucose and Na+/proline transporter clones. This degree of similarity is comparable to that noted previously between the mammalian facilitated glucose transporter and the E. coli H+/arabinose and H+/xylose cotransporters (18). The three sequences have -30% amino acid identity; taking into account conservative substitutions, -40% of the residues are similar. The E. coli Na+/proline cotransporter is 24% smaller than the human Na+/glucose cotransporter (502 vs. 664 residues). This cotransporter lacks the C-terminal region (residues 593-664), including part of the highly charged region near the C terminus (567-631). Except for segments 3 and 4, the predicted membrane-spanning sequences for the Na+/ proline transporter have positions very similar to those predicted for the Na+/glucose cotransporter. The Na+/ proline cotransporter has shorter hydrophilic segments between the membrane-spanning segments. Membrane segment 11 may represent a non-membrane-spanning region, owing to its short length and/or high hydrophobic moment, and this region corresponds to the hydrophobic surface region 9a of the Na+/glucose cotransporter. The Na+/ glucose transporter model in Fig. 3 shows regions of particular sequence homology to the Na+/proline transporter as thick line segments. The transporters have two charged regions in common at residues 130-140 and 408-420. The similarity is further emphasized when comparing the secondary structural elements (19), such as the occurrence of an a-helix in front of membrane-spanning regions 4. Earlier we showed that a tyrosine residue is located at or near the Na+-binding site of the rabbit brush border Na+/ glucose and Na+/proline cotransporter (8, 20). Of the 25 Na+/glucose cotransporter tyrosine residues, only 131, 191, 410, and 528 are located in the regions conserved with the E. coli proline carrier (Fig. 3). One or more of these may be involved in Na+ binding. If lysine residues are involved in glucose and proline binding, only three ofthese (residues 415, 420, and 549) are conserved in the rabbit, human, and E. coli transporters. In the case of the putP proline transporter, experiments with mutants (21) suggest that Arg-257 is related to the structure of the Na+ binding and Na+ coupling sites. This residue is conserved in the Na+/glucose cotransporter
(Fig. 4).
Recently, it has become clear that proline uptake into E. coli is coupled with the electrochemical gradient of Na+ (or Li'), but not H+ (see refs. 13 and 22-24). The similarities between the kinetics of the proline and glucose cotransporters, together with the close homology between the primary and secondary structures of the proteins, strongly suggest similar transport mechanisms. The structures in common (Fig. 3) offer important clues about the essential structures involved in the cotransport of Na+ and substrates across
09 29
NH2-
FIG. 3. Proposed model of the membrane orientation of the human intestinal Na+/glucose cotransporter. The method of Eisenberg et al. (9) was used with windows of 21 and 11 amino acids to predict membrane-spanning segments 1-11. Segment 7a is shown with dotted lines and represents an additional possible membrane-spanning segment. CHO indicates an N-glycosylation site. The region labeled Sur is a surface-seeking membrane region and was predicted using the Eisenberg method. Heavy lines in the sequence indicate regions of particular sequence homology with the E. coli Na+/proline transporter (4). Clusters of negative and positive charges in the hydrophilic regions of the sequence are represented as - and +.
~~~~
Proc. Nati. Acad. Sci. USA 86 (1989)
Biochemistry: Hediger et al.
10
1
.A S T P M L V T F C ........ putP: . SGLT1: M D S S T W S P K T T A V T R P V E THELR N A A D I S I 10 1 20 30
IUVI
30 K
.
GIG
N F D D Y I
.
40 R S|L G P F V T A
60
2
Y
F I W R L WAM F T 5,0
40
t0
50
[27
.G M I L I V V V M A V
Y I
.
5751
70 2
GrAHSD M S G W L L M FG7L PnGA V F L rS TG I1 S E S W
LrSA
902
80
70
120 110 100 _ 90 80 1 r IAIL T L G AWINWKLVA GR LRVHTEYNNNALTLD FTGRRFEDKSRLRI I Y I K A GVV V . LRIQVY LVV |.JL G W FL JL R K R F G G * * * WJF E W N A LQL.
I
100
3
130
110
JP
3
140
VIJMMJE 1 130
120
150
140
170 4
160
FE S TF GMS Y ET RLW AG AA A TIfL YT F I LqL F F T I Y C A, S GnVA G AIRL IFINA NY FLLIALTI LlsJLMSLLLYIFTKISADOFSjjGJ 1 150 160 1 80
IrS
A
V I
4
1
180
190
200
5
1
210
220
L. I F AL I L TT PVI V I I S IV G GI F G D S L. V InQ K S I E N LF7S W| DD T|VnAS rFGjF G YE2 T LMTV IL V G SIL I L TIG F A F H Eii JL AII YfT
21
200
5220
230
230
V S JG N T T ..
.
250
PH
LA VY Q
Q
MO TK FQ0E K C Y T P R A D S F H I F R D P T~D
..
300
..
..
..
..
..
..
..
.
..
260
260 F M A AD SHH N M C L
SLmK
270
270 A RR H VK G G C 310
IVH
6
240
250
N F V A I Y F S L G W G LG G Q ..................... P W P G F IF GM S I LT L W Y W C T D
280
1
7 [ 8 S MT W M I C L A GA. L C G Y L K M P M F I M
320
240
.
1
290
6
1
290 300 FFGIAYFNDHP T E K P8M I S R I L Y 340 730 V M
Ijj
8
310
320 330 340 I ELAQI LFNPW I AG I LLA ILAAVMTE MSVMI SLIM SISE CVVPSECEKYC]TKVGCTN I AYPTLVVE JMPNGLR LL SAL
LAnGAVNQNAERVF
....
350
360
380
370
390
-
-
-
7o
-
-
380 9 350 360 VW VIV M S C Q L L VC[S A I E LIK[F L1HIA A L A.V V ..I.. Q M[IW K V M I A G RL F R E T S I F N S A JT L F IML VJL I G I S JW V P1 400 410 420 8440 430
Vfu7V
390 L. A
400 N P E N R V L G L VY A WAG F
Q4G50QLD
IE460
41010
420
A A F G PVV
430
SRM T R N G
F S V M
L A
M
I
A
YLPPA470FEAF~4~NPBFEB 9I
440
45046 . . . . G [E IPG F IF G SfGIV F A ILASF ITIVV S R MITE F A Y G T G SCM E PS N C PT II C G V H YL AY
[qTVrVVWKQFGWL .......... I G
500
510
.
47071480 P AAMQKRF. KEERIDLD IISLL TWPIPDVHLYRLCWSLRN 550 560 570
.............. FrF|SLLGA
5540
520
490
H
500
ADAHYHSAPPSRLQnS* EE
NIQEGPKETIEI 580
TQ
590
VPEKKKGIFRRAYDLFCGLEQHGAPKMTEEEEKAMKMK M T DTSEKPLWRT 640 630 620 600 610
1
V L N VN G I I L VT V A V F C H A Y F A * 660 650 1
FIG. 4. Alignment of the human brush border Na+/glucose cotransporter and the E. coli Na+/proline cotransporter amino acid sequences using the program BESTFIT (10). The membrane-spanning regions for both transporters were predicted (13) and are indicated in the sequences. The Na+/proline cotransporter is numbered according to Nakao et al. (4). Identical amino acids are boxed.
5752
Biochemistry: Hediger et al.
plasma membranes; for example, the four conserved tyrosines (131, 191, 410, and 528) may be involved in Na' binding. The structural and functional similarity between the human Na+/glucose cotransporter and the E. coli Na+/proline cotransporter is certainly too extensive to have arisen by convergent evolution. An example of convergent evolution is evident in the Na+/glucose and Na+/melibiose cotransporters where there is no detectible similarity at either the DNA or amino acid levels. The homology of the Na+/glucose to the Na+/proline cotransporter suggests, therefore, that Na'cotransport proteins, or an ancestor thereof, were present in organisms before the divergence of prokaryotes and eukaryotes. We thank Hsin-Shung Lee for carrying out oocyte experiments, B. Blumberg for advice on the evaluation of the significance of the sequence homology, and M. Stelzner and E. W. Fonkalsrud for providing human ileum to prepare the cDNA library. This work was supported by grants from the U.S. Public Health Service (DK 19567 and DK 17328). 1. Kaback, H. R. (1986) Annu. Rev. Biophys. Chem. 15, 279-319. 2. Hopfer, U. (1987) in Physiology of the Gastrointestinal Tract, ed. Johnson, L. R. (Raven, New York), pp. 1499-1526. 3. Yazyu, H., Shiota-Niiya, S., Shimamoto, T., Kanazawa, H., Futai, M. & Tsuchiya, T. (1984) J. Biol. Chem. 259, 4320-4326. 4. Nakao, T., Yamato, I. & Anraku, Y. (1987) Mol. Gen. Genet. 208, 70-75. 5. Gubler, U. & Hoffman, B. (1983) Gene 25, 263-269. 6. Hediger, M. A., Coady, M. J., Ikeda, T. S. & Wright, E. M. (1987) Nature (London) 330, 379-381. 7. Peerce, B. E. & Wright, E. M. (1984) J. Biol. Chem. 259, 14105-14112.
Proc. Nadl. Acad. Sci. USA 86 (1989) 8. Peerce, B. E. & Wright, E. M. (1985) J. Biol. Chem. 260, 6026-6031. 9. Eisenberg, D., Schwarz, E., Komaromy, M. & Wall, R. (1984) J. Mol. Biol. 179, 125-142. 10. Mendlein, J., Hediger, M., Coady, M. & Wright, E. M. (1988) FASEB J. 2, A1021. 11. Pearson, W. R. & Lipman, D. J. (1988) Proc. Natl. Acad. Sci. USA 85, 2444-2448. 12. Devereaux, J., Haeberli, P. & Smithies, 0. (1984) Nucleic Acids Res. 12, 387-395. 13. Hanada, K., Yamato, I. & Anraku, Y. (1988) J. Biol. Chem. 263, 7181-7185. 14. Lipman, D. J. & Pearson, W. R. (1985) Science 227, 14351441. 15. Kanehisa, M. I. (1982) Nucleic Acids Res. 10, 183-196. 16. Dayhoff, M. O., Barker, W. C. & Hunt, L. T. (1983) Methods Enzymol 91, 524-545. 17. Doolittle, R. F. (1981) Science 214, 149-159. 18. Maiden, M. C. J., Davis, E. O., Baldwin, S. A., Moore, D. C. M. & Henderson, P. J. F. (1987) Nature (London) 325, 641-643. 19. Chou, P. Y. & Fasman, G. D. (1978) Annu. Rev. Biochem. 47, 251-276. 20. Wright, E. M. & Peerce, B. E. (1984) J. Biol. Chem. 259, 14993-14996. 21. Ohsawa, M., Mogi, T., Yamamoto, H., Yamato, I. & Anraku, Y. (1988) J. Bacteriol 170, 5185-5191. 22. Stewart, L. M. D. & Booth, I. R. (1983) FEMS Microbiol. Lett. 19, 161-164. 23. Chen, C.-C., Tsuchiya, T., Yamane, Y., Wood, J. M. & Wilson, T. H. (1985) J. Mol. Biol. 84, 157-164. 24. Chen, C.-C. & Wilson, T. H. (1986) J. Biol. Chem. 261, 2599-2604.