function as a regulator of a different CheY homologue. Although each pair of proteins appears to produce a special- ized response to a distinct type of stimulus, ...
Proc. Natl. Acad. Sci. USA Vol. 85, pp. 1403-1407, March 1988 Biochemistry
CheA protein, a central regulator of bacterial chemotaxis, belongs to a family of proteins that control gene expression in response to changing environmental conditions (Salmonella typhimurium/Escherichia col/receptors/sensory transduction)
ANN STOCK, THOMAS CHEN, DOUGLAS WELSH, AND JEFF STOCK Department of Molecular Biology, Princeton University, Princeton, NJ 08544
Communicated by Saul Roseman, October 26, 1987
ABSTRACT During bacterial chemotaxis, the binding of stimulatory ligands to chemoreceptors at the cell periphery leads to a response at the flagellar motor. Three proteins appear to be required for receptor-mediated control of swimming behavior, the products of the cheA, cheW, and cheY genes. Here we present the complete nucleotide sequence of the Salmonella lyphimurium cheA gene together with the purification and characterization of its protein product. The protein is a 73,000 M, cytoplasmic constituent. Amino acid-sequence comparisons indicate that it belongs to a family of bacterial regulatory proteins including the products of the cpxA, deWf, envZ, ntrB, phoR, phoM, and virA genes. Each member of this family has a conserved domain of -200 residues within its C terminus. We have previously shown that another chemotaxis protein, CheY, represents a domain of protein structure that has been conserved within a second large family of bacterial regulatory proteins. Each protein of the CheA family seems to function as a regulator of a different CheY homologue. Although each pair of proteins appears to produce a specialized response to a distinct type of stimulus, the relationships in primary structure suggest that a similar molecular mechanism may be involved.
During bacterial chemotaxis, receptor proteins at the cell surface modulate motor behavior in response to binding of extracellular ligands (for reviews, see refs. 1-3). The receptors communicate indirectly with the flagellar motor via a signal transduction system within the cytosol. Genetic studies have shown that the products of three genes, cheA, cheW, and cheY, are required for this process. CheY protein appears to interact directly with proteins at the motor, whereas CheA and CheW appear to be necessary for receptor-mediated control of CheY activity (4-8). The amino acid sequence of CheY indicates that this component is homologous to a family of bacterial regulatory proteins, each of which functions to generate adaptive responses to changing environmental conditions (3, 9, 10). These include proteins such as OmpR, which controls porin expression (11), NtrC, which controls responses to nitrogen depletion (12), PhoB, which controls phosphate assimilation (13), and SpoOA, which controls sporulation in response to nutrient deprivation (14). Most members of this family appear to be regulated by a second component that functions as a sensor for environmental change (10). Each sensory component contains a similar domain of =200 residues within its carboxyl terminus. Thus, in bacteria, two primordial domains of protein structure have apparently evolved to provide a core processor for the transduction of sensory information into appropriate responses.
In this communication we report the nucleotide sequence of the Salmonella typhimurium cheA gene.* The predicted product of cheA is a 73,000 Mr protein that is similar to the family of sensory proteins identified in other bacterial regulatory systems. We have also purified and characterized the CheA protein. Unlike most members of the sensor family, CheA does not appear to be membrane associated. During chemotaxis CheA probably receives information from a variety of different membrane chemoreceptor proteins and transduces it via CheY into a signal that controls motor behavior. Thus, proteins CheA and CheY may provide the central sensory processing mechanism for signal transduction in chemotaxis.
MATERIALS AND METHODS Strains and Plasmids. Part of the cheA gene used for sequencing was from pME1 (15), an S. typhimurium-pUC12 hybrid containing an 8.4-kilobase (kb) Pst I genomic DNA fragment from S. typhimurium strain ST1 spanning a region between cheA and flaM (Fig. 1). M13 phage subclones, M13molA and M13molB, were created by inserting a 3.1-kb Pst I-Sma I fragment of pME1 modified with BamHI linkers in both orientations into the BamHI site of M13mpl0 (16). The remainder of cheA was obtained from an S. typhimurium-pUC12 hybrid plasmid, designated pMOl. This plasmid was generated by ligating 8-kb EcoRI-Sst I genomic DNA fragments from S. typhimurium PSI into pUC12, transforming into Escherichia coli HB101 (17), and screening by the colony hybridization procedure of Grunstein and Hogness (18). A 0.9-kb Sst I-Pst I fragment of pME1 labeled with 32P by nick-translation (17) was used as a probe. M13 phage subclones, M13mo2A and M13mo2B, were created by inserting a 2.7-kb HindIII fragment of pMOl in both orientations into the HindIII site of M13mplO. The E. coli recA strain, JM109 (obtained from J. Messing, Rutgers), was used as host for all sequencing procedures. The expression vector used to produce CheA for purification was constructed in two steps. A Nae I-Pst I pMO1 fragment spanning the last 40 bp of motB and first 184 bp of cheA was first ligated into the HincII-Pst I sites of pUC12 to create a plasmid, designated pMO3, with an in-frame fusion of the pUC12 lacZ' gene and the 3' end of motB; then the 1.9-kb Pst I-EcoRI pME1 fragment encoding the remainder of cheA was ligated into the Pst 1-HindIII sites of pMO3 to yield the complete expression vector, pMO4. Determination of the Nucleotide Sequence of the S. typhimunium cheA Gene. Processive deletions through the inserts in M13molA, M13molB, M13mo2A, and M13mo2B were *The sequence reported in this paper is being deposited in the EMBL/GenBank data base (Bolt, Beranek, and Newman Laboratories, Cambridge, MA, and Eur. Mol. Biol. Lab., Heidelberg) (accession no. J03611).
The publication costs of this article were defrayed in part by page charge payment. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. §1734 solely to indicate this fact. 1403
1404
Biochemistry: Stock et al. motABI
(f laKE) E
H
cheA
NpH Ss P
Proc. Natl. Acad. Sci. USA 85 (1988)
lIWI E
tar
I[ R
S Sm
B IMIl fHM l Ia
pI
lkb
FIG. 1. Organization of the che genes in S t.yphimurium. The region shown spans -16 kb at -40 min on S. typhimurium genome. Restriction sites used in cloning and subconing are cated: E, EcoRI; H, HindIII; N, Nae I; P, Pst I
lhe
.
I; Ss, Sst I.
Smdi-
generated using exonuclease III (New England Biolabs) digestion as described elsewhere (19). Thee complete nucleotide sequence of cheA was determined on both strands by the dideoxynucleotide chain-termination Xprocedure of Sanger et al. (20) using [a-32P]dATP, DNA pxolymerase I Klenow fragment (Bethesda Research Laboiratories), an M13 universal primer (New England Biolabs), , and the deletion phage plus strands as templates. The X-ASTP and RDF programs of Lipman and Pearson (21) were used to search for homologies and help align sequences. Purification of CheA. E. coli HB101 corstaining the CheA expression vector pMO4 was grown at 37''C in L broth (17) supplemented with ampicillin at 40 ,ug/ml to a density of 109 cells per ml, harvested by centrifugation, resuspended in 0.1 M sodium phosphate, pH 7.0 (3 ml/g of cSells, wet weight), and lysed with a Heat Systems/Ultrasonic s (Plainview, NY) W-225 sonicator. The resulting extract w{as centrifuged at 100,000 x g for 90 min to remove membiranes, and ammonium sulfate was added to 40% saturationn. The pellet was collected by centrifugation, resuspended inl20 mM Tris1HCl, pH 7.5, containing 1 mM dithiothreitol, anid dialyzed against the same buffer. This fraction was then Esubjected to highperformance liquid chromatography using a 5 x 50 mm Mono Q HR 5/5 column (Pharmacia) aind eluting with a 30-ml linear gradient of 0.25-0.75 M NaCl in 20 mM Tris-HCl, pH 7.5, at a flow rate of 1 ml/muin. CheA, eluting at 0.54 M NaCI, was collected and si objected to highperformance liquid chromatography using a 7.1 x 300 mm GF-200 column (Sota Chromatography, Crompond, NY) equilibrated in 0.1 M sodium phosphate, pH 7.0. All ste except chromatography were performed at CheA ted 0-40C. automated from the Mono Q column was subject ed to automated Edman degradation after blotting onto poly difluoride membranes according to the method o fMatsudaira (22).
fvinylidene
RESULTS Nucleotide Sequencing of the cheA GelIe. The complete nucleotide sequence of the cheA gene andiflankng regions, together with the deduced amino acid sequ CheA protein, is shown in Fig. 2. The presunned site of CheA initiation is a GTG codon just distal to the TGA termination codon of the motB gene (23). The GTG is preceded by a ribosomal binding site (24, 25) comprises of 8 bases with perfect complementarity to a sequence at t the 3'end of 16s rRNA. This initiation site is followed by an open reading frame extending 2013 bp to a TGA termina tlion codon just 20 bp proximal to the ATG initiation codon of the cheW gene (15). The predicted product of cheA woiuld have a Mr of 73,000. This corresponds well with the estimated Mr of 76,000 obtained from NaDodSO4/polyacr yhA m26e, 27ec A trophoresis of the 35S-labeled product of cheA 27). a 268 second cheA product with an apparent Mr of 68,0 has also been observed. It has been proposed th Lat this protein ' produced from a secondary site of transla Ltionai inutiation at the 5' end of the gene (28). We can detect only one possible candidate for such a site, an ATG initiatio on codon preceded by a Shine-Dalgarno ribosomal binding consensus. The predicted product initiated from this site w'ould have a M, of 64,000.
encite ofthe
The nucleotide sequence of a third of the E. coli cheA gene encoding the C terminus of the protein has previously been reported (29). The region corresponds to the S. typhimurium sequence extending from Ile-447 to Ala-671. The E. coli and S. typhimurium amino acid sequences exhibit only about a 15% difference, with most changes resulting in conservative substitutions. Similar results have been obtained in comparing other chemotaxis genes in the two species (9, 15, 29, 30, 31). There is one anomalous region, however. Between Gly-564 and Lys-583 the E. coli and S. typhimurium amino acid sequences appear completely different. An analysis of codon usage within the corresponding nucleotide sequences indicates that this apparent divergence is due to a shift in reading frame. A base is missing in the E. coli sequence at the position of G-1693 in the S. typhimurium gene, and the S. typhimurium reading frame is restored in the reported E. coli sequence by the introduction of a guanine just after the base corresponding to A-1752 in the S. typhimurium sequence. Relationship Between CheA and a Family of Bacterial Sensory Transduction Proteins. The predicted amino acid sequence of CheA was compared to the collection of 4791 protein sequences in the National Biomedical Research Foundation Protein Sequence Databaset as well as to a set of over 7000 protein sequences translated from GenBankJ1 The results clearly indicate that a sequence of 150 residues in CheA extending roughly from Ser-382 to Leu-529 is related to a corresponding region in the family of bacterial regulatory proteins that include the products of the cpxA, dctB, envZ, ntrB, phoM, phoR, and virA genes from a wide range of different bacterial species (Fig. 3). Each of these proteins functions in the regulation of a protein that is homologous to CheY of the chemotaxis system: EnvZ regulates OmpR (11), PhoR regulates PhoB (13), PhoM may regulate ORF2 (34), CpxA regulates SfrA (37), VirA may regulate VirG (38), NtrB regulates NtrC (39), and CheA is required for the regulation of CheY (8), as well as another homologous protein, CheB (40). It seems likely that the conserved domain shared by all the CheA homologues corresponds to a structure that interacts with the corresponding CheY homologue. There appear to be four highly conserved sequences shared by all members of the CheA family (Fig. 4). These to a in CheA around His-393; the correspond sequence beginningsequence at Asn-415, N X X X N; the sequence beginning at Asp-437, D X G X G; and the sequence beginning at Gly-488, G X G X G. The proteins tend to be relatively variable between these sequences with several insertions and deletions. It seems likely that at least some of the highly conserved residues participate directly in the interaction between CheA and CheY. Most members of the CheA family appear to be membrane proteins. This has been directly demonstrated for EnvZ (41), VirA (36), and CpxA (personal communication, P. Silverand inferred from hydropathy analyses of the seman), quences of PhoM (34), PhoR (35), and DctB (personal communication, C. Ronson). Predicted membrane spanning sequences are indicated in Fig. 3. NtrB is an exception in that it is clearly a cytoplasmic constituent (42). A hydropathy analysis of the CheA protein indicates that it, like NtrB, has no extended hydrophobic sequences that could span the cytoplasmic membrane. Identification and Purification of the CheA Protein. The CheA protein was tentatively identified with a band that appeared on Coomassie blue-stained NaDodSO4/polytProtein Identification Resource (1987) Protein Sequence Database (Natl. Biomed. Res. Found., Washington, DC), Release 12.0. tEMBL/GenBank Genetic Sequence (1987) GenBank (Bolt, Beranek, and Newman Laboratories, Cambridge, MA), Tape Release 48.0.
Proc. Natl. Acad. Sci. USA 85 (1988)
Biochemistry: Stock et al.
1405
motB
GA=GATAGC
-1
GTGAGCATGGATATTAGCGATTTTTATCAGACATTTTT>TGATGAGCTGACGAGCTGTTGGCTGATATGGAGCAGCATCTGCTGGATTTG F F D E A D E L L A D M E Q H L L D L
90
GTGCCGGAGTCCCCGGACGCTGAACACTGAACGCCAmTTTC(CGCGCGGCGCACTCCATTAAGGCGGCGCGGGCACATTCGGCTTCACT I K G G A G T F G F T
180 60
ATTCTGCAGGAACCACGCATTAAGGAACCTGCTGGAGAGCCCGACGCQgMcAGCTC AATACCGACATTATCAACCTG L L N T D I I
270 90
F L E T K D I M Q E Q L D A Y K N S E E P D A A S F E Y I C
TTTTGGAACGAAAGATATTATGCA(GAACAGCTCGACGCCTATAAACTCCGAAGAGcCGGATG;CCGCAAGCTTCGAATATATCTGC
360 120
AATGCGCTACGCCAGTTG'r'CGCTGGAGGCGMAAAGCGAAACCACGCCGGCGG;TCGTAGzAACCGCGGCACTAAGCGCGGCGATCCAGGAA T A A L S A A I Q E
450 150
GAGAGCGTAGCCGAACCGAATCGCCTCGTGACGAGAGTAACTGCGCATTGTACTTTCACGGTTGAAGCCAATGAGGTCGATCTGCTT L
540 180
GAAGAGAGTTGGGCAATCTGGCTACTCTGACGGATGTCGTTAAGGGGGCGGATTCACTGTCGGCAACTCTGGATGGCAGCGTCGCGGAA V A E
630 210
GATGACATTGTCGCGGTGCTC1i-l.GTCATCGAGGCCGATCAGTCGCTTTTGAAAGTGGTCGCCGCGCCGMGAAAAGCGCAA
720 240
cheA >
-- -
X S N D I S D F Y Q T
V P E S P D A E Q L N A I F R A A H S
N
I L Q E T T H L M E N L L D E A R R G E M Q
N A L R Q
E
E
S
E
D D
V A E E
I
L G
L A
T
N
V A V
E
L E A K G
S
P R D
L A T
L C
E
E T T
S
P A V V
K L R
I
V L
L T D V V K G A D
F V
I
E A
D Q
I
A
F
S
E
S
R
L S
L K A N
A
T
E V D
L D G
E R V V A A
P V
L
S
E
K A Q
GAGAAACCGAGGTAGCGCCGGTAGCGCCTCCGGCAGTCGTTGCGCCAGCCGCCAAGCGCGGCCCATGAACATCATGCGGGGCGTGAA E K T
E V A P V A
P
P A V V A
P A A K
S
A A H
E H
H A G
R E
AAACCTGCGCGTGAACGCGAGTCCACCAGCATTCGCGGCGAGAGAAGTGTCAGCTCATAAACCTGGTCGGCGAGT-TAGTGATT K P A R
E R
E
S
T
S
I
R V A V
E K V
D Q
L
I
N
L V
G
E
L V
I
ACCCAGTCAATGTTGGCCCAGCGTTCTAACGAGCTGGACCCGGTAACCACGGCGATCTCATTACCAGTATGGGTCAGTTACAACGTAAC T Q
S
N L A Q R S
N
E
L D P V N H G
D
L I
T
S
M G Q
L Q R N
GCCCGCGACTTACAAGAGTCGGTGATGTCGATTcGTATGATGCCTATGGAGTACGTCTTCAGCCGTTCC~CCGTCTGGTACGCGATTA A a
D
L Q
E
S
V N
S
I
R N N P N
E Y V
F
S
R
F
P R
L V R D L
GCGGGCAMACTGGGCAAGCAGGTAGAGCTGACGCGTGAGCTCCACTGMACTTGATAAGCCTGATCGMACGCATTATCGACCCG A G
K
L G K Q V
E
L T
L V
S
G
S
T
E
L D R
S
L
I
E
R
I
I
D
P
CTGACGCACCTGGTGCGCAACAGTCTCGACCACGGGATTGAATGCCGGAAACGCCTTGAGGCGGAAAAATGTTGTCGGCAATCTG L T H
L V R N
S
L D H G
I
E M P
E K R
L E A G
K N V V G N
L
ATTCTTTCCGCAGAACATCAGGGCGGGAATATTGTATTGAAGTGACCGATGACGGCGCGGGGCTTAACCCG~AGCGTATCCTGGCGAAG I
L S
A
E H Q G G N
I
C
I
E V T D D G A G
L N R
E R
I
L A K
GCCATGTCGCAGGGGATGGCGGAGAACATGACCGACGATGMAGTAGGCATGTTGATCTTCGCGCCGGGATTCTCAACTGCGGAA A N
S
Q G N A V N E N N T D D E V G N L I F A P G F S T A E
30
810 270 900 300 990 330
1080 360 1170 390
1260 420 1350 450
1440 480
CAGGTCACGGATGTCTCCC~rTCGCGGCGTTGGCATGGATGTMTAACGTAACATTCAGGAGATGGCGGTCACGTTGAGATCCAGTCA 1530 Q V T D V S G R G V G H D V V K R N I Q E N G G H V E I Q S 510
AAACAAGGCTCCGGCACCACGATTCGTATTCTGCTGccGCTGACGCTGGCCATCCTCGATGrG TG;TCGGTTCGCGTGGGGGGGTT 1620 540 K Q G S G T T I R I L L P L T L A I L D G N S V R V A G E V TTATCCTGCCGTATGCGGTGATGGAGTCGCTGCAA CCGCGGAAGA GTCTTCATCCGCTGGCGGGMCAAGGTCGA 1710 F I L P L N A V N E S L Q P R E E D L H P L A G G E R V L E 570
GTCG;CGGCGAGTATTTGCCGCTGGTCGAATTGTGGAAATGTTTGCG;TGGACGGGGCGAAACCGAAGCCACG;CAGG;GCATTGTT(GTG V R G
E Y L P L V
E
L V K V
F D V D G A K T
E A T Q
G
I
V V
ATCCTGCAAMccGCGCG~cCGTTACGCGCTGCTGGCGATCAGCTCATCGGTCAGCACCAGGTM;TGCAA TCT(GAAGCT I
L Q
S A G R R Y A L L V D Q L I
G
Q H Q V V V K N L E S N
1800 600
1890 630
TATCGCAA GTACCGGGATTCCGCCGCGACGATCCTGCACGGCAGCGTCGCGCTGATT'TATGTTrCCGCATTGCAGGTTA 1980 660 Y R K V P G I S A A T I L G D G S V A L I V D V S A L Q G L
AACCGCGAACAACGTATGGCGATCACAGCCGCCTGATTGAGTGAGMQQATA -----> N R E Q R M A I T A A cheW
FIG. 2. Nucleotide sequence of cheA and the predicted amino acid sequence of its protein product. The nucleotide sequence is numbered relative to the first base of the cheA-coding region. The amino acid sequence is numbered with respect to the initiator methionine. The TGA tennination codon of motB is indicated, and critical bases involved in the initiation of cheA and cheW are underlined. A possible secondary site of cheA initiation is also underlined.
acrylamide gels of E. coli cells transformed with a cheA expression vector, pM04, but the protein was missing in
cells transformed with the parent plasmid, pUC12 (Fig. 5). This material was purified by ammonium sulfate fractionation, ion exchange, and molecular sieve chromatography. Amino terminal-sequence analysis confirmed its identity with the predicted product of the cheA gene. Two overlap-
ping sequences were detected in approximately 1:1 stoichiometry, Ser-Met-Asp-Ile-Ser-Asp-Phe-Tyr-Gln-Phe and
Met-Asp-Ile-Ser-Asp-Phe-Tyr-Gln-Phe-Phe. The first se-
quence confirms the GTG initiation codon indicated in Fig. 2. The second could result from proteolytic cleavage of the N-terminal serine during the purification procedure, or from ambiguity in translational initiation. The purified protein
1406
Proc. Natl. Acad. Sci. USA 85 (1988)
Biochemistry: Stock et A
Che A
_II
CpxA
o
U ._.-U---L] LII I =I
Dct B
EnvZ
I
U--L
-
Nt r B
x
U----
PhoR
-
3
4
5
6
I
-I[=rIILI o UIZ
PhoM
2
10-3
I
[I1 ]
U LI* LII 11
m
Mr
r
94
_
67
_
L=L [L=I LI
-~- -LI
=
43
VirA
YiII
-. 5_
_
4W
= OLEZ-
100 amino acids
FIG. 3. Regions of similarity between CheA and a family of bacterial regulatory proteins. Homologous regions are indicated by white boxes with similar sequences aligned as indicated in Fig. 4. In these regions the number of identical residues at corresponding positions in CheA, and each similar protein ranges from 16 to 25%. Black boxes indicate predicted membrane-spanning sequences. Sequences were obtained from the following sources: CpxA (10, 49); DctB (50); EnvZ (32); NtrB (33); PhoM (34); PhoR (35); and VirA
(36). exhibited an apparent native Mr of -240,000 during molecular sieve chromatography. It seems likely, therefore, that under these conditions CheA is a trimer or tetramer.
DISCUSSION and E. coli only six genes exhibit a Che In S. typhimurium null phenotype: cheA, cheB, cheR, cheW, cheY, and cheZ. CheAst 382 SLIERIIDI LTHLVRNSLDH CPXAEc 238 TSQQRLLS SHELRTPLTR DctBRl 404 AILGQVAA AHEINQPVATI EnvZEc 232 DDRTLLMAG VSHDLRTPLTRI NtrBBp 138 RSVIALA) AHEIKNPLSG] 128 IAARDLVR EIKNPLGG NtrBKp 254 NYIEQYVY THELKSPLAAI PhoMEc PhoREC 202 GARRNFF SHELRTPLTV VirAAt 468 EAVGTLAG AHEFNNILGSI VirAAt 468 EAVGTLA AHEFNNILGVI _______
-
VirAAt
All have now been sequenced, and their protein products
have been purified and characterized (9, 15, 29-31, 43). Two of these components, CheR and CheB, appear to serve a peripheral role as modulators of receptor activity (44). CheZ also seems to be peripheral because mutants totally lacking
this protein still respond to attractant stimuli (45). Studies of the effects of plasmids that produce varying levels of CheZ
------------------------------------------------------------------------
LGTALLRRRSGESKELERIE-------TEAQRLDSMINDLLVMSRNQQKNALVSETIKANQLWSEVLDNAAF YADNARTFLDRGQTAPAGENL-ESIAALTERIGSITEELKTFARKGRGSAEPTGLKDVIE-GAVMLLPTRF
LATEMMSEQDG---------YLAESINKDIEECNAI IEQFIDYLRTGQEMPI4EMADLNAVL-GEVIAAESGAAQLLEQQASSEDRLL----T-RLICDEADRIVTLVDRMEVFGDDRPVARGPVNIHSVLDHVKRLAQSGFA GAAQLLSKALPDPALME---YT-KVI IEQADRLRNLVDRLL--GPQHPGMHVTESIHKVAERVVKLVSMELP
GAAEILREGPPPEVVAR---FT-DNILTQNARMQALVETLLRQARLENRQEVVLTAVDVAALFRRVSEARTV YLEMMNEQPLEGAVREKALHT-MREQTQ--RMEGLVKQLLTLSKIEAAPTHLLNEKVDVPMMLRVEREAQT HAELAQNSVSRTSVTRR--YI-DYI ISSGDRAMLI IDQILTLSRKQERMIKPFSVSELVTEIAPLLRMALP YA-MAQNILHRRTYARH--YI-DRINAESHRARLIIDQILALSRRRERTARPFNLSALVREIAPSLRVALP
LAHE-R-PL--I -------------------------------------------------------------------------
VS L K SV IT D N I CheAst 404 CPXAEC 325 DCtBRl 496 EnVZEC 315 NtrBBP 227 NtrBKp 212 PhOMEc 344 PhoREC 294 VirAAt 559
FIG. 5. NaDodSO4/polyacrylamide gel electrophoresis of fractions from the CheA purification. Lanes: 1, Mr standards; 2, 50 lg of 100,000 x g supernatant from E. coli HB101 containing pUC12; 3, 50 ,.g of 100,000 x g supernatant from E. coli HB101 containing the cheA expression vector, pMO4; 4, 50 jtg of the 40% ammonium sulfate precipitation fraction; 5, 10 ,ug of the 0.54 M NaCl peak from the Mono Q HR 5/5 column; 6, 10 j.g of 240,000 M, peak after molecular sieve chromatography. Samples were electrophoresed in 7% polyacrylamide according to the method of Laemmli (48), and the gel was stained with Coomassie blue.
L
------------EMPEKRLEAG~ VN]-------------------LSAEHQGGNiCI. ~DGAGRYSHTKIEVGFAVDKDGITI1 DDDGPG EAEQMGRSLTVNFPPGPWPLYGNPNALESALE iVRN ---------------------
ADNGPGI AGRMDTLDID--LPPDELQVMGNRIRLEQVLI LLQN EAVAP--------------------KAGEGRVEIRTSTDAG -------------------------RYGNGWIKVSSGTEPNRAWF VEDDGPG> YEREIETALY----PGSIEVKMHPLSIKRAV RNVRFIEDYDPSLPPVL----ANQDQLIQVF LVKN EAVADLGTDAEIQLTTAFR------PGVRLSVPGKKSRVSLPLEF KDNGSG DNVKLVRDYDPSLPELP----HDPDQIEQVL IVRNA ALGPEGGEITLRTRTAFQLTLHGVRYRLA-----------ARI EDNGPGI QLAEK-KITL-HVTPTEVNVAAEPALLEQAL LLDN D-----------------------FTPESGCITLSAEVDQEHVTL LDTGSGI LSQK--KQTFTFEIDNGLKVSGNEDQLRSAIS LVYN N---------------------- HTPEGTHITVRWQRVPHGAEF' VEDNGPGI SDNGGGI PNI ---ELSFRFDQMQSVIE-GSPLELQQVLI ICKN Q--------AMTANGQIDIIISQAFLPVKKILAHGVMPPGDYVL EDNGGGI LCKN E--------ACIGTGRIEDSVYRSFVWKHKVLANGTIPAGDYIL 559 SEV---EVDFNIQSAQMIVE-GHPLEIEQI ____________________________--_ N-V-NA --____-__-------------------------------------V-DNG-GIV I D L L L F T c
CheAst 444 RERILAKAMSQGMAVNENMTDDEVGM FAF STAEQVTDV-' RGVGMDVVKRNIQEMGGHVEIQSKQGSG TIRILLPLTLAILDGMSV... LVIWLPLYKRS* CPXAEC 394 PEDREQ--------------------- FRP RTDEARDRES TGLGIAIVETAIQQHRGWVKAEDSPLG DctBR1 568 TEIRKG--------------------- FTP TSKE------S LGLGLVISKDIVGDYGGRMDVASDSGG FIVQLRKA* IRAWLPVPVTRAQGTTKEG* EnVZEC 380 PEQRKH---------------FQP VGDSARTI--S TGLGLAIVQRIVDNHNGMLELGTSERG
NtrBBp
311 NtrBKp 296 PhoMEc 413 PhoREc 363 VirAAt 651 VirAAt 651
EDLLPN--------------------SHLQDT-------------------DYALSR -------------------PEHIPR--------------------EAVLPH-------------------QAALPH------------------.____ ___ __ ______________
FRVLDADVQRRQATRPKQPR* KQ------ SGLGLALVAKIVGDHGGIIECESQPR FDP FSV-YLPIRK* FYP SGRE------ TGLGLSIARSLIDQHSGKIEFTSWPG FER 'SLPRANGQ-- SGLGLAFVSEVARLFNGEVTLRNVQEG LASLRLHRHFT* ER RVDKARSRQT( SGLGLAIVKHAVNHHESRLNIESTVGKz RFSFVIPERLIAKNSD* FEP STRARN----G TGLGLASVHGHISAFAGYIDVSSTVGHz RFDIYLPPSSKEPVNPDS ... FEP RTRAQC----C TGLGLSTVHGHVSAMAGFVDVISTVGR' RFDIYLPTSAKKPVNSES ... -Pw ------------_-GLGL--V --------G ---------G---------------------IT RG
M
S
V M
S
S
T
A
FIG. 4. Amino acid sequence similarities between family of related bacterial regulatory proteins. The most highly conserved regions are boxed with conserved residues indicated beneath the boxes. Sequences are from references cited in Fig. 3.
Biochemistry: Stock et al. protein (8, 46) indicate that CheZ acts to antagonize the effects of CheY. Thus, the only components that appear to be absolutely essential for signal transduction are proteins CheA, CheW, and CheY. Both CheY and CheW are small compared to CheA with Mr values of 14,000 and 18,000, respectively (9, 15). Several lines of evidence suggest that CheY interacts directly with the flagellar motor and that CheA acts to modulate this activity (4-9). CheW may serve as a link between CheA and the membrane chemoreceptor proteins (5, 8, 15). As a regulator of CheA, the function of CheW would be analogous to that of the protein, designated PI,, that interacts with proteins NtrB and NtrC to regulate the expression of glutamine synthetase and the nitrogen fixation system (47). In comparing the amino acid sequences of the Che signaling proteins to components of cognate regulatory systems it is apparent that only the CheY domain and a portion of CheA have been conserved through evolution. These structures apparently provide an essential sensory transduction function. In all the systems except chemotaxis, the output involves positive transcriptional regulation (10). This finding suggests a fundamental relationship between sensory-motor regulation and control of gene expression. With the availability of pure component proteins it should be possible to begin to unravel the molecular mechanisms that underlie this relationship. Note Added in Proof. Recent results have established a common mechanism of action between the CheA and CheY proteins and the corresponding homologues of the nitrogen regulation system, the NtrB and NtrC proteins. Previous studies showing that NtrB is a protein kinase that phosphorylates NtrC (39) have been extended to the chemotaxis system. We have shown that the purified CheA protein is a CheY kinase (51).
We thank Paul Matsudaira for determining the N-terminal sequence of our purified CheA protein and Kenneth Moss for his help with the homology searches. This work was supported by grants from the Public Health Service (AI20980) and the American Cancer Society (NP-515). A.S. was supported by postdoctoral fellowships from the National Cancer Institute and the Damon Runyon-Walter Winchell Fund. T.C. was supported in part by a fellowship from the National Science Foundation. 1. Parkinson, J. S. & Hazelbauer, G. L. (1983) in Gene Function in Prokaryotes, eds. Beckwith, J., Davies, J. & Gallant, J. (Cold Spring Harbor Lab., Cold Spring Harbor, NY), pp. 293-313. 2. Macnab, R. (1987) in Escherichia coli and Salmonella typhimurium, ed. Neidhardt, F. C. (Am. Soc. Microbiol., Washington, DC), Vol. 1, pp. 732-759. 3. Stock, J. (1987) BioEssays 6, 199-203. 4. Parkinson, J. S., Parker, S. R., Talbert, P. B. & Houts, S. E. (1983) J. Bacteriol. 155, 265-274. 5. Clegg, D. 0. & Koshland, D. E., Jr. (1984) Proc. Nati. Acad. Sci. USA 81, 5056-5060. 6. Yamaguchi, S., Aizawa, S.-I., Kihara, M., Isomura, M., Jones, C. J. & Macnab, R. M. (1986) J. Bacteriol. 168, 1172-1179. 7. Ravid, S., Matsumura, P. & Eisenbach, M. (1986) Proc. Natl. Acad. Sci. USA 83, 7157-7161. 8. Wolfe, A. J., Conley, M. P., Kramer, T. J. & Berg, H. C. (1987) J. Bacteriol. 169, 1878-1885. 9. Stock, A. M., Koshland, D. E., Jr., & Stock, J. B. (1985) Proc. Natl. Acad. Sci. USA 82, 7989-7993. 10. Ronson, C. W., Nixon, D. T. & Ausubel, F. M. (1987) Cell 49, 579-581. 11. Hall, M. N. & Silhavy, T. J. (1981) J. Mol. Biol. 151, 1-51. 12. Gussin, G. N., Ronson, C. W. & Ausubel, F. M. (1986) Annu. Rev. Genet. 20, 567-593.
Proc. Natl. Acad. Sci. USA 85 (1988)
1407
13. Wanner, B. L. (1987) in Escherichia coli and Salmonella typhimurium, ed. Neidhardt, F. C. (Am. Soc. Microbiol., Washington, DC), Vol. 1, pp. 1326-1333. 14. Ferrari, F. A., Trach, K., LeCoq, D., Spence, J., Ferrari, E. & Hoch, J. A. (1985) Proc. Natl. Acad. Sci. USA 82, 2647-2651. 15. Stock, A., Mottonen, J., Chen, T. & Stock, J. (1987) J. Biol. Chem. 262, 535-537. 16. Messing, J. (1983) Methods Enzymol. 101, 20-78. 17. Maniatis, T., Fritsch, E. F. & Sambrook, J. (1982) Molecular Cloning: A Laboratory Manual (Cold Spring Harbor Lab., Cold Spring Harbor, NY). 18. Grunstein, M. & Hogness, D. (1975) Proc. Natl. Acad. Sci. USA 72, 3961-3%5. 19. Stock, A. (1986) Dissertation (Univ. of California, Berkeley,
CA).
20. Sanger, F., Nicklen, S. & Coulson, A. R. (1977) Proc. Natl. Acad. Sci. USA 74, 5463-5467. 21. Lipman, D. J. & Pearson, W. R. (1985) Science 227, 1435-1441. 22. Matsudaira, P. (1987) J. Biol. Chem. 262, 10035-10038. 23. Stader, J., Matsumura, P., Vacante, D., Dean, G. E. & Macnab, R. M. (1986) J. Bacteriol. 166, 244-252. 24. Shine, J. & Dalgarno, I. (1974) Proc. NatI. Acad. Sci. USA 71, 1342-1346. 25. Steitz, J. A. & Jakes, K. (1975) Proc. Natl. Acad. Sci. USA 72, 4734-4738. 26. Silverman, M. & Simon, M. (1977) J. Bacteriol. 130, 1317-1325. 27. DeFranco, A. L. & Koshland, D. E., Jr. (1981) J. Bacteriol. 147, 390-400. 28. Smith, R. A. & Parkinson, J. S. (1980) Proc. Natl. Acad. Sci. USA 77, 5370-5374. 29. Mutoh, N. & Simon, M. 1. (1986) J. Bacteriol. 165, 161-166. 30. Simms, S. A., Stock, A. M. & Stock, J. B. (1987) J. Biol. Chem. 262, 8537-8543. 31. Stock, A. M. & Stock, J. B. (1987) J. Bacteriol. 169, 3301-3311. 32. Mizuno, T., Wurtzel, T. & Inouye, M. (1982) J. Biol. Chem. 257, 13692-13698. 33. Nixon, B. T., Ronson, C. W. & Ausubel, F. M. (1986) Proc. NatI. Acad. Sci. USA 83, 7850-7854. 34. Amemura, M., Makino, K., Shinagawa, H. & Atsuo, N. (1986) J. Bacteriol. 168, 294-302. 35. Makino, K., Shinagawa, H., Amemura, M. & Nakata, A. (1986) J. Mol. Biol. 192, 549-556. 36. Leroux, B., Yanofsky, M. F., Winans, S. C., Ward, J. E., Ziegler, S. F. & Nester, E. W. (1987) EMBO J. 6, 849-856. 37. Silverman, P. M. (1985) BioEssays, 2, 254-259. 38. Winans, S. C., Ebert, P. R., Stachel, S. E., Gordon, M. P. & Nester, E. W. (1986) Proc. Natl. Acad. Sci. USA 83, 8278-8282. 39. Ninfa, A. J. & Magasanik, B. (1986) Proc. Natl. Acad. Sci. USA 83, 5909-5913. 40. Springer, M. S. & Zanolari, B. (1984) Proc. Natl. Acad. Sci. USA 81, 5061-5065. 41. Liljestrom, P. (1986) FEMS Microbiol. Lett. 36, 145-150. 42. Hunt, T. P. & Magasanik, B. (1985) Proc. Natl. Acad. Sci. USA 82, 8453-8457. 43. Simms, S., Keane, M. & Stock, J. (1985) J. Biol. Chem. 260, 10161-10168. 44. Stock, J. & Stock, A. (1987) Trends Biochem. Sci. 12, 371-375. 45. Block, S. M., Segall, J. E. & Berg, H. C. (1982) Cell 31, 215-226. 46. Kuo, S. C. & Koshland, D. E., Jr. (1987) J. Bacteriol. 169, 1307-1314. 47. Magasanik, B. & Bueno, R. (1985) Curr. Top. Cell. Regul. 27, 215-220. 48. Laemmli, U. K. (1975) Nature (London) 227, 680-685. 49. Albin, R., Weber, R. & Silverman, P. (1986) J. Biol. Chem. 261, 4698-4705. 50. Ronson, C. W., Astwood, P. M., Nixon, B. T. & Ausubel, F. M. (1987) Nucleic Acids Res. 15, 7921-7950. 51. Wylie, D. C., Stock, A., Wong, C. Y., Stock, J. (1988) Biochem. Biophys. Res. Commun., in press.