Crystal structure of elongation factor P from Thermus thermophilus HB8

2 downloads 0 Views 600KB Size Report
Jun 29, 2004 - I, II, and III are colored green, red, and blue, respectively. (C) Ribbon presentation of the T. thermophilus EF-P (molecule A) crystal structure ...
Crystal structure of elongation factor P from Thermus thermophilus HB8 Kyoko Hanawa-Suetsugu*, Shun-ichi Sekine†, Hiroaki Sakai*, Chie Hori-Takemoto*, Takaho Terada*†, Satoru Unzai*‡, Jeremy R. H. Tame*‡, Seiki Kuramitsu†§, Mikako Shirouzu*†, and Shigeyuki Yokoyama*†¶储 *RIKEN Genomic Sciences Center, 1-7-22 Suehiro-cho, Tsurumi, Yokohama 230-0045, Japan; †RIKEN Harima Institute at SPring-8, 1-1-1 Kouto, Mikazuki-cho, Sayo, Hyogo 679-5148, Japan; ‡Protein Design Laboratory, Yokohama City University, 1-7-29, Suehiro-cho, Tsurumi, Yokohama 230-0045, Japan; §Department of Biology, Graduate School of Science, Osaka University, Toyonaka, Osaka 560-0043, Japan; and ¶Department of Biophysics and Biochemistry, Graduate School of Science, University of Tokyo, 7-3-1 Hongo, Bunkyo-ku, Tokyo 113-0033, Japan

Translation elongation factor P (EF-P) stimulates ribosomal peptidyltransferase activity. EF-P is conserved in bacteria and is essential for cell viability. Eukarya and Archaea have an EF-P homologue, eukaryotic initiation factor 5A (eIF-5A). In the present study, we determined the crystal structure of EF-P from Thermus thermophilus HB8 at a 1.65-Å resolution. EF-P consists of three ␤-barrel domains (I, II, and III), whereas eIF-5A has only two domains (N and C domains). Domain I of EF-P is topologically the same as the N domain of eIF-5A. On the other hand, EF-P domains II and III share the same topology as that of the eIF-5A C domain, indicating that domains II and III arose by duplication. Intriguingly, the N-terminal half of domain II and the C-terminal half of domain III of EF-P have sequence homologies to the N- and C-terminal halves, respectively, of the eIF-5A C domain. The three domains of EF-P are arranged in an ‘‘L’’ shape, with 65- and 53-Å-long arms at an angle of 95°, which is reminiscent of tRNA. Furthermore, most of the EF-P protein surface is negatively charged. Therefore, EF-P mimics the tRNA shape but uses domain topologies different from those of the known tRNA-mimicry translation factors. Domain I of EF-P has a conserved positive charge at its tip, like the eIF-5A N domain.

T

ranslation elongation factor P (EF-P) was found as a protein that stimulates the peptidyltransferase activity of the 70S ribosome in Escherichia coli (1). EF-P enhances dipeptide synthesis with N-formylmethionyl-tRNA and puromycin in vitro, suggesting its involvement in the formation of the first peptide bond of a protein (2). E. coli EF-P is encoded by the efp gene and consists of 188 amino acid residues (3). The efp genes are universally conserved in Bacteria (4). Gene interruption experiments in E. coli revealed that the efp gene is essential for cell viability and is required for protein synthesis (3). The amount of EF-P in E. coli cells is ⬇1兾10th of that of EF-G; 800–900 molecules of EF-P exist in a cell, an amount consistent with 1 EF-P per 10 ribosomes (5). EF-P reportedly binds to both the 30S and 50S ribosomal subunits (6). Antibiotic sensitivity and footprinting studies have indicated that EF-P binds near the streptomycin-binding site of the 16S rRNA in the 30S subunit (6). EF-P also interacts with domains II and V of the 23S rRNA, i.e., near the peptidyltransferase center (PTC) (6, 7). Ribosome reconstitution experiments have shown that the L16 ribosomal protein or its N-terminal 47-residue fragment was required for EF-P-mediated peptide bond synthesis, whereas L11, L15, or L7兾L12 were not (8–10). Eukarya and Archaea seem to lack EF-P, although a similar function may be mediated by eukaryotic initiation factor 5A (eIF-5A) (4, 6). The eIF-5A protein is composed of ⬇140 amino acid residues and is shorter than EF-P by ⬇40 residues. Complete intracellular depletion of eIF-5A results in cell growth inhibition; however, protein synthesis seems to be only slightly reduced (11). eIF-5A is a unique cellular protein that contains the unusual amino acid hypusine [N␧-(4-aminobutyl-2-hydroxy)L-lysine], which is formed by posttranslational modification of a specific lysine residue. The enzyme that modifies lysine to hypusine in eIF-5A is essential for yeast viability (12). On the www.pnas.org兾cgi兾doi兾10.1073兾pnas.0308667101

other hand, hypusine is not found in bacteria (13). The structures of eIF-5A from three Archaea, Methanococcus jannaschii, Pyrobaculum aerophilum, and Pyrococcus horikoshii, have two domains, which are composed of several ␤-strands (14–16). Crystallizations of EF-P from E. coli and Aquifex aeolicus have been reported (7, 17). In this article, we report the crystal structure of EF-P from Thermus thermophilus HB8 at a 1.65-Å resolution. The EF-P structures are ␤-rich and is divided into three ␤-barrel domains (domains I, II, and III). Domains II and III of EF-P share a very similar topology. The structures of domains I and II of EF-P are superposable on the structures of the M. jannaschii, P. aerophilum, and P. horikoshii eIF-5A proteins. The overall structure of EF-P is strikingly similar to the L-shaped structure of tRNA. Materials and Methods Protein Preparation and Crystallization. The DNA fragment encod-

ing EF-P, the protein TT0860 (DNA Data Base in Japan, accession no. AB103477), was isolated from the T. thermophilus HB8 genome and was cloned into the expression plasmid, pET11a (Novagen). E. coli BL21(DE3) was transformed with the vector, and T. thermophilus EF-P was overexpressed. The protein was purified by successive chromatography steps on Q Sepharose and HiLoad Superdex 75 columns (Amersham Biosciences). Hampton Research Crystal Screen (18) was used to determine the initial crystallization conditions for EF-P. The final crystallization conditions, 100 mM Hepes-Na buffer (pH 7.6) and 1.35 M lithium sulfate at 16°C yielded high-quality crystals suitable for x-ray diffraction data collection. They belong to the space group P212121, with unit cell dimensions a ⫽ 55.8, b ⫽ 78.4, and c ⫽ 138.9 Å. The crystallographic asymmetric unit contains two nearly identical EF-P monomers.

Data Collection and Structure Determination. The crystal structure

of EF-P was solved by the multiple isomorphous replacement method. Three heavy-atom derivatives were prepared by soaking the EF-P crystals for 12 h in reservoir solutions containing 2 mM potassium tetrachloroaurate (III), 2 mM sodium ethylmercurithiosalicylate, and 2 mM mersalyl acid, respectively (Table 1). All the native and heavy-atom derivative data sets were collected from frozen crystals at 90 K by using synchrotron radiation at the SPring-8 beam lines (Hyogo, Japan). The data were processed with the program HKL2000 (19). Determination of the heavy atom positions and calculation of the This paper was submitted directly (Track II) to the PNAS office. Abbreviations: EF, elongation factor; eIF-5A, eukaryotic initiation factor 5A; rmsd, rms deviation. Data deposition: The atomic coordinates and structure factors have been deposited in the Protein Data Bank, www.pdb.org (PDB ID code 1UEB). 储To

whom correspondence should be addressed. E-mail: [email protected]. ac.jp.

© 2004 by The National Academy of Sciences of the USA

PNAS 兩 June 29, 2004 兩 vol. 101 兩 no. 26 兩 9595–9600

BIOCHEMISTRY

Edited by Paul R. Schimmel, The Scripps Research Institute, La Jolla, CA, and approved May 13, 2004 (received for review December 26, 2003)

Table 1. Crystallographic data

Data collection Resolution,* Å Observed reflections, n Unique reflections, n Completeness,† % Rsym,‡ % I兾␴(I)§ Heavy atom refinement and phasing Heavy atom sites, n Riso,¶ % Phasing power (centric–acentric)㛳 Rcullis** Mean FOM†† Refinement Resolution, Å Reflections, n Rcryst,‡‡ % Rfree,‡‡ % Protein atoms, n Water atoms, n rmsd bonds, Å rmsd angles, ° rmsd improper angles, °

Native

K2AuCl4

EMTS

Mersaryl acid

50–1.65 (1.71–4.65) 451,416 62,691 99.9 (100.0) 7.7 (49.6) 26.4 (3.0)

50–2.37 (2.45–2.37) 103,399 21,379 98.8 (92.6) 3.4 (9.1) 44.9 (22.1)

50–2.11 (2.19–2.11) 133,338 30,131 99.9 (99.8) 5.5 (14.6) 26.4 (7.9)

50–2.11 (2.19–2.11) 156,921 30,123 99.4 (98.8) 8.1 (26.9) 18.9 (5.9)

1 6.5 0.58–0.75 0.69

1 15.0 0.42–0.50 0.73

1 24.9 0.33–0.34 0.80

0.46 40–1.65 62,587 21.3 24.1 2797 407 0.005 1.2 0.83

EMTS, sodium ethylmecurithiosalicylate. *Resolution range of the highest shell is listed in parentheses. †Completeness in the highest-resolution shell is listed in parentheses. ‡R sym ⫽ 兺兩Iobs ⫺ 具I典兩兾兺Iobs, where Iobs is the observed intensity of reflection. Rsym in the highest-resolution shell is listed in parentheses. §I兾␴(I) in the highest-resolution shell is listed in parentheses. ¶R iso ⫽ 兺兩Fder ⫺ Fnat兩兾兺Fnat, where Fnat and Fder are the native and derivative structure factor amplitude, respectively. 㛳Phasing power ⫽ (兺F2 兾兺(F (obs) ⫺ F (calc))2)1/2, where F represents the calculated heavy atom structure factor amplitude. PH PH H H **Rcullis ⫽ 兺储FPH ⫾ FP兩 ⫺ FH兩兾兺兩FPH ⫾ FP兩. ††Mean figure of merit (FOM) ⫽ 具F best兾F典. ‡‡R cryst,free ⫽ 兺兩Fobs ⫺ Fcalc兩兾兺Fobs, where the crystallographic R factor is calculated including and excluding refinement reflections. In each refinement, free reflections consist of 5% of the total number of reflections.

multiple isomorphous replacement phases were carried out by using the program SOLVE (20). The experimental phases were improved by using the RESOLVE program (20) and further refined by using the ARP兾WARP program (21) to 1.65 Å. The improved electron density map was of high quality, which allowed the ARP兾WARP program to automatically build an almost complete model of one of the EF-P monomers (molecule A) in the asymmetric unit. Because the structure of the other EF-P molecule in the asymmetric unit (molecule B) is practically identical with that of molecule A, the molecule B model was readily produced by fitting the molecule A model to the electron density. The models were manually adjusted to the electron density by using the O program (22). Because no clear electron density was observed for the loop region (amino acid residues 139 –145) of molecule B, these residues were excluded from the coordinates. The refinement was carried out with several rounds of conventional molecular dynamics protocols with the CNS program (23), with all data in the resolution range of 40 –1.65 Å. The refinement converged to an R factor of 21.3% (Rfree ⫽ 24.9%) at a 1.65-Å resolution (Table 1). The final model has 91.8% and 8.2% of the amino acid residues in the most favored and additional allowed regions, respectively, of the Ramachandran plot, as indicated by the program PROCHECK (24). Graphic figures were created with the programs MOLSCRIPT (25) and RASTER3D (26) or GRASP (27). Determination of Molecular Weight in Solution. The molecular

weight of T. thermophilus EF-P in solution was estimated by light scattering (DynaPro 99, Protein Solutions, Charlottesville, VA)

9596 兩 www.pnas.org兾cgi兾doi兾10.1073兾pnas.0308667101

and by analytical ultracentrifugation (Optima XL-1, Beckman Coulter). Light scattering was performed at 20°C in 20 mM Tris䡠HCl buffer (pH 7.5) containing 150 mM NaCl and 1 mM DTT. Analytical ultracentrifugation was performed at 20°C in 20 mM Tris䡠HCl buffer (pH 7.5) containing 150 mM NaCl and 5 mM 2-mercaptoethanol. Results and Discussion Overall Structure. In the present study, we determined the crystal structure of EF-P from T. thermophilus at a 1.65-Å resolution by the multiple isomorphous replacement method. The crystallographic data are summarized in Table 1. The atomic coordinates have been deposited in the Protein Data Bank (PDB ID code 1UEB). In the crystal, two molecules (A and B) are in the asymmetric unit (Fig. 1A). The EF-P protein is a ␤-rich protein containing 16 ␤-strands and is made up of three ␤-barrel domains (domains I, II, and III) (Fig. 1B). The ␤-strands of molecule A are designated as ␤1–␤16, whereas the corresponding ␤-strands of molecule B are ␤1⬘– ␤16⬘, to discriminate between the two molecules. In the protein crystal, the ␤3-strand forms an antiparallel ␤-sheet with the ␤3⬘-strand of the other monomer (Fig. 1 A). This interaction connects the two monomers in the asymmetric unit, to form the dimer. A large, flat, six-stranded ␤-sheet is formed by ␤3, ␤4, ␤5, ␤3⬘, ␤4⬘, and ␤5⬘. The two monomers are related by a pseudo 2-fold axis, which is perpendicular to the ␤-sheet and passes near the carbonyl oxygen of His-27 on ␤3 (or ␤3⬘). The buried surface area between the EF-Ps was 539 Å2, which is ⬇5% of the monomer surface of EF-P. The T. thermophilus EF-P exists as a Hanawa-Suetsugu et al.

monomer (20,224 Da) under physiological conditions, as shown by analytical ultracentrifugation (20.8 kDa) and light-scattering experiments (23.2 kDa) (Fig. 2). This finding suggests that the monomer is the major functional unit of EF-P, although we cannot exclude the possibility that EF-P dimerizes during some functional stage. The overall shape of the EF-P monomer (Figs. 1C and 3 A and B) is remarkably similar to the L shape of the tRNA molecule (Fig. 3C). One arm of the L, made by domains I and II (Fig. 1C), is ⬇65 Å long and 23 Å wide (Fig. 3 A and B), whereas the other arm, formed by domains II and III (Fig. 1C), is ⬇53 Å long and 25 Å wide (Fig. 3 A and B). The angle made by the two arms of EF-P is ⬇95° (Fig. 3 A and B). In the yeast tRNAPhe L-shaped structure, with two arms at ⬇90° (28–30), the acceptor-T arm is ⬇65 Å long and 22 Å wide, whereas the anticodon-D arm is ⬇70 Å long and 20 Å wide (Fig. 3C). The overall shapes and the sizes of tRNA molecules are well conserved. The shape and the size

Fig. 2. A plot of the sedimentation equilibrium data with the residuals from the best fit to a single ideal species. This plot shows the data with protein at 0.5 mg䡠ml⫺1 and a speed of 20,000 rpm. The estimated partial specific volume of the protein is 0.74, and the solvent density was calculated to be 1.005 g䡠ml⫺1. All nine data sets (three speeds, three concentrations) were fitted together.

Hanawa-Suetsugu et al.

of the EF-P molecule are similar to those of tRNA molecules. Notably, EF-P is an acidic protein (calculated pI ⫽ 4.6), and most of its surface is negatively charged (Fig. 3 A and B). Therefore, its overall shape is reminiscent of that of tRNA, although it is currently unclear which arm of EF-P corresponds to the acceptor or anticodon arm of tRNA (Fig. 3 A and B). The structures of the two EF-P monomers in the asymmetric unit are quite similar to each other, with an rms deviation (rmsd) of 0.79 Å over all the protein atoms. Nevertheless, their interdomain orientations differ slightly. Although the domain I structures of molecules A and B are practically the same (they can be superposed on each other with an rmsd of 0.57 Å), the relative orientation of domain I to domain II in the two molecules differs by ⬇4°. Therefore, this arm is slightly flexible. On the other hand, the difference in the relative orientation of domains II and III between molecules A and B is negligibly small. Both of the domain I–II and II–III interfaces are formed by hydrophobic side chains with high surface complementarities. Therefore, the L shape of EF-P is likely to be a native conformation, rather than an artifact due to crystal packing.

Fig. 3. Structure comparison of EF-P with tRNA and ribosome-binding proteins. (A and B) EF-P from T. thermophilus (PDB ID code 1UEB). (C) tRNAPhe from Saccharomyces cerevisiae (PDB ID code 1EVV). (D) EF-G from T. thermophilus (PDB ID code 1EFG). (E) Ribosome recycling factor from E. coli (PDB code 1EK8). (F) Release factor 2 from E. coli (PDB ID code 1GQE). PNAS 兩 June 29, 2004 兩 vol. 101 兩 no. 26 兩 9597

BIOCHEMISTRY

Fig. 1. The structure of T. thermophilus EF-P. (A) Ribbon diagram showing the two molecules (A and B) in the asymmetric unit. The arrows represent ␤-strands. The three domains of each molecule are colored green, red, and blue, in dark and light tones for molecules A and B, respectively. The pseudosymmetry axis is marked in magenta. (B) Topology diagram of the T. thermophilus EF-P structure. The arrows represent ␤-strands and the ellipses represent 310-helixes. Domains I, II, and III are colored green, red, and blue, respectively. (C) Ribbon presentation of the T. thermophilus EF-P (molecule A) crystal structure (stereoview). Color coding is as in B.

Fig. 4. Alignment of the amino acid sequences of EF-P and eIF-5A. bTth, bEco, and bBsu are the bacterial EF-P proteins from T. thermophilus, E. coli, and Bacillus subtilis, respectively. eHum, eSce, ePae, and eMja are the eIF-5A proteins from Homo sapiens, S. cerevisiae, P. aerophilum, and M. jannaschii, respectively. The secondary structure of EF-P from T. thermophilus and M. jannaschii are indicated with arrows for ␤-strands and coils for helices. The amino acid residues conserved throughout the EF-P proteins are highlighted in yellow. The amino acid residues completely conserved throughout EF-P and eIF-5A are shown with white letters highlighted in red, and those well conserved in EF-P and eIF-5A are shown with red letters.

Several proteins possess domain(s) similar to a portion of tRNA. The C-terminal domain of EF-G, protruding from the globular GTPase domain, reportedly has a shape similar to that of the anticodon-stem loop in the EF-Tu–tRNA–GDPNP ternary complex (31, 32). Ribosome recycling factor, eukaryal release factor 1, and release factor 2 each possess a protruding domain (33–37). The entire structure of EF-P mimics the overall shape of a tRNA molecule, like these tRNA-mimicking proteins (Fig. 3). However, the ‘‘tRNA mimicry’’ does not necessarily mean that the proteins bind to the tRNA-binding sites on the ribosome as tRNA molecules do (36). Based on a cryo-electron microscopy analysis, it is reported that release factor 2 is incorporated in the ribosome by assuming a different shape from that observed in the crystal structure and by changing its interdomain orientations (38, 39). A biochemical analysis revealed that the binding mode of ribosome-recycling factor is different from that of tRNA (40). It has been hypothesized that the tRNA mimicry might allow the proteins to pass through the entrance of the ribosome (36). In this context, biochemical data suggested that EF-P binds to the A site of the ribosome (6, 7). Ganoza et al. (6) proposed that the EF-P-binding domain is very near the site of EF-Tu and EF-G binding on both the 30S and 50S subunits. However, the ribosome-binding manner of EF-P is unknown and must be studied further with the ribosome-bound state structure of the EF-P. Domain Architectures. The N-terminal domain I (amino acid

residues 1–64) contains six ␤-strands (␤1–␤6) and a one-turn 310-helix (Fig. 1 B and C). The strands ␤2, ␤3, ␤4, and ␤5 form a large antiparallel ␤-sheet, whereas ␤1 and ␤6 form a smaller, curved antiparallel ␤-sheet. The larger ␤-sheet is flat on one 9598 兩 www.pnas.org兾cgi兾doi兾10.1073兾pnas.0308667101

side, but its other side is curved, and, together with the smaller ␤-sheet, it is involved in a ␤-barrel with a hydrophobic core. The 310-helix is located between ␤1 and ␤2, forming the lid of the barrel. The ␤3-, ␤4-, and ␤5-strands are much longer than the other strands, and therefore, the region connecting ␤3 and ␤4 (amino acid residues 28–36) protrudes outward. The conserved basic residues, Arg-8, Lys-29, Arg-32, Lys-40, and Lys-42, are clustered on the ␤3-side surface of domain I (Fig. 3A), implying the significance of this region in EF-P function, such as nucleic acid binding. On the other hand, the opposite surface of domain I (Fig. 3B) is more negatively charged, as Asp-6, Asp-16, and Glu-61 are highly conserved among the EF-P sequences (Fig. 4A). It is remarkable that central domain II and C-terminal domain III of EF-P possess the same fold (Fig. 1 B and C). The two ␤-barrel domains are tandemly arranged along the axis of one arm of the L-shaped EF-P molecule. Domain II (amino acid residues 65–126) contains five ␤-strands (␤7–␤11). The ␤7-, ␤8-, and ␤9-strands form a curved antiparallel ␤-sheet on one hand, whereas ␤7, ␤10, and ␤11 form a similar antiparallel ␤-sheet on the other, and thus together they construct a typical ␤-barrel structure with a hydrophobic core. Domain III (amino acid residues 127–184) also possesses a ␤-barrel architecture consisting of five ␤-strands (␤12–␤16), in which two antiparallel ␤-sheets, consisting of ␤12, ␤13, and ␤14, and ␤12, ␤15, and ␤16, respectively, face each other. Thus, the two domains possess the same topology for the strand connectivities. Domains II and III were superposed on each other with an rmsd of 1.2 Å for 31 C␣ atoms. These domains share partial sequence similarity (10 of 58 residues are identical), which implies that they originated from a single domain, probably by a duplication event. It should be Hanawa-Suetsugu et al.

noted here that the sequence of domain III is much better conserved than that of domain II in the EF-P proteins (Fig. 4). According to a DALI-based protein–structure comparison, the fold composed of domains II and III is similar to that of the so-called oligonucleotide-binding fold, observed in E. coli coldshock protein, the RNA-binding domains of E. coli polyribonucleotide nucleotidyltransferase, E. coli transcription factor Rho, Pyrococcus kodakaraensis aspartyl-tRNA synthetases, and so forth. Thus, it is possible that domains II and兾or III of EF-P are involved in RNA (or DNA) binding. However, because almost the entire surface of domain II is negatively charged (Glu-76, Glu-78, Asp-84, Glu-89, and Glu-106 are conserved), this domain probably does not bind nucleic acids. In contrast, one surface of domain III has a patch of conserved basic residues, including Arg-140, Lys-149, Arg-176, and Arg-183, which are probably favorable for nucleic acid binding. The other surface of domain III is negatively charged, and Asp-134, Glu-154, Glu-166, and Glu-169 are conserved. Comparison Between EF-P and eIF-5A. eIF-5A is an archaeal兾 eukaryal paralog of EF-P. Thus far, three eIF-5A crystal structures have been reported, from M. jannaschii, P. aerophilum, and P. horikoshii (14–16). These structures revealed that eIF-5A consists of only two ␤-barrel domains. These N and C domains appear to correspond to domains I and II (or III), respectively, of EF-P. The overall shape of the two-domain eIF-5A is a straight bar, in contrast to the L-shaped structure of the threedomain EF-P (Fig. 5A). Intriguingly, slight flexibility in the relative orientation of domain I to domain II has also been found for the M. jannaschii and P. horikoshii eIF-5A proteins (⬇7°) (14, 16), which is very similar to the internal-domain flexibility of EF-P (⬇4°) described above. The structure of the EF-P domain I superposed well on those of the N domains of M. jannaschii eIF-5A (rmsd ⫽ 1.3 Å per 61 C␣ atoms), P. aerophilum eIF-5A (rmsd ⫽ 1.2 Å per 60 C␣ atoms), and P. horikoshii eIF-5A (rmsd ⫽ 1.4 Å per 63 C␣ Hanawa-Suetsugu et al.

Concluding Remarks. The overall tRNA-like shape of the EF-P

molecule and its charge distribution seem to be suitable for this protein to bind to the ribosome by spanning the two subunits (6). EF-P may bind to the tRNA-binding site(s) on the ribosome by mimicking the tRNA shape. eIF-5A corresponds to domains I and II of EF-P. It is interesting to note that, in this context, eIF-5A might correspond to a minihelix or anticodon helix of tRNA. Although eIF5A is not thought to be essentially involved in translation in yeast (11), it is also possible that, in Archaea and Eukarya, some other protein or RNA factor(s) compensates structurally or functionally for the missing third domain. Many questions about EF-P still remain. How does EF-P interact with the ribosome? Which arm of the L corresponds to the acceptor or anticodon arm of tRNA? Does it form a ternary complex with EF-Tu䡠GTP? How can it activate the peptidyltransferase of the ribosome? To answer these questions, further functional and structural studies are needed. PNAS 兩 June 29, 2004 兩 vol. 101 兩 no. 26 兩 9599

BIOCHEMISTRY

Fig. 5. Structure comparison of EF-P and eIF-5A. (A) Superimposition of the ribbon diagrams of T. thermophilus EF-P (blue) and M. jannaschii eIF-5A (yellow). (B) Amino acid residues conserved in EF-Ps and eIF-5As color-coded on the surface of T. thermophilus EF-P.

atoms). Consistent with the structural information, the domain I amino acid sequences of the known EF-P and eIF-5A proteins share some similarity (Fig. 4). According to a structure-based sequence comparison, 42% of the T. thermophilus EF-P amino acid residues are conserved or semiconserved in eIF-5As. In particular, the amino acid residues corresponding to Lys-29, Gly-31, Gly-33, and Ala-35 of the T. thermophilus EF-P are absolutely conserved in the EF-P兾eIF-5A superfamily, and they are located on the loop connecting ␤3 and ␤4 in both the EF-P and eIF-5A structures. The eIF-5As conserve a Lys residue at the tip of the loop (Lys-40, Lys-42, and Lys-37 for the M. jannaschii, P. aerophilum, and P. horikoshii eIF-5A proteins, respectively). This Lys residue is modified posttranslationally to a hypusine to generate the mature eIF-5A (12). It is remarkable that a Lys or Arg residue is also strictly conserved at the corresponding position of the EF-P proteins in bacteria (Fig. 4). For T. thermophilus EF-P, the corresponding basic residue is Arg-32. In the present EF-P crystal structure, the Arg-32 side chain protrudes toward the solvent at the end of the domain I arm. The bacterial EF-P reportedly lacks hypusine (13). Nevertheless, the conservation of the amino acid residues in the ␤3–␤4 connective linker and the basic residue at the tip of the loop implies the significance of this region for EF-P function. The eIF-5A C domain was compared with the EF-P domain II (Fig. 5A). The C domains of the M. jannaschii, P. aerophilum, and P. horikoshii eIF-5As overlapped well on the EF-P domain II, with rmsd values of 1.4 Å, 1.5 Å, and 2.0 Å, for 53, 58, and 56 C␣ atoms, respectively. The similarity between the EF-P and eIF-5A sequences exists only in the N-terminal half of domain II (the ␤7–␤9 region in EF-P) and not in the C-terminal half of domain II (Fig. 3). On the other hand, as the EF-P domains II and III share the same folding topology, the eIF-5A domain II also superposed well on the EF-P domain III (rmsd values of 2.1 Å, 2.4 Å, and 2.2 Å, for 41, 41, and 43 C␣ atoms of the eIF-5A proteins from M. jannaschii, P. aerophilum, and P. horikoshii, respectively). It is remarkable that the amino acid sequence of the C-terminal half of the eIF-5A C domain is similar to that of the C-terminal half of EF-P domain III (the ␤15–␤16 region in EF-P) (Fig. 5B). Thus, a gap appears in the eIF-5A sequences, as compared with the EF-P sequences. This implies the possibility that eIF-5A originated from an ancestral three-domain protein common to EF-P by a deletion event, in which the ancestral eIF-5A might have lost the region corresponding to the EF-P ␤10–␤14 region. Because the missing region topologically corresponds to a single ␤-barrel domain, the resultant eIF-5A C domain retains the same folding topology as domains II and III of EF-P. On the other hand, it is also possible that the sequence of the EF-P ␤10–␤14 region diversified after domains II and III were formed by duplication.

We thank R. Ushikoshi, H. Tanaka, and Y. Kamewari for preparation of the T. thermophilus EF-P protein, H. Nakajima and Drs. Y. Kawano and N. Kamiya for supporting our data collection at beamline 45PX at SPring-8, and Dr. S. Yokobori (Tokyo University of Pharmacy and Life Science) for helpful discussions about the phylogenetic analysis. This

work was supported in part by a grant from the Organized Research Combination System of the Science and Technology Agency of Japan and by the RIKEN Structural Genomics兾Proteomics Initiative and the National Project on Protein Structural and Functional Analyses, Ministry of Education, Culture, Sports, Science and Technology of Japan.

1. Glick, B. R. & Ganoza, M. C. (1975) Proc. Natl. Acad. Sci. USA 72, 4257–4260. 2. Glick, B. R., Chladek, S. & Ganoza, M. C. (1979) Eur. J. Biochem. 97, 23–28. 3. Aoki, H., Dekany, K., Adams, S. L. & Ganoza, M. C. (1997) J. Biol. Chem. 272, 32254–32259. 4. Kyrpides, N. C. & Woese, C. R. (1998) Proc. Natl. Acad. Sci. USA 95, 224–228. 5. An, G., Glick, B. R., Friesen, J. D. & Ganoza, M. C. (1980) Can. J. Biochem. 58, 1312–1314. 6. Ganoza, M. C., Kiel, M. & Aoki, H. (2002) Microbiol. Mol. Biol. Rev. 66, 460–485. 7. Aoki, H., Adams, S. L., Turner, M. A. & Ganoza, M. C. (1997) Biochimie 79, 7–11. 8. Ganoza, M. C., Zahid, N. & Baxter, R. M. (1985) Eur. J. Biochem. 146, 287–294. 9. Baxter, R. M., Ganoza, M. C., Zahid, N. & Chung, D. G. (1987) Eur. J. Biochem. 163, 473–479. 10. Glick, B. R. & Ganoza, M. C. (1976) Eur. J. Biochem. 71, 483–491. 11. Kang, H. A. & Hershey, J. W. (1994) J. Biol. Chem. 269, 3934–3940. 12. Park, M. H., Wolff, E. C. & Folk, J. E. (1993) Trends Biochem. Sci. 18, 475–479. 13. Magdolen, V., Klier, H., Wohl, T., Klink, F., Hirt, H., Hauber, J. & Lottspeich, F. (1994) Mol. Gen. Genet. 244, 646–652. 14. Kim, K. K., Hung, L. W., Yokota, H., Kim, R. & Kim, S. H. (1998) Proc. Natl. Acad. Sci. USA 95, 10419–10424. 15. Peat, T. S., Newman, J., Waldo, G. S., Berendzen, J. & Terwilliger, T. C. (1998) Structure (London) 6, 1207–1214. 16. Yao, M., Ohsawa, A., Kikukawa, S., Tanaka, I. & Kimura, M. (2003) J. Biochem. (Tokyo) 133, 75–81. 17. Kristensen, O. & Laurberg, M. (2002) Acta Crystallogr. D 58, 1039–1041. 18. Jancarik, J. & Kim, S. H. (1991) J. Appl. Crystallogr. 24, 409–411. 19. Otwinowski, Z. & Minor, W. (1997) Methods Enzymol. 276, 307–326. 20. Terwilliger, T. C. & Berendzen, J. (1999) Acta Crystallogr. D 55, 849–861. 21. Perrakis, A., Morris, R. & Lamzin, V. S. (1999) Nat. Struct. Biol. 6, 458–463. 22. Jones, T. A., Zou, J. Y., Cowan, S. W. & Kjeldgaard, M. (1991) Acta Crystallogr. A 47, 110–119.

23. Brunger, A. T., Adams, P. D., Clore, G. M., DeLano, W. L., Gros, P., Grosse-Kunstleve, R. W., Jiang, J. S., Kuszewski, J., Nilges, M., Pannu, N. S., et al. (1998) Acta Crystallogr. D 54, 905–921. 24. Laskowski, R. A., MacArthur, M. W., Moss, D. S. & Thornton, J. M. (1993) J. Appl. Crystallogr. 26, 283–291. 25. Kraulis, P. J. (1991) J. Appl. Crystallogr. 24, 946–950. 26. Merritt, E. & Bacon, D. J. (1997) Methods Enzymol. 277, 505–524. 27. Nicholls, A., Sharp, K. A. & Honig, B. (1991) Proteins 11, 281–296. 28. Robertus, J. D., Ladner, J. E., Finch, J. T., Rhodes, D., Brown, R. S., Clark, B. F. & Klug, A. (1974) Nature 250, 546–551. 29. Suddath, F. L., Quigley, G. J., McPherson, A., Sneden, D., Kim, J. J., Kim, S. H. & Rich, A. (1974) Nature 248, 20–24. 30. Jovine, L., Djordjevic, S. & Rhodes, D. (2000) J. Mol. Biol. 301, 401–414. 31. Czworkowski, J., Wang, J., Steitz, T. A. & Moore, P. B. (1994) EMBO J. 13, 3661–3668. 32. Nissen, P., Kjeldgaard, M., Thirup, S., Polekhina, G., Reshetnikova, L., Clark, B. F. & Nyborg, J. (1995) Science 270, 1464–1472. 33. Selmer, M., Al-Karadaghi, S., Hirokawa, G., Kaji, A. & Liljas, A. (1999) Science 286, 2349–2352. 34. Kim, K. K., Min, K. & Suh, S. W. (2000) EMBO J. 19, 2362–2370. 35. Song, H., Mugnier, P., Das, A. K., Webb, H. M., Evans, D. R., Tuite, M. F., Hemmings, B. A. & Barford, D. (2000) Cell 100, 311–321. 36. Vestergaard, B., Van, L. B., Andersen, G. R., Nyborg, J., Buckingham, R. H. & Kjeldgaard, M. (2001) Mol Cell. 8, 1375–1382. 37. Brodersen, D. E. (2003) Nat. Struct. Biol 10, 78–80. 38. Rawat, U. B., Zavialov, A. V., Sengupta, J., Valle, M., Grassucci, R. A., Linde, J., Vestergaard, B., Ehrenberg, M. & Frank, J. (2003) Nature 421, 87–90. 39. Klaholz, B. P., Pape, T., Zavialov, A. V., Myasnikov, A. G., Orlova, E. V., Vestergaard, B., Ehrenberg, M. & van Heel, M. (2003) Nature 421, 90–94. 40. Lancaster, L., Kiel, M. C., Kaji, A. & Noller, H. F. (2002) Cell 111, 129–140.

9600 兩 www.pnas.org兾cgi兾doi兾10.1073兾pnas.0308667101

Hanawa-Suetsugu et al.