MAPS: An interactive web server for membrane

0 downloads 0 Views 383KB Size Report
Jayasinghe et al10. The fraction ASA values fASA (i) ... 1—(a): A schematic diagram of the protein-lipid bilayer ... and fASA = 1 was used for H(i) values (Eq. 1)] ...
Indian Journal of Biochemistry & Biophysics Vol. 48, April 2011, pp. 106-110

MAPS: An interactive web server for membrane annotation of transmembrane protein structures Jitender Cheema* and Gautam Basu* Department of Biophysics, Bose Institute, P-1/12 CIT Scheme VIIM, Kolkata 700 054, India. Received 07 October 2010; revised 11 March 2011 The exact positioning of the membrane in transmembrane (TM) proteins plays important functional roles. Yet, the structures of TM proteins in protein data bank (pdb) have no information about the explicit position of the membrane. Using a simple hydrophobic lipid-protein mismatch energy function and a flexible lipid/water boundary, the position of lipid bilayer for representative TM proteins in pdb have been annotated. A web server called MAPS (Membrane Annotation of Protein Structures; available at: http://www.boseinst.ernet.in/gautam/maps) has been set up that allows the user to interactively analyze membrane-protein orientations of any uploaded pdb structure with user-defined membrane flexibility parameters. Keywords: Web server MAPS, Transmembrane proteins, Membrane flexibility

Transmembrane (TM) protein structures in the pdb1 have no explicit information about the position of the membrane. It is not very difficult to come up with a physically reasonable orientation of the protein with respect to the membrane using residue-based hydrophobicities. A simple score based on hydropathic mismatch energy can be made more complex by adding lipid perturbation contribution2 or a structure factor3. In addition, the simple description of the membrane as a planar slab with a constant thickness can be changed, especially with respect to the water lipid boundary. A target function more sophisticated than simple hydrophobic mismatch energy, a non-rigid lipid bilayer model and an exhaustive search for the protein-membrane orientation can yield more reliable membrane-protein orientations. Even with such exhaustive searches using sophisticated objective functions, the final membraneprotein boundary is no more than just a prediction. Results from two different prediction methods differ in their physical meaning, which depends on the algorithm and the models used. In other words, given a number of such predictions, it is up to the user to —————— * Corresponding author E-mail: [email protected] (JC) [email protected] (GB) +Present address: Department of Computational and Systems Biology Ohn Innes Centre, Norwich Research Park Colney, Norwich, NR4 7UH, UK

choose the prediction that uses the most appropriate model and parameters for the specific problem. Thus, the availability of more than one publicly accessible web servers employing different methodologies for membrane annotation of TM proteins is to the advantage of the user. Two web servers currently available towards this end are PDBTM4 and OPM5. However, they do not: i) provide options for membrane flexibility, and ii) allow interactive sessions in where the user can change parameters and redo the calculations. We have set up a freely available web server MAPS (Membrane Annotated Protein Structures), where a flexible membrane boundary algorithm is implemented to annotate TM proteins and which allows a user to interactively change parameters and obtain different membrane annotations for the same protein. In addition to being complementary to the existing servers, the interactive nature of MAPS can provide additional benefit to students and new learners of structural bioinformatics. Results and Discussion Formulation of the problem

Given a protein structure, all residues are first assigned a fraction solvent accessible surface area (ASA) weighted hydrophobicity H(i)

H(i) = f ASA h(i)

… (1)

107

CHEEMA & BASU: MAPS: AN INTERACTIVE WEB SERVER

The hydrophobicity of residues h(i) is user defined and current choices are WW6,7 , KD8 and GES9 scales. KD and GES scales are adjusted by 1.0 kcal mol-1 as compensation for per-residue backbone dehydration cost as suggested by Jayasinghe et al10. The fraction ASA values f ASA (i) are calculated by dividing the DSSP-generated11 ASA by the maximum ASA (estimated from DSSPgenerated ASA of the residue when the residue is present with extended conformation in the tripeptide Gly-Xxx-Gly). The protein-bilayer orientation is described schematically in Fig. 1a. The lipid bilayer is considered to be a rectangular slab of equilibrium half width w h0 and the protein is considered as a rigid body. During the calculation, the bilayer is fixed and is centered at the origin with its normal parallel to the z-axis. The protein is placed in the bilayer such that the z-component of its geometric center is 0. During the calculation the protein is translated along the z-axis (by z0) and tilted from the z-axis (θ) and rotated around the z-axis (φ) to produce new bilayerprotein orientations. Each new orientation is assigned a score as discussed below. The lipid bilayer with equilibrium half width w h0 can fluctuate by ±w f without any energy penalty (Fig. 1b). However, if the boundary stretches such that w h > (w h0 + w f ) or it shrinks such that

w h < (w h0 − w f ) , then there is an energy penalty for stretching ( E + ) or shrinking ( E− ): E + = km [w h − (w h0 + w f )]2

E + and E− are shown as a function of (w h0 + w f ) and (w h0 − w f ) respectively in Fig. 2. For a given bilayer model ( w h0 , w f and ( w h0 , w f and km ), the energy E(z0,θ, φ ) of a given lipidprotein orientation (Fig. 1a) is given by a sum of residue-based (residue index i) energy contributions, R{z(i), H (i), z 0,θ , φ}, where z(i) is the value of the z Cartesian coordinate of the i-th residue, as:

{

E ( z0, θ, φ, w 0h , w f , k m ) = ∑ R z(i ), H (i), z 0, θ, φ, w 0h , w f , k m i

}

… (3) Contributions from individual residues, R(i) in Eq. 3 are given by:

Fig. 2—Membrane distortion energy ( w f = 0.5 Å,

Fig. 1—(a): A schematic diagram of the protein-lipid bilayer system [Starting from the original configuration (solid lines) the protein is translated along the z-axis (by z0), rotated about the z-axis (by angle) and tilted from the z-axis (by angle) to produce a new configuration (dotted lines); and (b): Six possible positionings of hydrophobic (filled circles) and hydrophilic (open circles) residues with respect to the membrane (shaded) boundary and the corresponding residue contributions to the total mismatch energy (Eqs 3 and 4).

… (2)

E− = km [(w h0 − w f ) − w h ]2

kd

= 0.1

kcal.mol-1.Å-2; Eq. 2) displayed (black broken line) as a function of perpendicular distance from the membrane boundary. [Membrane stretching ( E + ) and contraction ( E − ) corresponds to positive and negative distances, respectively. The stretching energy E + changes to E + + H(i) (blue curves) when a hydrophobic residue gets incorporated in the membrane as result of stretching. The hydrophobic mismatch energies H(i) of hydrophilic residues are displayed (red lines) along with the E − curve for comparison. The membrane can contract only when E− − H(i) < 0 (see Eq. 4). The WW6-7 hydrophobicity scale, and fASA = 1 was used for H(i) values (Eq. 1)]

108

INDIAN J. BIOCHEM . BIOPHYS., VOL. 48, APRIL 2011

 I. [H(i)] if z(i) ≤ (w 0 + w ) and H(i) < 0 h f   II. [E + + H(i)] if z(i) > (w 0h + w f ) and E + + H(i) < 0 R(i) =  0  III. [E − ] if z(i) ≤ (w h − w f ) and E − − H(i) < 0  IV. [H(i)] if z(i) ≤ (w 0h − w f ) and E − − H(i) > 0

… (4) For a hydrophobic residue ( H(i) < 0 ) when z(i) ≤ (w h0 + w f ) , the residue is considered to be always inside the membrane with an energy contribution H(i) (Type I in Eq. 4 and Fig. 1b). When z(i) > (w h0 + w f ) , the residue is within the membrane with an energy contribution E + + H(i) , only if the unfavorable membrane stretching energy E + is less than the favorable hydrophophic mismatch energy H(i) ; (Type II in Eq. 4 and Fig. 1b). On the other hand, if the unfavorable membrane stretching energy exceeds the favorable hydrophobic mismatch energy, the residue is considered to be outside the membrane with no energy contribution (Type IIa in Fig. 1b). For a hydrophilic residue ( H(i) > 0 ), when z(i) ≤ (w h0 − w f ) , the residue is considered to be inside the membrane only if the unfavorable membrane contracting energy E− is less than the unfavorable mismatch energy H(i) . Such a residue will contribute E− to the total energy and is covered in Type III in Eq. 1 and Fig. 1b. On the other hand, when the unfavorable membrane contracting energy exceeds the unfavorable hydrophobic mismatch energy H(i) , the residue is considered inside the membrane with an energy contribution H(i) (type IV in Fig. 1b and Eq. 4). When z(i) > (w h0 − w f ) , the residue is always outside the membrane with no energy contribution (shown as type V in Fig. 1b). In addition, all residues for which H(i) = 0 are considered to be inside the membrane, if they lie within w h0 Å from the membrane center, although they do not contribute to E(z0) . For a given set of w h0 , w f and km, the accepted membrane-protein orientation corresponds to that {z0,θ, φ} for which E(z0,θ, φ ) (Eq. 4) is the minimum.

Fig. 3—The interactive page of MAPS, where a structure is uploaded by the user or chosen from MAPS database, parameters for the calculation are selected and a job is submitted

up the web server MAPS (www.boseinst.ernet. in/gautam/maps). From the homepage, the option ‘Interactive’ allows the user to start the program (Fig. 3). The first step is choosing a PDB file, which can be chosen from the existing searchable database (currently 980 pdb files) or uploaded by the user. Visualization (3D) of protein structures in MAPS is implemented using Jmol (http://jmol.sourceforge.net/) Applets that can be viewed in any Java-enabled browser with Java runtime environment support. The membrane-annotated sequence database, generated using the default parameters on a select set of pdb structures can be downloaded. Membrane-annotated pdb files (membrane in/out assignments are shown in the Cα B-factor column: 96 (in), 3 (out)), sequences in fasta format (where upper case and lower case letters signify amino acids inside and outside the membrane, respectively) and rasmol figures (gif) can also be downloaded for any pdb structure in the results section in the interactive mode (Fig. 4). Program parameters Hydrophobicity scale

The MAPS server

The choices for h(i) are: WW6-7, ii) KD8 (default) and GES9 scales. KD and GES scales are adjusted by 1.0 kcal mol-1 for compensation for per-residue backbone dehydration cost10. The program also allows a user defined hydrophobic scale.

Options and features

Residue centering

Java, JavaScript, C and Python languages have been used to implement the program and setting

The position of each residue is assigned either by: i) CA (The position of its Cα atoms), or ii) SC

CHEEMA & BASU: MAPS: AN INTERACTIVE WEB SERVER

109

Fig. 4—MAPS membrane annotation page for human aquaporin 1 (pdb code: 1H6I) [The color-annotated structure is shown on the left, the color-annotated sequence is shown on the right and parameters used for the calculation are shown below the sequence. The user can change parameters and redo calculations and download the figure, sequence and annotated pdb structure from this page]

(geometric center of its side chain). Option i) is the default and the only option, if the pdb contains only Cα atoms. ASA weighing

Two options are available: i) ON and ii) OFF. Option i) is the default. Option ii) cannot be used, if the pdb contains only Cα atoms. Membrane flexibility

The membrane can be rigid or flexible (default). Default Parameters: i) wh = 26.0 Å (DMPC bilayer width at 30°C);12 ii) wf and km are 0.25 Å and 0.1 kcal.mol-1.Å-1, the latter derived by fitting a Gaussian distribution to the equilibrium DMPC carbonyl density along the bilayer normal generated from a long MD simulation of DMPC bilayer13. Initial orientation

Options for initial protein orientation are: i) pdb (Cartesian z-axis of the input pdb), and ii) MAPS (this option automatically assigns a suitable protein axis

similar to that described in TMDET3 with minor modifications). Utility and uniqueness The primary aim of MAPS is to provide the user with an interactive environment for membraneannotation of TM proteins based on a simple model that includes membrane flexibility. All TM proteins are associated with “natural” membrane partners (endoplasmic reticulum, plasma membrane etc.) with unique width and flexibility. The web server MAPS allows the user to adjust membrane parameters that best fits the problem in an interactive fashion. The uniqueness of MAPS is its interactive nature and the inclusion of membrane flexibility. We hope MAPS will not only be beneficial to active scientists in the field, but also be used by students to appreciate the subtleties of the problem by interactively using the server and changing the parameters that suit their purpose.

110

INDIAN J. BIOCHEM . BIOPHYS., VOL. 48, APRIL 2011

Acknowledgements Funding for this work was provided by MEXT, Japan, and DBT, India. We thank Sanjib K Gupta and Madhurima Das for their assistance. GB likes to acknowledge critical comments by Nobuhiro Go and Takeshi Kawabata. References 1 Berman H M, Westbrook J, Feng Z, Gilliland G, Bhat, T N, Weissig H, Shindyalov, I N & Bourne P E (2000) Nucleic Acids Res 28, 235-241 2 Basyn F, Charloteaux B, Thomas A & Brasseur R (2001) J Mol Graph Model 20, 235-244 3 Tusnady G E, Dosztanyi Z & Simon I (2004) Bioinformatics 20, 2964-2972

4 Tusnady G E, Dosztanyi Z & Simon I (2005) Bioinformatics 21, 1276-7 5 Lomize M A, Lomize A L, Pogozheva I D & Mosberg H I (2006) Bioinformatics 22, 623-625 6 Wimley W C & White S H (1996) Nat Struct Biol 3, 842-848 7 Wimley W C, Creamer T P & White S H (1996) Biochemistry 35, 5109-5124 8 Kyte J & Doolittle R E (1982) J Mol Biol 157, 105-132 9 Engelman D M, Steitz T A & Goldman A (1986) Annu Rev Biophys Chem 15, 321-353 10 Jayasinghe S, Hristova K & White S H (2001) J Mol Biol 312, 927-934 11 Kabsch W & Sander C (1983) Biopolymers 22, 2577-2637 12 Nagle J F & Tristram-Nagle S (2000) Biochim Biophys Acta 1469, 159-195 13 Gorfe A A, Pellarin R & Caflisch A (2004) J Am Chem Soc 126, 15278