The crystal structure of Umbelopsis vinacea -gal-. actosidaseI, which belongs to glycoside hydrolase family 27, was determined at 2.0AË resolution. The.
Biosci. Biotechnol. Biochem., 73 (10), 2360–2364, 2009
Communication
The Tetramer Structure of the Glycoside Hydrolase Family 27 -Galactosidase I from Umbelopsis vinacea Zui F UJIMOTO,1; y Satoshi K ANEKO,2 Wook-Dong K IM,2; * Gwi-Gun P ARK,2;3 Mitsuru M OMMA,1 and Hideyuki K OBAYASHI2; ** 1
Protein Research Unit, National Institute of Agrobiological Sciences, 2-1-2 Kannondai, Tsukuba, Ibaraki 305-8602, Japan 2 Food Biotechnology Division, National Food Research Institute, 2-1-12 Kannondai, Tsukuba, Ibaraki 305-8642, Japan 3 Department of Food and Bioengineering, Kyungwon University, Seongnam, Gyeonggi, 461-701, Republic of Korea Received August 13, 2009; Accepted September 8, 2009; Online Publication, October 7, 2009 [doi:10.1271/bbb.90604]
The crystal structure of Umbelopsis vinacea -galactosidase I, which belongs to glycoside hydrolase ˚ resolution. The family 27, was determined at 2.0 A monomer structure was well conserved with those of glycoside hydrolase family 27 enzymes. The biological tetramer structure of this enzyme was constructed by the crystallographic 4-fold symmetry, and tetramerization appeared to be caused by three inserted peptides that were involved in the tetramer interface. The quaternary structure indicated that the substrate specificity of this enzyme might be related to the tetramer formation. Three N-glycosylated sugar chains were observed, and their structures were found to be of the high-mannose type. Key words:
crystal structure; -galactosidase; glycoprotein; Umbelopsis vinacea; tetramer
-Galactosidase (E.C. 3.2.1.22) is an exoglycosidase that hydrolyzes an -linked galactosyl residue from galactooligosaccharides, polymeric galactoglucomannans, and galactolipids. -Galactosidases are widely distributed in animals, plants, and microorganisms, and in plants, -galactosidase is one of the key enzymes in the degradation of the cell wall galactomannan during germination.1) -Galactosidases can be classified into glycoside hydrolase family (GH) 4, 27, 36, 57, 97, or 110, according to CAZY database.2,3) -Galactosidases from eukaryotes have high amino acid sequence homologies and are generally classified into GH27, whereas prokaryotic -galactosidases are mostly classified into GH36. To date, the crystal structures of -galactosidases from rice (Oryza sative), human (Homo sapiens), and fungi (Hypocrea jecorina), which belong to GH27, have been determined.4–6) These enzymes consist mainly of two domains, an N-terminal domain consisting of a ð=Þ8 -barrel with a catalytic site and a C-terminal structural domain. They have been analyzed in complex y
with D-galactose, and details of the substrate-binding mechanism were identified. Besides the -galactosidases, the crystal structures of two -N-acetylgalactosaminidases from chicken (Gallus gallus) and Bacillus halodurans, and -L-arabinopyranosidase from Streptomyces avermitilis are available for GH27.7–9) We have purified -galactosidases from several species and have determined the substrate specificities of each of them.10–13) Umbelopsis vinacea (filamentous fungi, earlier we used the synonym Mortierella vinacea) possesses two types of -galactosidases, and one of which, -galactosidase I (UvGalI), has a molecular mass of 50–56 kDa and 240 kDa when analyzed by SDS– PAGE and gel filtration respectively. The broadness of the SDS–PAGE band suggests that UvGalI might be a heterogeneous glycoprotein.14) In addition, gel filtration data suggest that it might exist as a tetramer in solution. Enzymatic investigation indicated that it showed different substrate specificities toward galactomannan,14,15) as compared to the other -galactosidases. UvGalI is specific only for the terminal -galactosyl residue that is linked to the non-reducing end mannose of mannan, while U. vinacea -galactosidase II and rice -galactosidase are specific for the -galactosyl side-chain of galactomannooligosaccharides, as well as for the terminal -galactosyl residue.11,12) A crystallographic analysis has been started in order to obtain the structural basis of the catalytic mechanism and substrate specificity of the enzyme. The strain of U. vinacea that we used was kindly donated by the Hokkaido Sugar (Tokyo). UvGalI was purified and crystallized as previously reported.16) Crystallization trials of UvGalI yielded two different crystal forms, type 1 and type 2, and structure analyses succeeded for both crystal forms. Their overall structures were similar, and hence we describe the structure analysis here for the type-1 crystal. Diffraction data for native crystals were obtained at ˚ ). beamline BL41XU, Spring-8, Harima ( ¼ 1:0313 A
To whom correspondence should be addressed. Fax: +81-29-838-7877; E-mail: zui@affrc.go.jp Present address: Department of Biotechnology, Graduate School of Engineering, Osaka University, 2-1 Yamadaoka, Suita, Osaka 565-0871, Japan ** Present address: Koibuchi College of Agriculture and Nutrition, 5965 Koibuchi, Mito, Ibaraki 319-0323, Japan Abbreviations: GH, glycoside hydrolase family; UvGalI, Umbelopsis vinacea -galactosidase I; PEG400, polyethylene glycol 400 *
Structure of Umbelopsis vinacea -Galactosidase I
A UvGalI crystal was mounted in a quartz glass capillary 0.3 mm in diameter and then flash-frozen in a nitrogen stream at 95 K. Diffraction data were collected using the MarCCD X-ray detector (Rayonix, Evanston, IL). The native data set was processed and scaled using ˚ resolution. The crystals DPS/MOSFLM17) up to 2.0 A belonged to the tetragonal space group I422, with cell ˚ , c ¼ 130:6 A ˚ (Table 1). dimensions of a ¼ b ¼ 142:5 A Structural analysis was performed by the molecular replacement method with the program MOLREP.18) A structural model of rice -galactosidases (Protein Data Bank code 1UAS, 39% identity with the UvGalI amino acid sequence) was used as a search model.4) The solution was subjected to a cycle of maximum likelihood simulated annealing refinement using the program Refmac5, and model building was done using the auto-modeling program ARP/wARP.19,20) Further model rebuilding was conducted manually with the program QUANTA2000 (Accelrys Software, San Diego, CA) and COOT,21) and the model was refined for several cycles with the program Refmac5.19) During the course of refinement, Fobs {Fcalc maps showed the presence of sugar chains attached to the three N-glycosylation sites. Similarly, bound waters, a Tris (tris(hydroxymethyl)aminomethane) molecule, and polyethylene glycol 400 (PEG400) molecules were identified. The structural refinement statistics are summarized in Table 1. The stereochemistry of the model was analyzed with the program PROCHECK.22) The figures of the ribbon model were drawn with the program Raster3d.23,24) The atomic coordinates of the UvGalI structure have been deposited in the Protein Data Bank through the Protein Data Bank Japan, Osaka University, under code name 3A5V. The crystal structure of UvGalI was determined at ˚ resolution. The asymmetric unit contained one 2.0 A UvGalI peptide chain of 397 amino acid residues, 12 sugar moieties of three N-linked carbohydrate chains, 556 water molecules, and one Tris and four PEG400 molecules. Since the enzyme was produced as a secretory fungal protein, the analyzed structure was solved as a glycoprotein attached by N-linked carbohydrates. The final 2Fobs {Fcalc electron density map was Table 1. Data Collection and Structure Refinement Statistics for UvGalI Values in parentheses are for the highest resolution shell. Data collection Cell parameter (I422) ˚) a, b (A ˚) c (A ˚) Resolution (A No. of unique reflections Multiplicity Mean I=ðIÞ Rmerge (%) Completeness (%) Structure refinement ˚) Resolution (A R-factor Rfree -factor Root-mean-square deviations ˚) Bond lengths (A Bond angles ( )
142.5 130.6 50–2.0 45,293 7.7 8.4 7.5 99.9
(2.11–2.00) (6,451) (6.1) (2.4) (27.9) (99.9)
50–2.0 0.142 0.158
(2.05–2.00) (0.170) (0.204)
0.014 1.291
The Rfree -factor was calculated using 5% of the unique reflections.
2361
observed throughout the peptide chain. Four disulfide bridges (Cys21–Cys53, Cys101–Cys131, Cys201– Cys215, and Cys203–Cys208) were localized in the catalytic domain, and one cis-peptide (Leu349–Pro350) was observed in the C-terminal domain. In a Ramachandran plot, 89.4% of the non-glycine residues were resolved in the most favored regions and 9.7% of the residues were plotted in the additional allowed regions, whereas Cys53 was plotted in the generously allowed region and Asp240 and Asn374 were in the disallowed region, although the electron densities were clearly observed for these residues. After the initial phase was calculated, model building was started according to the amino acid sequence of U. vinacea -galactosidase deduced from the DNA sequence (DDBJ databases accession no. S79440),25) but in the course of refinement, there was some inconsistency between the electron density of the model and the amino acid sequence. Amino acids for the model were assumed according to the electron density, and 60 amino acids were finally modeled to be different from the deduced amino acid sequence. The cause of inconsistency is still unknown. We have made efforts to clone the gene encoding UvGalI, but they have not yet succeeded. The structural model, therefore, contains ˚ some uncertain residual models, due to the 2.0 A resolution electron density. Figure 1 shows the differences between the sequences, aligned with the sequence of rice -galactosidase. Figure 2 shows the structure of UvGalI. The monomer structure was conserved to those of known GH27 -galactosidases. The catalytic domain was comprised of a ð=Þ8 -barrel, and the active site pocket was found on the C-terminal side of the central -strands of the catalytic domain. The C-terminal domain was made up of eight -strands, forming two tandemly repeated Greek-key motives. Compared to rice -galactosidase,4) most of the basic secondary structures in the catalytic domain and the C-terminal domain were well conserved (Fig. 1). The root-mean square difference of C-atoms between UvGalI and rice -galactosidase was calculated ˚ . These structures were found to be in good to be 1.1 A topological agreement and well superimposed, but some insertions and deletions were observed in the loop regions. Among the differences, UvGalI had three remarkable inserted peptides: peptides 137–141 (Ins1) and 196–215 (Ins2) were located around the active cleft of the ð=Þ8 -barrel, and peptide 346–351 (Ins3) was located in the C-terminal domain. The largest peptide, 196–215, which was located after the 6th -sheet of the ð=Þ8 -barrel, was composed of 20 amino acids and contained two disulfide bridges within the kinked peptide (Cys201–Cys215 and Cys203–Cys208), forming a small spherical domain located in front of the catalytic pocket. The catalytic pocket of the enzyme was located on the C-terminal side of the central -barrel of the catalytic domain. The Tris molecule occupied the active site of UvGalI (Fig. 2). Two catalytic residues, Asp129 and Asp189, which act as a nucleophile and an acid/base catalyst respectively, were located on the upper and the lower side of the catalytic pocket with a distance of ˚ inbetween. Most amino acids in the catalytic 6.3 A pocket were conserved with those of rice -galactosi-
2362
Z. F UJIMOTO et al. Catalytic domain
*
Ins1
*
Ins2
#
#
C-terminal domain
Ins3
#
Fig. 1. Sequence Alignment of UvGalI with Rice -Galactosidase (OsGal). Inconsistent amino acids of the deduced sequence from DNA no. S79440 are shown. -Helices and -strands are indicated by coils and allows respectively. The catalytic residues and N-glycosylated asparagine residues are indicated by and # respectively. Three inserted peptides of UvGalI as compared to rice -galactosidase are shown as Ins1, Ins2, and Ins3.
dase. Based on the structure of rice -galactosidase, Trp16 and disulfide bridge Cys101–Cys131 appeared to form hydrophobic surfaces of the pocket, and Cys166, Trp168, and Arg185 appeared to recognize the C-2 hydroxyl group of the galactose molecule. Previous biochemical analyses have indicated that UvGalI appears to form a tetramer structure,11) and preliminary crystallographic studies of it also supported this hypothesis.16) The crystal of UvGalI belonging to space group I422 contains one molecule in the asymmetric unit, but the tetramer can be constructed by four crystallographic 4-fold symmetry-related molecules of itself around the 4-fold axis (Fig. 2B). The tetramer ˚ structure is flat-cylindrical with dimensions of 100 A ˚ in thickness. The tetramer along two edges and is 75 A interfaces are located mainly between the catalytic domains of the adjacent molecules, and are made up by the van der Waals interactions, nine direct hydrogen bond interactions, and some water-mediated interactions. Among the hydrogen bonds, three bonds were observed between the carbohydrate chain N-linked to Asn155 and the adjacent protein molecule. The other N-linked sugar chains were also located on the surface of the tetramer, and they extended to the solvent. These three sugar chains were located around the tetramer interface. The catalytic pocket was buried in the tetramer interface. The side-chain of Trp168, contributing to
make the catalytic pocket, was involved in tetramer ˚ to Lys230 of formation, being in the proximal of 4.5 A the adjacent molecule. Near the catalytic pocket, there was an inserted peptide, 196–215 (Ins2), from which nine residues participated in tetramer interface formation. As shown in Fig. 2B, this insertion gathered around the 4-fold axis and interacted with the self of the adjacent molecules, forming the tetramer interface. Similarly, the other two insertions were also involved in the tetramer interface. Thus, these insertions appear to play an important role in forming a stable tetramer for this enzyme. There was a concave surface in the tetramer interface, and this concave maintained the accessibility of the catalytic pocket. Thus, the enzyme appears to have catalytic activity as a tetramer form, but its substrate specificity might be limited because the catalytic pocket is located at the bottom of the concave surface. Since the structure of the catalytic pocket is well conserved with the other -galactosidases, the unique substrate specificity of UvGalI, that this enzyme is active only for the terminal -galactosyl residue of the galactomannan, can be attributed to the structure of the concave surface. Further structural studies of the enzyme should reveal its catalytic mechanism in detail. Three carbohydrate chains were observed in the UvGalI molecule to be N-linked carbohydrates. NGlycosylated asparagine residues were Asn155, Asn276,
Structure of Umbelopsis vinacea -Galactosidase I
2363
A N155
N155
Ins1
N276
Ins1
N276
Ins2
Ins2
Ins3
Ins3
Cterm
Cterm
N371
B
N371
N276 N371
N155
Ins1
Ins2
Ins3
C C53
C131
C101
C131
D52
D129
C53 C101
D129 Y93
K127 W168 C166 R185
Y93 K127
D51 TRIS
W16
D240
W168 C166 R185
D189
C201
D52
M241
C215
C201
D51 TRIS
W16
D189 D240
M241
C215
Fig. 2. Structure of UvGalI. A, Stereoview of the ribbon model of UvGalI. The catalytic domain and the C-terminal domain are shown in green and yellow respectively. The three inserted peptides are shown in blue. The catalytic residues, disulfide bridges, and N-linked carbohydrate chains are shown in ball-andstick drawings. The bound Tris molecule in the catalytic pocket is shown in a CPK model. B, Tetramer structure of UvGalI. C, Stereoview of the catalytic pocket of UvGalI. The amino acid residues and the bound Tris molecule are shown in ball-and-stick drawings. Hydrogen bonds between UvGalI and Tris are shown as a broken cyan line.
and Asn371 (Figs. 1 and 2). Two of these were located in the catalytic domain, and Asn371 was in the Cterminal domain. The electron density of the carbohydrate chain linked to Asn155 is shown in Fig. 3A, and the derived sugar chain structures are shown in Fig. 3B. The carbohydrate moieties in the sugar chain attached to Asn155 were observed best among the three chains, and the sugar chain structure was determined to be of the high-mannose type. The sugar chain attached to Asn155
was located on the reverse side of the ð=Þ8 -barrel to the catalytic site, extending to the adjacent symmetryrelated molecule, and was involved in the tetramer formation through three hydrogen bond interactions. Sugar chains attached to Asn276 and Asn371 extended to solvent regions, and only two or three carbohydrate moieties were identified. Glycosylation sometimes interferes with crystallization of the protein because of its heterogeneity and
2364
Z. F UJIMOTO et al.
B
A
Man
Man α1,6 α1,3
Man
Man
Man
Man α1,6 α1,3
Man
β1,4
GlcNAc
β1,4
GlcNAc
β1,N
Asn155
Man Man
Man
GlcNAc
β1,4
GlcNAc
β1,N
Asn276
GlcNAc GlcNAc
Man
β1,4
GlcNAc
β1,4
GlcNAc
β1,N
Asn371
Asn155
Fig. 3. Structure of N-Linked Sugar Chain. A, 2Fobs {Fcalc electron density map contoured at 1 for the Asn155-linked carbohydrate chain. B, Structure of the N-glycosylated carbohydrate chain of UvGalI.
flexibility. In this study, however, UvGalI could be crystallized, probably because UvGalI has a stable tetramer structure having a 4-fold symmetry that is applicable to crystal packing. Furthermore, the tetramerization of UvGalI appears to change its substrate specificity and enhance stability. It remains to determine the biological implications of the tetramerization of UvGalI.
9)
10)
11)
12)
Acknowledgments This study was supported in part by a Grant-in-Aid for Scientific Research from Japan Society for the Promotion of Science. We also thank the beamline staff members of SPring-8, Harima, and the beamline staff members of the Photon Factory, Tsukuba, for help with diffraction experiments.
13)
14)
15) 16)
References 1) 2) 3)
4) 5) 6)
7) 8)
Reid JS, Edwards ME, Gidley MJ, and Clark AH, Biochem. Soc. Trans., 20, 23–26 (1992). Henrissat B and Davies G, Curr. Opin. Struct. Biol., 7, 637–644 (1997). Coutinho PM and Henrissat B, ‘‘Recent Advances in Carbohydrate Bioengineering,’’ eds. Gilbert HJ, Davies G, Henrissat B, and Svensson B, The Royal Society of Chemistry, Cambridge, pp. 3–12 (1999). Fujimoto Z, Kaneko S, Momma M, Kobayashi H, and Mizuno H, J. Biol. Chem., 278, 20313–30318 (2003). Garman SC and Garboczi DN, J. Mol. Biol., 337, 319–335 (2004). Golubev AM, Nagem RA, Brandao Neto JR, Neustroev KN, Eneyskaya EV, Kulminskaya AA, Shabalin KA, Savel’ev AN, and Polikarpov I, J. Mol. Biol., 339, 413–422 (2004). Garman SC, Hannick L, Zhu A, and Garboczi DN, Structure, 10, 425–434 (2002). Fujimoto Z, Ichinose H, Harazono K, Honda M, Uzura A, and Kaneko S, Acta Crystallogr. F, 65, 632–634 (2009).
17) 18) 19) 20)
21) 22) 23) 24) 25)
Ichinose H, Fujimoto Z, Honda M, Harazono K, Nishimoto Y, Uzura A, and Kaneko S, J. Biol. Chem., 284, 25097–25106 (2009). Shibuya H, Kobayashi H, Park GG, Komatsu Y, Sato T, Kaneko R, Nagasaki H, Yoshida S, Kasamo K, and Kusakabe I, Biosci. Biotechnol. Biochem., 59, 2333–2335 (1995). Shibuya H, Kobayashi H, Sato T, Kim WS, Yoshida S, Kaneko S, Kasamo K, and Kusakabe I, Biosci. Biotechnol. Biochem., 61, 592–598 (1997). Kim WD, Kobayashi O, Kaneko S, Sakakibara Y, Park GG, Kusakabe I, Tanaka H, and Kobayashi H, Phytochemistry, 61, 621–630 (2002). Ishiguro M, Kaneko S, Kuno A, Koyama Y, Yoshida S, Park GG, Sakakibara Y, Kusakabe I, and Kobayashi H, Appl. Environ. Microbiol., 67, 1601–1606 (2001). Shibuya H, Kobayashi H, Yoshida S, Kaneko S, Park GG, and Kusakabe I, Biosci. Biotechnol. Biochem., 63, 1096–1099 (1999). Suzuki H, Li S, and Li Y, J. Biol. Chem., 245, 781–786 (1970). Fujimoto Z, Kim WD, Kaneko S, Park GG, Momma M, Kobayashi H, and Mizuno H, Acta Crystallogr. D Biol. Crystallogr., 59, 2289–2291 (2003). Rossmann MG and van Beek CG, Acta Crystallogr. D Biol. Crystallogr., 55, 1631–1640 (1999). Vagin A and Teplyakov A, J. Appl. Crystallogr., 30, 1022–1025 (1997). Murshudov GN, Vagin AA, and Dodson EJ, Acta Crystallogr. D Biol. Crystallogr., 53, 240–255 (1997). Cohen SX, Morris RJ, Fernandez FJ, Ben Jelloul M, Kakaris M, Parthasarathy V, Lamzin VS, Kleywegt GJ, and Perrakis A, Acta Crystallogr. D Biol. Crystallogr., 60, 2222–2229 (2004). Emsley P and Cowtan K, Acta Crystallogr. D Biol. Crystallogr., 60, 2126–2132 (2004). Laskowski RA, MacArthur MW, Moss DS, and Thornton JM, J. Appl. Crystallogr., 26, 283–291 (1993). Kraulis PJ, J. Appl. Crystallogr., 24, 946–950 (1991). Merritt EA and Bacon DJ, Methods Enzymol., 277, 505–524 (1997). Shibuya H, Kobayashi H, Kasamo K, and Kusakabe I, Biosci. Biotechnol. Biochem., 59, 1345–1348 (1995).