International Journal of Biotechnology and Biochemistry. ISSN 0973-2691 Volume 7 Number 4 (2011) pp. 463-480 © Research India Publications http://www.ripublication.com/ijbb.htm
Enhanced Biological Activity of Recombinant Human Interferon Alpha Produced in Pichia pastoris using a codon-optimized synthetic cDNA Imen Rabhi1 and Dahmani M. Fathallah1,2* 1
Molecular Biotechnology Group, Institute Pasteur, Tunis, Tunisia 2 Biotechnology Program, King Fahd Chair for Biotechnology, Arabian Gulf University, Manama, Bahrain *Corresponding Author E-mail: Dahmani M. Fathallah:
[email protected],
[email protected] E-mail: Imen Rabhi:
[email protected],
[email protected] Abstract
The development of biotechnologies has spurred the production of recombinant proteins and the development of the biopharmaceutical industry. This industry is now increasingly faced with the challenge of supplying large amounts of high quality recombinant protein-based drugs. Higher production efficiency and the consequent lower costs of the final products are necessary to produce widely affordable products. The strategy consisting in fine-tuning of codon usage has been shown to improve the production yield of recombinant proteins while more evidences are showing that this strategy may as well affect the biological activity of the recombinant proteins. We hypothesized that if these two issues can be achieved simultaneously, i.e. the production of a given protein at a higher yield with enhanced biological activity, this would result in the production of more doses of drug from a given amount of material then the same amount that has a lower specific activity. Therefore the manufacturers of the protein-based drugs can significantly lower their costs and the drug price decrease consequently. Toward this end we have developed and carried out a gene optimization strategy to enhance the production yield and the biological activity of the human IFND in the yeast Pichia pastoris. Using a PCR assembly approach we have modified the native human interferon D2a sequence and constructed several clones according to the preferred codon-usage of P. pastoris. Comparison of the production yields from the modified sequences and the native cDNA showed a 3-fold increase when using a fully synthetic interferon
464
Imen Rabhi and Dahmani M. Fathallah cDNA with the preferred P. pastoris codon bias. Interestingly, recombinant interferon Dproduced from this clone exhibited a specific activity (4.48x108 IU/mg) that was 2.85 times higher than that of recombinant interferon Dproduced from the native cDNA sequence. Our data show that by using a synthetic cDNA sequence optimized for the P.pastoris codon bias, higher product yield can be achieved, and, more importantly, recombinant human interferon D has higher biological activity. This biotechnology-based strategy can contribute significantly to lowering the price of the IFND drug and similarly other recombinant protein-based biopharmaceuticals. Keywords: codon-bias, recombinant Interferon, biological activity, Pichia pastoris
Introduction Most of the 600 biopharmaceuticals currently on the market are recombinant proteins. A variety of strategies are being used by designers and manufacturers of biopharmaceuticals to optimize production yield. One such method is the use of recombinant protein production systems such as those involving the bacterium Escherichia coli and the yeasts Saccharomyces cerevisiae and Pichia pastoris [1] . Early in the drug development phase, optimization strategies use genetic engineering and molecular biology techniques, whereas in later development stages, they use approaches such as high cell density cultivation, manipulation of the culture environment, downstream processing and engineering of the fermentation process [2]. Gene optimization strategies are based on the codon bias found in almost all species. This bias exists among the 61 amino acid codons found in mRNA molecules, and the levels of aminoacyl-tRNAs seem to correlate with the frequency of codon usage [3,4]. Therefore, one would expect to observe the impaired translation of an abundant mRNA species containing an excess of rare codons with low aminoacyl-tRNA levels. Such a situation arises after the initiation of the transcription of cloned heterologous genes in E. coli and yeast [5, 6]. Codons that are common in some species may be rare in others; therefore, expression can be limited by the amount of available aminoacyltRNA in the host cell. The premature termination of translation can occur when specific aminoacyl-tRNAs are depleted, and transcription can be terminated if the DNA has a high proportion of A and T bases [5, 6]. The E. coli and P. pastoris expression systems are currently the most widely-used systems for the large-scale production of various recombinant proteins of pharmaceutical interest. Recent developments with respect to the P.pastoris system have impacted not only the protein expression levels that can be achieved but also the quality of the heterologous proteins produced [7, 8]. When expressed in yeast, however, the codons of a human gene may not be optimal for the high expression of recombinant protein. Gene optimization strategies call for alterations in codon-usage and an increase in the proportion of G and C bases to improve expression levels. The consensus sequence that initiates translation also needs to be optimized.
Enhanced Biological Activity of Recombinant
465
Gene optimization strategies have been used successfully to express several human genes in Escherichia coli and P. pastoris including huIFND[9,10] The expression of the hepatitis B virus e antigen using a cDNA with a codon bias reflecting that of P. pastoris, resulted in 5 times higher protein expression levels [11]. The codon optimization of the human DNA sequence encoding glucocerebrosidase, a protein used to treat Gaucher disease, resulted in a 10.6-fold increase in the amount of protein expressed in P. pastoris [12]. In another report, the optimization of a mycobacterial cDNA enhanced the Immunogenicity of a DNA Vaccine Encoding for antigen Ag85B [13]. While the use of gene optimization to improve production yield is well documented, there are reports showing the effects of synonymous codon change on the biological activity of proteins [14,15,16]. Here, we describe a gene optimization method in P. pastoris that improved not only the production yield but also the biological activity of recombinant human interferon D2a (huIFND2a). The yields achieved with partially optimized cDNAs were approximately 1.5 to 1.7 times higher than that achieved with the wild type cDNA, while a fully synthetic (FS) cDNA displayed a 3-fold higher yield and a 2.85fold higher specific activity than the wild type cDNA.
Materials and Methods Strains and plasmids The Top10fF’ strain of E. coli (recA- , endA-) (Invitrogen, Groningen, Netherlands) was used as the host strain for cloning experiments. The P.pastoris host strain was MutS KM71H (aox1::ARG4). The pPICZDA plasmid (Invitrogen, Groningen, Netherlands) was used for expression in P. pastoris. Composition of bacterial medium Low-salt (< 90 mM) Luria-Bertani (LB) medium containing 1% tryptone, 0.5% NaCl, 0.5% yeast extract and Zeocin at a final concentration of 25 µg/ml was used at a pH of 7.5. Composition of media for shake flask production Yeast extract-peptone-dextrose (YPD) medium was used containing 2% peptone, 1% yeast extract and 2% dextrose. To make YPDS medium, YPD was supplemented with 1 M sorbitol and Zeocin at a final concentration of 100 µg/ml. Buffered minimal glycerol-complex medium (BMGY) was prepared using 2% peptone, 1% yeast extract, 1% glycerol, 1.34% yeast nitrogen base (YNB) with ammonium sulfate but without amino acids and biotin in 100-mM potassium phosphate buffer at pH 6.0. Buffered minimal methanol-complex medium (BMMY) had the same composition as BMGY medium, except 1% methanol was used instead of glycerol. Construction of the recombinant pPICZD$rWHuIFND2 expression vector The human interferon D2b cDNA was cloned as described by Rabhi-Essafi et al., 2007. The cDNA was amplified using the F/IFND2 forward primer designed to
466
Imen Rabhi and Dahmani M. Fathallah
introduce an EcoRI site at the 5’ end of the gene (Table 1) and the R/IFND2 reverse primer designed to introduce a NotI site and a TGA stop codon at the 3’ end of the gene (Table 1). The 498-bp RT-PCR product was purified and cut with EcoRI and NotI, then inserted into the 3.6-kb plasmid pPICZD$ (Invitrogen, Groningen, the Netherlands) to generate the pPICZD$rhuIFND2a expression vector. Transformed E. coli Top10F’ (recA-/endA-) clones were selected on low-salt LB medium with 25 µg/ml Zeocin. The pPICZD$rhuIFND2a recombinant plasmid containing the cDNA sequence encoding human IFND2 was isolated by colony PCR using the 5’ F/IFND2a forward and 3’ R/IFND2a reverse primers according to procedures described by Ausubel et al., 2002 and by restriction analysis using the BglII restriction enzyme as recommended by the manufacturer (Amersham Biosciences, Athens, Greece). Finally, the nucleotide sequences of the positive clones were confirmed by DNA sequencing using an ABI PRISM 377 DNA sequencer (Perkin Elmer Applied Biosystems) and the Aox1/F and Aox1/R primers from the EasySelect Pichia expression kit (Invitrogen). Table 1: Primers used for the construction of optimized human interferon alpha cDNA sequences. Primer designation 1. FIFND2b 2. RIFND2b 3. Fa1 4. Fb1 5. Fc1 6. FfgsI 7. RfgsI 8. Fa2 9. Fb2 10. Fc2 11. FfgsII 12. RfgsI
Sequence (5’ end
…
….3’ end) TGGAATTCTGTGATCTGCCTCAAACCCA ATTCTGCGGCCGCTCATTCCTTACTTCTTAAACTTTC TGGAATTCTGTGATTTGCCTCAAACCCACTCCTTGGGT TCCAGAAGAACCTTGATGTTGTTGGCACAGATGAGAA AAATCTCTTTGTTCTCC AAACTCCTCCTGTGGAAATCCAAAGTCATGTCTGTCCTTCAA GCAGGAGAACAAAGAGAT CCACAGGAGGAGTTTGGCAACCAGTTCCAAAAGG CTGAAACCATCCCTGTCTTGCATGAGATGATCCAGCAGATCTTC TGGAATTCTGTGATTTGCCTCA GAAGATCTGCTGGATCATCTC CAGATCTTCAATTTGTTCTCCTCCACAAAGGACTCTTCTG CTGCTTGGGATGAGACCTTGTTGGACAAATTCTACACTGA ATTGTACCAGCAGTTGAATGAC GTATTTTCTAACAGCCAAAATGGAGTCCTCCTTCATCAA TGGAGTCTCTGTAACACCAACACCCTGGATAACACAGGC TTCCAAGTCATTCAACTGCTG GCTGTTAGAAAATACTTCCAAAGAATCACTTTGTATTT GAAAGAGAAGAAATACTCCCCTTGTGCCTGGGAGGTTG TCAGAGCAGAAATCATGAGATCT TGAAGATCTTCAATTTGTTCTCCAC TGAAGATCTCATGATTTCTGCTCTGA
Enhanced Biological Activity of Recombinant
467
Design of P. pastoris codon-optimized human interferonD2 cDNA clones (WS, SW and FS) The wild-type IFND2a nucleotide sequence was analyzed with Graphical Codon Usage Analyzer software, which uses the nearest-neighbor method, available at http://www.geneart.degcua.schoedl.de/ or from ExPASy Proteomics tools at www.Swissprot.org (Fig. 1).
Figure 1: Codon analysis of the native (wild-type) cDNA sequence encoding human IFND2 using Graphical Codon Usage Analyzer software. The human IFND2a sequence was used as a template to design the first huIFND2a synthetic fragment by PCR assembly using the synthetic primers Fa1, Fb1 and Fc1 (Table 1) and two external primers, 5’ forward FfgsI and 3’ reverse RfgsI. The Fa2, Fb2 and Fc2 primers and two external primers, 5’ forward FfgsII and 3’ reverse
468
Imen Rabhi and Dahmani M. Fathallah
RfgsII, were used to generate the second synthetic human IFND2 cDNA fragment. The purified 200-bp DNA fragment corresponding to the first huIFND2a synthetic fragment was cloned into the recombinant pGX4T1/rhuWIFND2a plasmid (Fig. 2A), which was previously cut at the 5’ EcoRI and 3’ BglII restriction sites, generating a plasmid containing only the 48 bp corresponding to the 16 amino acids of the huIFND2a C-terminus. According to the codon bias of P. pastoris, the 16 C-terminus amino acids of huIFNDa do not contain any rare codons. Isolation of the recombinant P1 plasmid (pGEX4T1/huIFND2a, containing the first synthetic fragment) was performed by restriction analysis using BglII. Finally, sequences of the positive clones were confirmed by DNA sequencing using the ABI PRISM 377 DNA sequencer (Perkin Elmer Applied Biosystems, PaloAlto,CA ,USA) and the 5’ pGEX primer.
Figure 2A: P1 construct (8 bp EcoRI+183 bp synthetic IFND2 Fragment 1+9 bp BglII= 200 pb The second 267-bp huIFND2a synthetic fragment was introduced into the P1 recombinant plasmid cut with BglII. Isolation of the P2 recombinant plasmid clone in pGEX4T1, which contained the huIFND2a synthetic fragment 2 inserted in the correct orientation, was performed by colony PCR using the 5’ FfgsI forward and 3’ RfgsII reverse primers (Table 1) and by restriction analysis using EcoRI and XhoI as recommended by the manufacturer (Amersham Biosciences, Athens, Greece). Full-
Enhanced Biological Activity of Recombinant
469
length synthetic huIFND2a was cut with EcoRI and NotI and inserted into the pPICZD$ plasmid to generate the pPICZD$FS-huIFND2a expression vector. Transformed E. Coli Top10F’ (recA-/endA-) clones were selected on low-salt LB medium with 25-µg/ml Zeocin. Isolation of the pPICZD$synthetic huIFND2 recombinant plasmid containing the cDNA sequence encoding synthetic human IFND2a was performed by colony PCR using the 5’ FfgsI forward and 3’ RIFND2 reverse primers (Table 1) as described in reference [17] and by restriction analysis using EcoRI and NotI as recommended by the manufacturer (Amersham Biosciences, Athens, Greece). Finally, the nucleotide sequences of positive clones were confirmed by DNA sequencing using the Aox1/F and Aox1/R primers from the EasySelect Pichia expression kit (Invitrogen, Groningen, the Netherlands). To generate chimeric clones with a half synthetic and half wild-type IFND nucleotide sequence (Fig. 2D), the pGEX4T1/SynrhuIFND2a synthetic full-length construct (P2 vector) was used as a template as follows. For the SW clone, the 267-bp DNA fragment corresponding to the second rhuIFND2 wild-type fragment was cut with BglII and inserted into the pGEX4T1/SynrhuIFND2a synthetic full-length recombinant vector (P2 vector) cut with the same restriction enzyme. For the WS chimeric clone (Fig. 2D), the second 267-bp huSynIFND2 synthetic fragment was cut with BglII and cloned into the pGEX4T1/rWhuIFND2a wild-type full-length vector cut with the same restriction enzyme.
Figure 2D: Construction of the chimeric SW IFN and WS IFND clones.
470
Imen Rabhi and Dahmani M. Fathallah
Shake flask production of secreted rhuIFND2 A single colony of pPICZD$rHuIFND2 MutS KM71H P.pastoris transformants was used to inoculate 50 ml of BMGY medium in a 500-ml flask and then incubated at 30°C with shaking at 250 rpm for 24 h. Cells were pelleted by centrifugation at 3,000 g for 10 min, resuspended in 5 ml of BMMY medium containing 0.5% methanol and incubated at 30°C with shaking at 250 rpm. This induction was repeated every 24 hours at a concentration of 1% methanol directly after addition. After 48 h of induction, cells were harvested by centrifugation at 10,000 g for 20 min at 4°C, and the supernatant was collected and stored at -20°C for protein expression analysis. Electroporation of pPICZD$rhuIFND2 cDNAs into P. pastoris MutS KM71H Recombinant pPICZD$huIFND2a expression vectors were propagated in Top10F’ (recA-endA-) E. coli in the presence of 25-µg/ml Zeocin. Plasmids were isolated from the transformed E. coli clones using a Qiagen plasmid miniprep kit. (Heidelberg, Germany) Each of the pPICZD$-modified huIFND2 plasmids (FS, WS and SW) was cut according to the manufacturer’s instructions using SacI, which does not cut within the modified IFND2 cDNA. pPICZD$ containing the wild-type rW-huIFND2 was cut with BstX1. The cut fragments were purified by phenol/chloroform/IsoAmyl Alcohol purification following standard protocols. Ten micrograms of linearized, recombinant plasmid DNA was used to transform the KM71H (aox1::ARG4) P. pastoris strain by electroporation (Bio-Rad Gene Pulser, 1500 volts charging voltage, 25-µF capacitance, and 200 ohms resistance) as described in the EasySelectTM Pichia Expression Kit manual (Invitrogen). After 3 days of incubation at 30°C on solid YPDS medium containing Zeocin at 100 µg/ml, 20 KM71H MutS transformants were retained for further study. Analysis of rhuIFND protein expression Culture supernatants were analyzed by SDS-PAGE. Electrophoresis was performed in a 15% SDS-polyacrylamide gel that was then stained with Coomassie Brilliant Blue. Recombinant fusion proteins were specifically detected by western blot analysis using an ECL kit (Amersham Biosciences, Athens, Greece) according to the manufacturer’s instructions with a 1:400 dilution of an anti-human IFNĮ monoclonal antibody (Endogen Searchlight, U.S.) followed by a peroxidase-conjugated anti-goat/sheep IgG monoclonal secondary antibody (Sigma Aldrich, Germany). Image J software was used to compare hurIFND2a protein expression after codon optimization. Finally, the concentration of rWT-huIFND2a compared to synthetic rhuIFND2 expressed from several Pichia pastoris recombinant clones was determined by a quantitative ELISA developed in-house. Dilution series containing 0 to 570 pg of HPLC-purified soluble IFNĮ2 produced in our laboratory were included in each assay to generate a standard curve. Recombinant proteins were detected using a biotin-labeled anti-human IFNĮ monoclonal antibody (ENDOGEN Searchlight, US) and a colorimetric detection system with a streptavidin-horseradish peroxidase (HRP) conjugates (Amersham Biosciences, Athens, Greece).
Enhanced Biological Activity of Recombinant
471
Biological activity of rhIFND2 The biological activity of recombinant huIFND2a was determined using an antiviral assay as described in reference [17]. This assay is based on the ability of huIFND2a to inhibit the cytopathic effects of encephalomyocarditis virus (EMCV) on the glioblastoma cell line 2D9. HEK 293P cell lines stably transfected with an IFNinducible promoter sequence (ISRE) linked to the secreted alkaline phosphatase (SEAP) gene were used to perform a reporter gene assay. One unit of activity was defined as the amount of recombinant hIFND2a required to produce antiviral activity equivalent to that of 1 IU of the huIFND2 reference standard (code: 95/566; Division of Immunobiology; National Institute for Biological Standards and Control,[NIBSC] Potters Bar, UK). IFN potency values were calculated using an in-house parallel line displacement program with the IFN 95/566 standard as the primary calibrator. Potency values were statistically analyzed using Prism software developed at NIBSC. For each recombinant IFN preparation, the assay was performed in duplicate.
Results The design and construction of modified synthetic codon-optimized human interferonD2a cDNAs The design strategy for the fully synthetic clone (FS-huIFND2) The sequence of the wild-type human IFND2 cDNA (WT-IFND2a) was analyzed using Graphical Codon Usage Analyzer software, which uses the nearest-neighbor method. This analysis allowed the identification of codons in the native IFND sequence rarely used by P. pastoris, equaling 25% of the total number of codons (Fig. 1). To construct a fully synthetic human IFND2a cDNA, we designed a cloning approach based on synthetic overlapping primers and a PCR assembly strategy. The detailed steps of the cloning strategy are shown in Fig. 2A, B and C.
Figure B: The P2 recombinant plasmid construction strategy (7 bp BglII+254 bp synthetic IFND2 Fragment 2+6 bp BglII= 267 bp)
472
Imen Rabhi and Dahmani M. Fathallah
Figure 2C: Pichia pastoris plasmids for optimized expression of interferon D cDNA Data from the codon sequence analysis were used to design synthetic primers Fa1, Fb1, and Fc1 and to generate the first huIFND2a synthetic fragment (fragment 1) by PCR assembly (Fig. 2A). Primers Fa2, Fb2, and Fc2 were used to generate the second synthetic human IFND2 cDNA fragment (fragment 2) (Fig. 2B). Gene synthesis and fragment assembly were carried out using the PCR method described in Fig. 2A and B. The purified 186-bp DNA fragment corresponding to the first huIFND2 synthetic fragment was cloned into pGX4T1/rhuIFND2a (wild type cDNA sequence) previously cut at the 5’ EcoRI and 3’ BglII restriction sites to generate a plasmid containing only the 48 bp corresponding to the 16 residues at the huIFND2 COOH terminus. Based on the codon usage bias of Pichia pastoris, the coding sequence for these 16 residues does not contain rare codons. The isolation of the P1 recombinant plasmid (pGEX4T1 containing the synthetic fragment corresponding to the C-terminal half of human IFND2) (Fig. 2A) was performed by restriction digestion with BglII. Ten clones integrated the first synthetic fragment. Finally, the sequences of the positive clones were confirmed by DNA sequencing. The second 268-bp huIFND2a synthetic fragment obtained by PCR assembly was introduced into the P1 recombinant plasmid cut with BglII. The recombinant clones that had integrated this fragment in the correct orientation were identified by colony PCR using the 5’ F/fgsI and 3’ R/fgsII primers as described by Ausubel et al., 2000. The integrity of the full-length rhuSynIFND2a sequence was checked by DNA sequencing and restriction analysis using EcoRI and XhoI. Full-length synthetic huSynIFND2a was cut with EcoRI and NotI and inserted into the pPICZD$ plasmid (Invitrogen) to generate the pPICZD$rSynHuIFND2a expression vector (Fig. 2C). Chimeric clones: A half synthetic, half wild type (SW and WS) design strategy The SW chimera was designed to have an optimized synthetic N-terminal half sequence (up to codon TTC192) and a wild-type C-terminal half sequence (Fig. 2D). Clone SW was obtained by cloning the SW chimeric sequence between the EcoRI and
Enhanced Biological Activity of Recombinant
473
NotI restriction sites in the pPICZD$ plasmid (Invitrogen), creating the pPICZD$rSWhuIFND2a expression vector. The WS chimera was designed to have a wild-type N-terminal half sequence (up to codon TTC192) and a synthetic C-terminal half sequence (Fig. 2D) and was obtained by cloning the WS chimeric sequence between the EcoRI and NotI restriction sites in the pPICZD$ plasmid, creating the pPICZD$rWShuIFND2a expression vector. The selection of the pPICZD$rSW-huIFND2 and pPICZD$rWS-huIFND2a recombinant plasmids (SW and WS, respectively) containing the cDNA sequence encoding the human IFND2a chimera was performed by DNA sequencing using the 5’ AOXF and the 3’ AOXR primers, respectively. Isolation of clones producing high protein levels To isolate clones producing high protein levels, we used a rapid and direct method based on growth on increasing concentrations (500, 1000 and 2000 µg/ml) of Zeocin [18]. We assumed that clones resisting the highest concentration of antibiotic [2000 µg/ml of Zeocin] have similar copy number of IFND cDNA as consensually admitted by the community of P.pastoris experts [5, 18,19, 20]. The clones producing the highest protein levels were selected by the visual comparison of the band intensities of the culture supernatants subjected to either SDS-PAGE and Coomassie Brilliant Blue staining (Fig. 3A) or western blot analysis (Fig. 3B). Three rWT-huIFNDD clones 7, 13 and 14 were able to grow on plates with 2,000 µg/ml Zeocin. These clones were identified as the highest producing clones in comparison to 12 other clones that were resistant to a high concentration of Zeocin [clones 1, 2, 3, 4, 6, 7, 9, 10, 12, 13, 14, and 20]. One of the three clones producing the highest rFW-huIFND protein levels, clone 7, was selected to be used as a wild-type huIFND2 expression reference clone. Clones expressing the highest levels of human IFNDa protein from the fully synthetic FS, SW and WS chimeras were selected in a similar fashion. For the clones producing high protein levels, expression levels correlated with Zeocin resistance.
Figure 3: Screening for P. pastoris clones producing high levels of recombinant wildtype (rW) huIFND.
474
Imen Rabhi and Dahmani M. Fathallah
To analyze the effect of gene dosage on the protein expression level of rhuIFND2a, a total of nine recombinant P. pastoris clones were picked from plates containing 2,000-µg/ml Zeocin. These clones included three fully synthetic FSrhuIFND2a clones (10S, 11S, and 12S), three SW rhuIFND2a clones (4SW, 11SW, and 18SW) and three WS rhuIFND2a clones (8WS, 14WS, and 20WS). Clones C8S and C15S were only resistant to 100 µg/ml Zeocin and showed the lowest expression levels. Clone C8S, one of the clones with the lowest protein expression, was used as an internal ELISA control. A visual comparison of the band intensities was performed by a western blot analysis (Fig. 4) of the culture supernatants from FS-rhuIFND2a (clones 10S and 11S), SW rhuIFND2a (clones 11SW and 18SW) and WS rhuIFND2a (clones 8WS and 20WS).
Figure 4: Western blot analysis of the effects of gene modifications on rhuIFND2 protein levels. Quantitative analysis of the effects of gene modifications on rhuIFND2 expression We evaluated the effects of codon optimization on protein expression levels using the clones with the highest protein expression from each category (WT, FS, WS and SW). The levels of rhuIFND2a protein expression were determined using a quantitative ELISA developed in-house (Fig. 5). HPLC-purified, soluble IFNĮ2a produced in our laboratory was used to generate a standard curve. In addition, two internal ELISA controls were used. Western blot analysis (Fig. 6A) was performed for rhuIFND2a protein expressed from the wild-type cDNA (clone C7W) and the optimized cDNAs (FS, WS and SW) and analyzed by quantitative densitometry using ImageJ software (Fig. 6B).
Enhanced Biological Activity of Recombinant
475
Figure 5: Analysis of the effects of gene modification on the expression level of recombinant rhuIFND2 by quantitative ELISA.
Figure 6: Image J analysis of the effects of gene modifications on rhuIFND2 protein expression levels. Both the quantitative ELISA and ImageJ western blot analysis showed the same significant improvement in the rhuIFND2 expression level after optimizing the cDNA sequence for the codon bias of P. pastoris. The yields from the WS and SW clones were 1.5 and 1.7 times higher, respectively, than the yield from the wild-type clone, and the FS clone gave a 3-fold higher yield (750 mg/L) than the wild-type clone.
476
Imen Rabhi and Dahmani M. Fathallah
Biological activity of soluble huIFND2 expressed in a P. pastoris host The biological activity of each purified recombinant huIFND2a protein (rWThuIFND2a and rSynhuIFND2a) produced in P. pastoris was determined using antiviral and reporter gene assays [18]. The two recombinant IFN preparations were calibrated against the WHO IFNĮ international standard (code: 95/566), [17]. The results of the reporter gene assay fit a sigmoidal dose-response curve with IFN concentration (the log of the reciprocal of the IFN dilution) plotted against absorbance. Using the linear portion of the curve, a parallel line displacement program determined the concentration of interferon in a sample by comparing the responses for the test and reference solutions using statistical methods for parallel line assays. Prism software was used to calculate and compare the standard error values of the IFN (95/966) standard preparation and recombinant huIFNalpha2 for each experiment. The results consistently showed that compared to the IFN WHO international standard (a specific activity of 1.4x108 IU/mg (Meager, 2002), the recombinant IFNDa produced from the fully synthetic clone (FS) had an average specific activity of 4.48 x108 IU/mg (+/- 0.140), and the recombinant IFN produced from the native sequence (FW) had an average specific activity of 1.57x108 IU/mg (+/- 0.214). The specific activity of the rSynhuIFND2a clone was 2.85 times higher than that of the rWThuIFND2a clone and 3.20 times higher than that of the WHO IFNĮ international standard, (p 0.05).
Discussion Several strategies have been developed to increase the production of recombinant proteins in different host expression systems, particularly for P. pastoris and E. coli [5, 9, 19, and 20]. The concept that organisms display a non-random pattern of synonymous codon-usage has been confirmed by the explosion of sequence data available from recent genome sequencing projects. Indeed, all organisms investigated to date have shown a general bias toward a subset of the 61 possible sense codons. Although there are several hypotheses to explain the origin of this bias, a model involving a selection for translational efficiency has been well-supported in prokaryotes, unicellular eukaryotes, and, to a lesser extent, insects [21, 22, and 23]. The optimization of coding sequences toward the codon bias of the host has led to an increase in heterologous protein production in a variety of host cell types [24, 25, and 26]. However the effect of such codon optimization strategy on the biological activity of recombinant human IFND, has not been investigated previously. In our study, we used cDNA codon optimization to improve the yield of rhuIFND in P. pastoris. We adjusted the codons in the human interferon D2a cDNA according to the codon bias of P. pastoris and examined the effects on the biological activity of the recombinant protein. We designed and constructed four IFND2a cDNA, one identical to the native sequence [wild-type], one fully synthetic sequence [FS] adjusted to the Pichia pastoris codon bias and two partially adjusted chimeric clones (SW and WS). In the SW clone, the synthetic covered half of the IFN coding sequence starting at the 5’
Enhanced Biological Activity of Recombinant
477
end, whereas in the WS clone, the synthetic sequence covered the coding sequence starting at codon TTC192 and extending to the stop codon. The recombinant IFND produced from the different clones had the same molecular weight as that produced from the native human cDNA sequence and was specifically recognized by a monoclonal anti-human IFND antibody by western blot analysis. The latter result indicates that the antigenic determinants of the protein were conserved. The comparisons of the production yields showed that the yields of the partially modified IFND clones (SW and WS) were 1.5 to 2 times higher than that of the wildtype clone. In addition, the fully synthetic clone gave a yield that was 3 times higher (750 mg/L) than that of the wild-type clone. Compared to data in the literature, a 3fold improvement in expression may not appear noteworthy, probably due to the prior optimization of the sequence used to initiate translation and the relatively high amount (250 mg) of IFNDa produced by the native cDNA in shake flasks. However, these results represent a significant improvement in expression, especially considering that further improvement may be achieved using optimization strategies at later stages. The biological activity of these rhuIFN2Da preparations was determined and compared to the WHO IFND international standard. Surprisingly, the IFNDproduced from the fully synthetic cDNA had consistently shown a higher specific activity (4.48 x108 IU/mg) than that of the IFNDproduced from the native cDNA (1.570x108 IU/mg). Thus, within the accuracy limits of the biological activity test, codon optimization improved not only the production yield but also the specific activity of IFN. Additional data regarding the efficiency of transcription and translation and their eventual effects on rIFND folding are needed, however, to further examine this difference in specific activity. Both high structural similarity between a recombinant protein and its native counterpart and high biological activity are of paramount importance in the approval of biosimilar drugs (or follow-up proteins) by the pharmaceutical regulatory authorities.
Conclusion
Here, we demonstrated that using an interferon D cDNA sequence that fully complies with the P. pastoris codon-usage can improve the production yield and, more importantly, the biological activity of recombinant human interferon D. This biotechnology-based strategy can be applied in the design and manufacturing of other recombinant protein-based biopharmaceuticals to lower the production costs and thus the prices of this important class of new drugs.
List of abbreviations P. pastoris E. coli IFND huIFND=
Pichia pastoris Escherichia coli Interferon alpha human interferon alpha
478 rhuIFND= YPD BMGY BMMY rhuWIFND2 huSynIFND2 SW WS FS WT W S Aox1 EMCV ISRE SEAP gene
Imen Rabhi and Dahmani M. Fathallah recombinant human interferon alpha Yeast extract-peptone-dextrose medium minimal glycerol-complex medium buffered minimal methanol-complex medium recombinant human interferon alpha 2 wild-type cDNA sequence human synthetic interferon alpha 2 5’ half synthetic-3’ half wild-type human interferon alpha 2 cDNA sequence 5’ half wild-type-3’ half synthetic human interferon alpha 2 cDNA sequence fully synthetic human interferon alpha 2 cDNA sequence Wild-type human interferon alpha 2 cDNA sequence Wild type Synthetic Alcohol oxidase 1 encephalomyocarditis virus IFN-inducible promoter sequence secreted alkaline phosphatase gene
Acknowledgement We thank Dr Anthony Meager, Immunobiology Division, National Institute for Biological Standards and Control, Potters Bar, UK, for his help in analyzing the biological activity of rhIFND2. The detailed IFN production process, for the work reported in this paper is described in the International patent N°: WO 2007/099462 A2.
References [1] Schmidt, F.R., 2004, "Recombinant expression systems in the pharmaceutical industry," App, Microb. Biotech., 65, (4), pp.363-72. [2] Graumann, K.P., and Premstaller, A., 2006, "Manufacturing of recombinant therapeutic proteins in microbiological system," Biotech J.,1, pp.164-86. [3] Shah, P., Gilchrist, M.A. 2010, "Effect of correlated tRNA abundances on translation errors and evolution of codon usage bias," PLoS Genetics, 6 (9) pp. e1001128 [4] Shardiwal, R.K., Sohrab, S.S., 2010, "A more elaborative way to check codon quality: an open source program," Inter. J. of Bioinfor. Res. App., 6 (3), pp.223-229. [5] Daly, R., Hearn, M.T.W., 2005, "Expression of heterologous protein in Pichia pastoris: a useful experimental tool in protein engineering and production," J. Mol. Recog., 18 (2), pp.119-138.
Enhanced Biological Activity of Recombinant
479
[6] Sreekrishna, K., "et al1", 1997, "Strategies for optimal synthesis and secretion of heterologous proteins in the methylotrophic yeast Pichia pastoris. Gene 190, pp.55-62. [7] Benabdesselem, C., "et al", 2006, "Enhanced Patient Serum Immunoreactivity to Recombinant Mycobacterium tuberculosis CFP32 Produced in the Yeast Pichia pastoris Compared to Escherichia coli and Its Potential for Serodiagnosis of tuberculosis," J. Clin. Microb., 44, (9), pp.3086–3093. [8] Benabdesselem, C., "et al", 2007, "High Level Expression of Recombinant Mycobacterium tuberculosis Culture Filtrate Protein CFP32 in Pichia pastoris. Mol. Biotech., 35, pp.41-49. [9] Rabhi-Essafi I., "et al", 2007, "A strategy for high-level expression of soluble and functional human interferon alpha as a GST-fusion protein in E.coli," Prot.Eng. Des. Sel. 20(5), pp.201-9. [10] Retnoningrum, D.S., "et al", 2010, "Construction of synthetic open reading frame encoding human interferon alpha 2b for high expression in Escherichia coli and characterization of its gene product," J. Biotech., 145(2), pp.193-8. [11] Li, Z.,"et al ", 2008, "Optimization of the expression of hepatitis B virus e gene in Pichia pastoris and immunological characterization of the product" J. Biotech,. 138 (1), pp.1-8. [12] Sinclair, G., Choy, F.Y.M., 2002, "Synonymous codon usage bias and the expression of Human gluccocerebrosidase in the methylotophic yeast, Pichia pastoris," Prot Expr.Purif., 26, pp.96-105. [13] Ko H.J., "et al", 2005, "Optimization of codon usage enhances the immunogenicity of a DNA vaccine encoding mycobacterial antigen Ag85B," Infect and Immun., 73(.9), pp.5666-5674. [14] Komar, A.A.,"et al", 1999, "Synonymous codon substitutions affect ribosome traffic and protein folding during in vitro translation. FEBS Lett., 462, pp.387391. [15] Kimchi-Sarfaty, C., "et al", 2007, "A "Silent" Polymorphism in the MDR 1 Gene Changes Substrate Specificity," Science, 315, pp.525-528. [16] Sauna, Z.E., "et al", 2007, "Silent Polymorphisms Speak: How They Affect Pharmacogenomics and the Treatment of Cancer," Cancer.Res., 67(20), pp.9609-9612. [17] Ausubel, F.M.,"et al", 2002, "Current protocols in Molecular Biology," WileyInterscience ISBN 978-0-471-50338-5. [18] Meager, A., 2002, "Biological assays for interferons," J.Immun.Meth., 26, pp.121-136 [19] Higgins, D.R.,"et al", 1998, "Small vectors for expression based on dominant drug resistance with direct multicopy selection," Meth.Mol.Bio., 103, pp.41-53. [20] Cregg, J.M.,"et al", 2000, "Recombinant protein expression in Pichia pastoris," Mol. Biotech. ,16(1), pp.23-52. [21] Sorensen, H.P., Mortensen, K.K., 2005, "Soluble expression of recombinant proteins in the cytoplasm of Escherichia coli," J. Biotech., 115, pp.113-128. [22] Bartolomé, C., Charlesworth, B., 2006," Genet.,174 (4), pp.2033-44.. [23] Vicario, S., "et al", 2008, " Molecular Biology Evolution. 25 (11), pp.2269-77.
480
Imen Rabhi and Dahmani M. Fathallah
[24] Supek, F., Muc, T., 2010,"On relevance of codon usage to expression of synthetic and natural genes in Escherichia coli.,"Genet., 185 (3), pp.1129-34. [25] Cid-Arregui, A., "et al", 2003. "A synthetic E7 gene of human papillomavirus type 16 that yields enhanced expression of the protein in mammalian cells and is useful for DNA immunization studies. J. Virol.,77 (8), pp.4928-37. [26] Lee S.F., "et al", 2009, "," Microbio., 155, pp.3581-8. [27] Rosano, G.L., Ceccarelli, E.A., 2009, "Rare codon content affects the solubility of recombinant proteins in a codon bias-adjusted Escherichia coli strain," Micro.Cell Fact. 8 (41) , pp.86-94