The Evolution of the Hh-Signaling Pathway Genes - Semantic Scholar

2 downloads 39 Views 507KB Size Report
Presumably, the appearance of precisely this novel, in principle, evolutionary strategy was the cause of the Cambrian explosion of Eumetazoa and especially.
In Silico Biology 7 (2007) 333–354 IOS Press

333

The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study Konstantin V. Gunbin a,∗ , Dmitrij A. Afonnikov a,b and Nikolay A. Kolchanova a b

Institute of Cytology and Genetics SB RAS, Lavrentyev Ave. 10, Novosibirsk, 630090, Russia Novosibirsk State Unversity, Pirogova Str. 2, Novosibirsk, 630090, Russia

Edited by S. Rodin (guest editor) and N. Kolchanov; received 20 January 2007; revised and accepted 29 March 2007; published August 2007

ABSTRACT: Positive selection of genes that comprise signaling cascades and play the paramount role in the development of multicellular organisms is critical to our understanding of the reasons for the evolution of embryonic development. In this work, we analyze the evolution of 9 genes involved in the function of the Hh signaling cascade. We demonstrated that positive selection is a characteristic feature of the protein domains, encoded by gene regions, whose functions are related to the molecular mechanisms of development. We also found that the positive selection of Hh-signaling cascade transcription factors, morphogens, their development-related receptors and intracellular signal transduction factors are related to the divergence of the Bilateria taxonomic types. KEYWORDS: Hh-signaling cascade, positive selection, Bilateria, compensatory changes

INTRODUCTION The evolution of the fundamental molecular mechanisms underlying the embryonic animal development has become a key issue in the modern theory of evolution [Sole et al., 2003; Shu et al., 2006]. Allometric growth and symmetry of the bilaterian body plans, which appeared at about the same time (∼570 MYA), have suggested that the bilaterian ancestors possessed a new type of embryogenesis (compared with the one that arose in the vendobionts, the antique multicellular animals) giving rise to the formation of new tissues and organs [Knoll and Carroll, 1999; Erwin and Davidson, 2002]. The embryonic type of development specific to the now existing bilaterian animals has supposedly passed during the Precambrian evolution of Eumetazoa (∼50–80 MYA geological period) and morphologically it involved the following crucial events [Davidson et al., 1995; Peterson et al., 2000]: 1) the emergence of molecular mechanisms for cell differentiation; 2) the rise of primitive embryogenesis with cell and tissue differentiation immediately after fertilization; 3) the integration of the molecular mechanisms of cell proliferation – differentiation and the appearance of cells capable of embryonic induction; 4) the development a common molecular mechanism for the differentiation of these cells. It has been assumed that the rise of new tissues and organs during evolution was associated with a drastic reorganization of the molecular developmental mechanisms and also with the appearance of genes expressed in particular ∗

Corresponding author. E-mail: [email protected].

Electronic publication can be found in In Silico Biol. 7, 0047 , August 2007. 1386-6338/07/$17.00  2007 – IOS Press and Bioinformation Systems e.V. and the authors. All rights reserved

334

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

organs and tissues [Sole et al., 2003; Salazar-Ciudad et al., 2003; Salazar-Ciudad and Jernvall, 2004]. Thus, it appears that the adaptive mode of the evolution of genes defining the formation of new tissue and organ may generate the type of Bilateria embryogenesis and give rise to different bilaterian body plans. An important molecular mechanism that provides cell differentiation is signal transduction [PiresdaSilva and Sommer, 2003]. The signal transduction pathways ensure the reception of the concentration gradients of morphogens and their transformation into the differentiated state of cells within tissues and organs. Hence it may be assumed that at the molecular level the key molecular rearrangements may be related to changes in genes that participate in signal transduction pathways. A study of the events that had occurred during the stages of evolution corresponding to the emergence of Bilateria may provide useful information on the evolution of the molecular mechanisms controlling cell differentiation [Raff, 1996]. The Hedgehog (Hh) cascade, its sensors, switches and routers have been amply described and studied [Lum and Beachy, 2004]. It is involved in diverse morphogenetic processes [Ingham and McMahon, 2001; Nybakken and Perrimon, 2002]. A set of proteins accomplishing versatile functions can be distinguished in the Hh signal cascade. These include the Hh morphogen, the Ptc and Smo cell receptors, the Cos2, PKA, Slmb, Su(Fu) and Fu proteins that make up the high molecular weight complex transducing the Hh signal within the cell nucleus, and the Ci transcription factor. It is known how these proteins interact in the functioning Hh-cascade and the interactions can be described as a gene network. A computer-based description has been made and deposited in the Gene Net database [Gunbin et al., 2004a]. In the current study, we analyze the evolution of proteins involved in the function of the Hh-cascade at the steps that match with the evolutionary events at the morphological level related to the appearance of the Bilateria taxa and the divergence of the Bilateria taxonomical types on the phylogenetic tree (the divergence between Ecdysozoa, Lophotrochozoa and Deuterostomia groups of taxonomical types) [Sepkoski and Markov, 1999; Peterson et al., 2004; Benton and Donoghue, 2006]. The aim of the present study was to identify positive selection in the various functional regions of proteins involved in the Hh-cascade, on the one hand, and on phylogenetic tree branches that match with the divergence of the main Bilateria taxonomical types during the evolution of the multicellular organisms, on the other hand. Our analysis is based on identification of gene regions showing an excess of nonsynonymous over synonymous fixations [Kimura, 1983]. An excess of nonsynonymous substitutions over synonymous substitutions is an unambiguous indicator of positive natural selection in the various regions of genes and it may be connected with functional changes in protein domains [Kimura, 1983; Suzuki, 2004; Merl et al., 2005]. There are many approaches for searching of the positive selection mode. One approach defines the ratio of the fixation rate of the nonsynonymous (K a ) to the synonymous (K s ) mutations in various gene regions or in discrete codons. From the results it may be concluded that positive selection acts with different pressures on the different gene regions. Another approach defines the K a /Ks ratios on separate branches of the phylogenetic tree, allowing to answer the question: At what evolutionary step genes subject to positive selection (as a rule, this selection might have been related to the acquirement of a new function by a protein after gene duplication)? [Yang and Nielsen, 2000; Tzeng et al., 2004; Tang and Wu, 2006]. Here we search and analyze the events of Darwinian positive selection in the genes involved in the Hh-cascade. We started by searching regions in genes which are participants in the cascade and for which selection is positive at the codon level. We proceeded on the assumption that the protein domains, encoded by these gene regions, may be of functional importance with regard to morphogenesis and cell

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

335

differentiation. Next, we searched positive selection of the protein domains at the evolutionary step matching the divergence of Bilateria taxonomical types. We also assumed that the genes under positive selection at this stage of evolution may be related to the emergence of the diverce bilaterian body plans. As a result, we demonstrated that positive selection is a characteristic feature of the protein domains, encoded by gene regions whose functions are related to the molecular mechanisms of morphogenesis. Our salient finding was that amino acid substitutions become fixed at those stages of evolution that match with the divergence of the Bilateria taxonomical types [Gunbin et al., 2005]. SUBJECT OF INQUIRY The classical model of the Hh signaling pathway in invertebrates includes the following series of events (Fig. 1) [Ingham and McMahon, 2001; Nybakken and Perrimon, 2002; Lum and Beachy, 2004]: The Disp gene codes for the enzyme responsible for exocytosis of the Hh morphogen [Burke et al., 1999; Gallet et al., 2003; Peters et al., 2004]. In the absence of Hh, the Ptc morphogen receptor inhibits the transduction of the signal into the cell through the Smo protein. The Cos2, Fu, and Ci proteins form a high-molecular-weight complex. In the absence of Hh, the complex is bound to the cytoskeleton, and the Ci transcription factor is phosphorylated by PKA protein kinase and cleaved within this complex. After the cleavage, the N-terminal Ci fragment is transported to the nucleus and represses a gene cassette. Interaction between Hh and its receptor Ptc eliminates the inhibitory effect of Ptc on Smo activity. The activated Smo triggers a series of reactions resulting in removal of Fu/Cos2/Ci from the cytoskeleton and hyperphosphorylation of Fu and Cos2. This results in stabilization of the full-sized Ci and its transport to the nucleus. The Su(Fu) protein is bound to Fu and Ci. Su(Fu) is involved in retaining Ci in the cytoplasm. The Slmb protein is responsible for ubiquitination of Ci. We have compiled a computerized description of the gene network for this pathway, it includes a description of the interactions of 33 genes and 46 proteins and protein complexes, based on analysis of 150 publications [Gunbin et al., 2004a]. Here we present the results of an analysis of sequences of the following gene families: Hh, Ptc, Smo, Disp, PKA, Slmb, Su(Fu), Fu, and Ci (data is available from the authors upon request). The Cos2 family was omitted from consideration, because there were no experimentally annotated protein sequences of vertebrate species [Katoh, Y. and Katoh, M., 2004]. We also examined published experimental data on the functions of domains of the proteins encoded by genes under study (protein domains are indicated with the same numerals in Table 1 and in Fig. 2). METHOD Analysis of amino acid and nucleotide sequences included the following steps: (1) search for similar sequences for protein families involved in the operation of the Hh pathway; (2) alignment of amino acid sequences; (3) construction of phylogenetic trees; (4) search for regions of genes for the proteins found evolving under adaptive selection; (5) identification of the positive selection mode for branches of the phylogenetic tree corresponding to the stage of divergence of the major groups of Bilateria phyla. Sampling of Amino Acid and Nucleotide Sequences Amino acid sequences of a protein family were retrieved from the GenBank database (http://www.ncbi. nlm.nih.gov/Genbank/index.html) [Pruitt et al., 2005] using PSI-BLAST (http://www.ncbi.nlm.nih.gov/

336

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

BLAST/) [Altschul et al., 1997] and from the Ensembl database (http://www.ensembl.org/) [Hubbard et al., 2005] using Ensembl options for detection of ortholog clusters [Enright et al., 2002]. The queries included experimentally annotated amino acid sequences of Drosophila, mouse, and Xenopus. Data Processing (Multiple Alignment and Construction of Phylogenetic Trees) Preliminary alignment of protein sequences was performed using algorithms CLUSTALW (http://www.ebi.ac.uk/clustalw/) [Thompson et al., 1994], T-COFFEE (http://www.tcoffee.org/Projects home page/t coffee home page.html) [Notredame et al., 2000], and MAFFT (http://align.bmr.kyushuu.ac.jp/mafft/software/) [Katoh, K. et al., 2002; 2005]. The alignments were improved with the RASCAL package (ftp://ftp-igbmc.u-strasbg.fr/pub/RASCAL) [Thompson et al., 2003]. The quality of the resulting preliminary alignments was assessed as follows. Regions corresponding to conserved domains of proteins stored in NCBI CDD (http://www.ncbi.nlm.nih.gov/Structure/cdd/cdd.shtml) [Marchler-Bauer et al., 2005] were identified in the sequences of proteins aligned; column score was estimated for these regions as in [Thompson et al., 1999]. The alignment with the highest score was chosen from three preliminary alignments constructed with different programs. The best alignment was used for the subsequent analysis and as the source for constructing aligned cDNA sequences for positive selection analysis. Phylogeny was reconstructed from the multiple alignment of amino acid sequences by the maximum likelihood method implemented in the PHYML program (http://atgc.lirmm.fr/phyml/) [Guindon and Gascuel, 2003]. We used the replacement matrix that satisfied best the AIC criterion implemented in the PROTTEST program (http://darwin.uvigo.es/software/prottest.html) [Abascal et al., 2005]. For all families, it was the WAG matrix [Whelan and Goldman, 2001] that satisfied the criterion. For tracing errors of phylogeny reconstruction, which could result from sequencing errors or nonuniform evolution of protein regions, we compared the topologies of the phylogenetic trees with those of trees stored in the Tree of Life database (http://www.tolweb.org/tree/) [Maddison and Schulz, 1996-2006]. Sequences whose positions on the tree we reconstructed did not match with the taxonomic positions of the corresponding species in the Tree of Life were excluded as recommended in [Nei et al., 1998; Sanderson and Shaffer, 2002]. Detection of Positive Selection in Gene Sequences Detection of gene regions evolving under positive selection was an essential part of our analysis. First, this analysis allows determination of protein regions where amino acid replacements accumulated faster than expected. This may be related to functional divergence of the regions and, as a consequence, to their specific role in the operation of the Hh signaling pathway. Second, comparison of such gene regions appeared promising in revealing selection events on phylogenetic tree branches. There are many methods allowing search for gene regions under adaptive selection. However it is known that different methods are differently sensitive to specific features of nucleotide sequences and to their similarity degree [Suzuki, 2004; Merl et al., 2005]. We sought for regions evolving in the adaptive mode using algorithms: PSWIN (available from S.D. Reid) [Whittam and Nei, 1991], ADAPTSITE (http://www.cib.nig.ac.jp/dda/yossuzuk/welcome.html) [Suzuki et al., 2001; Suzuki, 2004], PLATO (http://evolve.zoo.ox.ac.uk/software.html?id=plato) [Grassly and Holmes, 1997], and PAML (http://abacus.gene.ucl.ac.uk/software/paml.html) [Yang, 1997]. These programs implement various measures and algorithms for identification of gene regions of one codon and more in length, where nucleotides are replaced in the course of positive selection. The final conclusion on the existence of

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

337

Fig. 1. Schematic presentation of the Hh signaling pathway. (A) Exocytosis of the Hh morphogen. Succession of Hh-pathway events: (B) without Hh; (C) with Hh.

338

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

Fig. 2. Detection of regions evolving under positive selection in nine genes involved in the Hh pathway. X-axis: amino acid numbers in the protein multiple alignments, starting from the N-end. Y -axis: for the ADAPTSITE (black line), PAML (black dotted line), and PSWIN (grey line) programs, the number of adaptively evolving codons in a 30 aa window; for the PLATO program (grey dotted line), the Z-score for a 30 codons stretch, significantly deviating from random values. Below the X-axes, gene regions considered here to evolve under positive selection are indicated in red, protein domains are indicated in grayscale and labeled with numbers (see Table 1).

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

339

positive selection was based on the voting rule. If three of four programs detect this evolution mode in a 30-codon long stretch of a gene, its evolution is considered to have been driven by positive selection. In our view, the approach allows a more reliable detection of positive selection in gene sequences, being less sensitive to drawbacks of single methods. We ran the programs with default parameters for both in multiple cDNA alignments (gapless) and phylogenies. We chose the length of 30 codons as the approximate size of the functional protein domain. Detection of Adaptive Selection on Phylogenetic Tree Branches We detected positive selection on phylogenetic tree branches using an approach based on pairwise comparison of coding sequences without reconstruction of ancestral codons. As in the previous case, we used several methods, and the final result was obtained by voting. The methods used were: 1) NG86 (estimation of the PN /PS ratio, the program DISTS2 (http://ftp.dna.affrc.go.jp/pub/unix/syn/)) [Nei and Gojobori, 1986; Ina, 1995], 2) YN00 (estimation of the K a /Ks ratio, the PAML package (http://abacus.gene.ucl.ac.uk/software/paml.html)) [Yang, 1997; Yang and Nielsen, 2000], 3) TW06 (estimation of the Ka /Ks ratio, the program KAKH (available from C.-I. Wu)) [Tang and Wu, 2006], and 4) the logarithm of the WAG/NED4 ratio [Peltier et al., 2000], where NED4 was calculated as the prevalence of synonymous codons for amino acids A, G, L, P, R, S, T, and V with identical positions third of the codons, and WAG was the evolutionary distance evaluated from the WAG matrix with the TREE-PUZZLE program (http://www.tree-puzzle.de/) [Schmidt et al., 2002]. The statistical significance of the results obtained using these programs for long evolutionary distances typical of the considered proteins was additionally tested by simulation of sequence evolution. Simulation of the molecular evolution of genes (proteins) using the evolver program (Monte Carlo simulation of nucleotide or amino acid substitutions) allows to take into account the effect of saturation of substitutions in the deep branches on the outcome of methods used to search positive selection [Yang, 1997]. Multiple alignments of amino acid sequences were done with the evolver program of the PAML package (http://abacus.gene.ucl.ac.uk/software/paml.html) [Yang, 1997] on the basis of phylogenies reconstructed using the WAG amino acid replacement matrix and taking into account the prevalences of amino acids in the sequences. The parameter of the gamma distribution shape, α, was taken to be unity, and the number of categories for the gamma distribution was 8. The coding nucleotide sequences were aligned on the basis of the resulting multiple alignments of amino acid sequences using their reverse translation to codons. We applied this method, because 1) direct generation of sequences at the codon level followed by their translation to amino acid sequences yielded considerable deviations of the amino acid composition of the simulated sequences from that observed in real proteins and 2) saturation of synonymous changes in the deep branches of the phylogenetic trees (use of the different synonymous codons tends to be random). For each family, 150 alignments of gene and protein sequences were generated. The positive selection measures, described above, were calculated for sequence pairs from simulated alignments to define boundaries of the 95% confidence and tolerance intervals in the assumed absence of positive selection. The estimates of positive selection measures for real sequence pairs were compared with the boundaries of the resulting confidence intervals to provide the basis for calculation of the significance index for each measure. This index took the values: 1 (observed within the tolerance interval but below the lower boundary of the confidence interval of mean values), 2 (observed within the confidence interval of mean values), 3 (observed within the tolerance interval but above the upper boundary of the confidence interval of mean values), and 4 (observed above the upper boundary of the tolerance interval of values). Positive selection was considered to be detected if the total of the indices

340

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study Table 1 Functional domains of studied proteins

Protein Domain protein composition name Hh 1) N-terminal 2) C-terminal, intein

Domain function

References

Ptc receptor binding Self-excision; binding of cholesterol to Hh N-terminal domain Membrane anchoring; possibly, secretion of the Hh determined by two extracellular domains, located between loops 1 and 2 (region adjacent to the N-end of domain 5) and loops 7 and 8 (region between domains 5 and 6) Intracellular vesicular transport Possibly, ion transport

Bijlsma et al., 2004 Hall et al., 1997; Bijlsma et al., 2004 Burke et al., 1999; Kuwabara and Labouesse, 2002

Fuse et al., 1999

6) Exporters of RND superfamily (predicted)

Membrane anchoring; the Hh contacts with two extracellular domains, located between loops 1 and 2 (region adjacent to the N-end of domain 5) and loops 7 and 8 (region between domains 5 and 6) Intracellular vesicular transport; interaction with the Smo protein Possibly, ion transport; Contact with Smo

7 transmembrane loops

Membrane anchoring

Disp

12 transmembrane loops

Ptc

3) Sterol-sensing 4) Exporters of RND superfamily (predicted) 12 transmembrane loops

5) Sterol-sensing

Smo

7) N-terminal, extracellular 8) G-receptors 9) C-terminal

Tseng et al., 1999

Martin et al., 2001; Karpen et al., 2001 Tseng et al., 1999; Karpen et al., 2001; Jia et al., 2004 Jia et al., 2003; Lum et al., 2003; Ogden et al., 2003; Ho et al., 2005

Possibly, binding to Ptc G proteins binding Binding to the high-molecular-weight complex Fu/Cos2/Ci PKA 10) Serine/threonine kinases, catalytic Protein phosphorylation Petretti and Prigent, 2005 Slmb 11) WD-domain Protein-protein interactions; Jackson and Eldridge, binding to the SCF ubiquitin-ligase complex 2002; Cardozo and Pagano, 2004; Nakayama and Nakayama, 2005 12) F-box Fu 13) Protein kinase catalytic domain Protein phosphorylation Murone et al., 2000; Ascano et al., 2002; Monnier et al., 2002; Ascano and Robbins, 2004 C-terminal Inhibition of Hh-signaling; interaction with Su(Fu) and Cos2 Su(Fu) 14) SUFU-domain Interaction with Ci Dunaeva et al., 2003; Merchant et al., 2004; Barnfield et al., 2005 Ci (Gli) 15) and 17) Cytoplasm anchoring (at C- Ci protein retention in cytoplasm (only in Monnier et al., 2002; end and N-end from Zinc finger) insects) Wang and Jiang, 2004 16) Zinc finger DNA-binding Alexandre et al., 1996 18) Transcription cofactors binding Binding to Ci transcription cofactors Akimaru et al., 1997; Chen et al., 2000

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

341

Fig. 3. The phylogenetic tree of Hh gene family. The topology indicated by asterisk is in agree with [Maddison and Schulz, 1996–2006; Bourlat et al., 2006]. Branches supposedly affected by adaptive evolution events are shown by thick black line. Genes analyzed are boxed. Vertebrate paralogs are indicated with circled numerals.

of the four methods exceeded 10. Positive selection for a tree branch was considered to be detected if positive selection was detected for all sequence pairs of the subtrees. In consideration of genes belonging to vertebrate species, priority was given to species of the classes Actinopterygii, Sarcopterygii, Amphibia, and Reptilia. We focused attention on the phylogenetic tree branches corresponding to the divergence stage of the major phylum groups of Bilateria (insects and crustaceans – mollusks and annelids – chordates and echinoderms). The detailed results of the various analysis steps (multiple alignments, phylogenetic trees, positive selection detection) are available from the authors upon request. RESULTS Detection of Protein Domains Evolving under Positive Selection At the first step, we identified regions of genes of the Hh pathway evolving under positive selection. The results for nine genes are given in Fig. 2. We found that proteins involved in the Hh pathway and

342

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

their different domains are differently subject to positive selection (Fig. 2). – In the Hh gene, positive selection occurs in the region coding for the intein domain, in agreement with previous studies [Kumar et al., 1996]. – In the Disp gene, evolution under positive selection manifests itself in the region coding for the N-terminal peptide, whose function is as yet unknown. – In the Ptc gene, positive selection was detected for the region coding for the protein domains binding to the Hh morphogen. – Positive selection in the Smo gene was detected in the regions coding for: (1) the domain binding to the high-molecular-weight Fu/Cos2/Ci complex, (2) the extracellular domain, and (3) the domain binding to G proteins. – In the PKA gene and its homologs, the effect of positive selection is observed in small region (∼15 codons) coding for the N-terminal peptide. – No positive selection was found for the Slmb gene. – In the F u gene, positive selection affects the region coding for the C-terminal peptide occurring outside the catalytic domain of the Fu protein. – In the Su(Fu) gene, positive selection affects the region coding for the middle portion of the SU(FU) domain. – In the Ci gene and its homologs Gli, the effect of positive selection is observed in regions coding for domains responsible for retaining Ci(Gli) in the cytoplasm and for binding to transcription cofactors. Correlation between protein regions evolving under positive selection and their function indicates that positive selection occurred mainly for proteins and their domains involved in protein-protein interactions within the Hh signaling pathway, whereas proteins of “universal” functions (PKA and Slmb) were under negative selection. Neither was positive selection found for the catalytic domains of the proteins under study; this may indicate that this function was probably conserved during the evolution of the Hh pathway. Detection of Phylogenetic Tree Branches Evolving under Positive Selection The results of the analysis performed to detect phylogenetic branches presumably evolving under positive selection are shown in Figs 3–11 and in Supplementary Information. They indicate that positive selection unevenly affected the genes and proteins under study. No positive selection was found on branches of the phylogenetic trees for the Ptc, PKA, and Slmb genes. This result was expected for PKA and Slmb, because we found no regions evolving under positive selection. In contrast, such regions were found in Ptc, but their analysis did not allow definite localization of positive selection stages on the phylogenetic tree, probably, because of the very weak positive selection signal observed in proteins of this family. The stages of positive selection in the Hh family genes correspond to the divergence of the genera Artemia, Patella, Ciona, Lytechinus and Strongylocentrotus, and, possibly, Branchiostoma (appearance of arthropods and vertebrates) and to the formation of paralogous groups of the Hh gene in vertebrates, in agreement with previous studies [Kumar et al., 1996; Shimeld, 1999]. In the Disp genes, positive selection was detected for the tree branches passing from the genera of the class Insecta to those of the Craniata group. This corresponds also to the appearance times of arthropods and vertebrates. In the Smo gene family, positive selection is typical of branches passing from Insecta to the genera Danio, Tetraodon, and Fugu, corresponding to the appearance times of arthropods and vertebrates. According to our analysis, for the Su(Fu) gene family positive selection manifests on tree branches passing from the genera of class Insecta to those of the Craniata group. As for the above families, this corresponds to the

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

343

Table 2 Correlation between adaptive modes of gene evolution and appearance of major animal taxa Gene

Functional group Events of positive selection (developmental gene or in genes correlated with housekeeping gene) appearance of major animal taxa Hh Developmental gene + Ptc Developmental gene − Smo Developmental gene + Disp Developmental gene + Cos2 Developmental gene +* PKA Housekeeping gene − Slmb Housekeeping gene − Su(Fu) Developmental gene + Fu Developmental gene + Ci Developmental gene + Designations: “+”, gene families found to evolve in the adaptive mode; “−”, gene families found to evolve under negative selection; “*”, absence of functional homologous proteins from insects and vertebrates, low degree of similarity in vertebrate proteins.

Fig. 4. The phylogenetic tree of Disp gene family coding for transmembrane proteins. Designations as in Fig. 3.

appearance of arthropods and vertebrates. At the same evolutionary stages, positive selection affected genes of the Fu family: positive selection corresponds to the branch passing from the genus Danio and Gallus to the insect genera, corresponding to the appearance times of arthropods and vertebrates. For the genes of the Ci(Gli) family, positive selection was detected for the branch passing from the

344

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

Fig. 5. The phylogenetic tree of Ptc gene family coding for transmembrane proteins. Designations as in Fig. 3. Paralogs of genes are indicated with circled numerals.

genus Branchiostoma to the genera of the class Insecta and the group Tetrapoda, corresponding to the appearance stage of major taxa of bilateral organisms (arthropods and vertebrates). Thus, we conclude that the main stages of positive selection in the evolution of the Hh-pathway genes corresponded to the divergence of main phylum groups of Bilateria: Ecdysozoa (insects and crustaceans), Lophotrochozoa (mollusks and annelids), and Deuterostomia (chordates and echinoderms). This indicates that the evolutionary stages under positive selection and the appearance of major animal taxa are correlated (Table 2). DISCUSSION Rearrangement of the Hh-Cascade Structure During Evolution and Positive Selection We have demonstrated that certain gene regions that encode the domains of proteins whose function may be specific precisely to developmental processes (“the developmental genes”) are under positive selection. It is of importance that these domains are involved in the internal interactions of the Hh-cascade

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

345

Fig. 6. The phylogenetic tree of Smo gene family. Designations as in Fig. 3.

proteins, either self-processing and interacting with their own domains, like in Hh, or with other proteins of the cascade, such as Ptc, Smo, Fu, Su, Ci and, presumably, Disp. As to the developmental genes of the Hh-cascade, the time span when positive selection acted matches with the divergence of the two main bilaterian groups, Ecdysozoa and Deuterostomia. This indicates that amino acid substitutions were rapidly accumulating at the time of the fundamental structural changes in the Hh-cascade. In contrast, positive selection is not characteristic of the PKA and Slmb genes that encode the proteins involved in the interaction with the other signaling pathways (phosphorylation and protein degradation processes) or the catalytic domains of the protein under study. These components perform possibly a constant function that provides the vital activities of the cells. Therefore, PKA and Slmb evolved under negative selection. The different selection pressures acted upon Hh-cascade genes can be explained by the structural changes in the Hh-signaling pathway via compensatory changes in the mechanisms of protein-protein interactions. These changes in the cascade were possibly compensated by alterations in the dynamics of the morphogen production or sensing of the morphogen. For example, the rate and the remoteness of the response to the morphogen may be altered by modification of the mobility of the Hh morphogen or of the morphogen receptors (Ptc and Smo) [Nybakken and Perrimon, 2002]. It is of interest that in the Hh protein positive selection occurred in the intein domain controlling the excision of the Hh part, which is a morphogen. It may be suggested that the spreading rate of the morphogen (and accordingly the distance

346

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

Fig. 7. The phylogenetic tree of the family of genes coding for protein kinases. Designations as in Fig. 3.

it acts) may be controlled by its excision efficiency from the zymogen. If so, the positive selection in the excision-controlling domain would provide the reorganization of the Hh-cascade functioning by the appropriate changes in the kinetics of the yield and spread of the active Hh form. The positive selection in the proteins of the middle part of the Hh-cascade might have ensured the reorganization of the cell response to the morphogen. It is known that the different Bilateria types differ in the composition of proteins involved in the work of the middle part of the Hh-cascade. To illustrate, the N-end part of the Drosophila Smo protein is required for transduction of Hh-signal, while in vertebrates its deletion does not produce significant changes in the Hh-cascade [Nakano et al., 2004].

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

347

Fig. 8. The phylogenetic tree of Slmb gene family. Designations as in Fig. 3.

In invertebrates, Cos2 and Fu proteins form a functional complex that controls the penetration of Ci transcription factor into the nucleus [Jia et al., 2003; Lum et al., 2003; Ruel et al., 2003; Ascano and Robbins, 2004]. However, the experimental evidence for the Cos2 presence in vertebrates is indirect only [Tay et al., 2005]. Positive selection in transcription factors Ci(Gli) may provide changes in the cell response type (for example, passage from the unstable-stable response) to the Hh morphogen. Bilateria taxonomic types differ greatly in the composition and function of the Ci(Gli) transcription factors [Huangfu and Anderson, 2006]. In contrast to invertebrates, vertebrates have three Gli proteins that activate the transcription of the Hh-cascade effector genes, (Gli1, Gli2 and unprocessed Gli3) and two proteins inhibiting the transcription of Hh-cascade effector genes (modified Gli2 and processed Gli3); activating Gli enhances the expression of not only the effector genes, but also the Gli1 and Gli2 genes, thereby forming the positive feedback circuit nonexistent in invertebrates [Dai et al., 1999; Aza-Blanc et al., 2000; Ikram et al., 2004; Tyurina et al., 2005]. In vertebrates transcription factor Gli3 is probably the most important of the Gli proteins because only this one, in the absence of active Smo, can be subject to proteolysis like Ci vertebrate protein [M¨uller and Basler, 2000; Aza-Blanc et al., 2000; Tyurina et al., 2005]. Thus, vertebrates have additional feedback circuits that establish a definite expression level of the Hh-cascade effector genes. The presence of the transcription factors Gli1 and Gli2 in vertebrates broadens the range of genes whose expression can regulate the Hh-cascade, thereby affording more opportunities of obtaining specific responses of

348

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

Fig. 9. The phylogenetic tree of Fu gene family. Designations as in Fig. 3.

tissues to the same morphogen [Huangfu and Anderson, 2006]. Thus, this comparative analysis of the structure of the Hh-cascade proteins (or their functional domains) in different Bilateria suggests that the drastic changes in the Hh-cascade pathway occurred at the time of the divergence of the main groups of Bilateria taxonomic types (Ecdysozoa – Deuterostomia divergence). Evolution of the Molecular-Genetic Systems that Define the Morphogenesis and the Evolution of Hh-Signaling Cascade Genes During embryogenesis of the primitive animals (sponge, coelenterates) the adult tissue differentiation is determined immediately after fertilization and, consequently, fertilization directly determined adult morphogenesis [Davidson et al., 1995; Peterson et al., 2000]. This development type is the most primitive and supposedly the ancestral multicellular animals possessed it [Salazar-Ciudad et al., 2003; SalazarCiudad and Jernvall, 2004]. In contrast, Bilateria form embryonic tissue types missing later in the adult. Bilateria embryonic tissues are capable, as a rule, to produce morphogens that are capable of giving rise to other tissues capable, in turn, of also producing morphogens [Kolchanov et al., 2004; Suslov et al., 2004; Gunbin et al., 2004b; 2004c]. The ultimate result is that changes in the growth parameters of embryonic tissues, no matter how small, can generate tissues of new types, change in morphogenetic patterns, novel adult traits. Thus, Bilateria use a novel evolutionary strategy (in comparison with the primitive animals) improving their chances of coping with their environment. This evolutionary strategy of Bilateria has been studied in detail in the exemplary case of the rodent digestive system by Vorontsov, 1967. He has shown that the different organs of the rodent digestive system, as a rule, are at different specialization levels in response to the same environmental agents.

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

349

Fig. 10. The phylogenetic tree of Su(Fu) gene family. Designations as in Fig. 3.

Specialization of one organ of the system may be not related to the specialization of another organ. If so, one organ of the system may compensate the function of another. More than that, the different specialization levels of the tissues and organs of an animal organism and the compensation of the function of a tissue/organ by another may be universal for the Bilateria evolution [Hartman et al., 2001]. In this way, the uneven rates of the evolutionary reorganization of tissues and organs and the compensation of the function of a tissue/organ by another may provide the rapid and parsimonious strategy of the adaptability of bilaterian species. In the genes that are expressed in one tissue the fixation of nonsynonymous mutation is 3–5-fold greater than in the genes that are expressed in at least 16 tissues [Duret and Mouchiroud, 2000]. Therefore, the uneven rate of evolutionary reorganization is also a feature of the bilaterian genes that carry different functional loads. It would appear that the genes for tissue differentiation were expressed in a restricted tissues set during the emergence of Bilateria [Sole et al., 2003; Salazar-Ciudad et al., 2003; Salazar-Ciudad and Jernvall, 2004]. Clearly, the uneven rates of the evolutionary reorganization of tissues and the compensation of a tissue function by another could provide: 1) parsimonious and rapid adaptability of Bilateria species to various environmental agents and 2) rapid emergence of wide diversity of Bilateria, including the appearance of various Bilateria body plans (the appearance of Bilateria taxonomic types). Presumably, the appearance of precisely this novel, in principle, evolutionary strategy was the cause of the Cambrian explosion of Eumetazoa and especially of Bilateria. However, with the increasing morphological complexity, the number of developmental stages and embryonic tissues increased and a need for an evolutionary stagnation of many molecular-genetic systems arose. For these reasons, the adaptive evolution (positive selection) of the genes involved in the moleculargenetic developmental systems derived from other fundamental molecular machineries halted. This is

350

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

Fig. 11. The phylogenetic tree of Ci(Gli) gene family. Designations as in Fig. 3.

what we observed. Presumably, for these reasons, the adaptive evolution of the signaling pathway genes appears to be suspended in our days. CONCLUSION Here, we searched and analyzed the selection mode in gene families involved in the work of the Hh-signaling cascade. We used a number of methods to detect the positive selection mode both in gene regions and on phylogenetic tree branches. The combined computer-assisted approach allowed us to increase reliability as compared with the case when each method is used alone. The resulting data were related to those on the structural-functional organization of the proteins under analysis and also related to their interactions in the functioning Hh-cascade. The results demonstrated that the function of the analyzed proteins was subject to positive selection during the short span, in geological terms, corresponding to the stages when large bilaterian taxa (Ecdysozoa, Lophotrochozoa and Deuterostomia) formed. This was manifested as an increase in the accumulation rates of amino acid substitutions in many protein domains that control the specific interactions within the signaling cascade. The results suggested that changes in substitution accumulation are causes for the structural reorganization of the Hh-cascade at the formation steps of Ecdysozoa and Deuterostomia. Their compensatory nature is presumably dual: changes in the dynamics of the Hh-cascade function due to adaptive substitutions in the Hh, Ptc and Smo proteins and/or changes in the protein domains that are responsible for the protein-protein interactions

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

351

within the intracellular part of the Hh-signaling pathway and related to the structural reorganization of these interactions. ACKNOWLEDGEMENTS The work has been supported by the Ministry of Education of the Russian Federation grant “Development of the Higher School Scientific Potential” 2.1.1.4935, Russian Foundation of the Basic Research (05-04-49141-a, 05-07-98012-p), SB RAS integration projects 49, No. 10104-34/II-18/155-270/110506-001/28/2006-1. The computation was performed in part at the High Performance Computing Center, SB RAS. REFERENCES • Abascal, F., Zardoya, R. and Posada, D. (2005). ProtTest: selection of best-fit models of protein evolution. Bioinformatics 21, 2104-2105. • Akimaru, H., Chen, Y., Dai, P., Hou, D.-X., Nonaka, M., Smolik, S. M., Armstrong, S., Goodman, R. H. and Ishii, S. (1997). Drosophila CBP is a co-activator of cubitus interruptus in hedgehog signalling. Nature 386, 735-738. • Alexandre, C., Jacinto, A. and Ingham, P.W. (1996). Transcriptional activation of hedgehog target genes in Drosophila is mediated directly by the cubitus interruptus protein, a member of the GLI family of zinc finger DNA-binding proteins. Genes Dev. 10, 2003-2013. • Altschul, S. F., Madden, T. L., Schäffer, A. A., Zhang, J., Zhang, Z., Miller, W. and Lipman, D. J. (1997). Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 25, 3389-3402. • Ascano, M. Jr., Nybakken, K. E., Sosinski, J., Stegman, M. A. and Robbins, D. J. (2002). The carboxyl-terminal domain of the protein kinase fused can function as a dominant inhibitor of hedgehog signalling. Mol. Cell. Biol. 22, 1555-1566. • Ascano, M. Jr. and Robbins, D. J. (2004). An intramolecular association between two domains of the protein kinase Fused is necessary for Hedgehog signalling. Mol. Cell. Biol. 24, 10397-10405. • Aza-Blanc, P., Lin, H.-Y., Ruiz i Altaba, A. and Kornberg, T. B. (2000). Expression of the vertebrate Gli proteins in Drosophila reveals a distribution of activator and repressor activities. Development 127, 4293-4301. • Barnfield, P. C., Zhang, X., Thanabalasingham, V., Yoshida, M. and Hui, C. C. (2005). Negative regulation of Gli1 and Gli2 activator function by Suppressor of fused through multiple mechanisms. Differentiation 73, 397-405. • Benton, M. J. and Donoghue, P. C. (2007). Paleontological evidence to date the tree of life. Mol. Biol. Evol. 24, 26-53. • Bijlsma, M. F., Spek, C. A. and Peppelenbosch, M. P. (2004). Hedgehog: an unusual signal transducer. Bioessays 26, 387-394. • Bourlat, S. J., Juliusdottir, T., Lowe, C. J., Freeman, R., Aronowicz, J., Kirschner, M., Lander, E. S., Thorndyke, M., Nakano, H., Kohn, A. B., Heyland, A., Moroz, L. L., Copley, R. R. and Telford, M. J. (2006). Deuterostome phylogeny reveals monophyletic chordates and the new phylum Xenoturbellida. Nature 444, 85-88. • Burke, R., Nellen, D., Bellotto, M., Hafen, E., Senti, K.-A., Dickson, B. J. and Basler, K. (1999). Dispatched, a novel sterol-sensing domain protein dedicated to the release of cholesterol-modified hedgehog from signalling cells. Cell 99, 803-815. • Cardozo, T. and Pagano, M. (2004). The SCF ubiquitin ligase: insights into a molecular machine. Nat. Rev. Mol. Cell Biol. 5, 739-751. • Chen, Y., Goodman, R. H. and Smolik, S. M. (2000). Cubitus interruptus requires Drosophila CREB-binding protein to activate wingless expression in the Drosophila embryo. Mol. Cell. Biol. 20, 1616-1625. • Dai, P., Akimaru, H., Tanaka, Y., Maekawa, T., Nakafuku, M. and Ishii, S. (1999). Sonic Hedgehog-induced activation of the Gli1 promoter is mediated by GLI3. J. Biol. Chem. 274, 8143-8152. • Davidson, E. H., Peterson, K. J. and Cameron, R. A. (1995). Origin of adult bilaterian body plans: Evolution of developmental regulatory mechanisms. Science 270, 1319-1325. • Dunaeva, M., Michelson, P., Kogerman, P. and Toftgard, R. (2003). Characterization of the physical interaction of Gli proteins with SUFU proteins. J. Biol. Chem. 278, 5116-5122. • Duret, L. and Mouchiroud, D. (2000). Determinants of substitution rates in mammalian genes: expression pattern affects selection intensity but not mutation rate. Mol. Biol. Evol. 17, 68-74. • Enright, A. J., Van Dongen, S. and Ouzounis, C. A. (2002). An efficient algorithm for large-scale detection of protein families. Nucleic Acids Res. 30, 1575-1584.

352

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

• Erwin, D. H. and Davidson, E. H. (2002). The last common bilaterian ancestor. Development 129, 3021-3032. • Fuse, N., Maiti, T., Wang, B., Porter, J. A., Hall, T. M., Leahy, D. J. and Beachy, P. A. (1999). Sonic hedgehog protein signals not as a hydrolytic enzyme but as an apparent ligand for patched. Proc. Natl. Acad. Sci. USA 96, 10992-10999. • Gallet, A., Rodriguez, R., Ruel, L. and Therond, P. P. (2003). Cholesterol modification of hedgehog is required for trafficking and movement, revealing an asymmetric cellular response to hedgehog. Dev. Cell. 4, 191-204. • Grassly, N. C. and Holmes, E. C. (1997). A likelihood method for the detection of selection and recombination using nucleotide sequences. Mol. Biol. Evol. 14, 239-247. • Guindon, S. and Gascuel, O. (2003). A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst. Biol. 52, 696-704. • Gunbin, K. V., Omelyanchuk, L. V. and Ananko, E. A. (2004a). Two gene networks underlying the formation of the anteriorposterior and dorso-ventral wing imaginal disc compartment boundaries in Drosophila melanogaster. In: Kolchanov, N. A. et al. (eds.), Proceedings of the forth international conference on bioinformatics of genome regulation and structure, BGRS’2004. Institute of Cytology and Genetics Press 2, 56-59. http://www.bionet.nsc.ru/meeting/bgrs2004/tom2.pdf • Gunbin, K. V., Suslov, V. V. Kolchanov, N. A. and Omelyanchuk, N. A. (2004b). Genetic mechanisms of morphological evolution, part 1. Siberian Journal of Ecology 11, 599-610 (in Russian). • Gunbin, K. V., Suslov, V. V., Kolchanov, N. A. and Omelyanchuk, N. A. (2004c). Genetic mechanisms of morphological evolution, part 2. Siberian Journal of Ecology 11, 611-621 (in Russian). • Gunbin, K. V., Omelyanchuk, L. V. and Kolchanov, N. A. (2005). Aromorphoses and the adaptive molecular evolution: Hedgehog signalling cascade genes. International Workshop “Biosphere Origin and Evolution”. Abstracts. Boreskov Institute of Catalysis Press, p. OP-5. http://www-sbras.nsc.ru/ws/biosphere/index.en.html • Hall, T. M., Porter, J. A., Young, K. E., Koonin, E. V., Beachy, P. A. and Leahy, D. J. (1997). Crystal structure of a Hedgehog autoprocessing domain: homology between Hedgehog and self-splicing proteins. Cell 91, 85-97. • Hartman, J. L. 4th, Garvik, B. and Hartwell, L. (2001). Principles for the buffering of genetic variation. Science 291, 1001-1004. • Ho, K. S., Suyama, K., Fish, M. and Scott, M. P. (2005). Differential regulation of Hedgehog target gene transcription by Costal2 and Suppressor of Fused. Development 132, 1401-1412. • Huangfu, D. and Anderson, K. V. (2006). Signalling from Smo to Ci/Gli: conservation and divergence of Hedgehog pathways from Drosophila to vertebrates. Development 133, 3-14. • Hubbard, T., et al. (2005). Ensembl 2005. Nucleic Acids Res. 33, D447-D453. • Ikram, M. S., Neill, G. W., Regl, G., Eichberger, T., Frischauf, A.-M., Aberger, F., Quinn, A. and Philpott, M. (2004). GLI2 is expressed in normal human epidermis and BCC and induces GLI1 expression by binding to its promoter. J. Invest. Dermatol. 122, 1503-1509. • Ina, Y. (1995). New methods for estimating the numbers of synonymous and nonsynonymous substitutions. J. Mol. Evol. 40, 190-226. • Ingham, P. W. and McMahon, A. P. (2001). Hedgehog signalling in animal development: paradigms and principles. Genes Dev. 15, 3059-3087. • Jackson, P. K. and Eldridge, A. G. (2002). The SCF ubiquitin ligase: an extended look. Mol. Cell. 9, 923-925. • Jia, J., Tong, C. and Jiang, J. (2003). Smoothened transduces Hedgehog signal by physically interacting with Costal2/Fused complex through its C-terminal tail. Genes Dev. 17, 2709-2720. • Jia, J., Tong, C., Wang, B., Luo, L. and Jiang, J. (2004). Hedgehog signalling activity of Smoothened requires phosphorylation by protein kinase A and casein kinase I. Nature 432, 1045-1050. • Karpen, H. E., Bukowski, J. T., Hughes, T., Gratton, J. P., Sessa, W. C. and Gailani, M. R. (2001). The sonic hedgehog receptor patched associates with caveolin-1 in cholesterol-rich microdomains of the plasma membrane. J. Biol. Chem. 276, 19503-19511. • Katoh, K., Kuma, K., Toh, H. and Miyata, T. (2005). MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res. 33, 511-518. • Katoh, K., Misawa, K., Kuma, K. and Miyata, T. (2002). MAFFT: a novel method for rapid multiple sequence alignment based on fast Fourier transform. Nucleic Acids Res. 30, 3059-3066. • Katoh, Y. and Katoh, M. (2004) KIF27 is one of orthologs for Drosophila Costal-2. Int. J. Oncol. 25, 1875-1880. • Kimura, M. (1983) The Neutral Theory of Molecular Evolution. Cambridge University Press, Cambridge. • Knoll, A. H. and Carroll, S. B. (1999). Early animal evolution: emerging views from comparative biology and geology. Science 284, 2129-2137. • Kolchanov, N. A., Suslov, V. V. and Gunbin, K. V. (2004). Biological evolution simulation: genetic regulatory systems and biological complexity encoding. Herald of the VOGIS 8, 86-99. http://www.bionet.nsc.ru/vogis/pict pdf/2004/29/07 Kolchanov.pdf (in Russian) • Kumar, S., Balczarek, K. A. and Lai, Z. C. (1996). Evolution of the hedgehog gene family. Genetics 142, 965-972. • Kuwabara, P. E. and Labouesse, M. (2002) The sterol-sensing domain: multiple families, a unique role? Trends Genet. 18, 193-201.

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

353

• Lum, L. and Beachy, P. A. (2004). The Hedgehog response network: sensors, switches, and routers. Science 304, 1755-1759. • Lum, L., Zhang, C., Oh, S., Mann, R. K., von Kessler, D. P., Taipale, J., Weis-Garcia, F., Gong, R., Wang, B. and Beachy, P. A. (2003). Hedgehog signal transduction via Smoothened association with a cytoplasmic complex scaffolded by the atypical kinesin, Costal-2. Mol. Cell. 12, 1261-1274. • Maddison, D. R. and Schulz K.-S. (eds.) (1996-2006). The Tree of Life Web Project. http://tolweb.org • Marchler-Bauer, A., et al. (2005). CDD: a Conserved Domain Database for protein classification. Nucleic Acids Res. 33, D192-D196. • Martin, V., Carrillo, G., Torroja, C. and Guerrero, I. (2001). The sterol-sensing domain of Patched protein seems to control Smoothened activity through Patched vesicular trafficking. Curr. Biol. 11, 601-607. • Merchant, M., Vajdos, F. F., Ultsch, M., Maun, H. R., Wendt, U., Cannon, J., Desmarais, W., Lazarus, R. A., de Vos, A. M. and de Sauvage, F. J. (2004). Suppressor of fused regulates Gli activity through a dual binding mechanism. Mol. Cell. Biol. 24, 8627-8641. • Merl, D., Escalante, A. and Prado, R. (2005). Comparison of Bayesian, maximum likelihood and parsimony methods for detecting positive selection. UCSC Department of Applied Math and Statistics Technical Reports, ams2005-03. www.ams.ucsc.edu/reports/ams2005-03.pdf. • Monnier, V., Ho, K. S., Sanial, M., Scott, M. P. and Plessis, A. (2002). Hedgehog signal transduction proteins: contacts of the Fused kinase and Ci transcription factor with the kinesin-related protein Costal2. BMC Dev. Biol. 2, 4. • M¨uller, B. and Basler, K. (2000). The repressor and activator forms of Cubitus interruptus control Hedgehog target genes through common generic gli-binding sites. Development 127, 2999-3007. • Murone, M., Luoh, S. M., Stone, D., Li, W., Gurney, A., Armanini, M., Grey, C., Rosenthal, A. and de Sauvage, F. J. (2000). Gli regulation by the opposing activities of fused and suppressor of fused. Nat. Cell Biol. 2, 310-312. • Nakano, Y., Nystedt, S., Shivdasani, A. A., Strutt, H., Thomas, C. and Ingham, P. W. (2004). Functional domains and sub-cellular distribution of the Hedgehog transducing protein Smoothened in Drosophila. Mech. Dev. 121, 507-518. • Nakayama, K. I. and Nakayama, K. (2005). Regulation of the cell cycle by SCF-type ubiquitin ligases. Semin. Cell Dev. Biol. 16, 323-333. • Nei, M. and Gojobori, T. (1986). Simple methods for estimating the numbers of synonymous and nonsynonymous nucleotide substitutions. Mol. Biol. Evol. 3, 418-426. • Nei, M., Kumar, S. and Takahashi, K. (1998). The optimization principle in phylogenetic analysis tends to give incorrect topologies when the number of nucleotides or amino acids used is small. Proc. Natl. Acad. Sci. USA 95, 12390-12397. • Notredame, C., Higgins, D. G. and Heringa, J. (2000). T-Coffee: A novel method for fast and accurate multiple sequence alignment. J. Mol. Biol. 302, 205-217. • Nybakken, K. and Perrimon, N. (2002). Hedgehog signal transduction: recent findings. Curr. Opin. Genet. Dev. 12, 503-511. • Ogden, S. K., Ascano, M. Jr., Stegman, M. A., Suber, L. M., Hooper, J. E. and Robbins, D. J. (2003). Identification of a functional interaction between the transmembrane protein Smoothened and the kinesin-related protein Costal2. Curr. Biol. 13, 1998-2003. • Peltier, M. R., Raley, L. C., Liberles, D. A., Benner, S. A. and Hansen, P. J. (2000). Evolutionary history of the uterine serpins. J. Exp. Zool. 288, 165-174. • Peters, C., Wolf, A., Wagner, M., Kuhlmann, J. and Waldmann, H. (2004). The cholesterol membrane anchor of the Hedgehog protein confers stable membrane association to lipid-modified proteins. Proc. Natl. Acad. Sci. USA 101, 8531-8536. • Peterson, K. J., Cameron, R. A. and Davidson, E. H. (2000). Bilaterian Origins: significance of new experimental observations. Dev. Biol. 219, 1-17. • Peterson, K. J., Lyons, J. B., Nowak, K. S., Takacs, C. M., Wargo, M. J. and McPeek, M. A. (2004). Estimating metazoan divergence times with a molecular clock. Proc. Natl. Acad. Sci. USA 101, 6536-6541. • Petretti, C. and Prigent, C. (2005). The Protein Kinase Resource: everything you always wanted to know about protein kinases but were afraid to ask. Biol. Cell. 97, 113-118. • Pires-daSilva A. and Sommer R.J. (2003). The evolution of signalling pathways in animal development. Nat. Rev. Genet. 4, 39-49. • Pruitt, K. D., Tatusova, T. and Maglott, D. R. (2005). NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins. Nucleic Acids Res. 33, D501-D504. • Raff, R. A. (1996). The Shape of Life: Genes, Development, and the Evolution of Animal Form. University of Chicago Press, Chicago. • Ruel, L., Rodriguez, R., Gallet, A., Lavenant-Staccini, L. and Th´erond, P. P. (2003). Stability and association of Smoothened, Costal2 and Fused with Cubitus interruptus are regulated by Hedgehog. Nat. Cell Biol. 5, 907-913. • Salazar-Ciudad, I. and Jernvall, J. (2004). How different types of pattern formation mechanisms affect the evolution of form and development. Evol. Dev. 6, 6-16.

354

K.V. Gunbin et al. / The Evolution of the Hh-Signaling Pathway Genes: A Computer-Assisted Study

• Salazar-Ciudad, I., Jernvall, J. and Newman, S. A. (2003). Mechanisms of pattern formation in development and evolution. Development 130, 2027-2037. • Sanderson, M. J. and Shaffer, H. B. (2002). Troubleshooting molecular phylogenetic analyses. Annu. Rev. Ecol. Syst. 33, 49-72. • Schmidt, H. A., Strimmer, K., Vingron, M. and von Haeseler, A. (2002). TREE-PUZZLE: maximum likelihood phylogenetic analysis using quartets and parallel computing. Bioinformatics 18, 502-504. • Shimeld, S. M. (1999). The evolution of the hedgehog gene family in chordates: insights from amphioxus hedgehog. Dev. Genes Evol. 209, 40-47. • Sepkoski, J. J. Jr. Fossil marine animals. Stratigraphic distribution; taxonomy. 1996. Programming: Markov A., 1999. http://macroevolution.narod.ru/dblist.htm (in Russian) • Shu, D. G., Morris, S. C., Han, J., Li, Y., Zhang, X. L., Hua, H., Zhang, Z. F., Liu, J. N., Guo, J. F., Yao, Y. and Yasui, K. (2006). Lower Cambrian vendobionts from China and early diploblast evolution. Science 312, 731-734. • Sole, R. V., Fernandez, P. and Kauffman, S. A. (2003). Adaptive walks in a gene network model of morphogenesis: insights into the Cambrian explosion. Int. J. Dev. Biol. 47, 685-693. • Suslov, V. V., Gunbin, K. V. and Kolchanov, N. A. (2004). Genetic mechanisms of biological complexity encoding. Ecological genetic 2, 13-26 (in Russian). • Suzuki, Y. (2004). New methods for detecting positive selection at single amino acid sites. J. Mol. Evol. 59, 11-19. • Suzuki, Y., Gojobori, T. and Nei, M. (2001). ADAPTSITE: detecting natural selection at single amino acid sites. Bioinformatics 17, 660-661. • Tang, H. and Wu, C. I. (2006). A new method for estimating nonsynonymous substitutions and its applications to detecting positive selection. Mol. Biol. Evol. 23, 372-379. • Tay, S. Y., Ingham, P. W. and Roy, S. (2005). A homologue of the Drosophila kinesin-like protein Costal2 regulates Hedgehog signal transduction in the vertebrate embryo. Development 132, 625-634. • Thompson, J. D., Higgins, D. G. and Gibson, T. J. (1994). CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, positions-specific gap penalties and weight matrix choice. Nucleic Acids Res. 22, 4673-4680. • Thompson, J. D., Plewniak, F. and Poch, O. (1999). A comprehensive comparison of multiple sequence alignment programs. Nucleic Acids Res. 27, 2682-2690. • Thompson, J. D., Thierry, J. C. and Poch, O. (2003). RASCAL: rapid scanning and correction of multiple sequence alignments. Bioinformatics 19, 1155-1161. • Tseng, T. T., Gratwick, K. S., Kollman, J., Park, D., Nies, D. H., Goffeau, A. and Saier, M. H. Jr. (1999). The RND permease superfamily: an ancient, ubiquitous and diverse family that includes human disease and development proteins. J. Mol. Microbiol. Biotechnol. 1, 107-125. • Tyurina, O. V., Guner, B., Popova, E., Feng, J., Schier, A. F., Kohtz, J. D. and Karlstrom, R. O. (2005). Zebrafish Gli3 functions as both an activator and a repressor in Hedgehog signalling. Dev. Biol. 277, 537-556. • Tzeng, Y. H., Pan, R. and Li, W. H. (2004). Comparison of three methods for estimating rates of synonymous and nonsynonymous nucleotide substitutions. Mol. Biol. Evol. 21, 2290-2298. • Vorontsov, N. N. (1967). Evolutsia Pishchevaritelnoy Systemy Mysheobraznykh Gryzunov. Nauka, (In Russian). English translation (1979). Evolution of the Alimentary System in Myomorph Rodents. Smithsonian Institution, Washington D.C. & Nat. Sci. Foundation (Translated by the Indian Documentation Centre, New Delhi) • Wang, G. and Jiang, J. (2004). Multiple Cos2/Ci interactions regulate Ci subcellular localization through microtubule dependent and independent mechanisms. Dev. Biol. 268, 493-505. • Whelan, S. and Goldman, N. (2001) A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol. Biol. Evol. 18, 691-699. • Whittam, T. S. and Nei, M. (1991). Neutral mutation hypothesis test. Nature 354, 115-116. • Yang, Z. (1997). PAML: a program package for phylogenetic analysis by maximum likelihood. Comput. Appl. Biosci. 13, 555-556. • Yang, Z. and Nielsen, R. (2000). Estimating synonymous and nonsynonymous substitution rates under realistic evolutionary models. Mol. Biol. Evol. 17, 32-43.

Suggest Documents