Modelling metabolic evolution on phenotypic fitness landscapes: a ...

13 downloads 2800 Views 178KB Size Report
*Institute for Computer Science, Heinrich Heine University, 40225 D üsseldorf, Germany ... metabolic fitness landscape structure with changing environments.
1172

Biochemical Society Transactions (2015) Volume 43, part 6

Modelling metabolic evolution on phenotypic fitness landscapes: a case study on C4 photosynthesis David Heckmann*1 *Institute for Computer Science, Heinrich Heine University, 40225 Dusseldorf, ¨ Germany

Abstract How did the complex metabolic systems we observe today evolve through adaptive evolution? The fitness landscape is the theoretical framework to answer this question. Since experimental data on natural fitness landscapes is scarce, computational models are a valuable tool to predict landscape topologies and evolutionary trajectories. Careful assumptions about the genetic and phenotypic features of the system under study can simplify the design of such models significantly. The analysis of C4 photosynthesis evolution provides an example for accurate predictions based on the phenotypic fitness landscape of a complex metabolic trait. The C4 pathway evolved multiple times from the ancestral C3 pathway and models predict a smooth ‘Mount Fuji’ landscape accordingly. The modelled phenotypic landscape implies evolutionary trajectories that agree with data on modern intermediate species, indicating that evolution can be predicted based on the phenotypic fitness landscape. Future directions will have to include structural changes of metabolic fitness landscape structure with changing environments. This will not only answer important evolutionary questions about reversibility of metabolic traits, but also suggest strategies to increase crop yields by engineering the C4 pathway into C3 plants.

Evolution of complex traits and underlying fitness landscapes The metabolism of multicellular organisms exhibits an outstanding complexity in both the number and the spatial arrangement of interacting components. Since many of those components appear to be essential for the adaptive function of the system as a whole, the question arises of how these complex systems were formed through adaptive evolution. The theory of fitness landscapes provides a framework for answering such questions. Genetic fitness landscapes map genetic states to corresponding fitness values, whereas phenotypic landscapes map a phenotypic domain to fitness [1] (Figure 1). In the theory of genetic fitness landscapes, the equivalent of the above problem would be the search for accessible evolutionary trajectories through the landscape that connect an ancestral state with a derived state. Whether such a trajectory exists will depend on the availability of beneficial mutations and the existence of epistatic interactions between them [2]. The resulting topology of the underlying fitness landscape thus determines the accessibility of evolutionary paths. Smooth, single-peaked, ‘Mount Fuji’-type landscapes allow adaptive changes from each state in the landscape, whereas rugged Key words: C4 photosynthesis, epistasis, evolution, fitness landscape, genotype-phenotype map, metabolic model. Abbreviations: PEPC, phosphoenolpyruvate carboxylase; Rubisco, ribulose-1,5-bisphosphate carboxylase-oxygenase. 1 To whom correspondence should be addressed (email [email protected]).

 C 2015

Authors; published by Portland Press Limited

topographies imply the existence of local optima that render additional changes deleterious and result in ‘dead end’trajectories [3]. A rugged topology in a genetic fitness landscape implies the existence of reciprocal sign epistasis between genetic loci; simultaneous genetic changes result in a fitness change that has the opposite sign of that of the individual changes [4]. Current techniques for experimental exploration of fitness landscapes are extremely limited by the combinatorial explosion of possible states. The most common approach is the selection of a small set of mutations that are either known to be relevant for organismal fitness a priori or are connecting two phenotypic states of interest. Using this approach, subsets of the intragenic landscapes of bacterial β-lactamase [5] and isopropylate dehydrogenase [6] were mapped experimentally. Both these single-gene studies yielded smooth ‘Mount Fuji’-type landscapes. Khan et al. [7] mapped a multigene landscape that arose in an experimental evolution experiment and also found a smooth genetic landscape. A complementary approach to an exhaustive evaluation of small landscapes is the random sampling of accessible trajectories through experimental evolution [8,9]. The fitness landscape of a complex trait is unlikely to be mapped using current experimental techniques because even genetic landscapes that are spanned by few mutations quickly become infeasible to evaluate experimentally. Systems models of complex metabolic traits allow us to investigate landscapes of high dimension, provided that we understand how genotypic or phenotypic parameters map to organismal fitness. Biochem. Soc. Trans. (2015) 43, 1172–1176; doi:10.1042/BST20150148

Metabolic Pathways Analysis 2015

Figure 1 Concepts of fitness landscapes A state transition (i.e. mutation) is shown on levels of genotype, phenotype and fitness. Black arrows indicate the genotype–phenotype (left) and the phenotype-fitness (right) map. The phenotype in this example is the concentration of an arbitrary compound A.

genotypic landscape

phenotypic landscape

...GATC...

[A] = 1.5 µM

...GCTC...

[A] = 2 µM

genotype

phenotype

Theoretical studies of fitness landscape are often restricted to the analysis of ‘artificial’ landscapes of given statistical properties [3,10]. Although such investigations are important for studying the general implications of landscape topologies and assumptions about the nature of adaptive walks, they are not based on real biological systems. Theoretical studies on actual genetic fitness landscapes are scarce due to the lack of models for accurate genotype–phenotype mapping. One exception is the prediction of RNA structure from primary sequence. Fontana and Schuster [11] simulated the folding of RNA towards an arbitrarily defined structure of maximum fitness and found discontinuous changes in phenotype space. Even though this RNA system provides a unique opportunity to study a genotype–phenotype map, the phenotype-tofitness map is not based on a real biological mechanism. Although feasible for the simple structural phenotypes of RNA molecules, the computational prediction of genotypefitness maps for complex traits is currently not possible. The closest we can get to make reliable predictions of complex trait evolution is through models of the phenotypic fitness landscape. Modelling evolutionary trajectories on the phenotypic fitness landscape makes important assumptions about the nature of the underlying genotype–phenotype map. We firstly have to ensure that phenotypic traits are able to evolve independently. Furthermore, and more importantly, we have to assume that the underlying genotype–phenotype map is a continuous function; the neighbourhood of a point in sequence space should be a subset of the neighbourhood of the corresponding point in phenotype space. Whereas this is not the case for RNA secondary structure, additivity of thermodynamic contributions in protein catalysis provides scenarios where this assumption is justified [6,12]. In cases where these assumptions hold, modelling evolution on the phenotypic fitness landscape might even

fitness

be preferable to the genotype-based approach. Genotype– phenotype mappings are often many-to-one and focusing on the phenotypic domain can yield a substantial dimensional reduction in the space that has to be explored. For simple metabolic systems, models were successfully used to understand experimental fitness landscapes in terms of phenotype-fitness mapping [6,13]. The use of optimality criteria for the prediction of metabolic states indicates the feasibility of predicting phenotype-fitness relationships for complex metabolic systems [14]. This makes metabolic systems a promising candidate for attempting to predict evolutionary trajectories in complex systems.

C4 photosynthesis: a complex metabolic solution for the oxygenase problem of Rubisco C4 photosynthesis is a complex metabolic add-on to the ancestral C3 pathway that involves changes in metabolic gene expression, enzyme kinetics, leaf anatomy and interaction of cell types. A clear connection between phenotype and fitness and the existence of evolutionary intermediates render C4 photosynthesis an ideal case study for the evolution of a complex metabolic trait. The most abundant enzyme on earth, Rubisco (ribulose1,5-bisphosphate carboxylase-oxygenase), fixes atmospheric CO2 into three-carbon compounds in the mesophyll cells of C3 plants. Due to the evolutionary history of Rubisco in low O2 environments and the chemical similarity of CO2 and O2 , molecular oxygen is a competing substrate for Rubisco. When Rubisco catalyses an oxygenation reaction, one molecule of 2-phosphoglycolate is created that needs to be recycled in the photorespiratory cycle. Apart from a significant energy usage, this metabolic pathway involves the release of CO2  C 2015

Authors; published by Portland Press Limited

1173

1174

Biochemical Society Transactions (2015) Volume 43, part 6

Figure 2 Schematic view of C4 photosynthesis In the leaf of a C4 plant, mesophyll cells are in contact with the intercellular gas phase. Carbon is fixed primarily by PEPC in the form of bicarbonate. Resulting C4 acids diffuse in to the gas-tight bundle sheath cells, where they are decarboxylated. This creates a high ratio of CO2 to O2 in the bundle sheath cells, where Rubisco is located. Operation of the C4 cycle thus suppresses the oxygenation reaction of Rubisco, but creates additional energetic demand in the form of ATP.

mesophyll cell

C4 acid

bundle sheath cell

C4 acid Carbohydrates

CO2

PEPC

Rubisco

CO2 PEP

C3 acid

AMP

ATP

+ 2 Pi

+Pi

by the glycine decarboxylase (GDC) complex, decreasing photosynthetic capacity by up to 30 % [15–17]. C4 plants suppress photorespiration by concentrating CO2 around Rubisco (Figure 2). In order to be operational, this energy-dependent CO2 pump requires changes in gene expression, leaf anatomy and biochemical features. Rubisco expression is shifted from the mesophyll to the bundle sheath tissue where cells possess thick cell walls that are assumed to reduce CO2 leakage [18]. Phosphoenolpyruvate carboxylase (PEPC), an enzyme lacking an oxygenation function, catalyses the primary fixation of CO2 as bicarbonate in the mesophyll. Resulting C4 acids diffuse to the bundle sheath cells and are decarboxylated, releasing CO2 in proximity to Rubisco. The result is a high ratio of CO2 to O2 in the bundle sheath cells that efficiently decreases photorespiration. Genes encoding for all enzymes required for the C4 cycle are found in C3 plant genomes, where they fulfil diverse metabolic functions [19]. Although only 3 % of all land plant species use the C4 cycle, they contribute about a quarter of primary biomass production on land [20,21]. Under tropical conditions, the ability to suppress photorespiration results in higher rates of photosynthesis and increased photosynthetic water- and nitrogen-use efficiency [22–24]. The most productive crop species are C4 plants, which possess high biomass production rates even in temperate regions [25]. Due to this high productivity, the engineering of the C4 trait into major C3 crops is a highly promising route towards meeting the growing demands on food production [26,27]. Since major features of the genetic blueprint of C4 photosynthesis are still lacking, combining directed breeding with genetic engineering approaches is considered a promising option towards achieving this goal [28]. Understanding the evolutionary  C 2015

Authors; published by Portland Press Limited

Calvin-Benson Cycle

trajectories that produced C4 photosynthesis will support such strategies [28,29]. Conceptual models on the evolution of C4 photosynthesis are based on the physiological, biochemical and genetic study of C3 , C4 and C3 –C4 intermediate species [21]. Such data on intermediate species is valuable for constructing the fitness landscape model, especially when deciding on the independence of parameters and continuity of genotype– phenotype mapping. Of course, caution is advised as to avoid circular arguments when using intermediate species for model validation. Despite the complexity of the trait, C4 photosynthesis has evolved independently in more than 60 angiosperm lineages [30]. This high level of polyphyly constitutes a striking example of convergent evolution and points towards a smooth genotypic fitness landscape between C3 and C4 photosynthesis. Environmental factors have a strong influence on the probability of C4 evolution. The atmospheric CO2 –O2 ratio, heat, aridity and high light are discussed as important abiotic factors promoting C4 evolution [15,16,31]. On the other hand, the high energy requirement of C4 metabolism leads to a disadvantage of C4 under low-light conditions, such as those that occur in shaded forest areas [32]. Due to these dependencies, the fitness landscape will exhibit a dynamic topology with changing environments, possibly inverting the direction of selection pressure. This dynamic property of fitness landscapes is stressed by the concept of the ‘fitness seascape’ [33]. A general pre-adaptation in the form of gene duplications is assumed to be a requirement for C4 evolution [34,35]. This expanded genetic repertoire allows neofunctionalization of genes and the distinct expression patterns that are found

Metabolic Pathways Analysis 2015

in C4 plants. This means that phenotypic parameters can change without affecting their original metabolic function. Both enzyme activity and kinetics can thus be treated as independent parameters in a landscape model.

Simulating C4 photosynthesis evolution Modelling evolutionary trajectories of C4 photosynthesis was first suggested by Peisker [36], but these early models lacked a fitness-relevant output. Heckmann et al. [37] extended steady state gas exchange models of C3 –C4 intermediates developed by von Caemmerer [38] to map the phenotypic fitness landscape connecting C3 and C4 photosynthesis in a C4 -promoting environment. The main assumption underlying this study is the proportional increase in fitness with CO2 assimilation rate for a given nitrogen budget. Although difficult to verify, this assumption is justified by the advantages of C4 photosynthesis discussed above. Similar assumptions are made for applying flux balance analysis in bacteria [14], although the connection between the growth rates is more direct than in the case of a plant. The choice of evolving phenotypic parameters has to ensure their independence. For example, kinetic parameters of Rubisco are unable to evolve independently [39] and such interplay has to be included into the model [37]. In case of C4 photosynthesis, the underlying genotype–phenotype map is not well elucidated. As discussed above, continuity of the genotype–phenotype map has to be assumed to predict evolutionary trajectories solely in phenotype space. Although recent studies on individual parameters justify this assumption [40], it is not possible to rule out large phenotypic jumps in response to small genetic changes. Under these assumptions, the topology of the phenotypic landscape of C4 photosynthesis is of the Mount-Fuji type, similar to some results on small bacterial landscapes [5,6,37]. This result might explain the high level of convergence in C4 evolution [30]. In the in silico equivalent of an experimental evolution experiment, Markov Chain Monte Carlo simulations were used to predict evolutionary trajectories on this landscape [37]. These predicted trajectories lie close to all C3 – C4 intermediate species for which parameters were available. This example shows the importance of experimentally accessible intermediate phenotypes for validating evolutionary predictions for complex traits. Due to the high number of assumptions that go into these models, such a validation is essential for studying fitness landscapes. The predicted trajectories show no signs of diminishing returns [37], which are often observed in bacterial experimental evolution studies [8,9]. The reason for this observation in the C4 model lies in the epistatic effects between landscape parameters; the fitness benefit of late mutations is increased when early changes are present [37]. Furthermore, the predicted trajectories were similar to those that resulted from a purely statistical approach for the inference of trajectories towards C4 photosynthesis [41]. Further details of the early steps of C4 evolution were elucidated by combining kinetic and genome-scale

metabolic models [42]. C2 photosynthesis is a weak carbon-concentrating mechanism that operates by releasing photorespiratory CO2 close to Rubisco in a specific cell type [16]. The fact that many plants in the phylogenetic neighbourhood of C4 photosynthesis utilize this mechanism indicates that C2 photosynthesis increases the probability of C4 evolution. Only when modelling C2 photosynthesis using a detailed genome-scale metabolic model, a variety of nitrogen recycling pathways became apparent [42]. These pathways use components of the C4 cycle to operate, thus indicating that C2 photosynthesis works as a metabolic preadaptation to C4 photosynthesis.

Outlook: modelling environments and engineering strategies Future directions in studying the fitness landscape of C4 evolution will have to integrate knowledge on environmental responses [43]. Conceptual models predict C3 photosynthesis to be advantageous in shaded environments, whereas hot high light environments result in a Mount Fuji landscape that predicts strong selection towards the C4 pathway [37]. The shape of the fitness landscape can thus be expected to transform drastically when environments change. Integrating quantitative physiological knowledge into fitness models will result in a space of environments that will allow mapping this ‘seascape’ of C4 evolution [33]. Such an advanced model will allow for predictions on the evolvability and reversibility of C4 photosynthesis across relevant environments. Furthermore, this will permit the prediction of optimal strategies towards engineering C4 photosynthesis in C3 plants, possibly through the combined use of metabolic engineering and experimental evolution [28]. The preferred way of validating predictions on the fitness landscape is the comparison with intermediate species. Additional data on the biochemistry, gene expression, anatomy and growth environments will allow the validation and falsification of assumptions on the current models of C4 fitness landscapes. In the greater picture of the predictability of the evolution of complex metabolic traits, mapping from genotype to phenotype will be the strongest limitation to the development of robust models. In cases where the genotype–phenotype mapping is continuous, this constraint can be relaxed. In cases where this relaxation is not possible, predicting evolution solely based on phenotype-fitness models is likely to fail. Understanding and predicting the evolution of metabolic systems remains a challenging task, but integration of the available biochemical, physiological and evolutionary data on well-chosen systems will continue to produce robust quantitative fitness models.

Acknowledgements I would like to thank Professor Martin Lercher for is insightful comments on this manuscript.

 C 2015

Authors; published by Portland Press Limited

1175

1176

Biochemical Society Transactions (2015) Volume 43, part 6

Funding This work was supported by the Deutsche Forschungsgemeinschaft [grant number IRTG 1525] to D.H.

References 1 Orr, H.A. (2005) The genetic theory of adaptation: a brief history. Nat. Rev. Genet. 6, 119–127 CrossRef PubMed 2 Wagner, G.P. and Altenberg, L. (1996) Perspective: complex adaptations and the evolution of evolvability. Evolution 50, 967–976 CrossRef 3 Franke, J., Klozer, ¨ A., de Visser, J.A.G.M. and Krug, J. (2011) Evolutionary accessibility of mutational pathways. PLoS Comput. Biol. 7, e1002134 CrossRef PubMed 4 Poelwijk, F.J., Tanase-Nicola, ˘ S., Kiviet, D.J. and Tans, S.J. (2011) Reciprocal sign epistasis is a necessary condition for multi-peaked fitness landscapes. J. Ther. Biol. 272, 141–144 CrossRef 5 Weinreich, D.M., Delaney, N.F., DePristo, M.A. and Hartl, D.L. (2006) Darwinian evolution can follow only very few mutational paths to fitter proteins. Science 312, 111–114 CrossRef PubMed 6 Lunzer, M., Miller, S.P., Felsheim, R. and Dean, A.M. (2005) The biochemical architecture of an ancient adaptive landscape. Science 310, 499–501 CrossRef PubMed 7 Khan, A.I., Dinh, D.M., Schneider, D., Lenski, R.E. and Cooper, T.F. (2011) Negative epistasis between beneficial mutations in an evolving bacterial population. Science 332, 1193–1196 CrossRef PubMed 8 Lenski, R.E. and Travisano, M. (1994) Dynamics of adaptation and diversification: a 10,000-generation experiment with bacterial populations. Proc. Natl. Acad. Sci. U.S.A. 91, 6808–6814 CrossRef PubMed 9 Elena, S.F. and Lenski, R.E. (2003) Evolution experiments with microorganisms: the dynamics and genetic bases of adaptation. Nat. Rev. Genet. 4, 457–469 CrossRef PubMed 10 Kauffman, S. and Levin, S. (1987) Towards a general theory of adaptive walks on rugged landscapes. J. Ther. Biol. 128, 11–45 CrossRef 11 Fontana, W. and Schuster, P. (1998) Continuity in evolution: on the nature of transitions. Science 280, 1451–1455 CrossRef PubMed 12 Aita, T., Iwakura, M. and Husimi, Y. (2001) A cross-section of the fitness landscape of dihydrofolate reductase. Protein Eng. 14, 633–638 CrossRef PubMed 13 Chou, H.H., Delaney, N.F., Draghi, J.A. and Marx, C.J. (2014) Mapping the fitness landscape of gene expression uncovers the cause of antagonism and sign epistasis between adaptive mutations. PLoS Genet. 10, e1004149 CrossRef PubMed 14 Ibarra, R.U., Edwards, J.S. and Palsson, B.O. (2002) Escherichia coli K-12 undergoes adaptive evolution to achieve in silico predicted optimal growth. Nature 420, 186–189 CrossRef PubMed 15 Ehleringer, J.R., Sage, R.F., Flanagan, L.B. and Pearcy, R.W. (1991) Climate change and the evolution of C4 photosynthesis. Trends Ecol. Evol. 6, 95–99 CrossRef PubMed 16 Sage, R.F., Sage, T.L. and Kocacinar, F. (2012) Photorespiration and the evolution of C4 photosynthesis. Annu. Rev. Plant Biol. 63, 19–47 CrossRef PubMed 17 Sage, R.F. (2013) Photorespiratory compensation: a driver for biological diversity. Plant Biol. 15, 624–638 CrossRef PubMed 18 Kiirats, O., Lea, P.J., Franceschi, V.R. and Edwards, G.E. (2002) Bundle Sheath diffusive resistance to CO2 and effectiveness of C4 photosynthesis and refixation of photorespired CO2 in a C4 cycle mutant and wild-type Amaranthus edulis. Plant Physiol. 130, 964–976 CrossRef PubMed 19 Aubry, S., Brown, N.J. and Hibberd, J.M. (2011) The role of proteins in C3 plants prior to their recruitment into the C4 pathway. J. Exp. Bot. 62, 3049–3059 CrossRef PubMed 20 Still, C.J., Berry, J.A., Collatz, G.J. and DeFries, R.S. (2003) Global distribution of C3 and C4 vegetation: carbon cycle implications. Global Biogeochem. Cy. 17, 6–14 CrossRef 21 Sage, R.F. (2004) The evolution of C4 photosynthesis. New Phytol. 161, 341–370 CrossRef

 C 2015

Authors; published by Portland Press Limited

22 Brown, R.H. (1978) A difference in N use efficiency in C3 and C4 plants and its implications in adaptation and evolution. Crop. Sci. 18, 93–98 CrossRef 23 Taylor, S.H., Hulme, S.P., Rees, M., Ripley, B.S., Ian Woodward, F. and Osborne, C.P. (2010) Ecophysiological traits in C3 and C4 grasses: a phylogenetically controlled screening experiment. New Phytol. 185, 780–791 CrossRef PubMed 24 Vogan, P.J. and Sage, R.F. (2011) Water-use efficiency and nitrogen-use efficiency of C3 -C4 intermediate species of Flaveria Juss. (Asteraceae). Plant Cell Environ. 34, 1415–1430 CrossRef PubMed 25 Beale, C.V. and Long, S.P. (1995) Can perennial C4 grasses attain high efficiencies of radiant energy conversion in cool climates? Plant Cell Environ. 18, 641–650 CrossRef 26 Hibberd, J.M., Sheehy, J.E. and Langdale, J.A. (2008) Using C4 photosynthesis to increase the yield of rice – rationale and feasibility. Curr. Opin. Plant Biol. 11, 228–231 CrossRef PubMed 27 von Caemmerer, S., Quick, W.P. and Furbank, R.T. (2012) The development of C4 rice: current progress and future challenges. Science 336, 1671–1672 CrossRef PubMed 28 Sage, R. and Sage, T. (2007) Learning from nature to develop strategies for the directed evolution of C4 rice. In Charting New Pathways to C4 Rice (Sheehy, J.E., Mitchell, P.L. and Hardy, B., eds), pp. 195–216, International Rice Research Institute, Metro Manila 29 Slewinski, T.L. (2013) Using evolution as a guide to engineer Kranz-type C4 photosynthesis. Front. Plant Sci. 4, 212 PubMed 30 Sage, R.F., Christin, P.A. and Edwards, E.J. (2011) The C4 plant lineages of planet earth. J. Exp. Bot. 62, 3155–3169 CrossRef PubMed 31 Edwards, E.J., Osborne, C.P., Stromberg, ¨ C.A. and Smith, S.A. (2010) The origins of C4 grasslands: integrating evolutionary and ecosystem science. Science 328, 587–591 CrossRef PubMed 32 Ehleringer, J.R., Lin, Z.F., Field, C.B., Sun, G.C. and Kuo, C.Y. (1987) Leaf carbon isotope ratios of plants from a subtropical monsoon forest. Oecologia 72, 109–114 CrossRef 33 Mustonen, V. and Lassig, ¨ M. (2009) From fitness landscapes to seascapes: non-equilibrium dynamics of selection and adaptation. Trends Genet 25, 111–119 CrossRef PubMed 34 Monson, R.K. (2003) Gene duplication, neofunctionalization, and the evolution of C4 photosynthesis. Int. J. Plant Sci. 164, 43–54 CrossRef 35 Wang, X., Gowik, U., Tang, H., Bowers, J., Westhoff, P. and Paterson, A. (2009) Comparative genomic analysis of C4 photosynthetic pathway evolution in grasses. Genome Biol 10, 1–18 CrossRef 36 Peisker, M. (1986) Models of carbon metabolism in C3 -C4 intermediate plants as applied to the evolution of C4 photosynthesis. Plant Cell Environ 9, 627–635 CrossRef 37 Heckmann, D., Schulze, S., Denton, A., Gowik, U., Westhoff, P., Weber, A.P. and Lercher, M.J. (2013) Predicting C4 photosynthesis evolution: modular, individually adaptive steps on a Mount Fuji fitness landscape. Cell 153, 1579–1588 CrossRef PubMed 38 von Caemmerer, S. (2000) Biochemical models of leaf photosynthesis, Collingwood, Csiro Publishing. 39 Savir, Y., Noor, E., Milo, R. and Tlusty, T. (2010) Cross-species analysis traces adaptation of Rubisco toward optimality in a low-dimensional landscape. Proc. Natl. Acad. Sci. U.S.A. 107, 3475–3480 CrossRef PubMed 40 Schulze, S., Mallmann, J., Burscheidt, J., Koczor, M., Streubel, M., Bauwe, H., Gowik, U. and Westhoff, P. (2013) Evolution of C4 photosynthesis in the genus flaveria: establishment of a photorespiratory CO2 pump. Plant Cell 25, 2522–2535 CrossRef PubMed 41 Williams, B.P., Johnston, I.G., Covshoff, S. and Hibberd, J.M. (2013) Phenotypic landscape inference reveals multiple evolutionary paths to C4 photosynthesis. Elife 2, e00961 PubMed 42 Mallmann, J., Heckmann, D., Brautigam, ¨ A., Lercher, M.J., Weber, A.P., Westhoff, P. and Gowik, U. (2014) The role of photorespiration during the evolution of C4 photosynthesis in the genus flaveria. Elife 3, e02478 CrossRef PubMed 43 Beer, K.D., Orellana, M.V. and Baliga, N.S. (2013) Modeling the evolution of C4 photosynthesis. Cell 153, 1427–1429 CrossRef PubMed Received 30 June 2015 doi:10.1042/BST20150148