Reciprocal Sign Epistasis between Frequently Experimentally Evolved Adaptive Mutations Causes a Rugged Fitness Landscape

Download PDF České info

The fitness landscape captures the relationship between genotype and evolutionary fitness and is a pervasive metaphor used to describe the possible evolutionary trajectories of adaptation. However, little is known about the actual shape of fitness landscapes, including whether valleys of low fitness create local fitness optima, acting as barriers to adaptive change. Here we provide evidence of a rugged molecular fitness landscape arising during an evolution experiment in an asexual population of Saccharomyces cerevisiae. We identify the mutations that arose during the evolution using whole-genome sequencing and use competitive fitness assays to describe the mutations individually responsible for adaptation. In addition, we find that a fitness valley between two adaptive mutations in the genes MTH1 and HXT6/HXT7 is caused by reciprocal sign epistasis, where the fitness cost of the double mutant prohibits the two mutations from being selected in the same genetic background. The constraint enforced by reciprocal sign epistasis causes the mutations to remain mutually exclusive during the experiment, even though adaptive mutations in these two genes occur several times in independent lineages during the experiment. Our results show that epistasis plays a key role during adaptation and that inter-genic interactions can act as barriers between adaptive solutions. These results also provide a new interpretation on the classic Dobzhansky-Muller model of reproductive isolation and display some surprising parallels with mutations in genes often associated with tumors.

Published in the journal: . PLoS Genet 7(4): e32767. doi:10.1371/journal.pgen.1002056
Category: Research Article
doi: https://doi.org/10.1371/journal.pgen.1002056

Summary

Introduction

Introduced by Wright, the fitness landscape describes the possible mutational trajectories by which lineages evolve in a stepwise manner from genotypes that lie in regions of low fitness to ones of higher fitness [1], [2]. When viewed as a whole, this metaphorical landscape represents a species' possible paths of adaptive evolution towards the optimal genotype in a particular environment. An outstanding question is whether the selective surface of fitness landscapes is smooth, containing a single global fitness optimum, or rugged, where selective constraints on differing mutational trajectories create multiple local fitness optima [3]–[7]. If the landscape is smooth, any path leading to the optimal genotype that continuously increases the population's fitness will be selectively favored, and the population will reach the global optimum on the landscape. However, if the landscape is rugged, adaptation will be constrained by the mutations available to increase the population's fitness [3]. This ruggedness can hamper the efficacy of natural selection compared to a smooth landscape by slowing the rate of adaptation due to pervasive genetic constraint [8].

Genetic constraint on fitness landscapes is due to fitness epistasis, where a mutation's adaptive value depends on the genetic background in which it arises [6]. Epistasis is a key component in such processes as reproductive isolation and speciation [9], [10], the evolution of sex and recombination [11], [12], as well as human diseases [13]. Theory shows that a form of epistasis called “sign epistasis” is necessary to constrain mutational trajectories on fitness landscapes [14]. Sign epistasis occurs when mutations are beneficial within the context of some genetic backgrounds, but detrimental within others. However, it is an extreme form of sign epistasis, recently dubbed “reciprocal sign epistasis”, that is necessary to create the local peaks and valleys on fitness landscapes [7], [14], [15]. Reciprocal sign epistasis occurs when the mutational path between two genotypes is selectively inaccessible due to intermediate, low-fitness genotypes. Such valleys are less likely to be crossed by natural selection alone, depending on the mutation rate and population size [16]. Therefore, genotypes that reside at local fitness optima are likely dead-ends for natural selection: even if a higher fitness peak exists elsewhere on the landscape, the neighboring fitness valley impedes adaptation to the global fitness optimum.

Experimental studies aimed at testing mutational constraint on fitness landscapes due to epistasis have focused on testing engineered, biased amino acid substitutions, such as mutating residues at enzymes' active site(s) [17], [18], engineering likely evolutionary intermediates between ancestral and adapted versions of a single protein [19]–[21], or quantifying interactions between mutations in different genes that display an observable phenotype [22], [23]. Such work suggests that genetic constraint due to sign epistasis is prevalent and that adaptation can take surprisingly few mutational paths to the optimal genotype on the landscape. Some genotype-phenotype mapping studies using molecular data have inferred a multi-peaked landscape using proxies for fitness, but the extent of the role played by local optima during adaptation was either unknown [23] or limited [17], [24].

Here we describe a rugged fitness landscape that arose during an experimental evolution. We identify the molecular nature of the mutations resulting from the evolution, and describe which mutations are individually adaptive. Adaptive mutations in two genes appear several times in different adaptive lineages, and we determine that they are selectively mutually exclusive due to reciprocal sign epistasis. The genetic constraint between these two mutations causes a rugged fitness landscape, as both mutations occur multiple times during the evolution and are highly adaptive individually, while highly maladaptive in concert. This work shows that inter-genic interactions can act as barriers between adaptive solutions and adds to the mounting experimental evidence that the constraint caused by epistasis is of central importance in evolutionary biology.

Results

Whole-genome sequencing of experimentally evolved clones reveals novel mutations

We have further characterized a previously described population of asexually-propagated haploid S. cerevisiae that were experimentally evolved under glucose limitation in continuous culture for 448 generations [25]. In that study, the chemostat was seeded with equal quantities of three otherwise isogenic haploid S288c strains that each expresses a different fluorescent protein constitutively (GFP, YFP or DsRed), and the proportions of the three colored lineages were tracked over time using flow cytometry. Expansions and contractions of the colored subpopulations were monitored, and a total of five adaptive clones (M1–M5), were isolated from the various colored subpopulations at generations 56, 91, 196, 266 and 385, respectively (see Materials and Methods for additional details on experimental design). That study identified a total of 12 independent mutations in these clones, using tiling microarrays [25]. However, through analysis of progeny of these clones, we discovered that some of them harbored additional, adaptive mutations of unknown identity, so we performed whole-genome sequencing on all five clones and their ancestor in order to identify these mutations. Sequence coverage of the nuclear genome ranged from 21× to 45× (Table S1). As expected, we discovered additional mutations: a total of five additional single nucleotide polymorphisms (SNPs) in M2, M3 and M5 (Figure 1). All SNPs, indels and copy number variants found previously using tiling microarrays [25] were detected with this genome sequencing approach, though our sequence data were not able to discover the LTR insertion in GPB2 in M5 that we previously characterized, which is likely a limitation of the single-end sequencing.

**Fig. 1. Mutations in adaptive clones M1–M5.**

We also estimated the copy number of the HXT6/7 amplifications in M4 and M5 using real-time quantitative PCR (qPCR), targeting the nearly identical HXT6 and HXT7 coding regions as well as the HXT7 promoter. These data were normalized to an ancestral strain without the amplification, which has one copy each of the HXT6 and HXT7 ORFs flanking the HXT7 promoter (see Figure S1 for the region's structure). Given the model of mitotic recombination proposed by [26] to explain the HXT6/7 amplification, the number of HXT6/7 ORFs should be one greater than the number of HXT7 promoter regions, regardless of the number of amplifications that have occurred. The qPCR results indicate that there are ten HXT6/7 ORFs and nine HXT7 promoters in M4, while M5 has 8–10 HXT6/7 ORFs and 7–9 HXT7 promoters (Figure S2). These data suggest there were a minimum of four mitotic recombination events for M4 and three for M5 to produce each array of HXT6/7 genes. Comparing these results to the sequencing coverage of the HXT6/7 region show that the coverage-based analysis underestimated the copy number for both clones (Figure S1), which may be due to the mapping algorithm used.

Fitness characterization of individual mutations in evolved clones

Each mutation could be the result of positive natural selection, or alternatively, could be a neutral or slightly deleterious mutation that hitchhiked along with one or more adaptive mutations. Thus, we segregated the mutations and determined each one's fitness effect in genetic isolation using competition experiments [27], where the only genetic difference between the mutant and wild-type competitor strain was the single mutation. Surprisingly, these data indicate that only 1–2 mutations per clone confer a significant fitness increase when considered singly, regardless of the total number of mutations in an adaptive clone (Figure 2). This increase in the number of seemingly non-adaptive mutations in later clones (M4 and M5) could be due to a lack of sensitivity in our single mutation fitness assay, an accumulation of neutral or deleterious hitchhiking mutations (Muller's ratchet [28]), or to non-additive fitness effects between mutations (positive, synergistic epistasis [6]). Given the 1.2×10⁷ bp size of the S. cerevisiae genome, a mutation rate of 5.12×10⁻¹⁰ bp⁻¹ generation⁻¹ [29], and the number of generations passed for these evolved clones, the expected number of neutral mutations is 1.6 for M4 and 2.4 for M5, while the probability of accumulating 4 or more neutral mutations in M4 is 0.084 and 5 or more in M5 is 0.092 (Poisson distribution). While these probabilities are low, they do not allow us to reject the null hypothesis that these mutations are neutral. Thus, our results are consistent with there being 1–2 adaptive mutations per clone, with the remaining mutations being neutral, though we cannot unequivocally rule out there being additional adaptive mutations in M4 and M5. Notably, we have also observed a nonsense mutation in MUK1 in an independent evolution experiment performed under glucose limitation (Wenger and Sherlock, unpublished), suggesting that it may be adaptive either singly or in concert with another mutation.

**Fig. 2. Relative fitness of individual mutations derived from competition experiments.**

To test for evidence of additional adaptive mutations, we calculated the sum fitness effect of all the singly adaptive mutations from a particular clone to determine if that sum recapitulates the overall fitness of the clone. If additional adaptive mutations exist, the sum effect will be less than the fitness of the clone, assuming a no epistasis model. We found that the additive fitness effects of the individual adaptive mutations recapitulated the fitness of adaptive clones M1–M3 and M5 (Figure 3). In M4, the additive fitness effect of the two adaptive mutations is significantly larger than the fitness of the adaptive clone. This suggests that negative, antagonistic epistasis between two mutations in M4 causes a reduction in its overall fitness. This may be because the fitness effect of each individual mutation is so large that when combined using an additive model, the additive fitness effect is larger than the upper bound of fitness in the given environment.

**Fig. 3. Adaptive mutations recapitulate fitness of adaptive clones M1–M3 and M5.**

The genes containing singly adaptive mutations are enriched [30] for the GO terms “hexose transport” and “negative regulation of Ras protein signal transduction” (p = 1.36e-4 and p = 5.8e-3, respectively; FDR<0.01%), suggesting that adaptation was due to increased glucose transport and signaling through the Ras/cAMP pathway, as previously hypothesized [25]. We compared the mutations found in our experiment with S. cerevisiae polymorphism data [31], and found that MTH1 and RIM15 both have naturally-occurring premature stop codons alleles in environmental isolates. This suggests that the mutations leading to premature stop codons we see in these genes during experimental adaptation to limiting glucose are also ecologically-relevant mutations that may provide a fitness advantage in nature. Furthermore, we have also observed variation in the copy number of the HXT6/HXT7 locus in a survey of ∼70 yeast strains (B. Dunn and G. Sherlock, unpublished), again suggesting ecological relevance of these mutations.

Adaptive mutations in MTH1 and HXT6/7 are selectively mutually exclusive

The within-population reproducibility of adaptation between different lineages was striking -⁠ three independent mth1 adaptive nonsense mutations arose in clones M1–M3 (hereafter referred to as mth1-1, mth1-2 and mth1-3), while the amplification of the tandemly arrayed glucose transporter genes HXT6 and HXT7 arose independently in clones M4 and M5, as well as within the green subpopulation by the end of the evolution experiment (see Figure 3 in [25]). These repeated independent changes suggest that the presumptive loss of MTH1 function or increased HXT6/7 copy number -⁠ both of which result in increased HXT expression [26], [32] -⁠ are effective mechanisms by which yeast can adapt to limiting glucose. Strikingly, despite the common occurrence of these two mutations independently, none of the five clones we characterized had them both. To further investigate this trend, we genotyped 22 randomly picked clones from the yellow subpopulation at generation 266 for the mth1-3 allele and HXT6/7 amplification. While the subpopulation was heterogeneous, none of the 22 clones had both mutations (Table S2). Additionally, when examining our estimates of allele frequencies throughout the evolution experiment, we found that the mth1-3 allele began decreasing in frequency concurrent with the increase in frequency of the HXT6/7 amplification in the yellow subpopulation, further suggesting that the two mutations did not co-exist within the same clonal lineage (see generations 200–300, Figure 4). We also genotyped 24 random clones isolated from the generation 448 terminal green subpopulation, and again the two mutations were never seen to co-exist, with all 24 clones carrying the HXT6/7 amplification but not the mth1-1 allele (Table S2). Since the MTH1 coding sequence has a large capacity for nonsense mutations (169/434 codons differ by only one nucleotide from stop codons), and since we were only assessing the specific mth1-1 allele within the terminal green population, we sequenced the full-length MTH1 coding sequence in 4 of the 24 terminal green subpopulation clones, and found that all four had the wild-type coding sequence (data not shown). Taken together, these results suggest that the mth1 mutation and HXT6/7 amplification did not exist together in the same clone to reach detectable frequencies during our experiment, suggesting they may be selectively mutually exclusive.

Allele frequencies of <i>mth1-3</i> and <i>HXT6/7 amplification</i> in the yellow subpopulation. — **Fig. 4. Allele frequencies of *mth1-3* and *HXT6/7 amplification* in the yellow subpopulation.**

Reciprocal sign epistasis for fitness between mth1 and HXT6/7 amplification

Reciprocal sign epistasis for fitness can constrain evolutionary trajectories and even create multiple peaks on a fitness landscape [7], [14], [15], a situation that would create selectively mutually exclusive mutations. Thus, we hypothesized that reciprocal sign epistasis underlies mutual exclusivity between the observed mth1 mutations and HXT6/7 amplification. To test this hypothesis, we constructed double mutant strains (as outlined in Figure S3) containing either an mth1-2 or mth1-3 nonsense mutation and an HXT6/7 amplification allele, and competed this double mutant against either a wild-type strain, an mth1 mutant, or an HXT6/7 amplification mutant. As controls, we also competed single mutants and wild-type spores from the same dissection against a wild-type strain. As expected, the single mutants were again more fit than the parental wild-type strain. However, the double mutant was significantly less fit than the wild-type strain as well as both the mth1 and HXT6/7 amplification single mutant strains (Figure 5A and Figure S4). This shows that within the genetic contexts we tested, a clone with both a nonsense mutation in mth1 and an amplification in HXT6/7 is highly maladaptive, and is thus unlikely to reach an appreciable frequency during glucose-limited evolution.

**Fig. 5. Competition experiments to test for epistasis between singly adaptive mutations.**

Pervasive negative epistasis between other adaptive mutations

To determine if sign epistasis during our evolution experiment was a common phenomenon, we constructed pairwise combinations of non-co-existing adaptive mutations from M1–M5 and competed them against the wild-type parental strain. While we saw no further evidence of reciprocal sign epistasis between these pairs of mutations, the RIM15 and GPB2 mutations do show standard sign epistasis, where the double mutant is less fit than one of the single mutants (GPB2) but more fit than the other (RIM15) (Figure 5B). In addition, negative epistasis is also prevalent, defined as the double mutant having a fitness effect that is less than the additive effects of the single mutants, but still greater than the fitness effects of each single mutant. The pairs IRA1/(HXT6/7)_yellow, IRA1/RIM15, MTH1/GPB2 all have significant negative epistasis between the single mutations. Since negative epistasis between adaptive mutations results in a double mutant with a fitness greater than both single mutants, it is selectively favorable and does not act to constrain the fitness landscape [7]. As with the inferred negative epistasis between mutations found in M4 (Figure 3), it is possible that the calculated negative epistasis between these mutations is due to the large magnitude of their individual fitness effects exceeding the maximum fitness, and the seemingly pervasive negative epistasis may be non-specific, possibly occurring between any two mutations of individually large enough fitness effect.

Discussion

We have characterized the fitness of individual mutations as well as genetic interactions between mutations that arose during experimental evolution of yeast under glucose limitation. Our exhaustive analysis of single mutation fitness determined that only one to two mutations per adaptive clone individually result in a fitness gain, regardless of the total number of mutations in an adaptive clone (Figure 2). The remaining mutations' effects are consistent with the null hypothesis of neutrality. Our data also show that, given an additive model, the identified singly adaptive mutations are sufficient to explain the fitness of adaptive clones M1–M3 and M5, and in one case (M4), there is evidence for negative epistasis between mutations (Figure 3). The effect of negative epistasis may act like the idea of diminishing returns –⁠ adaptive mutations of large individual effect result in a smaller fitness effect when they occur together. The idea of diminishing returns of adaptation over time in a constant environment is further supported by data from long-term E. coli evolution experiments, where the rate of fitness improvement was initially fast but quickly decreased over time and then remained low [33], [34]. Perhaps this decrease in the adaptive value of accumulated mutations is due to a maximum intrinsic fitness for a particular environment. This makes intuitive sense in the light of the mutations in our study having high relative fitness coefficients (1.1 to 1.45), which when combined under an additive model would result in extremely large fitness coefficients. This non-specific, negative epistasis-like phenomenon is further exemplified by the interactions between non-co-existing adaptive mutations (Figure 5B). Thus, it is possible that the pervasive negative epistasis observed is genetically promiscuous and not a result of the specific interactions between the two mutations, and any two mutations with large enough fitness increases will display such interactions. To our knowledge, this phenomenon has not been addressed in the literature, and there is as yet no distinction between true negative epistasis between specific sets of alleles and a non-specific, negative epistasis-like phenomenon due to a fitness upper bound, even though both scenarios would be classified as negative epistasis by its mathematical definition.

We have also determined that reciprocal sign epistasis between mutations in MTH1 and HXT6/7 constrains adaptation, which causes these two mutations to be mutually exclusive during the evolution experiment, a result that is consistent with data from an independent glucose-limited yeast evolution experiment [35]. This constraint can be represented in an empirical fitness landscape, which maps out the mutational evolutionary trajectories available for natural selection, and our data make possible the construction of this landscape using direct fitness effects of combinations of mutations. Reciprocal sign epistasis between the adaptive mutations in MTH1 and HXT6/7 can be represented as a simple two-locus fitness landscape (Figure S5). Here, the mth1 mutation is at a local optimum, as the only mutational steps available lead to a decrease in fitness, and it is not the fittest genotype on the landscape (Figure 5A and Figure S4). The HXT6/7 amplification mutation is the fittest genotype and is therefore the global optimum for this landscape. A consequence of this landscape is that lineages with an adaptive mutation in MTH1 are stuck on a local adaptive peak and may not be able to reach the higher fitness peak where the HXT6/7 amplification mutation lies. This is the likely reason why the mth1 and HXT6/7 mutations remain mutually exclusive for the duration the experiment.

The issue of how populations move from one peak to another in nature (the “peak shift” problem) has been disputed since Wright conceived of the fitness landscape metaphor [1], [2], [36]–[38]. Furthermore, the existence of multi-peaked fitness landscapes themselves has recently been questioned at the theoretical level due to their immense multidimensionality causing neutral ridges connecting genotypes of high fitness on the landscape (the holey landscape model) [4], [39]. These ridges thereby eliminate the classical peak shift problem, and there is some experimental support for such ridges (e.g. [21], [40]). Since the adaptive mutations in mth1 and HXT6/7 remain mutually exclusive in our experiment, even after sampling several clones in different lineages, this supports the argument of an adaptive valley on the fitness landscape rather than a ridge connecting the two peaks. Alternatively, a ridge connecting the two peaks might be long and circuitous, and our experiment was not performed for sufficient evolutionary time for neutral evolution on a fitness ridge to occur. This is likely the case in [41], where an unresolved potentiating mutation is thought to have occurred 20,000 generations into the evolution experiment, allowing an innovative phenotype to evolve. In either case, the constraint is such that it would be difficult to adapt from one peak to the other.

In reality, all possible genotypes are present on a fitness landscape. In our case, while it is impossible to experimentally test all combinations of all possible mutations in conjunction with the mth1/(HXT6/7) double mutant to prove that this portion of the fitness landscape does indeed have two peaks [3], the large population size of the culture (2×10⁹) means that a large spectrum of mutations should be sampled by natural selection often (∼10⁷ new SNP mutations per generation, based on the previously measured mutation rate [29]). This suggests that natural selection may be rejecting the double mutant in myriad genomic contexts. Determining whether further evolution of an mth1 single mutant strain ever results in an HXT6/7 amplification mutation will shed additional light on the feasibility of adaptation from one peak to the other. If the HXT6/7 amplification can indeed appear on the mth1 background in such an evolution experiment, this would suggest that a compensatory mutation or mutations provide a fitness ridge between the peaks. While observing how an evolution experiment proceeds cannot indisputably prove the shape of the fitness landscape, it can inform the most relevant and repeated paths of adaptation.

It has been understood for quite some time that epistasis is a fundamental component of adaptation [6], but only have recent technological developments facilitated the discovery and testing of individual nucleotide changes and how they interact with one another to create function and fitness [6]–[8], [18]–[22]. We speculate that for the reciprocal sign epistasis between mth1 and HXT6/7, the fitness defect may be caused by an overabundance of hexose transporter proteins in the cell, due to the fact that both individual mutations act to increase hexose transporter transcription [25], [26], [32]. It is possible that the devotion of too many resources to the production of hexose transporters may take resources away from other essential functions. For example, the secretion machinery by which hexose transporters are localized to the plasma membrane may be overwhelmed by their overabundance. Alternatively, the large number of hexose transporters may take up space on the membrane's surface, preventing other transporters from being correctly localized or negatively impacting the fluidity of the membrane. Another scenario is that the overabundant hexose transporters may aggregate and form plaques in the cell due to their 12 hydrophobic trans-membrane domains. Understanding the mechanism of the reciprocal sign epistasis between mth1 and HXT6/7 will be important for understanding the underpinnings of molecular fitness landscapes and is worthy of further investigation.

The rugged mth1/(HXT6/7) fitness landscape has far-reaching evolutionary implications. In the Dobzhansky-Muller model of postzygotic reproductive isolation, two species are separated from each other by a pair of genomic loci that interact negatively to create a hybrid organism that is of lower fitness compared to its parents [42], [43]. This model bears striking similarity to the epistatic interaction we see between mth1 and HXT6/7 amplification (also see [23]). In our case, if two lineages became fixed for the mth1 or HXT6/7 mutations, the low fitness of hybrid double mutant offspring may lead the two lineages on a path to reproductive isolation, though in contrast to a typical D-M pair, clones containing the individual mutations are more fit than wild-type clones. This is in contrast to a recent finding of a D-M pair between two strains of S. cerevisiae that were experimentally adapted to different environmental conditions [9], [44]. In this case, the two mutations were adaptive in the conditions in which they evolved, but maladaptive when made to co-occur in one of the conditions, which is the traditional way in which D-M pairs are thought to arise.

Mutually exclusive mutations are also known to exist in cancers [45], [46]. Of specific relevance is the recent observation that mutually exclusive mutations in the Ras pathway are individually adaptive under limiting glucose in colorectal cells, an environmental condition believed to be relevant to cancers in vivo [46]. Intriguingly, these mutations lead to an increased expression of the glucose transporter GLUT1 resulting in increased glucose uptake by cancer cells [46]. These results from human cancers closely parallel the mutations we see in the glucose sensing and Ras/cAMP pathways, which lead to an increase expression of the hexose transporters. Thus, human cancers and yeast may respond to the same selective pressures by mutating the same pathways, and these parallels beg the thought that reciprocal sign epistasis might be the mechanism by which these cancer mutations are mutually exclusive.

Materials and Methods

Source of adaptive clones

The details of the experimental setup yielding the adaptive clones used in the present study have been described previously [25]. Briefly, three strains of haploid S288c that are isogenic, except that each expresses a different fluorescent protein constitutively (GFP, YFP or DsRed), were seeded in equal quantities in a 20 ml chemostat device. The population was evolved for 448 generations at steady state under glucose limitation (0.08%) at a dilution rate of 0.2 h⁻¹. During this evolution, the proportions of the three colored lineages were tracked using flow cytometry. At five points throughout the evolution experiment, the population was sorted into its component colored subpopulations using fluorescence activated cell sorting (FACS), and 7 clones from the visibly adaptive subpopulation were isolated, competed to determine the clones' fitnesses, and the most fit clone was selected for subsequent investigations. The five resulting clones are labeled M1–M5 (Table S3 and Figure 1).

Strains and growth conditions

Strains used and constructed in this study are shown in Table S3. All batch and competitive chemostat cultures were grown as described previously [25].

Sequencing and mutation determination

Adaptive clones M1–M5 (GSY1171, GSY1180, GSY1194, GSY1200, GSY1208) and an ancestral strain (GSY1135) were single-end sequenced using the Illumina Genome Analyzer (GAI or GAII). Single-end sequencing libraries were constructed for each clone using the Illumina Genomic DNA sample prep kit from 5 µg of genomic DNA and each library was sequenced on 2 flow cell lanes. Sequence analysis was performed as follows, using default parameters unless otherwise noted. Reads were mapped to the S288c reference genome (downloaded from SGD on Dec 3, 2008) using bwa-short in BWA v0.5.7 [47] and variants were found with SAMtools v0.1.7-6 [48] and filtered with SAMtools varFilter (-d 5; -D 100,000; -S 20; -i 50). For each genomic position, if the ancestral strain and adaptive clone shared the consensus genotype, the variant was eliminated. The resulting variants were filtered unless they passed the following heuristic filters. SNP: proportion of “N” bases covering position <0.1; majority non-reference base must be >80% of all non-reference bases at the position; when comparing ancestral to evolved, the proportion of non-reference bases must be <0.1 in one strain and >0.5 in the other. Indels: coverage >10; indel calls must make up >50% of coverage; proportion of the sum of two most frequent indel calls >0.8; proportion of reference matches <0.5; when comparing ancestral to evolved, the difference in proportion for shared alleles must be >0.3. All novel SNPs were confirmed by Sanger sequencing (primers in Table S4).

Three SNPs were excluded from further consideration because they were present in both the ancestral strain and each evolved strain of a particular color: chr02 : 353579 (C to A) in GSY1136 and M1; chr09 : 275382 (PKP1, G to T) in GSY1135, M2 and M4; and chr16 : 401704 (MOT1, G to C) in GSY1137, M3 and M5. In addition, the COX18 mutation found in M2 and M4 as reported in [25] was also excluded, because it was determined the mutation was already fixed in the red subpopulation at the earliest sampling. Thus the mutation was not the result of evolution in the chemostat but was most likely a random mutation in the colony used to initiate the red population in the chemostat; we have also determined that this mutation is not adaptive (data not shown).

Assessing the copy number of the HXT6/7 array

Real-time quantitative PCR (qPCR) of genomic DNA (gDNA) was used to determine the copy number of the HXT6/7 coding region, as well as the HXT7 promoter, using a previously described protocol [49] and primers listed in Table S4. Results were normalized to the copy number of UBP1, a non-varying locus also on chromosome 4, to control for slight differences in input gDNA amount. Experiments were performed in triplicate. 95% confidence intervals were calculated as mean ± 1.96 * SEM.

HXT6/7 copy number was also visualized using sequencing coverage. Adaptive clone coverage was normalized to the average coverage of the ancestral strain and this was divided by the ancestral strain coverage. A running median was calculated over this proportion to smooth the data.

Construction of strains for competition experiments

M1–M5 were backcrossed to a wild-type S288c ancestral strain containing the same fluorescent protein (GSY1221–1223). Diploids were sporulated and dissected, and resulting spores were genotyped for mutations using allele-specific colony PCR (primers in Table S4). The spores were also tested for mating type by cross-stamping two tester strains (GSY2476 and GSY2670) onto YPD master plates of the dissections, grown overnight, followed by replica plating and selection for mated diploids on SC-ura+G418 plates. Backcrossing was repeated as often as necessary to get a single mutation segregating per cross (Figure S6). Double mutants for epistasis experiments were constructed by mating haploid single mutant strains and the resulting diploids were sporulated, dissected and genotyped.

Pairwise competition experiments

Pairwise competitive chemostats were performed as described [25], but were sampled every 6 h over 20–25 generations. Single mutation competition experiments were performed in at least biological triplicate. As a control, we also performed competition experiments in at least biological triplicate of wild-type sister spores derived from the same backcross. Selection coefficients were calculated as described [25], and normalized by subtracting the wild-type mean selection coefficient from the mutant mean selection coefficient for each mutation. Fitness was calculated relative to the competing wild-type strain, such that . The mth1/(HXT6/7) epistasis competitive chemostats were performed in at least biological duplicate, and the remaining epistasis competitive chemostats were performed once. For each of these experiments, a wild-type strain from the same dissection was used as a control. The linear phase of growth was determined, and the selection coefficient was taken as the slope of the linear regression line as described [35] and normalized as above. 95% confidence intervals were inferred for the slope of the regression using the confint() function in R. Analysis of covariance (ANCOVA) was used to determine if the selection coefficient of the mutant strain was significantly different than the wild-type control. In this analysis, a model in which the mutant and wild-type experiments were allowed independent slopes was compared to a model in which they had a common slope. Epistasis was quantified as , where s is the normalized selection coefficient and x and y are mutant alleles of two different genes [6]. Error of epsilon was calculated by the method of error propagation: [22], and a 95% confidence interval of epsilon was calculated as . If the value of epsilon fell outside this confidence interval, we considered there to be significant epistasis between alleles x and y.

Assessing the additive effects of individual adaptive mutations

For each adaptive clone, the selection coefficients of all individually adaptive mutations derived from that clone were added and a pseudo relative fitness was calculated from the selection coefficient as above. This was compared to the relative fitness data for each clone from [25]. To make this comparison quantitatively, we used the epistasis framework above to calculate a value and confidence interval for epsilon. If indeed there are additional adaptive mutations, this would manifest itself as a positive value for epsilon, as the additive effects of the individual mutations would be less than the fitness of the clone.

Population allele frequencies

Quantitative Sanger sequencing [35] and the software PeakPicker [50] with default settings were used to determine the frequency of mth1-3 in triplicate throughout the evolution.

Data availability

The Illumina sequence data are available at the NCBI Sequence Read Archive under accession SRA020606.1.

Supporting Information

Zdroje

1. WrightS 1932 The roles of mutation, inbreeding, crossbreeding and selection in evolution. Proceedings of The Sixth Congress on Genetics 356 366

2. WrightS 1988 Surfaces of Selective Value Revisited. The American Naturalist 131 115 123

3. WhitlockMCPhillipsPCMooreFBGTonsorSJ 1995 Multiple Fitness Peaks and Epistasis. Annu Rev Ecol Syst 26

4. GavriletsS 2004 Fitness Landscapes and the Origin of Species; LevinSAHornHS Princeton, NJ Princeton University Press

5. BrodieED 2000 Why Evolutionary Genetics Does Not Always Add Up. WolfJBBrodieEDWadeMJ Epistasis and the Evolutionary Process New York Oxford University Press

6. PhillipsPC 2008 Epistasis–the essential role of gene interactions in the structure and evolution of genetic systems. Nat Rev Genet 9 855 867

7. PoelwijkFJKivietDJWeinreichDMTansSJ 2007 Empirical fitness landscapes reveal accessible evolutionary paths. Nature 445 383 386

8. PovolotskayaISKondrashovFA 2010 Sequence space and the ongoing expansion of the protein universe. Nature 465 922 926

9. AndersonJBFuntJThompsonDAPrabhuSSochaA 2010 Determinants of divergent adaptation and dobzhansky-muller interaction in experimental yeast populations. Curr Biol 20 1383 1388

10. PresgravesDC 2010 The molecular evolutionary basis of species formation. Nat Rev Genet 11 175 180

11. BartonNHCharlesworthB 1998 Why sex and recombination? Science 281 1986 1990

12. de VisserJAElenaSF 2007 The evolution of sex: empirical insights into the roles of epistasis and drift. Nat Rev Genet 8 139 149

13. CordellHJ 2009 Detecting gene-gene interactions that underlie human diseases. Nat Rev Genet 10 392 404

14. WeinreichDMWatsonRAChaoL 2005 Perspective: Sign epistasis and genetic constraint on evolutionary trajectories. Evolution 59 1165 1174

15. PoelwijkFJTanase-NicolaSKivietDJTansSJ 2011 Reciprocal sign epistasis is a necessary condition for multi-peaked fitness landscapes. J Theor Biol 272 141 144

16. WeissmanDBDesaiMMFisherDSFeldmanMW 2009 The rate at which asexual populations cross fitness valleys. Theor Popul Biol 75 286 300

17. LozovskyERChookajornTBrownKMImwongMShawPJ 2009 Stepwise acquisition of pyrimethamine resistance in the malaria parasite. Proc Natl Acad Sci U S A 106 12025 12030

18. LunzerMMillerSPFelsheimRDeanAM 2005 The biochemical architecture of an ancient adaptive landscape. Science 310 499 501

19. OrtlundEABridghamJTRedinboMRThorntonJW 2007 Crystal structure of an ancient protein: evolution by conformational epistasis. Science 317 1544 1548

20. WeinreichDMDelaneyNFDepristoMAHartlDL 2006 Darwinian evolution can follow only very few mutational paths to fitter proteins. Science 312 111 114

21. LunzerMGoldingGBDeanAM 2010 Pervasive cryptic epistasis in molecular evolution. PLoS Genet 6 e1001162 doi:10.1371/journal.pgen.1001162

22. TrindadeSSousaAXavierKBDionisioFFerreiraMG 2009 Positive epistasis drives the acquisition of multidrug resistance. PLoS Genet 5 e1000578 doi:10.1371/journal.pgen.1000578

23. de VisserJAParkSCKrugJ 2009 Exploring the effect of sex on empirical fitness landscapes. Am Nat 174 Suppl 1 S15 30

24. PoelwijkFJKivietDJTansSJ 2006 Evolutionary potential of a duplicated repressor-operator pair: simulating pathways using mutation data. PLoS Comput Biol 2 e58 doi:10.1371/journal.pcbi.0020058

25. KaoKCSherlockG 2008 Molecular characterization of clonal interference during adaptive evolution in asexual populations of Saccharomyces cerevisiae. Nat Genet 40 1499 1504

26. BrownCJToddKMRosenzweigRF 1998 Multiple duplications of yeast hexose transport genes in response to selection in a glucose-limited environment. Mol Biol Evol 15 931 942

27. ElenaSFLenskiRE 2003 Evolution experiments with microorganisms: the dynamics and genetic bases of adaptation. Nat Rev Genet 4 457 469

28. MullerHJ 1964 The Relation of Recombination to Mutational Advance. Mutat Res 106 2 9

29. LangGIMurrayAW 2008 Estimating the per-base-pair mutation rate in the yeast Saccharomyces cerevisiae. Genetics 178 67 82

30. BoyleEIWengSGollubJJinHBotsteinD 2004 GO::TermFinder–open source software for accessing Gene Ontology information and finding significantly enriched Gene Ontology terms associated with a list of genes. Bioinformatics 20 3710 3715

31. LitiGCarterDMMosesAMWarringerJPartsL 2009 Population genomics of domestic and wild yeasts. Nature 458 337 341

32. LafuenteMJGancedoCJauniauxJCGancedoJM 2000 Mth1 receives the signal given by the glucose sensors Snf3 and Rgt2 in Saccharomyces cerevisiae. Mol Microbiol 35 161 172

33. BarrickJEYuDSYoonSHJeongHOhTK 2009 Genome evolution and adaptation in a long-term experiment with Escherichia coli. Nature 461 1243 1247

34. de VisserJALenskiRE 2002 Long-term experimental evolution in Escherichia coli. XI. Rejection of non-transitive interactions as cause of declining rate of adaptation. BMC Evol Biol 2 19

35. GreshamDDesaiMMTuckerCMJenqHTPaiDA 2008 The repertoire and dynamics of evolutionary adaptations to controlled nutrient-limited environments in yeast. PLoS Genet 4 e1000303 doi:10.1371/journal.pgen.1000303

36. CoyneJABartonNHTurelliM 1997 A Critique of Sewall Wright's Shifting Balance Theory of Evolution. Evolution 51 643 671

37. GavriletsS 1997 Evolution and Speciation on Holey Adaptive Landscapes. Trends in Ecology and Evolution 12 307 312

38. WhitlockMC 1995 Variance-Induced Peak Shifts. Evolution 49 252 259

39. GavriletsS 1999 A Dynamical Theory of Speciation on Holey Adaptive Landscapes. American Naturalist 154 1 22

40. WhibleyACLangladeNBAndaloCHannaAIBanghamA 2006 Evolutionary paths underlying flower color variation in Antirrhinum. Science 313 963 966

41. BlountZDBorlandCZLenskiRE 2008 Historical contingency and the evolution of a key innovation in an experimental population of Escherichia coli. Proc Natl Acad Sci U S A 105 7899 7906

42. DobzhanskyT 1951 Genetics and the Origin of Species New York Columbia University Press

43. MullerHJ 1942 Isolating mechanisms, evolution and temperature. Biol Symp 6 71 125

44. DettmanJRSirjusinghCKohnLMAndersonJB 2007 Incipient speciation by divergent adaptation and antagonistic epistasis in yeast. Nature 447 585 588

45. VogelsteinBKinzlerKW 2004 Cancer genes and the pathways they control. Nat Med 10 789 799

46. YunJRagoCCheongIPagliariniRAngenendtP 2009 Glucose deprivation contributes to the development of KRAS pathway mutations in tumor cells. Science 325 1555 1559

47. LiHDurbinR 2009 Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25 1754 1760

48. LiHHandsakerBWysokerAFennellTRuanJ 2009 The Sequence Alignment/Map format and SAMtools. Bioinformatics 25 2078 2079

49. HoebeeckJSpelemanFVandesompeleJ 2007 Real-time quantitative PCR as an alternative to Southern blot or fluorescence in situ hybridization for detection of gene copy number changes. Methods Mol Biol 353 205 226

50. GeBGurdSGaudinTDoreCLepageP 2005 Survey of allelic expression using EST mining. Genome Res 15 1584 1591

51. WengerJWSchwartzKSherlockG 2010 Bulk segregant analysis by high-throughput sequencing reveals a novel xylose utilization gene from Saccharomyces cerevisiae. PLoS Genet 6 e1000942 doi:10.1371/journal.pgen.1000942

52. WinstonFDollardCRicupero-HovasseSL 1995 Construction of a set of convenient Saccharomyces cerevisiae strains that are isogenic to S288C. Yeast 11 53 55