Abstract
Gene products that are beneficial in one environment may become burdensome in another, prompting the emergence of diverse regulatory schemes that carry their own bioenergetic cost. By ensuring that regulators are only expressed when needed, we demonstrate that autoregulation generally offers an advantage in an environment combining mutation and time-varying selection. Whether positive or negative feedback emerges as dominant depends primarily on the demand for the target gene product, typically to ensure that the detrimental impact of inevitable mutations is minimized. While self-repression of the regulator curbs the spread of these loss-of-function mutations, self-activation instead facilitates their propagation. By analyzing the transcription network of multiple model organisms, we reveal that reduced bioenergetic cost may contribute to the preferential selection of autoregulation among transcription factors. Our results not only uncover how seemingly equivalent regulatory motifs have fundamentally different impact on population structure, growth dynamics, and evolutionary outcomes, but they can also be leveraged to promote the design of evolutionarily robust synthetic gene circuits.
Similar content being viewed by others
Introduction
Much effort has been devoted to uncovering organizing principles of living cells in systems biology, and also to devising design guidelines to ensure the predictable behavior of cellular dynamics in synthetic biology1,2,3. As a result, we not only better understand the processes underpinning bacterial chemotaxis perfected by evolution4, but also how to implement integral feedback to ensure robust perfect adaptation5,6,7,8. These results are generally made possible by interpreting complex dynamical systems as a collection of core components wired together, each realizing a well-defined and highly-optimized information processing function9,10,11. While this view offers a powerful reductionist approach to design and analyze networks of daunting complexity, recent results also highlight its limits as recurring motifs can exhibit a wide range of dynamical responses depending on their biophysical parameters and context12,13,14,15,16,17,18,19,20,21.
Among common network motifs, activation and repression are the most fundamental building blocks. They are functionally equivalent (Fig. 1a), as the expression of a gene product can be regulated by relying on an inducer that either activates an activator (positive control), or relieves the repression of a repressor (negative control). Savageau proposed that according to the use-it-or-lose-it principle, positive/negative control emerges when gene products are often/rarely needed22,23,24, ensuring that cognate binding sites are occupied by the transcription factors (TFs) most of the time, thus minimizing the probability of fitness-reducing errors25. Conversely, as a mutated regulator represents a fitness cost only when it is needed, the wear-and-tear principle suggests that it may be evolutionary advantageous to instead minimize the usage of regulators to reduce the negative impact of eventually inevitable mutations26, motivated by the well-established population genetics concept of genetic robustness27,28,29,30.
While precise temporal control of a beneficial gene product may result in an advantage, the expression of the required regulator carries its own bioenergetic cost31. Crucially, for both positive and negative control, the regulator is only required when the inducer is present, otherwise its expression is gratuitous. Therefore, it may be advantageous to have the regulator under autoregulatory control to ensure it is expressed only when needed (Fig. 1b). Understanding the competition and evolutionary selection among the core regulatory motifs in Fig. 1 could shed light on organizing principles of living organisms as similar just-in-time regulation is a wide-spread feature in natural systems32,33,34,35,36, as well as guide the design of synthetic gene circuits when selecting among alternative modes of regulation37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52.
Motivated by the central role of autoregulation in systems and synthetic biology, we characterize how demand for a beneficial gene product, mutation rate of its regulator, population size, selection pressure, regulatory delay, and the timescale of environmental shifts together determine the optimal choice among the motifs in Fig. 1. We show that (i) autoregulation generally dominates, (ii) the dominant strategy typically agrees with the wear-and-tear principle, and (iii) while self-repression of the regulator curbs the spread of loss-of-function mutations, self-activation instead facilitates their propagation. We further demonstrate that the reduced bioenergetic cost of autoregulation may contribute to its ubiquitous nature in gene regulatory networks, and how our work could aid the design of evolutionarily robust synthetic gene circuits.
Results
Mathematical model
Building on a quantitative framework26 inspired by demand theory22,23,24, the mathematical model underpinning our analysis comprises the changing environment, random mutations, and fitness-based selection. Typical values of the model parameters are discussed in the “Methods”.
We consider an evolutionary scenario where cells are exposed to environmental variations. This is modeled via the concentration of the inducer I that varies periodically between high and low values (Fig. 2a). Within each period T, these correspond to the induced and non-induced phases lasting Ti and Tni, respectively. During the former, expression of a gene product P confers a fitness advantage, whereas during the latter its unnecessary synthesis a fitness cost. The fraction D = Ti/T hence measures the demand for the beneficial gene product.
In this evolutionary setting, loss-of-function mutations that affect the regulator R result in non-functional variants. The emergence of these non-binders occurs at rate ν− from a functional binder, whereas gain-of-function mutations happen at rate ν+ (Fig. 2b). We assume that these mutation rates are constant and independent of the mode of gene regulation. We consider the population size N to remain constant over time, and denote the size of the binder and non-binder sub-populations with Nb(t) and Nnb(t), respectively.
Finally, each period alternates between selection and neutral phases lasting Ts and Tn, respectively (Fig. 2c). During the former, non-binders suffer the cost sp > 0 due to either the presence of P during Tni for negative control, or its absence during Ti for positive control. Additionally, cells incur the expense sr > 0 when the regulator R is synthesized. Selection pressure hence stems from two sources: expression of P not matching the environmental condition (P-cost), and expression of R (R-cost). For non-autoregulated control, binders and non-binders suffer identical R-cost throughout the entirety of each period, thus selection against non-binders occurs solely based on their non-zero P-cost. Autoregulation of R reduces the time when the R-cost is suffered (Fig. 2c), thus holding the potential to provide an evolutionary advantage over non-autoregulated control (Supplementary Figs. 1 and 2). Therefore, we next quantify the average fitness cost of each core regulatory motif in Fig. 1 to compare their performance.
Fitness cost in large populations
We first quantify the performance of the control schemes in large populations, when sampling fluctuations are negligible. The evolution of the fraction x = Nnb/N of non-binders in the population is governed by the deterministic dynamics (Supplementary Section 2)
where s(t) = snb(t) − sb(t) is the relative selection pressure against non-binders, with sb(t) and snb(t) denoting the fitness cost encountered by the binders and non-binders, respectively (Fig. 2c). Thus, over each period the average fitness cost \(\bar{s}\) is
Considering only non-autoregulated control, activation/repression dominates at low/high demand for weak selection (Fig. 3a). For intermediate values of the demand (Supplementary Figs. 4–6), the two control schemes offer comparable performance for short periods and when control is expensive (sr ≈ sp). Selection pressure further amplifies this effect (Supplementary Figs. 7–9), and we recover the results presented in ref. 26: when the fraction of non-binders does not appreciably change as a result of frequent environmental shifts (Supplementary Section 1), positive and negative control perform similarly, otherwise activation/repression dominates for low/high demand, matching the wear-and-tear principle.
When considering all motifs in Fig. 1, non-autoregulated control is generally replaced by its autoregulated counterpart as the dominant strategy (Fig. 3b). However, two major differences do emerge. First, for low demand, while non-autoregulated activation and repression often have comparable performance (especially for strong selection, Fig. 3a), the parameter region where self-activation emerges as a clear winner expands significantly due to the elimination of the R-cost for non-binders (Fig. 2c). Second, when demand is high, control is expensive (sr ≈ sp), and the combined impact of mutations and selection is weak (Supplementary Section 1), self-activation emerges as dominant (red star in Fig. 3b). To understand this, consider the case when sr = sp = s0, yielding the fitness cost Ds0 for self-activation (Fig. 2) and Ds0 + 2x0(1 − D) > Ds0 for self-repression where x0 is the approximately constant value of x throughout the period (Supplementary Section 2). Thus, while the winner emerges according to the wear-and-tear principle when only non-autoregulated control is considered (Fig. 3a), the dominant strategy may be aligned with the use-it-or-lose-it principle in the presence of autoregulation (Fig. 3b). The region where this occurs shrinks with increasing demand (Supplementary Figs. 10–15).
Fitness cost in small populations
The impact of sampling fluctuations (genetic drift) becomes more pronounced as the population size decreases, thus we next characterize performance in the presence of stochastic effects. To this end, we first consider the standard Wright-Fisher model with constant population size N53,54, then its diffusion approximation55 as it significantly accelerates the computation of the fitness cost without compromising accuracy (Supplementary Fig. 16).
Focusing on non-autoregulated control first, it was previously reported that while the wear-and-tear principle dominates in large populations, it is replaced by the use-it-or-lose-it principle as N decreases26. Although this can happen, it only occurs when selection pressure is strong (sp ≫ ν−), and there is a reversal as the population size further decreases (Fig. 4a), which is not discussed in ref. 26. The source of these two transitions is the varying frequency of non-binders occasionally taking over during neutral periods due to sampling fluctuations (unlike in large populations). The prevalence of such events is inversely proportional to the population size, the duration of the selection phase, and the selection pressure, and they carry a significant penalty as binders only slowly re-emerge as a result of rare gain-of-function mutations.
To better understand this phenomenon, consider the low demand case (the high demand case can be analyzed similarly, with positive and negative regulation swapped). As population size starts to decrease, non-binders take over more frequently for activation than for repression due to the shorter selection period (Supplementary Fig. 18), giving rise to the region where the dominant strategy is consistent with the use-it-or-lose-it principle (blue star in Fig. 4a). A similar shift occurs for negative control as well, only at lower population size due to the longer selection phase, and the corresponding substantial fitness cost increase is what drives the reversal to positive control re-emerging as the dominant strategy, this time at the small population limit (Fig. 4a). These transitions happen only when selection pressure is sufficiently strong to ensure that non-binders are eliminated in large populations.
When considering all core motifs featured in Fig. 1, the picture that emerges in Fig. 4b is qualitatively similar when environmental shifts happen frequently (T ≪ 1/ν−) and when they occur rarely (T ≫ 1/ν−). For low demand, the trend echoes our previous findings in Fig. 4a when considering only non-autoregulated control. The region where the dominant strategy is underpinned by the use-it-or-lose-it principle expands with selection pressure, and it is sandwiched between population sizes where it is replaced by the wear-and-tear principle from both below (Fig. 4b) and above (Fig. 3b), only this time self-activation and self-repression dominate instead of their non-autoregulated counterparts due to their reduced R-cost (Fig. 2c). While in the high demand case the situation is similar (with self-activation and self-repression swapped), one crucial difference does emerge: in the small population limit non-autoregulated and autoregulated repression have comparable performance. To understand this, note that unlike for activation where autoregulation decreases the fitness cost by sr throughout the entire period, for repression there is no reduction in case of non-binders (Fig. 2c), yielding comparable performance for non-autoregulated and autoregulated control. This effect becomes more pronounced with decreasing selection pressure (Fig. 4b), which increases the probability of non-binders taking over the population due to stochastic fluctuations (Supplementary Figs. 19 and 20).
In summary, the presence of sampling fluctuations largely preserves the wear-and-tear principle behind the dominant strategy considering both non-autoregulated and autoregulated control. The use-it-or-lose-it principle emerges only in a narrow slice of the parameter space (which further decreases with selection pressure), for instance, within a confined range of the population size. Thus, in addition to providing a more complete picture about the competition between the two non-autoregulated motifs by revealing the reversal to the wear-and-tear principle in the small population limit, our results also significantly expand prior work26 by comparing the performance of all four core regulatory motifs featured in Fig. 1 in the presence of genetic drift.
Autoregulation can result in unwanted selection pressure
By eliminating the gratuitous expression of the regulator during the non-induced phase (Fig. 2c), autoregulation generally outperforms its non-autoregulated counterpart for both activation and repression (Figs. 3 and 4). For positive and negative control, however, autoregulation has drastically different impact on the population composition, as well as on how rapidly it changes.
To illustrate the differential impact of autoregulation on the fraction of non-binders x, consider first non-autoregulated control. Increasing sr represents an additional and identical fitness cost for both binders and non-binders (Fig. 2c), hence the population-level composition remains unaffected, since the evolution of x according to (1) depends on the difference s = snb − sb. Conversely, while autoregulation yields s = 0 during the neutral phase, it results in s = sw with sw = sp − sr < sp and sw = sp + sr > sp during selection for self-activation and self-repression, respectively (Fig. 5), compared to sw = sp for non-autoregulated control. This change in selection pressure thus means (i) more stringent elimination of the non-binders when relying on self-repression, and (ii) the accumulation of loss-of-function mutations in case of self-activation (Fig. 5). The latter is especially concerning considering the fitness gain of self-activation relative to its non-autoregulated counterpart: should these two variants compete, the former would eventually take over the population, however, it could easily result in one dominated by non-binders (unlike in the case of self-repression where the R-cost instead promotes the elimination of deleterious mutations).
Feedback also has a differential impact on how rapidly non-binders emerge and disappear in case of positive and negative autoregulation. To illustrate this, we define \({T}^{* }=\max ({T}_{b},{T}_{w})\) where Tb = 1/ν− and Tw = 1/(sw + ν+) are the timescales for the build-up and wipe-out of non-binders due to mutations and selection. For short periods (T ≪ T *), the fraction of non-binders remains approximately constant throughout the entire period: non-binders are eliminated if Tw < Tb, otherwise they take over the population (Supplementary Fig. 24). For long periods (T ≫ T *), if Tw < Tb then build-up of non-binders during Tn is wiped out during Ts, otherwise non-binders persist throughout the entire period as loss-of-function mutations dominate the combined impact of selection and gain-of-function mutations during Ts (Supplementary Fig. 24). Considering the typical range of model parameters (“Methods”), for all four regulatory schemes featured in Fig. 1 we have T * = Tb (Supplementary Section 4). Importantly, while Tw decreases with sr in case of self-repression, autoregulation has the opposite impact for positive control, further hindering the elimination of non-binders from the population.
Delay can cause non-autoregulated motifs to outperform autoregulation
For autoregulated motifs, the inducer also triggers the appearance and disappearance of the regulator. Since its synthesis and decay may take time, considerable delays could be introduced56, resulting in an increased fitness cost when compared to the idealized scenario outlined in Fig. 2c, especially when feedback is realized in the form of regulatory cascades57. The negative impact of delay on the performance of autoregulation is illustrated in Fig. 6.
For short periods (T ≪ T *), both selection and mutation play a negligible role throughout each period, thus the fraction of non-binders x remains approximately constant. Hence, in this case delay has no impact on the performance of autoregulated motifs: the dominant strategy in Fig. 6 remains unchanged when compared to the case without delay (Fig. 3b).
For long periods (T ≫ T *), the combined impact of mutation and selection can impact x, which may result in non-autoregulated control outperforming autoregulation. For instance, in case of low demand, the fitness cost of non-autoregulated and autoregulated activation is approximately sr and Dsp, respectively (Supplementary Fig. 24). Therefore, while self-activation offers superior performance when regulation is expensive (sr ≈ sp), non-autoregulated activation can dominate if control is instead affordable (sr ≪ sp). These results hold for both weak and strong selection (Fig. 6), and also in the presence of stochastic fluctuations due to small population size (Supplementary Fig. 25). The high demand case can be analyzed similarly.
In the special case when T ≈ T *, autoregulation consistent with the use-it-or-lose-it principle emerges as dominant when selection pressure is sufficiently strong (stars in Fig. 6). To understand why this happens, here we focus on the low demand case when regulation is affordable (sr ≪ sp), other scenarios can be analyzed similarly. For self-repression, non-binders are eliminated (x ≈ 0) during the entire period (Supplementary Fig. 26) as a result of strong and long selection (due to low demand), yielding the average fitness cost \(\bar{s} \, \approx \, D{s}_{r}\) (Fig. 2c). Conversely, for self-activation there is alternating build-up and wipe-out during Tn and Ts due to the shorter selection phase, resulting in \(\bar{s} \, \approx \, {\bar{x}}_{s}D{s}_{p}\) where \({\bar{x}}_{s}\) denotes the average of x during Ts (Supplementary Fig. 26). Crucially, \({\bar{x}}_{s}\) increases with the delay, which is the driving force behind self-repression offering superior performance compared to self-activation, and eventually emerging as the dominant strategy for sufficiently strong selection (blue star in Fig. 6). Delay thus has the greatest impact in the special case when T ≈ T * by triggering the emergence of regions where the dominant strategy is underpinned by the use-it-or-lose-it principle (marked by stars in Fig. 6). These regions expand with increasing selection pressure sp, and they can appear even in the presence of brief delays (Supplementary Fig. 31).
Feedback cost can cause non-autoregulated motifs to outperform autoregulation
In addition to introducing delay, autoregulation can also result in additional burden due to the increased expression of the regulator R that may be required to control its own expression14,15. To capture this, we next assume that while the R-cost for non-autoregulated control remains sr, for autoregulation it instead increases to asr with a ≥ 1 (Fig. 7a).
To understand how this change impacts the dominant strategy, we first focus on the case when mutations have minimal impact, so that the population consists entirely of binders. As a result, the P-cost is zero for all four schemes. Furthermore, positive and negative control have identical R-cost, both when considering non-autoregulated and autoregulated motifs (Fig. 2c). While autoregulation reduces the time when the R-cost is suffered as demand decreases, non-autoregulated strategies outperform their autoregulated counterparts once feedback becomes too expensive (Fig. 7a). These results hold even when mutations are considered. In particular, data in Supplementary Figs. 32–34 confirm that (i) the pattern of how the use-it-or-lose-it and the wear-and-tear principles emerge behind the dominant strategies remains largely unaffected, (ii) non-autoregulated strategies can outperform their autoregulated counterparts as a increases, and (iii) this transition happens at lower values of a as the demand increases.
As the R-cost for autoregulation increases with the duration of the induced phase Ti = DT (Fig. 2c), we expect that the sensitivity of the dominant autoregulated strategy to changes in a also increases with the demand D, confirmed in Fig. 7b. In particular, while the fitness cost of self-activation increases only slightly when compared to its non-autoregulated counterpart in the low demand case, performance of self-repression in the high demand limit quickly degrades. Furthermore, the results in Fig. 7b echo our finding in Fig. 5 about increasing the R-cost (this time via a instead of sr), giving rise to a differential impact on population composition: more stringent selection against non-binders when relying on self-repression, and the spread of loss-of-function mutations in case of self-activation. This also reveals a trade-off between the fraction of non-binders x and the margin in a for preserving the dominant strategy. For instance, when it is underpinned by the wear-and-tear principle, in the low demand case a can be increased substantially and self-activation would still emerge as dominant (e.g., it remains only about 15% as expensive as non-autoregulated positive control when a = 2 in Fig. 7b), although at the price of increased prevalence of non-binders. Conversely, in the high demand case negative autoregulation promotes the elimination of deleterious mutations, however, the margin in a is considerably smaller to preserve the dominance of self-repression (e.g., non-autoregulated negative control dominates its autoregulated counterpart once a > 1.07 in Fig. 7b).
Reduced bioenergetic cost may contribute to the prevalence of autoregulation in model organisms
We next turn our attention to the transcriptional regulatory network of B. subtilis, C. glutamicum, and E. coli, model organisms with the most comprehensive data availability58,59 (Supplementary Fig. 35). Our results suggest that self-activation and self-repression should be prominently featured among regulators, hence in what follows we concentrate on whether the reduced bioenergetic cost of autoregulated control schemes could indeed confer an evolutionary advantage, and thus contribute to their high prevalence60,61.
To this end, we first note that autoregulation is preferentially selected by evolution when choosing regulators. While the average prevalence of autoregulation is only 27% among TFs (each TF is only counted once), it is instead 35% among all regulators (each TF is counted as many times as it appears as an activator/repressor), with the overrepresentation ranging between 2–16 percentage points (Fig. 8a). This is not surprising, as autoregulated TFs offer a multitude of beneficial properties62,63,64,65,66,67. Importantly, if R-cost reduction was a negligible factor, we would expect identical prevalence among all regulators and among the subset that control gene targets synergistically (Fig. 8b), otherwise the frequency in the latter should be greater (Supplementary Section 7). Thus, for each organism our two samples of interest comprise regulators of genes that are either positively or negatively controlled, focusing on 86%, 50%, and 40% of target genes in B. subtilis, C. glutamicum, and E. coli, respectively (Supplementary Table 2).
Using the frequency of autoregulation among TFs as a baseline, the overrepresentation of this motif is greater among synergistic regulators (Fig. 8c) than among all regulators (Fig. 8a): the mean difference is 25 percentage points, with the overrepresentation ranging between 18–42 percentage points (Supplementary Table 2). Therefore, beneficial properties of autoregulation yield their preferential selection for regulators (Fig. 8a), but crucially, the frequency of autoregulation among synergistic regulators further exceeds this modified and elevated baseline by 17 percentage points on average (Fig. 8d), with the overrepresentation ranging between 6–28 percentage points (Supplementary Table 2). To quantify the significance of the differences, we performed standard two-sample location tests across all datasets to compute the probability of the modified baseline (autoregulation among all regulators) and the samples of interest (autoregulation among synergistic regulators) coming from the same distribution (leveraging not only the frequencies but also the sample sizes). The resulting p-values are all smaller than 0.001, suggesting that the differences are statistically significant across all three organisms for both self-activation and self-repression, and that the underlying distributions are likely different.
In summary, autoregulated motifs are overrepresented among synergistic regulators compared to their frequency among all regulators. This suggests that autoregulation likely offers additional beneficial properties in the former case, leading to their even stronger preferential selection. Our results highlight that reduced R-cost offers an appealing explanation as one of the factors that contribute to establishing the prominent role of autoregulation.
Discussion
Given the prevalence of autoregulation in transcription networks60,61, it is not surprising that this network motif offers numerous advantages. For instance, self-repression can accelerate temporal responses62,63 and reduce stochastic fluctuations64,65, whereas self-activation underpins cellular memory66 and helps cell populations to maintain a mixed phenotype to assure optimal performance in stochastic environments67. As autoregulation ensures that a TF is only expressed when needed, our results highlight that the abundance of this motif may also stem from its reduced bioenergetic and fitness cost relative to non-autoregulated control.
We confirm this both in small and large populations by demonstrating that non-autoregulated regulation practically never dominates its autoregulated counterpart except when the additional cost of feedback is substantial or in the presence of regulatory delays. The dominant strategy is generally in accordance with the wear-and-tear principle, the use-it-or-lose it principle emerges only in a narrow region of population size and timescale of environmental shifts. Our work thus further highlights that demand for a beneficial gene product and whether positive or negative regulation offers superior performance are tightly coupled. These results considerably expand our prior understanding focusing only on non-autoregulated control26, and may explain why self-activation and self-repression show strong clustering and preferential localization across functional subsystems (Supplementary Fig. 38) with different temporal demand profiles68.
As the performance of autoregulation degrades with delay, it is understandable that the transcription network of E. coli has an essentially feedforward structure, where feedback occurs primarily in the form of autoregulation69,70. Further strengthening the connection between evolutionary selection and autoregulation, the latter most likely emerged as a result of gene duplication71, a core factor in the origin of mutational robustness72 resulting in the accumulation of phenotypically cryptic genetic variation73 promoting evolvability74. This link between autoregulation and mutational robustness was confirmed by recent experimental advances, suggesting that regulatory feedback may be an important element of the network architectures that confer mutational robustness across biology75.
The results presented in this paper focus on the impact of autoregulation by putting the spotlight on fitness advantage, as this factor plays a pivotal role in establishing the dominant strategies that emerge as a result of competition and selection. In synthetic biology applications, however, while the fitness cost can be crucial in certain scenarios, e.g., when studying the cost of plasmid acquisition and maintenance76, often the rate at which non-binders emerge is instead of primary concern to avoid compromising the genetic stability of engineered synthetic circuits77. Importantly, our approach can be applied to characterize how autoregulation impacts the proliferation of deleterious mutations considering timescales and population sizes typical in synthetic biology applications (Supplementary Section 8) ranging from microfluidics experiments to bioreactor-based contexts78,79,80,81,82. Crucially, our work reveals that while self-activation facilitates the spread of non-binders, self-repression acts against this phenomenon, and these effects are amplified as the cost of regulation increases. Thus, our results not only uncover how genetic design choices alter population structure, growth dynamics, and evolutionary outcomes, but they can also be leveraged to minimize the prevalence of cells that harbor non-functional genetic modules to design evolutionarily robust synthetic gene circuits83,84.
Our model-based analysis raises several experimentally testable hypotheses. Does the wear-and-tear principle emerge in small populations and at the critical timescale of environmental shifts in case of sufficiently strong selection? Is it true that while negative autoregulation curbs the emergence of detrimental mutations, self-activation instead facilitates their spread? To test these predictions, all regulatory schemes featured here can be constructed using existing synthetic biology toolkits and parts85,86,87,88, and key variables can be conveniently tuned to reveal their role in establishing the dominant control scheme. Demand and period length can be varied by creating defined environments to control when the gene product is needed89,90 using automated cell culture systems81,82. Mutation rates can be adjusted by using UV radiation or CRISPR-guided mutagenesis91,92. The P-cost can be tuned by modulating selection pressure, for instance, via the composition of the growth media93,94,95,96, whereas the R-cost can be altered through codon (de)optimization97. Finally, delay can be modulated leveraging regulatory cascades57. By ensuring that experiments with maximal information content are selected98, it is possible to efficiently test whether the predicted regulatory mechanisms actually emerge as dominant strategies in each environment, thus to promote the design of biosystems that operate robustly under inevitable evolutionary forces99.
Biology has evolved powerful and creative solutions to control gene expression by selecting the optimal variant(s) among a wide array of competing control mechanisms. Our results reveal how the interplay of biophysical parameters and environmental factors together shape the emergence of dominant regulatory strategies. This can be leveraged both to shed light on evolutionary organizing principles underpinning the transcription networks of living organisms, and also to guide the design of synthetic gene circuits, for instance, when selecting among alternative modes of regulation to implement biomolecular controllers and insulation devices37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52 to facilitate the modular design of complex synthetic gene circuits.
Methods
Parameters
Wild-type E. coli grown under optimal conditions typically has mutation rates on the order of 10−3 mutations per genome per generation100. Considering the typical genome size of bacteria, this corresponds to approximately 10−9–10−8 mutations per base per generation101,102,103, though this rate may depend on population size104 and expression levels105. Furthermore, hypermutators with up to 104-fold greater mutation rates can occur under laboratory conditions, and more frequently in natural bacterial populations100. Assuming roughly 100 sensitive nucleotide positions, we thus estimate the rate of loss-of-function mutations to span the range ν− ≈ 10−7–10−3 per generation. Since gain-of-function mutations are assumed to be less probable, we consider ν+ = ν−/10 throughout the paper matching experimental estimates106. Selection intensity is notoriously hard to measure107,108,109,110, however, based on estimates for codon bias111, we consider sp/ν− = 10 and sp/ν− = 100 in case of weak and strong selection, respectively. Finally, we assume that sr < sp for a typical target gene, otherwise there would be no evolutionary selection pressure to regulate the expression of the product. Throughout the paper, the period T is measured in number of generations, whereas the mutation rates ν− and ν+ as well as the selection intensities sp and sr are all given in 1/generation.
Stochastic simulation of the evolutionary dynamics
The standard Wright-Fisher model assumes the following: discrete and non-overlapping generations with a constant population size N, with each member replaced in every generation53,54. Introduce s = snb − sb where snb and sb are the fitness cost incurred by the non-binders and binders, respectively, and let nt denote number of non-binders in the population in generation t (i.e., nt = 0, 1, 2, …, N). We first generate the number of gain-of-function and loss-of-function mutations m+ and m− from Poisson distributions with means ntν+ and (N − nt)ν−, respectively, so that the fraction of non-binders in the population becomes x = (nt + m− − m+)/N. The number of non-binders nt+1 in the next generation is drawn from a Binomial distribution with success probability \({x}^{{\prime} }=x-sx(1-x)/(1-sx)\) to account for the fitness difference s (thus the number of offsprings produced) between non-binders and binders26.
Periodic steady state distribution
Considering the periodic selection pressure in the evolutionary dynamics depicted in Fig. 2, the distribution P(x, t) of x approaches a periodic steady state distribution (subsequent to a transient that depends on the initial condition). This can be estimated for 0 < x < 1 considering \(\frac{\partial P(x,t)}{\partial t}=-\frac{\partial j(x,t)}{\partial x}\) with \(j(x,t)=-\frac{1}{2N}\frac{\partial }{\partial x}[x(1-x)P]+[{\nu }_{-}-({\nu }_{+}+{\nu }_{-}+s)x+s{x}^{2}]P\) using the diffusion approximation26. To compute the probabilities P(0, t) and P(1, t) at the boundaries x = 0 and x = 1, we consider the flux conditions \(\frac{\,{{\mbox{d}}}P(0,t)}{{{\mbox{d}}}\,t}=-j(0,t)\) and \(\frac{\,{{\mbox{d}}}P(1,t)}{{{\mbox{d}}}\,t}=j(1,t)\). After estimating the steady state distribution of P(x, t) using the algorithm developed in ref. 55 implementing the above steps, the average fitness cost \(\bar{s}\) during one period can be computed as in (2) with x(t) replaced by \({x}^{{\prime} }(t)=P(1,t)+\int\nolimits_{0}^{1}x(t)P(x,t)\,{{\mbox{d}}}\,x\), following the approach in ref. 26.
Statistical analysis
We perform two-sample location tests comparing the success rate (presence of positive/negative autoregulation) observed in a reference (regulators of genes) and our samples of interest (regulators of synergistically controlled genes). We assume that the underlying distributions are Binomial with nr and n trials (sample size) and θr and θ success rates in the reference and in the sample of interest, respectively. Therefore, the number of successes are given by Xr ~ Binom(nr, θr) and X ~ Binom(n, θ), respectively. With this, \({\hat{\theta }}_{r}={X}_{r}/{n}_{r}\) and \(\hat{\theta }=X/n\) are non-biased estimators of the unknown success rates.
With the null hypothesis H0 : θ = θr of identical success rates, we are interested in the probability \(p={{{{{\bf{P}}}}}}\left({{\Omega }}\ge \omega \,| \,{H}_{0}\right)\) of observing the value ω of the test statistic Ω (number of positively/negatively autoregulated regulators) at least as extreme as if the null hypothesis was true. As \({n}_{r}{\hat{\theta }}_{r},{n}_{r}(1-{\hat{\theta }}_{r}),\, n\hat{\theta },\, n(1-\hat{\theta }) > 5\) for all datasets we consider, from the Central Limit Theorem it follows that
Since θr and θ are unknown, and there are infinitely many choices that satisfy the null hypothesis, we follow the standard choice of using the pooled proportion \({\hat{\theta }}_{0}=({X}_{r}+X)/({n}_{r}+n)\) in place of both, as it satisfies the null hypothesis and it is consistent with our data. With this, we obtain that p ≈ 1 − Φ(z) where Φ( ⋅ ) is the cumulative distribution function of the standard normal distribution, together with \(z=(\hat{\theta }-{\hat{\theta }}_{r})/\sqrt{{\hat{\theta }}_{0}(1-{\hat{\theta }}_{0})({n}_{r}^{-1}+{n}^{-1})}\).
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
CoryneRegNet 7.0 data were downloaded from https://exbio.wzw.tum.de/coryneregnet/processToDownalod.htm58. PRODORIC data can be accessed using the API at https://www.prodoric.de/api/59. Data on the functional organization of the transcription regulatory network of E. coli were downloaded from https://www.pnas.org/doi/full/10.1073/pnas.1702581114112. Source data are provided with this paper.
Code availability
The manuscript does not rely on custom mathematical algorithms or software. Simulation data were generated and analyzed as described in the “Methods” using built-in MATLAB (version R2023a) functions. The MATLAB scripts used to obtain the results featured in the paper are publicly available at https://github.com/qbionet/evolutionary-selection. Additional information is available from the corresponding author upon request.
References
Hartwell, L. H., Hopfield, J. J., Leibler, S. & Murray, A. W. From molecular to modular cell biology. Nature 402, C47–C52 (1999).
Wolf, D. M. & Arkin, A. P. Motifs, modules and games in bacteria. Curr. Opin. Microbiol. 6, 125–134 (2003).
Wall, M. E., Hlavacek, W. S. & Savageau, M. A. Design of gene circuits: lessons from bacteria. Nat. Rev. Genet. 5, 34–42 (2004).
Yi, T.-M., Huang, Y., Simon, M. I. & Doyle, J. Robust perfect adaptation in bacterial chemotaxis through integral feedback control. Proc. Natl Acad. Sci. USA 97, 4649–4653 (2000).
Briat, C., Gupta, A. & Khammash, M. Antithetic integral feedback ensures robust perfect adaptation in noisy biomolecular networks. Cell Syst. 2, 15–26 (2016).
Aoki, S. K. et al. A universal biomolecular integral feedback controller for robust perfect adaptation. Nature 570, 533–537 (2019).
Khammash, M. H. Perfect adaptation in biology. Cell Syst. 12, 509–521 (2021).
Gupta, A. & Khammash, M. Universal structural requirements for maximal robust perfect adaptation in biomolecular networks. Proc. Natl Acad. Sci. USA 119, e2207802119 (2022).
Alon, U. Network motifs: theory and experimental approaches. Nat. Rev. Genet. 8, 450–461 (2007).
Hart, Y., Antebi, Y. E., Mayo, A. E., Friedman, N. & Alon, U. Design principles of cell circuits with paradoxical components. Proc. Natl Acad. Sci. USA 109, 8346–8351 (2012).
Gorochowski, T. E., Grierson, C. S. & Di Bernardo, M. Organization of feed-forward loop motifs reveals architectural principles in natural and engineered networks. Sci. Adv. 4, eaap9751 (2018).
Ingram, P. J., Stumpf, M. P. & Stark, J. Network motifs: structure does not determine function. BMC Genom. 7, 108 (2006).
Cardinale, S. & Arkin, A. P. Contextualizing context for synthetic biology - identifying causes of failure of synthetic biological systems. Biotechnol. J. 7, 856–866 (2012).
Del Vecchio, D., Ninfa, A. J. & Sontag, E. D. Modular cell biology: retroactivity and insulation. Mol. Syst. Biol. 4, 161 (2008).
Gyorgy, A. & Del Vecchio, D. Modular composition of gene transcription networks. PLoS Comput. Biol. 10, e1003486 (2014).
Gyorgy, A. et al. Isocost lines describe the cellular economy of genetic circuits. Biophys. J. 109, 639–646 (2015).
Ceroni, F., Algar, R., Stan, G.-B. & Ellis, T. Quantifying cellular capacity identifies gene expression designs with reduced burden. Nat. Methods 12, 415–418 (2015).
Gorochowski, T. E., Avcilar-Kucukgoze, I., Bovenberg, R. A. L., Roubos, J. A. & Ignatova, Z. A minimal model of ribosome allocation dynamics captures trade-offs in expression between endogenous and synthetic genes. ACS Synth. Biol. 5, 710–720 (2016).
Yeung, E. et al. Biophysical constraints arising from compositional context in synthetic gene networks. Cell Syst. 5, 11–24.e12 (2017).
Sechkar, K., Perrino, G. & Stan, G.-B. A coarse-grained bacterial cell model for resource-aware analysis and design of synthetic gene circuits. Preprint at bioRxiv https://doi.org/10.1101/2023.04.08.536106 (2023).
Di Blasi, R. et al. Resource-aware construct design in mammalian cells. Nat. Commun. 14, 3576 (2023).
Savageau, M. A. Genetic regulatory mechanisms and the ecological niche of Escherichia coli. Proc. Natl Acad. Sci. USA 71, 2453–2455 (1974).
Savageau, M. A. Design of molecular control mechanisms and the demand for gene expression. Proc. Natl Acad. Sci. USA 74, 5647–5651 (1977).
Savageau, M. A. Demand theory of gene regulation. I. Quantitative development of the theory. Genetics 149, 1665–1676 (1998).
Shinar, G., Dekel, E., Tlusty, T. & Alon, U. Rules for biological regulation based on error minimization. Proc. Natl Acad. Sci. USA 103, 3999–4004 (2006).
Gerland, U. & Hwa, T. Evolutionary selection between alternative modes of gene regulation. Proc. Natl Acad. Sci. USA 106, 8841–8846 (2009).
Gu, Z. et al. Role of duplicate genes in genetic robustness against null mutations. Nature 421, 63–66 (2003).
Stelling, J., Sauer, U., Szallasi, Z., Doyle, F. J. & Doyle, J. Robustness of cellular functions. Cell 118, 675–685 (2004).
Wagner, A. Robustness and Evolvability in Living Systems (Princeton University Press, 2007).
Plata, G. & Vitkup, D. Genetic robustness and functional evolution of gene duplicates. Nucleic Acids Res. 42, 2405–2414 (2014).
Lynch, M. & Marinov, G. K. The bioenergetic costs of a gene. Proc. Natl Acad. Sci. USA 112, 15690–15695 (2015).
Laub, M. T., McAdams, H. H., Feldblyum, T., Fraser, C. M. & Shapiro, L. Global analysis of the genetic network controlling a bacterial cell cycle. Science 290, 2144–2148 (2000).
Kalir, S. et al. Ordering genes in a flagella pathway by analysis of expression kinetics from living bacteria. Science 292, 2080–2083 (2001).
Ronen, M., Rosenberg, R., Shraiman, B. I. & Alon, U. Assigning numbers to the arrows: parameterizing a gene regulation network by using accurate expression kinetics. Proc. Natl Acad. Sci. USA 99, 10555–10560 (2002).
McAdams, H. H. & Shapiro, L. A bacterial cell-cycle regulatory network operating in time and space. Science 301, 1874–1877 (2003).
Zaslaver, A. et al. Just-in-time transcription program in metabolic pathways. Nat. Genet. 36, 486–491 (2004).
Franco, E., Giordano, G., Forsberg, P.-O. & Murray, R. M. Negative autoregulation matches production and demand in synthetic transcriptional networks. ACS Synth. Biol. 3, 589–599 (2014).
Darlington, A. P. S., Kim, J., Jiménez, J. I. & Bates, D. G. Engineering translational resource allocation controllers: mechanistic models, design guidelines, and potential biological implementations. ACS Synth. Biol. 7, 2485–2496 (2018).
Darlington, A. P. & Bates, D. G. Architectures for combined transcriptional and translational resource allocation controllers. Cell Syst. 11, 382–392.e9 (2020).
Ceroni, F. et al. Burden-driven feedback control of gene expression. Nat. Methods 15, 387–393 (2018).
Del Vecchio, D., Abdallah, H., Qian, Y. & Collins, J. J. A blueprint for a synthetic genetic feedback controller to reprogram cell fate. Cell Syst. 4, 109–120.e11 (2017).
Ng, A. H. et al. Publisher Correction: Modular and tunable biological feedback control using a de novo protein switch. Nature 579, E8–E8 (2020).
Hu, C. Y. & Murray, R. M. Layered feedback control overcomes performance trade-off in synthetic biomolecular networks. Nat. Commun. 13, 5393 (2022).
Gyorgy, A., Menezes, A. & Arcak, M. A blueprint for a synthetic genetic feedback optimizer. Nat. Commun. 14, 2554 (2023).
Darlington, A. P. S., Kim, J., Jiménez, J. I. & Bates, D. G. Dynamic allocation of orthogonal ribosomes facilitates uncoupling of co-expressed genes. Nat. Commun. 9, 695 (2018).
Mishra, D., Rivera, P. M., Lin, A., Del Vecchio, D. & Weiss, R. A load driver device for engineering modularity in biological networks. Nat. Biotechnol. 32, 1268–1275 (2014).
Liu, B., Cuba Samaniego, C., Bennett, M., Chappell, J. & Franco, E. RNA compensation: a positive feedback insulation strategy for RNA-based transcription networks. ACS Synth. Biol. 11, 1240–1250 (2022).
Anastassov, S., Filo, M., Chang, C.-H. & Khammash, M. A cybergenetic framework for engineering intein-mediated integral feedback control systems. Nat. Commun. 14, 1337 (2023).
Frei, T., Chang, C.-H., Filo, M., Arampatzis, A. & Khammash, M. A genetic mammalian proportional-integral feedback control circuit for robust and precise gene regulation. Proc. Natl Acad. Sci. USA 119, e2122132119 (2022).
Filo, M., Kumar, S. & Khammash, M. A hierarchy of biomolecular proportional-integral-derivative feedback controllers for robust perfect adaptation and dynamic performance. Nat. Commun. 13, 2119 (2022).
Frei, T. et al. Characterization and mitigation of gene expression burden in mammalian cells. Nat. Commun. 11, 4641 (2020).
Lillacci, G., Benenson, Y. & Khammash, M. Synthetic control systems for high performance gene expression in mammalian cells. Nucleic Acids Res. 46, 9855–9863 (2018).
Fisher, R. A.The Genetical Theory of Natural Selection (Clarendon Press, 1930).
Wright, S. Evolution in Mendelian populations. Genetics 16, 97–159 (1931).
Zhao, L., Yue, X. & Waxman, D. Complete numerical solution of the diffusion equation of random genetic drift. Genetics 194, 973–985 (2013).
Bratsun, D., Volfson, D., Tsimring, L. S. & Hasty, J. Delay-induced stochastic oscillations in gene regulation. Proc. Natl Acad. Sci. USA 102, 14593–14598 (2005).
Shopera, T., Henson, W. R. & Moon, T. S. Dynamics of sequestration-based gene regulatory cascades. Nucleic Acids Res. 45, 7515–7526 (2017).
Parise, M. T. D. et al. CoryneRegNet 7, the reference database and analysis platform for corynebacterial gene regulatory networks. Sci. Data 7, 142 (2020).
Dudek, C.-A. & Jahn, D. PRODORIC: state-of-the-art database of prokaryotic gene regulation. Nucleic Acids Res. 50, D295–D302 (2021).
Thieffry, D., Huerta, A. M., Pérez-Rueda, E. & Collado-Vides, J. From specific gene regulation to genomic networks: a global analysis of transcriptional regulation in Escherichia coli. BioEssays 20, 433–440 (1998).
Shen-Orr, S. S., Milo, R., Mangan, S. & Alon, U. Network motifs in the transcriptional regulation network of Escherichia coli. Nat. Genet. 31, 64–68 (2002).
Rosenfeld, N., Elowitz, M. B. & Alon, U. Negative autoregulation speeds the response times of transcription networks. J. Mol. Biol. 323, 785–793 (2002).
Camas, F. M., Blázquez, J. & Poyatos, J. F. Autogenous and nonautogenous control of response in a genetic network. Proc. Natl Acad. Sci. USA 103, 12718–12723 (2006).
Dublanche, Y., Michalodimitrakis, K., Kümmerer, N., Foglierini, M. & Serrano, L. Noise in transcription negative feedback loops: simulation and experimental analysis. Mol. Syst. Biol. 2, 41 (2006).
Becskei, A. & Serrano, L. Engineering stability in gene networks by autoregulation. Nature 405, 590–593 (2000).
Kramer, B. P. & Fussenegger, M. Hysteresis in a synthetic mammalian gene network. Proc. Natl Acad. Sci. USA 102, 9517–9522 (2005).
Arkin, A., Ross, J. & McAdams, H. H. Stochastic kinetic analysis of developmental pathway bifurcation in phage λ-infected Escherichia coli cells. Genetics 149, 1633–1648 (1998).
Martínez-Antonio, A., Janga, S. C. & Thieffry, D. Functional organisation of Escherichia coli transcriptional regulatory network. J. Mol. Biol. 381, 238–247 (2008).
Ma, H.-W., Buer, J. & Zeng, A.-P. Hierarchical structure and modules in the Escherichia coli transcriptional regulatory network revealed by a new top-down approach. BMC Bioinform. 5, 199 (2004).
Ma, H.-W. An extended transcriptional regulatory network of Escherichia coli and analysis of its hierarchical structure and network motifs. Nucleic Acids Res. 32, 6643–6649 (2004).
Cosentino Lagomarsino, M., Jona, P., Bassetti, B. & Isambert, H. Hierarchy and feedback in the evolution of the Escherichia coli transcription network. Proc. Natl Acad. Sci. USA 104, 5516–5520 (2007).
Fares, M. A. The origins of mutational robustness. Trends Genet. 31, 373–381 (2015).
Masel, J. & Siegal, M. L. Robustness: mechanisms and consequences. Trends Genet. 25, 395–403 (2009).
Masel, J. & Trotter, M. V. Robustness and evolvability. Trends Genet. 26, 406–414 (2010).
Denby, C. M., Im, J. H., Yu, R. C., Pesce, C. G. & Brem, R. B. Negative feedback confers mutational robustness in yeast transcription factor regulation. Proc. Natl Acad. Sci. USA 109, 3874–3878 (2012).
Ahmad, M. et al. Tradeoff between lag time and growth rate drives the plasmid acquisition cost. Nat. Commun. 14, 2343 (2023).
Ingram, D. & Stan, G.-B. Modelling genetic stability in engineered cell populations. Nat. Commun. 14, 3471 (2023).
Bennett, M. R. & Hasty, J. Microfluidic devices for measuring gene network dynamics in single cells. Nat. Rev. Genet. 10, 628–638 (2009).
Prindle, A. et al. A sensing array of radically coupled genetic ‘biopixels’. Nature 481, 39–44 (2012).
Rullan, M., Benzinger, D., Schmidt, G. W., Milias-Argeitis, A. & Khammash, M. An optogenetic platform for real-time, single-cell interrogation of stochastic transcriptional regulation. Mol. Cell 70, 745–756.e6 (2018).
Wong, B. G., Mancuso, C. P., Kiriakov, S., Bashor, C. J. & Khalil, A. S. Precise, automated control of conditions for high-throughput growth of yeast and bacteria with eVOLVER. Nat. Biotechnol. 36, 614–623 (2018).
Steel, H., Habgood, R., Kelly, C. L. & Papachristodoulou, A. In situ characterisation and manipulation of biological systems with Chi.Bio. PLOS Biol. 18, e3000794 (2020).
Sleight, S. C., Bartley, B. A., Lieviant, J. A. & Sauro, H. M. Designing and engineering evolutionary robust genetic circuits. J. Biol. Eng. 4, 12 (2010).
Renda, B. A., Hammerling, M. J. & Barrick, J. E. Engineering reduced evolutionary potential for synthetic biology. Mol. BioSyst. 10, 1668–1678 (2014).
Weber, E., Engler, C., Gruetzner, R., Werner, S. & Marillonnet, S. A modular cloning system for standardized assembly of multigene constructs. PLoS ONE 6, e16765 (2011).
Meyer, A. J., Segall-Shapiro, T. H., Glassey, E., Zhang, J. & Voigt, C. A. Escherichia coli “Marionette” strains with 12 highly optimized small-molecule sensors. Nat. Chem. Biol. 15, 196–204 (2019).
Moore, S. J. et al. A Multifunctional MoClo Kit for E. coli Synthetic Biology. ACS Synth. Biol. 9, 1225–1225 (2020).
Joshi, S. H.-N., Yong, C. & Gyorgy, A. Inducible plasmid copy number control for synthetic biology in commonly used E. coli strains. Nat. Commun. 13, 6691 (2022).
Elena, S. F. & Lenski, R. E. Evolution experiments with microorganisms: the dynamics and genetic bases of adaptation. Nat. Rev. Genet. 4, 457–469 (2003).
Ibarra, R. U., Edwards, J. S. & Palsson, B. O. Escherichia coli K-12 undergoes adaptive evolution to achieve in silico predicted optimal growth. Nature 420, 186–189 (2002).
Shibai, A. et al. Mutation accumulation under UV radiation in Escherichia coli. Sci. Rep. 7, 14531 (2017).
Halperin, S. O. et al. CRISPR-guided DNA polymerases enable diversification of all nucleotides in a tunable window. Nature 560, 248–252 (2018).
Baym, M. et al. Spatiotemporal microbial evolution on antibiotic landscapes. Science 353, 1147–1151 (2016).
Kohanski, M. A., DePristo, M. A. & Collins, J. J. Sublethal antibiotic treatment leads to multidrug resistance via radical-induced mutagenesis. Mol. Cell 37, 311–320 (2010).
Andersson, D. I. & Hughes, D. Microbiological effects of sublethal levels of antibiotics. Nat. Rev. Microbiol. 12, 465–478 (2014).
Matange, N., Hegde, S. & Bodkhe, S. Adaptation through lifestyle switching sculpts the fitness landscape of evolving populations: implications for the selection of drug-resistant bacteria at low drug pressures. Genetics 211, 1029–1044 (2019).
Burgess-Brown, N. A. et al. Codon optimization can improve expression of human genes in Escherichia coli: a multi-gene study. Protein Expr. Purif. 59, 94–102 (2008).
Gilman, J., Walls, L., Bandiera, L. & Menolascina, F. Statistical design of experiments for synthetic biology. ACS Synth. Biol. 10, 1–18 (2021).
Castle, S. D., Grierson, C. S. & Gorochowski, T. E. Towards an engineering theory of evolution. Nat. Commun. 12, 3326 (2021).
Swings, T. et al. Adaptive tuning of mutation rates allows fast response to lethal stress in Escherichia coli. eLife 6, e22939 (2017).
Drake, J. W. A constant rate of spontaneous mutation in DNA-based microbes. Proc. Natl Acad. Sci. USA 88, 7160–7164 (1991).
Ochman, H., Elwyn, S. & Moran, N. A. Calibrating bacterial evolution. Proc. Natl Acad. Sci. USA 96, 12638–12643 (1999).
Lee, H., Popodi, E., Tang, H. & Foster, P. L. Rate and molecular spectrum of spontaneous mutations in the bacterium Escherichia coli as determined by whole-genome sequencing. Proc. Natl Acad. Sci. USA 109, E2774–E2783 (2012).
Perfeito, L., Fernandes, L., Mota, C. & Gordo, I. Adaptive mutations in bacteria: high rate and small effects. Science 317, 813–815 (2007).
Park, C., Qian, W. & Zhang, J. Genomic evidence for elevated mutation rates in highly expressed genes. EMBO Rep. 13, 1123–1129 (2012).
Imhof, M. & Schlötterer, C. Fitness effects of advantageous mutations in evolving Escherichia coli populations. Proc. Natl Acad. Sci. USA 98, 1113–1117 (2001).
Schluter, D. Estimating the form of natural selection on a quantitative trait. Evolution 42, 849 (1988).
Orr, H. A. The genetic theory of adaptation: a brief history. Nat. Rev. Genet. 6, 119–127 (2005).
Shaw, R. G. & Geyer, C. J. Inferring fitness landscapes. Evolution 64, 2510–2520 (2010).
Lambert, G. & Kussell, E. Quantifying selective pressures driving bacterial evolution using lineage analysis. Phys. Rev. X 5, 011016 (2015).
Hartl, D. L., Moriyama, E. N. & Sawyer, S. A. Selection intensity for codon bias. Genetics 138, 227–234 (1994).
Fang, X. et al. Global transcriptional regulatory network for Escherichia coli robustly connects gene expression to transcription factor activities. Proc. Natl Acad. Sci. USA 114, 10286–10291 (2017).
Acknowledgements
This work was supported by research funds from New York University Abu Dhabi.
Author information
Authors and Affiliations
Contributions
A.G. conceived and designed the research, collected and analyzed the data, prepared the figures, and wrote the manuscript.
Corresponding author
Ethics declarations
Competing interests
The author declares no competing interests.
Peer review
Peer review information
Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Gyorgy, A. Competition and evolutionary selection among core regulatory motifs in gene expression control. Nat Commun 14, 8266 (2023). https://doi.org/10.1038/s41467-023-43327-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-023-43327-7
- Springer Nature Limited