Competition and evolutionary selection among core regulatory motifs in gene expression control

Gyorgy, Andras

doi:10.1038/s41467-023-43327-7

Competition and evolutionary selection among core regulatory motifs in gene expression control

Article
Open access
Published: 13 December 2023

Volume 14, article number 8266, (2023)
Cite this article

Download PDF

You have full access to this open access article

From

View current issue

Competition and evolutionary selection among core regulatory motifs in gene expression control

Download PDF

Andras Gyorgy ORCID: orcid.org/0000-0001-8103-260X¹

3247 Accesses
14 Altmetric
Explore all metrics

Abstract

Gene products that are beneficial in one environment may become burdensome in another, prompting the emergence of diverse regulatory schemes that carry their own bioenergetic cost. By ensuring that regulators are only expressed when needed, we demonstrate that autoregulation generally offers an advantage in an environment combining mutation and time-varying selection. Whether positive or negative feedback emerges as dominant depends primarily on the demand for the target gene product, typically to ensure that the detrimental impact of inevitable mutations is minimized. While self-repression of the regulator curbs the spread of these loss-of-function mutations, self-activation instead facilitates their propagation. By analyzing the transcription network of multiple model organisms, we reveal that reduced bioenergetic cost may contribute to the preferential selection of autoregulation among transcription factors. Our results not only uncover how seemingly equivalent regulatory motifs have fundamentally different impact on population structure, growth dynamics, and evolutionary outcomes, but they can also be leveraged to promote the design of evolutionarily robust synthetic gene circuits.

Robustness and Evolvability in Transcriptional Regulation

Evolutionary potential of transcription factors for gene regulatory rewiring

Article 10 September 2018

Evolution of new regulatory functions on biophysically realistic fitness landscapes

Article Open access 09 August 2017

Introduction

Much effort has been devoted to uncovering organizing principles of living cells in systems biology, and also to devising design guidelines to ensure the predictable behavior of cellular dynamics in synthetic biology^1,2,3. As a result, we not only better understand the processes underpinning bacterial chemotaxis perfected by evolution⁴, but also how to implement integral feedback to ensure robust perfect adaptation^5,6,7,8. These results are generally made possible by interpreting complex dynamical systems as a collection of core components wired together, each realizing a well-defined and highly-optimized information processing function^9,10,11. While this view offers a powerful reductionist approach to design and analyze networks of daunting complexity, recent results also highlight its limits as recurring motifs can exhibit a wide range of dynamical responses depending on their biophysical parameters and context^{12,13,14,15,16,17,18,19,20,21}.

Among common network motifs, activation and repression are the most fundamental building blocks. They are functionally equivalent (Fig. 1a), as the expression of a gene product can be regulated by relying on an inducer that either activates an activator (positive control), or relieves the repression of a repressor (negative control). Savageau proposed that according to the use-it-or-lose-it principle, positive/negative control emerges when gene products are often/rarely needed^22,23,24, ensuring that cognate binding sites are occupied by the transcription factors (TFs) most of the time, thus minimizing the probability of fitness-reducing errors²⁵. Conversely, as a mutated regulator represents a fitness cost only when it is needed, the wear-and-tear principle suggests that it may be evolutionary advantageous to instead minimize the usage of regulators to reduce the negative impact of eventually inevitable mutations²⁶, motivated by the well-established population genetics concept of genetic robustness^27,28,29,30.

**Fig. 1: Functionally equivalent core regulatory motifs in gene expression control.**

While precise temporal control of a beneficial gene product may result in an advantage, the expression of the required regulator carries its own bioenergetic cost³¹. Crucially, for both positive and negative control, the regulator is only required when the inducer is present, otherwise its expression is gratuitous. Therefore, it may be advantageous to have the regulator under autoregulatory control to ensure it is expressed only when needed (Fig. 1b). Understanding the competition and evolutionary selection among the core regulatory motifs in Fig. 1 could shed light on organizing principles of living organisms as similar just-in-time regulation is a wide-spread feature in natural systems^{32,33,34,35,36}, as well as guide the design of synthetic gene circuits when selecting among alternative modes of regulation^{37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52}.

Motivated by the central role of autoregulation in systems and synthetic biology, we characterize how demand for a beneficial gene product, mutation rate of its regulator, population size, selection pressure, regulatory delay, and the timescale of environmental shifts together determine the optimal choice among the motifs in Fig. 1. We show that (i) autoregulation generally dominates, (ii) the dominant strategy typically agrees with the wear-and-tear principle, and (iii) while self-repression of the regulator curbs the spread of loss-of-function mutations, self-activation instead facilitates their propagation. We further demonstrate that the reduced bioenergetic cost of autoregulation may contribute to its ubiquitous nature in gene regulatory networks, and how our work could aid the design of evolutionarily robust synthetic gene circuits.

Results

Mathematical model

Building on a quantitative framework²⁶ inspired by demand theory^22,23,24, the mathematical model underpinning our analysis comprises the changing environment, random mutations, and fitness-based selection. Typical values of the model parameters are discussed in the “Methods”.

We consider an evolutionary scenario where cells are exposed to environmental variations. This is modeled via the concentration of the inducer I that varies periodically between high and low values (Fig. 2a). Within each period T, these correspond to the induced and non-induced phases lasting T_i and T_ni, respectively. During the former, expression of a gene product P confers a fitness advantage, whereas during the latter its unnecessary synthesis a fitness cost. The fraction D = T_i/T hence measures the demand for the beneficial gene product.

**Fig. 2: The evolutionary setting combines environmental shifts, mutations, and fitness-based selection.**

In this evolutionary setting, loss-of-function mutations that affect the regulator R result in non-functional variants. The emergence of these non-binders occurs at rate ν₋ from a functional binder, whereas gain-of-function mutations happen at rate ν₊ (Fig. 2b). We assume that these mutation rates are constant and independent of the mode of gene regulation. We consider the population size N to remain constant over time, and denote the size of the binder and non-binder sub-populations with N_b(t) and N_nb(t), respectively.

Finally, each period alternates between selection and neutral phases lasting T_s and T_n, respectively (Fig. 2c). During the former, non-binders suffer the cost s_p > 0 due to either the presence of P during T_ni for negative control, or its absence during T_i for positive control. Additionally, cells incur the expense s_r > 0 when the regulator R is synthesized. Selection pressure hence stems from two sources: expression of P not matching the environmental condition (P-cost), and expression of R (R-cost). For non-autoregulated control, binders and non-binders suffer identical R-cost throughout the entirety of each period, thus selection against non-binders occurs solely based on their non-zero P-cost. Autoregulation of R reduces the time when the R-cost is suffered (Fig. 2c), thus holding the potential to provide an evolutionary advantage over non-autoregulated control (Supplementary Figs. 1 and 2). Therefore, we next quantify the average fitness cost of each core regulatory motif in Fig. 1 to compare their performance.

Fitness cost in large populations

We first quantify the performance of the control schemes in large populations, when sampling fluctuations are negligible. The evolution of the fraction x = N_nb/N of non-binders in the population is governed by the deterministic dynamics (Supplementary Section 2)

$$\dot{x}={\nu }_{-}-x\left({\nu }_{+}+{\nu }_{-}+s\right)+{x}^{2}s,$$

(1)

where s(t) = s_nb(t) − s_b(t) is the relative selection pressure against non-binders, with s_b(t) and s_nb(t) denoting the fitness cost encountered by the binders and non-binders, respectively (Fig. 2c). Thus, over each period the average fitness cost $\bar{s}$ is

$$\bar{s}=\frac{1}{T}\int\nolimits_{0}^{T}\left[x\left(t\right){s}_{nb}\left(t\right)+\left(1-x\left(t\right)\right) \, {s}_{b} \, \left(t\right)\right]\,{{\mbox{d}}}\,t.$$

(2)

Considering only non-autoregulated control, activation/repression dominates at low/high demand for weak selection (Fig. 3a). For intermediate values of the demand (Supplementary Figs. 4–6), the two control schemes offer comparable performance for short periods and when control is expensive (s_r ≈ s_p). Selection pressure further amplifies this effect (Supplementary Figs. 7–9), and we recover the results presented in ref. ²⁶: when the fraction of non-binders does not appreciably change as a result of frequent environmental shifts (Supplementary Section 1), positive and negative control perform similarly, otherwise activation/repression dominates for low/high demand, matching the wear-and-tear principle.

**Fig. 3: Evolutionary advantageous regulatory motifs in large populations.**

When considering all motifs in Fig. 1, non-autoregulated control is generally replaced by its autoregulated counterpart as the dominant strategy (Fig. 3b). However, two major differences do emerge. First, for low demand, while non-autoregulated activation and repression often have comparable performance (especially for strong selection, Fig. 3a), the parameter region where self-activation emerges as a clear winner expands significantly due to the elimination of the R-cost for non-binders (Fig. 2c). Second, when demand is high, control is expensive (s_r ≈ s_p), and the combined impact of mutations and selection is weak (Supplementary Section 1), self-activation emerges as dominant (red star in Fig. 3b). To understand this, consider the case when s_r = s_p = s₀, yielding the fitness cost Ds₀ for self-activation (Fig. 2) and Ds₀ + 2x₀(1 − D) > Ds₀ for self-repression where x₀ is the approximately constant value of x throughout the period (Supplementary Section 2). Thus, while the winner emerges according to the wear-and-tear principle when only non-autoregulated control is considered (Fig. 3a), the dominant strategy may be aligned with the use-it-or-lose-it principle in the presence of autoregulation (Fig. 3b). The region where this occurs shrinks with increasing demand (Supplementary Figs. 10–15).

Fitness cost in small populations

The impact of sampling fluctuations (genetic drift) becomes more pronounced as the population size decreases, thus we next characterize performance in the presence of stochastic effects. To this end, we first consider the standard Wright-Fisher model with constant population size N^53,54, then its diffusion approximation⁵⁵ as it significantly accelerates the computation of the fitness cost without compromising accuracy (Supplementary Fig. 16).

Focusing on non-autoregulated control first, it was previously reported that while the wear-and-tear principle dominates in large populations, it is replaced by the use-it-or-lose-it principle as N decreases²⁶. Although this can happen, it only occurs when selection pressure is strong (s_p ≫ ν₋), and there is a reversal as the population size further decreases (Fig. 4a), which is not discussed in ref. ²⁶. The source of these two transitions is the varying frequency of non-binders occasionally taking over during neutral periods due to sampling fluctuations (unlike in large populations). The prevalence of such events is inversely proportional to the population size, the duration of the selection phase, and the selection pressure, and they carry a significant penalty as binders only slowly re-emerge as a result of rare gain-of-function mutations.

**Fig. 4: Evolutionary advantageous regulatory motifs in small populations.**

To better understand this phenomenon, consider the low demand case (the high demand case can be analyzed similarly, with positive and negative regulation swapped). As population size starts to decrease, non-binders take over more frequently for activation than for repression due to the shorter selection period (Supplementary Fig. 18), giving rise to the region where the dominant strategy is consistent with the use-it-or-lose-it principle (blue star in Fig. 4a). A similar shift occurs for negative control as well, only at lower population size due to the longer selection phase, and the corresponding substantial fitness cost increase is what drives the reversal to positive control re-emerging as the dominant strategy, this time at the small population limit (Fig. 4a). These transitions happen only when selection pressure is sufficiently strong to ensure that non-binders are eliminated in large populations.

When considering all core motifs featured in Fig. 1, the picture that emerges in Fig. 4b is qualitatively similar when environmental shifts happen frequently (T ≪ 1/ν₋) and when they occur rarely (T ≫ 1/ν₋). For low demand, the trend echoes our previous findings in Fig. 4a when considering only non-autoregulated control. The region where the dominant strategy is underpinned by the use-it-or-lose-it principle expands with selection pressure, and it is sandwiched between population sizes where it is replaced by the wear-and-tear principle from both below (Fig. 4b) and above (Fig. 3b), only this time self-activation and self-repression dominate instead of their non-autoregulated counterparts due to their reduced R-cost (Fig. 2c). While in the high demand case the situation is similar (with self-activation and self-repression swapped), one crucial difference does emerge: in the small population limit non-autoregulated and autoregulated repression have comparable performance. To understand this, note that unlike for activation where autoregulation decreases the fitness cost by s_r throughout the entire period, for repression there is no reduction in case of non-binders (Fig. 2c), yielding comparable performance for non-autoregulated and autoregulated control. This effect becomes more pronounced with decreasing selection pressure (Fig. 4b), which increases the probability of non-binders taking over the population due to stochastic fluctuations (Supplementary Figs. 19 and 20).

In summary, the presence of sampling fluctuations largely preserves the wear-and-tear principle behind the dominant strategy considering both non-autoregulated and autoregulated control. The use-it-or-lose-it principle emerges only in a narrow slice of the parameter space (which further decreases with selection pressure), for instance, within a confined range of the population size. Thus, in addition to providing a more complete picture about the competition between the two non-autoregulated motifs by revealing the reversal to the wear-and-tear principle in the small population limit, our results also significantly expand prior work²⁶ by comparing the performance of all four core regulatory motifs featured in Fig. 1 in the presence of genetic drift.

Autoregulation can result in unwanted selection pressure

By eliminating the gratuitous expression of the regulator during the non-induced phase (Fig. 2c), autoregulation generally outperforms its non-autoregulated counterpart for both activation and repression (Figs. 3 and 4). For positive and negative control, however, autoregulation has drastically different impact on the population composition, as well as on how rapidly it changes.

To illustrate the differential impact of autoregulation on the fraction of non-binders x, consider first non-autoregulated control. Increasing s_r represents an additional and identical fitness cost for both binders and non-binders (Fig. 2c), hence the population-level composition remains unaffected, since the evolution of x according to (1) depends on the difference s = s_nb − s_b. Conversely, while autoregulation yields s = 0 during the neutral phase, it results in s = s_w with s_w = s_p − s_r < s_p and s_w = s_p + s_r > s_p during selection for self-activation and self-repression, respectively (Fig. 5), compared to s_w = s_p for non-autoregulated control. This change in selection pressure thus means (i) more stringent elimination of the non-binders when relying on self-repression, and (ii) the accumulation of loss-of-function mutations in case of self-activation (Fig. 5). The latter is especially concerning considering the fitness gain of self-activation relative to its non-autoregulated counterpart: should these two variants compete, the former would eventually take over the population, however, it could easily result in one dominated by non-binders (unlike in the case of self-repression where the R-cost instead promotes the elimination of deleterious mutations).

**Fig. 5: Autoregulation can result in unwanted selection pressure.**

Feedback also has a differential impact on how rapidly non-binders emerge and disappear in case of positive and negative autoregulation. To illustrate this, we define ${T}^{* }=\max ({T}_{b},{T}_{w})$ where T_b = 1/ν₋ and T_w = 1/(s_w + ν₊) are the timescales for the build-up and wipe-out of non-binders due to mutations and selection. For short periods (T ≪ T *), the fraction of non-binders remains approximately constant throughout the entire period: non-binders are eliminated if T_w < T_b, otherwise they take over the population (Supplementary Fig. 24). For long periods (T ≫ T *), if T_w < T_b then build-up of non-binders during T_n is wiped out during T_s, otherwise non-binders persist throughout the entire period as loss-of-function mutations dominate the combined impact of selection and gain-of-function mutations during T_s (Supplementary Fig. 24). Considering the typical range of model parameters (“Methods”), for all four regulatory schemes featured in Fig. 1 we have T * = T_b (Supplementary Section 4). Importantly, while T_w decreases with s_r in case of self-repression, autoregulation has the opposite impact for positive control, further hindering the elimination of non-binders from the population.

Delay can cause non-autoregulated motifs to outperform autoregulation

For autoregulated motifs, the inducer also triggers the appearance and disappearance of the regulator. Since its synthesis and decay may take time, considerable delays could be introduced⁵⁶, resulting in an increased fitness cost when compared to the idealized scenario outlined in Fig. 2c, especially when feedback is realized in the form of regulatory cascades⁵⁷. The negative impact of delay on the performance of autoregulation is illustrated in Fig. 6.

**Fig. 6: Evolutionary advantageous motifs in the presence of delay.**

For short periods (T ≪ T *), both selection and mutation play a negligible role throughout each period, thus the fraction of non-binders x remains approximately constant. Hence, in this case delay has no impact on the performance of autoregulated motifs: the dominant strategy in Fig. 6 remains unchanged when compared to the case without delay (Fig. 3b).

For long periods (T ≫ T *), the combined impact of mutation and selection can impact x, which may result in non-autoregulated control outperforming autoregulation. For instance, in case of low demand, the fitness cost of non-autoregulated and autoregulated activation is approximately s_r and Ds_p, respectively (Supplementary Fig. 24). Therefore, while self-activation offers superior performance when regulation is expensive (s_r ≈ s_p), non-autoregulated activation can dominate if control is instead affordable (s_r ≪ s_p). These results hold for both weak and strong selection (Fig. 6), and also in the presence of stochastic fluctuations due to small population size (Supplementary Fig. 25). The high demand case can be analyzed similarly.

In the special case when T ≈ T *, autoregulation consistent with the use-it-or-lose-it principle emerges as dominant when selection pressure is sufficiently strong (stars in Fig. 6). To understand why this happens, here we focus on the low demand case when regulation is affordable (s_r ≪ s_p), other scenarios can be analyzed similarly. For self-repression, non-binders are eliminated (x ≈ 0) during the entire period (Supplementary Fig. 26) as a result of strong and long selection (due to low demand), yielding the average fitness cost $\bar{s} \, \approx \, D{s}_{r}$ (Fig. 2c). Conversely, for self-activation there is alternating build-up and wipe-out during T_n and T_s due to the shorter selection phase, resulting in $\bar{s} \, \approx \, {\bar{x}}_{s}D{s}_{p}$ where ${\bar{x}}_{s}$ denotes the average of x during T_s (Supplementary Fig. 26). Crucially, ${\bar{x}}_{s}$ increases with the delay, which is the driving force behind self-repression offering superior performance compared to self-activation, and eventually emerging as the dominant strategy for sufficiently strong selection (blue star in Fig. 6). Delay thus has the greatest impact in the special case when T ≈ T * by triggering the emergence of regions where the dominant strategy is underpinned by the use-it-or-lose-it principle (marked by stars in Fig. 6). These regions expand with increasing selection pressure s_p, and they can appear even in the presence of brief delays (Supplementary Fig. 31).

Feedback cost can cause non-autoregulated motifs to outperform autoregulation

In addition to introducing delay, autoregulation can also result in additional burden due to the increased expression of the regulator R that may be required to control its own expression^14,15. To capture this, we next assume that while the R-cost for non-autoregulated control remains s_r, for autoregulation it instead increases to as_r with a ≥ 1 (Fig. 7a).

**Fig. 7: Feedback cost has a differential impact on positive and negative control.**

To understand how this change impacts the dominant strategy, we first focus on the case when mutations have minimal impact, so that the population consists entirely of binders. As a result, the P-cost is zero for all four schemes. Furthermore, positive and negative control have identical R-cost, both when considering non-autoregulated and autoregulated motifs (Fig. 2c). While autoregulation reduces the time when the R-cost is suffered as demand decreases, non-autoregulated strategies outperform their autoregulated counterparts once feedback becomes too expensive (Fig. 7a). These results hold even when mutations are considered. In particular, data in Supplementary Figs. 32–34 confirm that (i) the pattern of how the use-it-or-lose-it and the wear-and-tear principles emerge behind the dominant strategies remains largely unaffected, (ii) non-autoregulated strategies can outperform their autoregulated counterparts as a increases, and (iii) this transition happens at lower values of a as the demand increases.

As the R-cost for autoregulation increases with the duration of the induced phase T_i = DT (Fig. 2c), we expect that the sensitivity of the dominant autoregulated strategy to changes in a also increases with the demand D, confirmed in Fig. 7b. In particular, while the fitness cost of self-activation increases only slightly when compared to its non-autoregulated counterpart in the low demand case, performance of self-repression in the high demand limit quickly degrades. Furthermore, the results in Fig. 7b echo our finding in Fig. 5 about increasing the R-cost (this time via a instead of s_r), giving rise to a differential impact on population composition: more stringent selection against non-binders when relying on self-repression, and the spread of loss-of-function mutations in case of self-activation. This also reveals a trade-off between the fraction of non-binders x and the margin in a for preserving the dominant strategy. For instance, when it is underpinned by the wear-and-tear principle, in the low demand case a can be increased substantially and self-activation would still emerge as dominant (e.g., it remains only about 15% as expensive as non-autoregulated positive control when a = 2 in Fig. 7b), although at the price of increased prevalence of non-binders. Conversely, in the high demand case negative autoregulation promotes the elimination of deleterious mutations, however, the margin in a is considerably smaller to preserve the dominance of self-repression (e.g., non-autoregulated negative control dominates its autoregulated counterpart once a > 1.07 in Fig. 7b).

Reduced bioenergetic cost may contribute to the prevalence of autoregulation in model organisms

We next turn our attention to the transcriptional regulatory network of B. subtilis, C. glutamicum, and E. coli, model organisms with the most comprehensive data availability^58,59 (Supplementary Fig. 35). Our results suggest that self-activation and self-repression should be prominently featured among regulators, hence in what follows we concentrate on whether the reduced bioenergetic cost of autoregulated control schemes could indeed confer an evolutionary advantage, and thus contribute to their high prevalence^60,61.

To this end, we first note that autoregulation is preferentially selected by evolution when choosing regulators. While the average prevalence of autoregulation is only 27% among TFs (each TF is only counted once), it is instead 35% among all regulators (each TF is counted as many times as it appears as an activator/repressor), with the overrepresentation ranging between 2–16 percentage points (Fig. 8a). This is not surprising, as autoregulated TFs offer a multitude of beneficial properties^{62,63,64,65,66,67}. Importantly, if R-cost reduction was a negligible factor, we would expect identical prevalence among all regulators and among the subset that control gene targets synergistically (Fig. 8b), otherwise the frequency in the latter should be greater (Supplementary Section 7). Thus, for each organism our two samples of interest comprise regulators of genes that are either positively or negatively controlled, focusing on 86%, 50%, and 40% of target genes in B. subtilis, C. glutamicum, and E. coli, respectively (Supplementary Table 2).

**Fig. 8: Self-activated/self-repressed TFs are overrepresented among regulators of genes that are synergistically activated/repressed.**

Using the frequency of autoregulation among TFs as a baseline, the overrepresentation of this motif is greater among synergistic regulators (Fig. 8c) than among all regulators (Fig. 8a): the mean difference is 25 percentage points, with the overrepresentation ranging between 18–42 percentage points (Supplementary Table 2). Therefore, beneficial properties of autoregulation yield their preferential selection for regulators (Fig. 8a), but crucially, the frequency of autoregulation among synergistic regulators further exceeds this modified and elevated baseline by 17 percentage points on average (Fig. 8d), with the overrepresentation ranging between 6–28 percentage points (Supplementary Table 2). To quantify the significance of the differences, we performed standard two-sample location tests across all datasets to compute the probability of the modified baseline (autoregulation among all regulators) and the samples of interest (autoregulation among synergistic regulators) coming from the same distribution (leveraging not only the frequencies but also the sample sizes). The resulting p-values are all smaller than 0.001, suggesting that the differences are statistically significant across all three organisms for both self-activation and self-repression, and that the underlying distributions are likely different.

In summary, autoregulated motifs are overrepresented among synergistic regulators compared to their frequency among all regulators. This suggests that autoregulation likely offers additional beneficial properties in the former case, leading to their even stronger preferential selection. Our results highlight that reduced R-cost offers an appealing explanation as one of the factors that contribute to establishing the prominent role of autoregulation.

Discussion

Given the prevalence of autoregulation in transcription networks^60,61, it is not surprising that this network motif offers numerous advantages. For instance, self-repression can accelerate temporal responses^62,63 and reduce stochastic fluctuations^64,65, whereas self-activation underpins cellular memory⁶⁶ and helps cell populations to maintain a mixed phenotype to assure optimal performance in stochastic environments⁶⁷. As autoregulation ensures that a TF is only expressed when needed, our results highlight that the abundance of this motif may also stem from its reduced bioenergetic and fitness cost relative to non-autoregulated control.

We confirm this both in small and large populations by demonstrating that non-autoregulated regulation practically never dominates its autoregulated counterpart except when the additional cost of feedback is substantial or in the presence of regulatory delays. The dominant strategy is generally in accordance with the wear-and-tear principle, the use-it-or-lose it principle emerges only in a narrow region of population size and timescale of environmental shifts. Our work thus further highlights that demand for a beneficial gene product and whether positive or negative regulation offers superior performance are tightly coupled. These results considerably expand our prior understanding focusing only on non-autoregulated control²⁶, and may explain why self-activation and self-repression show strong clustering and preferential localization across functional subsystems (Supplementary Fig. 38) with different temporal demand profiles⁶⁸.

As the performance of autoregulation degrades with delay, it is understandable that the transcription network of E. coli has an essentially feedforward structure, where feedback occurs primarily in the form of autoregulation^69,70. Further strengthening the connection between evolutionary selection and autoregulation, the latter most likely emerged as a result of gene duplication⁷¹, a core factor in the origin of mutational robustness⁷² resulting in the accumulation of phenotypically cryptic genetic variation⁷³ promoting evolvability⁷⁴. This link between autoregulation and mutational robustness was confirmed by recent experimental advances, suggesting that regulatory feedback may be an important element of the network architectures that confer mutational robustness across biology⁷⁵.

The results presented in this paper focus on the impact of autoregulation by putting the spotlight on fitness advantage, as this factor plays a pivotal role in establishing the dominant strategies that emerge as a result of competition and selection. In synthetic biology applications, however, while the fitness cost can be crucial in certain scenarios, e.g., when studying the cost of plasmid acquisition and maintenance⁷⁶, often the rate at which non-binders emerge is instead of primary concern to avoid compromising the genetic stability of engineered synthetic circuits⁷⁷. Importantly, our approach can be applied to characterize how autoregulation impacts the proliferation of deleterious mutations considering timescales and population sizes typical in synthetic biology applications (Supplementary Section 8) ranging from microfluidics experiments to bioreactor-based contexts^{78,79,80,81,82}. Crucially, our work reveals that while self-activation facilitates the spread of non-binders, self-repression acts against this phenomenon, and these effects are amplified as the cost of regulation increases. Thus, our results not only uncover how genetic design choices alter population structure, growth dynamics, and evolutionary outcomes, but they can also be leveraged to minimize the prevalence of cells that harbor non-functional genetic modules to design evolutionarily robust synthetic gene circuits^83,84.

Our model-based analysis raises several experimentally testable hypotheses. Does the wear-and-tear principle emerge in small populations and at the critical timescale of environmental shifts in case of sufficiently strong selection? Is it true that while negative autoregulation curbs the emergence of detrimental mutations, self-activation instead facilitates their spread? To test these predictions, all regulatory schemes featured here can be constructed using existing synthetic biology toolkits and parts^85,86,87,88, and key variables can be conveniently tuned to reveal their role in establishing the dominant control scheme. Demand and period length can be varied by creating defined environments to control when the gene product is needed^89,90 using automated cell culture systems^81,82. Mutation rates can be adjusted by using UV radiation or CRISPR-guided mutagenesis^91,92. The P-cost can be tuned by modulating selection pressure, for instance, via the composition of the growth media^93,94,95,96, whereas the R-cost can be altered through codon (de)optimization⁹⁷. Finally, delay can be modulated leveraging regulatory cascades⁵⁷. By ensuring that experiments with maximal information content are selected⁹⁸, it is possible to efficiently test whether the predicted regulatory mechanisms actually emerge as dominant strategies in each environment, thus to promote the design of biosystems that operate robustly under inevitable evolutionary forces⁹⁹.

Biology has evolved powerful and creative solutions to control gene expression by selecting the optimal variant(s) among a wide array of competing control mechanisms. Our results reveal how the interplay of biophysical parameters and environmental factors together shape the emergence of dominant regulatory strategies. This can be leveraged both to shed light on evolutionary organizing principles underpinning the transcription networks of living organisms, and also to guide the design of synthetic gene circuits, for instance, when selecting among alternative modes of regulation to implement biomolecular controllers and insulation devices^{37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52} to facilitate the modular design of complex synthetic gene circuits.

Methods

Parameters

Wild-type E. coli grown under optimal conditions typically has mutation rates on the order of 10⁻³ mutations per genome per generation¹⁰⁰. Considering the typical genome size of bacteria, this corresponds to approximately 10⁻⁹–10⁻⁸ mutations per base per generation^101,102,103, though this rate may depend on population size¹⁰⁴ and expression levels¹⁰⁵. Furthermore, hypermutators with up to 10⁴-fold greater mutation rates can occur under laboratory conditions, and more frequently in natural bacterial populations¹⁰⁰. Assuming roughly 100 sensitive nucleotide positions, we thus estimate the rate of loss-of-function mutations to span the range ν₋ ≈ 10⁻⁷–10⁻³ per generation. Since gain-of-function mutations are assumed to be less probable, we consider ν₊ = ν₋/10 throughout the paper matching experimental estimates¹⁰⁶. Selection intensity is notoriously hard to measure^{107,108,109,110}, however, based on estimates for codon bias¹¹¹, we consider s_p/ν₋ = 10 and s_p/ν₋ = 100 in case of weak and strong selection, respectively. Finally, we assume that s_r < s_p for a typical target gene, otherwise there would be no evolutionary selection pressure to regulate the expression of the product. Throughout the paper, the period T is measured in number of generations, whereas the mutation rates ν₋ and ν₊ as well as the selection intensities s_p and s_r are all given in 1/generation.

Stochastic simulation of the evolutionary dynamics

The standard Wright-Fisher model assumes the following: discrete and non-overlapping generations with a constant population size N, with each member replaced in every generation^53,54. Introduce s = s_nb − s_b where s_nb and s_b are the fitness cost incurred by the non-binders and binders, respectively, and let n_t denote number of non-binders in the population in generation t (i.e., n_t = 0, 1, 2, …, N). We first generate the number of gain-of-function and loss-of-function mutations m₊ and m₋ from Poisson distributions with means n_tν₊ and (N − n_t)ν₋, respectively, so that the fraction of non-binders in the population becomes x = (n_t + m₋ − m₊)/N. The number of non-binders n_t+1 in the next generation is drawn from a Binomial distribution with success probability ${x}^{{\prime} }=x-sx(1-x)/(1-sx)$ to account for the fitness difference s (thus the number of offsprings produced) between non-binders and binders²⁶.

Periodic steady state distribution

Considering the periodic selection pressure in the evolutionary dynamics depicted in Fig. 2, the distribution P(x, t) of x approaches a periodic steady state distribution (subsequent to a transient that depends on the initial condition). This can be estimated for 0 < x < 1 considering $\frac{\partial P(x,t)}{\partial t}=-\frac{\partial j(x,t)}{\partial x}$ with $j(x,t)=-\frac{1}{2N}\frac{\partial }{\partial x}[x(1-x)P]+[{\nu }_{-}-({\nu }_{+}+{\nu }_{-}+s)x+s{x}^{2}]P$ using the diffusion approximation²⁶. To compute the probabilities P(0, t) and P(1, t) at the boundaries x = 0 and x = 1, we consider the flux conditions $\frac{\,{{\mbox{d}}}P(0,t)}{{{\mbox{d}}}\,t}=-j(0,t)$ and $\frac{\,{{\mbox{d}}}P(1,t)}{{{\mbox{d}}}\,t}=j(1,t)$. After estimating the steady state distribution of P(x, t) using the algorithm developed in ref. ⁵⁵ implementing the above steps, the average fitness cost $\bar{s}$ during one period can be computed as in (2) with x(t) replaced by ${x}^{{\prime} }(t)=P(1,t)+\int\nolimits_{0}^{1}x(t)P(x,t)\,{{\mbox{d}}}\,x$, following the approach in ref. ²⁶.

Statistical analysis

We perform two-sample location tests comparing the success rate (presence of positive/negative autoregulation) observed in a reference (regulators of genes) and our samples of interest (regulators of synergistically controlled genes). We assume that the underlying distributions are Binomial with n_r and n trials (sample size) and θ_r and θ success rates in the reference and in the sample of interest, respectively. Therefore, the number of successes are given by X_r ~ Binom(n_r, θ_r) and X ~ Binom(n, θ), respectively. With this, ${\hat{\theta }}_{r}={X}_{r}/{n}_{r}$ and $\hat{\theta }=X/n$ are non-biased estimators of the unknown success rates.

With the null hypothesis H₀ : θ = θ_r of identical success rates, we are interested in the probability $p={{{{{\bf{P}}}}}}\left({{\Omega }}\ge \omega \,| \,{H}_{0}\right)$ of observing the value ω of the test statistic Ω (number of positively/negatively autoregulated regulators) at least as extreme as if the null hypothesis was true. As ${n}_{r}{\hat{\theta }}_{r},{n}_{r}(1-{\hat{\theta }}_{r}),\, n\hat{\theta },\, n(1-\hat{\theta }) > 5$ for all datasets we consider, from the Central Limit Theorem it follows that

$$\hat{\theta }-{\hat{\theta }}_{r} \sim {{{{{\mathcal{N}}}}}}\left(\theta -{\theta }_{r},\sqrt{\frac{\theta (1-\theta )}{n}+\frac{{\theta }_{r}(1-{\theta }_{r})}{{n}_{r}}}\right).$$

Since θ_r and θ are unknown, and there are infinitely many choices that satisfy the null hypothesis, we follow the standard choice of using the pooled proportion ${\hat{\theta }}_{0}=({X}_{r}+X)/({n}_{r}+n)$ in place of both, as it satisfies the null hypothesis and it is consistent with our data. With this, we obtain that p ≈ 1 − Φ(z) where Φ( ⋅ ) is the cumulative distribution function of the standard normal distribution, together with $z=(\hat{\theta }-{\hat{\theta }}_{r})/\sqrt{{\hat{\theta }}_{0}(1-{\hat{\theta }}_{0})({n}_{r}^{-1}+{n}^{-1})}$.

Reporting summary

Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.

Data availability

CoryneRegNet 7.0 data were downloaded from https://exbio.wzw.tum.de/coryneregnet/processToDownalod.htm⁵⁸. PRODORIC data can be accessed using the API at https://www.prodoric.de/api/⁵⁹. Data on the functional organization of the transcription regulatory network of E. coli were downloaded from https://www.pnas.org/doi/full/10.1073/pnas.1702581114¹¹². Source data are provided with this paper.

Code availability

The manuscript does not rely on custom mathematical algorithms or software. Simulation data were generated and analyzed as described in the “Methods” using built-in MATLAB (version R2023a) functions. The MATLAB scripts used to obtain the results featured in the paper are publicly available at https://github.com/qbionet/evolutionary-selection. Additional information is available from the corresponding author upon request.

References

Hartwell, L. H., Hopfield, J. J., Leibler, S. & Murray, A. W. From molecular to modular cell biology. Nature 402, C47–C52 (1999).
Article CAS PubMed Google Scholar
Wolf, D. M. & Arkin, A. P. Motifs, modules and games in bacteria. Curr. Opin. Microbiol. 6, 125–134 (2003).
Article CAS PubMed Google Scholar
Wall, M. E., Hlavacek, W. S. & Savageau, M. A. Design of gene circuits: lessons from bacteria. Nat. Rev. Genet. 5, 34–42 (2004).
Article CAS PubMed Google Scholar
Yi, T.-M., Huang, Y., Simon, M. I. & Doyle, J. Robust perfect adaptation in bacterial chemotaxis through integral feedback control. Proc. Natl Acad. Sci. USA 97, 4649–4653 (2000).
Article ADS CAS PubMed PubMed Central Google Scholar
Briat, C., Gupta, A. & Khammash, M. Antithetic integral feedback ensures robust perfect adaptation in noisy biomolecular networks. Cell Syst. 2, 15–26 (2016).
Article CAS PubMed Google Scholar
Aoki, S. K. et al. A universal biomolecular integral feedback controller for robust perfect adaptation. Nature 570, 533–537 (2019).
Article CAS PubMed Google Scholar
Khammash, M. H. Perfect adaptation in biology. Cell Syst. 12, 509–521 (2021).
Article CAS PubMed Google Scholar
Gupta, A. & Khammash, M. Universal structural requirements for maximal robust perfect adaptation in biomolecular networks. Proc. Natl Acad. Sci. USA 119, e2207802119 (2022).
Article MathSciNet CAS PubMed PubMed Central Google Scholar
Alon, U. Network motifs: theory and experimental approaches. Nat. Rev. Genet. 8, 450–461 (2007).
Article CAS PubMed Google Scholar
Hart, Y., Antebi, Y. E., Mayo, A. E., Friedman, N. & Alon, U. Design principles of cell circuits with paradoxical components. Proc. Natl Acad. Sci. USA 109, 8346–8351 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Gorochowski, T. E., Grierson, C. S. & Di Bernardo, M. Organization of feed-forward loop motifs reveals architectural principles in natural and engineered networks. Sci. Adv. 4, eaap9751 (2018).
Article ADS PubMed PubMed Central Google Scholar
Ingram, P. J., Stumpf, M. P. & Stark, J. Network motifs: structure does not determine function. BMC Genom. 7, 108 (2006).
Article Google Scholar
Cardinale, S. & Arkin, A. P. Contextualizing context for synthetic biology - identifying causes of failure of synthetic biological systems. Biotechnol. J. 7, 856–866 (2012).
Article CAS PubMed PubMed Central Google Scholar
Del Vecchio, D., Ninfa, A. J. & Sontag, E. D. Modular cell biology: retroactivity and insulation. Mol. Syst. Biol. 4, 161 (2008).
Article PubMed PubMed Central Google Scholar
Gyorgy, A. & Del Vecchio, D. Modular composition of gene transcription networks. PLoS Comput. Biol. 10, e1003486 (2014).
Article PubMed PubMed Central Google Scholar
Gyorgy, A. et al. Isocost lines describe the cellular economy of genetic circuits. Biophys. J. 109, 639–646 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Ceroni, F., Algar, R., Stan, G.-B. & Ellis, T. Quantifying cellular capacity identifies gene expression designs with reduced burden. Nat. Methods 12, 415–418 (2015).
Article CAS PubMed Google Scholar
Gorochowski, T. E., Avcilar-Kucukgoze, I., Bovenberg, R. A. L., Roubos, J. A. & Ignatova, Z. A minimal model of ribosome allocation dynamics captures trade-offs in expression between endogenous and synthetic genes. ACS Synth. Biol. 5, 710–720 (2016).
Article CAS PubMed Google Scholar
Yeung, E. et al. Biophysical constraints arising from compositional context in synthetic gene networks. Cell Syst. 5, 11–24.e12 (2017).
PubMed Google Scholar
Sechkar, K., Perrino, G. & Stan, G.-B. A coarse-grained bacterial cell model for resource-aware analysis and design of synthetic gene circuits. Preprint at bioRxiv https://doi.org/10.1101/2023.04.08.536106 (2023).
Di Blasi, R. et al. Resource-aware construct design in mammalian cells. Nat. Commun. 14, 3576 (2023).
Article ADS PubMed PubMed Central Google Scholar
Savageau, M. A. Genetic regulatory mechanisms and the ecological niche of Escherichia coli. Proc. Natl Acad. Sci. USA 71, 2453–2455 (1974).
Article ADS CAS PubMed PubMed Central Google Scholar
Savageau, M. A. Design of molecular control mechanisms and the demand for gene expression. Proc. Natl Acad. Sci. USA 74, 5647–5651 (1977).
Article ADS CAS PubMed PubMed Central Google Scholar
Savageau, M. A. Demand theory of gene regulation. I. Quantitative development of the theory. Genetics 149, 1665–1676 (1998).
Article CAS PubMed PubMed Central Google Scholar
Shinar, G., Dekel, E., Tlusty, T. & Alon, U. Rules for biological regulation based on error minimization. Proc. Natl Acad. Sci. USA 103, 3999–4004 (2006).
Article ADS CAS PubMed Google Scholar
Gerland, U. & Hwa, T. Evolutionary selection between alternative modes of gene regulation. Proc. Natl Acad. Sci. USA 106, 8841–8846 (2009).
Article ADS CAS PubMed PubMed Central Google Scholar
Gu, Z. et al. Role of duplicate genes in genetic robustness against null mutations. Nature 421, 63–66 (2003).
Article ADS CAS PubMed Google Scholar
Stelling, J., Sauer, U., Szallasi, Z., Doyle, F. J. & Doyle, J. Robustness of cellular functions. Cell 118, 675–685 (2004).
Article CAS PubMed Google Scholar
Wagner, A. Robustness and Evolvability in Living Systems (Princeton University Press, 2007).
Plata, G. & Vitkup, D. Genetic robustness and functional evolution of gene duplicates. Nucleic Acids Res. 42, 2405–2414 (2014).
Article CAS PubMed Google Scholar
Lynch, M. & Marinov, G. K. The bioenergetic costs of a gene. Proc. Natl Acad. Sci. USA 112, 15690–15695 (2015).
Article ADS CAS PubMed PubMed Central Google Scholar
Laub, M. T., McAdams, H. H., Feldblyum, T., Fraser, C. M. & Shapiro, L. Global analysis of the genetic network controlling a bacterial cell cycle. Science 290, 2144–2148 (2000).
Article ADS CAS PubMed Google Scholar
Kalir, S. et al. Ordering genes in a flagella pathway by analysis of expression kinetics from living bacteria. Science 292, 2080–2083 (2001).
Article CAS PubMed Google Scholar
Ronen, M., Rosenberg, R., Shraiman, B. I. & Alon, U. Assigning numbers to the arrows: parameterizing a gene regulation network by using accurate expression kinetics. Proc. Natl Acad. Sci. USA 99, 10555–10560 (2002).
Article ADS CAS PubMed PubMed Central Google Scholar
McAdams, H. H. & Shapiro, L. A bacterial cell-cycle regulatory network operating in time and space. Science 301, 1874–1877 (2003).
Article ADS CAS PubMed Google Scholar
Zaslaver, A. et al. Just-in-time transcription program in metabolic pathways. Nat. Genet. 36, 486–491 (2004).
Article CAS PubMed Google Scholar
Franco, E., Giordano, G., Forsberg, P.-O. & Murray, R. M. Negative autoregulation matches production and demand in synthetic transcriptional networks. ACS Synth. Biol. 3, 589–599 (2014).
Article CAS PubMed Google Scholar
Darlington, A. P. S., Kim, J., Jiménez, J. I. & Bates, D. G. Engineering translational resource allocation controllers: mechanistic models, design guidelines, and potential biological implementations. ACS Synth. Biol. 7, 2485–2496 (2018).
Article CAS PubMed Google Scholar
Darlington, A. P. & Bates, D. G. Architectures for combined transcriptional and translational resource allocation controllers. Cell Syst. 11, 382–392.e9 (2020).
PubMed Google Scholar
Ceroni, F. et al. Burden-driven feedback control of gene expression. Nat. Methods 15, 387–393 (2018).
Article CAS PubMed Google Scholar
Del Vecchio, D., Abdallah, H., Qian, Y. & Collins, J. J. A blueprint for a synthetic genetic feedback controller to reprogram cell fate. Cell Syst. 4, 109–120.e11 (2017).
PubMed PubMed Central Google Scholar
Ng, A. H. et al. Publisher Correction: Modular and tunable biological feedback control using a de novo protein switch. Nature 579, E8–E8 (2020).
Article CAS PubMed Google Scholar
Hu, C. Y. & Murray, R. M. Layered feedback control overcomes performance trade-off in synthetic biomolecular networks. Nat. Commun. 13, 5393 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Gyorgy, A., Menezes, A. & Arcak, M. A blueprint for a synthetic genetic feedback optimizer. Nat. Commun. 14, 2554 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Darlington, A. P. S., Kim, J., Jiménez, J. I. & Bates, D. G. Dynamic allocation of orthogonal ribosomes facilitates uncoupling of co-expressed genes. Nat. Commun. 9, 695 (2018).
Article ADS PubMed PubMed Central Google Scholar
Mishra, D., Rivera, P. M., Lin, A., Del Vecchio, D. & Weiss, R. A load driver device for engineering modularity in biological networks. Nat. Biotechnol. 32, 1268–1275 (2014).
Article CAS PubMed PubMed Central Google Scholar
Liu, B., Cuba Samaniego, C., Bennett, M., Chappell, J. & Franco, E. RNA compensation: a positive feedback insulation strategy for RNA-based transcription networks. ACS Synth. Biol. 11, 1240–1250 (2022).
Article CAS PubMed Google Scholar
Anastassov, S., Filo, M., Chang, C.-H. & Khammash, M. A cybergenetic framework for engineering intein-mediated integral feedback control systems. Nat. Commun. 14, 1337 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Frei, T., Chang, C.-H., Filo, M., Arampatzis, A. & Khammash, M. A genetic mammalian proportional-integral feedback control circuit for robust and precise gene regulation. Proc. Natl Acad. Sci. USA 119, e2122132119 (2022).
Article CAS PubMed PubMed Central Google Scholar
Filo, M., Kumar, S. & Khammash, M. A hierarchy of biomolecular proportional-integral-derivative feedback controllers for robust perfect adaptation and dynamic performance. Nat. Commun. 13, 2119 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Frei, T. et al. Characterization and mitigation of gene expression burden in mammalian cells. Nat. Commun. 11, 4641 (2020).
Article ADS CAS PubMed PubMed Central Google Scholar
Lillacci, G., Benenson, Y. & Khammash, M. Synthetic control systems for high performance gene expression in mammalian cells. Nucleic Acids Res. 46, 9855–9863 (2018).
Article CAS PubMed PubMed Central Google Scholar
Fisher, R. A.The Genetical Theory of Natural Selection (Clarendon Press, 1930).
Wright, S. Evolution in Mendelian populations. Genetics 16, 97–159 (1931).
Article CAS PubMed PubMed Central Google Scholar
Zhao, L., Yue, X. & Waxman, D. Complete numerical solution of the diffusion equation of random genetic drift. Genetics 194, 973–985 (2013).
Article PubMed PubMed Central Google Scholar
Bratsun, D., Volfson, D., Tsimring, L. S. & Hasty, J. Delay-induced stochastic oscillations in gene regulation. Proc. Natl Acad. Sci. USA 102, 14593–14598 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Shopera, T., Henson, W. R. & Moon, T. S. Dynamics of sequestration-based gene regulatory cascades. Nucleic Acids Res. 45, 7515–7526 (2017).
Article CAS PubMed PubMed Central Google Scholar
Parise, M. T. D. et al. CoryneRegNet 7, the reference database and analysis platform for corynebacterial gene regulatory networks. Sci. Data 7, 142 (2020).
Article MathSciNet PubMed PubMed Central Google Scholar
Dudek, C.-A. & Jahn, D. PRODORIC: state-of-the-art database of prokaryotic gene regulation. Nucleic Acids Res. 50, D295–D302 (2021).
Article PubMed Central Google Scholar
Thieffry, D., Huerta, A. M., Pérez-Rueda, E. & Collado-Vides, J. From specific gene regulation to genomic networks: a global analysis of transcriptional regulation in Escherichia coli. BioEssays 20, 433–440 (1998).
Article CAS PubMed Google Scholar
Shen-Orr, S. S., Milo, R., Mangan, S. & Alon, U. Network motifs in the transcriptional regulation network of Escherichia coli. Nat. Genet. 31, 64–68 (2002).
Article CAS PubMed Google Scholar
Rosenfeld, N., Elowitz, M. B. & Alon, U. Negative autoregulation speeds the response times of transcription networks. J. Mol. Biol. 323, 785–793 (2002).
Article CAS PubMed Google Scholar
Camas, F. M., Blázquez, J. & Poyatos, J. F. Autogenous and nonautogenous control of response in a genetic network. Proc. Natl Acad. Sci. USA 103, 12718–12723 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Dublanche, Y., Michalodimitrakis, K., Kümmerer, N., Foglierini, M. & Serrano, L. Noise in transcription negative feedback loops: simulation and experimental analysis. Mol. Syst. Biol. 2, 41 (2006).
Article PubMed PubMed Central Google Scholar
Becskei, A. & Serrano, L. Engineering stability in gene networks by autoregulation. Nature 405, 590–593 (2000).
Article ADS CAS PubMed Google Scholar
Kramer, B. P. & Fussenegger, M. Hysteresis in a synthetic mammalian gene network. Proc. Natl Acad. Sci. USA 102, 9517–9522 (2005).
Article ADS CAS PubMed PubMed Central Google Scholar
Arkin, A., Ross, J. & McAdams, H. H. Stochastic kinetic analysis of developmental pathway bifurcation in phage λ-infected Escherichia coli cells. Genetics 149, 1633–1648 (1998).
Article CAS PubMed PubMed Central Google Scholar
Martínez-Antonio, A., Janga, S. C. & Thieffry, D. Functional organisation of Escherichia coli transcriptional regulatory network. J. Mol. Biol. 381, 238–247 (2008).
Article PubMed PubMed Central Google Scholar
Ma, H.-W., Buer, J. & Zeng, A.-P. Hierarchical structure and modules in the Escherichia coli transcriptional regulatory network revealed by a new top-down approach. BMC Bioinform. 5, 199 (2004).
Article Google Scholar
Ma, H.-W. An extended transcriptional regulatory network of Escherichia coli and analysis of its hierarchical structure and network motifs. Nucleic Acids Res. 32, 6643–6649 (2004).
Article CAS PubMed PubMed Central Google Scholar
Cosentino Lagomarsino, M., Jona, P., Bassetti, B. & Isambert, H. Hierarchy and feedback in the evolution of the Escherichia coli transcription network. Proc. Natl Acad. Sci. USA 104, 5516–5520 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Fares, M. A. The origins of mutational robustness. Trends Genet. 31, 373–381 (2015).
Article CAS PubMed Google Scholar
Masel, J. & Siegal, M. L. Robustness: mechanisms and consequences. Trends Genet. 25, 395–403 (2009).
Article CAS PubMed PubMed Central Google Scholar
Masel, J. & Trotter, M. V. Robustness and evolvability. Trends Genet. 26, 406–414 (2010).
Article CAS PubMed PubMed Central Google Scholar
Denby, C. M., Im, J. H., Yu, R. C., Pesce, C. G. & Brem, R. B. Negative feedback confers mutational robustness in yeast transcription factor regulation. Proc. Natl Acad. Sci. USA 109, 3874–3878 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Ahmad, M. et al. Tradeoff between lag time and growth rate drives the plasmid acquisition cost. Nat. Commun. 14, 2343 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Ingram, D. & Stan, G.-B. Modelling genetic stability in engineered cell populations. Nat. Commun. 14, 3471 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Bennett, M. R. & Hasty, J. Microfluidic devices for measuring gene network dynamics in single cells. Nat. Rev. Genet. 10, 628–638 (2009).
Article CAS PubMed PubMed Central Google Scholar
Prindle, A. et al. A sensing array of radically coupled genetic ‘biopixels’. Nature 481, 39–44 (2012).
Article ADS CAS Google Scholar
Rullan, M., Benzinger, D., Schmidt, G. W., Milias-Argeitis, A. & Khammash, M. An optogenetic platform for real-time, single-cell interrogation of stochastic transcriptional regulation. Mol. Cell 70, 745–756.e6 (2018).
Article PubMed PubMed Central Google Scholar
Wong, B. G., Mancuso, C. P., Kiriakov, S., Bashor, C. J. & Khalil, A. S. Precise, automated control of conditions for high-throughput growth of yeast and bacteria with eVOLVER. Nat. Biotechnol. 36, 614–623 (2018).
Article CAS PubMed PubMed Central Google Scholar
Steel, H., Habgood, R., Kelly, C. L. & Papachristodoulou, A. In situ characterisation and manipulation of biological systems with Chi.Bio. PLOS Biol. 18, e3000794 (2020).
Article CAS PubMed PubMed Central Google Scholar
Sleight, S. C., Bartley, B. A., Lieviant, J. A. & Sauro, H. M. Designing and engineering evolutionary robust genetic circuits. J. Biol. Eng. 4, 12 (2010).
Article PubMed PubMed Central Google Scholar
Renda, B. A., Hammerling, M. J. & Barrick, J. E. Engineering reduced evolutionary potential for synthetic biology. Mol. BioSyst. 10, 1668–1678 (2014).
Article CAS PubMed PubMed Central Google Scholar
Weber, E., Engler, C., Gruetzner, R., Werner, S. & Marillonnet, S. A modular cloning system for standardized assembly of multigene constructs. PLoS ONE 6, e16765 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Meyer, A. J., Segall-Shapiro, T. H., Glassey, E., Zhang, J. & Voigt, C. A. Escherichia coli “Marionette” strains with 12 highly optimized small-molecule sensors. Nat. Chem. Biol. 15, 196–204 (2019).
Article CAS PubMed Google Scholar
Moore, S. J. et al. A Multifunctional MoClo Kit for E. coli Synthetic Biology. ACS Synth. Biol. 9, 1225–1225 (2020).
Article CAS PubMed Google Scholar
Joshi, S. H.-N., Yong, C. & Gyorgy, A. Inducible plasmid copy number control for synthetic biology in commonly used E. coli strains. Nat. Commun. 13, 6691 (2022).
Article ADS CAS PubMed PubMed Central Google Scholar
Elena, S. F. & Lenski, R. E. Evolution experiments with microorganisms: the dynamics and genetic bases of adaptation. Nat. Rev. Genet. 4, 457–469 (2003).
Article CAS PubMed Google Scholar
Ibarra, R. U., Edwards, J. S. & Palsson, B. O. Escherichia coli K-12 undergoes adaptive evolution to achieve in silico predicted optimal growth. Nature 420, 186–189 (2002).
Article ADS CAS PubMed Google Scholar
Shibai, A. et al. Mutation accumulation under UV radiation in Escherichia coli. Sci. Rep. 7, 14531 (2017).
Article ADS PubMed PubMed Central Google Scholar
Halperin, S. O. et al. CRISPR-guided DNA polymerases enable diversification of all nucleotides in a tunable window. Nature 560, 248–252 (2018).
Article ADS CAS PubMed Google Scholar
Baym, M. et al. Spatiotemporal microbial evolution on antibiotic landscapes. Science 353, 1147–1151 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Kohanski, M. A., DePristo, M. A. & Collins, J. J. Sublethal antibiotic treatment leads to multidrug resistance via radical-induced mutagenesis. Mol. Cell 37, 311–320 (2010).
Article CAS PubMed PubMed Central Google Scholar
Andersson, D. I. & Hughes, D. Microbiological effects of sublethal levels of antibiotics. Nat. Rev. Microbiol. 12, 465–478 (2014).
Article CAS PubMed Google Scholar
Matange, N., Hegde, S. & Bodkhe, S. Adaptation through lifestyle switching sculpts the fitness landscape of evolving populations: implications for the selection of drug-resistant bacteria at low drug pressures. Genetics 211, 1029–1044 (2019).
Article CAS PubMed PubMed Central Google Scholar
Burgess-Brown, N. A. et al. Codon optimization can improve expression of human genes in Escherichia coli: a multi-gene study. Protein Expr. Purif. 59, 94–102 (2008).
Article CAS PubMed Google Scholar
Gilman, J., Walls, L., Bandiera, L. & Menolascina, F. Statistical design of experiments for synthetic biology. ACS Synth. Biol. 10, 1–18 (2021).
Article CAS PubMed Google Scholar
Castle, S. D., Grierson, C. S. & Gorochowski, T. E. Towards an engineering theory of evolution. Nat. Commun. 12, 3326 (2021).
Article ADS CAS PubMed PubMed Central Google Scholar
Swings, T. et al. Adaptive tuning of mutation rates allows fast response to lethal stress in Escherichia coli. eLife 6, e22939 (2017).
Article PubMed PubMed Central Google Scholar
Drake, J. W. A constant rate of spontaneous mutation in DNA-based microbes. Proc. Natl Acad. Sci. USA 88, 7160–7164 (1991).
Article ADS CAS PubMed PubMed Central Google Scholar
Ochman, H., Elwyn, S. & Moran, N. A. Calibrating bacterial evolution. Proc. Natl Acad. Sci. USA 96, 12638–12643 (1999).
Article ADS CAS PubMed PubMed Central Google Scholar
Lee, H., Popodi, E., Tang, H. & Foster, P. L. Rate and molecular spectrum of spontaneous mutations in the bacterium Escherichia coli as determined by whole-genome sequencing. Proc. Natl Acad. Sci. USA 109, E2774–E2783 (2012).
Perfeito, L., Fernandes, L., Mota, C. & Gordo, I. Adaptive mutations in bacteria: high rate and small effects. Science 317, 813–815 (2007).
Article ADS CAS PubMed Google Scholar
Park, C., Qian, W. & Zhang, J. Genomic evidence for elevated mutation rates in highly expressed genes. EMBO Rep. 13, 1123–1129 (2012).
Article CAS PubMed PubMed Central Google Scholar
Imhof, M. & Schlötterer, C. Fitness effects of advantageous mutations in evolving Escherichia coli populations. Proc. Natl Acad. Sci. USA 98, 1113–1117 (2001).
Article ADS CAS PubMed PubMed Central Google Scholar
Schluter, D. Estimating the form of natural selection on a quantitative trait. Evolution 42, 849 (1988).
Article PubMed Google Scholar
Orr, H. A. The genetic theory of adaptation: a brief history. Nat. Rev. Genet. 6, 119–127 (2005).
Article CAS PubMed Google Scholar
Shaw, R. G. & Geyer, C. J. Inferring fitness landscapes. Evolution 64, 2510–2520 (2010).
Article PubMed Google Scholar
Lambert, G. & Kussell, E. Quantifying selective pressures driving bacterial evolution using lineage analysis. Phys. Rev. X 5, 011016 (2015).
CAS PubMed PubMed Central Google Scholar
Hartl, D. L., Moriyama, E. N. & Sawyer, S. A. Selection intensity for codon bias. Genetics 138, 227–234 (1994).
Article CAS PubMed PubMed Central Google Scholar
Fang, X. et al. Global transcriptional regulatory network for Escherichia coli robustly connects gene expression to transcription factor activities. Proc. Natl Acad. Sci. USA 114, 10286–10291 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

This work was supported by research funds from New York University Abu Dhabi.

Author information

Authors and Affiliations

Division of Engineering, New York University Abu Dhabi, Abu Dhabi, UAE
Andras Gyorgy

Authors

Andras Gyorgy
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.G. conceived and designed the research, collected and analyzed the data, prepared the figures, and wrote the manuscript.

Corresponding author

Correspondence to Andras Gyorgy.

Ethics declarations

Competing interests

The author declares no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous reviewers for their contribution to the peer review of this work. A peer review file is available.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Reporting Summary

Peer Review File

Source data

Source Data

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Gyorgy, A. Competition and evolutionary selection among core regulatory motifs in gene expression control. Nat Commun 14, 8266 (2023). https://doi.org/10.1038/s41467-023-43327-7

Download citation

Received: 04 September 2023
Accepted: 07 November 2023
Published: 13 December 2023
DOI: https://doi.org/10.1038/s41467-023-43327-7
Springer Nature Limited

Competition and evolutionary selection among core regulatory motifs in gene expression control

Abstract

Similar content being viewed by others

Introduction

Results

Mathematical model

Fitness cost in large populations

Fitness cost in small populations

Autoregulation can result in unwanted selection pressure

Delay can cause non-autoregulated motifs to outperform autoregulation

Feedback cost can cause non-autoregulated motifs to outperform autoregulation

Reduced bioenergetic cost may contribute to the prevalence of autoregulation in model organisms

Discussion

Methods

Parameters

Stochastic simulation of the evolutionary dynamics

Periodic steady state distribution

Statistical analysis

Reporting summary

Data availability

Code availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Peer review

Peer review information

Additional information

Supplementary information

Source data

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation