Abstract
The emergence of an autocatalytic network from an available set of elements is a fundamental step in early evolutionary processes, such as the origin of metabolism. Given the set of elements, the reactions between them (chemical or otherwise), and with various elements catalysing certain reactions, a Reflexively Autocatalytic F-generated (RAF) set is a subset R\('\) of reactions that is self-generating from a given food set, and with each reaction in R\('\) being catalysed from within R\('\). RAF theory has been applied to various phenomena in theoretical biology, and a key feature of the approach is that it is possible to efficiently identify and classify RAFs within large systems. This is possible because RAFs can be described as the (nonempty) subsets of the reactions that are the fixed points of an (efficiently computable) interior map that operates on subsets of reactions. Although the main generic results concerning RAFs can be derived using just this property, we show that for systems with at least 12 reactions there are generic results concerning RAFs that cannot be proven using the interior operator property alone.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Discrete graph-theoretic models have been developed to describe the emergence and structure of self-generating autocatalytic reaction networks within a larger network. This approach was pioneered by Stuart Kauffman’s modelling of autocatalytic systems in a simple polymer-based origin-of-life model (Kauffman 1986, 1993), as well as independent results on the appearance of cycles in random directed graphs (Bollobás and Rasmussen 1989; Cohen 1988) motivated by their relevance to the emergence of living systems. Kauffman’s notion of a self-generating autocatalytic network was later formalised as a the concept of a Reflexively Autocatalytic and F-generated set (‘RAF’, defined shortly) (Hordijk and Steel 2004). The subsequent theory and algorithms concerning RAFs have been applied in a number of areas, ranging from the origin and structure of primitive metabolism (Xavier et al. 2020; Xavier and Kauffman 2022), to cognitive modelling in cultural evolution (Gabora and Steel 2017, 2020), to ecology (Gatti et al. 2018, 2020), and to economics (Gatti et al. 2020). The RAF concept is related to (but different from) Robert Rosen’s Metabolism-Replacement (M;R) systems in theoretical biology (Jaramillo 2010).
The task of determining whether or not a large network of ‘reactions’ contains a RAF and if so finding one, is made tractable (in polynomial time) by the property of a certain RAF map defined on the subsets of the full network of reactions. Here we generalize RAF maps to interior operators and investigate the properties of such operators, as well as the extent to which such operators (on arbitrary finite sets) can be realized as RAF maps. In particular, we show that there are generic results concerning RAFs that are not provable from just the basic properties of the RAF map as an interior operator. The significance of this result in applications is that certain generic properties of RAFs may require more detailed arguments than those that can be derived using interior operator properties alone.
We begin by defining interior operators on finite sets, listing some of their basic properties, and describing how they arise naturally from directed graphs. The results are then applied to self-generating autocatalytic networks.
2 Interior Operators and Their Fixed Sets
In this paper, we will assume that all sets are finite, and given a set Y, we write \(2^Y\) to denote the power set of Y. A function \(\psi : 2^Y \rightarrow 2^Y\) is an interior operator on the subsets of Y if it satisfies the following three properties (nesting, monotonicity, and idempotence) for all subsets \(X, X'\) of Y:
- (\(I_1\)):
-
\(\psi (X) \subseteq X\),
- (\(I_2\)):
-
\(X \subseteq X' \Rightarrow \psi (X) \subseteq \psi (X')\), and
- (\(I_3\)):
-
\(\psi (\psi (X)) =\psi (X)\).
The term ‘interior operator’ comes from topology, since the function that assigns to any subspace S of a topological space the interior of S (the union of all the open sets contained in S) satisfies the three properties \((I_1)\)–\((I_3)\).
Given an interior operator, \(\psi : 2^Y \rightarrow 2^Y\) and a subset X of Y, let
denote collection of subsets of X that are fixed by \(\psi\). We refer to the collection \(\{F_\psi (X): X \in 2^Y\}\) as the fixed sets of \(\psi\). Note that \(F_\psi (X) \ne \emptyset\) since \(\emptyset \in F_\psi (X)\) for any interior operator \(\psi\).
The following lemma summarises some basic and elementary properties of interior operators (a proof is provided in the Appendix).
Lemma 1
Let, \(\psi : 2^Y \rightarrow 2^Y\) be an interior operator, and let X be a subset of Y.
-
(i)
\(\psi (X) \in F_\psi (X)\) and \(\psi (X)= \bigcup _{U \in F_\psi (X)} U\).
-
(ii)
\(W \subseteq F_\psi (X) \Rightarrow \bigcup W \in F_\psi (X)\).Footnote 1
-
(iii)
An arbitrary collection \({\mathcal {C}}\) of subsets is the collection of fixed sets for some interior operator if and only if \(\emptyset \in {\mathcal {C}}\) and \({\mathcal {C}}\) is union-closed. Moreover, in that case, there is a unique interior operator \(\psi _{{\mathcal {C}}}\) that has \({\mathcal {C}}\) as its collection of fixed sets, and which is determined by:
$$\begin{aligned} \psi _{{\mathcal {C}}}(X)= \bigcup {\{U \in {\mathcal {C}}: U \subseteq X\}}, \end{aligned}$$(1)for all \(X \subseteq Y\).
Notice that Parts (i) and (ii) of this lemma imply that \(\psi (X)\) is the unique maximal fixed set contained within X.
Next, consider any function \(\lambda : 2^Y \rightarrow 2^Y\) that satisfies the properties (\(I_1\)) and (\(I_2\)) of an interior operator (but not necessarily \((I_3)\)). Define a function \(\psi _\lambda : 2^Y \rightarrow 2^Y\) as follows. For \(X \in 2^Y\), set
where \(H_0(X)=X\) and \(H_{i+1}(X) = \lambda (H_i(X))\) for all \(i\ge 0\). Notice that since Y is finite, this intersection is finite, and thus, \(\psi _\lambda (X) = H_n(X)\) for the first value of n for which \(H_n(X)=H_{n+1}(X)\).
Proposition 1
If Y is finite, and \(\lambda : 2^Y \rightarrow 2^Y\) satisfies the properties (\(I_1\)) and (\(I_2\)), then \(\psi _\lambda\) is an interior operator on \(2^Y\). Moreover, \(\psi _\lambda = \lambda\) if and only if \(\lambda\) satisfies (\(I_3\)).
Proof
For any \(X \in 2^Y\), we have \(\psi _\lambda (X) = H_n(X)\) for some value of n (dependent on X), and \(H_{n+1}(X) = \lambda (H_n(X)) = H_n(X)\). Thus,
Since all of the sets in this intersection equal \(H_n(X)\) we obtain \(\psi _\lambda (\psi _\lambda (X)) = H_n(X) =\psi _\lambda (X)\). For the second claim, if \(\lambda =\psi _\lambda\) then since \(\psi _\lambda\) satisfies (\(I_3\)), so does \(\lambda\). Conversely, if \(\lambda\) satisfies (\(I_3\)) then for every \(X \in 2^Y\) we have:
\(\square\)
2.1 Interior Operators Arising from Directed Graphs
Let \(D=(Y,A)\) be a finite directed graph with vertex set Y, and for any nonempty subset X of Y, let D|X be the induced sub-digraph on X (i.e. D|X has vertex set X and (u, v) is an arc of D|X if and only if \((u,v) \in A\) and \(u,v \in X\)). We let \(d_D^+(v)\) denote the in-degree of vertex v in D, and for \(v \in X\), we let \(d^+_{D|X}(v)\) denote the in-degree of vertex v in D|X. Let \(\left( {\begin{array}{c}Y\\ k\end{array}}\right)\) denote the subsets of Y of size k, and for \(k\ge 1\), let:
We say that \({\mathcal {C}}(D)\) is trivial if \({\mathcal {C}}(D)= \{\emptyset \}\). The following result (particularly Part (iii)) will play an important role in Sect. 3.3. The proof is provided in the Appendix.
Proposition 2
-
(i)
\({\mathcal {C}}(D)\) is union-closed; moreover, \({\mathcal {C}}(D)\) is nontrivial if and only if D contains a directed cycle.
-
(ii)
If \(U, W \in {\mathcal {C}}(D)\) with \(U \subsetneq W\), then either \(W\setminus U \in {\mathcal {C}}(D)\) or there is an element \(w \in W\setminus U\) for which \(U \cup \{w\} \in {\mathcal {C}}(D).\)
-
(iii)
Suppose that \(k \ge 3\), \({\mathcal {C}}_k(D) = \left( {\begin{array}{c}Y\\ k\end{array}}\right)\) and \({\mathcal {C}}_{j}(D)=\emptyset\) for all \(1\le j<k\). Then
$$\begin{aligned} |Y|~\le ~1+(k-1)(k-2). \end{aligned}$$
Remark
-
In Part (i), the claim that \({\mathcal {C}}(D)\) is nontrivial implies that D has a directed cycle was noted in Contreras et al. (2011).
-
Proposition 2(iii) fails for \(k=1\) or \(k=2\); in fact, Y can be arbitrarily large in these cases (e.g., for \(k=1\) take the arc set \(\{(v,v): v \in Y\}\) and for \(k=2\) take the arc set \(\{(u,v): u, v \in Y, u \ne v\}\)).
-
It follows from Proposition 2 that not every interior operator on \(2^Y\) can be realised as \(\psi _{{\mathcal {C}}(D)}\) for some digraph D. For example, if we let \(Y=\{a,b,c\}\) and take the union-closed set system \({\mathcal {C}}=\{\emptyset , \{a\}, \{a,b,c\}\}\) then \({\mathcal {C}}\) cannot equal \({\mathcal {C}}(D)\) for any digraph D by Proposition 2(ii). Alternatively, consider the union-closed set system \({\mathcal {C}}^+_k = \{X \in 2^Y: |X| \ge k\} \cup \{\emptyset \}\), where \(k\ge 3\). This satisfies the two assumptions in Proposition 2(iii) and so for any set Y with \(|Y| > 1+(k-1)(k-2)\) it follows that \({\mathcal {C}}^+_k \ne {\mathcal {C}}(D)\) for any digraph D on vertex set Y. Moreover, as Y becomes large, the proportion of interior operators on \(2^Y\) that can be realised as \(\psi _{{\mathcal {C}}(D)}\) for some D converges to zero as |Y| grows. To see this, observe that there are exactly \(2^{n^2}\) digraphs on a vertex set Y of size n, and each digraph uniquely determines \(\psi _{{\mathcal {C}}(D)}\) (though many digraphs produce the same interior operatorFootnote 2). By contrast, the total number of interior operators on \(2^Y\) grows much faster, as the following result shows (a proof is provided in the Appendix).
Proposition 3
For any set Y of size n, there are at least \(2^{\left( {\begin{array}{c}n\\ \lfloor n/2\rfloor \end{array}}\right) }\) interior operators on \(2^Y\).
3 Self-generating Autocatalytic Networks (RAFs)
A catalytic reaction system (CRS) is a quadruple \({\mathcal {Q}}=(X,R,C,F)\) consisting of a finite nonempty set X of elements (e.g., molecule types) and a finite set R of reactions; here a reaction \(r \in R\) refers to an ordered pair (A, B) where A and B are multisets of elements from X. In addition, C is a subset of \(X \times R\) where \((x,r) \in C\) has the interpretation that element x ‘catalyses’ reaction r. We will denote such a CRS by writing \({\mathcal {Q}}= (X, R, C, F)\). For each \(r \in R\), the subset of X consisting of those elements x for which \((x,r) \in C\) are called the catalysts of r, and a particular subset of X, namely a set F that has the interpretation as a set of elements that are freely available to the system. Accordingly, F is referred to as a food set. We write
to denote the reaction that has the reactants \(A=\{a_1, \ldots , a_k\}\), the products \(B=~\{b_1, \ldots , b_l\}\), and the catalysts \(\{c_1, \ldots , c_r\}\).
Let \(\rho (r)\) denote the set of reactants of r (i.e., A, ignoring multiplicities), and let \(\pi (r)\) denote the products of r (i.e., B, ignoring multiplicities).Footnote 3 Moreover, for a subset \(R'\) of R, it is convenient to let \(\pi (R') = \bigcup _{r\in R'} \pi (r)\) denote the set of products of the reactions in \(R'\).
A subset \(R'\) is F-generated if the reactions in \(R'\) can be placed in some linear order \(r_1, r_2, \ldots , r_k\) so that \(\rho (r_1) \subseteq F\) and for all j between 2 and k we have \(\rho (r_j) \subseteq F \cup \pi (\{r_1, \ldots , r_{j-1}\})\). In other words, the reactions in \({\mathcal {R}}'\) are F-generated if they can proceed in some order so that the reactant(s) of each reaction are available by the time they are first required. We call such an ordered sequence of \(R'\) an admissible ordering.
Finally, given a CRS \({\mathcal {Q}}= (X,R,C,F)\), we say that a subset \(R'\) of R is a RAF (Reflexively Autocatalytic and F-generated set) if \(R'\) is nonempty and is F-generated and, in addition, each reaction \(r \in R'\) is catalysed by at least one element in \(F \cup \pi (R')\). For any CRS \({\mathcal {Q}}\), let \({\mathcal {C}}^{\textrm{RAF}}_{\mathcal {Q}}\) denote the set of RAFs for \({\mathcal {Q}}\).
Example 1
Consider the CRS \({\mathcal {Q}}=(X,R, F, C)\) for which \(X=\{f, f', x,y,z\}\), \(F = \{f, f'\}\) and the set R of reactions (with a catalyst indicated in square brackets) is given by:
In this case, R has exactly two admissible orderings (\(r_1, r_2, r_3\) and \(r_1, r_3, r_2\)), and \({\mathcal {C}}^{\textrm{RAF}}_{\mathcal {Q}}= \{\{r_1\}, \{r_1, r_2, r_3\}\}\).
3.1 The maxRAF Interior Operator
A basic result is that when a CRS \({\mathcal {Q}}\) has a RAF, it has has a unique maximal RAF (which is the union of all the RAFs for \({\mathcal {Q}}\)), denoted \(\textrm{maxRAF}({\mathcal {Q}})\) (Hordijk and Steel 2004). For any subset \(R'\) of R, let \({\mathcal {Q}}|R'\) be the CRS \((X, R', C', F)\), where \(C'\) is the restriction of C to \(X \times R'\), and let \(\varphi _{\mathcal {Q}}:2^R \rightarrow 2^R\) be the following function:
To see how \(\varphi _{\mathcal {Q}}\) can be viewed as an interior operator, we first recall some further terminology. Given a subset \(R'\) of reactions R, a subset W of X is said to be \(R'\)-closed if the following property holds:
-
If a reaction r in \(R'\) has all its reactants in W (i.e. \(\rho (r) \subseteq W\)), then all the products of r are also in W (i.e., \(\pi (r) \subseteq W\)).
The union of two \(R'\)-closed sets need not be \(R'\)-closed; nevertheless, given a nonempty subset \(W_0\) of X, there is a unique minimal \(R'\)-closed set containing \(W_0\), denoted \(\textrm{cl}_{R'}(W_0)\). This can be computed in polynomial time in the size of the system by constructing a nested increasing sequence of subsets of elements
where:
We then have \(\textrm{cl}_{R'}(W_0) = W_k\) (note that k is the first value of i for which \(W_i=W_{i+1}\)). If we now take \(W_0=F\), it turns out that any subset \(R'\) of R is F-generated if and only if \(\rho (r) \subseteq \textrm{cl}_{R'}(F)\) for all \(r \in R'\) (Steel et al. 2019); moreover, \(R'\) is a RAF if \(R' \ne \emptyset\) and for each \(r \in R'\), the reactants of r and at least one catalyst of r is present in \(\textrm{cl}_{R'}(F)\). This allows us to express \(\varphi _{\mathcal {Q}}\) as an operator of the form \(\psi _\lambda\), where \(\lambda\) is a function on \(2^R\) that satisfies the interior operator properties \((I_1)\) and \((I_2)\).
Let \(\lambda _{\mathcal {Q}}: 2^R \rightarrow 2^R\) be the function defined by:
The function \(\lambda _{\mathcal {Q}}\) clearly satisfies conditions (\(I_1\)) and (\(I_2\)). If we recall the definition of \(\psi _\lambda\) from Eqn. (2), the maxRAF operator has a representation in the following result from Steel et al. (2020).
Proposition 4
For any CRS \({\mathcal {Q}}=(X,R,C,F)\), the map \(\varphi _{\mathcal {Q}}: 2^R \rightarrow 2^R\) is precisely the interior operator \(\psi _\lambda\) for \(\lambda = \lambda _{\mathcal {Q}}\).
This identity (\(\varphi _{\mathcal {Q}}= \psi _\lambda\)) allows for a polynomial-time algorithm to compute \(\varphi _{\mathcal {Q}}\) (c.f. Steel et al. (2020) and the references therein). In particular, a nonempty subset \(R'\) of R is a RAF if and only if \(\varphi _{\mathcal {Q}}(R') = R'\). Some new and interesting algebraic (semigroup) properties of the map \(\varphi _{\mathcal {Q}}\) were established recently in Loutchko (2023b) (see also Loutchko (2023a), which considers a more general notion than a RAF, corresponding to ‘pseudo-RAFs’ in the RAF literature, and which we do not explore further in this paper).
3.2 RAFs in Elementary CRS Systems
At this point, it is helpful to consider a very special type of catalytic reaction system. A CRS \({\mathcal {Q}}=(X,R,C,F)\) is said to be elementary if each of its reactions has all its reactants in the present food set (formally, \(\rho (r) \subseteq F\) for each \(r \in R\)).
Given an elementary CRS \({\mathcal {Q}}=(X,R,C,F)\), define a digraph \(D({\mathcal {Q}})\) to have vertex set R and an arc from r to \(r'\) (\(r \ne r'\)) if a product of r catalyses \(r'\); in addition, we place an arc from r to itself if either a product of r or an element of F catalyses r.
The following result is easily verified from the definitions [or see (Steel et al. 2019), Theorem 2.1] and describes the set of RAFs of an elementary CRS \({\mathcal {Q}}\) (i.e., \({\mathcal {C}}^{\textrm{RAF}}_{\mathcal {Q}}\)) in terms of the fixed sets of the interior operators arising from digraphs [from Sect. 2.1, and recalling the definition of \({\mathcal {C}}(D)\)]. This will be applied in the next section.
Lemma 2
\({\mathcal {C}}^{\textrm{RAF}}_{\mathcal {Q}}\cup \{\emptyset \}= {\mathcal {C}}(D({\mathcal {Q}}))\).
An immediate consequence of this lemma and Proposition 2(ii) is the following.
Corollary 1
If \({\mathcal {Q}}\) is an elementary CRS which has a RAF, then for any two RAFs of \({\mathcal {Q}}\) (say, \(R', R''\)) if \(R' \subsetneq R''\), then either \(R''\setminus R'\) is a RAF for \({\mathcal {Q}}\) or there is some reaction \(r \in R'' {\setminus } R'\) for which \(R''\cup \{r\}\) is a RAF for \({\mathcal {Q}}\).
Note that this corollary can fail without the assumption that \({\mathcal {Q}}\) is elementary; Example 1 provides a counterexample for the two RAFs \(R'=\{r_1\}\) and \(R'' = R= \{r_1, r_2, r_3\}\). If one removes the ‘elementary’ restriction on a CRS, the class of possible set systems that can be realised as RAFs of some suitably chosen CRS becomes larger and less tractable. We investigate this further in the next section, where we will apply Lemma 2 and the earlier Proposition 2(iii).
3.3 Representing an Interior Operator as a RAF Operator
The main results in RAF theory that are generic (i.e., which hold regardless of the particular choices or restrictions on F, X, R or C in \({\mathcal {Q}}\)) can be established by using only the property that the maxRAF operator \(\varphi _{\mathcal {Q}}\) is a (efficiently computable) interior operator [see Steel et al. (2020)]. This raises the question as to whether theorems that hold true for all RAFs can always be established from (just) this generic property. In other words, can every interior operator for every finite set Y be realised as the maxRAF operator associated with a suitably chosen catalytic reaction system \({\mathcal {Q}}= (X, R, C, F)\) in which Y is identified (via a bijection) with the set R of reactions in \({\mathcal {Q}}\). We show that the answer is ‘no’ by describing a generic result in RAF theory that is not a consequence of the interior operator property of the maxRAF operator.
More precisely, we say that an interior operator \(\psi\) on the subsets of Y has a RAF-realisation if there exists a CRS \({\mathcal {Q}}=(X,R, C, F)\) and a bijection \(b: Y \rightarrow R\) such that for each \(Y' \in 2^Y\) we have:
where \(R' =\beta (Y')\) and where \(\beta :2^Y \rightarrow 2^R\) is the natural bijection induced by b. In other words, the diagram shown commutes for each \(Y'\subseteq Y\). Note that no restriction is placed on the sets X, F, and C in \({\mathcal {Q}}\); in particular, they could be arbitrarily large sets.
We now show that such a realisation is not always possible, as described in Proposition 5(ii) below. For this result, an irreducible RAF (iRAF) for a CRS \({\mathcal {Q}}= (X, R, C, F)\) is a RAF \(R'\) with the property that it contains no (nonempty) RAF as a proper subset (i.e., \(\varphi _{\mathcal {Q}}(R')=R'\) and \(\varphi _{\mathcal {Q}}(R'{\setminus } \{r\}) = \emptyset\) for all \(r \in R'\)).
Proposition 5
-
(i)
For any integer \(k\ge 3\) and any CRS \({\mathcal {Q}}= (X, R, C, F)\) with \(|R|\ge k^3-3k^2+4k\), not all subsets of R of size k are iRAFs.
-
(ii)
For any finite set R of size at least 12, there exists an interior operator \(\psi\) on \(2^R\) that does not have a RAF-realisation.
Proof of Proposition 5:
Part (i): Let \(m=(k^2-3k+3)\), and suppose that \(|R| \ge km\) and every subset of R of size k is an iRAF; we will derive a contradiction. Since \(|R| \ge km\) there exist m disjoint subsets of R of size k, call them \(R_1, \ldots , R_m\). Since these are subsets of R of size k they are iRAFs for \({\mathcal {Q}}\). Now, any RAF requires at least one reaction to have all its reactants in the food set F (this can easily been verified by considering the first reaction in any admissible ordering of the reactions in a RAF). Thus, we can select one such reaction \(r_i\) from \(R_i\) (for each i), to obtain a set \(R_k=\{r_1, \ldots , r_m\}\) of m (distinct) reactions, with each reaction in \(R_m\) having all its reactants in F. Consider the CRS \({\mathcal {Q}}_m = (X, R_m, C_m, F)\) by restricting R to \(R_m\) and restricting C to \(C_m=\{(x,r) \in C: r \in R_m\}\). This is an elementary CRS, and so, by Lemma 2, the set of RAFs of \({\mathcal {Q}}_m\) is equal to \({\mathcal {C}}(D({\mathcal {Q}}_m)) \setminus \{\emptyset \}\) (where \(D({\mathcal {Q}}))\) is defined as Sect. 3.2). Since each subset of R of size k is an iRAF of \({\mathcal {Q}}\) (and noting that \(m \ge k\)), it follows that \({\mathcal {C}}(D({\mathcal {Q}}_m))\setminus \{\emptyset \}\) contains all subsets of \(R_m\) of size k, and no subsets of size less than k, and so we can apply Proposition 2(iii) (with \(Y=R_m\)) to deduce that \(m=|R_m| \le 1+(k-1)(k-2)\). But this contradicts the inequality \(m =|R|/k= k^2-3k+4 > 1+(k-1)(k-2)\).
Part (ii): Put \(k=3\) in Part (i) and consider the following map \(\psi : 2^R \rightarrow 2^R\):
It is easily verified that \(\psi\) satisfies properties (\(I_1\)), (\(I_2\)) and (\(I_3\)) and so is an interior operator, but \(\psi\) has no RAF-realisation by Part (i).
Remark
-
The condition that \(k\ge 3\) is required in Proposition 5(i) since for \(k\le 2\) it is easy to construct CRS systems with an arbitrarily large set of reactions and with all subsets of R of size k being iRAFs (based on the second remark following Proposition 2).
-
Note also that the value 12 in Proposition 5(i) (when \(k=3\)) can be reduced to 4 if one restricts to RAF representations within elementary CRS systems. However, without that restriction, Proposition 5(i) does not hold if 12 is replaced by 4. An example is provided by the CRS \({\mathcal {Q}}\) consisting of \(X=\{f, c_1,c_2, c_3, \gamma , x,y,z\}, F=\{f\}\) and R comprising the four catalysed reactions:
$$\begin{aligned} r_1: f [c_3,\gamma ]\rightarrow x+y+c_1 \\ r_2: f [c_1,\gamma ]\rightarrow y+z+c_2 \\ r_3: f [c_2,\gamma ]\rightarrow x+z+c_3 \\ r_4: x+y+z [w]\rightarrow w+\gamma \end{aligned}$$For this system, each of the four subsets of R of size 3 is an iRAF of \({\mathcal {Q}}=(X,R,C,F)\).
It is possible that the value of 12 in Proposition 5(i) (when \(k=3\)) could be reduced further (or that value of 4 provided by the example above could be increased), however this would require more elaborate arguments.
4 Concluding Comments
Proposition 2 provides set-theoretic necessary conditions for a union-closed collection of sets to be realisable by a digraph. A natural question is whether there is a set-theoretic characterisation of the class of union-closed sets to be realisable by a digraph. A more difficult task would be to characterise the set systems that are realisable as the RAFs of some CRS. Related to the (still open) union-closed conjecture (Balla et al. 2013), is the question of whether there is always a reaction that lies in at least half the RAFs (for either an elementary or general CRS). Although we have focused on applications of interior operators arising from digraphs to autocatalytic networks, other properties of interior operators realisable by graph-based processes may also be relevant to various applications [e.g. in investigating the fixed sets present within digraph models of neuronal networks of the type discussed in Grindrod (2017)].
Notes
For a collection W of sets we use the shorthand \(\bigcup W\) to denote \(\cup _{V \in W} V\).
For example, by Proposition 2(i), all acyclic digraphs return the trivial interior operator defined by \(\psi _{{\mathcal {C}}(D)}(X) = \emptyset , \forall X \subseteq Y\).
It is assumed that \(\rho (r), \pi (r) \ne \emptyset\) for all \(r \in R\).
References
Balla I, Bollobás B, Eccles T (2013) Union-closed families of sets. J Combin Theor A 120:531–544
Bollobás B, Rasmussen S (1989) First cycles in random directed graph processes. Discrete Math 75:55–68
Cazzolla Gatti R, Hordijk W, Kauffman S (2018) Biodiversity is autocatalytic. Ecol Modell 364:70–76
Cohen JE (1988) Threshold phenomena in random structures. Discrete Appl Math 19:113–128
Contreras DA,Pereira U, Hernández VC. Reynaert B, Letelier JC (2011). A loop conjecture for metabolic closure. Proceedings of ECAL 2011: The 11th European Conference on Artificial Life. Paris, France. (pp. 30). ASME. https://doi.org/10.7551/978-0-262-29714-1-ch030
Gabora L, Steel M (2017) Autocatalytic networks in cognition and the origin of culture. J Theor Biol 431:87–95
Gabora L, Steel M (2020) A model of the transition to behavioral and cognitive modernity using reflexively autocatalytic networks. J R Soc Interface. https://doi.org/10.1098/rsif.2020.0545
Gatti RC, Fath B, Hordijk W, Kauffman S, Ulanowicz R (2018) Niche emergence as an autocatalytic process in the evolution of ecosystems. J Theor Biol 454:110–117
Gatti RC, Koppl R, Fath BD, Kauffman S, Hordijk W, Ulanowicz RE (2020) On the emergence of ecological and economic niches. J Bioecon 22:99–127
Grindrod P (2017) On human consciousness: a mathematical perspective. Netw Neurosci 2(1):23–40
Hordijk W, Steel M (2004) Detecting autocatalyctic, self-sustaining sets in chemical reaction systems. J Theor Biol 227(4):451–461
Jaramillo S et al. (2010) (M,R) systems and RAF sets: common ideas, tools and projections. In Proc. ALIFEXII Conf., Odense, Denmark, August 2010, 17. pp. 94–100. Cambridge, MA: MIT Press
Kauffman SA (1993) The origins of order. Oxford University Press, Oxford
Kauffman S (1986) Autocatalytic sets of proteins. J Theor Biol 119:1–24
Loutchko D (2023) Semigroup models for biochemical reaction networks. J Math Biol 86:78
Loutchko D (2023) An algebraic characterization of self-generating chemical reaction networks using semigroup models. J Math Biol 86:76
Steel M, Hordijk W, Xavier JC (2019) Autocatalytic networks in biology: structural theory and algorithms. J Roy Soc Interface 16:20180808
Steel M, Xavier JC, Huson DH (2020) The structure of autocatalytic networks, with application to early biochemistry. J Roy Soc Interface 17:20200488
Xavier JC, Kauffman S (2022) Small-molecule autocatalytic networks are universal metabolic fossils. Phil Trans Roy Soc A 380:20210244
Xavier JC, Hordijk W, Kauffman S, Steel M, Martin WF (2020) Autocatalytic chemical networks at the origin of metabolism. Proc Roy Soc B 287:2019237. https://doi.org/10.1098/rspb.2019.2377
Acknowledgements
I thank Dimitri Loutchko for some helpful comments on an earlier version of this manuscript, and two anonymous reviewers for their constructive suggestions.
Funding
Open Access funding enabled and organized by CAUL and its Member Institutions.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The author declares no competing interests and that this work was not supported by any external grant funding.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendix
Appendix
Proof of Lemma 1
Part (i): The first claim follows immediately from Condition (\(I_3\)). For the second claim, observe that the term on the right is a subset of \(\psi (X)\) since every set U in \(F_\psi (X)\) is a subset of X and so we can apply Condition (\(I_2\)). However \(\psi (X)\) is also in \(F_\psi (X)\) (by Condition (\(I_3\))), so the \(\psi (X)\) is a subset of the right-hand side.
Part (ii): If \(A \in W\), then \(A=\psi (A)\subseteq \psi (\bigcup W)\) by \((I_2)\), so \(\bigcup W \subseteq \psi (\bigcup W)\). Since \(\psi (\bigcup W) \subseteq \bigcup W\) (by \(I_1\)) it follows that \(\psi (\bigcup W) = \bigcup W\) and so \(\bigcup W \in F_\psi (X)\), as claimed.
Part (iii): If \({\mathcal {C}}\) is a collection of fixed sets of some interior operator, then \(\emptyset \in {\mathcal {C}}\), and \({\mathcal {C}}\) is union-closed by Part (ii). Suppose that \({\mathcal {C}}\) has these properties, and consider \(\psi _{{\mathcal {C}}}\). This function clearly satisfies (\(I_1\)) and (\(I_2\)). To verify Condition (\(I_3\)), observe that the union closure condition on \({\mathcal {C}}\) implies that \(\bigcup \{U \in {\mathcal {C}}: U \subseteq X\}\) is a set \(U'\) in \({\mathcal {C}}\), and so
as required. Moreover, the fixed set of \(\psi _{{\mathcal {C}}}\) is \({\mathcal {C}}\), since if \(X \in {\mathcal {C}}\), then \(\psi _{{\mathcal {C}}}(X) = \bigcup \{U \in {\mathcal {C}}: U \subseteq X\} = X\), and if \(\psi _{{\mathcal {C}}}(X) =X\) then since \({\mathcal {C}}\) is union-closed, this implies that \(X \in {\mathcal {C}}\). For the uniqueness claim, suppose that \(\psi\) has \(F_\psi = {\mathcal {C}}\). From Part (i), \(\psi (X)= \bigcup _{U \in F_\psi (X)} U\) and \(F_\psi (X) = \{U \in {\mathcal {C}}: U \subseteq X\}\) thus \(\psi =\psi _{\mathcal {C}}.\) \(\square\)
Proof of Proposition 2
Part (i): Suppose that \(X, X' \in {\mathcal {C}}(D)\), and \(v \in X \cup X'\). Without loss of generality, we may suppose that \(v \in X\). Then v has strictly positive in-degree in D|X, and any arc of the form (x, v) with \(x \in X\) is also present in \(D|(X \cup X')\). Thus \(X \cup X' \in {\mathcal {C}}(D)\). For the second claim, if \(v_1, \ldots , v_r = v_1\) is a directed cycle in D, then the set of vertices in this cycle lies in \({\mathcal {C}}(D)\). Conversely, if D is acyclic, then so too is D|X for any subset X of Y, and since every finite acyclic directed graph has a vertex of in-degree 0, it follows that \(X\not \in {\mathcal {C}}(D)\).
Part (ii): Suppose there is no vertex \(w \in W\setminus U\) for which \(U \cup \{w\} \in {\mathcal {C}}(D)\). Then there is no arc from any vertex in U to a vertex in \(W\setminus U\). However, every vertex in \(W\setminus U\) has an incoming arc from some vertex in W, and therefore, it has an incoming arc from some vertex in \(W{\setminus } U\). Thus \(D|(W{\setminus } U)\) has the property that every vertex in this induced graph has in-degree at least 1, so \(W{\setminus } U \in {\mathcal {C}}(D)\).
Part (iii): Let \(D=(V, A)\), and suppose that \({\mathcal {C}}_k(D) = \left( {\begin{array}{c}Y\\ k\end{array}}\right)\) and \({\mathcal {C}}_{j}(D) = \emptyset\) for all \(1\le j<k\), where \(k\ge 3\). We first show that this implies that \(d^+_{D}(v) \le k-2\) for each vertex \(v \in Y\). To see this, suppose that \(d^+_{D}(v') \ge k-1\) for some element \(v' \in Y\); we will derive a contradiction. Observe that \((v',v') \notin A\) (otherwise \(\{v'\} \in {\mathcal {C}}_1(D) = \emptyset\)) and so there is a subset \(X'\) of size at least \(k-1\) for which \((x',v') \in A\) for each \(x' \in X'\). Let \(X''\) be any subset of \(X'\) of size exactly \(k-1\). Since \({\mathcal {C}}_{k-1}(D)=\emptyset\) and \(|X''|=k-1\), at least one element \(x'' \in X''\) has no incoming arc from any other vertex in \(X''\), which means that \((v',x'') \in A\), since \(X'' \cup \{v'\} \in {\mathcal {C}}_k(D)\). On the other hand, \((x'', v') \in A\) (by definition of \(X''\)), which implies that \(\{v', x''\} \in {\mathcal {C}}_2(D)\), providing the required contradiction since \({\mathcal {C}}_2(D)=\emptyset\) (since \(k\ge 3\)). Thus each vertex v in D has in-degree at most \(k-2\), as claimed.
If we now let \(n=|Y|\) then, since \(|A| = \sum _{v \in Y}d_D^+(v)\), we obtain:
Now consider the set \(\Omega\) of pairs (S, a) where S is a subset of k vertices from Y, and a is an arc between any two vertices of S. Formally,
We count this set in two ways. Since \(n= |Y|\), the number of choices for S is \(\left( {\begin{array}{c}n\\ k\end{array}}\right)\). Moreover, for each such set S, there are precisely k arcs that form a cycle involving k elements in S, since: (a) if any more arcs were present between the vertices of S then a set in \({\mathcal {C}}_{j}(D)\) for some \(j<k\) would appear, and (b) if no cycle was present involving all elements of k, then S would not lie in \({\mathcal {C}}_k(D)\), and both of these two possibilities are excluded by the two assumptions stated in Part (iii). In summary,
We can also count \(\Omega\) by first selecting an arc (u, v) from A and counting the number of sets \(S \in \left( {\begin{array}{c}Y\\ k\end{array}}\right)\) that contain u and v. By the assumptions in Part (iii), each subset of Y of size k induces a unique cycle through all the vertices (and with no other arcs present between the vertices), so the number of sets S that can be chosen for (u, v) is \(\left( {\begin{array}{c}n-2\\ k-2\end{array}}\right)\). Thus we have:
Combining Eqns. (4), (5) and (6) gives:
which simplifies to \(n \le 1+(k-1)(k-2)\), as claimed. \(\square\)
Proof of Proposition 3:
Let \({\mathcal {A}}=\{U \subset Y: |U|=\lfloor n/2\rfloor \}\), which is an antichain in the poset \(2^Y\) (partially ordered by set inclusion) of size \(\left( {\begin{array}{c}n\\ \lfloor n/2\rfloor \end{array}}\right)\) (\({\mathcal {A}}\) is also a largest antichain by Sperner’s theorem). Let S be a subset of \({\mathcal {A}}\), and let \({\mathcal {C}}[S]\) be the collection of subsets of Y consisting of \(\emptyset\), the sets in S, and all possible unions of the sets from S. In this case, \({\mathcal {C}}[S]\) satisfies the conditions of Lemma 1 (iii) and so there is a unique interior operator \(\psi _{{\mathcal {C}}[S]}\) that has the fixed set \({\mathcal {C}}[S]\). Moreover, the collection of minimal nonempty fixed sets of \(\psi _{C[S]}\) is precisely the sets in S, so if \(S \ne S'\), then \(\psi _{{\mathcal {C}}[S]} \ne \psi _{{\mathcal {C}}[S']}\). Since there are \(2^{\left( {\begin{array}{c}n\\ \lfloor n/2\rfloor \end{array}}\right) }\) choices for S, this completes the proof. \(\square\)
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Steel, M. Interior Operators and Their Relationship to Autocatalytic Networks. Acta Biotheor 71, 21 (2023). https://doi.org/10.1007/s10441-023-09472-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10441-023-09472-8