Case File
efta-efta01183812DOJ Data Set 9OtherPrimordial sex facilitates the emergence of
Date
Unknown
Source
DOJ Data Set 9
Reference
efta-efta01183812
Pages
7
Persons
0
Integrity
Extracted Text (OCR)
Text extracted via OCR from the original document. May contain errors from the scanning process.
Primordial sex facilitates the emergence of
evolvable protocells
Sam Sinai ' '
, Jason Olejarz , lulia A. Neagu
, and Martin A. Nowale.b."
"Program for Evolutionary Dynamics. HarvaM University. I Brame square. suite 6.02138; hDepanment of Organismic and Evolutionary Biology; `Department of Mathematics;
dOepariment of Physics. Harvard University
This manuscript was compiled on April x.2016
Membranes, forming protocells, are widely considered beneficial or
even essential to the maintenance of cooperation in early evolution
[1-5). Moreover, there are strong arguments from chemistry to sug-
gest that membranes played a critical role in pre-evolutionary dynam-
ics [6-9). In this study we propose a novel reason why membranes
are beneficial even before the presence of replication or selection.
We argue that the ability of lipid membranes to fuse and share their
contents. "primordial sex", improves the efficiency of finding mini-
mal evolvable protocells. We analyze and quantify a model of merg-
ing membranes that resembles a sexual repair mechanism known
as multiplicity reactivation in modern viruses [10). We then argue
that this mechanism could shorten the timescale and increase the
probability of finding evolvable combinations of simple functional
elements significantly. This in turn suggests that assembling com-
plicated sets of functions at random may not be as probabilistically
implausible as it first appears. Hence, in the presence of sex, large
assemblies and functional networks can form without requiring evo-
lution. Finally, we establish a quantitative framework to analyze how
parasites, thought to be a serious impediment in early life. affect
the accumulation of functions. We show that while parasites may
hurt the accumulation process, under most circumstances, the ben-
efits of sex massively outweigh the risks of exposure to parasitic
elements.
Origin of Life I Protocells I Origin of Sex I Multiplicity Reactivation
M
enthrones are ubiquitous across all domains of modern
life, yet their importance stretches far back to the origins
of the very first cells [6-8,
Prebiotic chemists [6, 7, 9], as
well as origin of life theorists [2-4, 8, 12), have been interested
in understanding the specific roles that membranes, in self-
organized lipid vesicles (also referred to as protocells), could
have played in early evolution.
The "RNA world hypothesis" concerns itself with how RNA
or similar bio-polymers gave rise to information-coding and
enzymatic activities that eventually lead to their central role in
living organisms [13-15]. However, well-mixed populations of
such molecules often stiffer from well-known pitfalls, including
the error catastrophe for replicates [16, 17] and parasitism for
cooperative enzymes [1, 2, 5, 12]. Further, despite decades of
effort in prebiotic chemistry, and some exciting piseess (e.g.
[18, 19]), building efficient, stable, and prebiotically plausible
replicases (sometimes called the "holy grail" of the RNA world)
in lab has remained a challenge [20]. Population assortment
through dividing membranes seems to alleviate the parasitism
problem [2-5, 12]. Apart from mathematical reasons, chemists
also argue that membranes play a crucial role (e.g. producing
an electro-chemical gradient) in maintaining a metabolism in
early cells [6, 7, 9, 21].
While early presence of membranes has many potential
benefits, it is prudent to consider whether they could have been
present in abiotic earth. There is good evidence in support. It
has been shown that amphiphilic molecules, like simple fatty
acids that are building blocks for the lipid-membrane, could
be produced in a prebiotically plausible manner [22]. These
molecules are able to spontaneously assemble into vesicles in
aqueous conditions [23-25]. Alternatively, lipids could have
been imported to earth by chondrite meteorites[26-28]. Hence,
it is commonly assumed that such molecules were present
in sufficient abundances [2-4, 6-9, 12, 24] and could have
produced lipid vesicles.
A "lipid world" may have preceded or coexisted with the
RNA world [6-9, 29-31]. In a lipid world, protocells can con-
tain and protect catalytic and information-bearing molecules.
After the onset of replication (on a molecular or cellular level),
a key step in the RNA world, protocells help selection for
cooperative polymers, in particular replicases [3, 4]. Because
of the potential benefits of protocells, a multitude of successful
experiments in the past decade have focused on the dynamics
of simple co-polymerization inside lipid vesicles [32-36].
There are several abstract properties of protocells that are
of interest. First and most obviously, the contents of protocells
are held near each other (are "co-localized"), and share the
same fate. This results in higher concentrations, increased
interactions within the protocell, and decreased interactions
with outside environment. It also means that the protocell can
house a "compositional genome", i.e. the information within
the protocell need not be stored in one (or few) contiguous
polymer [11, 37]. It may also dampen the effects of side
Significance Statement
Protocells are thought to play important roles in the origins of
life. Meanwhile, some propose that sex —sharing informational
content among protocells— provides benefits in early evolution.
We use mathematical modeling to suggest that even before
the emergence of replication (and evolution). sex could have
been enormously beneficial. In particular. we show that while
assembling protocells with a desired set of components is
nearly impossible if the number of components is large, sex
would improve the efficiency of making such cells by orders of
magnitude. We quantify how much this primitive sex (which
also appears in viruses) "speeds up the inception of evolvable
protocells. and once evolution begins can further increase the
speed at which complexity arises.
SS-J.0.. M.A.N. doomed rosaanth. S.S. 10- tA.N. MAN. paterned ear.oatch.S S JO I A.N
MANanalyzed data. S.S.. J.Q. IAN. AAR solo true papa..
The =has declare no cocain al intwesl
I S.S. and J.O. oanlrb.ned equal?, to thirintk.
2 Martin Nest [email protected](lu
‘vettreas.orgeodoviotoratteati0000000cxx
PNAS I Apr14,2016 I vol. )3fX I no. XX I 1-7
EFTA01183812
reactions for auto-catalytic cycles that may be required to start
and maintain a metabolism [1]. Seoond, protocells can divide
into daughter cells that inherit parts of their contents [38]. This
property is at the heart of many group selection models, like the
stochastic corrector [12], that alleviate problems arising from
parasitism[3, 4, 39-41]. Third, protocells are able to merge
and share their contents under certain conditions. While this
property of protocells has been scrutinized before [1, 42, 43], it
has received far less attention relative to division mechanisms.
Nonetheless, there has been recent experimental success in
protocell fusion models, suggesting that fusion may play a
role in the development of early cells [21]. Note that some
of these properties are also exhibited by other non-organic
boundaries. For instance, bubbles [44] and porous materials
[45] (where fluids flow through small holes and pipes) can
increase local interactions, divide material, and merge them.
In this study, we primarily focus on merging and its role in
constructing evolvable protocells, keeping in mind that our
results are general and are applicable to many processes that
exhibit such properties.
We assume that in order to be evolvable, a protocell needs
to contain a certain number of functions (molecules of various
complexity). In early life, these could be molecules as simple
as ions, co-factors, and nutrients, or more complicated poly-
mers, like oligo-peptides, and even elementary ribozymes and
simple unlinked genes [I, 19, 32-34, 46-48]. Similar models
of functional assemblies have been employed successfully in
the past [2, 11, 12, 37, 49-51] and simple examples have been
experimentally observed [21, 52]. We call the smallest set
of functions from which an evolvable protocol' can be made
a minimal evolvable protocell. More precisely, the target set
should result in an auto-catalytic network that results in an
evolvable cell with non-negligible probability.
In the absence of evolution through replication, a protocell
will need to collect all of those functions through some random
process. If the number of necessary functions that have to
co-occur in a protocell is large, this process is very inefficient
in a landscape where there is no evolution and replication.
The absolute worst case scenario would be that out of pure
luck, a membrane is formed around all the required functions
at once, and results in an evolvable protocell. As this is
incredibly unlikely, it is used as a criticism against approaches
that require many components (or in some scenarios "genes")
at inception [1]. However, as we show in the following section,
alternative random mechanisms of accumulation are made
possible by protocells. These mechanisms may reduce the
probabilistic burden significantly enough, that even under no
evolution, the target set of functions may be achievable for
large number of functions.
Model and Results
The goal of our study is to compare the efficiency of mecha-
nisms that lead to construction of a minimal evolvable proto-
cell, in terms of the information it contains. While we take an
algorithmic perspective (see [53] for a related discussion), the
results can be interpreted biologically. Our target set would
entail a lipid membrane that encloses all the necessary func-
tions for starting a simple metabolism (e.g. an auto-catalytic
cycle) and eventually a replication process.
We study the average-case trajectory of single cell in the
population of protocells until it accumulates all the necessary
Fig. I. Merging occurs between randomly assembled cal s. A pro-
tooel consisting of all of the necessary functionalibes could be constructed by (A)
random assembly or (8) merging. Each merging event can be seen as a bdwise
OR operation between strings of length n that represent protocell contents. Here
each color represents a functionality (and so does each position on the representative
stems). A cell arch all the functions would be represented by a string of all Is.
functionalit ies. We use the number of operations for each
process. relative to the number of functions we need to reach,
as our measure of complexity. A process that takes fewer steps
to finish is considered more efficient. This approach allows us
to analyze the processes in the same framework, and compare
their efficiency. It is possible to map this measure to physical
time or energy cost in a continuous chemical process, depend-
ing on the problem of interest. Here we concern ourselves with
a protocell's abstract properties.
In order to mathematically measure the number of opera-
tions, we represent the functional (or genetic) content of each
protocell as a binary string of length n (a beckon conjunction
used similarly in [54]). For simplicity, and without loss of gen-
erality, we ignore the redundancy (or dose) of each function in
the protocol', and are only concerned with their presence. If a
protocell contains a particular function i, then the string will
have a value of I at the ilk position and 0 otherwise. We as-
sume that the probability for the presence of each component
(component frequency) in a sample is p (and define q E 1 — p).
We also assume that p is identical across components, as this
simplifies our model but is not a key issue for our conclusions.
A process begins with an empty protocell that we track,
called the accumulator. The accumulator then collects func-
tions by "sampling" protocells and updating its own contents.
Sampling is defined as picking random strings from the en-
vironment. We can measure the average-case trajectory of
the accumulator protocell by calculating the expected num-
ber of independent samples required so that the accumulator
acquires all a functions. We can now model an accumulation
algorithm within this framework. The methods used here
are commonplace in analysis of random algorithms and have
also been applied to evolutionary processes [55]. The detailed
calculations for the following results are provided in the SI.
2 I weiwooes.orstgvoovio.vmenaam00000cxxx
Sinai el al
EFTA01183813
Random assembly. In a world in which protocells cannot fuse,
and material transport across the membrane is insignificant,
protocells are constructed by spontaneous formation of a mem-
brane around a random set of functions. In this world, proto-
cells keep the set of functions they already contain. A full set
is only generated when all the required functions happen to
be enclosed within a membrane upon formation. Under the
random assembly process, for each step, the accumulator takes
the value of the latest sample. Hence, each new sample repre-
sents a protocell brining around a random set of molecules. A
protocell produced in this way will contain X functions, where
X is a random variable, distributed binomially with param-
eters ut, p. Therefore, on average np functions are contained
inside the protocell when it is assembled at random. However,
if we need all a functions to co-occur in the same cell, then
TR(n), the expected hitting time, grows exponentially in n:
Ta(a) = (
"
1
[1]
This calculation lays the foundation for our model as it
provides a point of reference for the performance under the
worst-case scenario.
Merging. As we can see from above, most of the cells generated
by random assembly will contain a subset of n possible func-
tions. However, if merging is allowed, we can intersect their
contents to produce the desired set. In order to model this
process we merge the accumulator protocell with random sam-
ples taken from the environment while counting the number
of merges.
Table 1. Merging protocells efficiently produce cells with f, compo-
nents.
Number 04
Functions (n)
Random Assembly
(Err I.)
Merging
(Eqn 2.)
Merging
(Simulation)
10
1020
291.93
291.16 ± 5.24
25
1054
380.18
381.96 ± 5.47
SO
10100
448.17
448.48 ± 5.55
100
10200
516.64
516.22 ± 5.53
250
10500
607.51
608.32 ± 5.41
As an example. fixing the concentration parameter at p = 0.01. we
compare the number of steps it takes to accumulate a functions
through random assembly. and that of the merging process. While
finding sets of a functions by random assembly grows exponentially
fast in a. if those same randomly assembled protocells were able to
merge, target compositions can be found with very few merges. We
also verify the model numerically. The Monte Carlo simulation results
are the mean hitting times over 2000 trials (with corresponding 95%
confidence interval).
If protocells are able to merge with each other, and gener-
ate a new protocell that encloses all the functions from the
two original cells, then their contents are the union of the
parental cells that are generated by random assembly. Like
before, merging occurs with samples that contain X functions,
where X is binomially distributed. Hence, a sampled string
will on average contain np functions, not all of which are nec-
essarily new additions to the accumulator. Note that when
two protocells merge, the value of the resulting string at every
position i is simply determined by a bitwise Oft operation (an
Oft operation on the ith bit of the original protocells taken
together). Now, the problem can be seen as the probability of
finding a string that has a 1 at every position, by sampling
many strings and merging them with the current accumulator.
The hitting time of this process
En (1")'"
1—qt
boa
[2]
This function is O(logn) (intuitively, grows no faster than
At log n as n
oo for some constant k). This captures the idea
that you start from many protocells and merge them together
to arrive at all n functions. Further, it is possible to show that
the distribution of the number of merges is reasonably tight
around the mean (see SI).
Note that in the limit where p = 1/n, each new merging
protocell contains a single function on average. We call this
limit the "membrane transport" process as each operation
entails absorption of a single function from the outside envi-
ronment into the protocell. The reader may notice that this
characterization is identical to the coupon collector's problem
[58]. In this limit, the hitting time is:
Ts(a) = nHn
[3]
Here H„ is the a-th harmonic number. More intuitively,
this function is G(n logn). As a nice check, we can see that
the formula provided for the merging model (setting p = I/n)
is a good approximation to the hitting time predicted by
the coupon collector process. This is indeed the case, and a
verification is provided in the SI.
The membrane transport process is a special case of the
merging process. Both processes are far more efficient that
the prohibitively slow random assembly. As an aside, note
that our analysis of merging membranes easily maps to other
membrane transport phenomena such as heat-cycles, where
protocells become more permeable to surrounding material in
a periodic manner [59]. In such a case, T(n) would capture
the number of cycles that the protocell undergoes to capture
all n functions (assuming there is net inflow).
Loss of functions. We observed above that in an ideal setting,
where all samples can be incorporated into the accumulator
without interruptions, merging significantly reduces the num-
ber of steps it takes to reach the target set. Obviously, while
functions are being accumulated, membrane integrity may be
lost, the protocell may get infected by a parasite, or the proto-
cell may simply divide. Hence, the key test of the performance
of the merging process is to understand if it can accumulate
functions efficiently even in cases where it is regularly set back
by events like division or death.
To address this question, which is the main contribution
of our study, we consider the possibility of a restart in the
accumulator. A restart can be total or partial. A total restart
is equivalent to protocell death; i.e., all functions are lost, and
the accumulation of functions starts anew. A partial restart
occurs if a protocell divides and loses some—but not all—of
its functions. Obviously, division performs better than death
in the merging process, as the accumulator gains a head start
on the number of functions. Therefore, calculating the hitting
time by assuming a total restart (death) provides us with an
upper bound on the performance of the merging process with
protocell division.
I A read, Win Vdef oil in effpriinme may rectoint Ibis resun as the expeled help l01 a woes-
blis12 skp 151 arm n elemealS156.571
SEIM Nat.
PNAS I Apr114, 2016 I vol. XXX I na XX I 3
EFTA01183814
We introduce a new parameter, 8, which denotes the prob-
ability of death at any given step. The accumulator makes a
step (samples another protocell) with probability 1— cf. Given
48, we can revisit the mechanisms introduced above and incor-
porate the death parameter into them. For random assembly,
= 1, i.e. either all functions are accumulated on the first
step, or the process restarts. We now turn our attention to
the cases where 0 < 6 < 1 (with arbitrary 0 < p < I) and
extend the results that we obtained in the previous sections
for merging.
To calculate the hitting time, we define a sequence as a series
of merging events starting from a randomly assembled protocell
that terminates either by accumulating all a functionalities or
by being reset due to death. After each death event, a new
sequence begins. Denote by F the probability that a given
sequence results in all n functionalities being accumulated
without being reset. We have:
F =
(!— 8).-1 [(1- qz)" —
- ez-In
soot
Denote by P(z) the probability mass function for the num-
ber of samples, z, needed to accumulate all n functionalities
when starting with a randomly assembled protocell given that
all it functionalities are accumulated before death. We have:
P(z) = ( 1 — sr -1 [Ci — qt
— (1
r
i n
F
Similarly, denote by A(z) the probability mass function for
the number of samples, z, taken before the protocell is reset
to having no functionalities when starting with a randomly
assembled protocoll given that the protocell dies. We have:
A(z) — so - Sy-,o -
- eysi
1 — F
Hence, the expected number of samples needed to accumu-
late all n functionalities is given by the exact result:
z[FP(z) + (1 - F)A(z)]
T(n)
z=1
[4]
F
We show a numerical verification for Eq. (4) in Figure
2. We can calculate the expected number of steps exactly
through Eq. (4). However to understand the trade-off between
component frequency p, death S, and the number of functions n
better we provide the following approximations. For arbitrary
values p,S E (0.1) and large it. the expected number of steps
has the following asymptotic behavior:
T(n)
—(1 — 6) 40 — p)nk, where k =
[5]
log(1 — 6)
62F(k)
log(1 — p)
There are many possible cases to consider. For example,
we can simplifyEq. (5) further for small p,S (hence k
8/p),
and if 8 is not too large relative to p. In this case, we can
approximate the growth of T(n) by:
/Lk
T(n)
-
This equation is also plotted and verified via simulation in
Figure 2. Using Eq. (5) and Eq. (6) we can see that the ratio k
is the primary factor in determining the number of operations
[6]
0
yV
Comparison of the model, approximation, and simulation
data for p=0.0I and varying
— Model
x x Simulation
ono Approx (Eq. (5))
000 Approx (Eq. (6))
22
23
2a
25
26
Number of functions (n)
2'
23
Fig. 2. Numerical verification of the merging process with death.
For chtferenl values of me death parameler 6. we show the number of samples required
to read, a minimal evolvable prdocell. Simulation results and the approximations in
Eq. (5) and Eq. (6) are provided for comparison.
required to reach the target set of functions. Remarkably, the
merging process achieves a complete set of functions in low-
order polynomial time for a sizable segment of the parameter
space. For example. for .5 < p, the upper bound on the growth
of T(n) is O(n). As another example, for 1 — S > (1 — p)2,
the upper bound on the growth of T(n) is O(n2). As long as
< 1—i.e.. while there is merging of protocells—the merging
process accumulates a complete set of functions in polynomial
time. 718 clarify this further we provide a visualization for the
growth of T(n) with respect to p and tS in Figure 3.
Discussion
Membrane merging, and sharing of informational content,
could be seen as a primitive form of sex. The idea that
sex (or a similar fusion and genetic sharing mechanism) may
have existed since the RNA world has been discussed for
decades [1, 30, 42, 43, 60], but, to our knowledge, the time
complexity of this process has not yet been quantified. We
offer a simple model in the previous and use it to quantify
the time complexity of the accumulation processes that result
in functional or genetic assemblies (akin to compositional
genomes [11, 37] or auto-catalytic sets [50]). These results
establish the quantitative scale of improvement that is possible
through merging (and transport across a membrane), in terms
of number of operations. The time 7'(n) required to assemble a
protocell with all the necessary functionalities is reduced from
an exponential number of attempts to a low-order polynomial
by the merging process. Critically, our observations remain
relevant even if the protocells undergo division or death during
this process or if they are infected with parasites. This is
the key result of this analysis. If merging is possible, the
idea of cells with many co-occurring functions is no longer a
probabilistic miracle, but sometimes even inevitable.
There are conjectures that in the primordial world, genomes
may have been segmented and a lot of mixing and reassert-
ment may have taken place [1, II, 61, 62]. Our results take the
benefits of sex to even before evolution (self-replication of com-
ponents or protocells) started. In other words, the population
4 I wonv.pnas.ctestglidS/10.1073.epnasJ0O30CXXXXX
Sins el al
EFTA01183815
on
>. 40
°”
g of
42
014
E
0
1
V 001 oPP
045
04
on
)
on
el
o n
0
00S
S
SO.,
e
SJSOSOS
O
SI
S
„I* Je O .0
Death 4
Death .4
Number of steps
=SEPOPE.M
.
≤n
n n
n
n
/I
Fig. 3. Target protocells are found within polynomial number of
steps through merging with death. The bur panels provide a general
oveSew of the Interplay between the probability of component per sample p prob.
abildy of death 6. and multiple examples of n (number of functions). The colors
represent the number of steps 7 (n) as a function of ri.
structure, and operations proposed here, improve the efficiency
of finding an evolvable cell, without the need for any selection
or explicit replication. The process only requires protocells, or
a similar compartmentalization agent that is capable of fusion.
Of course, the same results could equally apply to protocell
interactions after the emergence of replication.
In that case, these results may suggest that among other
benefits that sex could provide for protocells, like allowing
good combinations to form, select for good "mixers" [63], and
repairing lost functionalities [42], it could have also played a
role in finding them quickly. Under such scenario, primordial
sex through merging and content sharing preceded primor-
dial replication (or "prelife" [64]) and lasted throughout early
evolution. Obviously, these results are applicable if mem-
branes, or similar compartments, appear early and in sufficient
abundance, and the number of functions is not trivially small.
A well-recognized pitfall of early life dynamics is the prob-
lem of parasitism [2, 4]. In particular, sex increases the possi-
bility of exposure to parasitic elements [I, 43]. However, our
results show that while parasites do harm the efficiency of
the process significantly, even in their presence the merging
process remains tractable and reasonably efficient for a large
set of parameters. Notably, here we assume that merging with
a single parasite is sufficient to kill the cell, which is a strictest
possible bound. In other words, barring relatively high prob-
ability of encountering a parasite at each merging, in most
regimes, it is beneficial for the protocell to fuse with others (to
gather functions). Hence, while the issue of parasites cannot
be ignored, we address the issue at its heart by establishing
an analytical framework to quantify the trade-off between
abundance of parasites and performance gain provided by
merging.
The merging model developed here also has similarities to
well-known biological phenomena in modern viruses. The first,
Multiplicity Reactivation (MR) [10, 65], is captured by our
model. It is a process to generate an infectious particle by
combining multiple non-functional mutant viruses of the same
strain. In experiments, the viral particles would be subject to
intense radiation such that they accumulate too many dele-
terious mutations and would not be able to replicate in their
host. However, if several of these mutants were introduced
into the same host cell, the mutant particles would "cover"
each other's loss-of-function mutations, and ultimately result
in a functioning virus. Our calculations complement the early
models proposed by Luria and Baricelli [10, 65]. They can
also be used to calculate the expected multiplicity of infection
required for sexual repair in viruses, given any level of genetic
damage (or mutation). Second, in multi-compartment viruses
multiple distinct components need to co-infect the same host
in order to produce a new virion. In many plant viruses, such
as the genus 1Vrnovirus, the infection occurs when two or
more functionally distinct virions infect the same host [66, 67].
Similarly, some viral satellites and virophages need to co-infect
a host in the presence of their target organism in order to
reproduce [68]. These satellites are thought to transfer genetic
and functional material between their hosts. These processes
could serve as modern examples of similar mechanisms in early
life. The fact that this type of combinatorial reproduction is
present in many RNA viruses, which are thought to be ancient
[62], is consistent with the suggestion that such mechanisms
could have been present for a long time. If one assumes a
virus-early point of view [61, 69, 70], we can readily see how
this process could have contributed to the increase in complex-
ity of cellular life. There are in fact several suggestions that
RNA viruses with segmented genomes may be very ancient,
and in fact may have undergone some form of mating [61, 62].
Excitingly, recent experiments have invoked fusion success-
fully in vitro in order to produce "self-sustaining" protocells
for three generations [21]. Kurihara et al. used "conveyer
protocells" (which correspond to our samples) to restore the
chemical composition of their "giant vesicle" (accumulator),
and thereby produced a recursive mechanism by which pro-
tocells can grow and divide for multiple generations. Our
results indicate that not only can such an approach be used
to construct the basic accumulator from scratch, and further
provide it with metabolic nutrients, but also it can be used
to efficiently increase the genetic and functional information
content of a complex vesicle.
The insights we gain through this analysis could prove
useful ill progress towards synthetic genomes. A "minimal
bacterial cell" may require a few hundred genes in order to
self-sustain [71-73]. Current attempts at making such cells use
a reductionist approach, where non-essential genes are pruned
by trial and error to the point that all remaining genes are
required for a cell to grow in a stress-free environment at a
reasonable rate. Recall that in our model the ratio between
frequency of fatal outcomes 45 and proportion of components
p (in this case genes) that carry essential functions in a given
context is the key factor that determines the efficiency of the
process. If this ratio is small (8/p A. I) in this case, it means
Sinai ef .
PNAS 1 Agree, 2016 I vol. XXX I no. XX I S
EFTA01183816
that it is possible to construct genomes by random samples of
genes from pools of simple genosnes within a feasible number
of trials.
We hope that in light of our results, the role of protocell
fusion in pre-life and early life is revisited and further con-
sidered both by theoreticians and experimentalists. In this
study, we have shown that merging significantly improves the
plausibility of producing protocells with a high number of com-
ponents through a random process. Our results are applicable
to assemblies of molecules before and after the onset of evolu-
tion. We have also provided a clear quantitative model that
captures the effects of parasites (or other fatal causes) on the
efficiency of the merging process. Finally, we hope that these
results can be useful in analyzing viral mechanisms such as
multiplicity reactivation, reassortment, and their evolutionary
backgrounds.
Materials and Methods
Models were vet flied using Monte Carlo simulations written in
Mathematica and Python. Each simulated mean is generated from
2000 independent trials, and confidence intervals are calculated
using the t-Procedure. Simulation code can be provided by request
from the authors.
ACKNOWLEDGMENTS. We thank Krishnendu Chatterjee for
technical comments on the manuscript. We also thank Leslie Valiant
and Scott Linderman for helpful comments in the initial phases of
this project. We thank Robert Israel for pointing us to related liter-
ature. We thank Jeffrey M. Gerold, Carl Veller, Michael Nicholson,
Ben Adlam, Nicolas Freiman, and Tibor Antal for helpful discus-
sions. This research was conducted using the resources provided
by the Program for Evolutionary Dynamics at Harvard University.
PED is supported by the John Templeton Foundation and in part
by a grant from B Wu and Eric Larson.
I. SUMMary E. SONGS M. FerreMO (2005) EvOlutkinary poeenlial and feekileMentS ke *n-
and protocols n Preadt Chemistry. ISisnngen. pp 167-211.
2. SUMMary E. DOOM& I. (19071 Oroup election d easy replied:OM and the origin oe lire.
J04.anel d theoretics, NOV), 12814463-48B
3. barters G. 21.00 K. Chen (A. Nook MA(20131 Solecism for replicas:isin protocols. PLoS
compursvois(5i e100X61.
4. Markvoat AJ. Sinai S. Nowak MA '2014) Compiler simulations of mauler group SO:dian
reveal mechanism fa sustaining oxperaton. Journal ol theoretical busboy 357:123-133.
5. Hogewog P. Takeuchi N 003) lAnikontil 6010CIOn In m00(46 Cl prebOOC OvOlulion: coin
partnem and spatial solorgannalkon. Ctrs of [do and Evolunen of The Biosphere 33(4.
5):375-4CO.
8. IAMB N. menowF (20121 The Origin or menses otoeneeseocs. CoV151(7):1406-1416.
7. (iS PL. Wade P. Otorhober T 11999) Lied nestles as possible intermedmme in ono man
d Ire. Current (*Wen in MOWS IstlYttos science 4I1)'.33-39.
B. Soria Ben-Eli D. Deana? DW.Lancel D t2001)The Ipdettld. Orrisof [dead EMU=
deb Biosphere 31(I-2):119-14S
9. NINA C (20151 A WHAM step 100..4 MO 0 lan Of Me ROA* M INcherstal Sciences
40(9):487-498.
10. Lode SE. Dulbecce R 11969) OMMIC reCOMCM80009 WON TO PrOrluCtiOn 01 800.4 beat
Hoehn. from LAWS( inactivated bacterOphage mead*. Generia34(2):93
II. SagoQ Ben.Ek D. Lancet 1)(2803i Compose:anal gemmed: prebobc taxmetoneaefrm
rrutussr cusysenorcoysem assembles. ProceecexuamoraneoaPacsopmr novices
9718):4112-4117.
12. Fontanan JF. Santos M. Szathmary E (20061 Coexetence and error propagation in probialic
Wade MIME a SIMIP OONCItin SPOssach Mona et elleowleal bli*DY 239(2I 247-258.
It Crick Aka at (196$) The ongn of Ma recite code. Joumarcilmoiecularbaabgy39(3)367-
379.
14, Creel LE 09881 &dollen01 the sentoo ripow awe . Jewel Ol mato* blOr09y38(3)38I-
15. INC.O9 C (19671 The generic Cede.
18, Eigen M (1971) Sedarganizadm 01 Meer and the OvOlullOn d tidOgail Mea0MdeCul06
Nafamissenschahan 55(10)A65-623.
17. Nowa.* 12006) Esttliattlery (/nNente tHarvard UnfierMY Pre661.
IL Atwater J. Wachner A. Holger P12013) Mica coolulyan of ma polymeraso nbozyme acovrty.
Nature chennstry5(12):1011-10la
19. liman TA. Joyce OF (2009) 301.6tllained replICM01 01 an ma ewers
Science
323(5918)1 229-I 232.
20. Higgs P0. Lehman N (2014) The ma world MOIOCular cooptaikm at Me °lions owe
Wive Reviews Geneses
21. KUlhala K M S. (2015)A reCistne vOSIC10-0440(1M0(1016.0t00(41*(th a 901144.4 model Cell
cycle. Nature cornmuncastos 6.
22. Ph:GOICM TM. Meer 0. Simonet BR (19991 UOI0 'lanese under hydroneemd COndliOnil by
fischertropschtypa reactcna Ofirs ot Laaand Evoiboon of Moticeohare2912)153-166.
23. Yamamoto S. Maruyarna Y. Hyodo Sa (20021 Despalin pW1d0 dynamics study d span
lanfrOura veSide *MEMO d amsepasc MC10Culeit
The JOternd 01 dental pathos
I16113):510/2-5849.
24. DearDer DW (19861 Role 01 amPlfM110 Compounds in the evolution d membrane tauttuie
en me °any earth. Ofigies or Ut. rod EvOtbliOn Mme Saphen? 1711)'.3-25.
25. Mated F. Lust Pl. 11996) AliapalabC set.roproduong vesicles: a simples] isnotc model.
The Journal o f Physta lCrierterlry 103(41)1 6603-16607.
26. Wen 0.13blit N.Des Magni CM. Chang (1984)CarbOn Osage compositkon el lar molecular
weight hydrocarbonsand monocarbonlic apish= murtheon metconto.
27. Lanless JO. Wen GIJ (1979) OuarridaillOn d MOMairbOxyliC acids In me niobium OM-
tonaceol.c meteorite.
29. Deemer DV/ 11996) Boundary atlases are tamed by organic co/monads of ono murchson
CarbOnASOuti 011000110.
29. Russel IN al.(2014) The two to lb on as andicyworlds. Psychology 14141:309-343.
30. Halel8110J13341929)Tteorltan 0 Be. ROMAN Annual 148:3-10.
31. Operin Al. et al. 11957rfhe oritin Of 110 on V* earth The onoln Oa* On Int EWA (3r0 Ed).
32. Mansy SS cial.(201:01Ternplatodredeci symhesisol a gore polymer in a model protocell.
Mtun745417200):122-125.
33. btu TF. Stanek .1W (2009) Coupled growth and dvbion of model protocol membranes. Joum
nal of the Amencan Chemical Soo* 131(1515705-5713.
34. Ekdn I. Detnain A 520510 JW 12012) COMM*80M-Olven woe* °IMO& MOSOcell Men,.
banes. Journalof the Arnercan Chemical Society 134151120912-20919.
35. Adamala K. SZCI4104N 12013) Nearan2yMOC larmialetkecied ma °reheats moo, model
laotocens Scianee3e2(6162):10S6-1100.
36. Hanczyc MM. Fulliawa SPA. Szoszak AV 12003) ENtormontal models 01 FdrelaVO cellular
COIrObilMeaS: eowsolaltn. yore,. and dMslan. 36010•302(5645)1318-622.
37. Segni O. Lancet D. Kadin O. Pipet Y(19911, Graded amacalalysis roplcalion domain (paid':
kinetic anaheis of sad rephcation in mutual., cameo sets. Onpins oft* and Eldtgal OF
ale aiospnene 28(44)501-514.
39. Madmen A. Scrams A. Peters* K van Santo R. Fibers P (200/1 Lipid-based mocha.
Name fOr veljele liKJOn. The JOurnef d Pertoia Cheeesfry13 111(20)5719-5725.
39. SUMO B. ROCMIKJA.0044:41 M 12013) TO.WRIS a general theory Ol group 99100/01. &Ob.
ion 67(61:1561-1572.
40. tuween A. Seocem N. NOMA MA (2008) Asasoso MEOW la iniviOutl and crag seieclion
d any intensity. Bataan or Inachomatica boxy 70(5):1410-1424.
41. Traisen A. Now* MA (20051 Evolution of cooperation by imblevel a:exam Proceochnos
of me Nadenal Academy 0So:iron 103(29)10952-10955.
42. Bernston H. B)ttly HC. Hopi FA. Mithod RE (19641 00pn of sex. Journal of Pworctical
MAW 110/3)323-351.
43. Se/405kt Zemzems E. Statinary E(2003)Origin Of Sex feeltded. Or1pVs Of LW and EsOkr.
ton aline Biosphere 33(441%05-612.
44. Russel W. HMI A (1997) The °magenta Mlle from Iron M0009.4)440 Outdate at 8 ilutena.
Me hOrotherrnal redo( and ph tort Journal oldie Geefopcal Society 15443)277-402.
45. Elamite P el at 12007) Enron. accurn.Mm of nucleotides in simulated hydrothermal core
SWIMS. Procayangs Of MeNefienalACKIenly M Sciences I04(22)9346-9351.
46. FeNcis PA (2007) Steps toads tie famaton of a protect the possible rote of anon pop
SOK GPM of EXe endEaWtgn
8.01.phenfl 37(6):537-553.
47. Chan IA. Now* MA 12012) ROM mile to Me: Ivn chemical uneta become OvOLOMarY
*names. Accounts of chancel research 450 2i2088-2096.
48. Slace.Na el al (20131 Nuclecbatiee bra to anti etatiqe acriewee ol a pieticec anifmtaite.
prOVdtg a ruble mechanism Ica the emergence of prolccells. Proc000Ngs of MO Nassnai
Academy of Sciences 10113), 13272-13276-
49. Oars T (1975) Organ/UM Of ChOniCal reacsors m0 drelding and MdatOlihng Laid the
chemolons. acSystims 7(1)15-21.
50. Kaultman SA (1 9861 haccalatic seta Of txoteine Jennifer neenificaretcoopy I IS(I):1-24.
51. Rut.MIWO K. WYNN F 12008) On MO way towards
PACOOMWS spew. StOdIfiStiC
simulations of anal Ipd-poplkdo coals. Ehosysrems9H2):374-387.
52. Vaido N et a. (20121 SPOnlaneOuti Alamo& Annellan smolt' osseous ma ;comm.
Naive 49117422):72-77.
53. Walker Si. Davos PC (2013) The algorithmic origins of lib. Journal of the Royal Society
infttfaCe 10(79)20120869.
54. Valera LG (2009) EVrohlkday. J anal ol onoACM lJACAh56.(1):3.
55. Cnaltafee K. PlivlOgianns A. Adam B. Novak MA (20141 The PM Wale 01 OvOk.C.MMY
innOvI6On PLOSCOOKkal 8601019):01003818.
56. Pugh W (1990) Slop lets: a probabilslic Simeon to balanced trees. CommumeaDons of
tie ACA(33631668-676.
57. Kirschanoler P. Martinez C. Prodngor H (1995) Analysis of an waded search algorinm
for deplete Thooreocal Computer Sums 144(11:199.220.
58. Baum LE. BlIngsley P (19651 Aminipoors clairtulbne tor the coupon CONCIOr'S pretleM.
The Anna's of Mathemancal SL3kat'S pp. D526-1139.
59. Mentor $S. SZOSUA AV (2008) TherMOSUblity of model potato membranes. Ptoceeftos
of Me Ratios.% Academy olSelerxes 106436).13351-13355
60. Noms V. Rana 0 (199IN A lissianeusicn origin for Ile. Oars of Ltio and &Witco of the
Btepher.928144)523-537.
61. Kocan EV. Senkench TO. Dolp VV 12006) Tho anoint nu: world and enduton of cots.
avalrect 1(1119.
62. Nan A. CeetanO-An0901 0 (2015) A phybagenttnt Olta.01.0n eiplaradon or viral orig-ne
and nekton. Science area cos 1(9):e1500527.
63. gnat A. PapadMitiou C. Oushod J. Fddman MW (20081 A mixabilty theory tar tho role of
sex 01 *MAIM. FYOCOOMOOS Oft* Mahone, academy M Sciences 105(50)198,03-19808.
6 I wysv.pnas.ororcgssovto.toizzoas.xx)ocootxxx
Sinai el at
EFTA01183817
64. Nose MA. ON444,1 H('2006) Preven.46"ary dynamics and t0e oil!» ol so1.4kon. PrO:OPel.
trios of the NaaouVAr.l6amyol Samos
4924-14927.
65. Berti:elli NA 11960) An analyli(Al epc'oacb to Me pole« 01 Vega reCpnWylbn anti fe-
~axe
I. muitiscliy Coact/eon and the nature of radabon damage 14negy I
105.
66. Smati J1.1. et at 11909) Evoiøeouryget~ (0,clord Univerelly Pieta).
67. Rao A (2006) Gonna packagesg by spherical plant re wee Amu Rev. ~tot
4461-67.
66. la See B IN et (2006) 7ne vIreNte be a tees Wet. Of Ihe eel MIMIANS. Nee
46517209)300-104.
49. FonefteP12010)Delininalfe: tbe «we :iewpoini. 060<ris of LA* an t EvoN6'On of &wares
4012)151-160.
70. Palm) P (20031 The pl0in 01 viruses cut Met possible NIBS in mot er0lu00nby Irene
hens. Vna ~awn% 11711):5-16.
71. lAttesten AR. Øn EV 11996) A minima gene set 10' celluter Ille ileMed by cows-
'son of corrpte tetanal gonores. Prop:alms of /he Nanna' Acadarry of Seances
93119)1026S-102M
72. OAR. Siva FJ. Pere05.1 Maya A (2004) Decermaelbn 01100 COre 01 a minima bacterial gene
set. Morbokgr and Mabcolar Otology Renews 6901:511-537.
73. Huicnison CA et el. (2016) Design anti synthesis ol a minimal bacteral genome SCien0e
351(6260)aa06253
74. Elsoner.13 12001) On the oxectatos of en maxtmurn S Id geometric random vendee.
Sfehee 6 Pr009OØ Len 78(21135-143.
Sinai Nal.
PNAS I
April 4, 2016 I
vol. )0(X I na XX I 7
EFTA01183818
Technical Artifacts (17)
View in Artifacts BrowserEmail addresses, URLs, phone numbers, and other technical indicators extracted from this document.
Phone
12814463Phone
13341929Phone
1406-1416Phone
1410-1424Phone
1515705-5713Phone
1561-1572Phone
346-9351Phone
3631668Phone
526-1139Phone
5417200Phone
645)1318Phone
6517209Phone
719-5725Phone
8601019Phone
9117422Wire Ref
referenceWire Ref
transfer geneticRelated Documents (6)
DOJ Data Set 10CorrespondenceUnknown
EFTA Document EFTA01471588
0p
DOJ Data Set 10CorrespondenceUnknown
EFTA Document EFTA01473047
0p
DOJ Data Set 10OtherUnknown
EFTA01466564
2p
DOJ Data Set 10CorrespondenceUnknown
EFTA Document EFTA01713478
0p
DOJ Data Set 10OtherUnknown
EFTA01473853
3p
Dept. of JusticeOtherUnknown
EFTA Document EFTA01467642
Trade Type,Trade ID,DealGroupID,MTM,Ccy,Secondary MTM,Secondary CCY,Counterparty,Trade Date,Eff. Date,Settlement Date,Maturity Date,Delivery Date,Not.Amt 1,Not.Ccyl,Not.Amt 2,Not.Ccy2,Quantity,Ref. Entity,Long/ Short,Put/ Call,Strike Price,DBPays DBReceives,Next Reset,Spread At Maturity,Pmt Rate Ref.,Rate,Price Per Unit,BuySell,Pmt Ccy,Implied Volatility,Swapswire ID, Fair Price,Spot Price,Option Type,Option Style,Party,Delta,Product Type,Underlying Ticker,Unit,Vega,Gamma "FxEuroOpt","366
2p
Forum Discussions
This document was digitized, indexed, and cross-referenced with 1,400+ persons in the Epstein files. 100% free, ad-free, and independent.
Annotations powered by Hypothesis. Select any text on this page to annotate or highlight it.