DNA strand displacement architecture - Programming Chemical Kinetics: Engineering Dynamic React

following “inverse” question: given a finite set of formal chemical reaction equations (with specified rate constants) between formal speciesX1, X2, ..., Xn, can we design a set of “real” molecules M1, M2, ..., Mpthat interact in a well-mixed solution to approximate the mass-action kinetics specified by the formal system?

Of course, this question regarding a general strategy for implementing arbitrary formal CRNs is interesting only if CRNs are capable of exhibiting a wide range of dynamical behaviors. This is indeed the case: given a system of polynomial ODEs with nonnegative integer powers, one may explicitly construct a formal CRN some of whose species will approximate the solution to the system of ODEs on the positive orthant, up to arbitrary accuracy over any time interval [87,88].

In fact, the CRN constructed has particularly nice properties: (i) all reactions follow conservation of mass, (ii) have at most two reactants and two products, and (iii) no reactions are autocatalytic.

Soloveichik et al. [110] show that, given a formal CRN, it is indeed possible to engineer a molecular implementation that will, assuming certain “fuel” species are in large excess, approximate the mass action kinetics specified by the formal CRN (up to scaling rate constants). Indeed, they provide a construction which “compiles” any set of formal chemical reactions into a set of DNA strand displacement reactions, which approximate the prescribed dynamics up to arbitrary accuracy. The “DNA implementation” of the formal CRN has a larger set of interacting molecules, some of which represent the formal species of the formal CRN and approximate their dynamics, while the others are auxiliary species that mediate the desired reactions. Following this theoret- ical advance, other such CRN-to-DNA compilation schemes have been proposed [111,224–226].

We describe our DNA strand displacement architecture, adapted from Soloveichik et al. [110], in Section3.2.

b c

Concentration

Time

Desired dynamics

B+A−→2B C+B−→2C A+C−→2A

k k k

Molecular program DNA dynamics

Concentration

Time

Concentrations of free A, B, C

DNA architecture

signal species A B B B C

fuel species

GGTATAG

5’

AAATGGG GGTGGTTAGTTAGAGTTTACCC CCACCAATCAATCTC 3’

3’

5’

u b

t* b*

u b

t* b*

b t

b u*

t* b*

u b

⇔ ⇔

...

⇔

Figure 3.2: a. Overview of our CRN-to-DNA efforts. We start with a desired dynamical behavior (oscillation, in this case) and a CRN program that captures the desired dynamics. We then use the CRN-to-DNA scheme described in this chapter to translate the formal CRN into a DNA strand displacement implementation, where the formal species are represented by single strands of DNA called “signal” species. Desired reactions between signal species are mediated by “fuel” species which provide both logic and free-energy for the reaction. Some of the fuel species are mutli- stranded complexes which are pre-prepared and purified. In the regime where the fuel species are at high concentration, the signal species approximate the dynamics of the formal species in the original CRN. Our reactions are performed in “batch reactor” mode, which means that fuel species are not replenished. Therefore, the test tube dynamics is expected to deviate from idealized formal CRN dynamics. b. Domain notation. A “domain” comprises contiguously located bases whose binding and unbinding occurs as one logical unit. * indicates Watson-Crick complementarity.

Arrows indicate 3’ ends. c. Toehold exchange. “Short” (5-7 nucleotide) domains which bind fleetingly to their components at room temperature and reversibly co-localize distinct molecules are called “toeholds”. Here toeholdtreversibly co-localizes the molecules to form a three stranded intermediate, where the two bdomains can exchange base pairs by a process called three-way branch migration. Eventually, either toeholdudissociates (leading to the products) or toeholdt dissociates (leading to the reactants). Notice that the entire process is reversible and toeholducan also carry out toehold exchange.

a

Br

^h^Br ^f^B ^m^B ^s^B ^hÂp ^fÂ ^mÂ ^sÂ

Ap

f_X m_X s_X

Xi

h_Xi

f_Y m_Y s_Y

Yj

h_Yj

Bs

^h^Bs ^f^B ^m^B ^s^B

}

logical unit

}

versioning

Figure 3.3: The formal species are represented by single-stranded DNA molecules (signal strands).

Each signal strand comprises a history domain in black (versioning unit, e.g.hBr) and a logical unit. The logical unit comprises three domains: the first toehold (e.g. fB), a branch migration region (e.g.mB), and the second toehold (e.g.sB). Signal strands are designed to not interact with each other. Signal strands with the same logical unit (e.g. Br and Bs) represent the same formal species (B) and are designed to behave identically in solution.

behavior that CRNs are capable of (such as oscillations, chaos, etc.), rather than just the steady state end point, remains elusive.

We now describe our attempt to exploit a modified version of Soloveichik et al. [110]’s CRN-to- DNA scheme to engineer prescribed dynamical behaviors in chemical systems. Figure3.2provides a pictorial overview of our efforts.

Figure 3.3illustrates the single-stranded representation of formal species employed by our scheme. Each formal species (e.g. B) is represented by single strands that contain a history domain (in black, e.g.hBr) followed by 3 logical domains: a first toehold (e.g. fB), a branch migration domain (e.g.mB) and a second toehold (e.g.sB). Strands that have identical logical domains (e.g.

BrandBs) are designed to behave identically in solution, as they both represent formal species B, regardless of their history domain. The reason for this will become clear once the mechanism for implementing reactions is illustrated.

Strands representing formal species (“signal strands”) are designed to have orthogonal domains — they are not supposed to interact with each other directly. Desired reactions between signal strands are mediated by auxiliary species. Some of those auxiliary species are fuel species, which are present in large excess at the beginning of the reaction and perform the dual functions of both encoding the logical flow of the desired reactions and providing the required free energy to drive the intended reactions. This design principle ensures that (i) signal strands do not have any sequence inter-dependence and (ii) if a formal CRN, say CRN1, is extended to CRN2, then the DNA implementation of CRN1 may also be extended to a DNA implementation of CRN2 merely

by adding to the test tube fuel species necessary for the additional reactions in CRN2.

Figure3.4illustrates how a general bimolecular reaction of the formB + A→X + Ywould be implemented. Logically, the DNA implementation is performed in two steps. First, the “react” step consumes reactants B and A and releasesFluxABi — if and only if both reactants are present. If one or both of the reactants are absent, no irreversible reactions occur. Next,FluxABi

gets consumed in the “produce” step and releases both outputs X and Y. Therefore, taking both react and produce steps together, the reactants B and A have been consumed and the products X and Y have been released. For completeness, we also illustrate how the general unimolecular reaction (B→X), degradation reaction (B→φ), and production reaction (φ→X) are implemented (Figures3.5,3.6and3.7).

The naming scheme we use for the species involved in the reaction pathways in our CRN-to- DNA scheme is both precise and general. By this we mean that, given an arbitrary formal CRN, the naming scheme allows the user to write down the names and molecular specifications for all the species involved in the DNA strand displacement reactions needed to implement the given formal CRN. Moreover, given just the name, the associated molecule can be immediately constructed at the domain level. Essentially, our naming scheme is consistent with a compiler that could be used to generate the DNA implementation for any specified formal CRN.

+ +

fB mB sB hBr

fA mA sA hAp

fX mX sX hXi

fY mY sY hYj

React step: Br + Ap Flux_ABi

Waste_BrAp

sB mB

s*_B

sA mA f*_A m*_A s*_A m*_B

f*_B fB

hBr fA

hAp

FluxABi sA mA

hXi

React_BAXi

sB mB fA

s*_B

sA mA f*_A m*_A s*_A m*_B

f*_B

hXi fB mB sB

hBr

BackBA sB

mB fA mB sB

s*_B

sA mA f*_A m*_A s*_A m*_B

f*_B fB hBr

hXi

fA mA sA hAp

Ap +

Produce step: Flux_ABi Xi + Yj

fX mX sX hXi

Produce_AXiYj

s*A

fY mY

f*Y hXi

hXi* hYj sX mX fX f*X hYj*

s*_A

fY mY

sY mA sA hXi

hXi* f*_Y

hYj hYj* f*_X

HelperXYj hYj

fX fY

+ +

fY mY sY hYj

s*_A mA sA

WasteAXiYj hXi hXi*

hYj hYj*

fY f*_Y fX f*_X

Figure 3.4: a. CRN-to-DNA scheme illustrated with the general bimolecular reactionB + A → X + Y. Note that the reactants (Br, Ap) and the products (Xi, Yj) have completely independent sequences. The same mechanism can occur with different versions of the formal species B and A. b. Names of fuel species are enclosed in a dashed box. The reaction is implemented in two steps: React and Produce. The React step is mediated by theReactBAXiandBackBAfuel species. B reacts withReactBAXi to reversibly displaceBackBAby toehold exchange and produces an intermediate species. Note that this process exposes the previously sequestered toeholdf_A^∗. In case Ap is present, it can react with the intermediate by strand displacement using the toeholdf_Ato irreversibly displaceFlux_ABiand produceWaste_BrAp.Waste_BrAphas no free toeholds and is therefore inert.Flux_ABireleases the outputs through the produce step, which is mediated by fuel molecules Produce_AXiYjandHelper_XYj. Flux_ABireacts reversibly withProduce_AXiYjto release the first output Xi and an intermediate species.Helper_XYjreacts irreversibly using the newly exposed toehold f_X^∗ by strand displacement to release the second output Yj andWasteAXiYj.

a

b

Br

f_B m_B s_B h_Br

Xi

f_X m_X s_X h_Xi

React step: Br Flux

_BXi

Waste_Br

React_BXi

s_B m_B

s*_B m*_B f*_B

h_Xi fB mB sB

hBr

Br

sB mB

s*_B m*_B f*_B fB hBr

Produce step: Flux

_BXi

Xi

Produce_BXi

s*_B hXi hXi*

sX m_X fX f*_X fX mX sX

hXi

Xi

Waste_BXi

s*_B mB sB hXi

h_Xi* fX f*_X

sB mB

hXi

Flux_BXi

f_X fX

Figure 3.5: Implementation for the reactionB→X. The same mechanism can occur with different versions of B. Names of fuel species are enclosed in a dashed box.

a

b

Br

fB mB sB

hBr

React step: Br

Waste

_Br

React

_B_ø

s*_B m*_B f*_B f_B m_B s_B

h_Br

Br

s*_B m*_B f*_B fB

hBr m_B s_B

Figure 3.6: Molecular implementation forB→φ. The same mechanism can occur with different versions of B. Names of fuel species are enclosed in a dashed box.

a

b React step: GX Flux

_GXi

Waste_GX

React_GXi

s_GX m_GX

s*_GX m*_GX f*_GX

h_Xi fGX mGX sGX

sGX mGX

s*_GX m*_GX f*_GX fGX

Produce step: Flux

_GXi

Xi

Produce_GXi

s*_GX hXi hXi*

sX mX f_X f*_X fX mX sX

hXi

Xi

Waste_GXi

s*_GX mGX sGX hXi

h_Xi* fX f*_X

sGX mGX

hXi

Flux_GXi

f_X fX

fX mX sX hXi

Xi

GX

Figure 3.7: Molecular implementation forφ→X. Names of fuel species are enclosed in a dashed box.

Dalam dokumen Programming Chemical Kinetics: Engineering Dynamic Reaction Networks with DNA Strand Displacement (Halaman 74-82)