Team:Freiburg/Project/modeling

From 2013.igem.org

(Difference between revisions)
 
(113 intermediate revisions not shown)
Line 7: Line 7:
<style>
<style>
-
 
+
#imgtxt{
-
td {
+
border:none
-
border:none;
+
-
color:white;
+
}
}
#bild {
#bild {
width:100%;
width:100%;
 +
}
 +
 +
#bild1{
 +
width:80%;
}
}
Line 25: Line 27:
   }
   }
-
.gelpic {
+
 
-
background-color: transparent;
+
-
border:none;
+
-
}
+
#lhs{
#lhs{
Line 37: Line 36:
}
}
-
th{
 
-
color:white;
 
-
border:none;
 
-
border:bottom;
 
-
}
 
-
 
-
td{
 
-
padding-right:50px;
 
-
}
 
</style>
</style>
Line 68: Line 58:
</p>
</p>
-
<p class="first_order"><a href="https://2013.igem.org/Team:Freiburg/Project/1"> Introduction </a></p>
+
<p class="first_order"><a href="https://2013.igem.org/Team:Freiburg/Project/1"> Abstract & Intro </a></p>
-
<p class="first_order"><a href="https://2013.igem.org/Team:Freiburg/Project/effector" class="active"> Effector </a></p>
+
<p class="first_order"><a href="https://2013.igem.org/Team:Freiburg/Project/crrna"> Targeting </a></p>
 +
<p class="first_order"><a href="https://2013.igem.org/Team:Freiburg/Project/effector"> Effectors </a></p>
 +
<p class="first_order"><a href="https://2013.igem.org/Team:Freiburg/Project/induction"> Effector Control </a> </p>
 +
<p class="first_order"><a href="https://2013.igem.org/Team:Freiburg/Project/modeling" class="active"> Modeling </a></p>
-
<p class="second_order"> <a href="#activation">Activation </a> </p>
+
<p class="second_order"> <a href="#Networks"> Networks </a> </p>
-
<p class="second_order"> <a href="#repression">Repression </a> </p>
+
<p class="second_order"> <a href="#Parameters"> Parameters </a> </p>
-
<p class="second_order"> <a href="#epigenetics">Epigenetics </a> </p>
+
<p class="second_order"> <a href="#Data"> Data Generation </a> </p>
 +
<p class="second_order"> <a href="#Fitting"> Fitting & Results </a> </p>
-
<p class="first_order"><a href="https://2013.igem.org/Team:Freiburg/Project/induction"> Induction </a> </p>
+
<p class="first_order"><a href="https://2013.igem.org/Team:Freiburg/Project/truncation"> Truncation </a></p>
-
<p class="first_order"><a href="https://2013.igem.org/Team:Freiburg/Project/crrna"> Targeting </a></p>
+
<p class="first_order"><a href="https://2013.igem.org/Team:Freiburg/Project/method"> uniBAss </a></p>
<p class="first_order"><a href="https://2013.igem.org/Team:Freiburg/Project/method"> uniBAss </a></p>
-
<p class="first_order"><a href="https://2013.igem.org/Team:Freiburg/Project/toolkit"> Toolkit </a></p>
+
<p class="first_order"><a href="https://2013.igem.org/Team:Freiburg/Project/unibox"> uniBOX </a></p>
-
<p class="first_order"><a href="https://2013.igem.org/Team:Freiburg/Project/modeling"> Modeling </a></p>
+
<p class="first_order"><a href="https://2013.igem.org/Team:Freiburg/Project/toolkit"> Manual </a></p>
 +
<p class="first_order"><a href="https://2013.igem.org/Team:Freiburg/Project/application" > Application </a></p>
 +
 
</div>
</div>
Line 86: Line 81:
<div id="main_contant">
<div id="main_contant">
-
<p id="h1"> Modeling uniCas
+
<p id="h1"> Modeling our dCAS</p>
-
 
+
 +
<div id="introduction">
<div id="introduction">
<p id="h3"> Introduction </p>
<p id="h3"> Introduction </p>
<div align="justify">
<div align="justify">
-
  <p> We used a thermodynamic approach to model and characterize our system. It is based on various ordinary differential equations that describe the behaviour of our network. Due to the limited measurment possibilities and the unwritten law, that you should at least measure half of the number of components of your network we started by using a small network with a limited amount of different components.<br>
+
  <p> We used a kinetc approach to model and characterize our system. It is based on various ordinary differential equations (ODE) which describe the behaviour of our network. Due to the limited measurement possibilities and the unwritten law, that you should at least measure half of the number of components of your network we started by using a small network with a limited amount of different components.<br>
-
 
+
-
The Components are: the protein of Cas9, the target protein (SEAP), and two RNAs (tracrRNA and crRNA).<br>
+
-
The two RNAs (tracrRNA and crRNA) are modeled as one complex, therefore there is no possibility to distinguish between them. There might be some differences in their expressions, especially because of the different promoters (crRNA expressed under U6-promoter; tracrRNA expressed under h1-promoter), however the model won't show them.
+
</p>
</p>
</div>
</div>
   
   
 +
<div id="Networks">
 +
<p id="h2"> The Networks </p>
 +
</div>
-
<p id="h3"> The Network </p>
+
<p id="h3" > 1. dCAS-VP16 </p>
-
 
+
<p> Our network includes four different components dCas9-VP16, a RNA complex (tracr/cr RNA), a RNA-dCas9-VP16 complex and the secreted alkaline phosphatase (SEAP). The RNA-complex is bound to dCas9-VP16 and the resulting complex binds the DNA, which leads to the production of SEAP. </p>
-
<p id="h4" > uniCAS-VP16 </p>
+
-
<p> Our network includes four different components uniCas-VP16, a RNA complex (cr/tracr RNA), a RNA-uniCas-VP16 complex and the Secreted alkaline phosphatase (SEAP). UniCas-VP16 binds the RNA-complex and the whole complex binds the DNA, which leads to the production of SEAP. </p>
+
<div>
<div>
-
<table class="imgtxt">
+
<table id="Fig1" class="imgtxt">
<tbody><tr>  
<tbody><tr>  
-
<td> <img id="bild" src="https://static.igem.org/mediawiki/igem.org/b/bc/Freiburg2013_AktivierungGraphik_fertig.png"> </td>
+
<td> <img id="bild1" src="https://static.igem.org/mediawiki/2013/b/bc/Freiburg2013_AktivierungGraphik_fertig.png"> </td>
</tr>
</tr>
<tr>
<tr>
-
<td> <b>Fig. 1: Transcriptional Activation via uniCAS-VP16:</b><br> The uniCAS-VP16 fusion protein is guided to the desired DNA  
+
<td> <b >Figure 1: Transcriptional Activation via dCas9-VP16:</b><br> The dCas9-VP16 fusion protein is guided to the desired DNA  
-
sequence by a co-expressed crRNA, which binds a trRNA. The binding leads to an expression of SEAP. </td>
+
sequence by a co-expressed crRNA and tracrRNA. The binding of the gene recognition complex leads to an expression of SEAP. </td>
</tr>
</tr>
</tbody></table>
</tbody></table>
Line 121: Line 115:
-
<p id="h3"> Setting up the ODE </p>
+
<p id="h4"> Setting up the ODE </p>
  <p> According to the graphical reaction network the ODE can be set up. </p>
  <p> According to the graphical reaction network the ODE can be set up. </p>
-
<p> Cas9 is constitutively expressed by the CBh promoter and degraded proportional to the current concentration. It is used to build the DNA recognition complex and produced during complex decay. </p>
+
<p> dCas9 is constitutively expressed by the CAG promoter and degraded proportionately to the current concentration. It is used to build the DNA recognition complex and is produced during complex decay. </p>
-
<p id="formel" > d[Cas]/dt = k<sub>1</sub>
 
-
- k<sub>2</sub> &lowast; [Cas] - k<sub>5</sub> &lowast; [Cas] &lowast; [tracr,crRNA] + [tracr,crRNA,Cas]</p>
 
-
<p>The RNA-complex is build linearly. The production constant can be seen as production constant of the lower expressed RNA, because this expression regulates the complex building. It is assumed that the RNA is degraded after DNA recognition complex decay and therefore the complex decay does not lead to more RNA. </p>
+
<p id="formel" > <img src="https://static.igem.org/mediawiki/2013/a/a0/Freiburg2013_Modelling_Act1.png"> </p>
-
<p id="formel"> d[tracr,crRNA]/dt = k<sub>3</sub> - k<sub>4</sub> &lowast; [tracr,crRNA] - k<sub>2</sub> &lowast; [tracr,crRNA] &lowast; [Cas]</p>
+
<p>The RNA-complex is built linearly. The production constant can be regarded as equivalent to the production constant of the lower expressed RNA (either crRNA or tracrRNA), since their expression limits the complex building. It is assumed that the RNA is degraded after DNA recognition complex decay and therefore the complex decay does not lead to an increase in the amount of RNA. </p>
-
<p>The complex is build, when Cas9 and RNA meets and degraded proportional to the current DNA recognition complex-concentration.</p>
+
<p id="formel" > <img src="https://static.igem.org/mediawiki/2013/e/e4/Freiburg2013_Modeling_Act2.png"> </p>
-
<p id="formel">d[tracr,crRNA,Cas]/dt = k<sub>5</sub> &lowast; [Cas] &lowast; [tracr,crRNA] - k<sub>6</sub> &lowast; [tracr,crRNA,Cas]</p>
+
<p>The DNA recognition complex is built when dCas9 and RNA meet and is degraded proportionately to the current DNA recognition complex concentration.</p>
-
                       
 
-
<p>There is a leaky SEAP production and one that depends on the current concentration of the Cas9/RNA Complex. This dependency is assumed to
+
<p id="formel" > <img src="https://static.igem.org/mediawiki/2013/0/0d/Freiburg2013_Modeling_Act3.png"> </p>
-
follow the Michaelis-Menten-Rules. Because of the long half time (T<sub>2</sub> > 500 h) of SEAP we can neglect the SEAP decay.(Müller et al., 2013 )</p>
+
<p>There is a leaky SEAP production and one that depends on the current concentration of the dCas9/RNA complex. This dependency is assumed to follow the Monod-kinetic. Because of the long half-life period (T<sub>2</sub> > 500 h) of SEAP we can neglect the SEAP decay<span id="refer"> <a href="#(2)"> [2, 3]</a></span>.</p>
-
<p id="formel">d[SEAP]/dt = k<sub>7</sub> + k<sub>8</sub >&lowast; ([tracr,crRNA,Cas]/(k<sub>9</sub>+[tracr,crRNA,Cas])))</p>
 
 +
<p id="formel" > <img src="https://static.igem.org/mediawiki/2013/9/99/Freiburg2013_Modeling_Act4.png"> </p>
-
<table class="gelpic">
 
-
<th>The parameters are: </th>
+
<table style="border:none">
-
<tbody><tr>  
+
 
-
<td> k<sub>1</sub>: linear production rate of Cas9 <br>       
+
<th style="background-color:transparent; border:none">The parameters are: </th>
 +
<tbody style="border:none"><tr>  
 +
<td style="border:none"> k<sub>1</sub>: linear production rate of Cas9 <br>       
k<sub>2</sub>: Cas9 degradation rate        <br>         
k<sub>2</sub>: Cas9 degradation rate        <br>         
-
k<sub>3</sub>: cr/trRNA production rate    <br>         
+
k<sub>3</sub>: tracr/crRNA production rate    <br>         
-
k<sub>4</sub>: cr/trRNA degradation rate    <br>       
+
k<sub>4</sub>: tracr/crRNA degradation rate    <br>       
-
k<sub>5</sub>: cr/trRNA /Cas9 complex building rate <br></td>
+
k<sub>5</sub>: gene recognition complex building rate <br></td>
-
<td>  k<sub>6</sub>: cr/trRNA /Cas9 degradation rate <br>
+
<td style="border:none">  k<sub>6</sub>: cr/trRNA /Cas9 degradation rate <br>
k<sub>7</sub>: SEAPs leaky production rate<br>
k<sub>7</sub>: SEAPs leaky production rate<br>
k<sub>8</sub>: Complex dependent SEAP production rate<br>
k<sub>8</sub>: Complex dependent SEAP production rate<br>
-
k<sub>9</sub>: Michaelis-Menten constant <br></td>
+
k<sub>9</sub>: Cas9-VP16 specific constant<br> </td>
</tr>
</tr>
</tbody></table>
</tbody></table>
 +
<p id="h3" > 2. dCAS-KRAB </p>
 +
<p> The dCas-KRAB sytem was modeled as a three component system. tetR as activator of gene production, dCas-KRAB as repressor and SEAP as final reporter protein. </p>
 +
<div>
 +
<table id="Fig2" class="imgtxt">
 +
<tbody><tr>
 +
<td> <img id="bild1" src="https://static.igem.org/mediawiki/2013/f/fd/Freiburg2013_Repression_Modeling.png"> </td>
 +
</tr>
 +
<tr>
 +
<td> <b  > Figure 2: Transcriptional respression via dCAS-KRAB:</b><br> The dCAS-KRAB fusion protein binds to the desired target sequence at a different locus than the tetR. tetR in turn binds to tetO and is assumed to repress SEAP production. </td>
 +
</tr>
 +
</tbody></table>
 +
</div> 
 +
<p id="h4"> Setting up the ODE </p>
-
<p id="h3"> Finding the parameters </p>
+
<p> According to the graphical reaction network the ODE can be set up. </p>
-
<p>Up to this point, it is only a theoretical description, which fits to every four component system connected in the given way. The reason is, that the parameter
+
<p> dCas9 is constitutively expressed by the CAG promoter and degraded proportionately to the current concentration, similar to the activation model. It binds to the DNA and is released afterwards. Since the amount of transfected dCas9-plasmid is four times higher than that of TetR-VP16 the basal expression rate is equally assumed to be four times higher than that of TetR. </p>
-
values are not yet defined and therefore there is a n-dimensional hypothesis space which contains the hypothesis which fits to the Cas9-system (n is the number of
+
<p id="formel" > <img src="https://static.igem.org/mediawiki/2013/4/43/Freiburg2013_Modeling_Rep1.png"> </p>
-
parameters). At least you assume this. Finding this hypothesis means finding the right parameters.</p>
+
<p>tetR is assumed to have a similiar kinetic as dCas9. </p>
-
<p>To find these parameters we use a method called  maximum likelihood. The maximum likelihood hypothesis is the hypothesis which has the highest probability to
+
<p id="formel" > <img src="https://static.igem.org/mediawiki/2013/e/e1/Freiburg2013_Modeling_Rep2.png"> </p>
 +
                       
-
generate the measured data. It is shown (Müller et al., 2013 ), that using the maximum likelihood approach and assuming gaussion noise in the data leads to a  
+
<p>There is a leaky SEAP production and one that depends on the current concentration of dCas9 and tetR. This dependency is assumed to follow an allosteric inhibition process. Similar to the activation process the SEAP decay can be neglected  because of the long half-life period (T<sub>2</sub> > 500 h) of SEAP <span id="refer"> <a href="#(2)"> [2, 3]</a></span>.</p>
-
least-square error minimization problem. </p>
+
<p id="formel" > <img src="https://static.igem.org/mediawiki/2013/3/39/Freiburg2013_Modeling_Rep3.png"> </p>
-
<p>A minimization problem is an optimization problem. You search for parameters (p0) for which holds, that the value of the function (f) at the point of the
+
<table style="border:none">
-
parameters is smaller than all other values. (f(p0)<=f(p)). In three dimensions the function can be thought as a landscape and minimization is finding the deepest
+
<th style="background-color:transparent; border:none" >The parameters are: </th>
 +
<tbody style="border:none"><tr>
 +
<td style="border:none"> k<sub>1</sub>: linear production rate of Cas9 <br>     
 +
k<sub>2</sub>: Cas9 degradation rate        <br>       
 +
k<sub>3</sub>: tetR production rate    <br>       
 +
k<sub>4</sub>: tetR degradation rate    <br>     
 +
</td>
 +
 +
<td style="border:none">
 +
k<sub>5</sub>: SEAPs leaky production rate <br>
 +
k<sub>6</sub>: Cas9/tetR specific production rate<br>
 +
k<sub>7</sub>: Cas9-KRAB specific constant<br>
 +
k<sub>8</sub>: tetR specific constant <br>
 +
</td>
 +
</tr>
 +
</tbody></table>
-
valley. Depending on the method you use different problems arise. The most common problem is finding only a local minimum and not the global one. </p>
+
<div id="Parameters">
 +
<p id="h2"> Determining the parameters </p>
 +
</div>
 +
 
 +
<p>By setting up the ODE a (n+1)-dimensional hypothesis space (n is the number of
 +
 
 +
parameters) is generated. Determining the right parameter combination means finding a point in the space which fits the data best.<br>
 +
To find these parameters we used the maximum likelihood approach. The maximum likelihood hypothesis is the procedure which has the highest probability to
 +
 
 +
generate the measured data. It was shown, using the maximum likelihood approach and assuming gaussion noise to the data (an assumption that holds in our case) leads to a least-square error minimization problem <span id="refer"><a href="#(3)">[3]</a></span>.</p>
 +
 
 +
<p>A minimization problem is an optimization problem. One searches for parameters (p0) for which the assumption holds, that the value of the function (f) at the point of the
 +
 
 +
parameters is smaller than all other values (f(p0)<=f(p)). In three dimensions the function can be regarded as a landscape and minimization as finding the deepest
 +
 
 +
valley. Depending on the method one uses different problems arise. The most common problem is finding an only local minimum instead of the global one <span id="refer"> <a href="#Fig3">(Fig. 3)</a></span>. </p>
<div>
<div>
-
<table class="gelpic">
+
<table id="Fig3" class="imgtxt">
<tbody><tr>  
<tbody><tr>  
<td> <img  id="bild" style="width:400px text-align:center" src="https://static.igem.org/mediawiki/2013/2/25/Freiburg2013_3D-example.png"> </td>
<td> <img  id="bild" style="width:400px text-align:center" src="https://static.igem.org/mediawiki/2013/2/25/Freiburg2013_3D-example.png"> </td>
</tr>
</tr>
<tr>
<tr>
-
<td> <b>Fig. 2: Example of a minimization problem.</b><br> Shown is a 3D landscape. Depending on the start position (the initial parameters), the found minimum is either a local or the global one.</td>
+
<td> <b >Figure 3: Example of a minimization problem</b><br>A 3D landscape is shown. Depending on the start position (the initial parameters), the minimum which is found can either be a local minimum or the global one.</td>
</tr>
</tr>
</tbody></table>
</tbody></table>
Line 203: Line 239:
-
<p> To avoid this and  
+
<p> To avoid this issue and to make sure a global minimum was found we started our minimization procedure using different start values for our parameters. To sample these parameters we used the latin hypercube sampling on a logarithmic scale <span id="refer"> <a href="#Fig4">(Fig. 4)</a></span>. </p>
-
 
+
-
to be sure to have found a global minimum we started our minimization procedure using different start values for our parameters. To sample these parameters we used  
+
-
 
+
-
the latin hypercube sampling on a logarithmic scale. </p>
+
<div>
<div>
-
<table class="gelpic">
+
<table id="Fig4" class="imgtxt">
<tbody><tr>  
<tbody><tr>  
<td> <img id="bild" src="https://static.igem.org/mediawiki/2013/f/fa/Freiburg2013_Lhs_design_erklaerung.png"> </td>
<td> <img id="bild" src="https://static.igem.org/mediawiki/2013/f/fa/Freiburg2013_Lhs_design_erklaerung.png"> </td>
-
<td> <b>Fig. 3: Illustration of the latin hypercube sampling in a two dimensional parameter space.</b><br> The number of initial parameter vectors is 5. Therefore the parameter space is divided in 25 subspaces. Shown is one possible parameter combination.</td>
+
<td> <b >Figure 4: Illustration of the latin hypercube sampling in a two dimensional parameter space.</b><br> The number of initial parameter vectors is 5. Therefore the parameter space is divided in 25 subspaces. One possible parameter combination is shown here.</td>
</tr>
</tr>
</tbody></table>
</tbody></table>
</div>   
</div>   
-
<p>N is set as the number of different initial parameter settings and the parameter range is divided by N. For the initial parameter the values are chosen so that there is only one parameter in each row and column. </p>
+
<p>The number of different initial parameter settings is set to N and thus the parameter space is divided in N*N subspaces. For the initial parameter the values are chosen in a way that there is only one parameter in each row and column. </p>
-
<p>The resulting errors we plotted in an increasing order to be sure to have found a global minimum.</p>
+
<p>The resulting errors were plotted in an increasing order to be sure the global minimum could be found.</p>
 +
<div id="Data">
 +
<p id="h2"> Data generation </p>
 +
</div>
 +
 +
<p> Fusion proteins with effectors (dCas9-VP16, dCas9-KRAB) were quantified, as well as SEAP reporter protein (secreted alkaline phosphatase).</p>
 +
 +
<p id="h3"> 1. dCas9-Effector Quantification</p>
 +
 +
<p> dCas9-VP16 and dCas9-KRAB were quantified by using <a id="link" href="https://2013.igem.org/Team:Freiburg/protocols#Western Blot">Western Blots</a>.</p>
 +
 +
<div>
 +
<table id="Fig5" class="imgtxt">
 +
<tbody><tr>
 +
<td> <img id="bild" style="width:400px" src="https://static.igem.org/mediawiki/2013/e/e6/Freiburg_modellingBlots.png"> </td>
 +
</tr>
 +
<tr>
 +
<td> <b  >Figure 5:</b> Western Blot quantification of protein concentrations. </td>
 +
</tr>
 +
</tbody></table>
 +
</div> 
 +
 +
<p id="h3"> 2. SEAP Reporter Quantification</p>
 +
 +
<p> SEAP reporter protein was quantified by using a <a id="link" href="https://2013.igem.org/Team:Freiburg/protocols#SEAP%20Assay">SEAP assay</a>.</p>
-
<p id="h3"> Data generation </p>
 
-
<p> Cas is quantified by using Western blot and we used SEAP as target protein that can be quantified by a SEAP assay.
 
-
For more detailed information refer our <a id="link" href="https://2013.igem.org/Team:Freiburg/Notebook/modeling">modeling notebook</a>. </p>
 
-
<p id="h3"> Results </p>
 
<div>
<div>
-
<table class="gelpic">
+
<table id="Fig6" class="imgtxt">
<tbody><tr>  
<tbody><tr>  
-
<td> <img id="bild" src=https://static.igem.org/mediawiki/2013/1/1b/Freiburg2013_Akt500_fehler1_1.png> </td>
+
<td> <img id="bild" style="width:700px" src="https://static.igem.org/mediawiki/2013/b/b1/Freiburg2013_modelling_seap.png"> </td>
</tr>
</tr>
<tr>
<tr>
-
<td> <b>Fig. 4: Different error values plotted in increasing order.</b><br> </td>
+
<td> <b >Figure 6:</b> SEAP reporter protein quantification by using the SEAP assay. </td>
</tr>
</tr>
</tbody></table>
</tbody></table>
</div>   
</div>   
-
<p id="h3"> The Code Files </p>
+
 
 +
 
 +
 
 +
 
 +
<p>For more detailed information refer to our <a id="link" href="https://2013.igem.org/Team:Freiburg/Notebook/modeling">modeling notebook</a>. </p>
 +
 
 +
<div id="Fitting">
 +
<p id="h2"> Fitting Procedure and Results </p>
 +
</div>
 +
<p id="h3"> 1. dCas9-VP16 </p>
 +
 
 +
<p>Assuming the given ODE and using the fminsearch-function of matlab for various initial parameter vectors the fitting process resulted in one optimal parameter composition. Our  measurement time started with a transfection. The first data were obtained after 6 h. As the initial concentrations were unknown all 14 parameters (including the initial concentrations) were assumed to obtain changing values during the fit. This procedure results in unlikely high initial concentrations. Although the transfection end point could not clearly be defined only a small amount of the components should be present within the cells after transfection . A second process in which the initial concentrations were assumed to be zero followed the first one. The change between fixed and variable parameters was easy to perform, because of an additional vector (qfit). This vector contained different boolean values depending on whether the parameter was fixed (false) or flexible (true) during the fitting process. Moreover another parameter was required for adjusting the absolute values of the duplicates. The results of this second process are shown in <span id="refer"> <a href="#Tab1">table 1</a></span>. </p>
 +
<p>
 +
The model reflects the general construction of the network <span id="refer"> <a href="#Fig5">(Fig. 5)</a></span>.  As assumed dCas9 converges asymptotically to a steady state and an exponential increase in the SEAP concentration can be observed. Both processes are reflected in the data.
 +
The model also shows a potential behavior of the non measured components, the free tracr/crRNA-complex and the gene recognition complex. The tracr/crRNA-complex reaches its steady state quickly.  However there is no possibility to distinguish between the two different RNAs. There might be some differences in their expressions, especially because of the different promoters (crRNA expressed under U6-promoter; tracrRNA expressed under h1-promoter), however, the model does not show them. <br> The free gene recognition complex has not reached its steady state yet due to the small degradation rate.
 +
 
 +
</p>
 +
 
 +
 
 +
<div>
 +
<table id="Fig7" class="imgtxt">
 +
<tbody><tr>
 +
<td> <img id="bild" src="https://static.igem.org/mediawiki/2013/f/fc/Freiburg2013-Act500_fertig.png"> </td>
 +
</tr>
 +
<tr>
 +
<td> <b  >Figure 7: Modeling Result:</b><br>  This figure shows the comparison of the experimental results (purple square) to the model prediction values (cyan cross) for SEAP and dCas9, as well as the model prediction for non measured components. </td>
 +
</tr>
 +
</tbody></table>
 +
</div> 
 +
 
 +
<p> Since the fminsearch algorithm is not proved to converge to a minimum <span id="refer"> <a href="#(1)"> [1]</a></span>, different points within the parameter space were chosen to ensure a high probability for finding the global minimum <span id="refer"> <a href="#Fig8"> (Fig. 8)</a></span>.</p>
 +
 
 +
<div>
 +
<table id="Fig8" class="imgtxt">
 +
<tbody><tr>
 +
<td> <img id="bild" src="https://static.igem.org/mediawiki/2013/1/1b/Freiburg2013_Akt500_fehler1_1.png"> </td>
 +
</tr>
 +
<tr>
 +
<td> <b  >Figure 8: Different error values plotted in increasing order.</b><br> </td>
 +
</tr>
 +
</tbody></table>
 +
</div> 
 +
 
 +
<table  id="Tab1">
 +
<thead> <b>Table 1: Overview of the parameter results of the dCas9-VP16 fitting process </b> </thead>
 +
<tbody>
 +
<tr> <th> parameter </th><th> value </th> </tr>
 +
<tr> <td> linear production rate of dCas9 </td><td> 8.78E+02 [M/h] </td> </tr>
 +
<tr> <td> dCas9 degradation rate </td><td> 0.54 [1/h] </td> </tr>
 +
<tr> <td> tracr/crRNA production rate </td><td> 0.73 </td> </tr>
 +
<tr> <td> tracr/crRNA degradation rate </td><td> 1.21E+08 </td> </tr>
 +
<tr> <td> Gene recognition complex building rate </td><td> 1.61 [1/M] </td> </tr>
 +
<tr> <td> cr/trRNA /dCas9 degradation rate </td><td> 8.59E-26 </td> </tr>
 +
<tr> <td> SEAPs leaky production rate </td><td> 4.05E-05 [M/h] </td> </tr>
 +
<tr> <td> Complex dependent SEAP production rate </td><td> 1.86E+08 [M/h] </td> </tr>
 +
<tr> <td> Cas9-VP16 specific constant </td><td> 1.06 </td> </tr>
 +
<tr> <td> initial concentration: dCas9 </td><td> 0.00 [M] </td> </tr>
 +
<tr> <td> initial concentration: tracr/crRNA -Komplex </td><td> 0.00 </td> </tr>
 +
<tr> <td>initial concentration: gene recognition complex </td><td> 0.00 </td> </tr>
 +
<tr> <td>initial concentration: SEAP </td><td> 2.73E-09 [M] </td> </tr>
 +
<tr> <td> least square error </td><td> 159.3423 </td> </tr>
 +
</tbody>
 +
</table>
 +
 
 +
<p id="h4"> Using the Model </p>
 +
 
 +
<p>
 +
The parameters of our model were identified by using MATLAB and a least square error minimization procedure. Afterwards the model, written in SBML by using COPASI, was used to test how the amounts of reporter protein can be increased by changing other parameters. As assumed the reporter protein level depends on the concentration of RNA and dCas9-VP16. If one paramter is increased, reporter level rises.
 +
</p>
 +
<p>
 +
We decided to try to increase the dCas9-VP16 expression rate, because there are no possibilites to increase the RNA amount. The promoters coding for the RNAs are classified as strong and it's not possible to shorten the sequences. dCas9 in contrast is a large protein - over 160 kDa - and has various functions. Basically we need only the helicase and DNA binding domain. Therefore it should be possible to remove other domains by truncating the protein and so  reducing the protein's size. When the protein is smaller, we expect that the dCas9 expression will increase. This will lead - according to the model - to a higher output of the system.
 +
</p>
 +
<p>
 +
There are several other possibilities of increasing the expression rate practically: <br>
 +
One is to change the promoter, another one means transfecting more plasmid DNA and - as already mentioned - there is the possibility to truncate the protein itself. Although a change of the promoter seems to be the most simple method, we couldn't perform this, because we already used the strong CAG promoter. By transfecting more plasmid DNA the toxic effects on the cells will increase.
 +
This is why we decided to truncate the dCas9-VP16. We assumed that by reduction of size of dCas9-VP16 with the same amount of cellular resources more protein can be built. A positive side effect might therefore also be that the off target effect on the regular protein translation goes down, as well.<br>
 +
To read more about the truncation click <a id="link" href="https://2013.igem.org/Team:Freiburg/Project/truncation">here</a>.
 +
</p>
 +
 
 +
<center>
 +
<table class="imgtxt" width="500px">
 +
<tbody><tr>
 +
<td> <img class="imgtxt" src="https://static.igem.org/mediawiki/2013/c/c0/Plot_verschiedenReaktionsraten.png" width="500px"> </td>
 +
</tr>
 +
<tr>
 +
<td> <b>Figure 9: Parameter Scan </b><br>
 +
The theoretical behaviour of the dCas9-VP16 concentration and the SEAP concentration by various dCas9-VP16 production rates.
 +
</td>
 +
</tr>
 +
</tbody></table>
 +
</center>
 +
 
 +
 
 +
 
 +
<p id="h3"> 2. dCas9-KRAB </p>
 +
 
 +
<p> The same procedure was performed to find the parameters for dCas9-KRAB. All parameters were set to changing values and the initial concentrations to zero. Fitting applied to initial concentrations of zero led to a lower least square error. As assumed before the dCas9-KRAB reaches its steady state within the measured time period, approximately at the same time point as Cas9-VP16. However the expected stop of the increase in SEAP concentration could not be observed. The effect of repression is unobservable due to the high ratio of tetR to dCas9.</p> 
 +
 
 +
<div>
 +
<table id="Fig9" class="imgtxt">
 +
<tbody><tr>
 +
<td> <img id="bild" src="https://static.igem.org/mediawiki/2013/4/47/Freiburg2013_Rep100_1.png"> </td>
 +
</tr>
 +
<tr>
 +
<td> <b  >Figure 10: Modeling Result:</b><br>  Experimental results (purple square) are shown in comparison to the model prediction values (cyan cross) for SEAP and dCas9, as well as the model prediction for the non measured component tetR. </td>
 +
</tr>
 +
</tbody></table>
 +
</div> 
 +
 
 +
<p> This time we also fitted the different error values <span id="refer"> <a href="#Fig11"> (Fig.11)</a></span>.</p>
 +
 
 +
<div>
 +
<table id="Fig11" class="imgtxt">
 +
<tbody><tr>
 +
<td> <img id="bild" src="https://static.igem.org/mediawiki/2013/d/d9/Rep100_Fehler1.png"> </td>
 +
</tr>
 +
<tr>
 +
<td> <b >Figure 11: Different error values plotted in increasing order.</b><br> </td>
 +
</tr>
 +
</tbody></table>
 +
</div> 
 +
 
 +
<table id='Tab2'>
 +
<thead><b> Table 2: Overview of the parameter results of the dCas9-KRAB fitting process </b> </thead>
 +
<tbody>
 +
<tr> <th> parameter </th><th> value </th> </tr>
 +
<tr> <td> linear dCas9 production rate </td><td> 7.04E-04 [M/h] </td> </tr>
 +
<tr> <td> dCas9 degradation rate </td><td> 1.55E+02 [1/h] </td> </tr>
 +
<tr> <td> tetR production rate </td><td> 5.59E+09 </td> </tr>
 +
<tr> <td> tetrR degradation rate </td><td> 1.29E-04 </td> </tr>
 +
<tr> <td> SEAPs leaky production rate </td><td> 2.70E-15 [M] </td> </tr>
 +
<tr> <td> Cas9/tetR specific production rate </td><td> 0.18 [M] </td> </tr>
 +
<tr> <td> Cas9-KRAB specific constant </td><td> 1.27E+00 [1/M] </td> </tr>
 +
<tr> <td> tetR specific constant </td><td> 4.30E+13 </td> </tr>
 +
 
 +
<tr> <td> initial concentration: dCas9 </td><td> 0.00 [M] </td> </tr>
 +
<tr> <td> initial concentration: tetR </td><td> 0.00 </td> </tr>
 +
<tr> <td> initial concentration: SEAP </td><td> 8.90E-19 [M] </td> </tr>
 +
<tr> <td> least square error </td><td> 119.6221 </td> </tr>
 +
</tbody>
 +
</table>
 +
 
 +
 
 +
 
 +
 
 +
 
 +
<p id="h2"> The Code Files </p>
<div>
<div>
<ul>
<ul>
-
<li><a id="link" href="https://static.igem.org/mediawiki/2013/9/94/Freiburg2013_ParamFit.m"> ParamFit </a></li>
+
<li><a id="link" href="https://static.igem.org/mediawiki/2013/9/94/Freiburg2013_ParamFit.m"> Script </a></li>
-
<li><a id="link" href="https://static.igem.org/mediawiki/2013/9/9f/Freiburg2013_ActFit.m"> ActFit</a></li>
+
<li><a id="link" href="https://static.igem.org/mediawiki/2013/e/e7/Freiburg2013_ActFit1.m"> Errorfunction </a></li>
-
<li><a id="link" href="https://static.igem.org/mediawiki/2013/b/ba/Freiburg2013_PlotFit.m"> PlotFit</a></li>
+
<li><a id="link" href="https://static.igem.org/mediawiki/2013/b/ba/Freiburg2013_PlotFit.m"> Plotting Function</a></li>
-
<li><a id="link" href="https://static.igem.org/mediawiki/2013/6/6f/Freiburg2013_PlotError.m"> PlotError </a></li>
+
<li><a id="link" href="https://static.igem.org/mediawiki/2013/6/6f/Freiburg2013_PlotError.m"> Plotting Errorvalues </a></li>
-
<!-- <li><a id="link" href=""> </a></li>
+
<li><a id="link" href="https://static.igem.org/mediawiki/2013/2/22/Freiburg2013_Lhs_setup.m"> lhs_setup </a></li>
-
<li><a id="link" href=""> </a></li>
+
<li><a id="link" href="https://static.igem.org/mediawiki/2013/1/1d/Freiburg_2013_Divide_k.m"> divide </a></li>
-
<li><a id="link" href=""> </a></li> -->
+
<li><a id="link" href="https://static.igem.org/mediawiki/2013/2/2c/Freiburg2013_Connect_k.m"> connect </a></li>  
</ul>
</ul>
</div>
</div>
 +
 +
 +
<p id="h3" >References:</p>
 +
<div id="(1)">
 +
    <small>
 +
        (1) Lagarias, J. <i>et al.</i> (1998). Convergence Properties of the Nelder-Mead Simplex Method in Low Dimensions. SIAM Journal of Optimization 9,
 +
        <i>
 +
            112–147</i>.
 +
      </small>
 +
</div>
 +
 +
<div id="(2)">
 +
    <small>
 +
        (2) Müller, K., <i>et al.</i> (2013). A red/far-red light-responsive bi-stable toggle switch to control gene expression in mammalian cells. Nucleic acids research 41,
 +
        <i>
 +
e77</i>.
 +
      </small>
 +
</div>
 +
 +
 +
<div id="(3)">
 +
    <small>
 +
        (3) Müller, K., <i>et al.</i> (2013). A red/far-red light-responsive bi-stable toggle switch to control gene expression in mammalian cells. Supplementary Information: Design and parameterization of the mathematical model.
 +
      </small>
 +
</div>
 +
 +
</body>
</body>
</html>
</html>

Latest revision as of 03:22, 29 October 2013


Modeling our dCAS

Introduction

We used a kinetc approach to model and characterize our system. It is based on various ordinary differential equations (ODE) which describe the behaviour of our network. Due to the limited measurement possibilities and the unwritten law, that you should at least measure half of the number of components of your network we started by using a small network with a limited amount of different components.

The Networks

1. dCAS-VP16

Our network includes four different components dCas9-VP16, a RNA complex (tracr/cr RNA), a RNA-dCas9-VP16 complex and the secreted alkaline phosphatase (SEAP). The RNA-complex is bound to dCas9-VP16 and the resulting complex binds the DNA, which leads to the production of SEAP.

Figure 1: Transcriptional Activation via dCas9-VP16:
The dCas9-VP16 fusion protein is guided to the desired DNA sequence by a co-expressed crRNA and tracrRNA. The binding of the gene recognition complex leads to an expression of SEAP.

Setting up the ODE

According to the graphical reaction network the ODE can be set up.

dCas9 is constitutively expressed by the CAG promoter and degraded proportionately to the current concentration. It is used to build the DNA recognition complex and is produced during complex decay.

The RNA-complex is built linearly. The production constant can be regarded as equivalent to the production constant of the lower expressed RNA (either crRNA or tracrRNA), since their expression limits the complex building. It is assumed that the RNA is degraded after DNA recognition complex decay and therefore the complex decay does not lead to an increase in the amount of RNA.

The DNA recognition complex is built when dCas9 and RNA meet and is degraded proportionately to the current DNA recognition complex concentration.

There is a leaky SEAP production and one that depends on the current concentration of the dCas9/RNA complex. This dependency is assumed to follow the Monod-kinetic. Because of the long half-life period (T2 > 500 h) of SEAP we can neglect the SEAP decay [2, 3].

The parameters are:
k1: linear production rate of Cas9
k2: Cas9 degradation rate
k3: tracr/crRNA production rate
k4: tracr/crRNA degradation rate
k5: gene recognition complex building rate
k6: cr/trRNA /Cas9 degradation rate
k7: SEAPs leaky production rate
k8: Complex dependent SEAP production rate
k9: Cas9-VP16 specific constant

2. dCAS-KRAB

The dCas-KRAB sytem was modeled as a three component system. tetR as activator of gene production, dCas-KRAB as repressor and SEAP as final reporter protein.

Figure 2: Transcriptional respression via dCAS-KRAB:
The dCAS-KRAB fusion protein binds to the desired target sequence at a different locus than the tetR. tetR in turn binds to tetO and is assumed to repress SEAP production.

Setting up the ODE

According to the graphical reaction network the ODE can be set up.

dCas9 is constitutively expressed by the CAG promoter and degraded proportionately to the current concentration, similar to the activation model. It binds to the DNA and is released afterwards. Since the amount of transfected dCas9-plasmid is four times higher than that of TetR-VP16 the basal expression rate is equally assumed to be four times higher than that of TetR.

tetR is assumed to have a similiar kinetic as dCas9.

There is a leaky SEAP production and one that depends on the current concentration of dCas9 and tetR. This dependency is assumed to follow an allosteric inhibition process. Similar to the activation process the SEAP decay can be neglected because of the long half-life period (T2 > 500 h) of SEAP [2, 3].

The parameters are:
k1: linear production rate of Cas9
k2: Cas9 degradation rate
k3: tetR production rate
k4: tetR degradation rate
k5: SEAPs leaky production rate
k6: Cas9/tetR specific production rate
k7: Cas9-KRAB specific constant
k8: tetR specific constant

Determining the parameters

By setting up the ODE a (n+1)-dimensional hypothesis space (n is the number of parameters) is generated. Determining the right parameter combination means finding a point in the space which fits the data best.
To find these parameters we used the maximum likelihood approach. The maximum likelihood hypothesis is the procedure which has the highest probability to generate the measured data. It was shown, using the maximum likelihood approach and assuming gaussion noise to the data (an assumption that holds in our case) leads to a least-square error minimization problem [3].

A minimization problem is an optimization problem. One searches for parameters (p0) for which the assumption holds, that the value of the function (f) at the point of the parameters is smaller than all other values (f(p0)<=f(p)). In three dimensions the function can be regarded as a landscape and minimization as finding the deepest valley. Depending on the method one uses different problems arise. The most common problem is finding an only local minimum instead of the global one (Fig. 3).

Figure 3: Example of a minimization problem
A 3D landscape is shown. Depending on the start position (the initial parameters), the minimum which is found can either be a local minimum or the global one.

To avoid this issue and to make sure a global minimum was found we started our minimization procedure using different start values for our parameters. To sample these parameters we used the latin hypercube sampling on a logarithmic scale (Fig. 4).

Figure 4: Illustration of the latin hypercube sampling in a two dimensional parameter space.
The number of initial parameter vectors is 5. Therefore the parameter space is divided in 25 subspaces. One possible parameter combination is shown here.

The number of different initial parameter settings is set to N and thus the parameter space is divided in N*N subspaces. For the initial parameter the values are chosen in a way that there is only one parameter in each row and column.

The resulting errors were plotted in an increasing order to be sure the global minimum could be found.

Data generation

Fusion proteins with effectors (dCas9-VP16, dCas9-KRAB) were quantified, as well as SEAP reporter protein (secreted alkaline phosphatase).

1. dCas9-Effector Quantification

dCas9-VP16 and dCas9-KRAB were quantified by using Western Blots.

Figure 5: Western Blot quantification of protein concentrations.

2. SEAP Reporter Quantification

SEAP reporter protein was quantified by using a SEAP assay.

Figure 6: SEAP reporter protein quantification by using the SEAP assay.

For more detailed information refer to our modeling notebook.

Fitting Procedure and Results

1. dCas9-VP16

Assuming the given ODE and using the fminsearch-function of matlab for various initial parameter vectors the fitting process resulted in one optimal parameter composition. Our measurement time started with a transfection. The first data were obtained after 6 h. As the initial concentrations were unknown all 14 parameters (including the initial concentrations) were assumed to obtain changing values during the fit. This procedure results in unlikely high initial concentrations. Although the transfection end point could not clearly be defined only a small amount of the components should be present within the cells after transfection . A second process in which the initial concentrations were assumed to be zero followed the first one. The change between fixed and variable parameters was easy to perform, because of an additional vector (qfit). This vector contained different boolean values depending on whether the parameter was fixed (false) or flexible (true) during the fitting process. Moreover another parameter was required for adjusting the absolute values of the duplicates. The results of this second process are shown in table 1.

The model reflects the general construction of the network (Fig. 5). As assumed dCas9 converges asymptotically to a steady state and an exponential increase in the SEAP concentration can be observed. Both processes are reflected in the data. The model also shows a potential behavior of the non measured components, the free tracr/crRNA-complex and the gene recognition complex. The tracr/crRNA-complex reaches its steady state quickly. However there is no possibility to distinguish between the two different RNAs. There might be some differences in their expressions, especially because of the different promoters (crRNA expressed under U6-promoter; tracrRNA expressed under h1-promoter), however, the model does not show them.
The free gene recognition complex has not reached its steady state yet due to the small degradation rate.

Figure 7: Modeling Result:
This figure shows the comparison of the experimental results (purple square) to the model prediction values (cyan cross) for SEAP and dCas9, as well as the model prediction for non measured components.

Since the fminsearch algorithm is not proved to converge to a minimum [1], different points within the parameter space were chosen to ensure a high probability for finding the global minimum (Fig. 8).

Figure 8: Different error values plotted in increasing order.
Table 1: Overview of the parameter results of the dCas9-VP16 fitting process
parameter value
linear production rate of dCas9 8.78E+02 [M/h]
dCas9 degradation rate 0.54 [1/h]
tracr/crRNA production rate 0.73
tracr/crRNA degradation rate 1.21E+08
Gene recognition complex building rate 1.61 [1/M]
cr/trRNA /dCas9 degradation rate 8.59E-26
SEAPs leaky production rate 4.05E-05 [M/h]
Complex dependent SEAP production rate 1.86E+08 [M/h]
Cas9-VP16 specific constant 1.06
initial concentration: dCas9 0.00 [M]
initial concentration: tracr/crRNA -Komplex 0.00
initial concentration: gene recognition complex 0.00
initial concentration: SEAP 2.73E-09 [M]
least square error 159.3423

Using the Model

The parameters of our model were identified by using MATLAB and a least square error minimization procedure. Afterwards the model, written in SBML by using COPASI, was used to test how the amounts of reporter protein can be increased by changing other parameters. As assumed the reporter protein level depends on the concentration of RNA and dCas9-VP16. If one paramter is increased, reporter level rises.

We decided to try to increase the dCas9-VP16 expression rate, because there are no possibilites to increase the RNA amount. The promoters coding for the RNAs are classified as strong and it's not possible to shorten the sequences. dCas9 in contrast is a large protein - over 160 kDa - and has various functions. Basically we need only the helicase and DNA binding domain. Therefore it should be possible to remove other domains by truncating the protein and so reducing the protein's size. When the protein is smaller, we expect that the dCas9 expression will increase. This will lead - according to the model - to a higher output of the system.

There are several other possibilities of increasing the expression rate practically:
One is to change the promoter, another one means transfecting more plasmid DNA and - as already mentioned - there is the possibility to truncate the protein itself. Although a change of the promoter seems to be the most simple method, we couldn't perform this, because we already used the strong CAG promoter. By transfecting more plasmid DNA the toxic effects on the cells will increase. This is why we decided to truncate the dCas9-VP16. We assumed that by reduction of size of dCas9-VP16 with the same amount of cellular resources more protein can be built. A positive side effect might therefore also be that the off target effect on the regular protein translation goes down, as well.
To read more about the truncation click here.

Figure 9: Parameter Scan
The theoretical behaviour of the dCas9-VP16 concentration and the SEAP concentration by various dCas9-VP16 production rates.

2. dCas9-KRAB

The same procedure was performed to find the parameters for dCas9-KRAB. All parameters were set to changing values and the initial concentrations to zero. Fitting applied to initial concentrations of zero led to a lower least square error. As assumed before the dCas9-KRAB reaches its steady state within the measured time period, approximately at the same time point as Cas9-VP16. However the expected stop of the increase in SEAP concentration could not be observed. The effect of repression is unobservable due to the high ratio of tetR to dCas9.

Figure 10: Modeling Result:
Experimental results (purple square) are shown in comparison to the model prediction values (cyan cross) for SEAP and dCas9, as well as the model prediction for the non measured component tetR.

This time we also fitted the different error values (Fig.11).

Figure 11: Different error values plotted in increasing order.
Table 2: Overview of the parameter results of the dCas9-KRAB fitting process
parameter value
linear dCas9 production rate 7.04E-04 [M/h]
dCas9 degradation rate 1.55E+02 [1/h]
tetR production rate 5.59E+09
tetrR degradation rate 1.29E-04
SEAPs leaky production rate 2.70E-15 [M]
Cas9/tetR specific production rate 0.18 [M]
Cas9-KRAB specific constant 1.27E+00 [1/M]
tetR specific constant 4.30E+13
initial concentration: dCas9 0.00 [M]
initial concentration: tetR 0.00
initial concentration: SEAP 8.90E-19 [M]
least square error 119.6221

The Code Files

References:

(1) Lagarias, J. et al. (1998). Convergence Properties of the Nelder-Mead Simplex Method in Low Dimensions. SIAM Journal of Optimization 9, 112–147.
(2) Müller, K., et al. (2013). A red/far-red light-responsive bi-stable toggle switch to control gene expression in mammalian cells. Nucleic acids research 41, e77.
(3) Müller, K., et al. (2013). A red/far-red light-responsive bi-stable toggle switch to control gene expression in mammalian cells. Supplementary Information: Design and parameterization of the mathematical model.