Team:WHU-China/templates/standardpage modeling
From 2013.igem.org
(Difference between revisions)
IgnatzZeng (Talk | contribs) |
IgnatzZeng (Talk | contribs) |
||
(59 intermediate revisions not shown) | |||
Line 5: | Line 5: | ||
</div> | </div> | ||
- | <div class="mac" style="height:auto;width:66%;float:right;margin-right:7%;line-height:1.2em;"> | + | <div class="mac" style="height:auto;width:66%;float:right;margin-right:7%;line-height:1.2em;*width:98%;*border: 3px solid green;"> |
<a name="overview"></a> | <a name="overview"></a> | ||
<h1 style="font-size:20px;"><b> | <h1 style="font-size:20px;"><b> | ||
1. Overview</b></h1></br> | 1. Overview</b></h1></br> | ||
- | This model aims at predicting the final output of a tandem promoter system, which | + | <div style="float:right;height:auto;">For a <span style="color:red" >pdf version</span> of the tandem promoter modeling part,click <a href="https://static.igem.org/mediawiki/2013/4/41/WHUTP_web_v1.1.pdf">here <img src="https://static.igem.org/mediawiki/2013/4/4e/WHUPdf_icon.jpg" width=30px height=30px /></a></div> |
+ | </br></br></br> | ||
+ | |||
+ | This model aims at predicting the final output of a tandem-repeat promoter system, which constitutes of repeated identical sub-promoter. The key idea of the model is that the strength of a promoter system is proportional to the probability of at least one RNA Polymerase (mentioned as RNAP latter) binding on the promoter.</br></br> | ||
<a name="Symbol"></a> | <a name="Symbol"></a> | ||
<h1 style="font-size:20px;"><b> | <h1 style="font-size:20px;"><b> | ||
2. Symbol table, Assumption and reasons.</b></h1></br> | 2. Symbol table, Assumption and reasons.</b></h1></br> | ||
- | < | + | <div style="text-align:center;width:100%;"> |
- | + | <table style="text-align:left;"> | |
+ | <tr><td class="topstike" style="border-right:0;">Definition</td><td class="topstike" style="border-left:0;"> </td></tr> | ||
+ | <tr><td class="topstike">Relative Strength</td><td class="topstike">The relative strength of certain promoter is defined by let the strength of Anderson promoter BBa_J23100 equals to one (in E.coli), and adjust the strength of other promoters accordingly. | ||
+ | (<a href="http://parts.igem.org/Promoters/Catalog/Anderson">http://parts.igem.org/Promoters/Catalog/Anderson</a>) | ||
+ | </td></tr> | ||
+ | <tr><td class="topstike">Normalized Strength</td><td class="topstike">The normalized strength of certain promoter is calculated by dividing the strength of the promoter by the highest promoter strength in the host. The highest promoter strength can be reached by creating artificial tandem promoter constitutes of the strongest known promoter.</td></tr> | ||
+ | <tr><td class="topstike">Symbol</td><td class="topstike"> </td></tr> | ||
+ | <tr><td class="topstike">[ ]</td><td class="topstike">The symbol of concentration, i.e. [Protein] means the concentration of the protein</td></tr> | ||
+ | <tr><td class="topstike">ptot / y</td><td class="topstike">The probability of at least one RNAP(with all of its subunit) binding on the tandem promoter. It also means the normalized strength of the promoter.</td></tr> | ||
+ | <tr><td class="topstike">n / x</td><td class="topstike">The number of sub-promoters in the tandem promoter system.</td></tr> | ||
+ | <tr><td class="topstike">u</td><td class="topstike">Number of copies of a tandem promoter in a cell </td></tr> | ||
+ | <tr><td class="topstike">ξ</td><td class="topstike">Strength constant, equals to the strongest expression level possible (units in fluorenes normalized by a internal reference).</td></tr> | ||
+ | <tr><td class="topstike">V</td><td class="topstike">The volume of a cell</td></tr> | ||
+ | <tr><td class="topstike">pi</td><td class="topstike">The probability of a RNAP(with all of its subunit) form a RNAP-with complex with the ith sub-promoter in the tandem promoter system.</td></tr> | ||
+ | <tr><td class="topstike">qi</td><td class="topstike">qi=1-pi, the probability of a RNAP not binding to the ith sub-promoter</td></tr> | ||
+ | <tr><td class="topstike">j</td><td class="topstike">Cooperative factor</td></tr> | ||
+ | <tr><td class="topstike">α</td><td class="topstike">Transcription rate constant</td></tr> | ||
+ | <tr><td class="topstike">λ</td><td class="topstike">mRNA degradation constant</td></tr> | ||
+ | <tr><td class="topstike">v</td><td class="topstike">Translation rate constant</td></tr> | ||
+ | <tr><td class="topstike">k</td><td class="topstike">Protein degradation constant</td></tr> | ||
+ | <tr><td class="topstike">RNAP</td><td class="topstike">RNA Polymerase</td></tr> | ||
+ | <tr><td class="topstike">ODE</td><td class="topstike">Ordinary Differential Equation</td></tr> | ||
+ | <tr><td class="topstike">RP / RPc </td><td class="topstike">RNAP-Promoter complex, inactive complex</td></tr> | ||
+ | <tr><td class="topstike">RPi</td><td class="topstike">Intermediate complex</td></tr> | ||
+ | <tr><td class="topstike">RPo</td><td class="topstike">Open complex</td></tr> | ||
+ | </table> | ||
+ | <b><em>Table 1. Symbol table of TP Model</em></b></br></br> | ||
+ | </div> | ||
<ul><li> | <ul><li> | ||
- | 1.It’s assumed that the promoter strength is measured in the same species, with identical environment and growing stage. This | + | 1.It’s assumed that the promoter strength is measured in the same species, with identical environment and growing stage. This ensures that the concentration of all subunits of RNAP, all subunits of ribosome, all RNA degradation enzymes, all kind of proteases and all transportation protein are almost the same.</li><li> |
- | 2.In all measurement, the contexts of the | + | 2.In all measurement, the contexts of the promoters remain the same. i.e. same RBS, terminator, protein sequence, up stream element, down stream element and DNA supercoiling. </li><li> |
- | 3.All transcriptional factors are not considered in this version of the model, but can be included in the model with some modification to the equations. </li><li> | + | 3.All transcriptional factors are not considered in this version of the model, but can be included in the model with some modification to the equations.</li><li> |
- | 4.The promoter region is accessible for RNAP(and all kinds of its subunits), which means it’s not in heterochromatin region or any other condition that hamper a normal RNAP-DNA interaction. </li><li> | + | 4.The promoter region is accessible for RNAP(and all kinds of its subunits), which means it’s not in heterochromatin region or any other condition that hamper a normal RNAP-DNA interaction.</li><li> |
5.The probability of RNAP binding on the region between two sub-promoter within the tandem promoter system is neglected. As it contributes too little to final ptot. </li> | 5.The probability of RNAP binding on the region between two sub-promoter within the tandem promoter system is neglected. As it contributes too little to final ptot. </li> | ||
- | |||
<li> | <li> | ||
- | 6.The RNAP-DNA binding is assumed to stay on equilibrium in the model. This is reasonable because the open complex formation is a slow rate limiting step of transcription. So in the time scale of open complex formation, RNAP-DNA binding can always reach its equilibrium in neglectable time[1][2]. It’s also observed that the inactive RNAP-DNA complex can be detected on the DNA[3]. </ | + | 6.The RNAP-DNA binding is assumed to stay on equilibrium in the model. This is reasonable because the open complex formation is a slow rate limiting step of transcription. So in the time scale of open complex formation, RNAP-DNA binding can always reach its equilibrium in neglectable time[1][2]. It’s also observed that the inactive RNAP-DNA complex can be detected on the DNA[3].</li> |
- | + | (*The following assumption is adopted by the commonly used thermodynamic based model [1], but it’s challenged in the later part of the model. We will first keep this assumption to derive the model, and modified the model for conditions that this assumption do not work. The weakness of this assumption is discussed in detail in <a href="#here1">here</a> and <a href="#discussion">here</a>) | |
+ | <li> | ||
+ | 7.The probability (the speed) of RPc transforming to RPo is identical to all promoter, i.e. The strength of a promoter is merely related with the probability of RNAP binding to it. it enable us to calculate the promoter strength from the probability of RNAP binding to the promoter. </li> | ||
</ul> | </ul> | ||
</br></br></br> | </br></br></br> | ||
Line 36: | Line 67: | ||
We found that the strength of a tandem promoter system can be interpreted by a simple equation:</br> | We found that the strength of a tandem promoter system can be interpreted by a simple equation:</br> | ||
<div style="text-align:center"> | <div style="text-align:center"> | ||
- | <img src="https://static.igem.org/mediawiki/2013/ | + | <img src="https://static.igem.org/mediawiki/2013/8/80/WHU2013Refinetp1.png" align=center /></div> |
</br>Where qi is the probability of a RNAP(with all of its subunit) not forming a RNAP-with complex with the ith sub-promoter, n the number of sub-promoters, j the coordinative factor, and ξ the strength constant.</br> | </br>Where qi is the probability of a RNAP(with all of its subunit) not forming a RNAP-with complex with the ith sub-promoter, n the number of sub-promoters, j the coordinative factor, and ξ the strength constant.</br> | ||
</br> | </br> | ||
If we define the highest possible expression level of a promoter in certain species is 1. Then the equation 1 become normalized. </br> | If we define the highest possible expression level of a promoter in certain species is 1. Then the equation 1 become normalized. </br> | ||
- | <div style="text-align:center"> | + | <div style="text-align:center;"> |
- | <img src="https://static.igem.org/mediawiki/2013/ | + | <img src="https://static.igem.org/mediawiki/2013/f/fb/Refinetp2.png" align=center /></br></div> |
- | <div id="figcontainer" style="margin: | + | |
- | <em> | + | |
- | <b>Figure 1. | + | This model explains 99% of the tandem promoter strength variation caused by number of sub-promoters.</br> |
- | Y-axis | + | </br> |
- | The blue dot is data extracted from ref.[4] fig.2, the red line is the prediction made by | + | |
+ | |||
+ | <div id="figcontainer" style="text-align:center;float:right;margin:2.5%;width:95%;height:auto;float:right;border: 1px solid gray;"><img src="https://static.igem.org/mediawiki/2013/7/79/WHU2013Refine3.png" width=600px /></br> | ||
+ | <em style="width:50%;"> | ||
+ | <b>Figure 1.Prediction vs. Data plot and residual plot</b></br> | ||
+ | Y-axis shows the normalized promoter strength, X-axis the number of sub-promoters | ||
+ | The blue dot is data extracted from of ref.[4] fig.2 at14h and 25h, the red line is the prediction made by the model, the red dotted line is the 95% confidence bound. | ||
+ | </br> | ||
</em> | </em> | ||
</div> | </div> | ||
- | + | The model also successfully predict the strength of J23102- 23102 (BBa_K1081002) and J23106-23106 (BBa_K1081005) tandem promoters, with error less than 10%.</br> | |
- | + | ||
- | + | ||
- | + | ||
- | + | ||
- | + | ||
+ | <div id="figcontainer" style="text-align:center;float:right;margin:2.5%;width:95%;height:auto;float:right;border: 1px solid gray;"><img src="https://static.igem.org/mediawiki/2013/8/89/WHU2013Refinetp4.png" width=600px /></br> | ||
+ | <em style="width:50%;"> | ||
+ | <b>Figure 2. Experiment result versus Model prediction</b></br> | ||
+ | </em> | ||
+ | </div> | ||
+ | </br></br></br> | ||
+ | </br></br></br> | ||
- | |||
- | |||
<a name="derivation"></a> | <a name="derivation"></a> | ||
Line 70: | Line 108: | ||
<b> | <b> | ||
4.1 Expression level Measurement</b></br> | 4.1 Expression level Measurement</b></br> | ||
- | We use the fluorescence strength to indicate the strength of the promoter | + | We use the fluorescence strength to indicate the strength of the promoter. Because when the exciting light is fixed, the fluorescence is proportional to the concentration of FP. And FP can be lighted up in a short time after they are synthesis.</br> |
</br> | </br> | ||
<a name="trans"></a> | <a name="trans"></a> | ||
Line 94: | Line 132: | ||
We can consider [protein]eq as the indicator of the promoter strength, and let vα/ λk=ξ</br> | We can consider [protein]eq as the indicator of the promoter strength, and let vα/ λk=ξ</br> | ||
<div style="text-align:center"> | <div style="text-align:center"> | ||
- | <img src="https://static.igem.org/mediawiki/2013/ | + | <img src="https://static.igem.org/mediawiki/2013/5/5b/WHU2013Refinetp5.png" /></br></div> |
So the strength of the promoter is directly related to the concentration of the RNAP-DNA complex of this promoter.</br></br></br> | So the strength of the promoter is directly related to the concentration of the RNAP-DNA complex of this promoter.</br></br></br> | ||
<a name="RNAP"></a> | <a name="RNAP"></a> | ||
Line 104: | Line 142: | ||
The reaction can be combined with Central Dogma to be:</br> | The reaction can be combined with Central Dogma to be:</br> | ||
<div style="text-align:center"> | <div style="text-align:center"> | ||
- | <img src="https://static.igem.org/mediawiki/2013/ | + | <img src="https://static.igem.org/mediawiki/2013/3/34/WHU2013Refinetp6.png" /></br></div> |
Because K1 happens in a much smaller time scale. The probability of finding the polymerase | Because K1 happens in a much smaller time scale. The probability of finding the polymerase | ||
on the promoter will be given by its equilibrium constant K1.[1]</br></br> | on the promoter will be given by its equilibrium constant K1.[1]</br></br> | ||
To evaluate the probability of polymerase binding (pi) we must sum the Boltzmann weights over all possible states of P polymerase molecules on DNA. </br> | To evaluate the probability of polymerase binding (pi) we must sum the Boltzmann weights over all possible states of P polymerase molecules on DNA. </br> | ||
+ | |||
<div style="text-align:center"> | <div style="text-align:center"> | ||
- | <img src="https://static.igem.org/mediawiki/2013/ | + | <img src="https://static.igem.org/mediawiki/2013/0/00/WHU2013Refinetp7.png" /></br></div> |
This equation calculate the total Boltzmann weight of no RNAP binding to the target promoter, with N represent the number of non-specific sites on the DNA, P the effective RNAP number, ε^NS the non-specific binding energy, kb the Boltzmann constant and T the temperature.</br> | This equation calculate the total Boltzmann weight of no RNAP binding to the target promoter, with N represent the number of non-specific sites on the DNA, P the effective RNAP number, ε^NS the non-specific binding energy, kb the Boltzmann constant and T the temperature.</br> | ||
<div style="text-align:center"> | <div style="text-align:center"> | ||
- | <img src="https://static.igem.org/mediawiki/2013/ | + | <img src="https://static.igem.org/mediawiki/2013/b/b3/WHU2013Refinetp8.png" /></br></div> |
This equation calculate the total Boltzmann weight of one RNAP binding to promoter i, with ε^Si means the specific binding energy of promoter i.</br> | This equation calculate the total Boltzmann weight of one RNAP binding to promoter i, with ε^Si means the specific binding energy of promoter i.</br> | ||
So the probability of a RNAP binding to promoter i is,</br> | So the probability of a RNAP binding to promoter i is,</br> | ||
+ | |||
<div style="text-align:center"> | <div style="text-align:center"> | ||
<img src="https://static.igem.org/mediawiki/2013/4/4b/WHUPromoterProbability.png" /></br></div> | <img src="https://static.igem.org/mediawiki/2013/4/4b/WHUPromoterProbability.png" /></br></div> | ||
Line 131: | Line 171: | ||
<img src="https://static.igem.org/mediawiki/2013/b/b2/WHU2013pdp.png" /></br></div> | <img src="https://static.igem.org/mediawiki/2013/b/b2/WHU2013pdp.png" /></br></div> | ||
- | So | + | So the probability of RNAP binding to two promoter at the same time equals to the product of the probabilities of RNAP binding to the two promoter respectively. i.e. |
+ | </br><div style="text-align:center"> | ||
+ | <img src="https://static.igem.org/mediawiki/2013/d/d9/WHU2013Refinetp9.png" /></br></div></br></br> | ||
- | |||
- | |||
- | |||
- | |||
- | |||
As only one RNAP is needed to initiate the transcription in a tandem promoter system (the other RNAP will be blocked by the RNAP closest to the transcription initiation point). So the probability of at least one RNAP binding to the promoter is </br> | As only one RNAP is needed to initiate the transcription in a tandem promoter system (the other RNAP will be blocked by the RNAP closest to the transcription initiation point). So the probability of at least one RNAP binding to the promoter is </br> | ||
<div style="text-align:center"> | <div style="text-align:center"> | ||
<img src="https://static.igem.org/mediawiki/2013/f/fb/WHU2013Equation6.png" /></br></div> | <img src="https://static.igem.org/mediawiki/2013/f/fb/WHU2013Equation6.png" /></br></div> | ||
- | For a kind of promoter with u copies in a cell (all separated and function independently) | + | For a kind of promoter with u copies in a cell (all separated and function independently) |
+ | <div style="text-align:center"> | ||
+ | <img src="https://static.igem.org/mediawiki/2013/3/38/Refinetp10.png" /></br></div> | ||
+ | </br> | ||
+ | |||
+ | The strength of a promoter is, according to equation 5.</br> | ||
<div style="text-align:center"> | <div style="text-align:center"> | ||
<img src="https://static.igem.org/mediawiki/2013/6/60/WHU2013Strength.png" /></br></div> | <img src="https://static.igem.org/mediawiki/2013/6/60/WHU2013Strength.png" /></br></div> | ||
Line 150: | Line 192: | ||
<div style="text-align:center"> | <div style="text-align:center"> | ||
- | <img src="https://static.igem.org/mediawiki/2013/ | + | <img src="https://static.igem.org/mediawiki/2013/d/db/Refinetp11.png" /></br></div> |
- | However, | + | However, the prediction fail to explain the data. </br> </br> |
- | <div id="figcontainer" style="margin: | + | |
+ | |||
+ | <div id="figcontainer" style="text-align:center;float:right;margin:2.5%;width:95%;height:auto;float:right;border: 1px solid gray;"><img src="https://static.igem.org/mediawiki/2013/3/3a/Refinetp12.png" width=600px /></br> | ||
<em> | <em> | ||
- | <b>Figure 3. | + | <b>Figure 3. Prediction vs. Data and residual plot of the simpler model</b></br> |
- | + | Y-axis shows the normalized promoter strength, X-axis the number of sub-promoters | |
- | + | The blue dot is data extracted from of ref.[4] fig.2 at14h and 25h, the red line is the prediction made by the model | |
- | + | </br> | |
+ | </em></div> | ||
- | < | + | <a name="here1"></a> |
- | < | + | The data increase in y much quicker than our prediction, which indicate there will be some kind of cooperation among sub-promoters. This results in pij>pipj. The cooperation can be explained by the fact that when one RPo formed, it will “melt” the DNA duplex into two single strain. This DNA untwisting, unwinding and melting make the RNAP-DNA complex in the vicinity easier to transform from RPc to RPo. Therefore variation in α can no longer be ignored.</br></br> |
+ | So we should add a adjust term(the cooperation factor) into equation 8. Therefore equation 2 comes out, with nj as the cooperative factor.</br> | ||
- | |||
+ | <div style="text-align:center"> | ||
+ | <img src="https://static.igem.org/mediawiki/2013/a/a3/Refinetp13.png" /></br></div> | ||
- | <div id="figcontainer" style="margin: | + | |
+ | As we’ve showed in figure 1. This model successfully captures the essence of tandem promoter system.</br> | ||
+ | </br></br></br> | ||
+ | |||
+ | <a name="discussion" style="width:100%;float:left;"></a> | ||
+ | <div style="float:left;"> | ||
+ | <h1 style="font-size:20px;"><b> | ||
+ | 5.Discussion | ||
+ | </b></h1></br> | ||
+ | Because it failed to capture the interaction between sub-promoters, the flawed (but widely adopted) assumption 7 was proved inapplicable in tandem-repeat promoter strength prediction. Our data further showed that the it can not be employed to general tandem promoter condition.</br></br> | ||
+ | <div id="figcontainer" style="text-align:center;float:right;margin:2.5%;width:95%;height:auto;float:right;border: 1px solid gray;"><img src="https://static.igem.org/mediawiki/2013/a/a5/WHU2013Refinetp14.png" width=600px /></br> | ||
<em> | <em> | ||
- | <b>Figure 4. | + | <b>Figure 4. The relative strength of four tandem promoter</b></br> |
- | </em> | + | </em></div> |
- | </div> | + | Under assumption 7, the order of sub-promoters has nothing to do with the final output of the promoter. But obviously, though the strength of promoter J23116-106 and J23106-116 have no much difference, the strength of promoter J23102-106 differs greatly from the strength of promoter J23106-102.</br></br> |
+ | All these data reveal that there are various significant interaction between sub-promoters. And the α of different promoters varies a lot (Thus results in the giant difference between the strength of promoter J23102-106 and the strength of promoter J23106-102). </br></br> | ||
+ | The reason why the model works well in tandem-repeat promoter are: </br> | ||
+ | 1. The α is identical for all sub-promoters.</br> | ||
+ | 2. The cooperative factor successfully captures the interaction between sub-promoters.</br></br> | ||
+ | |||
+ | So, it’s understandable why the model cannot be easily modified to predict the strength of any randem tandem promoter. Because,</br> | ||
+ | 1. The α of different sub-promoters may vary.</br> | ||
+ | 2. The interaction between different promoter may vary a lot. (Thus results in the difference between J23102-106/J23106-102 and J23116-106/J23106-116)</br></br> | ||
+ | |||
+ | There is another two minor problem of the model.</br> | ||
+ | 1. The cooperative factor has no solid biological ground (it’s even a boundless function when x approach infinite). The more prudent way will be choosing a sigmoid function rather than nj as the cooperative factor. But that will make the model more complex and hard to employ when people just have scarce data about their promoter (easy over-fitting). So we decide to keep it in this simpler and efficient form.</br> | ||
+ | 2. The difference of translation efficiency caused by the length variation of mRNA 5’-UTR is ignored in the model. This will not undermine the accuracy of the model, because the influence of the length of 5’-UTR before RBS is trivia when the length is short, and the tandem promoter is often shorter than 100bp. It’s reported that changing the operon order of GGPP synthase and taxadiene synthase affect taxadiene synthase expression by 20% (GGPP synthase plus its RBS is ~1kb)[7] </br></br></br></br> | ||
+ | </div> | ||
- | |||
- | <a name="guideline"></a> | + | <a name="guideline" style="width:100%;float:left;"></a> |
<div style="float:left;"> | <div style="float:left;"> | ||
<h1 style="font-size:20px;"><b> | <h1 style="font-size:20px;"><b> | ||
- | + | 6.User Guideline | |
</b></h1></br> | </b></h1></br> | ||
To employ the model, the user need to assign the pi for each kind of promoter that will be used to construct the tandem promoter. | To employ the model, the user need to assign the pi for each kind of promoter that will be used to construct the tandem promoter. | ||
Line 195: | Line 264: | ||
4)using equation 2 to predict the ptot of the designed tandem promoter, with an empirical cooperative factor j=0.4.</br> | 4)using equation 2 to predict the ptot of the designed tandem promoter, with an empirical cooperative factor j=0.4.</br> | ||
</br> | </br> | ||
+ | <div style="text-align:center"> | ||
+ | <img src="https://static.igem.org/mediawiki/2013/5/57/WHU2013Refinetp15.png" /></br></div> | ||
+ | |||
In this way, the error of the prediction should be less than 4% of the maximum expression rate, as our data showed before.</br> | In this way, the error of the prediction should be less than 4% of the maximum expression rate, as our data showed before.</br> | ||
If the data allow, the user can carry out fit with a variable j, which may varies in different species and cell condition. | If the data allow, the user can carry out fit with a variable j, which may varies in different species and cell condition. | ||
- | |||
Line 209: | Line 280: | ||
3.DeHaseth, Pieter L., and John D. Helmann. "Open complex formation by Escherichia coli RNA polymerase: the mechanism of polymerase‐induced strand separation of double helical DNA." Molecular microbiology 16.5 (1995): 817-824.</br> | 3.DeHaseth, Pieter L., and John D. Helmann. "Open complex formation by Escherichia coli RNA polymerase: the mechanism of polymerase‐induced strand separation of double helical DNA." Molecular microbiology 16.5 (1995): 817-824.</br> | ||
4.Li, Mingji, et al. "A strategy of gene overexpression based on tandem repetitive promoters in Escherichia coli." Microb Cell Fact 11 (2012): 19.</br> | 4.Li, Mingji, et al. "A strategy of gene overexpression based on tandem repetitive promoters in Escherichia coli." Microb Cell Fact 11 (2012): 19.</br> | ||
- | 5.Buchler, Nicolas E., Ulrich Gerland, and Terence Hwa. "Nonlinear protein degradation and the function of genetic circuits." Proceedings of the National Academy of Sciences of the United States of America 102.27 (2005): 9559-9564.</br></em> | + | 5.Buchler, Nicolas E., Ulrich Gerland, and Terence Hwa. "Nonlinear protein degradation and the function of genetic circuits." Proceedings of the National Academy of Sciences of the United States of America 102.27 (2005): 9559-9564.</br> |
+ | 6.Alon, Uri. Introduction to Systems Biology: And the Design Principles of Biological Networks. Vol. 10. CRC press, 2007. Page 6.</br> | ||
+ | 7.Nishizaki, Tomoko, et al. "Metabolic engineering of carotenoid biosynthesis in Escherichia coli by ordered gene assembly in Bacillus subtilis." Applied and environmental microbiology 73.4 (2007): 1355-1361. | ||
+ | </br> | ||
+ | </em> | ||
+ | |||
</div> <!-- end of part 5 --> | </div> <!-- end of part 5 --> | ||
</br></br></br> | </br></br></br> | ||
- | |||
+ | </div> | ||
+ | <div class="link"> | ||
</html> | </html> |
Latest revision as of 14:31, 28 October 2013
1. Overview
For a pdf version of the tandem promoter modeling part,click here
This model aims at predicting the final output of a tandem-repeat promoter system, which constitutes of repeated identical sub-promoter. The key idea of the model is that the strength of a promoter system is proportional to the probability of at least one RNA Polymerase (mentioned as RNAP latter) binding on the promoter.
2. Symbol table, Assumption and reasons.
Definition | |
Relative Strength | The relative strength of certain promoter is defined by let the strength of Anderson promoter BBa_J23100 equals to one (in E.coli), and adjust the strength of other promoters accordingly. (http://parts.igem.org/Promoters/Catalog/Anderson) |
Normalized Strength | The normalized strength of certain promoter is calculated by dividing the strength of the promoter by the highest promoter strength in the host. The highest promoter strength can be reached by creating artificial tandem promoter constitutes of the strongest known promoter. |
Symbol | |
[ ] | The symbol of concentration, i.e. [Protein] means the concentration of the protein |
ptot / y | The probability of at least one RNAP(with all of its subunit) binding on the tandem promoter. It also means the normalized strength of the promoter. |
n / x | The number of sub-promoters in the tandem promoter system. |
u | Number of copies of a tandem promoter in a cell |
ξ | Strength constant, equals to the strongest expression level possible (units in fluorenes normalized by a internal reference). |
V | The volume of a cell |
pi | The probability of a RNAP(with all of its subunit) form a RNAP-with complex with the ith sub-promoter in the tandem promoter system. |
qi | qi=1-pi, the probability of a RNAP not binding to the ith sub-promoter |
j | Cooperative factor |
α | Transcription rate constant |
λ | mRNA degradation constant |
v | Translation rate constant |
k | Protein degradation constant |
RNAP | RNA Polymerase |
ODE | Ordinary Differential Equation |
RP / RPc | RNAP-Promoter complex, inactive complex |
RPi | Intermediate complex |
RPo | Open complex |
- 1.It’s assumed that the promoter strength is measured in the same species, with identical environment and growing stage. This ensures that the concentration of all subunits of RNAP, all subunits of ribosome, all RNA degradation enzymes, all kind of proteases and all transportation protein are almost the same.
- 2.In all measurement, the contexts of the promoters remain the same. i.e. same RBS, terminator, protein sequence, up stream element, down stream element and DNA supercoiling.
- 3.All transcriptional factors are not considered in this version of the model, but can be included in the model with some modification to the equations.
- 4.The promoter region is accessible for RNAP(and all kinds of its subunits), which means it’s not in heterochromatin region or any other condition that hamper a normal RNAP-DNA interaction.
- 5.The probability of RNAP binding on the region between two sub-promoter within the tandem promoter system is neglected. As it contributes too little to final ptot.
- 6.The RNAP-DNA binding is assumed to stay on equilibrium in the model. This is reasonable because the open complex formation is a slow rate limiting step of transcription. So in the time scale of open complex formation, RNAP-DNA binding can always reach its equilibrium in neglectable time[1][2]. It’s also observed that the inactive RNAP-DNA complex can be detected on the DNA[3]. (*The following assumption is adopted by the commonly used thermodynamic based model [1], but it’s challenged in the later part of the model. We will first keep this assumption to derive the model, and modified the model for conditions that this assumption do not work. The weakness of this assumption is discussed in detail in here and here)
- 7.The probability (the speed) of RPc transforming to RPo is identical to all promoter, i.e. The strength of a promoter is merely related with the probability of RNAP binding to it. it enable us to calculate the promoter strength from the probability of RNAP binding to the promoter.
3. Modeling result
We found that the strength of a tandem promoter system can be interpreted by a simple equation:
Figure 1.Prediction vs. Data plot and residual plot
Y-axis shows the normalized promoter strength, X-axis the number of sub-promoters
The blue dot is data extracted from of ref.[4] fig.2 at14h and 25h, the red line is the prediction made by the model, the red dotted line is the 95% confidence bound.
The model also successfully predict the strength of J23102- 23102 (BBa_K1081002) and J23106-23106 (BBa_K1081005) tandem promoters, with error less than 10%.
Figure 2. Experiment result versus Model prediction
4.Model derivation
The promoter strength may be influenced by various factors. We need to simplify the system into some reasonable toy model by wiping out all relatively trivial factor. 4.1 Expression level Measurement We use the fluorescence strength to indicate the strength of the promoter. Because when the exciting light is fixed, the fluorescence is proportional to the concentration of FP. And FP can be lighted up in a short time after they are synthesis. 4.2 Translation and transcription According to the Central Dogma
Figure 3. Prediction vs. Data and residual plot of the simpler model
Y-axis shows the normalized promoter strength, X-axis the number of sub-promoters
The blue dot is data extracted from of ref.[4] fig.2 at14h and 25h, the red line is the prediction made by the model
The data increase in y much quicker than our prediction, which indicate there will be some kind of cooperation among sub-promoters. This results in pij>pipj. The cooperation can be explained by the fact that when one RPo formed, it will “melt” the DNA duplex into two single strain. This DNA untwisting, unwinding and melting make the RNAP-DNA complex in the vicinity easier to transform from RPc to RPo. Therefore variation in α can no longer be ignored.
So we should add a adjust term(the cooperation factor) into equation 8. Therefore equation 2 comes out, with nj as the cooperative factor.
5.Discussion
Because it failed to capture the interaction between sub-promoters, the flawed (but widely adopted) assumption 7 was proved inapplicable in tandem-repeat promoter strength prediction. Our data further showed that the it can not be employed to general tandem promoter condition.
Figure 4. The relative strength of four tandem promoter
Under assumption 7, the order of sub-promoters has nothing to do with the final output of the promoter. But obviously, though the strength of promoter J23116-106 and J23106-116 have no much difference, the strength of promoter J23102-106 differs greatly from the strength of promoter J23106-102.
All these data reveal that there are various significant interaction between sub-promoters. And the α of different promoters varies a lot (Thus results in the giant difference between the strength of promoter J23102-106 and the strength of promoter J23106-102).
The reason why the model works well in tandem-repeat promoter are:
1. The α is identical for all sub-promoters.
2. The cooperative factor successfully captures the interaction between sub-promoters.
So, it’s understandable why the model cannot be easily modified to predict the strength of any randem tandem promoter. Because,
1. The α of different sub-promoters may vary.
2. The interaction between different promoter may vary a lot. (Thus results in the difference between J23102-106/J23106-102 and J23116-106/J23106-116)
There is another two minor problem of the model.
1. The cooperative factor has no solid biological ground (it’s even a boundless function when x approach infinite). The more prudent way will be choosing a sigmoid function rather than nj as the cooperative factor. But that will make the model more complex and hard to employ when people just have scarce data about their promoter (easy over-fitting). So we decide to keep it in this simpler and efficient form.
2. The difference of translation efficiency caused by the length variation of mRNA 5’-UTR is ignored in the model. This will not undermine the accuracy of the model, because the influence of the length of 5’-UTR before RBS is trivia when the length is short, and the tandem promoter is often shorter than 100bp. It’s reported that changing the operon order of GGPP synthase and taxadiene synthase affect taxadiene synthase expression by 20% (GGPP synthase plus its RBS is ~1kb)[7]