Team:NCTU Formosa/modeling

From 2013.igem.org

(Difference between revisions)

Latest revision as of 03:55, 29 October 2013

Modeling

Modeling was our first step forward. When validated with our experimental data, modeling is also a verification of the accuracy of our experiments.

Change the font size right here

MATLAB (matrix laboratory) is a numerical computing environment and fourth-generation programming language. It is developed by MathWorks, a company in United States. MATLAB allows matrix manipulations, plotting of functions and data, implementation of algorithms, creation of user interfaces, and interfacing with programs written in other languages, including C, C++, Java, and Fortran. Although MATLAB is intended primarily for numerical computing, an optional toolbox uses the MuPAD symbolic engine, allowing access to symbolic computing capabilities. An additional package, Simulink, adds graphical multi-domain simulation and Model-Based Design for dynamic and embedded systems.

ANFIS Introduction

Adaptive-Network-Based Fuzzy Inference System, in short ANFIS, is a power tool for constructing a set of fuzzy if-then rules to generate stipulated output and input pairs. Unlike system modeling using mathematical rules that lacks the ability to deal with ill-defined and uncertain system, ANFIS can transform human knowledge into rule base, and therefore, ANFIS can effectively tune membership functions, minimizing the output error.

Single Unit

Red Promoter

As shown in Figure 2, the red promoter is consisted of P_ompc and P_lac. By multiplying the experimental data of P_ompc + RBS + mGFP and P_lac + RBS + mGFP, we would be able to build a model for the red promoter. This model, however, wouldn't be so accurate. To solve this problem, we used this model to train the actual experimental data of the red promoter by using ANFIS. Figure 2 is the result of such a training. We obtained a curve between our model and the actual experimental data. This curve is the representation of the new model that has been trained and supported by the actually experimental data of P_red. This new model definitely contains a high degree of accuracy.

Figure 2. using ANFIS to modified the simulated data by experimental data to obtain a more accurate result.

Lux Promoter

We did the following modeling based on the data obtained from Imperial 2007 iGEM team. The data notes the strength of P_lux under different concentrations of AHL and different time frames.

Using ANFIS to train 76 sets of data and to test 20 sets of data, we ontained Figure 4. It shows that our training data exhibits a similar trend as the testing data, even though the computer has no based knowledge of the trend. This simply means that our modeling has successfully simulated the actually data.

Figure 4. The training and testing data using ANFIS system

Figure 5 is the resultant graph from input 1 (time) and input 2 (AHL concentration). According to this graph, we can observe the output (fluorescence) has two peaks about AHL concentration(at concentration of 4 nM and 40 nM). That means we could achieve our regulation goal with little AHL. Also, pleas note that there is more output as time passes.

Figure 5. Input 1 is time (min), input 2 is AHL concentration, and output is fluorescence.

37 °C RBS

We used Figure 6 biobrick to model our 37^oC RBS's function at different temperatures.

Figure 6. The biobrick used to test the temperature-regulated RBS function

First, we did a experiment that test the fluroscence at different temperature and different time. Choosing 100 sets of data to do machine learning, then we tested 20 sets of data. As Figure 7 shown, the curve can classify 4 groups fit in 27^oC,32^oC,37^oC and 42^oC.

Figure 7.The figure shows 100 training data composed by 4 different temperture (blue dot), and simulated result (red star).

To test the accuracy of our model, we randomly chose 20 pairs of data which is not include in our training data to do the independent test, and the test result is shown in Figure 8. The blue dot in the figure represents the real experimental data that we randomly choose from our whole dataset, and the red star represents simulated result of our model. It is obviously showed that our model can accurately predict the biobrick function in any condition by using ANFIS.

Figure 8. The blue dots represents the real experimental data. The red star represents simulated result of our model.

Figure 9. Input 1= Time (hr), Input 2= Temperature (degree Celsius), Output = Normalized expression (AU).

From Figure 9, the maximum output is obtained at 37 ^oC. Under the same time frame, the output (the normalized expression of the reporter gene) is maximized at 37 ^oC while minimized at 25 ^oC. There is a dramatic decrease in the output below 30 ^oC and the outputs around 37^oC are much higher. This modeling demonstrates that using 37 ^oC RBS is a plausible approach for achieving gene expression through temperature.

Reference

iGEM 2007 Imperial https://2007.igem.org/Imperial

E.colightuner Simulation

We have all the modeling of each single unit . Now, we want to combine each unit to make a stimulation to the E.colightuner.

Figure 10.The overall constituent of E.colightuner

Figure 10 shows the essential constituent of our E. colightuner. To save our efforts experimenting with this essential engine of E. colightuner, we built a model beforehand to help us evaluate its practicability. Our model is consisted of four components A, B, C, and D

Figure 11 is the overall picture of our component A. Notice that component A is built from similar biobricks as our E.colightuner. The only difference between the two is the promoters used. Both P_red of E.colightuner are substituted with P_cons in component A. With that said, by taking the difference in the strength between P_red and P_cons, into calculations, we would be able to model out E. colightuner with component A. Before that, however, we would have to first build a model for component A.

To build a model for component A, we combined component B and C. As you can see in component B shown in Figure 12, it does not include P_lux and luxR like component A. By assuming that when luxR is expressed, P_lux would immediately reach its full strength, however, we can consider luxR and P_lux pair as simply a Pcons,that is constitutively activated. From this perspective, component A and B are the same, except that component A is also effected by the translation efficiency of 37^oC RBS. Component C is the model for 37^oC RBS which was shown in the Single Unit part above. So by multiplying component C and component B, we would be able to obtain a model for component A.

In order to increase the accuracy of our component A model, we used ANFIS to fit our modeling result with the actual experimental data of our component A. By doing this, we obtained a new modeling curve for component that is more precise and accurate, shown in Figure 13.

Figure 13. using ANFIS to modified the simulated data by experimental data to obtain a more accurate result .

As mentioned above, we needed to take the effect of P_red into account before we could a model of E.colightuner based on component A. Component D has the model of P_red built in a similar way as component A model which was shown before, too. We first built a model for P_red and fitted it into the actual experimental data.

Having both precise model for component A and component D, we simply had to multiple them to obtain the final model for our E.Colightuner in Figure 14.

Figure 14. the final model for E.colightuner . A = Figure2. B = Figure3. C = 37 degree celsius RBS , D = red promoter

Using the single unit to predict the project we simplify like Figure 10, we got Figure 15. The predictive curve is under red light and at 37^oC. Due to the sRNA repression, the expression rate of mGFP is really low. This phenomenon is reasonable and interpretable. Thus, this modeling method can accurately predict the expression trend of a new biobrick. Applying this modeling system maturely, we can use computers to control our E.colightuner in the future.

Figure 15. The Figure 14 predicted tendency.

@@ Line 45: / Line 45: @@
 Choosing 100 sets of data to do machine learning, then we tested 20 sets of data. '''As Figure 7 shown, the curve can classify 4 groups fit in 27<sup>o</sup>C,32<sup>o</sup>C,37<sup>o</sup>C and 42<sup>o</sup>C.'''
 </p>
-[[File:Nctu_37_random_test.jpg|745px|center|Figure 7.]]
+[[File:Nctu_37_random_test.jpg|745px|center|Figure 7.The figure shows 100 training data composed by 4 different temperture (blue dot), and simulated result (red star).]]
-[[File:37rbs_model.jpg|center|500px|Figure 8. Input 1= Time (hr), Input 2= Temperature (degree Celsius), Output = Normalized expression (AU).]]
+<p>To test the accuracy of our model, we randomly chose 20 pairs of data which is not include in our training data to do the independent test, and the test result is shown in Figure 8. The blue dot in the figure represents the real experimental data that we randomly choose from our whole dataset, and the red star represents simulated result of our model. '''It is obviously showed that our model can accurately predict the biobrick function in any condition by using ANFIS.'''</p>
+[[File:37rbs_test.png|745px|center|Figure 8. The blue dots represents the real experimental data. The red star represents simulated result of our model.]]
+[[File:37rbs_model.jpg|center|500px|Figure 9. Input 1= Time (hr), Input 2= Temperature (degree Celsius), Output = Normalized expression (AU).]]
-<p>From Figure 8, the maximum output is obtained at 37 <sup>o</sup>C. Under the same time frame, the output (the normalized expression of the reporter gene) is maximized at 37 <sup>o</sup>C while minimized at 25 <sup>o</sup>C. There is a dramatic decrease in the output below 30 <sup>o</sup>C and the outputs around 37<sup>o</sup>C are much higher. '''This modeling demonstrates that using 37 <sup>o</sup>C RBS is a plausible approach for achieving gene expression through temperature.''' </p>
+<p>From Figure 9, the maximum output is obtained at 37 <sup>o</sup>C. Under the same time frame, the output (the normalized expression of the reporter gene) is maximized at 37 <sup>o</sup>C while minimized at 25 <sup>o</sup>C. There is a dramatic decrease in the output below 30 <sup>o</sup>C and the outputs around 37<sup>o</sup>C are much higher. '''This modeling demonstrates that using 37 <sup>o</sup>C RBS is a plausible approach for achieving gene expression through temperature.''' </p>
 ======Reference======
@@ Line 62: / Line 64: @@
 ===E.colightuner Simulation===
 <p>We have all the modeling of each single unit . Now, we want to combine each unit to make a stimulation to the E.colightuner. </p>
-[[File:Figure1_NCTU_Formosa.png|300px|center|Figure 9.The overall constituent of E.colightuner ]]
+[[File:Figure1_NCTU_Formosa.png|300px|center|Figure 10.The overall constituent of E.colightuner ]]
-<p>Figure 9 shows the essential constituent of our E. colightuner. To save our efforts experimenting with this essential engine of E. colightuner, we built '''a model beforehand to help us evaluate its practicability.''' Our model is consisted of four components A, B, C, and D</p>
+<p>Figure 10 shows the essential constituent of our E. colightuner. To save our efforts experimenting with this essential engine of E. colightuner, we built '''a model beforehand to help us evaluate its practicability.''' Our model is consisted of four components A, B, C, and D</p>
-[[File:Figure2_NCTU_Formosa.png|300px|center|Figure 10. component A ]]
+[[File:Figure2_NCTU_Formosa.png|300px|center|Figure 11. component A ]]
-<p>Figure 10 is the overall picture of our component A. Notice that component A is built from similar biobricks as our E.colightuner. The only difference between the two is the promoters used. '''Both P<sub>red</sub> of E.colightuner are substituted with P<sub>cons</sub> in component A.''' With that said, by taking the difference in the strength between P<sub>red</sub> and P<sub>cons</sub>, into calculations, we would be able to model out E. colightuner with component A. Before that, however, we would have to first build a model for component A. </p>
+<p>Figure 11 is the overall picture of our component A. Notice that component A is built from similar biobricks as our E.colightuner. The only difference between the two is the promoters used. '''Both P<sub>red</sub> of E.colightuner are substituted with P<sub>cons</sub> in component A.''' With that said, by taking the difference in the strength between P<sub>red</sub> and P<sub>cons</sub>, into calculations, we would be able to model out E. colightuner with component A. Before that, however, we would have to first build a model for component A. </p>
-[[File:Figure3_NCTU_Formosa.png|300px|center|Figure 11. component B ]]
+[[File:Figure3_NCTU_Formosa.png|300px|center|Figure 12. component B ]]
-<P>To build a model for component A, we combined component B and C. As you can see in component B shown in Figure 11, it does not include P<sub>lux</sub> and luxR like component A. By assuming that when luxR is expressed, P<sub>lux</sub> would immediately reach its full strength, however, we can consider luxR and P<sub>lux</sub> pair as simply a Pcons,that is constitutively activated. From this perspective, component A and B are the same, except that component A is also effected by the translation efficiency of 37<sup>o</sup>C RBS. Component C is the model for 37<sup>o</sup>C RBS which was shown in The Single Unit part above. So by multiplying component C and component B, we would be able to obtain a model for component A.</p>
+<P>To build a model for component A, we combined component B and C. As you can see in component B shown in Figure 12, it does not include P<sub>lux</sub> and luxR like component A. By assuming that when luxR is expressed, P<sub>lux</sub> would immediately reach its full strength, however, we can consider luxR and P<sub>lux</sub> pair as simply a Pcons,that is constitutively activated. From this perspective, component A and B are the same, except that component A is also effected by the translation efficiency of 37<sup>o</sup>C RBS. Component C is the model for 37<sup>o</sup>C RBS which was shown in the Single Unit part above. So '''by multiplying component C and component B, we would be able to obtain a model for component A.'''</p>
-<p>In order to increase the accuracy of our component A model, we used ANFIS to fit our modeling result with the actual experimental data of our component A. By doing this, we obtained a new modeling curve for component that is more precise and accurate, shown in Figure 12. </p>
+<p>In order to increase the accuracy of our component A model,''' we used ANFIS to fit our modeling result with the actual experimental data of our component A.''' By doing this, we obtained a new modeling curve for component that is more precise and accurate, shown in Figure 13. </p>
-[[File:NCTU_A-value.jpg|center|Figure 12. using ANFIS to modified the simulated data by experimental data to obtain a more accurate result .]]
+[[File:NCTU_A-value.jpg|center|Figure 13. using ANFIS to modified the simulated data by experimental data to obtain a more accurate result .]]
 <p>As mentioned above, we needed to take the effect of P<sub>red</sub> into account before we could a model of E.colightuner based on component A. Component D has the model of P<sub>red</sub> built in a similar way as component A model which was shown before, too. We first built a model for P<sub>red</sub> and fitted it into the actual experimental data. </p>
-<p>Having both precise model for component A and component D, we simply had to multiple them to obtain the final model for our E.Colightuner in Figure13. </p>
+<p>Having both precise model for component A and component D, we simply had to multiple them to obtain the final model for our E.Colightuner in Figure 14. </p>
-[[File:Figure7_NCTU_Formosa.png|300px|center|Figure 13. the final model for E.colightuner . A = Figure2. B = Figure3. C = 37 degree celsius RBS , D = red promoter]]
+[[File:Figure7_NCTU_Formosa.png|300px|center|Figure 14. the final model for E.colightuner . A = Figure2. B = Figure3. C = 37 degree celsius RBS , D = red promoter]]
-Using the single unit to predict the project we simplify like Figure 9, we got Figure 14. The predictive curve is under red light and at 37<sup>o</sup>C. Due to the sRNA repression, the expression rate of mGFP is really low. This phenomenon is reasonable and interpretable. Thus, this modeling method can accurately predict the expression trend of a new biobrick. Applying this modeling system maturely, we can use computers to control our E.colightuner in the future.
+Using the single unit to predict the project we simplify like Figure 10, we got Figure 15. The predictive curve is under red light and at 37<sup>o</sup>C. Due to the sRNA repression, the expression rate of mGFP is really low. This phenomenon is reasonable and interpretable. Thus, '''this modeling method can accurately predict the expression trend of a new biobrick.''' Applying this modeling system maturely, we can use computers to control our E.colightuner in the future.
-[[File:Nctu_model_whole_project.jpg|800px|center|Figure 14. The Figure 13 predicted tendency.]]
+[[File:Nctu_model_whole_project.jpg|800px|center|Figure 15. The Figure 14 predicted tendency.]]
 </div></div>