From 2013.igem.org

(Difference between revisions)

Revision as of 10:30, 27 September 2013

Slide

Take a gNAP before wearing your gloves! Genetic Network Analyze and Predict

The sketch and final GUI of gNAP!

We compare the result of our software with gene expression profile in literature.

We are USTC-Software!

Methodologies

In order to simulate the GRN’s working and analyze the changing after exogenous gene imported, some advanced algorithms and classical methods are employed in the software. These algorithms and methods include Binary Tree method, Needle-Wunsch Algorithm, Decision Tree method, Hill Equation and PSO Algorithm.
There are five parts of methodologies: Fetch Database, Alignment Analyze, New Network Construction, Network Model and Predict.

Fetch Database

Fetch Database Abstract

Fetch Regulation

Fetch Gene Info

Fetch Promoter Info

Integration

Our software integrates all information we picked out about genes and generates a file named “all_info” —— all information about genes —— for the output graphical interface’s reading. In the meanwhile, the array of objects containing all information has been stored in computer memory which greatly improve the computing speed of our software.

The format of all_info database:
No. promoter_sequence gene_sequence gene_name ID left_position right_position promoter_name description
The fetching module generates three files: old_GRN, all_info and uncertain_database.

Operon Theory and Regulatory Model

Operon Theory

In genetics, an operon is a functioning unit of genomic DNA containing a cluster of genes under the control of a single regulatory signal or promoter.
The genes contained in the operon are either expressed together or not at all.
Several genes must be both cotranscribed and co-regulated to define an operon.

The first time “operon” was proposed is in a paper of French Academic Science, 1960. The lac operon of the model bacterium E. coli was discovered and provides a typical example of operon function. It consists a promoter, an operator, three structural genes and a terminator. The operon is regulated by several factors including the availability of glucose and lactose.

From this paper, the so-called general theory of the operon was developed. According to the theory, all genes are controlled by means of operons through a single feedback regulatory mechanism-repression. The first operon to be described was the lac operon in E. coli. The 1965 Nobel Prize in Physiology and Medicine was awarded to François Jacob, André Michel Lwoff and Jacques Lucien Monod for their discoveries concerning the operon and virus synthesis.

Figure 1. Structure of Operon

An operon is made up of several structural genes arranged under a common promoter and regulated by a common operator. It is defined as a set of adjacent structural genes, plus the adjacent regulatory signals that affect transcription of the structural genes. The regulators of a given operon, including repressors, corepressors and activators, are not necessarily coded for by that operon.

As a unit of transcription, upstream of the structural genes lies a promoter sequence which provides a site for RNA polymerase to bind and initiate transcription. Close to the promoter lies a section of DNA called an operator.

Operon regulation can be either negative or positive by induction or repression. Negative control involves the binding of a repressor to the operator to prevent transcription. Operons can also be positively controlled. An activator protein binds to DNA, usually at a site other than the operator, to stimulate transcription.

Figure 2. Regulation of Operon 1: RNA Polymerase, 2: Repressor, 3: Promoter, 4: Operator, 5: Lactose, 6: lacZ, 7: lacY, 8: lacA. Top: The gene is essentially turned off. There is no lactose to inhibit the repressor, so the repressor binds to the operator, which obstructs the RNA polymerase from binding to the promoter and making lactase.Bottom: The gene is turned on.Lactose is inhibiting the repressor, allowing the RNA polymerase to bind with the promoter, and express the genes, which synthesize lactase. Eventually, the lactase will digest all of the lactose, until there is none to bind to the repressor. The repressor will then bind to the operator, stopping the manufacture of lactase.

Regulatory Model

Similarity and homology

New Network Construction

Random Noise

Filter

Construct new GRN

If there is a three-unit network and they interact with each other as it is shown in the figure. The regulation is described by the GRN matrix.

Figure 5. Example network and its GRN matrix.

If D is the exogenous unit, we can obtain three similarity data sets of D with the units in the original GRN:

Promoter sequence similarity

Gene sequence similarity

Amino acid sequence similarity.

The construction is equivalent to add a new column and a row into the original matrix.

Figure 6. Mathematical Equivalence

When filling the column, D is compared with the regulators of the unit in each row. The regulations in the row are consider separately and marked as “positive group” and “negative group”. The average similarity of each group represents the distance between the exogenous unit and the group. D is supposed to have the larger one’s regulatory direction(positive or negative). The regulatory intensity is the weight average regulation of the chose group. The weight here is the amino acid sequence similarity.

There are two conditions when fill the new row:
1. There are units having the same promoter as the exogenous unit.
2. There is no units having the same promoter as the exogenous unit.

In condition 1, the units sharing the same promoter with the new member are picked out, and the following steps are the same as the construction of the column. The difference is the similarity used here is the gene sequence similarity. As explained in the regulation model part, the promoter is the main regulatory region, but the following sequence is also considered. Now the promoter is the same, so what we focus on are the gene sequences.

In condition 2, the process is almost the same as constructing the new column. Promoter similarity is used because it is the main region.

Figure 7. Construct New GRN

Network Model

Network Model Abstract

Network analysis includes finding stable condition of network, adding new gene, finding new stable condition and changes from original condition to new condition. We use densities of materials to describe network condition. If all material densities are time-invariant, we can say the network condition is stable.

Hill Equations

Find Stable Network Condition

Find Changes From Original Stable Condition to New Condition

Predict

Predict Abstract

In some cases, importing exogenous gene is for enhancing or suppressing the expression of some specific genes in engineered bacteria itself. But it is hard to choose an appropriate regulatory gene. Our software analyzes the GRN forward as well as simulates by optimization algorithm backward for giving a reference of choosing to the users. Our software not only focused on the direct regulation but also focused on the global GRN. In the same time, controlling the expression of multiple genes in network has been realized by global prediction. What’s more, Particle Swarm Optimization (PSO) Algorithm makes it possible.

Input Target

Particle Swarm Optimization

Filter

Database

TF-TF

This file contains the regulation between Transcription Factors.

TF-Gene

Gene Info

Promoter Info

TU Info

@@ Line 157: / Line 157: @@
                 </p>
-<div align="center"><img src="../../method/Figure 1.png" />
+<div align="center"><img src="https://static.igem.org/mediawiki/igem.org/7/7d/USTC_Software_Figure_1.png" />
 <p align="center"><strong>Figure 1.</strong> Structure of Operon</p></div>
 <p align="justify">An operon is made up of several structural genes arranged under a common promoter and
@@ Line 172: / Line 172: @@
 site other than the operator, to stimulate transcription.
 </p>
-<div align="center"><img style="width:600px;" src="../../method/Figure 2.png"/>
+<div align="center"><img style="width:600px;" src="https://static.igem.org/mediawiki/igem.org/2/25/USTC_Software_Figure_2.png"/>
 <p align="justify"><strong>Figure 2.</strong> Regulation of Operon
 : RNA Polymerase, 2: Repressor, 3: Promoter, 4: Operator, 5: Lactose, 6: lacZ, 7:
@@ Line 188: / Line 188: @@
 				<div class="jobs_trigger"><strong>Regulatory Model</strong></div>
 				<div class="jobs_item" style="display: none;"><p align="justify">Regulation of gene expression includes four levels. We choose the transcriptional level to simulate the regulation both for its significance and model simplification.</p>
-                 <div align="center"><img style="width:600px; height:auto;"src="../../method/Figure 3.png" />
+                 <div align="center"><img style="width:600px; height:auto;"src="https://static.igem.org/mediawiki/igem.org/8/87/USTC_Software_Figure_3.png" />
                  <p><strong>Figure 3.</strong>Regulation of gene expression.<br />Our regulation model is built based on the operon theory.<br /> The promoter region is regarded as the main regulatory region.</p></div>
        </div>
@@ Line 241: / Line 241: @@
 that some similarities are out of statistic significance.</p>
 <div align="center">
-<img src="../../method/Figure 4.png" />
+<img src="https://static.igem.org/mediawiki/igem.org/8/89/USTC_Software_Figure_4.png" />
 <p><strong>Figure 4.</strong> Random similarity distribution</p></div>
@@ Line 269: / Line 269: @@
 				<div class="jobs_item" style="display: block;"><p align="justify">If there is a three-unit network and they interact with each other as it is shown in the figure.
 The regulation is described by the GRN matrix.</p>
-<div align="center"><img src="../../method/3.png" />
+<div align="center"><img src="https://static.igem.org/mediawiki/igem.org/8/8a/USTC_Software_Figure_5.png" />
 <p style="font-size:18px;"><strong>Figure 5.</strong> Example network and its GRN matrix.</p></div>
@@ Line 280: / Line 280: @@
 <p>
 The construction is equivalent to add a new column and a row into the original matrix.</p>
-<div align="center"><img src="../../method/4.png" />
+<div align="center"><img src="https://static.igem.org/mediawiki/igem.org/9/97/USTC_Software_Figure_6.png" />
 <p><strong>Figure 6.</strong> Mathematical Equivalence</p></div>
 <p>When filling the column, D is compared with the regulators of the unit in each row. The
@@ Line 299: / Line 299: @@
 similarity is used because it is the main region.</p>
 <div align="center">
-<img src="../../method/5.png" />
+<img src="https://static.igem.org/mediawiki/igem.org/c/c5/USTC_Software_Figure_7.png" />
 <p><strong>Figure 7.</strong> Construct New GRN</p></div>

Team:USTC-Software/Project/Method