Team:Alberta/Overview

From 2013.igem.org

(Difference between revisions)
 
(13 intermediate revisions not shown)
Line 55: Line 55:
     }
     }
     .sidebar {
     .sidebar {
-
       position:fixed;
+
       position: fixed;
-
       top:190px;
+
       top: 157px;
-
       float:left;
+
       float: left;
-
       z-index:3;
+
       z-index: 3;
-
       padding: 0 0px 0px 40px;
+
       padding: 10px 10px 10px 10px;
 +
      width: 230px;
 +
    }
 +
    .sidebar_block {
 +
      background-color:white;
 +
      margin:1px;
 +
      border:1px grey solid;
 +
      padding:5px 10px 5px 10px;
 +
      border-radius:5px;
 +
      box-shadow: 0px 0px 20px #444444;
     }
     }
     .main {
     .main {
Line 399: Line 408:
             -o-box-shadow: 2px 2px 6px rgba(0, 0, 0, 0.28);
             -o-box-shadow: 2px 2px 6px rgba(0, 0, 0, 0.28);
               box-shadow: 2px 2px 6px rgba(0, 0, 0, 0.28);
               box-shadow: 2px 2px 6px rgba(0, 0, 0, 0.28);
 +
    }
 +
    a.anchor{
 +
      display: block;
 +
      position: relative;
 +
      top: -180px;
 +
      visibility: hidden;
     }
     }
   </style>
   </style>
Line 405: Line 420:
     <div class="titlebar">
     <div class="titlebar">
<br>
<br>
-
      <h3><font color="#A80000"><font size="6">The Littlest Mapmaker</font></font></h3>
+
<img src="/wiki/images/4/41/Ab_mapmen_title.png" width="600" height="95"></img><img src="/wiki/images/a/a5/2013-igem-logo.png" width="198" height="95"></img>
-
      <h4><i><font size="2">"Exploration into the world of <font color="#A80000">DNA Computing</font>"<br/>
+
-
        Team Alberta: University of Alberta</font></i></h4>
+
-
      <a href="http://www.ualberta.ca" class="ualberta-logo"><img src="/wiki/images/b/b3/Ualberta-logo.png" alt="University of Alberta"></img></a>
+
-
      <a href="https://2013.igem.org/Team:Alberta" class="igem-logo"><img src="/wiki/images/a/a5/2013-igem-logo.png" alt="iGem Main Page"></img></a>
+
     </div>
     </div>
     <div class="bin">
     <div class="bin">
Line 422: Line 433:
             <li><a href="/Team:Alberta/Parts">Submitted Parts</a></li>
             <li><a href="/Team:Alberta/Parts">Submitted Parts</a></li>
             <li><a href="/Team:Alberta/Accomplishments">Accomplishments</a></li>
             <li><a href="/Team:Alberta/Accomplishments">Accomplishments</a></li>
-
            <li><a href="/Team:Alberta/Attributions">Attributions</a></li>
 
           </ul>
           </ul>
         </li>
         </li>
Line 430: Line 440:
             <li><a href="https://igem.org/Team.cgi?year=2013&team_name=Alberta">Official Profile</a></li>
             <li><a href="https://igem.org/Team.cgi?year=2013&team_name=Alberta">Official Profile</a></li>
             <li><a href="/Team:Alberta/Sponsors">Sponsors</a></li>
             <li><a href="/Team:Alberta/Sponsors">Sponsors</a></li>
 +
            <li><a href="/Team:Alberta/Attributions">Attributions</a></li>
           </ul>
           </ul>
         </li>
         </li>
Line 448: Line 459:
       </ul>
       </ul>
       <div class="sidebar">
       <div class="sidebar">
-
         <div href class="tooltip"><img src="/wiki/images/7/7d/SideChar.png"></img>  
+
         <div class="sidebar_block">
-
           <!--span class="saying">Welcome to the Team Alberta Wiki!
+
          <h5>Places</h5>
-
            <div id="box" style="width: 450px; padding: 5px; border: 3px solid #000; background-color: #000000;">
+
          <a href="#Top"><p>Top</p></a>
-
             <div id="template" style="text-align: center; font-weight: bold; font-size: large; color: #f6f6f6; padding: 5px;">
+
          <a href="#Travelling"><p>Travelling Salesman Problem</p></a>
-
              For visitors: this site is currently under construction. Please contact our Student Liason, Dawson at                  daocun@ualberta.ca, for more information on our current project and how to support us!
+
          <a href="#Biological-Terms"><p>In Biological Terms</p></a>
-
             </div>
+
           <a href="#Finding-the-Path"><p>Finding the Path</p></a>
-
             </div>
+
          <a href="#Mapmakers"><p>MapMakers at Work</p></a>
-
          </span-->
+
          <a href="#Weight"><p>Calculating Gene Concentration</p></a>
 +
          <h5>Project Sections</h5>
 +
             <a href="/Team:Alberta/Background"><p>Background</p></a>
 +
            <a href="/Team:Alberta/Overview" class="active"><p>Overview</p></a>
 +
            <a href="/Team:Alberta/Results"><p>Results</p></a>
 +
             <a href="/Team:Alberta/Protocols"><p>Protocols</p></a>
 +
             <a href="/Team:Alberta/Parts"><p>Submitted Parts</p></a>
 +
            <a href="/Team:Alberta/Accomplishments"><p>Accomplishments</p></a>
         </div>
         </div>
       </div>
       </div>
       <div class="main">
       <div class="main">
 +
      <a class="anchor" id="Top"></a>
       <div class="block">
       <div class="block">
         <p class="content-title">Overview</p>
         <p class="content-title">Overview</p>
       </div>
       </div>
 +
      <a class="anchor" id="Travelling"></a>
       <div class="block">
       <div class="block">
         <h2>The Travelling Salesman Problem</h2>
         <h2>The Travelling Salesman Problem</h2>
-
         <p>The Littlest Mapmaker is Team Alberta's endeavour to create a biological computer capable of solving  
+
         <p>MapMen is Team Alberta's endeavour to create a biological computer capable of solving  
           Travelling Salesman Problems (TSP), a general form of problem that asks:</p>
           Travelling Salesman Problems (TSP), a general form of problem that asks:</p>
         <p class="centered-well">Given a set of cities, and a list of the distances between each pair of those cities,  
         <p class="centered-well">Given a set of cities, and a list of the distances between each pair of those cities,  
           what is the shortest possible route that travels to every city exactly once and then returns to the origin  
           what is the shortest possible route that travels to every city exactly once and then returns to the origin  
           city?</p>
           city?</p>
-
         <img src="/wiki/images/6/6c/TSPexample.png" style="width:100%;"></img>
+
         <p><center><img src="/wiki/images/6/6c/TSPexample.png" align="middle" style="width:33%;"></center></p>
       </div>
       </div>
 +
      <a class="anchor" id="Biological-Terms"></a>
       <div class="block">
       <div class="block">
         <h2>In Biological Terms</h2>
         <h2>In Biological Terms</h2>
-
         <p>To solve a TSP biologically, we must create biochemical analogues for all of the components of the original  
+
         <p>To take advantage of the considerable information processing power of biology, we must create biochemical analogues for all of the components of the original problem. We begin by assigning a symbolic reporter gene or selectable marker gene to each of the six paths in our example map. The assignment is arbitrary; which gene is used for which path does not matter as long as it creates some kind of observable phenotype. For example, the path connecting the northeast city and the northwest city is symbolized by a gene coding for ampicillin resistance. The path from the northeast to city to the southeast meanwhile, is symbolized by a gene coding for kanamycin resistance.</p>
-
          problem. For the Littlest MapMaker, we do this by assigning a symbolic bacterial reporter gene or selectable
+
        <img src="/wiki/images/4/4c/Bio_terms.png" style="width:100%;"></img>
-
          marker gene to each of the paths that connect a pair of cities. For instance, for the example map featured 
+
<p>Now, just as the travelling salesman might describe taking a particular route to his customers – leaving from the northeast origin city and taking the road to the southwest, then turning up the north road from there, then back down the southeast, before returning northward home from, we can also represent that series of paths as a sequence of genes: ChlorR, RFP, GFP, and KanR.</p>
-
          here, the path connecting cities A and B is symbolized by a gene coding for ampicillin antibiotic 
+
<img src="/wiki/images/b/bd/Path_to_gene.png" style="width:100%;"></img>
-
          resistance. This allows us to describe routes through the network as a sequence of genes, standing for a
+
-
          sequence of paths. Again for example, the route travelling from city A to B, then to C, and then to D before
+
-
          returning to A is described by the sequence ampicillin resistance gene, followed by green fluorescent 
+
-
          protein gene, then blue amyl chromoprotein gene, and finally chloramphenicol resistance gene. </p>
+
-
        <img src="/wiki/images/f/fa/2013Alberta-bio-color-map.png" style="width:100%;"></img>
+
       </div>
       </div>
 +
      <a class="anchor" id="Finding-the-Path"></a>
       <div class="block">
       <div class="block">
         <h2>Finding the Path</h2>
         <h2>Finding the Path</h2>
-
         <p>Our biocomputer tests the different routes by assembling plasmids with the corresponding gene sequences.       
+
         <p>Just as a computer using a brute force approach to the travelling salesman problem would test every route and tally the sum distance of the paths travelled, our biocomputer assembles a multitude of plasmids, representing every possible combination of paths, through a set of ligations that represent the legs of the salesman’s journey. In the first leg, we depart from the origin city along any of the available roads: AmpR, ChlorR, or KanR. In the plasmid production, we begin with a combinatorial ligation – one where all three of the genes are being ligated with origins of replication, but only one of the three is able to ligate onto each Ori, resulting in a collection of incomplete plasmids that each includes just one of the three genes. </p>
-
          All of the various possible plasmids are assembled simultaneously, through a series of biased ligation 
+
<center><img src="/wiki/images/9/95/AB_Genemap.png" style="width:35%;"></img></center>
-
          reactions that prefer assembly of plasmids that use short paths — that is, those genes that symbolize 
+
<p>Following the first gene ligation, a short linker piece is added, signifying the city that has been reached – all of the linkers are identical, there is no differentiation of routes based on this component. In the second leg of the journey, we perform a ligation with three more genes, representing all of the possible paths the salesman might take next – three colour genes. Again, each growing plasmid can only incorporate one of the new genes, so we obtain a random assortment of incomplete plasmids that each include one antibiotic resistance from the first ligation and one colour gene from the third ligation. </p>
-
          shorter paths are more likely to be incorporated into a plasmid than those which symbolize longer paths (see 
+
<img src="/wiki/images/7/7e/AB_Assemblyfull.png" style="width:100%;"></img>
-
          “Building the Routes” below for more detail on how we create this preference). Billions of plasmids are 
+
<p>The third leg of the journey is identical to the second, incorporating an additional colour gene, and the final leg uses the three antibiotic genes once again. At the end of this process, we will have obtained a randomized distribution of 76 different kinds of plasmids representing all of the different possible permutations of the genes, capable of producing 18 unique phenotypes, and representing the six bi-directional solutions to the four-city travelling salesman problem.<p/>
-
          produced in this way, and because of the bias in favour of short path genes, the predominating products are 
+
-
          gene sequences that correspond to very short routes. Identifying the predominant product of the assembly 
+
-
          would, in turn, identify the shortest path and therefore the solution to the problem. </p>
+
       </div>
       </div>
 +
      <a class="anchor" id="Mapmakers"></a>
       <div class="block">
       <div class="block">
         <h2>MapMakers at Work</h2>
         <h2>MapMakers at Work</h2>
-
         <p>Unfortunately, as a byproduct of our assembly method, many of the plasmids generated this way do not
+
         <p>Once the plasmids have been produced, how can we use them to identify the correct solution? The answer is in the frequency with which each plasmid occurs: We create bias in the ligations during assembly so that genes that represent shorter paths are more likely to be incorporated into the plasmids, meaning in turn that the final product plasmids are more likely to be composed of several short-path genes. As a result, although we still expect to see plasmids representing every possible route, those plasmids that represent the shortest routes through the map will be much more common than plasmids representing lengthy routes. By identifying the most abundant plasmid, we identify the shortest route and thus the optimal solution. </p>
-
          correspond to any valid route through the assigned set of cities, and these need to be filtered out so that
+
         <p>The bias is introduced through the concentration of the reactants in each ligation. If a gene is meant to represent a short path, its concentration in the ligation will be higher than the others, and conversely, genes that are meant to represent lengthy paths will be included in lower concentrations than the others. As a result, at any given ligation step, the gene symbolizing the shortest available path is more likely to be incorporated. Among the product plasmids, genes are incorporated at a rate inversely proportional to their length. The concentration used for each gene in the ligation is described by the expression: –ln[Gene]∝(path distance)</p>
-
          the frequency of production of each of the valid routes can be compared. To compute the validity of the  
+
        <img src="/wiki/images/9/95/Proportions.png" style="width:100%;"></img>
-
          plasmids, we transform them into a culture of E. coli, which are then grown on antibiotic-treated plates and
+
<p>To identify which of the plasmids is most frequent, the products are transformed into a bacterial culture. Because each of the genes produces an easily interpreted phenotype, this allows us to infer the most frequently transformed plasmid from the frequency of the colonies produced on agar plates. The plasmid carried by any given colony on the plates below can be readily identified by the combination of the colony colour (purple colonies have RFP and aCP, orange colonies have RFP and GFP, and green colonies have GFP and aCP), and the antibiotic resistances it exhibits, which are inferred from the plate on which it is growing. As an example, a purple colony growing on the kan/amp plate must have the Amp-Red-Blue-Kan-plasmid shown in the first column below.<p/>
-
          allowed to express their reporter genes, making it simple to unambiguously identify the plasmid employed by
+
<img src="/wiki/images/6/6d/Ab_Solution_identification.png" style="width:100%;"></img>
-
          a given colony based on its colouration and resistances. </p>
+
<p>Many of the colonies do not correspond to any valid route. For instance, an orange colony on the kan/amp plate has the plasmid Amp-Red-Green-Kan, which does not describe a valid circuit visiting each city exactly once – but because we can tell that this is an invalid solution, we would not count this colony when assessing the frequency of the valid plasmids. Any four-city travelling salesman problem has just three unique solutions, and so there are only three phenotypes that must be counted: purple kan/amp, orange kan/chlor, and green amp/chlor. The colonies of these three phenotypes are counted and compared, and the most frequently occurring plasmid is the optimal solution! </p>
-
         <p>For example, consider the three possible routes for our sample four-city problem. The solution shown at the  
+
-
          right of the diagram below produces colonies that are coloured purple (a combination of the blue
+
-
          chromoprotein and red fluorescent protein reporter genes) and is able to survive on ampicillin/kanamycin
+
-
          plates. This is the only combination of genes that will result in this combination of colour and
+
-
          resistances. To determine which of the three routes is the solution to this travelling salesman problem, we
+
-
          plate the bacterial culture across these antibiotic plates and count the colonies that match with the three
+
-
          valid routes. The most commonly occurring bacterial genotype will correspond to the optimal route. </p>
+
-
        <img src="/wiki/images/8/8d/2013Alberta-mapmaker-states.png" style="width:100%;"></img>
+
       </div>
       </div>
 +
      <a class="anchor" id="Weight"></a>
       <div class="block">
       <div class="block">
-
        <h2>Building the Routes</h2>
+
      <h2>Calculating Gene Concentration</h2>
-
        <p>Our plasmid assembly system relies upon the achievements of previous Team Alberta iGEM entries, 2009’s BioBytes and 2010’s Genomikon assembly methods. The method relies on the aforementioned symbolic representation of the paths in the salesman's route with genes and pieces of DNA. A route starts at the origin of replication (the salesman's home city), travels along some gene (the first path of the planned journey), meets a linker (the city at the end of that path), then travels along a second gene (the second path) to another linker (the second city), on to a third gene (the third path) and a third linker (the third city), and finally a fourth gene (the final path) which must return to the origin (the addition of a final tail-linker that will close the plasmid).
+
      <img src="/wiki/images/5/57/2013Alberta_New_algorithm.png" style="width:100%;"></img>
-
</p>
+
-
 
+
-
        <p>The actual assembly begins with origins of replication anchored at one end to magnetic beads in a suspension within a reaction microfuge tube. The anchored strands have a single, free-floating sticky end. In the first ligation, the three genes representing the possible first paths on the salesman's route are added to the bead suspension, along with ligase and buffer. The genes are able to ligate on to the free sticky end of the origin of replication; the ligating of a given gene represents the salesman having planned to travel along that path. Once the new gene has been ligated on, we use a magnet to hold the beads (along with the anchored DNA) inside the reaction tube, while washing away the rest of the reaction, including the enzyme, buffers, and any non-ligated DNA that remains. The beads are then resuspended in a new reaction mixture, containing the next ligation step (the linker). The first and second ligation steps are shown in the schematics below.</p>
+
-
 
+
-
        <p><b>Applying Magnetic Bead Assembly to the TSP Problem</b></p>
+
-
        <div class="expandable"><img src="/wiki/images/4/48/2013Alberta-Building-the-Routes-1.png"></img></div>
+
-
 
+
-
        <p>The second ligation adds a short, 25-base-pair linker strand to the free end, which replaces the free                             
+
-
          sticky end with one that is able to receive another gene, allowing (after another wash) for the next gene to
+
-
          be ligated onto the growing strand. </p>
+
-
 
+
-
        <div class="expandable"><img src="/wiki/images/4/45/2013Alberta-Building-the-Routes-2.png"></img></div><br/>
+
-
 
+
-
        <img src="/wiki/images/c/c6/Poster2_678x448.jpg" style="width:100%;"></img>
+
-
 
+
-
        <p>Once all of the genes have been ligated (alternating with linkers), a tail-piece that complements the 
+
-
        original bead-anchor DNA sequence is added, so that the finished product can be unbound from the beads and 
+
-
        will close upon itself to form the circular plasmid. In this fashion, a four-gene, roughly 5000-base-pair 
+
-
        plasmid is assembled in as little as an afternoon, cheaply and easily. </p>
+
-
 
+
-
        <div class="expandable"><img src="/wiki/images/f/f2/2013Alberta-Building-the-Routes-3.png"></img></div>
+
-
 
+
-
        <p>When we ligate a new gene to the growing strand in this process, we provide the reaction with several 
+
-
        genes, representing every path that the travelling salesman problem might take at that step. We bias the 
+
-
        system to favour short paths by setting the concentration of the added genes based on the reciprocal of the 
+
-
        corresponding path’s length, as adjusted by a calibration coefficient (C in the formula below). As a result, 
+
-
        a gene that symbolizes a short path will be included in the reaction at higher concentration than a gene that 
+
-
        symbolizes a long path, and the shorter path genes will occur more frequently among the product plasmids.</p>
+
-
 
+
-
        <div class="centered"><img src="/wiki/images/e/e9/2013Alberta-Gene-Expression-Formula.png" style="width:60%;">
+
-
        </img></div>
+
-
 
+
-
        <p>For example, if the path corresponding to the ampicillin resistance gene in a particular TSP takes only one 
+
-
        unit of distance, while the chloramphenicol resistance gene path in the same problem takes four units of 
+
-
        distance, then we might use 0.4 picomoles of the AmpR gene, and 0.1 picomoles of the ChlorR gene when adding 
+
-
        those two genes to the ligation. We would then expect the ratio of products to be roughly 4:1 AmpR to ChlorR, 
+
-
        favouring the shortest path more in proportion to its distance. </p>
+
       </div>
       </div>
       </div>
       </div>

Latest revision as of 03:43, 29 October 2013


Overview

The Travelling Salesman Problem

MapMen is Team Alberta's endeavour to create a biological computer capable of solving Travelling Salesman Problems (TSP), a general form of problem that asks:

Given a set of cities, and a list of the distances between each pair of those cities, what is the shortest possible route that travels to every city exactly once and then returns to the origin city?

In Biological Terms

To take advantage of the considerable information processing power of biology, we must create biochemical analogues for all of the components of the original problem. We begin by assigning a symbolic reporter gene or selectable marker gene to each of the six paths in our example map. The assignment is arbitrary; which gene is used for which path does not matter as long as it creates some kind of observable phenotype. For example, the path connecting the northeast city and the northwest city is symbolized by a gene coding for ampicillin resistance. The path from the northeast to city to the southeast meanwhile, is symbolized by a gene coding for kanamycin resistance.

Now, just as the travelling salesman might describe taking a particular route to his customers – leaving from the northeast origin city and taking the road to the southwest, then turning up the north road from there, then back down the southeast, before returning northward home from, we can also represent that series of paths as a sequence of genes: ChlorR, RFP, GFP, and KanR.

Finding the Path

Just as a computer using a brute force approach to the travelling salesman problem would test every route and tally the sum distance of the paths travelled, our biocomputer assembles a multitude of plasmids, representing every possible combination of paths, through a set of ligations that represent the legs of the salesman’s journey. In the first leg, we depart from the origin city along any of the available roads: AmpR, ChlorR, or KanR. In the plasmid production, we begin with a combinatorial ligation – one where all three of the genes are being ligated with origins of replication, but only one of the three is able to ligate onto each Ori, resulting in a collection of incomplete plasmids that each includes just one of the three genes.

Following the first gene ligation, a short linker piece is added, signifying the city that has been reached – all of the linkers are identical, there is no differentiation of routes based on this component. In the second leg of the journey, we perform a ligation with three more genes, representing all of the possible paths the salesman might take next – three colour genes. Again, each growing plasmid can only incorporate one of the new genes, so we obtain a random assortment of incomplete plasmids that each include one antibiotic resistance from the first ligation and one colour gene from the third ligation.

The third leg of the journey is identical to the second, incorporating an additional colour gene, and the final leg uses the three antibiotic genes once again. At the end of this process, we will have obtained a randomized distribution of 76 different kinds of plasmids representing all of the different possible permutations of the genes, capable of producing 18 unique phenotypes, and representing the six bi-directional solutions to the four-city travelling salesman problem.

MapMakers at Work

Once the plasmids have been produced, how can we use them to identify the correct solution? The answer is in the frequency with which each plasmid occurs: We create bias in the ligations during assembly so that genes that represent shorter paths are more likely to be incorporated into the plasmids, meaning in turn that the final product plasmids are more likely to be composed of several short-path genes. As a result, although we still expect to see plasmids representing every possible route, those plasmids that represent the shortest routes through the map will be much more common than plasmids representing lengthy routes. By identifying the most abundant plasmid, we identify the shortest route and thus the optimal solution.

The bias is introduced through the concentration of the reactants in each ligation. If a gene is meant to represent a short path, its concentration in the ligation will be higher than the others, and conversely, genes that are meant to represent lengthy paths will be included in lower concentrations than the others. As a result, at any given ligation step, the gene symbolizing the shortest available path is more likely to be incorporated. Among the product plasmids, genes are incorporated at a rate inversely proportional to their length. The concentration used for each gene in the ligation is described by the expression: –ln[Gene]∝(path distance)

To identify which of the plasmids is most frequent, the products are transformed into a bacterial culture. Because each of the genes produces an easily interpreted phenotype, this allows us to infer the most frequently transformed plasmid from the frequency of the colonies produced on agar plates. The plasmid carried by any given colony on the plates below can be readily identified by the combination of the colony colour (purple colonies have RFP and aCP, orange colonies have RFP and GFP, and green colonies have GFP and aCP), and the antibiotic resistances it exhibits, which are inferred from the plate on which it is growing. As an example, a purple colony growing on the kan/amp plate must have the Amp-Red-Blue-Kan-plasmid shown in the first column below.

Many of the colonies do not correspond to any valid route. For instance, an orange colony on the kan/amp plate has the plasmid Amp-Red-Green-Kan, which does not describe a valid circuit visiting each city exactly once – but because we can tell that this is an invalid solution, we would not count this colony when assessing the frequency of the valid plasmids. Any four-city travelling salesman problem has just three unique solutions, and so there are only three phenotypes that must be counted: purple kan/amp, orange kan/chlor, and green amp/chlor. The colonies of these three phenotypes are counted and compared, and the most frequently occurring plasmid is the optimal solution!

Calculating Gene Concentration