Team:Penn/Software

From 2013.igem.org

(Difference between revisions)
Line 26: Line 26:
<div class = "textwrap">
<div class = "textwrap">
-
<b><center><h1>
+
<header><h1><b><center>The MaGellin Software Package</center></b></h1></header>
-
MaGellin
+
</br>
-
</b></center></h1>
+
</br>
-
<br>
+
For a detailed, graphical explanation of the MaGellin work flow, please download the <a href="https://dl.dropboxusercontent.com/u/11828463/MaGellin%20Spec%20Sheet.pdf">MaGellin Workflow Specifications Sheet</a>, which includes all of the steps in the MaGellin workflow.
-
<b><center>
+
</br>
-
 
+
</br>
-
Open Source Automated, Quantifiable DNA Analysis for our Assay
+
</br>
-
</b></center>
+
<b>Overview</b> It was clear that not all gels would be simple or fast to read by eye, and we wanted to be able to quantify relative intensities of bands between samples. We also wanted to solve our unique problem of calculating how band lengths will change when there’s full methylation, site-specific methylation, or no methylation. The solution can change whenever you clone in a new site-specific methylase. We took this problem to the world’s largest college hackathon, and had a functional software package after a sleepless 48 hours. It has since been further refined, and certain elements will be made available to the DIYBio community alongside standardized hardware through collaboration with the biotech start-up GeneFoo.
-
<br>
+
</br>
-
MaGellin is a MATLAB script that uses computer vision algorithms to calculate the location and intensity of DNA bands. It also has a bioinformatics module to compare the lengths of bands with the expected lengths, based on the methylation sensitivity of the enzymes and the plasmid of the sequence. So, MaGellin is more than another gel quantifier - it interprets the biological meaning of the band lengths and returns experimentally relevant analyses. MaGellin is crucial for our workflow because it allows for clear input/output. Since our restriction digests always yield bands in predictable locations, MaGellin eliminates user bias and standardizes data sets across trials and labs, accelerating the pace of discovery.  
+
</br>
-
<br>
+
</br>
-
<br>
+
The MaGellin Software Package is a MATLAB script that uses computer vision algorithms to calculate the location and intensity of DNA bands. It also has a bioinformatics module to compare the lengths of bands with the expected lengths, based on the methylation sensitivity of the enzymes and the sequence of the plasmid. So, this program is more than another gel quantifier - it interprets the biological meaning of the band lengths and returns experimentally relevant analyses. The MaGellin Software Package is crucial for our workflow because it allows for clear input/output. Since our restriction digests always yield bands in predictable locations, user bias is eliminated and data sets are standardized across trials and labs, accelerating the pace of discovery.  
-
MaGellin converts gel images into standardized data sets, making data analysis quick, easy and unbiased.
+
</br>
-
<br>
+
</br>
-
<img src="http://s21.postimg.org/z98hqbrdj/gellin.png">
+
</br>
-
<br>
+
<b>Protocol</b>
-
<br>
+
<ol>
-
A User-friendly GUI that is easy to navigate makes it ideal for first time users.
+
<li>Upload gel picture from restriction enzyme digest. Fill out all relevant information on the graphical user interface. Enter plasmid and target sequences and select restriction enzymes used. Enter descriptive names for gel and lanes if desired. </li>
-
<br>
+
<li>Press the Analyze button.</li>  
-
<img src="http://s8.postimg.org/5cd9z4b79/gellingui.png">
+
<li>The Magellin Software Package will calculate the intensity and position of each band and produce a graph. Save your results. </li>
-
<br>
+
<li>Collaborate with your fellow scientists by sharing your results on SkyDrive.</li>
-
<br>
+
<ol>
-
The bioinformatics module simulates DNA digestion, so you know what bands to expect before even touching a pipette.
+
</br>
-
<br>
+
</br>
-
<img src="http://s14.postimg.org/6dwg73o5t/gellinbioinfo.png">
+
</br>
-
<br>
+
<b>Details on the Software's Inner Workings</b>Before the analysis is performed, the image is subject to various procedures intended to remove ambient noise in the data. These methods include determining the average amount of noise bias per row, and convolving the image with several standard filters. Next, the MaGellin software finds the lanes in the gel. This is accomplished by running an edge detection algorithm that uses the location of edges of individual bands to infer the location of the center of each lane. The edge detection algorithm functions by treating edges as solutions to variational equations. Once the center of each lane is determined, MaGellin Software is ready to perform its analysis. The user inputs the plasmid sequence in question, and MaGellin Software converts these values into vertical coordinates on the gel by performing a logarithmic regression on the length of the user input sequences, relying on the factory-included values for the lengths of the bands of the NEB 2-Log ladder. Now MaGellin knows where to look, and it knows what it is looking for. All that’s left to do is run the analysis. MaGellin searches in the locations in question, and if a band of suitably high intensity is found, Magellin records the intensity at this location. If no such band is found, Magellin records the intensity as 0 to indicate that no band was found in the area. The data is then extrapolated into a graph with the relevant biological meaning. Usually, we perform our experiments in triplicate, so we can run a 2-way ANOVA.  
-
<br>
+
</br>
-
Computer vision algorithm calculates location and intensity of DNA bands.
+
</br>
-
<br>
+
</br>
-
<img src="http://s16.postimg.org/6smb69xt1/gellinvision.png">
+
-
<br>
+
-
<br>
+
-
SkyDrive provides open access to data and encourages collaborations. MaGellin is the interface to our assay, and it acts to abstract our plasmid system the same way a programming language abstracts computer hardware.
+
-
<br>
+
-
<img src="http://s18.postimg.org/fusjsrq1l/gellindrive.png">
+
         </div>
         </div>

Revision as of 08:33, 27 October 2013

Penn iGEM

The MaGellin Software Package



For a detailed, graphical explanation of the MaGellin work flow, please download the MaGellin Workflow Specifications Sheet, which includes all of the steps in the MaGellin workflow.


Overview It was clear that not all gels would be simple or fast to read by eye, and we wanted to be able to quantify relative intensities of bands between samples. We also wanted to solve our unique problem of calculating how band lengths will change when there’s full methylation, site-specific methylation, or no methylation. The solution can change whenever you clone in a new site-specific methylase. We took this problem to the world’s largest college hackathon, and had a functional software package after a sleepless 48 hours. It has since been further refined, and certain elements will be made available to the DIYBio community alongside standardized hardware through collaboration with the biotech start-up GeneFoo.


The MaGellin Software Package is a MATLAB script that uses computer vision algorithms to calculate the location and intensity of DNA bands. It also has a bioinformatics module to compare the lengths of bands with the expected lengths, based on the methylation sensitivity of the enzymes and the sequence of the plasmid. So, this program is more than another gel quantifier - it interprets the biological meaning of the band lengths and returns experimentally relevant analyses. The MaGellin Software Package is crucial for our workflow because it allows for clear input/output. Since our restriction digests always yield bands in predictable locations, user bias is eliminated and data sets are standardized across trials and labs, accelerating the pace of discovery.


Protocol
  1. Upload gel picture from restriction enzyme digest. Fill out all relevant information on the graphical user interface. Enter plasmid and target sequences and select restriction enzymes used. Enter descriptive names for gel and lanes if desired.
  2. Press the Analyze button.
  3. The Magellin Software Package will calculate the intensity and position of each band and produce a graph. Save your results.
  4. Collaborate with your fellow scientists by sharing your results on SkyDrive.



    1. Details on the Software's Inner WorkingsBefore the analysis is performed, the image is subject to various procedures intended to remove ambient noise in the data. These methods include determining the average amount of noise bias per row, and convolving the image with several standard filters. Next, the MaGellin software finds the lanes in the gel. This is accomplished by running an edge detection algorithm that uses the location of edges of individual bands to infer the location of the center of each lane. The edge detection algorithm functions by treating edges as solutions to variational equations. Once the center of each lane is determined, MaGellin Software is ready to perform its analysis. The user inputs the plasmid sequence in question, and MaGellin Software converts these values into vertical coordinates on the gel by performing a logarithmic regression on the length of the user input sequences, relying on the factory-included values for the lengths of the bands of the NEB 2-Log ladder. Now MaGellin knows where to look, and it knows what it is looking for. All that’s left to do is run the analysis. MaGellin searches in the locations in question, and if a band of suitably high intensity is found, Magellin records the intensity at this location. If no such band is found, Magellin records the intensity as 0 to indicate that no band was found in the area. The data is then extrapolated into a graph with the relevant biological meaning. Usually, we perform our experiments in triplicate, so we can run a 2-way ANOVA.