Team:Penn/Software

From 2013.igem.org

Revision as of 03:50, 29 October 2013 by Tycko (Talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)

Penn iGEM

The MaGellin Software Package



For a detailed, graphical explanation of the MaGellin work flow, please download the MaGellin Workflow Specifications Sheet, which includes all of the steps in the MaGellin workflow.


Overview

It was clear that not all gels would be simple or fast to read by eye, and we wanted to be able to quantify relative intensities of bands between samples. We also wanted to solve our unique problem of calculating how band lengths will change when there’s full methylation, site-specific methylation, or no methylation. The solution can change whenever you clone in a new site-specific methylase. We took this problem to the world’s largest college hackathon (PennApps), and had a functional software package after a sleepless 48 hours. It has since been further refined, and certain elements will be made available to the DIYBio community alongside standardized hardware through collaboration with the biotech start-up GeneFoo.


MaGellinWorkflow1
The MaGellin Software Package is part of the MaGellin Assay workflow, automating complex data analysis.



The MaGellin Software Package is a MATLAB script that uses computer vision algorithms to calculate the location and intensity of DNA bands. It also has a bioinformatics module to compare the lengths of bands with the expected lengths, based on the methylation sensitivity of the enzymes and the sequence of the plasmid. So, this program is more than another gel quantifier - it interprets the biological meaning of the band lengths and returns experimentally relevant analyses. The MaGellin Software Package is crucial for our workflow because it allows for clear input/output. Since our restriction digests always yield bands in predictable locations, user bias is eliminated and data sets are standardized across trials and labs, accelerating the pace of discovery.


Protocol
  1. Upload gel picture from restriction enzyme digest. Fill out all relevant information on the graphical user interface. Enter plasmid and target sequences and select restriction enzymes used. Enter descriptive names for gel and lanes if desired.
  2. Press the Analyze button.
  3. The Magellin Software Package will calculate the intensity and position of each band and produce a graph. Save your results.
  4. Collaborate with your fellow scientists by sharing your results on SkyDrive.



MaGellinWorkflow2
The Magellin Software Package will calculate the intensity and position of each band and produce a graph.



Details on the Software's Inner Workings.

Before the analysis is performed, the image is subject to various procedures intended to remove ambient noise in the data. These methods include determining the average amount of noise bias per row, and convolving the image with several standard filters. Next, the MaGellin software finds the lanes in the gel. This is accomplished by running an edge detection algorithm that uses the location of edges of individual bands to infer the location of the center of each lane. The edge detection algorithm functions by treating edges as solutions to variational equations. Once the center of each lane is determined, MaGellin Software is ready to perform its analysis. The user inputs the plasmid sequence in question, and MaGellin Software converts these values into vertical coordinates on the gel by performing a logarithmic regression on the length of the user input sequences, relying on the factory-included values for the lengths of the bands of the NEB 2-Log ladder. Now MaGellin knows where to look, and it knows what it is looking for. All that’s left to do is run the analysis. MaGellin searches in the locations in question, and if a band of suitably high intensity is found, Magellin records the intensity at this location. If no such band is found, Magellin records the intensity as 0 to indicate that no band was found in the area. The data is then extrapolated into a graph with the relevant biological meaning. Usually, we perform our experiments in triplicate, so we can run a 2-way ANOVA.


MaGellinWorkflow3
MaGellin uses a complex edge detection and visualization algorithm to compute band intensities and quanify gel data.


←Previous Next→