Team:Heidelberg/Tempaltes/iGEM42-W-10b

From 2013.igem.org

Scraping of BioBrick count

There is an API provided for accessing the registry, but unfortunately it only allows for search of a single specified BioBrick id. The fastest, but still very slow way to connect a team with it's BioBricks is doing an API request for every id in the teams parts range. The only assumption we made in order to speed things up, is that the BioBricks were submitted in a continuous parts range. Thus the matching is aborted when the first BioBrick in the parts rang can't be found.

Implementation of Scoring

In order to solve the very sensitive problem of rating a team's success we started out with a subjective scoring by every one of our team members for the different awards and the medals. This native scoring turned out to be pretty consistent an thus we just calculated mean values and put the on an exponential scale in order to achieve a harsh separation of the highest top scoring teams and those who didn't get that far. For every team the score of the single awards was added up. As the awards rewarded differend every year, we normalised the summarized score of the teams in one year to a scale from 0 to 100%.
The whole analysis was added to the R-script converting the JSON file to the RData file.

Further text analysis and data conversion

In order to do the text analysis for the methods extraction we collected a raw list of methods, which is displayed in table 10.1. They were clustered in Preprocessing, Processing and Analysis. The script doing the analysis in python again stems both the abstract and the methods and matches them.

Table 10.1: Clustered methods for text analysis
Preprocessing
Fusion Proteins Primer Design cloning
preparation of DNA Restriction Digestion Insert preparation
cell fractionation cell counting
Processing
DNA sequencing PCR DNA Microarray
arrays interaction chromatography purification
Gel extraction Ligation Transformation
FRET DNA extraction patch clamp
Analysis
Northern Blot Southern Blot Western blotting
Bioinformatics ELISA Chromatography
flow cytometry X-Ray-crystallography NMR
Electron microscopy Molecular dynamics coimmunoprecipitation
Electrophoretic mobility shift assay southwestern blotting size determination
gel electrophoresis macromolecule blotting and probing immuno assays
phenotypic analysis imaging spectroscopy
spectrometry

The RData file was extended and now also includes the meshterms within the data-list as well as the information content and the track in the data-frame.