Team:TU Darmstadt/modelling/Structure

From 2013.igem.org

(Difference between revisions)
Line 1: Line 1:
<html>
<html>
 +
<style type="text/css">
<style type="text/css">
-
body
+
body  
{
{
margin:0;  
margin:0;  
Line 35: Line 36:
     background-repeat: no-repeat;
     background-repeat: no-repeat;
     background-size: 100% 100% ;
     background-size: 100% 100% ;
-
     border-style: none;
+
     border-style: none;;}
-
}
+
#p-logo { display:none;}
#p-logo { display:none;}
Line 46: Line 46:
{
{
position: absolute;
position: absolute;
-
top: 240px;  
+
top: 150px;  
-
left: 200px;
+
left: 30px;
}
}
#mm_icon2
#mm_icon2
{
{
-
position: relative;
+
position: absolute;
-
top: 170px;  
+
top: 150px;  
left: 350px;
left: 350px;
}
}
Line 67: Line 67:
{
{
position: absolute;
position: absolute;
-
top: 260px;  
+
top: 150px;  
-
left: 230px;
+
left: 30px;
background:white;
background:white;
filter:alpha(opacity=83); opacity:0.83;
filter:alpha(opacity=83); opacity:0.83;
Line 80: Line 80:
{
{
position: absolute;
position: absolute;
-
top: 670px;  
+
top: 150px;  
left: 350px;
left: 350px;
background:white;
background:white;
Line 89: Line 89:
border-radius:15px;
border-radius:15px;
}
}
 +
 +
#abstracticon3
#abstracticon3
Line 94: Line 96:
position: absolute;
position: absolute;
top: 150px;  
top: 150px;  
-
left: 700px;
+
left: 670px;
background:white;
background:white;
filter:alpha(opacity=83); opacity:0.83;
filter:alpha(opacity=83); opacity:0.83;
Line 102: Line 104:
border-radius:15px;
border-radius:15px;
}
}
-
 
#taskbar
#taskbar
{
{
position:absolute;
position:absolute;
-
top:10px;
+
top:0px;
-
left:400px;
+
left:350px;
-
z-index: 5;
+
z-index: 2;
}
}
 +
 +
ul {
 +
    list-style: none;
 +
   
 +
}
 +
 +
li:before {
 +
    content: "• ";
 +
    color: white;
 +
}
 +
 +
 +
 +
dl.igemTUD2013gelpicture
 +
{
 +
border: 1px solid #000;
 +
background-color: #109f71;
 +
width: 110px;
 +
text-align: center;
 +
padding: 5px 5px 5px 5px;
 +
float: right;
 +
margin: 0 0 0 0;
 +
margin-left:15px
 +
}
 +
 +
.igemTUD2013gelpicture dt
 +
{
 +
font-weight: bold;
 +
background-color: #131210;
 +
color: #959289;
 +
padding: 0 0;
 +
margin-bottom: 10px;
 +
}
 +
 +
.igemTUD2013gelpicture dd img
 +
{
 +
border: 1px solid #000;
 +
width: 100px;
 +
height: 200px;
 +
}
 +
 +
.igemTUD2013gelpicture dd
 +
{
 +
margin: 0;
 +
padding: 5px 5px 5px 5px;
 +
font-size: 100%;
 +
text-align: left;
 +
}
 +
 +
 +
 +
 +
dl.igemTUD2013gelpicture2
 +
{
 +
border: 1px solid #000;
 +
background-color: #109f71;
 +
width: 210px;
 +
text-align: center;
 +
padding: 5px 5px 5px 5px;
 +
float: right;
 +
margin: 0 0 0 0;
 +
margin-left:15px
 +
}
 +
 +
.igemTUD2013gelpicture2 dt
 +
{
 +
font-weight: bold;
 +
background-color: #131210;
 +
color: #959289;
 +
padding: 0 0;
 +
margin-bottom: 10px;
 +
}
 +
 +
.igemTUD2013gelpicture2 dd img
 +
{
 +
border: 1px solid #000;
 +
width: 200px;
 +
height: 200px;
 +
}
 +
 +
.igemTUD2013gelpicture2 dd
 +
{
 +
margin: 0;
 +
padding: 5px 5px 5px 5px;
 +
font-size: 100%;
 +
}
 +
</style>
</style>
 +
 +
 +
<center>
 +
<!-- central main menu -->
 +
 +
<br>
 +
<br>
<!-- Taskbar -->
<!-- Taskbar -->
Line 133: Line 228:
<a href="https://2013.igem.org/Team:TU_Darmstadt/team">
<a href="https://2013.igem.org/Team:TU_Darmstadt/team">
<img alt="team" src="/wiki/images/a/a4/Darmstadt_green_Team.jpg" width="70" height="30"></a>
<img alt="team" src="/wiki/images/a/a4/Darmstadt_green_Team.jpg" width="70" height="30"></a>
 +
<br>
<br>
Line 206: Line 302:
-
<ol>
+
<ol align="left">
-
<li>Sequence is PSI-BLASTed against Uniprot [2]⁠</li>
+
<li>Sequence is PSI-BLASTed against Uniprot [2]
-
<li>Calculation of a position-specific scoring matrix (PSSM) from related sequences</li>
+
<li>Calculation of a position-specific scoring matrix (PSSM) from related sequences
-
<li>Using the PSSM to search the PDB for potential modeling templates</li>
+
<li>Using the PSSM to search the PDB for potential modeling templates
<li>The Templates are ranked based on the alignment score and the structural quality[3]⁠</li>
<li>The Templates are ranked based on the alignment score and the structural quality[3]⁠</li>
<li>Deriving additional information’s  for template and target (prediction of secondary structure, structure-based alignment correction by using SSALN scoring matrices [4])⁠.</li>
<li>Deriving additional information’s  for template and target (prediction of secondary structure, structure-based alignment correction by using SSALN scoring matrices [4])⁠.</li>
Line 246: Line 342:
We used the Yasara script hm_build.mcr for the model creation with the following parameters:
We used the Yasara script hm_build.mcr for the model creation with the following parameters:
 +
<ul style="margin-left:50px; margin-right:50px; text-align:justify; ">
-
    Modeling speed (slow = best): Slow
 
-
    Number of PSI-BLAST iterations in template search (PsiBLASTs): 3
 
-
    Maximum allowed PSI-BLAST E-value to consider template (EValue Max): 0.5
 
-
    Maximum number of templates to be used (Templates Total): 20
 
-
    Maximum number of templates with same sequence (Templates SameSeq): 1
 
-
    Maximum oligomerization state (OligoState): 4 (tetrameric)
 
-
    Maximum number of alignment variations per template: (Alignments): 5
 
-
    Maximum number of conformations tried per loop (LoopSamples): 50
 
-
    Maximum number of residues added to the termini (TermExtension): 10
 
 +
  <li> Modeling speed (slow = best): Slow</li>
 +
  <li> Number of PSI-BLAST iterations in template search (PsiBLASTs): 3</li>
 +
  <li> Maximum allowed PSI-BLAST E-value to consider template (EValue Max): 0.5</li>
 +
  <li>  Maximum number of templates to be used (Templates Total): 20</li>
 +
  <li> Maximum number of templates with same sequence (Templates SameSeq): 1</li>
 +
  <li> Maximum oligomerization state (OligoState): 4 (tetrameric)</li>
 +
  <li>  Maximum number of alignment variations per template: (Alignments): 5</li>
 +
  <li> Maximum number of conformations tried per loop (LoopSamples): 50</li>
 +
  <li>  Maximum number of residues added to the termini (TermExtension): 10 </li>
 +
</ul>
</body>
</body>

Revision as of 22:29, 3 October 2013







Modelling | Statistics | Structure




Homology Modelling

While our proteins are functionally described in literature and during the IGEM competition, only part of the structures are available in the protein data bank. For further work and visualizations, protein structures are indispensable. We used Yasara Structure [1]⁠ to calculate 3-dimensional structures of all of our proteins for the IGEM.





Workflow

Description how our Yasara script calculates homology model[7]: DKL

  1. Sequence is PSI-BLASTed against Uniprot [2]
  2. Calculation of a position-specific scoring matrix (PSSM) from related sequences
  3. Using the PSSM to search the PDB for potential modeling templates
  4. The Templates are ranked based on the alignment score and the structural quality[3]⁠
  5. Deriving additional information’s for template and target (prediction of secondary structure, structure-based alignment correction by using SSALN scoring matrices [4])⁠.
  6. A graph of the side-chain rotamer network is built, dead-end elimination is used to find an initial rotamer solution in the context of a simple repulsive energy function [5]⁠
  7. The loop-network is optimized using a high amount of different orientations
  8. Side-chain rotamers are fine-tuned considering electrostatic and knowledge-based packing interactions as well as solvation effects.
  9. An unrestrained high-resolution refinement with explicit solvent molecules is run, using the latest knowledge-based force fields[6]⁠.




Application

All these steps are performed to every template used for the modeling approach. For our project we set the maximum amount of templates to 20. Every derived structure is evaluated using an average per-residue quality Z-scores. At last a hybrid model is built containing the best regions of all predictions. This procedure make prediction’s accurate and thus more realistic. For the evaluation we used the Yasara Z-scores.A Z-score describes how many standard deviations the model quality is away from the average high-resolution X-ray structure. Negative values indicate that the homology model looks worse than a high-resolution X-ray structure. The overall Z-scores for all models have been calculated as the weighted averages of the individual Z-scores using the formula Overall = 0.145*Dihedrals + 0.390*Packing1D + 0.465*Packing3D [7].

Parameters
We used the Yasara script hm_build.mcr for the model creation with the following parameters:

  • Modeling speed (slow = best): Slow
  • Number of PSI-BLAST iterations in template search (PsiBLASTs): 3
  • Maximum allowed PSI-BLAST E-value to consider template (EValue Max): 0.5
  • Maximum number of templates to be used (Templates Total): 20
  • Maximum number of templates with same sequence (Templates SameSeq): 1
  • Maximum oligomerization state (OligoState): 4 (tetrameric)
  • Maximum number of alignment variations per template: (Alignments): 5
  • Maximum number of conformations tried per loop (LoopSamples): 50
  • Maximum number of residues added to the termini (TermExtension): 10





References

[1] E. Krieger, G. Koraimann, and G. Vriend, “Increasing the precision of comparative models with YASARA NOVA--a self-parameterizing force field.,” Proteins, vol. 47, no. 3, pp. 393–402, 2002.
[2] S. F. Altschul, T. L. Madden, A. A. Schäffer, J. Zhang, Z. Zhang, W. Miller, and D. J. Lipman, “Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.,” Nucleic Acids Res, vol. 25, no. 17, pp. 3389–3402, Sep. 1997.
[3] R. W. Hooft, G. Vriend, C. Sander, and E. E. Abola, “Errors in protein structures.,” Nature, vol. 381, no. 6580. Nature Publishing Group, p. 272, 1996.
[4] D. T. Jones, “Protein secondary structure prediction based on position-specific scoring matrices,” Journal of Molecular Biology, vol. 292, no. 2, pp. 195–202, 1999.
[5] A. A. Canutescu, A. A. Shelenkov, and R. L. Dunbrack, “A graph-theory algorithm for rapid protein side-chain prediction.,” Protein Science, vol. 12, no. 9, pp. 2001–2014, 2003.
[6] E. Krieger, K. Joo, J. Lee, J. Lee, S. Raman, J. Thompson, M. Tyka, D. Baker, and K. Karplus, “Improving physical realism, stereochemistry, and side-chain accuracy in homology modeling: Four approaches that performed well in CASP8.,” Proteins, vol. 77 Suppl 9, no. June, pp. 114–122, 2009.
[7] http://www.yasara.org/homologymodeling.htm