Team:Wageningen UR/Flux balance analysis
From 2013.igem.org
- Safety introduction
- General safety
- Fungi-related safety
- Biosafety Regulation
- Safety Improvement Suggestions
- Safety of the Application
Modeling
“When I came out of school I didn't even think that modeling was a job.”
Introduction
To develop and investigate mathematical models of metabolic processes is one of the primary challenges in systems biology. As a proof of concept of our modular domain approach lovastatin has been chosen and its production in several Aspergilli will be modeled. To investigate the potential of lovastatin production in A. niger will be compared to that in A. nidulans , A. oryzae and A. terreus.
Rationale
Producing a compound in a novel host at first requires investigation of the possibility to do so. Since the compounds required for biosynthesis of lovastatin are occur naturally in metabolic routes such as the citric acid cycles and fatty acid synthesis pathways, all of the Aspergilli that are modeled have the potential ability to produce lovastatin when the required genes are introduced. Analysis and comparison of the different models allows for a broad insight in efficient biosynthesis strategies.
Aim
• Model and balance the lovastatin pathway
• Expand the metabolic model of A. niger, A nidulans, A. oryzae with the lovastatin biosynthesis pathway
• Perform flux balance analysis to analyze the flux of lovastatin and compare this with the model of A. terreus
• Flux variability analysis to determine the ranges of fluxes that correspond to an optimal solution determined through flux balance analysis
• Change media composition in the model to investigate its effect on lovastatin production
• Use OptKnock to determine gene deletion strategies leading to increased production of lovastatin
Approach
First we need to make the models consistent, meaning that we need to make sure that similar compounds and reactions have similar names in the different models. Since the origin of the models is not the same, and even in those that originate from the same research group, there are differences that complicate a comparative analysis. After having generated a generic namespace for both reactions and metabolites we will analyze the metabolic flux towards lovastatin and the corresponding state space. Changing medium conditions will allows us to obtain insight in effect of its compositions to deduce efficient production media. Last of all we will use a computational intensive script to determine what gene deletion strategies are most favorable.
Research methods
First of all we extract the models from their respective sources. Since the A. terreus model is not in xml format we need to create this ourselves. In order to do so and make the models consistent we make use of MetanetX , which is an initiative in trying to standardise metabolic models. The models that we investigate are those of A. terreus, A. niger, A. nidulans and A. oryzae. After we have obtained all the models in xml format we make use of the COBRA toolbox within MATLAB. The COBRA toolbox facilitates easy input of the metabolic model in the Systems Biology Markup Language (SBML) to perform these calculations in MATLAB. Once the model has been expanded flux balance analysis allows for a genome-scale approach. OptKnock can be used to determine which gene knockouts should increase the metabolic flux towards lovastatin.
Lovastatin pathway
Lovastatin starts with the synthesis of dihydromonacolin L by a large iterative polyketide synthase (lovB) and an enoyl reductase (lovC). The iterative polyketide synthase starts with amalgamation of acetyl-CoA and malonyl-CoA in the first step, after which another malonyl-group is added at each subsequent step. At one point a methyl group is added, which is derived from S-adenosyl-methionine. Together LovB and LovC they catalyze 18 reactions to form this intermediate of lovastatin. Also a Diels Alder cyclization occurs during the process, though this reaction occurs spontaneous. In order to model the biosynthesis of this intermediate several steps have been lumped into a total of 8 reactions, simply because exact details of intermediates formed are unknown and this results in the highest level of detail possible.
In parallel with this process there is another, very similar polyketide synthase, LovF, that synthesizes the intermediate 2-metylbutyryl-CoA from the same starting substrates, acetyl-CoA and malonyl-CoA.
In the final step of the process, LovD amalgamates monacolin J and 2-metylbutyryl-CoA into the product lovastatin.
However, in order to balance the pathway more detail is required. It turns out that the co-factor NADPH is required by LovB, a non-trivial detail, which was found via Uniprot.Results
In order to analyze the differences and compare the different models it is best to use a standardized namespace. Although MetanetX attempts to do so, it is not perfect and therefore one must carefully address the changed imposed by the system when one tries to upload a model. Another thing that needs to be taken into account is that the MATLAB script written is generic, such that when the models are modified the script should function properly as before.
Since the files are in SBML format (.xml), converting them to MATLAB (.mat) files allows for much faster loading and saving of the models (link to script).
In order not to overwrite the previous model when saving it a prefix will be added to the file name (link to script).
Converting and improving the A. terreus model
The A. terreus model was obtained in excel format. The first step taken is to convert the Excel file (.xls) into .xml format such that the model can be accessed via MATLAB: (link to script). For A. terreus the biomass reaction contained a great number of metabolites. By adding an additional compound to the reactions called 'BIOMASS' with a stoichiometric coefficient of 1, we can add an exchange reaction for biomass. This allows for optimization of this exchange reaction in FBA, a feature that should be generic for all models.
The A terreus model does already contain a pathway for lovastatin production, however as it turns out this pathway is not functional. As it turns out the pathway is not balanced; the first step in which they lump the 35 reactions performed by lovB and lovC is missing the co-factor NADPH on one side, while NADP+ is present on the other.
J. Lui et al. (2013). Genome-scale reconstruction and in silico analysis of Aspergillus terreus metabolism. Molecular BioSystems, Vol. 9, p. 1939-1948
Standardising the models
Before proper conversion to the MNXM namespace from MetanetX can be finalized, uploading to model to MetanetX yields a mapping summary. This mapping summary includes information on the metabolites, compartments and reactions and their success of mapping. Since the system does not recognize all of them properly, the report has to be checked manually.
Metabolites
Mapping of metabolites is done on recognition of IDs and description. To check whether all metabolites were mapped correctly, mapping of those that do not have similar descriptions, IDs, or both, have to be manually curated from errors. Wrongly mapped metabolites had to be either set to their right respective MNXM id or else were given the prefix ‘my_’ to ensure proper mapping.
Compartments
To ensure mapping to the right compartment, it was found useful to put all the compartment names as a suffix behind square brackets (link to script). In a few cases mapping to the wrong compartment still occurred with several compounds that were not yet in the MNXM namespace, therefore these compounds were manually modified (link to script).
Reactions
Similar to the A. terreus model, both A. niger and A. nidulans don’t have a exchange reaction for biomass, which was added to allow for optimization of this parameter with the same script for all models.
The last non-triviality that had to be taken into account was the balancing of the reactions. To do so the model adds protons to the side of the reaction where it deems they are missing. A serious downside of this feature is that when metabolites are not recognized are involved, it is not able to take into account the protons absorbed or lost by these compounds. Still however, it adds protons to balance the reaction, but these are not rightly added.
Adding the lovastatin pathway
Since the lovastatin pathway is only present in the A. terreus model, but is not well balanced here, a function was written to add the pathway for lovastatin biosynthesis. First all the metabolites that are not yet in the model need to be added. Then the pathway upto the production of the intermediate dihydromonacolin L acid was added. This pathway consists of 35 reactions that have been lumped into 10 reactions. Together with the remaining 4 reactions required the total lovastatin biosynthesis pathway has been modeled in 14 steps (link to function). Finally also a exchange reaction for lovastatin was added to be able to optimise for its production.
The prefix 'Ex_' was added to all exchange reactions for easy recognition and an additional function was implemented that allows to easily change the optimisation criterion.
Medium constraints
In order to find medium constraints of the models all lower and upper bounds are first set to the value of -1000 and 1000 arbitrary units, respectively. Then one by one the lower and upper limit of the reaction bounds are set to 0 for one reaction at a time. If the model is not able to optimise the reaction of interest this indicates that exchange of this component with the medium is a prerequisite. If a lower bound larger then 0 is required this indicates that the component needs to be present in the in silico medium. When an upper bound larger then one required this means the opposite, namely that detrimental accumulation of this metabolite occurs detrimental accumulation
Media
In order to define a proper in silico medium need to know which exchange reactions the models have in common and which of these are of essential importance.
Flux Balance Analysis
Gene knockout strategies
Michiel Karrenbelt
Here comes the code