Team:Heidelberg/NRPS

From 2013.igem.org

Revision as of 17:21, 27 October 2013 by JuliaS1992 (Talk | contribs)

NRPS. Get to know the theory.

Abstract

Everybody knows the dogma of molecular biology - DNA makes RNA makes protein, but it doesn't cover all nature's capabilities, as for example non-ribosomal peptide synthetases, or short NRPS. This alternative pathway for peptide formation is mainly found in bacteria and fungi, but can at also be functionally transferred to mamallian cells. Natural NRPs can have various functions from simple dyes up to metal chelators or antibiotics. NRPSs are large protein complexes adding amino or arylic acids, taken from a pool of more than 500 monomers, to a growing peptide chain without the need for a template. The molecular structure of NRPS is based on modules, each incorporating exactly one monomer and comprising several domains of different functions. The most important domains are the adenylation (specific substrate activation), the thiolation (peptide / monomer carrier) and the condensation domain (peptide bond formation), which are needed to form basic peptide chains. Besides these various other domains are known to introduce secondary modifications. Thanks to their remarkable modularity and the wide range of substrates, NRPSs bear the impressive biosynthetic potential to create novel non-ribosomal peptides of various functions in vivo or in vitro. The range of possible applications exceeding natural functions remains unforseen.


Modular Structure

Figure 1: The three levels of modularity in NRPS are the gene products, modules each incorporating one amino acid and the domains in every module.

The biggest benefit of NRPS for synthetic biology is it's very modular structure. This starts at the gene level and goes as deep as domain or even single residue level. This modular organisation can be exploited by reorganisation of the modules in order to achieve different products.

Biosynthetic gene cluster

The genes necessary for the biosynthesis are normally organised in a gene cluster. This contains the genes coding for the non-ribosomal peptide synthetases, which can reach a size of up to 1.5 MDa (cyclosporin Fischbach), genes necessary for the monomers' biosynthesis, as well as tailoring enzymes introducing further modifications in the peptide. There is normally more than just one peptide synthetase. The proteins are often connected by communication domains in order to keep the structure of the assembly line. If one wants to transfer a whole pathway to a different host organisms one crucial apect, besides the successful cloning of the synthetases, is the monomer supply. Depending on the hosts's endogenous machinery one can leave out certain genes or has to include other pathways in order to keep up the supply.

One module - one amino acid

Each NRP synthetase is organised in so called modules, where every single module is responsible for incorporating one amino acid in the growing peptide chain. Since all modules have similar minimal structure components one can reorder them easily to achive a different product. These minimal structure components are NRPS specific protein domains each providing the assembly line with a different function. The three levels of modularity - proteins, modules and domains are shown in figure 1.(Fischbach)

Chain Elongation

The mechansim of the peptide bond formation in NRPS is partially different from that in the ribosome. The biggest differences are the attachment of the growing peptide chain and the number of catalytic domains. In ribosomal synthesis one ribosome can add many amino acids, but in non-ribosomal synthesis the number of catalytic domains rises linearly with the number of amino acids incorporated. Thus the latter is only suitable for oligopeptide synthesis.

Thiolation domain and PPTases

Figure 2: Phosphopantetheinyltransferases are essential for non-ribosomal peptide synthesis, as they are needed for posttranslational modification of the thiolation domains. (modified from 1)

In the ribosome the growing peptide chain and the monomers to be incorporated are bound to tRNA, but never to the ribosome itself. NRP synthetases contain thiolation domains (T-domains) in every module, where the corresponding amino acid is covalently bound to the enzyme via a thioester bond. The essential amino acid residue of the T-domains is a serine which is posttranslationally modified to carry the sulfhydryl group required for the thioester bond. This modification is carried out by separate proteins - the phosphopantetheinyltranferases (PPTases), which use coenzyme A as cofactor. The reaction is shown in figure 2. As the functionality of the thiolation domains is essential for the peptide synthesis one should consider transferring a suitable PPTase to a host organism together with the synthetases.

Adenylation domain

Figure 3: The adenylation (A) domains of a pathway selectively activate a monomer and the thioester bond with the thiolation domain. (modified from 1)

The actual attachment of the amino acid to the already modified T-domain is carried out by the adenylation domain (A-domain). It is highly substrate specific for only a single monomer, besides some rare cases when it can also bind a second, very similar monomer. During the reaction the the monomer is activated with ATP in a first step and in a second one the tioester bond between the phosphopantetheinyl residue and the monomer's carboxylic acid residue is formed. The reaction is depicted in figure 3.

Condensation domain

Figure 4: The condensation (C) domain is responsible for forming the peptide bonds between the monomer (acceptor) and the growing peptide chain (donor). After the reaction the whole peptide chain is attached to the acceptor's T-domain. (modified from 1)

After two neighbouring monomers have been activated, the condensation domain (C) is the one to form the peptide bond, which is shown in figure 4. The C-domain is selective for the acceptor amino acid and thus one couple of C and A-domain always have the same substrate specificity. The reaction catalysed by the C-domain is a nucleophilic attack of the acceptor amino acid on donor peptide chain.

TErmination - ThioEsterase

In order to terminate a pathway a thioesterase (TE) is located at the end of most NRPS. This can have one out of several exact functions. These can roughly be classified in simple product cleavage or macrocyclisation. The first group cleaves the thioester bond between the product and the enzyme complex by transfering the peptide to it's own conserved serine residue and then releases it to the cytoplasm. The second group introduces a macrocylce in the product after cleaving it off the last T-domain. This macrocylce can either connect the two peptide termini or introduce any other peptide bond based cycle. In addition to those integral TE domains separated thioesterases are known to function as a rescue protein for stalled NRPS.

Monomer Modifications

As already mentioned in the overview NRPS can select monomers out of more than 500 different ones. These are either amino acids or arylic acids, which normally lack the amine residue and are thus only suitable for chain initiations. Most of the monomers are derived from more basic ones.

Epimerisation and stereoselectivity

https://static.igem.org/mediawiki/2013/b/b1/Heidelberg_NRPS_Epimerisation.png

Figure 5: Epimerisation (E) domains racemise the donor amino acid (left) and from the equilibrium the following C-domain selects the correct conformation, in this case the D-form. For the other stereoconformation no peptide bond can be formed.

The most basic derivation is the introduction of D-amino acids. This is achieved by epimerisation (E) domains located between the T-domain of the donor and the C-domain of the acceptor module (see figure 5). It racemises the donor amino acid and the correct stereoconformation is selected by the acceptor C-domain. Thus the C-domains can be classified as either CD or CL.

N-Methylation

The next basic modification of the peptides is the N-methylation carried out by N-methyltransferases (NM). They are located between the A and the T-domain of a module. The transfer CH3 from S-adenosylmethionine to the amino group of the module's monomer after it has been activated by the A-domain.

Heterocyclisation

Besides the macrocyclisation performed by TE-domains cyclisation (Cy) domains can introduce additional amide bonds between neighbouring amino acids in the middle of the assembly line. They need either serine, threonine or cystein as acceptor amino acid in order to form a heterocycle.

All of these modifying domains have in common, that the border to the C or A-domains are often very blurry. The Cy-domains and E-domains are thus often displayed as combined C/Cy or C/E-domains. Besides these most common modifications many others can be found within the pathway or as tailoring enzymes. These can for example be O-methylations, halogenations, glycosylations or oxidations - there is a lot more to explore!

1. Fischbach M, Walsh C (2006) Assembly-Line Enzymology for Polyketide and Nonribosomal Peptide Antibiotics: Logic, Machinery, and Mechanisms. Chemical Reviews: 106, 3468−3496.

Thanks to