We developed the MaGellin assay to optimize the development process for site-specific methylases. Having validated the assay, we determined to design and test three site-specific methylases, two of which had never been constructed before.
The process further validated our MaGellin assay:
1. We recapitulated published results with a zinc finger-methylase and shed light on the significant magnitude of its off target effects. MaGellin is an excellent assay for this purpose, because of the noiseless chassis and because it's simpler to detect off target effects on a plasmid than a genome.
2. We further characterized our promising novel TALE-methylase and were able to de-noise this noisy, complex system. MaGellin was in agreement with bisulfite sequencing that the TALE exhibited targeted inhibition of the methylase. This has serious implications for the multitude of TALE-effector systems that have recently been developed: the TALE can inhibit the effector if the linker length and distance between the TALE binding site and target site are not optimized. The MaGellin workflow is well suited to solve this optimization problem.
Zinc Finger-M.SssI Fusion.
The zinc finger is a small DNA binding domain, with limited sequence specificity. Previous studies showed it was prone to off-target methylation, which we verified. This was also validation that the MaGellin assay accurately reports the site-specificity of methylation, effectively demonstrating our assay does everything we need it to do.
SHOW ZINC FINGER DATA
Figure 1: The ZF-M.SssI was cloned into MaGellin with and without its binding site present. We ran the standard MaGellin assay on both plasmids, using methylation sensitive restriction enzymes to report the methylase activity.
To be sure of the targeting specificity, we cloned the MaGellin plasmid with and without the zinc finger’s binding site present at the target cut site. This demonstrated how the presence of a zinc finger binding site shifts the methylation pattern (Figure 1).
TALE-M.SssI Fusion. TALEs have a greater sequence specificity than zinc fingers, and are easier to customize and less expensive to construct. They have already been validated for use in genome engineering and are quickly replacing zinc fingers. We performed a similar experiment with our TALE-M.SssI fusion, with and without the binding site present at the target cut site. We ran the gel and saw a significant effect on the methylation pattern but it was not in agreement with our software’s predicted experimental outcome.
TALE-M.SssI actively methylates DNA
Our TALE-M.SssI was actively methylating DNA at both the target site and off-target site. We expected a certain degree of off target methylation simply because the TALEs could occupy all the target sites on our low copy plasmid; the molar ratio is one of the problems in developing site-specific methylases that the inducible MaGellin system is designed to address. However, it was unexpected to see methylation skewed in favor of the off target site so we carried out more characterization experiments.
COBRA confirms on and off target methylation
We bisulfite converted the plasmid and used our validated bisulfite sequencing primers on both the target and off target site, then used the COBRA assay. The controls recapitulated that our primers are biased for only bisulfite converted DNA, as desired. Unconverted DNA would have been digested by the control enzyme. TaqαI digested both the on and off target sites, confirming that the TALE was partially methylating both sites (Figure 3). This validated our assay further, as it reports the same biological outcome as the published COBRA method, but at a fraction of the cost, time, and technical difficulty.
Varied Induction Conditions Clarify TALE Mechanism of Action
P = TALE Binding Site
M = No TALE Binding Site
N = On Target Methylation Band
F = Off Target Methylation Band
The software package calculated for us that the largest band we were seeing on the TALE gel was the result of simultaneous target site and off target site methylation while the second largest band was only off target site methylation. We used this information to formulate the Targeting Score to reflect increased site-specificity. We varied induction conditions, expecting one might be more optimal than our previous inductions. As desired, the negative control produced a baseline Targeting Score of almost exactly 1 (1.06). However, no induction condition increased Targeting Score, rather there was a steady decline (Figure 4). This indicated the TALE could be giving negative feedback to the site-specific methylation.
Bisulfite Sequencing Confirms TALE Binding
The next step for characterizing functional constructs with MaGellin is to use the standard bisulfite sequencing primers for a higher resolution look at individual CpG methylation. We bisulfite sequenced the target site of the induced TALE-M.SssI with the TALE binding site present 4 nucleotides upstream of the target AvaI cut site. The data further validated the enzymatic activity of our TALE-M.SssI in the region of the TALE binding site. It was also the first demonstration that the MaGellin assay and standard primers can easily be used for bisulfite sequencing with a TOPO cloning kit with ~7 day turnaround. Interestingly, no clones were methylated at the CpG site that is within the AvaI site itself (Figure 5).
This data, in combination with the varied induction conditions and COBRA results, led us to hypothesize a new model for the TALE’s mechanism that successfully explains each result. Although 4 nucleotides between the zinc finger binding site and the target cut site was optimal for a published zinc finger fusion with a short linker. That distance is too short for use with a TALE fusion with our linker length. The TALE is at least three times the size of the zinc finger and so TALE binding occludes that CpG from interacting with the methylase.
We have now shown that the novel TALE-M.SssI binds to its binding site strongly, as it almost fully protected that site from the methylase for 24 hours (Figure 4). We have shown that it exhibits methylating activity (Figure 2). Perhaps most interestingly, we have demonstrated that the TALE is large enough to physically occlude neighboring nucleotides from access to its linked effector, which has significant consequences for the recently published slew of TALE fusions – including the TALE-histone methylases, TALE-histone demethylases, and TALE-DNA demethylases for epigenetic engineering. We expect the same result will hold for Cas9-effector fusions, and are in the process of validating that hypothesis. We have already constructed the first dCas9-methylase fusion and demonstrated its enzymatic methylase activity in vivo (Figure 6).
Novel dCas9-M.SssI Methylation Activity Reported by MaGellin
Summary
MaGellin was developed to optimize the development of robust tools for site-specific methylation. To those ends, we successfully cloned and expressed three fusion methylases, two of which are novel constructs with advantages over the previously published zinc finger. Our constructs have shown methylase activity and DNA binding activity, which we could measure with our new assay. They are ready to be further optimized, using our workflow.
To gain our new insight into a fundamental shortcoming of recently developed genome engineering tools, we used MaGellin to its full extent: swapping out DNA binding domains and binding sites, varying induction conditions, applying COBRA, bisulfite sequencing, and depending on our original algorithm to properly predict methylation-sensitive digestion patterns. Importantly, we could not have reached this result without MaGellin, because the one-plasmid system in a noiseless chassis makes it simple, even unavoidable, to detect off target methylation. Conversely, for the previously published work in mammalian systems, it was not feasible to detect off target effects across a long genome with background signal. Based on our data, future improvements on genome engineering tools should include the construction of two targeted fusions with subunits of effectors that only dimerize and show activity at the binding sites, along the lines of how TALE-Nucleases cleave DNA. That could be the best way to construct epigenetic engineering tools with the specificity necessary for clinical applications.
Moreover, we have demonstrated the importance of studying the distance between the binding site and the target site, and shown the ideal distance will be very different between different DNA binding domains. This boils down to an optimization problem between choosing binding sites and linker lengths; this is exactly the sort of problem that the MaGellin system is designed to solve in a fast and affordable manner.