Crystal Structure of the Vaccinia Virus DNA Polymerase Holoenzyme Subunit D4 in Complex with the A20 N-Terminal Domain
Vaccinia virus is the prototype of the orthopoxvirus genus which includes other pathogens infecting humans and variola virus which was eradicated in the late 70's. Vaccinia virus DNA synthesis relies on three proteins: these are E9, the DNA polymerase bound to its heterodimeric cofactor D4/A20. To date, the molecular mechanism involved in poxvirus DNA replication remains poorly understood. Here, we present the high-resolution crystal structure of a complex formed by D4 and the first 50 residues of A20 (A201–50) that are necessary and sufficient for binding. The structure of D4/A201–50 reveals the contact surface engaged in the D4/A20 interaction in great detail. Interestingly, we could show that known small molecule inhibitors of vaccinia virus DNA synthesis selected for their ability to interfere with the D4/A20 interface could be docked onto the D4 surface where they mimic several aspects of the interacting A20 molecule. Finally, we present a model of D4/A20 in complex with DNA that allows us to discuss the role of mutations affecting the D4/A20 cofactor. Altogether, our structure gives new insights into the assembly of the vaccinia virus DNA polymerase cofactor and will be useful for the design of new antiviral compounds targeting the D4/A20 interaction.
Published in the journal:
. PLoS Pathog 10(3): e32767. doi:10.1371/journal.ppat.1003978
Category:
Research Article
doi:
https://doi.org/10.1371/journal.ppat.1003978
Summary
Vaccinia virus is the prototype of the orthopoxvirus genus which includes other pathogens infecting humans and variola virus which was eradicated in the late 70's. Vaccinia virus DNA synthesis relies on three proteins: these are E9, the DNA polymerase bound to its heterodimeric cofactor D4/A20. To date, the molecular mechanism involved in poxvirus DNA replication remains poorly understood. Here, we present the high-resolution crystal structure of a complex formed by D4 and the first 50 residues of A20 (A201–50) that are necessary and sufficient for binding. The structure of D4/A201–50 reveals the contact surface engaged in the D4/A20 interaction in great detail. Interestingly, we could show that known small molecule inhibitors of vaccinia virus DNA synthesis selected for their ability to interfere with the D4/A20 interface could be docked onto the D4 surface where they mimic several aspects of the interacting A20 molecule. Finally, we present a model of D4/A20 in complex with DNA that allows us to discuss the role of mutations affecting the D4/A20 cofactor. Altogether, our structure gives new insights into the assembly of the vaccinia virus DNA polymerase cofactor and will be useful for the design of new antiviral compounds targeting the D4/A20 interaction.
Introduction
The well-studied vaccinia virus (VACV) belongs to the orthopoxvirus genus of the family poxviridae. The Orthopoxvirus genus also comprises well-known pathogens such as monkeypox virus and cowpox virus (which can be transmitted to humans) as well as the most virulent member variola virus. Unlike other DNA viruses, orthopoxviruses replicate entirely in the cytoplasm of the infected host-cell. Viral genome synthesis takes place in perinuclear foci called viral factories and is thought to depend almost exclusively on virally encoded-proteins. Four of these proteins, presumably positioned at the replication fork, were shown to be essential for DNA synthesis [1]. For VACV these are: E9, the catalytic subunit of the DNA polymerase; D5, a DNA-independent nucleoside triphosphatase which contains a putative helicase domain [2] and primase activity [3]; D4, a uracil-DNA glycosylase (UDG) [4] and A20, a central component linking E9 and D4 [5], [6] and interacting with D5 [7], [8].
The catalytic DNA polymerase E9 alone is distributive under physiological conditions [9]. However, it becomes highly processive when bound to its heterodimeric co-factor D4/A20, forming the processive DNA polymerase holoenzyme [10]. The presence of a DNA repair protein (D4) as necessary component of the VACV replication machinery is intriguing and unusual for DNA viruses. UDGs encoded by several herpes viruses have been characterized so far and in contrast to poxviruses these were shown to behave as accessory proteins rather than essential factors during DNA synthesis. Indeed, when deletion mutants lacking UDG were built, the recombinant herpes viruses were replication-competent and viable [11]–[14]. The function of UDG is to prevent accumulations of uracil bases in DNA molecules due to misincorporation of dUTP or spontaneous cytosine deamination by excision of the uracil moiety and initiation of the base-excision repair pathway [15]. Another striking feature of D4 as component of the processivity factor is that while its presence is crucial for DNA replication, i.e knock-out mutants lacking D4 are not viable [16], [17], its glycosylase activity is dispensable [18]. These observations raised the question how D4/A20 confers processivity to E9. Data obtained from the Traktman group provided evidence in favor of a model in which the intrinsic DNA scanning activity of D4 stimulates long-chain DNA synthesis by E9 [5]. Indeed, the process of uracil search by UDG implies random diffusion on the DNA molecule in a sequence independent manner (DNA hopping and sliding) [19]. Then, the enzyme kinks and compresses the duplex DNA backbone in order to extrude each base to examine its identity (so called “pinch-push-pull” mechanism) [20]–[22]. Only uracil bases enter the uracil binding pocket allowing the hydrolysis of the glycosyl bond. Since D4 does not interact directly with E9 it is proposed that A20 forms the link between the UDG and the DNA polymerase catalytic subunit E9 [10]. In agreement with this model, our recent low-resolution structure of VACV D4/A20/E9 complex established a 150 Å separation between the polymerase active site of E9 and the DNA-binding site of D4 while the elongated A20 protein connects E9 and D4 [6].
Various mutations affecting D4 and A20 proteins have been described in the literature. These mutations were obtained from temperature-sensitive viruses isolated after mutagenesis [5], [10], [23], [24] or were engineered into the D4R and A20R genes by site-directed mutagenesis [25]–[28]. These mutants are of great importance for mechanistic insights into the VACV DNA synthesis. However, when the mutant proteins are used in a reconstituted in vitro replication system, it is often difficult to clearly explain the observed phenotype [28]. This is partially due to the lack of a high-resolution structure of the VACV replication complex allowing positioning of the mutations. So far, our knowledge about the assembly of the D4/A20/E9 complex remains very imprecise. While the N-terminal 25 amino acids of A20 are important for D4 binding [7], the region of D4 involved in A20 interaction is still unknown and no data are yet available regarding the A20/E9 interaction.
Soluble VACV D4 has been successfully expressed in bacteria. The concentrated recombinant protein forms dimers in solution and the dimeric assembly is maintained when the protein is crystallized [29]. Whether or not an oligomeric form of D4 is necessary for its activity within the VACV polymerase holoenzyme is still a matter of debate as another study suggests that virally expressed D4 would rather behave as a monomer [5]. Our low-resolution structure of D4/A20/E9 complex showed a 1∶1∶1 stoichiometry, consistent with D4 being in a monomeric state in the complex [6].
In order to further investigate the molecular structure of the VACV DNA replication machinery, we decided to study specifically the D4/A20 dimer interface. As mentioned previously, the first 25 residues of A20 have been shown to be the minimal binding region required for interaction with D4, however optimal binding was observed with the first 50 amino acids [7]. In this report, we present the co-expression and purification of VACV D4 bound to the first 50 residues of A20 (D4/A201–50). We showed that whereas D4 alone most likely forms homodimers in solution, D4/A201–50 clearly behaves as a heterodimer. Determination of the high-resolution structure of D4/A201–50 reveals that the A201–50 interaction on the D4 surface partially overlaps the D4/D4 dimer interface. A structure-based site-directed mutagenesis study allowed the identification of new residues that modulate the D4/A20 interaction. These residues could potentially be the target of poxvirus inhibitors known to interfere with the D4/A20 interaction [30]. Finally, we have proposed a model of D4/A201–50 in complex with DNA that allows us to discuss the observed phenotype of several mutants described in the literature [24], [28].
Results
Recombinant vaccinia virus His-D4/A201–50 forms a heterodimeric complex
The N-terminal domain of A20 has been shown to be necessary to interact with D4 [7], which has a tendency to dimerize when over-expressed as a recombinant protein in bacteria [29]. In order to determine the oligomeric state of a complex formed by D4 and A201–50, we co-expressed in bacteria N-terminal His-tagged D4 together with A201–50 fused to the C-terminus of the maltose binding protein (MBP), downstream of a TEV protease cleavage site. In parallel, we also expressed a recombinant form of D4 containing a TEV-cleavable vector-derived N-terminal hexa-histidine tag. Lysates containing either His-D4 alone or His-D4 and MBP-A201–50 were loaded onto a HisTrap HP column and eluted with imidazole. After TEV protease treatment, proteins were passed again over the nickel column and further purified by size exclusion chromatography. Purified recombinant D4 without His-tag migrates in SDS-PAGE according to its calculated molecular mass of 25.4 kDa (Figure 1A, lane 2). A201–50 (5.7 kDa) and His-D4 (26.7 kDa) co-eluted through the various purification steps (Figure 1A, lane 3), demonstrating complex formation between the two proteins.
In order to obtain the absolute molecular weight of recombinant D4 and His-D4/A201–50, SEC-MALLS experiments were performed (Figure 1B). D4 elutes from the gel filtration column at 11.5 mL as a broad and tailed peak. At the maximum peak height, a molecular mass of 43±2 kDa was determined. However, the measured molecular mass decreases throughout the tail of the chromatogram down to about 28 kDa. Thus, our results indicated that D4 is not monodisperse in solution (Mw/Mn = 1.011) and suggested that at high concentration (10 mg.mL−1) a fast monomer-dimer equilibrium is observed (expected molecular mass: 25.4 kDa and 50.8 kDa, respectively). In contrast, using the same experimental conditions, His-D4/A201–50 elutes as a sharp and symmetrical peak at 12.2 mL. The molecular mass of 32±2 kDa determined by MALLS is consistent with the formation of a 1∶1 heterodimeric complex (Figure 1B, theoretical molecular mass: 32.4 kDa). In addition, the ratio Mw/Mn = 1.000 obtained from the MALLS experiment indicated that the isolated complex is monodisperse in solution. Taken together, our results are consistent with the ability of D4 to form dimers in solution [29], however, when co-expressed with A201–50, His-D4/A201–50 clearly forms a tight heterodimeric complex.
Crystal structure of the His-D4/A201–50 complex
His-D4/A201–50-WT as well as His-D4/A201–50-T2A (carrying a Thr to Ala point mutation at position 2) crystallized at pH 8.7 in space group P3121 with unit-cell parameters a = b = 92.6, c = 145.9 Å (Table 1). Crystals measuring about 200×200×200 µm were obtained. Diffraction data were collected up to 1.85 Å for His-D4/A201–50-T2A and to 2.2 Å for the wild type complex (Table 1).
The structure solved by molecular replacement using a D4 monomer from Schormann et al. (pdb: 2OWQ, [29]) is shown in Figure 2A. Two His-D4/A201–50 complexes are present per asymmetric unit. All residues of A201–50 could be modelled and refined (Figure 2B) including an additional N-terminal Ala residue (Ala0) that remains after the TEV cleavage step. The structure of D4 in complex with the A20 N-terminus is identical to the revised model of free D4 (pdb entry 4DOF, chain A, rms 0.30 Å) when residues 164 to 174 are excluded from the superposition. The conformation of residues 164 to 174 shows a large variability in the different available homodimer structures but these residues are well ordered in our complex structure.
The structure of A201–50-WT is almost identical to the one of A201–50-T2A: the effect of the mutation is strictly local and affects only the mutated residue. A201–50 forms two α-helices that are packed against each other. The remaining residues connecting the two helices and the ones located at the extremities of the peptide do not show any secondary structure (Figures 2A and 2E). The loop connecting the two helices (residues 20 to 27), which is not involved in the D4/A201–50 interface shows some variability when the two complexes of the asymmetric unit are compared (data not shown). When this connector region is excluded from a superposition, the Cα atoms of the two complexes present in the asymmetric unit can be superimposed with 0.29 Å rms deviation.
Analysis of the D4/A201–50 interface
Both proteins form an extensive contact surface (1890 Å2 of buried surface, Figures 2C and 2D). The contact is strikingly flat with the exception of the prominent residue Trp43 of A20 sandwiched between Pro173 and Arg167 of D4 (Figure 2B). The contact surface is formed essentially by hydrophobic residues, but 6 hydrogen bonds confer the required specificity to the interaction (Figures 2C, 2D and 2E). In A20, two stretches of residues form the contact surface: residues 1 to 14 are located in the loop structure at the extremity of the fragment and within the 1st helix; residues 40 to 47 are located at the end of the 2nd helix and in the following loop structure. Likewise, in D4 two stretches of residues form the contact surface: residues 167–180 and 191–206 that are present within loop structures, a β-strand and a α-helix (Figures 2D and 2E).
On the A20 side of the interface the main contributing residues are Met1, Thr2, Leu7, Leu10, Leu14, Tyr17, Tyr42, Trp43, Lys44, Ile45, Gly46 and Val47, each contributing for more than 0.2 kcal.mol−1 to the binding as estimated with PISA [31] (Figures 2C and 2E). Additional residues are involved in hydrogen bonds: Ser4 and Ser40. A few more residues are located in the contact surface but contribute only marginally to binding. On the D4 side of the interface, Arg167 and Pro173 contribute to the interface whereas Ile197, Val200, Leu201 and Leu204 are the main hydrophobic contributors and Thr175, Thr176, Arg193 and Ser194 are the partners involved in hydrogen bond formation (Figures 2D and 2E).
Point mutations at the D4/A20 interface affect complex formation and stability
The crystal structure of His-D4/A201–50 highlights the importance of several Leu residues (Leu7, Leu10 and Leu14) of A20 for interaction with D4 (Figures 2C and 2E). In accordance with our findings it was previously shown that Leu to Ala mutation affecting these specific amino acids interfered with D4/A20 complex formation [5]. We then wanted to determine the contribution of the previously unknown contact generated by Trp43 of A20 that is involved in stacking interactions with Arg167 and Pro173 of D4 (Figure 2B). For this purpose, point mutations were introduced into the pETDuet-D4R-A20R1–50WT construct in order to produce three mutants: His-D4/A201–50W43A, His-D4-R167A/A201–50 and His-D4-P173G/A20R1–50. Mutant proteins were expressed and purified as their wild-type counterparts. As a control, D4 expressed alone was also purified. Figure 3 shows the elution profiles of D4, His-D4/A201–50WT and His-D4/A201–50 mutants after size exclusion chromatography: His-D4/A201–50WT elutes (as previously observed, Figure 1B) as a single peak (Figure 3, top left panel, peak 1). A similar elution profile is observed for the mutant His-D4-R167A/A201–50 (Figure 3, bottom left panel). However, chromatograms of His-D4-P173G/A20R1–50 and His-D4/A201–50W43A mutants (bottom middle and right panels) showed the presence of two distinct peaks. Proteins from peak 1 present a similar elution volume (11.7 mL) than the one observed for His-D4/A201–50WT and His-D4-R167A/A201–50, while proteins from the second peak have a lower elution volume of 10.7 mL which is identical to the one of D4 homodimer (Figure 3, top right panel, peak 2). 15% SDS-PAGE fraction analysis of the peaks is also presented in Figure 3. Fractions from peak 1 show the typical protein pattern obtained previously when His-D4 and A201–50 form the heterodimeric complex (Figure 1A). However, fractions from peak 2 clearly show an excess of D4 and little or no A201–50 peptide (Figure 3, see fractions 12 and 13). The shorter retention time of D4 from peak 2 (compared to His-D4/A201–50 from peak 1) is consistent with the presence of D4 homodimers in these fractions. Thus, our results demonstrated that A20 Trp43 and D4 Pro173 are critical amino acids at the D4/A201–50 interface and that mutations of these residues interfere with D4/A201–50 complex formation resulting in D4 homodimer assembly. In contrast, D4 Arg167 does not seem to be essential for D4/A201–50 complex formation.
To study further the behaviour of His-D4/A201–50 mutants and to compare them with the WT complex, fractions of peak 1 were pooled and resubmitted to size exclusion chromatography (Figure 4A). WT and mutant complexes elute as a single and sharp peak during this second chromatographic step. It is noteworthy that whatever the mutant, no peak with shorter retention time (see peak 2 in Figure 3) was observed during this second gel-filtration chromatography and SDS-PAGE analysis of the peak fractions showed co-elution of His-D4 and A201–50 (Figure 4A). As no D4 homodimer is formed dynamically, this indicates that once assembled, the His-D4/A201–50 complex is stable even in presence of mutations at the interface.
Thermal shift assays were performed with these re-purified complexes to monitor the thermal stability of the different mutants compared to the WT complex (Figure 4B). A Tm of 47.3±0.1°C was determined for the His-D4/A201–50WT complex. All three mutant complexes have about 4 to 5°C lower Tm, indicating that they are less stable than the WT. Thermal stability of His-D4 was also assayed in this experiment. A Tm of 38.0±0.2°C was determined and is in agreement with the result previously obtained by Nuth et al. (Tm of 38.4±1.6°C) [32]. The key role of A20 Trp43, D4 Arg167 and D4 Pro173 in the D4/A201–50 interaction is reinforced by the thermal stability data.
Discussion
It has been demonstrated that the complex formed by the VACV D4 and A20 proteins is essential to convert the distributive DNA polymerase E9 into a processive mode [5], [10]. Yet, much work remains to be done to understand the molecular mechanisms driving D4/A20 assembly and how it stimulates long-chain DNA synthesis. A first glimpse into the complex structure was obtained from the study of Schormann, et al. which has shown that bacterially expressed VACV D4 was found to be dimeric in solution and crystallized as a dimer [29]. The dimerization of D4 was intriguing since UDGs from different organisms are structurally well conserved and known to be small monomeric enzymes that do not require co-factors or even divalent cations for activity [33]. Additional biochemical and structural studies of the D4/A20 complex did not favor the model in which D4 functions as a dimer but rather suggested that within the DNA polymerase holoenzyme D4 is in a monomeric state [5], [6]. The data presented in this report strengthen this last model and explain at the molecular level how A20 prevents D4/D4 dimerization by binding to D4. The molecular mass ranging from 43 to 28 kDa obtained for D4 in the SEC-MALLS experiment is consistent with the protein existing as a mixture of monomer/dimer in solution, with a relatively large dissociation constant and fast kinetics. In contrast, when co-expressed with its partner A201–50, a D4/A201–50 complex is formed with a 1∶1 stoichiometry (molecular mass of 32 kDa, Figure 1).
Comparison of both D4/D4 and His-D4/A201–50 crystal structures indicates that the contact surfaces of D4 or A201–50 on D4 are overlapping (Figures 5A to 5D) and illustrates the difference between a specific interaction (His-D4/A201–50) and a non-specific one (D4/D4) (Figures 5C to 5E). In both interactions, most of the change in the Gibbs free energy (ΔG) upon binding is contributed by hydrophobic contacts. As the D4/A201–50 interaction surface is strikingly flat, additional interactions have to be provided in order to define the relative orientation of the two partners and to ensure specificity. This is achieved by the 6 hydrogen bonds within the interface (Figure 2C, 2D and 2E). In addition to the flat interaction surface, the D4/A20 interaction uses steric complementarity forming a tongue and groove connection involving residues Arg167 and Pro173 on D4 (the “groove”) and Trp43 on A20 (the “tongue”) (Figure 2B).
In contrast, the less than perfect match of the two binding surfaces in the D4 dimer is obvious, first from the reduction of the buried surface area (for example 1030 Å2 for the A/B dimer of pdb entry 4DOF vs. 1890 Å2 for the D4/A201–50 interface), which matches the visually much less pronounced contacts and a lesser surface complementarity (Figure 5C to 5E). Strikingly, no hydrogen bonds are involved in the contact (Figure 2F). The poor definition of the relative orientation of the two molecules in the D4/D4 dimer becomes obvious when the 9 available different crystallographic dimer structures are compared. The relative orientation of the subunits varies by 21° (Figure 5F). The dimers from pdb entry 4DOG, 2OWQ and 3NT7 cluster in one group, those from 4DOF together with 3 of the dimers from 2OWR in a second group. Finally, one dimer from pdb entry 2OWR (chains A and B) adopts an intermediate structure showing a rotation of 14° compared to the model from 4DOG (Figure 5F). Last but not least, the homodimer structures show variable conformations and a poor definition of two stretches of residues, 164–174 and 182–195, which contribute to the D4/D4 dimer contact. Overall, the results presented herein indicate that D4 is able to dimerize when over-expressed alone but in the presence of A201–50, D4/D4 interactions are prevented and the formation of a stable heterodimeric D4/A201–50 complex is favored. Thus during the course of the VACV infection the D4/A20 heterodimer forms the processivity factor of the DNA polymerase E9 and the D4 dimerization observed in vitro is likely to be an artifact.
Previous results from the Moss group showed that the minimal binding region necessary to bind to D4 resided within the N-terminal 25 amino acids of A20, although full binding was only observed when D4 was expressed together with the first 50 residues of A20 [7]. This suggested that additional residues located between amino acids 25 and 50 might be important for the D4/A20 interaction, a result fully confirmed by the structure of the D4/A20 interface. More recently, the highly conserved leucine residues within the first 25 amino acids of A20 were shown to be critical for the interaction [5]. Indeed, our crystal structure confirmed that Leu7, Leu10 and Leu14 played a key role in the D4/A20 interaction, whereas Leu13 and Leu16 rather form the hydrophobic core of the A20 fragment and are part of the contact between the two helices. Most importantly, the structure identifies a second contact in the D4/A20 complex, involving A20 Trp43 stacked between D4 Arg167 and Pro173. To determine if these residues are important for D4/A20 binding, we have mutated them individually and showed that each of these mutations had a significant negative effect on complex formation and stability. Mutations of the residues forming the tongue and groove interaction all lead to a reduced complex stability in a thermal shift assay and to formation of D4 homodimers for the mutants D4 Pro173Gly and A20 Trp43Ala. The effect of these mutations underlines the importance of this second binding site involving residues outside the initially identified residues 1–25 of A20. Thus, the results from Ishii et al. together with the data presented here strongly indicated that all the determinants for D4 binding are located within the first 50 residues of A20 [7].
VACV D4 shows about 20% sequence identity at the protein level with the human UDG and the overall structures are very similar [29]. In order to obtain a model of the D4/A201–50 complex interacting with DNA, we superimposed D4 onto the human UDG bound to DNA [20] (Figures 6A and 6B). The DNA binding site on D4 is clearly distinct from the D4/A20 interface as the DNA binds on a different side of D4. This is consistent with the fact that D4 exists as a catalytically active enzyme within the DNA polymerase holoenzyme [5]. Sequence alignment of VACV D4 with the related UDGs from herpes simplex virus type 1 (HSV-1) and human identified conserved active-site residues [18], [34]. These are Tyr70, Phe79 and Asn120 which are predicted to form the uracil recognition pocket in addition to Asp68 and His181 that are needed for glycosyl bond cleavage (Figure 6C).
So far, two vaccinia viruses (Dts30 and Dts27) with temperature sensitive defect in DNA synthesis caused by mutation in the D4R gene have been isolated [24]. The mutation in D4 Dts30 leads to a Gly179Arg substitution [34] while the D4 ORF of Dts27 contains a Leu110Phe substitution [10]. The Gly179Arg mutation was characterized and presented a default in D4/A20 assembly [10]. The mutated residue is located at the D4/A20 interface where the replacement of Gly179 by a bulky Arg residue right in the center of the interface will certainly weaken the interaction (Figure 2D, arrow). In contrast, the Leu110Phe substitution observed in Dts27 did not disturb D4/A20 interaction [10]. The Leu residue is located in the hydrophobic core of the protein, away from the D4/A20 interface and the DNA binding domain of D4 (not shown). Molecular modeling shows that the bulkyer aromatic side chain of Phe cannot be accommodated and will weaken considerably the packing of the hydrophobic core of D4 leading to the temperature-sensitive phenotype.
In a recent study, point mutants were generated in the D4 coding region and tested for their ability to function in processivity and to maintain UDG catalytic activity [28]. Three of these mutants (Lys126Val, Lys160Val and Arg187Val) did not function in processive DNA synthesis but retained binding to A20 and to DNA as well as glycosylase activity. Lys126, Lys160 and Arg187 are shown on a close up view of the DNA/D4/A201–50 model together with structural homolog from the human UDG (Figure 6B). Interestingly, even though the three VACV residues are not directly in contact with DNA, all are located in the vicinity of the double helix and may be involved in DNA binding. It is known that the UDG region close to the DNA shows structural rearrangements upon DNA binding [20]. Thus, it is reasonable to postulate that binding of D4 to DNA may induce some local conformational changes which may bring Lys126, Lys160 and Arg187 in contact with the VACV DNA genome. The loss of these contacts might explain the observed phenotype [28]. In the same study, several residues of the interface have been mutated to Ala: these are residues interacting through hydrogen bonds with A20 (Thr175, Thr176 and Ser194, Figures 2D and 2E). The mutants did not show any phenotype, certainly due to the minor importance of losing a single hydrogen bond to the binding affinity.
Inhibitors of protein-protein interaction have emerged as a new tool to modulate protein functions within various classes of targets [35]. We and others believe that molecules interfering with D4/A20 interaction could be attractive new anti-poxvirus compounds [30], [32], [36]–[38]. In their study, Schormann et al. have performed an in vitro screen allowing the identification of several molecules interfering with His-D4/MBP-A201–100 [30]. They further showed that the selected compounds exhibited both antiviral activity and binding to D4. In an attempt to determine if some molecules presented in this study could interact with the D4/A201–50 interface, we generated three-dimensional models of D4 in complex with some of these small molecules inhibitors using a molecular docking approach based on the Surflex algorithm [39]. Among the 5 compounds from the study of Schormann et al [30] (molecules #1, #6, #9, #12 and #15) that were evaluated by the docking algorithm, several Surflex models generated for compounds #1, #9 and #15 reproduced key contact features identified in the D4/A201–50 complex (Figure S1). Interestingly, all these inhibitors contain a hydrophobic phenyl ring derivative (methylphenyl for compound #1, bromophenyl for #9 and fluorophenyl for #15) that is predicted in these models to mimic the steric complementarity tongue/groove interaction between the A20 Trp43 and D4 Arg167 and Pro173. Further experiments with the above mentioned compounds will be necessary to verify the validity of the proposed models. Future work will allow lead optimization and/or de novo design of small molecule inhibitors from the D4/A201–50 structure.
Materials and Methods
Construction of plasmids expressing His-D4 and His-D4/MBP-A201–50
Full length D4R gene from VACV (Copenhagen strain) was amplified from viral genome using primers 1 and 2 or primers 3 and 4 (Table 2). The PCR fragments were digested and ligated either into the cleaved pPROEX HTb vector (Life Technologies) or into the cleaved pETDuet-1 plasmid (Novagen), for single or co-expression purpose, respectively (Table 2).
DNA encoding the first 50 amino acids of VACV A20 (Copenhagen strain) was PCR-amplified from viral genome with primers 5 and 6 (Table 2) and introduced into the pETM-40 vector (EMBL), downstream of the maltose binding protein (MBP) gene. The sequence encoding the MBP/A201–50 fusion protein was then amplified with primers 7 and 8 and ligated into the digested pETDuet-1 plasmid carrying the VACV D4R gene. Due to the NcoI restriction site used to clone the A20 DNA fragment into the pETM-40 plasmid, a Thr to Ala mutation was introduced in A201–50 at position 2 (T2A). In order to express the WT A201–50 peptide, the pETDuet-D4R-A20R1–50T2A construct was PCR-amplified using the Phusion Site-Directed Mutagenesis protocol (Thermo Scientific) and phosphorylated primers 9 and 10 (Table 2). The D4/A201–50 mutants described in this study were all engineered using the same protocol starting with the construct pETDuet-D4R/A20R1–50WT and the phosphorylated primers 11–16 shown in Table 2. The DNA sequence of each construct was verified by automated DNA sequencing.
Expression and purification of His-D4/MBP-A201–50 and His-D4
The construct pETDuet-D4R/A20R1–50 allows the expression of a non-cleavable N-terminal His-tagged D4 together with A201–50 fused to the C-terminus of the maltose binding protein (MBP), downstream of a TEV protease cleavage site. The recombinant pETDuet-D4R/A20R1–50 was transformed into Escherichia coli BL21(DE3) strain. An isolated colony was inoculated into LB medium containing carbenicillin (50 µg.mL−1), overnight at 37°C. The culture was diluted to 1/1000th into LB medium supplemented with carbenicillin and bacteria were grown until OD600 reached 0.4–0.6. The culture was then transferred to 18°C for 30 min before induction of protein expression with 0.1 mM of isopropyl β-D-1-thiogalactopyranoside. Bacterial growth was pursued for an additional 16-hour period at 18°C. The culture was harvested by centrifugation and the bacterial pellet was suspended in the following buffer: 50 mM Tris-HCl pH 7.5, 100 mM NaCl, 10 mM Imidazole, 5 mM β-mercaptoethanol and cOmplete, EDTA-free protease inhibitor cocktail (Roche). Bacteria were lysed by sonication (500 ms pulse at 300 W during 5 min at 4°C) and the supernatant was recovered after centrifugation at ∼40,000 g for 30 min at 4°C. Proteins were then loaded onto a 5 mL HisTrap HP column (GE Healthcare) equilibrated with 50 mM Tris-HCl pH 7.5, 100 mM NaCl, 10 mM imidazole. The column was washed with equilibration buffer and proteins were eluted with the same buffer containing 200 mM imidazole. Fractions containing the His-D4/MBP-A201–50 complex were pooled, desalted on a PD10 column (GE Healthcare) in buffer containing 50 mM Tris-HCl pH 7.5, 100 mM NaCl and treated with Tobacco Etch Virus (TEV) protease at a ratio of 1/100 (w/w), during 16 h at 20°C. The His-D4/A201–50 complex was loaded again onto a 5 mL HisTrap HP column (GE Healthcare) and eluted as described above. Proteins were further purified on a size exclusion chromatography (Superdex 75 10/300 GL, GE Healthcare) equilibrated in 50 mM Tris-HCl pH 7.5, 100 mM NaCl. His-D4/A201–50 was concentrated to 8 mg.mL−1 prior to crystallization trials. His-D4/MBP-A201–50 mutants described in this report were all purified as the wild type complex.
To express His-D4, E. coli Rosetta (DE3)pLysS strain (Novagen) was transformed with the pPROEX-D4R vector. This vector allows the expression of a TEV-cleavable N-terminal hexa-histidine tagged version of D4. Protein expression was essentially performed as for His-D4/MBP-A201–50 except that the culture was grown in the presence of carbenicillin (50 µg.mL−1) and chloramphenicol (34 µg.mL−1). Bacteria were suspended in a buffer: 25 mM Tris-HCl pH 7.5, 300 mM NaCl, 20 mM imidazole, 5 mM β-mercaptoethanol and cOmplete, EDTA-free protease inhibitor cocktail (Roche) and were lysed by three cycles of freezing and thawing followed by sonication (500 ms pulse at 300 W during 5 min at 4°C). Cleared cell lysate (obtained after centrifugation at ∼40,000 g for 30 min at 4°C) was loaded onto a 5 mL HisTrap HP column (GE Healthcare) and His-D4 was eluted using a 20–200 mM imidazole gradient. The purified protein was desalted on a PD10 column (GE Healthcare) in buffer containing 25 mM Tris-HCl pH 7.5, 300 mM NaCl. The His-tag was subsequently cleaved by the TEV protease as described above. Recombinant D4 was recovered from the flow through fraction of a HisTrap HP column (GE Healthcare) and further purified by size exclusion chromatography (Superdex 75 10/300 GL, GE Healthcare) equilibrated in 25 mM Tris-HCl pH 7.5, 100 mM NaCl. Purified proteins were analyzed on SDS-PAGE and stained with InstantBlue (Expedeon).
SEC (Size exclusion chromatography)-MALLS (multi-angle laser light scattering) experiments
SEC was performed with a Superdex 75 10/300 GL (GE Healthcare) equilibrated in 50 mM Tris-HCl pH 7.5, 100 mM NaCl. Separations were performed at 20°C with a flow rate of 0.5 mL.min−1. 50 µL of a protein solution at a concentration of 10 mg.mL−1 were injected. On-line MALLS detection was performed with a DAWN-EOS detector (Wyatt Technology Corp., Santa Barbara, CA) using a laser emitting at 690 nm. Protein concentration was measured on-line by refractive index measurements using a RI2000 detector (Schambeck SFD) and a refractive index increment dn/dc = 0.185 mL.g−1. Data were analyzed and weight-averaged molecular masses (Mw) were calculated using the software ASTRA V (Wyatt Technology Corp., Santa Barbara, CA) as described previously [40].
Thermal shift assay
Experiments were performed in 96-well non-skirted PCR plates (Thermo Scientific). Each 20 µL reactions were carried out in 50 mM Tris-HCl pH 7.5, 100 mM NaCl containing 1 µM D4 or His-D4/A201–50 and 5× Sypro Orange (Molecular Probes). Plates were closed with Microseal B Adhesive Seal (BioRad) and placed into a Mx3005P qPCR system (Stratagene). A temperature increment of 1°C.min−1 was applied from 25 to 75°C. Temperature-induced protein unfolding was monitored by measuring the fluorescence signal at 570 nm (with excitation at 472 nm) and data were processed using the MxPro software. Denaturation curves were normalized and the inflection points were used to determine the melting temperature (Tm). Experiments were performed in triplicate and repeated at least twice.
Crystallization and data collection
His-D4/A201–50 crystals were initially grown at the High-throughput crystallisation laboratory (HTX Lab, EMBL, Grenoble) using the sitting-drop vapour-diffusion technique as previously described [41]. The complex crystallized at 20°C in Grid Screen Ammonium Sulfate (Hampton Research) condition B6: 0.1 M Bicine pH 9, 1.6 M ammonium sulfate. Conditions were manually optimized and the best crystals were observed in 0.1 M Bicine pH 8.7, 1.5 M ammonium sulfate with a 1.5 µL∶1.5 µL protein∶reservoir drop ratio.
Prior to data-collection, crystals were successively transferred in 10% (v/v) and 20% (v/v) glycerol/reservoir cryo-solution before flash-cooling in liquid nitrogen. Data sets were collected from crystals on beamlines ID23-1 and ID14-4 at the European Synchrotron Radiation Facility (ESRF, Grenoble, France). Data were processed with iMOSFLM [42], and scaled with the program SCALA from the CCP4 suite [43].
Phase determination, refinement and structure analysis
The structure was solved by molecular replacement using a D4 monomer from pdb file 2OWQ [29] as a search model in PHASER [44]. Clear extra density corresponding to A201–50 was readily visible. The model was manually modified using COOT [45] and refined using REFMAC5 [46]. Refinement statistics and model composition are shown in Table 1. Structure superimposition was performed using CCP4MG [47], protein-protein interactions were analysed with PISA [31] and visually checked using PYMOL (The PyMOL Molecular Graphics System, Version 1.4.1 Schrödinger, LLC). Detailed analysis of monomer orientation within the dimers was performed using LSQKAB [48]. All structure-related figures were generated with PYMOL.
Molecular docking
Compounds #1, #6, #9, #12 and #15 from Schormann et al [30] were drawn and cleaned up using the SYBYL 2.0 sketch module (Tripos, Inc.). Partial charges were computed using the Gasteiger-Marsili algorithm as implemented in Sybyl and final 3D compound structures suitable for further molecular modeling studies were generated by applying a final quick energy minimization step (MMF94 force field, 500 iterations and using Sybyl default parameters). Compounds #1, #6 and #15 contain an asymmetric carbon and the resulting stereoisomers were also drawn and treated using the same protocol. Protein/compound complexes models were generated using Surflex 2.6 [39] as implemented in SYBYL-X 2.0. D4 target protein structure was extracted from the His-D4/A201–50 complex X-ray structure and prepared using Sybyl structure preparation tool (default parameters). The docking area entered as Surflex input parameter to generate the protomol was defined by visual investigation of D4/A20 interface and by selecting D4 residues 61, 63, 99, 148, 149, 151–157, 159, 160, 161, 163, 164, 166–168, 170, 172–181, 184, 188–207 and 209. The protomol was automatically generated after setting the threshold and bloat values to 0.5 and 0. For every compound, 20 docking poses were generated and clustered by families according to the docking zone selected at the surface of D4. Docking poses were visually inspected to identify similarities with D4/A201–50 interaction.
Supporting Information
Zdroje
1. Boyle K, Traktman P (2009) Poxviruses. In Cameron, C.E., Gotte, M., Raney, K.D., editors. Viral Genome Replication. New York: Springer. pp. 225–246.
2. BoyleKA, ArpsL, TraktmanP (2007) Biochemical and genetic analysis of the vaccinia virus d5 protein: Multimerization-dependent ATPase activity is required to support viral DNA replication. J Virol 81: 844–859.
3. De SilvaFS, LewisW, BerglundP, KooninEV, MossB (2007) Poxvirus DNA primase. Proc Natl Acad Sci U S A 104: 18724–18729.
4. UptonC, StuartDT, McFaddenG (1993) Identification of a poxvirus gene encoding a uracil DNA glycosylase. Proc Natl Acad Sci U S A 90: 4518–4522.
5. BoyleKA, StanitsaES, GresethMD, LindgrenJK, TraktmanP (2011) Evaluation of the role of the vaccinia virus uracil DNA glycosylase and A20 proteins as intrinsic components of the DNA polymerase holoenzyme. J Biol Chem 286: 24702–24713.
6. SeleC, GabelF, GutscheI, IvanovI, BurmeisterWP, et al. (2013) Low-resolution structure of vaccinia virus DNA replication machinery. J Virol 87: 1679–1689.
7. IshiiK, MossB (2002) Mapping interaction sites of the A20R protein component of the vaccinia virus DNA replication complex. Virology 303: 232–239.
8. McCraithS, HoltzmanT, MossB, FieldsS (2000) Genome-wide analysis of vaccinia virus protein-protein interactions. Proc Natl Acad Sci U S A 97: 4879–4884.
9. McDonaldWF, TraktmanP (1994) Vaccinia virus DNA polymerase. In vitro analysis of parameters affecting processivity. J Biol Chem 269: 31190–31197.
10. StanitsaES, ArpsL, TraktmanP (2006) Vaccinia virus uracil DNA glycosylase interacts with the A20 protein to form a heterodimeric processivity factor for the viral DNA polymerase. J Biol Chem 281: 3439–3451.
11. WardTM, WilliamsMV, Traina-DorgeV, GrayWL (2009) The simian varicella virus uracil DNA glycosylase and dUTPase genes are expressed in vivo, but are non-essential for replication in cell culture. Virus Res 142: 78–84.
12. MullaneyJ, MossHW, McGeochDJ (1989) Gene UL2 of herpes simplex virus type 1 encodes a uracil-DNA glycosylase. J Gen Virol 70 (Pt 2) 449–454.
13. ReddySM, WilliamsM, CohenJI (1998) Expression of a uracil DNA glycosylase (UNG) inhibitor in mammalian cells: varicella-zoster virus can replicate in vitro in the absence of detectable UNG activity. Virology 251: 393–401.
14. PrichardMN, DukeGM, MocarskiES (1996) Human cytomegalovirus uracil DNA glycosylase is required for the normal temporal regulation of both DNA synthesis and viral replication. J Virol 70: 3018–3025.
15. ChenR, WangH, ManskyLM (2002) Roles of uracil-DNA glycosylase and dUTPase in virus replication. J Gen Virol 83: 2339–2345.
16. MillnsAK, CarpenterMS, DeLangeAM (1994) The vaccinia virus-encoded uracil DNA glycosylase has an essential role in viral DNA replication. Virology 198: 504–513.
17. HolzerGW, FalknerFG (1997) Construction of a vaccinia virus deficient in the essential DNA repair enzyme uracil DNA glycosylase by a complementing cell line. J Virol 71: 4997–5002.
18. De SilvaFS, MossB (2003) Vaccinia virus uracil DNA glycosylase has an essential role in DNA synthesis that is independent of its glycosylase activity: catalytic site mutations reduce virulence but not virus replication in cultured cells. J Virol 77: 159–166.
19. SchonhoftJD, KosowiczJG, StiversJT (2013) DNA translocation by human uracil DNA glycosylase: role of DNA phosphate charge. Biochemistry 52: 2526–2535.
20. ParikhSS, MolCD, SlupphaugG, BharatiS, KrokanHE, et al. (1998) Base excision repair initiation revealed by crystal structures and binding kinetics of human uracil-DNA glycosylase with DNA. EMBO J 17: 5214–5226.
21. ParikhSS, PutnamCD, TainerJA (2000) Lessons learned from structural results on uracil-DNA glycosylase. Mutat Res 460: 183–199.
22. WongI, LundquistAJ, BernardsAS, MosbaughDW (2002) Presteady-state analysis of a single catalytic turnover by Escherichia coli uracil-DNA glycosylase reveals a “pinch-pull-push” mechanism. J Biol Chem 277: 19424–19432.
23. StuartDT, UptonC, HigmanMA, NilesEG, McFaddenG (1993) A poxvirus-encoded uracil DNA glycosylase is essential for virus viability. J Virol 67: 2503–2512.
24. DalesS, MilovanovitchV, PogoBG, WeintraubSB, HuimaT, et al. (1978) Biogenesis of vaccinia: isolation of conditional lethal mutants and electron microscopic characterization of their phenotypically expressed defects. Virology 84: 403–428.
25. IshiiK, MossB (2001) Role of vaccinia virus A20R protein in DNA replication: construction and characterization of temperature-sensitive mutants. J Virol 75: 1656–1663.
26. KlempererN, McDonaldW, BoyleK, UngerB, TraktmanP (2001) The A20R protein is a stoichiometric component of the processive form of vaccinia virus DNA polymerase. J Virol 75: 12298–12307.
27. PunjabiA, BoyleK, DeMasiJ, GrubishaO, UngerB, et al. (2001) Clustered charge-to-alanine mutagenesis of the vaccinia virus A20 gene: temperature-sensitive mutants have a DNA-minus phenotype and are defective in the production of processive DNA polymerase activity. J Virol 75: 12308–12318.
28. Druck ShudofskyAM, SilvermanJE, ChattopadhyayD, RicciardiRP (2010) Vaccinia virus D4 mutants defective in processive DNA synthesis retain binding to A20 and DNA. J Virol 84: 12325–12335.
29. SchormannN, GrigorianA, SamalA, KrishnanR, DeLucasL, et al. (2007) Crystal structure of vaccinia virus uracil-DNA glycosylase reveals dimeric assembly. BMC Struct Biol 7: 45.
30. SchormannN, SommersCI, PrichardMN, KeithKA, NoahJW, et al. (2011) Identification of protein-protein interaction inhibitors targeting vaccinia virus processivity factor for development of antiviral agents. Antimicrob Agents Chemother 55: 5054–5062.
31. KrissinelE, HenrickK (2007) Inference of macromolecular assemblies from crystalline state. J Mol Biol 372: 774–797.
32. NuthM, HuangL, SawYL, SchormannN, ChattopadhyayD, et al. (2011) Identification of inhibitors that block vaccinia virus infection by targeting the DNA synthesis processivity factor D4. J Med Chem 54: 3260–3267.
33. PearlLH (2000) Structure and function in the uracil-DNA glycosylase superfamily. Mutat Res 460: 165–181.
34. EllisonKS, PengW, McFaddenG (1996) Mutations in active-site residues of the uracil-DNA glycosylase encoded by vaccinia virus are incompatible with virus viability. J Virol 70: 7965–7973.
35. MullardA (2012) Protein-protein interaction inhibitors get into the groove. Nat Rev Drug Discov 11: 173–175.
36. FlusinO, SaccucciL, Contesto-RichefeuC, HamdiA, BardouC, et al. (2012) A small molecule screen in yeast identifies inhibitors targeting protein-protein interactions within the vaccinia virus replication complex. Antiviral Res 96: 187–195.
37. SaccucciL, CranceJM, ColasP, BickleM, GarinD, et al. (2009) Inhibition of vaccinia virus replication by peptide aptamers. Antiviral Res 82: 134–140.
38. SilvermanJE, CiusteaM, ShudofskyAM, BenderF, ShoemakerRH, et al. (2008) Identification of polymerase and processivity inhibitors of vaccinia DNA synthesis using a stepwise screening approach. Antiviral Res 80: 114–123.
39. JainAN (2003) Surflex: fully automatic flexible molecular docking using a molecular similarity-based search engine. J Med Chem 46: 499–511.
40. GerardFC, Ribeiro EdeAJr, AlbertiniAA, GutscheI, ZaccaiG, et al. (2007) Unphosphorylated rhabdoviridae phosphoproteins form elongated dimers in solution. Biochemistry 46: 10328–10338.
41. DimasiN, FlotD, DupeuxF, MarquezJA (2007) Expression, crystallization and X-ray data collection from microcrystals of the extracellular domain of the human inhibitory receptor expressed on myeloid cells IREM-1. Acta Crystallogr Sect F Struct Biol Cryst Commun 63: 204–208.
42. BattyeTG, KontogiannisL, JohnsonO, PowellHR, LeslieAG (2011) iMOSFLM: a new graphical interface for diffraction-image processing with MOSFLM. Acta Crystallogr D Biol Crystallogr 67: 271–281.
43. WinnMD, BallardCC, CowtanKD, DodsonEJ, EmsleyP, et al. (2011) Overview of the CCP4 suite and current developments. Acta Crystallogr D Biol Crystallogr 67: 235–242.
44. McCoyAJ, Grosse-KunstleveRW, AdamsPD, WinnMD, StoroniLC, et al. (2007) Phaser crystallographic software. J Appl Crystallogr 40: 658–674.
45. EmsleyP, LohkampB, ScottWG, CowtanK (2010) Features and development of Coot. Acta Crystallogr D Biol Crystallogr 66: 486–501.
46. MurshudovGN, SkubakP, LebedevAA, PannuNS, SteinerRA, et al. (2011) REFMAC5 for the refinement of macromolecular crystal structures. Acta Crystallogr D Biol Crystallogr 67: 355–367.
47. McNicholasS, PottertonE, WilsonKS, NobleME (2011) Presenting your structures: the CCP4mg molecular-graphics software. Acta Crystallogr D Biol Crystallogr 67: 386–394.
48. KabschW (1976) Solution for Best Rotation to Relate 2 Sets of Vectors. Acta Crystallographica Section A 32: 922–923.
Štítky
Hygiena a epidemiologie Infekční lékařství LaboratořČlánek vyšel v časopise
PLOS Pathogens
2014 Číslo 3
- Perorální antivirotika jako vysoce efektivní nástroj prevence hospitalizací kvůli COVID-19 − otázky a odpovědi pro praxi
- Diagnostický algoritmus při podezření na syndrom periodické horečky
- Stillova choroba: vzácné a závažné systémové onemocnění
- Jak souvisí postcovidový syndrom s poškozením mozku?
- Diagnostika virových hepatitid v kostce – zorientujte se (nejen) v sérologii
Nejčtenější v tomto čísle
- Cytomegalovirus m154 Hinders CD48 Cell-Surface Expression and Promotes Viral Escape from Host Natural Killer Cell Control
- Human African Trypanosomiasis and Immunological Memory: Effect on Phenotypic Lymphocyte Profiles and Humoral Immunity
- DHX36 Enhances RIG-I Signaling by Facilitating PKR-Mediated Antiviral Stress Granule Formation
- Conflicting Interests in the Pathogen–Host Tug of War: Fungal Micronutrient Scavenging Versus Mammalian Nutritional Immunity