Cyclic AMP is a major trigger of the differentiation process of Trypanosoma brucei, a bloodstream parasite causing sleeping sickness. Its generation in trypanosomes is accomplished by a unique battery of membrane‐bound adenylate cyclases (ACs). We have determined the high‐resolution X‐ray structures of the catalytic domains of two trypanosomal ACs (tACs), GRESAG4.1 and GRESAG4.3. The tAC domains are structurally highly related to the AC domains of higher eukaryotes, but also comprise a highly conserved structural element near the active site, the Δ‐subdomain. A cavity below the Δ‐subdomain might correspond to an allosteric regulator site as indicated by the stereospecific binding of a single (2S,3S)‐1,4‐ dimercapto‐2,3‐butanediol molecule. In three different crystal forms, the tAC domains are exclusively observed in a monomeric, catalytically inactive state. Biochemical analysis and the mutagenesis profile of GRESAG4.1 confirmed a common catalytic mechanism of tACs that involves transient dimerization of the AC domain. A low dimerization tendency might play a regulatory role in T.brucei if the activation of tACs is similarly driven by ligand‐induced dimerization as in membrane‐bound guanylate cyclases.
Adenylate cyclases (ACs) play a crucial role in the signal transduction pathways of many eukaryotic and eubacterial organisms (Barzu and Danchin, 1994; Sunahara et al., 1996) by converting ATP to the second messenger adenosine 3′,5′‐monophosphate (cAMP). cAMP mediates cellular responses to numerous extracellular signals ranging from hormone or neurotransmitter stimuli to changes in nutritional conditions, and even constitutes itself an extracellular signal for the life cycle control of some eubacteria and protozoa. Based on their topologies, ACs can be subdivided into four classes (Tang and Gilman, 1992). Class I cyclases are predominantly found in metazoans and comprise a 12‐transmembrane helix motif with two cytosolic regions that contain the catalytically active portion. In mammals, these rather complex cyclases occur in nine different isozymes (AC1–AC9) and represent the key effectors of many hormones that bind to G‐protein‐coupled receptors. Class II cyclases occur exclusively in protozoa and consist of an extracellular domain with putative receptor function, a single transmembrane span and a cytosolic catalytic region. Finally, class III cyclases are large, solely membrane‐associated proteins in fungi while the class IV cyclases of eubacteria and mammalian testis are soluble proteins.
The enzymatic activities of ACs and of the related guanylate cyclases (GCs) originate from homologous catalytic domains that are 220–250 amino acids in length, respectively. Until now, most biochemical and biophysical studies focused on mammalian class I enzymes (Tang and Gilman, 1992; Sunahara et al., 1996; Hurley, 1998), which greatly benefitted from the construction of soluble fragments of these cyclases retaining most of their catalytic and regulatory properties (Tang and Gilman, 1995). In class I enzymes, the intramolecular association of two homologous AC/GC domains, C1A and C2A, to a pseudo‐heterodimer is required for the formation of a catalytic site (Dessauer and Gilman, 1996; Whisnant et al., 1996). The binding of the diterpene forskolin or regulatory G‐protein subunits modulates the activity of class I enzymes, presumably by affecting their C1/C2 association. Low‐ to medium‐resolution X‐ray crystallographic studies on a forskolin‐linked, but mostly inactive C2A homodimer (Zhang et al., 1997b), and a catalytically competent, quarternary complex made of GSα, C1A, C2A and forskolin (Tesmer et al., 1997), demonstrated that the active site of class I ACs resides along the C1–C2 interface. Subsequently, it was concluded from biochemical data, sequence comparison and modelling studies that other nucleotidyl cyclases might function as hetero‐ or homodimeric entities as well (Liu et al., 1997).
Class II ACs occur as multiple isozymes in Trypanosoma brucei, a bloodstream parasite and the causative agent of sleeping sickness in humans and Nagana in cattle (Alexandre et al., 1990). Trypanosomal ACs (tACs) are either transiently expressed in the mammalian bloodstream like the ESAG4 isozymes (expression site associated gene 4) or constitutively throughout the life cycle like the isozymes GRESAG4.1, GRESAG4.2 and GRESAG4.3 (genes related to ESAG). In the bloodstream, T.brucei undergoes a transition from a proliferating long slender form to the non‐dividing short stumpy form that is triggered by the activation of tACs and subsequent increases of cytosolic cAMP levels (Vassella et al., 1997). This bloodstream transition, a prerequisite for the transfer of the parasite from the mammalian host to the insect vector, is accompanied by further metabolic changes: long slender forms metabolize glucose only to pyruvate, which is then secreted. In contrast, the mitochondria of short stumpy forms acquire the capability to degrade pyruvate in the Krebs cycle, which becomes fully operational in the procyclic insect stage (Tielens and Van Hellemond, 1998; Matthews, 1999). Currently, it is not known whether cAMP signaling and the metabolic changes are directly linked to each other in vivo (Tielens and Van Hellemond, 1998), and what kind of activating ligands play a role therein. Apart from a 2‐ to 8‐fold calcium‐mediated stimulation of the tAC activity of bloodstream forms, tACs are known to be insensitive to the stimulation by G‐proteins or other agents that commonly affect the catalytic activity of mammalian class I ACs (Rolin et al., 1996).
Here, we report the structures of the catalytically active AC domains of the trypanosomal isozymes GRESAG4.1 and GRESAG4.3 at 1.46 and 1.9 Å resolution, respectively. Unlike previous structures from class I ACs the catalytic domains of tACs crystallize only in their monomeric state. Furthermore, an internal cavity, occupied by (2S,3S)‐1,4‐dimercapto‐2,3‐butanediol (d‐DTT) was located in the protein structure. The presence of this binding site and of a unique trypanosomal insertion, the Δ‐subdomain, in a regulatory important region of the AC domain suggests that tACs might be allosterically controlled by small metabolites like some eubacterial ACs.
Results and discussion
Biochemical characterization of tACs
The gene fragments coding for the cytosolic region of GRESAG4.3 (residues A867–V1229) and ESAG4 (residues P897–R1269) were amplified from genomic DNA of the T.brucei strain 927, cloned and expressed in Escherichia coli, as previously described for GRESAG4.1 (Bieger and Essen, 2000). Unlike the recombinant C2A domains from mammalian ACs, which exhibit only marginal catalytic activities when assayed without accompanying C1A domains (Zhang et al., 1997a), the cytosolic regions of these tACs (mol. wt 40.2–41.9 kDa) are catalytically competent with specific activities of 4–5 nmol/min/mg. The tAC fragments show classical Michaelis–Menten kinetics, with KMATP values of 17–25 μM and Vmax values of 3–10 nmol/min/mg (Table I), which are similar to non‐stimulated, mammalian class I enzymes (Tang and Gilman, 1995; Dessauer and Gilman, 1996). Like the latter, the tAC fragments also exhibit catalytic activity only in the presence of divalent cations, with manganese being more effective as a cofactor than magnesium. For example, the maximal catalytic activity of GRESAG4.1‐(A884–Y1241) is 9‐fold higher at saturating manganese concentrations (≈6 mM) than in the presence of 40 mM magnesium (data not shown).
Only the first 230–240 residues of the cytosolic tAC fragments are homologous to other AC domains including the C1A and C2A domains of mammalian ACs. The remaining C‐terminal region of ∼110 residues is only found in the ACs from trypanosomes and Leishmania donovanii and is not homologous to any other protein. Compared with the tAC domains, which are highly conserved in GRESAG4.1–4.3 and ESAG4 (83–87% sequence identity), the C‐terminal regions are rather variable (54–78% sequence identity). The function of these variable C‐terminal regions is still unknown in tACs. However, proteolytical removal of the variable C‐terminal region from the isozyme GRESAG4.1 has only minor effects on the catalytic activity (Table I): the KMATP of the digestion product GRESAG4.1‐(A884–T1131) is slightly increased from 16.6 to 29.5 μM and the protein shows the same dependency on manganese and magnesium as GRESAG4.1‐(A884–Y1241). Consequently, a regulatory role for the C‐terminal region on tAC activity is only feasible if this region recruits additional regulatory cofactors from the trypanosomal cytosol or membrane. In contrast to previous in vivo studies (Rolin et al., 1996), a stimulation of tAC activity by trypanosomal calmodulin and/or calcium was not detected, either for the AC domain of ESAG4, the bloodstream‐specific AC or for the AC domains of GRESAG4.1 or GRESAG4.3 (data not shown).
Overall structure of tACs
After removal of the variable C‐terminal region, high‐quality crystals of the AC domains of the isozymes GRESAG4.1 and GRESAG4.3 were readily obtained under a variety of conditions (see Materials and methods; Bieger and Essen, 2000). Orthorhombic crystals of the catalytic domain of GRESAG4.1 (residues A884–T1131) diffracted to 1.4 Å resolution under cryogenic conditions, while tetragonal crystals of GRESAG4.3‐(A867–T1118) similarly attained 1.9 Å resolution. The structure of GRESAG4.1 was solved by multiple isomorphous replacement (MIR) using three heavy‐atom derivatives (Table II). The final model of GRESAG4.1‐(A884–T1131) refined with data at 1.45 Å resolution includes the residues N888–A1122, one molecule of d‐DTT, one sulfate ion and 276 water molecules. Seven, mostly solvent‐exposed, residues were observed in multiple conformers. The high quality of the final electron density map can be assessed from Figure 1C. Subsequently, the structure of the AC domain of GRESAG4.3 was determined by molecular replacement using the GRESAG4.1 AC domain as a search model (Table II).
GRESAG4.1 and GRESAG4.3 show high structural similarity to each other with a root‐mean‐square deviation (r.m.s.d.) of only 0.67 Å for 227 equivalent Cα positions. Structural differences are only found at the tip of the β‐hairpin motif β5–β6 (residues E1036–K1039), the turn between α3A and α3B (residues C998–D1008) and at the very C‐terminus. Both tACs share the overall structure of a seven‐stranded β‐sheet that is joined at its back by two parallel helices, α2 and α3, and at its front by helix α4 crossing β‐strands β1–β4 (Figure 1A and B). Together with the strands β1–β4, the helices α1–α3 form a doubly split βαββαβ sandwich, which is additionally intermitted between α3 and β4 by helices α3A and α3B. This βαββαβ motif, previously coined a palm domain (Artymiuk et al., 1997), forms most of the catalytic surface.
Comparison of tACs with related proteins
Although molecular replacement with the mammalian C1A and C2A domains failed for the structure determination of tACs, the tACs are structurally highly related to the AC domains of class I ACs. A superposition of GRESAG4.1 on the C1A domain of canine ACV gives an r.m.s.d. of 1.45 Å for 127 equivalent Cα positions (Figure 4A), likewise GRESAG4.1 and the C2A domain from rat ACII deviate by 1.64 Å for 134 Cα positions. Hence, the overall structural differences between GRESAG4.1 and the C1A and C2A domains are similar in size to those between the latter two, which superimpose with an r.m.s.d. of 1.3 Å for 153 Cα positions. However, there are two major features that are not present in the structures of mammalian AC domains: first, a 36 amino acid large insertion that forms an α‐helical subdomain between α3 and β4; and secondly, an ordered C‐terminus (GRESAG4.1, T1105–F1122; GRESAG4.3, A1093–R1106) that extends from the end of β9 along a cleft made between the α4β7 loop and the helical subdomain.
The secondary structure elements with most of the conserved residues, β1–β4 and α2, are all located at the front side of the palm domain (Figure 1A). Further conservation is observed in β5 and β6, which form the β‐hairpin motif in both AC structures. The region from β7 to the C‐terminus is located beneath the tAC‐specific α‐helical subdomain on the opposite site of the catalytic core. In this segment, the sequence similarity between tACs and the C1A domain is as low as that for helices α2 and α3 on the back of the AC domain, as these regions are not involved in catalysis.
Although class II ACs are topologically related to membrane‐bound GCs, structure‐based sequence comparisons and calculation of their evolutionary relationships indicate a divergence between GC and AC families that preceded the speciation into different AC classes (Figure 1D). Furthermore, these data suggest a close relationship between tACs and bacterial ACs such as those from mycobacteria or Stigmatella aurantiaca. All of them presumably act as homodimers. These results suggest that the topology of membrane association is not indicative of the evolutionary relationships between cyclase families, as membrane association occurs in phylogenetically distant cyclase families, like the bacterial and mammalian ACs.
Monomeric versus dimeric state of tACs
The catalytic activity of nucleotidyl cyclases was generally found to depend on the dimerization of the AC/GC domains. In both crystal structures of the C2A domain (Zhang et al., 1997b) and the chimeric complex between bovine Gsα, C1A from canine ACV and C2A from rat ACII (Tesmer et al., 1997), these AC domains were only observed as homo‐ and heterodimers. Furthermore, various studies on the guanylate cyclases and class I ACs demonstrated an association of the AC/GC domains that preceded stimulation of these cyclases by effectors (Garbers, 1989; Harteneck et al., 1990; Buechler et al., 1991; Yan et al., 1996). In contrast, all three crystal forms obtained for the tACs GRESAG4.1 and GRESAG4.3 comprise the AC domains in a monomeric instead of a dimeric state. Likewise, none of the crystal contacts that are made by the tAC domains is conserved among the crystal forms.
While the monomeric state is apparently catalytically inactive, biochemical data indicate that in solution the monomeric state of the tAC domains equilibrates dynamically with a dimeric, catalytically active species. Analytical gel filtration experiments with GRESAG4.1‐(A884–T1131) demonstrated the predominance of a dimeric species at low ionic strength (Figure 2A). However, at higher ionic strength, the elution profile was progressively shifted towards the monomeric species, which indicates the occurrence of monomeric and dimeric states at varying ratios in solution. This sensitivity towards salt suggests that the dimerization of tACs significantly depends on polar interactions in the dimer interface. Furthermore, in tAC homodimers, two catalytic sites are postulated along the 2‐fold symmetric dimer interface (Liu et al., 1997), while in class I ACs only one site is formed in the interface due to the asymmetry of the C1C2 heterodimer. The existence of two catalytic sites per tAC homodimer was proven by enforcing heterodimerization between the wild‐type enzyme of GRESAG4.1 and several catalytically inactive mutants that should selectively destroy one or both active sites upon association with a wild‐type AC domain. If added in stoichiometric excess, the addition of the single site mutants D949A or E943A to 2.0 nM GRESAG4.1‐(A884–Y1241) causes a 60% loss of activity, which is consistent with the loss of one active site per wt–mutant heterodimer (Figure 2B). Consequently, when the double mutant D949A/R1053A is added, the activity of GRESAG4.1 drops to <10%, indicating that both active sites of the dimer are functionally impaired (Figure 2B). Finally, if complementary pairs of single site mutants are mixed, a restoration of one active site can be observed per mutant–mutant heterodimer. For example, an equimolar mixture of the D949A and R1053A mutants exhibits a 4‐fold increased specific activity above the expected background level (Figure 2C).
Dimerization is fundamental to AC activity and might be of regulatory importance if the association of intact tACs in the membrane is promoted by effector binding either to the putative extracellular receptor domains or to the cytosolic regions. Such a cytosolic mode of activation was previously shown for the retinal guanylyl cyclase‐1 (retGC‐1) (Olshevskaya et al., 1999; Yu et al., 1999). In this case, activation of retGC‐1 coincides with a transition from a monomeric to a dimeric retGC‐1 species in the membrane when retGC‐1 binds to GC activating proteins (GCAP) at calcium concentrations that are sufficiently low to drive dimerization of GCAP. The weak dimerization tendency of isolated tAC domains as compared with the pronounced pairing of the mammalian C1A and C2A domains makes a similar activation mechanism also attractive for tACs in the membrane context, although suitable activating ligands have yet to be identified in T.brucei. The absence of steric overlaps in a model of the tAC dimer that was assembled on the basis of the C1C2 heterodimer (Figure 3A) indicates that the dimerization of tAC domains proceeds without a requirement for major conformational changes. However, most of the twenty polar interactions that were observed in the C1C2 heterodimer are missing in the putative tAC dimer due to sequence differences. Despite the lack of a conserved dimer interface, the structural element that contributes together with helix α2 most to the dimer interface in class I enzymes, the β5–β6 hairpin (residues D1031–Y1043), is structurally well preserved between tACs and the C1A and C2A domains (Figure 3B). In both tAC structures, this hairpin, also coined an arm region, is well ordered, as indicated by its low B‐factors. The orthogonal kink between the arm region and the adjacent β4 strand and α4 helix is made by two highly conserved glycines, G1027 and G1045, at the N‐ and C‐terminal ends of the arm region. The backbone conformations of the glycines occupy regions in torsional space (G1027, Φ = −176°/Ψ = 170°; G1045, Φ = 106°/Ψ = −163°), which are unfavorable for other amino acids. A further interaction that is conserved between evolutionary distant ACs is the salt bridge between K945 on β2 and D1042 residing on β6 of the arm region (2.8 Å). In class I ACs, this salt bridge becomes occluded in the ventral cavity upon dimerization of the C1A and C2A domains. Here, the corresponding Lys/Glu pair on the C2 domain (K938, D1018) forms hydrogen bonds to the N1 and N6 atoms of the adenine ring of bound substrate (Tesmer et al., 1997). Mutations at these positions relax the substrate specificity of class I ACs (Beuve, 1999), although it is not clear to what extent the precise backbone conformation of the arm domain contributes to substrate specificity. Accordingly, a disruption of the K945–D1042 salt bridge in the K945A mutant of GRESAG4.1 leads to a dramatic loss of activity (Table III). In the absence of other dominant interactions, it might be the binding of the arm region towards the base of the substrate that additionally stabilizes a transient tAC dimerization.
The active site surfaces of tACs
The surface regions that were identified in class I ACs to constitute the catalytic site cover most of the front side of the palm region. In the tAC crystals, these rather polar and flat surfaces are differently engaged in crystal contacts. In crystals of GRESAG4.1‐(A884–T1131), the C‐terminus of the tAC domain (residues T1105–F1122) runs along the active site surface by making several polar interactions with catalytic residues. The binding of the C‐terminal stretch towards the active site is obviously dictated by crystal packing and not due to a putative autoinhibitory function, as the shortened tAC domain of GRESAG4.1‐(A884–D1117) exhibits similar specific activities to that of GRESAG4.1‐(A884–T1131) (data not shown).
Attempts to cocrystallize tACs with nucleotide analogs were unsuccessful due to the strong propensity of tACs to crystallize in inactive, monomeric states. However, a previous model of the substrate ATP in the catalytic site of the C1C2 dimer (Tesmer et al., 1997, 1999) fits nicely into the two catalytic sites of the modeled GRESAG4.1 dimer. The binding of two ATP molecules per tAC dimer occurs without steric interference; the closest approach is 10 Å between the adenine rings while the α‐phosphoryls at which the cyclization reaction occurs are 15 Å distant from each other (Figure 4B). Likewise, the Hill coefficients that correspond only to a slight anticooperativity in GRESAG4.1, GRESAG4.3 and ESAG4 (Table I) indicate almost independent substrate binding by the two catalytic sites.
In the model of the tAC–ATP complex, the terminal γ‐phosphoryl of ATP is stabilized as part of the leaving group by salt bridges to R1022 and R1115. Mutagenesis of R1022, which resides on strand β4, moderately affects the catalytic efficiency of GRESAG4.1; the KM value of the R1022A mutant increases ∼3‐fold (45 μM) while Vmax drops by 59% (1.2 nmol/min/mg) when compared with the Michaelis–Menten parameters of the wild‐type enzyme. A similar contribution to catalysis was shown for the homologous residue R484 in the mammalian C1A domain (Tesmer et al., 1997, 1999). In contrast, the second arginine, R1115, contributes to the stabilization of the γ‐phosphoryl only in tACs. Here, the ordered C‐terminal stretch approaches the active site in such a way that the side‐chain of R1115 is closely positioned at the γ‐phosphoryl binding site while the same region is completely disordered in crystal structures of the mammalian ACs. Interestingly, in the GRESAG4.1 structure, the catalytic site surface harbors a sulfate ion from the crystallization buffer that forms ionic interactions with R1022 (NH2, 3.0 Å; NH1, 3.0 Å) and R1115 (NH1, 3.0 Å; NH1, 3.1 Å) as well (Figure 4C). This sulfate on the surface of monomeric GRESAG4.1 is only 2.0 Å away from the predicted binding site of the γ‐phosphoryl of ATP. Magnesium‐independent binding of sulfate to the active site surface correlates with the observed sensitivities of tAC activities towards dianions like phosphate and sulfate (IC50 for GRESAG4.1‐(A884–T1131): 21 mM for Pi, 53 mM for sulfate). The apparently preformed anion binding site also recruits the dipole moment of helix α1, whose N‐terminus, only 6.3 Å distant, is directed towards the sulfate or γ‐phosphoryl, respectively. The precise alignment of helix α1 is controlled by a highly conserved β‐bulge I907–S910, which connects the C‐terminus of β1 with α1. Interestingly, in DNA polymerases the corresponding helix is likewise orientated by a preceding β‐bulge. Therefore, it is likely that ACs and DNA polymerases share the way by which the leaving group is stabilized during phosphoryl transfer.
Another hallmark in class I ACs and DNA polymerases is two‐metal ion catalysis. According to recent structural and enzymological data, two magnesium ions coordinate both to the active site and the triphosphate moiety of the substrate molecule ATP (Park et al., 1997; Tesmer et al., 1999). In the structure of monomeric GRESAG4.3, a single magnesium ion is bound to two acidic, surface‐exposed residues of the catalytic site, D894 (OD1, 2.5 Å; OD2, 2.9 Å) and D937 (OD1, 2.3 Å); four additional water ligands complete the slightly disturbed octahedral coordination shell around the magnesium ion in the absence of bound substrate (Figure 4D). These two aspartates are highly conserved among ACs and were suggested to support the stabilization of the pentavalent transition state at the α‐phosphate. Mutagenesis of each of these metal ligands in GRESAG4.1 (D906 and D949; see Table III) and class I ACs (Tang et al., 1995) leads to a profound loss of catalytic activity. Accordingly, when superimposing the GRESAG4.3 structure on the C1C2 heterodimer, this magnesium ion is found at the same position as the magnesium ions in the structure of the C1C2 dimer liganded to Mg2+‐pyrophosphate and the product analog 2′‐d‐3′‐AMP.
Taken together, the similar structural features of the active site surfaces (Figure 4A) and the common mutagenesis profile (Table III) indicate that tACs follow an almost identical catalytic mechanism to the related class I ACs. However, despite the appealing similarities in the nucleotide binding sites, tACs and class I ACs strongly differ in their response to so‐called P‐site inhibitors. Mammalian ACs are non‐competitively inhibited by 2′‐deoxy‐adenosine and its 3′‐mono‐ or polyphosphate derivatives. These P‐site inhibitors (Desaubry et al., 1996) synergistically inactivate mammalian ACs by dead‐end inhibition only in the presence of pyrophosphate. Consequently, the apparent affinities of mammalian class I ACs towards P‐site inhibitors are exceptionally high with IC50 values being in the nanomolar range. When GRESAG4.1‐(A884–T1131) was assayed with 2′‐deoxyadenosine and its 3′‐monophosphate analog at concentrations up to 100 μM no significant inhibition was observed, either in the presence or in the absence of PPi. However, PPi alone inhibited GRESAG4.1‐(A884–T1131) with an apparent IC50 value of <1 mM. This observation corroborates an earlier study on intact tACs in T.brucei membrane preparations (Martin et al., 1978), which reported an IC50 value of 135 μM. Insensitivities towards P‐site inhibitors were also described for the bacterial AC from Bordella pertussis and the soluble ACs from bovine sperm. According to structural and biochemical studies on mammalian ACs, P‐site inhibitors occupy the active site together with Mg2+‐pyrophospate (Tesmer et al., 1997) and impair substrate release from the C1C2 heterodimer (Dessauer and Gilman, 1997). Interestingly, the residues bordering the adenine binding site of the tAC homodimer are well conserved. Therefore, the different behavior towards this class of substrate‐like inhibitors is presumably not linked to structural differences in the active site itself, but due to the looser association of tACs.
During the activation of class I ACs, most interactions between the C1C2 heterodimer and Gsα occur in a groove formed by α2 and the α3–β4 loop of the C2 domain. Similarly, in the GCAP‐mediated activation of membrane‐bound GCs, the α3–β4 loop was shown to interact with GCAP (Sokal et al., 1999). In tACs, this region, or Δ‐subdomain due to its deltoid shape, bears the most striking topological difference to other ACs and GCs: 36 amino acids are inserted in the α3–β4 loop (Figures 1A and 5A) where they form two additional helical segments, the 26 Å long helix α3A (N980–E997) and the shorter segment α3B (P1009–L1015). The multiple sequence alignment in Figure 5A shows that this Δ‐subdomain is uniquely conserved in the class II ACs from the parasites T.brucei and L.donovanii, but missing in other class II ACs like the ACG from Dictyostelium discoideum. Parts of the α3A helix are tightly associated with the α1 helix and the preceding β‐bulge, e.g. the side chains of L982 and F989 form a hydrophobic cluster with A912 and L913. Thereby, R1020 on the N‐terminus of strand β4 organizes major parts of the Δ‐subdomain on top of the globular tAC domain by making H‐bonds to Y986 (OH, 3.1 Å), E990 (OE2, 3.0 Å), T1004 (OG1, 3.0 Å) and Y1012 (OH, 2.8 Å). Consequently, when comparing the structures of GRESAG4.1 and GRESAG4.3, the Δ‐subdomain is structurally almost invariant due to its packing on the globular core, several stabilizing interactions inside the Δ‐subdomain and overall low mobility as reflected by low B‐factors. Furthermore, the proximity of the transmembrane segment of GRESAG4.1 (residues F863–P883) and the N‐terminus of the tAC domain sets topological constraints on how membrane‐bound and dimerized tACs position their Δ‐subdomains relative to the membrane context. According to such a model (Figure 5E), the Δ‐subdomain points towards the cytosolic bulk, ∼45 Å away from the membrane surface, where it is easily accessible for interactions with other, regulatory proteins.
d‐DTT binding and allosteric regulation
Together with the α4–β7 loop and parts of β4 and β8, the two helices α3A and α3B enclose an internal cavity that has an overall volume of 95 Å3 as calculated by GRASP (Nicholls, 1992). This cavity (Figure 5B) is mostly hydrophilically walled by backbone atoms; only the side chains of R1109, located on β8, and Q969 on α3 point directly into the center of the cavity. During the refinement of the 1.45 Å structure of GRESAG4.1, electron density came up in the cavity that could be assigned as a single d‐DTT molecule (Figure 5C). The interaction between the tAC domain and DTT is stereospecific, as racemic dl‐DTT was used as reductant during protein purification and crystallization. Most interactions between the hydroxyl and sulfhydryl groups of d‐DTT and the cavity involve main chain atoms: Q969 (O–O1, 2.6 Å; OE1–S1, 2.7 Å), N1017 (O–O2, 2.6 Å; N–S2, 2.8 Å), L1019 (O–O2, 2.7 Å) and V1021 (N–S1, 2.8 Å). Only two H‐bonds are made between d‐DTT and side chains, O1–R1109 NH2 (2.9 Å) and S1–Q969 OE1 (2.7 Å). Furthermore, two water molecules H‐bond to O2 (2.3 Å) and S2 (2.7 Å). In both tAC structures the cavity is completely occluded and without access to bulk solvent. However, in the GRESAG4.3 structure four water molecules are located instead of d‐DTT inside the cavity and H‐bond to R1109, Q969, V1021 and N1017 (Figure 5D). Similarly to structurally well characterized carbohydrate binding sites (Quiocho, 1986, 1993), these waters occupy in this d‐DTT‐free structure almost the same positions as the hydroxyls and sulfhydryls of d‐DTT in the GRESAG4.1–d‐DTT complex.
Functional studies on the precise roles and regulation of membrane‐bound ACs in trypanosomatids have been severely hampered by the presence of multiple, highly homologous class II ACs in these parasitic organisms. For example, at least six orthologous gene copies per cell were found for ESAG4, which is specifically expressed in the bloodstream form by its association with the variable surface glycoprotein (VSG) expression sites. Likewise, the tAC GRESAG4.1 is encoded by nine orthologs. These numbers represent only lower limits until the complete genome sequence of T.brucei has been revealed. Together with the mutagenesis data, the structural analysis of two isozymes, GRESAG4.1 and GRESAG4.3, confirmed that tACs structurally and mechanistically resemble class I enzymes but comprise an additional, unique structural motif, the Δ‐subdomain. The topological location of this subdomain in a region where other nucleotidyl cyclases like the mammalian class I ACs or the membrane‐bound GCs bind their regulatory cofactors suggests a regulatory role for the Δ‐subdomain. Furthermore, the observed structural invariance of the Δ‐subdomain and the strict conservation of residues that line the binding pocket for d‐DTT indicate that this domain serves equivalent functions in the different isozymes of trypanosomatids and leishmanial species. As the observed binding of d‐DTT might be a case of molecular mimicry, one might speculate that the binding pocket recognizes other, small molecule compounds that exert a unique allosteric control on tAC activity. For example, the cAMP‐triggered transition between the proliferative, long slender form and the non‐proliferative short stumpy form was found to be induced by a soluble activity released by bloodstream trypanosomes, which was termed SIF (stumpy induction factor) (Vassella et al., 1997). Alternatively, the Δ‐subdomain could bind in trypanosomes a yet unidentified proteinaceous cofactor that affects dimerization of the tAC domains similarly to the retGC‐1/GCAP system.
Interestingly, product inhibition of tAC activity by PPi itself might be a primary regulatory pathway for these ACs. Recent 31P‐NMR studies showed that PPi is more abundant than ATP in T.cruzi (Urbina et al., 1999) where it apparently partitions both to the cytosol and the acidocalcisomes, a specialized vacuolous compartment of trypanosomatids. At least some trypanosomal enzymes use PPi instead of ATP as an energy source (Bringaud et al., 1998; Rodrigues et al., 1999), and it might be falling cytosolic PPi concentrations that cause transient tAC activation during the different metabolic states of the parasite. In this scenario, the high propensity of tACs to adopt a monomeric state might make them especially sensitive to PPi‐mediated inhibition. AC activity that is solely under metabolic control would not be unique to tACs and has been described for several other ACs: soluble ACs from mammalian testis and cyanobacteria are activated by bicarbonate (Chen et al., 2000), while a bacterial AC from Brevibacterium liquefaciens was found to be stimulated by the glycolysis product pyruvate (Peters et al., 1991).
Materials and methods
Plasmid construction, site‐directed mutagenesis and protein expression
Gene fragments that code for the cytosolic regions of GRESAG4.3‐(L867–V1229) and ESAG4‐(P897–R1269) were amplified from genomic DNA of the T.brucei strain 927 using Takara Taq polymerase (BioWhittaker Europe) and 30 cycles at 30s/94°C, 2 min/58°C and 5 min/72°C. For amplification, the primer pairs 5′‐CTCGCTCGAGCA CAAATAAAGGGG and 5′‐GCGGATCCGCTAGCGGACTTGCGTGC TTCACTCTC were used for the GRESAG4.3 gene and 5′‐CTC GCTCGAGCTATTATCACGTAGTGGCGCTGTCATTTTCCAAGTC and 5′‐GCGGATCCCATATGTGACACCGTCAATGACTTTCAG for ESAG4 (restriction sites underlined). The DNA fragment of GRESAG4.3 (1.09 kbp) was digested with NheI and XhoI, and the 1.11 kbp fragment of ESAG4 with NdeI and XhoI and ligated into the expression vector pET‐28a (Invitrogen) using standard cloning techniques. In comparison with the SWISSPROT entries CY43_TRYBB and CYA4_TRYBB from T.brucei strain AnTat1.3, several sequence differences were found for the coding sequences that were confirmed by repeated PCR amplifications and sequencing of both strands. The GRESAG4.3 sequence (SWISSPROT entry CY43_TRYBB) shows the sequence exchanges R959C, H994S, M995L, N1098S, C1164W and G1165R, while the cloned ESAG4 gene (CYA4_TRYBB) has the exchanges K937M, C969R, Q990R, S1047P and R1101Q. The exchanges probably reflect sequence heterogeneity for these multigenic isozymes in T.brucei; none of the exchanges is observed at positions that are crucial for catalytic activity.
Site‐directed mutagenesis of the GRESAG4.1‐(A884–Y1241) gene fragment was carried out by fusion PCR (Ho et al., 1989) using Takara Taq polymerase (BioWhittaker Europe) and 30 cycles at 30s/94°C, 2 min/58°C and 5 min/72°C with pET‐gres4.1cyto (Bieger and Essen, 2000) as template. Accordingly, the double mutant D949A/R1053A was generated with the plasmid pET‐gres4.1cyto(D949A) as template. The mutated gene fragments of GRESAG4.1 were digested with NheI and XhoI and cloned into pET‐28a.
The plasmids pET‐gres4.1cyto, pET‐gres4.3cyto and pET‐es4cyto were transformed into E.coli BL21(DE3) cells. Protein expression, renaturation of the tAC fragments from inclusion bodies by dialysis and further purification were carried out using the previously described protocol for GRESAG4.1‐(A884–Y1241) (Bieger and Essen, 2000).
Limited proteolysis and mass spectrometry
To digest the cytosolic fragments of GRESAG4.1, GRESAG4.3 and ESAG4 down to the AC domains the recombinant isozymes were incubated for 30 min at 4°C at a concentration of 0.5 mg/ml with 0.05 mg/ml subtilisin [GRESAG4.1‐(A884–Y1241)] and 0.05 mg/ml proteinase K [GRESAG4.3‐(A867–V1229) and ESAG4‐(P897–R1269)]. The cleavage reaction was carried out in 20 mM Tris–HCl pH 8.0 as reaction buffer and stopped by adding 10 mM phenylmethylsulfonyl fluoride. The digestion reaction yielded for all three tACs a predominant product of ∼30 kDa mass. The digestion products were loaded onto a pre‐equilibrated Resource Q column (Pharmacia). The column was washed with 5 vols of the reaction buffer and the AC domains of the three tAC isozymes were eluted with 60 ml of a linear NaCl gradient (0–600 mM). The eluate was concentrated and loaded onto a Superdex 75 gel filtration column (Pharmacia) in 20 mM Tris–HCl pH 8.0, 0.6 M NaCl.
The purified AC domain of GRESAG4.1 was N‐terminally sequenced by Edman degradation and analyzed on an electrospray mass spectrometer (ABI) to determine its molecular mass and the location of its N‐ and C‐termini.
Crystallization, data collection and structure determination
The tAC GRESAG4.1‐(A884–T1131) was crystallized in two forms, A and B, as previously described (Bieger and Essen, 2000). Crystals of GRESAG4.3‐(A867–T1118) were grown in hanging drops using vapor diffusion at 18°C by mixing 1 μl of protein solution (6 mg/ml) in 5 mM Tris–HCl pH 8.0 with 1 μl of 25% (w/v) PEG4000, 0.2 M MgCl2 pH 7.0 as crystallization buffer. After 2 days, the tetragonal crystals were grown to final size and the PEG4000 concentration was increased to 35% (w/v) in the reservoir.
X‐ray data were collected at 100 K from GRESAG4.1‐(A884–T1131) frozen in 2.0 M ammonium sulfate pH 7.0, 20% (v/v) glycerol. For the GRESAG4.3‐(A867–T1118) crystals, the mother liquor was used as cryo‐protection buffer. Heavy metal derivatives of crystal form A of GRESAG4.1‐(A884–T1131) were prepared by incubating the crystals in the cryo buffer with 10 mM of the heavy metal salt for 4–8 h. Diffraction data were collected on a Rigaku RU‐200 CuKα rotating anode with focusing mirrors (Charles Supper) and a 30 cm MAR research imaging plate. Synchrotron data were collected from frozen crystals at beamlines BW6 and BW7B, HASYLAB, Hamburg, using a MAR CCD detector and wavelengths of 1.07 and 1.00 Å, respectively.
Data for crystal form A of GRESAG4.1‐(A884–T1131) were indexed and integrated with DENZO (HKL research) in space group P212121 (a = 49.9 Å, b = 60.1 Å, c = 79.7 Å) up to 1.45 Å resolution. The crystals have one monomer per asymmetric unit with 30% solvent content. Likewise, crystallographic data of GRESAG4.3‐(A867–T1118) were collected at 1.9 Å resolution and indexed and integrated with MOSFLM in space group I41 (a = 89.7 Å, c = 67.1 Å, solvent content 35%, with one monomer per asymmetric unit).
The structure of GRESAG4.1‐(A884–T1131) was solved by MIR using an initial native 1.9 Å dataset collected at beamline BW6. After phase refinement with SHARP (La Fortelle and Bricogne, 1997) and SOLOMON (Abrahams and Leslie, 1996), a 1.9 Å MIR map was obtained that allowed semi‐automatic tracing and model building by WARP (Lamzin and Wilson, 1997). Further refinement was carried out using CNS 0.9 (Brunger et al., 1997) with a native 1.46 Å dataset that was recollected at beamline BW7B. The refinement converged at an R‐factor/R‐free of 0.178/0.205 for data between 10.0 and 1.45 Å (Table I). The structure of the GRESAG4.3 AC domain was solved by molecular replacement (MR) using AMoRe (CCP4, 1994) and the structure of GRESAG4.1‐(A884–T1131) as search model. Refinement with the native data set proceeded to a final R‐factor/R‐free of 0.200/0.229 for data between 10 and 1.9 Å.
All tAC models exhibited good stereochemistry as analyzed by PROCHECK (Laskowski et al., 1993) while the following side chains were missing or only partially defined in electron density maps: E1036, V1037, R1092 (CD‐NH2), D1116 (CB‐OD2) and E1119 (CB‐OE2) for GRESAG4.1, and E985 (CG‐OE2), E987 (CB‐OE2), K1027 (CG‐NZ) and R1080 (CG‐NH2) for GRESAG4.3. Coordinates and structure factors of GRESAG4.1 and GRESAG4.3 have been deposited in the RCSB Protein Data Bank under accession codes 1FX2 and 1FX4.
The authors are very grateful for the encouragement and enduring support of D.Oesterhelt, the valuable comments of D.Griffith, the mutagenesis work of U.Heider, and the assistance of H.Bartunik at synchrotron beamline BW6, MPG‐ASMB, Hamburg, and W.Burmeister at ID14‐3, ESRF, Grenoble. This work is supported by a predoctoral fellowship (B.B.) of the Fonds der Chemischen Industrie.
- Copyright © 2001 European Molecular Biology Organization