Background

While cGMP is increasingly accepted as an important signaling component in many plant responses e.g. [13], it is perhaps astonishing that the discovery and functional characterization of GCs in higher plants is only just beginning, particularly so since in single celled green alga Chlamydomonas reinhardtii there are > 90 annotated nucleotide cyclases (NCs) that come in > 20 different domain combinations with 13 different domain partners [4]. The structural diversity and complexity of molecules with NC activity [46] are one likely reason why BLAST searches with known NCs from lower and higher eukaryotes did not yield candidate molecules in higher plants.

Search strategies based on conserved and functionally assigned amino acid (AA) residues in the catalytic center of known NCs [7] have now opened the way to a systematic search of NCs in higher plants and has led to the discovery of a number of Arabidopsis thaliana candidate molecules with catalytic activity in vitro and in vivo. These molecules include a wall-associated kinase like protein (AtWAKL10) with a role in defense [8], the brassinosteroid receptor (AtBRI1) [9], the Pep1 receptor (AtPepR1) [10] and the phytosulfokine receptor (AtPSKR) [11] as well as a nitric oxide-binding GC (AtNOGC1) [12]. PSKR belongs to a family of NCs that contains the GC catalytic center embedded within the intracellular kinase domain of leucine rich repeat receptor-like molecules and in in vitro experiments we have demonstrated that both the kinase and the GC domain have catalytic activity. Importantly, the natural ligands for both the PSKR and BRI1 receptors increase intracellular cGMP levels in isolated mesophyll protoplast assays suggesting that the GC activity is functionally relevant in planta[6, 11].

Presentation of the hypothesis

We propose that in addition to the six characterized GCs in Arabidopsis thaliana to-date (AtGC1, AtNOGC1, AtPSKR1, AtPEPR1, AtBRI1 and AtWAKL10), higher plants harbor a substantially larger number of GCs that remain to be discovered. This hypothesis is based on the fact that the tested GCs share a distinct AA signature in the catalytic center (Figure 1A) where the AA at position 1 forms the hydrogen bond with the guanine, the residue in position 3 confers substrate specificity for GTP while the AA in position 14 stabilizes the transition state from GTP to cGMP and two or three AAs away from the C-terminal end of the motif is the residue that interacts with the Mg2+/Mn2+ ions (Figure 1B). This motif based on tested GCs (Figure 1B) identifies 41 novel Arabidopsis candidate GCs (see Additional file 1) (http://www.arabidopsis.org/cgi-bin/patmatch/nph-patmatch.pl) [13] and no proteins in kingdoms other than the Viridiplantae. However, we do not predict that all retrieved candidate GCs will have activity in vitro and/or in vivo, nor do we exclude that molecules that do not contain the motif function as GCs.

Figure 1
figure 1

Construction of a GC search motif based on the catalytic center of characterized GCs in Arabidopsis thaliana . The GC catalytic centers in the alignment of all previously characterized GCs in Arabidopsis (A), is represented in the red box while the blue box implicates binding with Mg2+ or Mn2+ ions. A 14 amino acid long GC search motif (B) was built by including all the residues present in each position of the catalytic center. The amino acid substitutions are in square brackets ([ ]), “X” stands for any amino acid and the gap size is marked in curly brackets ({ }). Underlined amino acids have been added to the motif because of their chemical similarity to the amino acid at this position. Amino acids highlighted in red are functionally assigned residues; amino acid in position 1 forms the hydrogen bond with the guanine, amino acid in position 3 confers substrate specificity for GTP while amino acid in position 14 stabilizes the transition state from GTP to cGMP. The identification and assignment of functions to key residues of the GC motif are detailed in [7].

Testing the hypothesis

Firstly, the hypothesis can be tested by using computational approaches to predict the structural properties of candidate GCs and testing should include automated substrate (GTP) docking protocols. To obtain insights on the structural features of the GC catalytic centers, 3D models can be constructed using readily available crystal structures as templates. A high sequence similarity between subject and template will generate accurate 3D models. While sequence similarity of 25% is sufficient for structural predictions, we recommend selecting templates with a BLAST alignment score of at least 50–80 (color key: green). For example, a 3D structure of the GC region of AtPSKR1 built using homology modeling techniques revealed that the GC catalytic center is embedded within a cavity, presumably providing ideal steric interactions for substrate docking. While selecting template structures with the highest sequence similarity to the candidate GCs is the common practice, we would recommend to also evaluate the candidate proteins against a known GC template (e.g. crystal structure of a bacterial or human GC) since many plant candidate GCs are embedded within kinases, and modeling against a kinase template may not reflect the configuration of an activated candidate GC. The reason is that dual-activity enzymes such as AtPSKR1 do not assume concurrent activation states for both the kinase and GC catalytic domains, and it is likely that a molecular switch (e.g. Ca2+ and/or dimerization with another molecule) is required to shift from kinase to GC activation. In the example here, when the AtPSKR1 molecule is in the kinase state the GC catalytic center is partially buried. In turn, when the molecule is in the GC configuration, the GC catalytic center is completely exposed and GTP can successfully dock (as predicted by AutoDock Vina) (Figure 2). GTP docking at the PSKR1 GC center in the correct orientation favorable for interactions with the key residues within the cavity is represented in Additional file 2: Figure 3A. Functionally assigned residues of the GC motif can be replaced with another amino acid to estimate the importance or relevance of these residues in maintaining a functional configuration of the catalytic center. For example, when one or more key residues in the AtPSKR1 GC domain is replaced with leucine, GTP is more likely to fail to dock or dock in the wrong orientation, implying compromised or abolished GC activity (Figure 3B-D). In previously characterized Arabidopsis GCs, GTP docking is also disrupted when key residues are replaced (Table 1). We note that these computational methods alone are not diagnostic of GC activity and do not distinguish GCs from other enzymes that also catalyze GTP (e.g. GTPases). They however lend good support to the experimental data and can be used as an initial screen to assist in the selection of candidate molecules from a potentially large pool of proteins for subsequent in vitro and/or in vivo enzymatic functional assays.

Figure 2
figure 2

Computational assessment of the predicted activation states of AtPSKR1 kinase and GC catalytic centers. AtPSKR1-kinase domain (Phe734 – Val1008) was modeled against the AvrPtoB-BAK1 complex (PDB entry: 3TL8) (A) and against the bacteria GC Cya2 (PDB entry: 2W01) (B) using the Modeller (ver. 9.10) software [14], and GTP docking experiments performed using AutoDock Vina (ver. 1.1.2) [15]. When AtPSKR1 assumes the kinase activation state (A), the GC catalytic center is partially covered by a nine amino acid long ‘latch’ that may be loosened by a molecular switch (e.g. Ca2+) or by forming homo- or hetero-dimers. When AtPSKR1 assumes a GC configuration (B), the GC catalytic center is completely exposed and assessable to GTP. In addition, automated docking experiments suggest feasible GTP docking. The GC catalytic center is in yellow, functionally assigned residues in the GC motif are in cyan, amino acids implicated in metal binding are in green, the ATP-binding site is in orange and the nine amino acid long ‘latch’ partially covering the GC catalytic center is in magenta.

Figure 3
figure 3

Assessment of the AtPSKR1 GC catalytic center by molecular docking of GTP. According to the 3D model of AtPSKR1, the GC domain is embedded within a cavity with the key residues for GTP interaction and the metal binding residue highlighted in yellow and cyan respectively. The GTP substrate docks successfully with the guanine portion at the inner-most of the cavity for interaction with the serine residue (position 1) and the phosphate end pointing outwards of the cavity towards the arginine residue (position 14) - an orientation deemed suitable for GC activity (A). When one or more key amino acid residues at position 1, 3 and/or 14 was replaced with leucine, docking experiments indicate disruptions in the docking orientations of GTP in all except for the leucine replacements at positions 1 and 3 of the GC domain. This suggests an aberrant GC catalytic activity when these key residues are replaced. GTP docking results for AtPSKR1 GC domain with leucine replacements at position 14 (B), positions 1 and 3 (C), and positions 1, 3 and 14 (D) are as illustrated. AtPSKR1-GCD (Asn871 – Glu980) was modeled against the AvrPtoB-BAK1 complex (PDB entry: 3TL8) using the Modeller (ver. 9.10) software [14] while GTP docking experiments were performed using AutoDock Vina (ver. 1.1.2) [15]. The full-length AtPSKR1 protein and the domains organization are as shown in Additional file 2.

Table 1 Molecular docking of GTP with the GC catalytic center

Secondly, the candidate GCs need to be tested in vitro by incubating the recombinant protein harboring the GC domain with GTP and the appropriate metal ions (Mg2+ and/or Mn2+). This recombinant protein can be made by molecular cloning methods with the DNA construct expressed in an E. coli host system and affinity purified for the following in vitro GC activity testing. GC enzymatic reaction is initiated by incubating the purified recombinant protein in buffer containing the aforementioned ingredients. cGMP generation can then be measured using commercially available cGMP immunoassay kits which will provide an indication of GC activity. Cyclic GMP production should be further verified using mass spectrometry based techniques that consistently record higher cGMP amounts in independent in vitro experiments than those obtained with ELISA-based assays [9]. The substantially lower in vitro activities of plant GCs compared to animal GCs [5] have raised concerns regarding (1) the folding and structural integrity of the recombinant plant GCs and (2) the reliability of the in vitro recombinant GC activity assays to detect such low amounts of cGMP [16, 17]. We therefore recommend that candidate GCs be evaluated with both the biochemical assays and the more sensitive mass spectrometric methods.

Thirdly, candidate GC candidates should also be studied in vivo[18] with fluorescent cGMP-reporter assays, since this is a direct way to link ligand–binding to receptor-coupled GCs, and to the generation of cGMP and cGMP-dependent downstream effects.

Implications of the hypothesis

Currently, we know that a growing number of fundamental physiological processes, including gating of ion channels [19], specific phosphorylation events [18], post-translational modifications [20], stomatal guard cell movements and responses to hormones [18, 21] all depending, at least in part, on cGMP. Consequently, a major outstanding question is, where are the enzymes that catalyze the reaction from GTP to cGMP, how many are there in e.g. Arabidopsis, and how are they regulated? If our hypothesis proves right, and novel multi-domain enzymes with stimulus- and/or ligand-specific GC activity will be discovered, we will be able to finally unravel the complex signal transduction networks that link environmental stimuli to cGMP-dependent responses in plants such as chloroplast development and anthocyanin synthesis [22, 23]. Detailed analysis of cGMP-dependent responses will have to include a genetic and molecular analysis of transgenic plants with overexpressing or knocked-down of candidate GCs as well as transcriptomics studies that reveal further aspects and the extent to which cGMP modulates down-stream effects. In addition, and given the importance of cGMP-dependent phosphorylation, we would argue that comparative phospho-proteomics of wild type and GC mutants will provide a systems view of specific phosphorylation cascades that are induced by the activation of target GCs. Taken together, we predict, that if our hypothesis is true and the candidate GCs will be analyzed in considerable depth, it will establish cGMP, much like cytosolic free Ca2+, as a key second messenger in plant responses.