Abstract
Leucine rich repeats (LRRs) are present in over 100,000 proteins from viruses to eukaryotes. The LRRs are 20–30 residues long and occur in tandem. LRRs form parallel stacks of short β-strands and then assume a super helical arrangement called a solenoid structure. Individual LRRs are separated into highly conserved segment (HCS) with the consensus of LxxLxLxxNxL and variable segment (VS). Eight classes have been recognized. Bacterial LRRs are short and characterized by two prolines in the VS; the consensus is xxLPxLPxx with Nine residues (N-subtype) and xxLPxxLPxx with Ten residues (T-subtype). Bacterial LRRs are contained in type III secretion system effectors such as YopM, IpaH3/9.8, SspH1/2, and SlrP from bacteria. Some LRRs in decorin, fribromodulin, TLR8/9, and FLRT2/3 from vertebrate also contain the motifs. In order to understand structural features of bacterial LRRs, we performed both secondary structures assignments using four programs—DSSP-PPII, PROSS, SEGNO, and XTLSSTR—and HELFIT analyses (calculating helix axis, pitch, radius, residues per turn, and handedness), based on the atomic coordinates of their crystal structures. The N-subtype VS adopts a left handed polyproline II helix (PPII) with four, five or six residues and a type I β-turn at the C-terminal side. Thus, the N-subtype is characterized by a super secondary structure consisting of a PPII and a β-turn. In contrast, the T-subtype VS prefers two separate PPIIs with two or three and two residues. The HELFIT analysis indicates that the type I β-turn is a right handed helix. The HELFIT analysis determines three unit vectors of the helix axes of PPII (P), β-turn (B), and LRR domain (A). Three structural parameters using these three helix axes are suggested to characterize the super secondary structure and the LRR domain.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Leucine rich repeats (LRRs) are unusually rich in leucine [1,2,3,4,5,6]. The LRRs are composed of 20–30 residues stretches and repeat in tandem. The published repeat numbers range from 2 to 97. LRRs have been reported in over 100,000 proteins from viruses to eukaryotes.
LRR units are divided into a highly conserved segment (HCS) and a variable segment (VS) [1]. Eight classes of LRRs have been recognized [3]. Matsushima and Kretsinger recently proposed twenty-three types of LRRs [1]. Their grouping is based mainly on the difference of the VS parts. The eight classes are RI-like, cysteine containing (CC), SDS22-like, IRREKO, bacterial, plant specific, typical, and TpLRR.
The HCS part consists of an 11 or 12 residue stretch, LxxLxLxx(N/C)(x/-)L, in which “L” is Leu, Ile, Val, or Phe, “N” is Asn, Thr, Ser, or Cys, “C” is Cys, Ser or Asn, “x” is any amino acid, and “−” is a deletion. Three residues at positions 3–5 in the underlined residues form a short β-strand [4, 6]. These β-strands stack parallel; they have the pattern of H-bonding (N–H → O=C), and then these tandem repeats of LRRs assume their super helical arrangements. The LRRs fold into a horse shoe, a right handed or left handed helix, or a prism shape [7]. Conserved hydrophobic residues such as leucine, valine, isoleucine, or phenylalanine in the consensus sequences of LRRs contribute to the hydrophobic cores. Capping structures that shield the hydrophobic core of the first LRR unit at the N-terminus and/or the last unit at the C-terminus are observed in most of the known LRR structures [1, 5].
Characteristic of each LRR class, the VS parts adopt a variety of secondary structures including the α-helix, 310-helix, and an extended conformation or a tandem arrangement of β-turns [1, 6]. Their secondary structures on the convex side are connected to the strands forming the β-sheet on its concave side by two loops [5]. One of the loops is an “ascending loop” which links the C-terminus of the HCS to the N-terminus of the VS. The other is a “descending loop” which links the C-terminus of the VS to the N-terminus of the HCS of the following unit. Each LRR domain contains a concave surface, a convex surface, an ascending surface, and a descending surface on the opposite side. LRR domains are involved in direct interaction with proteins (including hormones) or ligands (including nucleic acid, lipid, lipo-polysaccharide, and plant steroid hormones) [1]. LRR domains can engage structurally various proteins or ligands using different surfaces of the LRR domains [1, 5].
LRR proteins participate in the plant immune response and in the mammalian innate immune response [1,2,3,4,5,6]. They are also involved in a broad range of functions including apoptosis, autophagy, ubiquitin related processes, nuclear mRNA transport, and neuronal development [1, 8]. Plant LRR proteins, many of which involve kinases and other receptor like proteins, act as signal amplifiers in tissue damage, in symbiotic relationships, and in developmental processes [1, 9].
Furthermore, LRR proteins are contained in the type III secretion system of many gram-negative bacterial pathogens. The LRR proteins called effectors are delivered into the cytosol of animal or plant cells [10]. Consequently, these effectors enable the bacteria to avoid the immune response of the infected organism by modulating cell functions of the host. The effector proteins include YopM from the bubonic plague bacterium, Yersinia pestis, and SspH1, SspH2, and SlrP from Salmonella enterica, and IpaH3 and IpaH9.8 from Shigella flexneri. These effectors are bacterial LRR proteins [1]. The LRR domain of SspH1 directly interacts with PKN1 [11].
Bacterial LRRs are characterized by two Leu Pro sequences in the VS; the consensus is xxLPxLPxx with Nine residues (N-subtype) and xxLPxxLPxx with Ten residues (T-subtype) where “L” is Leu, Val, or Ile and “x” is predominantly occupied by small residues such as Thr, Ser, or Gly. T-subtype is seen in Salmonella SlrP. Moreover, LRRs in the subfamily of toll-like receptors (TLR7, TLR8, and TLR9), the small leucine rich repeat proteoglycan (SLRP) family including fibromodulin, decorin and biglycan, and the fibronectin leucine rich repeat transmembrane family (FLRT) contains N- or T-subtype [1]. The LRRs consist of tandem repeats of a super motif of STT or ST in which “S” is Bacterial and “T” is Typical [12,13,14]. We called this the STT class [1].
PPIIs are known to be observed frequently in proline rich regions [15,16,17,18,19]. The PPIIs are characterized by the backbone dihedral angles (Φ, Ψ) of (− 75°, 145°) [20,21,22,23,24,25,26,27,28,29]. The PPIIs have helical parameters: 2.9 residues per turn, a pitch value of 8.7 Å per turn, and a helix radius of 1.33 Å. The assignment of PPII is not done in the widely used programs such as DSSP [28] and STRIDE [29]. Consequently, PPIIs in newly solved protein structures are not registered in protein data bank (PDB) [30]. Now there are some tools for assigning PPII number—DSSP-PPII [31], PROSS [32], SEGNO [33], XTLSSTR [34], and ASSP [35].
Super secondary structures with several adjacent elements of a secondary structure are also observed in protein structures [36, 37]. Examples include β-hairpins, α-helix hairpins, and β–α–β motifs. Adzhubei and Sternburg [38] identified super secondary structures consisting of PPII and α-helix and of 310-helix and PPII. Kumar and Bansal [27] also identified those consisting of β-strand and PPII, of β-strand, PPII, and α-helix, and of β-strand, PPII, and β-strand.
Evdokimov et al. [39] noted that the VS parts in the YopM LRRs adopt 310-helices. Matsushima et al. [6, 7], Bella et al. [5], and Park et al. [40] proposed that the VS parts in the bacterial LRR adopt left handed polyproline II helices (PPII). A review article by Adzhubei et al. [41] noted PPIIs in LRRs. However, it appears that PPII in LRR structures has not yet been well characterized based on the consensus sequence. Structural data of proteins containing bacterial LRRs have increased. The crystal structures of YopM, SspH1, SspH2, SlrP, IpaH3, and IpaH9.8 have been determined [39, 42,43,44,45,46]. The structures of TLR8, TLR9, fibromodulin, decorin, biglycan, FLRT2, and FLRT3 are also available [47,48,49,50,51,52,53,54,55,56].
The purpose of this study is to understand structural features of bacterial LRRs. We performed both the secondary structures analyses using secondary structures prediction programs of DSSP-PPII, PROSS, SEGNO, and XTLSSTR and the HELFIT analyses that calculate helix axis, helix pitch, helix radius, repeat/residue number per turn, based on the atomic coordinates of the crystal structures [57].
This present analysis demonstrates that the N-subtype VS adopts PPII consisting of 4–6 residues and type I β-turn at the C-terminal side. Thus, the VS part is characterized by super secondary structure consisting of PPII and a β-turn. In contrast, the T-subtype VS frequently prefers two separate PPIIs consisting of two or three and of two residues. The HELFIT analysis indicates that the type I β-turn is a right handed helix and consequently determines three unit vectors of the helix axes of PPII (P), β-turn (B), and LRR domain (A). We propose three structural parameters which are two angles between the two helix axes of PPII and β-turn, between the two helix axes of PPII and LRR domain, and between the helix axis of LRR domain and the vector product of P × B. These three angles are suggested to characterize the super secondary structure and the LRR domain.
2 Methods
2.1 Structure Data
We collected the structure data of proteins containing bacterial LRRs from the PDB. We performed sequence alignments in LRR proteins from the PDB by LRRpred that recognizes and aligns LRR motifs that predict the repeat number and “phasing” of LRRs with greater reliability [58] and identified bacterial LRR based on the consensus sequence. Bacteria LRR proteins are YopM, SspH1, SspH2, SlrP, IpaH3, IpaH9.8, TLR8, TLR9, fibromodulin, decorin, biglycan, FLRT2, and FLRT3 (Table 1) [39, 42,43,44,45,46,47,48,49,50,51,52,53,54,55,56]. LRRs in YopM, SspH1, SspH2, SlrP, IpaH3, and IpaH9.8 belong to bacterial LRR class, while TLR8, TLR9, fibromodulin, decorin, biglycan, FLRT2, and FLRT3 belong to the STT class (Table 1). Eighteen PDB files solved at resolution ≤ 3.4 Å were used; the sequence identity of the 18 different chains shows that the maximum is 48% and the average is 7% (Supplementary Table S1). The structure data of mouse FLRT2 at resolution 4.0 and 6.0 Å were not used for analyses.
2.2 Secondary Structures Analysis
Secondary structures assignments were made from the atomic coordinates of the LRR structures using four programs—DSSP-PPII [31], PROSS [32], SEGNO [33], and XTLSSTR [34]. The assignment of the DSSP program is based on the identification of precise hydrogen bond patterns corresponding to regular secondary structures [28]. In DSSP-PPII based on DSSP, PPII are assigned solely in the coil region for at least two consecutive residues in coil with Φ = − 75° ± 29° and Ψ = +145° ± 29°. The PROSS program assigns secondary structures, based mainly on Φ and Ψ dihedral angles. SEGNO utilizes the Φ and Ψ dihedral angles coupled with other angles. XTLSSTR uses two angles and three distances. DSSP-PPII, PROSS, and XTLSSTR assign β-turns; while SEGNO does not. The secondary assignments were performed using the PolyprOnline web interface [59]. Types of β-turn were also identified by the programs of PROMOTIF [60] and STRIDE [61]. Furthermore, the root-mean-square deviation (RMSD) of the VS part between within the N-subtype and within the T-subtype and between these two subtypes using the coordinates of the backbone atoms of each residue were evaluated by the CHIMERA program [62].
2.3 HELFIT Analysis
We have developed a total least squares program for fitting a helix to data points—HELFIT [57]. A helix consisting of n repeat units may be characterized by helix axis, helix pitch (P), helix radius (R), and number of repeat units/residue per turn (N). HELFIT computes these parameters in which the helix axis is represented by the unit vector. These parameters also yield the rise per repeat unit/residue (Δz = P/N) and the rotation per repeat unit/residue in the helix (ΔΦ = 360°/N). Moreover, HELFIT gives rmsd: where di is the closest distance from the data point to the trace of the helix.
Here p = rmsd / (n − 1)1/2 gives the regularity of helical structures independent of the number of data points or helix length. The criterion for regular PPII helices is p ≤ 0.10 Å. This same test is used for α-helices, ω-helices, and 310-helices in proteins [63, 64]. The HELFIT analysis requires only four data points: the coordinates of α-carbon (Cα) of each residue. LRRs form a β-strand of three residues at positions 3–5 in the HCS part. Thus, in LRRs, the Cα coordinates of the consensus leucine residue at position four in individual LRR repeat units are used. The repeat number of individual LRR domains was defined as the number that participates in the parallel β-sheet. This definition means that the first LRR is sometimes contained in the capping structures. β-Turns consist of four amino acid residues (labelled i, i + 1, i + 2, and i + 3). We also estimate the helical parameters of β-turns using the Cα coordinates of each residue.
The HELFIT analysis indicates that the β-turn is regarded as a right handed helix, as noted later. Consequently, HELFIT determines three unit vectors of the helix axes of LRR domain (A), PPII (P), and type I β-turn (B). We estimate three structural parameters. One is the angle between the two helix axes of PPII and β-turn (Ω1). The second is the angle between the two helix axes of PPII and LRR domain (Ω2). The third is the angle between the helix axis of LRR domain and the vector product of P × B. (Ω3). The three angles of Ω1, Ω2, and Ω3 are represented by the following equations.
3 Results
3.1 Two Subtypes of Consensus Sequences
Bacterial LRR is 20 or 21 residues long and is classified into two subtypes. The N-subtype has the VS consensus of xxLPxLPxx with nine residues and the T-subtype has xxLPxxLPxx with ten residues where “L” is Leu, Val, or Ile and “x” is predominantly occupied by small residues such as Thr, Ser, or Gly (Fig. 1). Thus bacterial LRR is characterized by two Leu Pro sequences in the VS parts; although variable VS that lack one of the two conserved prolines is observed. The characteristics are not seen in other LRR classes. For examples, ribonuclease inhibitors and Nod-like receptors contain RI-like LRRs which of the consensus is LxxLxLxx(N/C)xLxxxgoxxLxxoLxxzxxx with typically 28 or 29 residues [65].
The N-subtype appears sixty-three times in the known structures (Table 1). The VS consensus is xxLPxLPxx, as expected, in which “x” positions at the N- and C-terminal sides are frequently occupied by relatively small residues such as Thr or Ser; while the central “x” position is rich in Glu. Fifty-two of the sixty-three VS are completely consistent with the consensus (Fig. 1a). The remaining VSs are xxLxxLPxx in YopM; the conserved Pro at position four is replaced by Cys or Ser. The T-subtype appears forty-three times (Table 1). The conserved Leu at position three in the VS consensus is frequently occupied by other hydrophobic residues such as Val or Ile (Fig. 1b).
3.2 Secondary Structures
The assignment of PPII patterns differs among the four programs. Bacterial LRR proteins form not only monomers but also homo-dimers, -tetramers, and -hexamers in crystals (Table 1). The PPII patterns assigned also differ among their individual molecules. We therefore analyzed all chains of the known structures.
The four programs for secondary structures assignment indicate that the HCS parts adopt short β-strands in three underlined residues of LxxLxLxxNxL. In addition, the assignment indicates that the VS parts are rich in PPII conformations. However, the secondary structures show a difference between the two subtypes.
At least one of the four programs of secondary structures assignment indicates that the N-subtype VS adopts PPIIs consisting of four, five or six residues (Fig. 2a; Supplementary Table S2). Four, five or six residue PPIIs are observed in the underlined residues of xxLPxLPxx, xxLPxLPxx, and xxLPxLPxx, respectively. For example, all four program assign PPII in underlined residues of KKLPDLPLS (LRR7) in Y. pestis YopM (n = 12) [3G06_A] (Fig. 2a). The variable VS of xxLxxxLPxx frequently adopt four or five PPIIs. The HELFIT analysis demonstrated that all of the PPIIs assigned are definitely left handed polyproline helices, as noted later.
In addition, the programs identified β-turns at the C-terminal side in the VS parts. The sequences in the underlined residues of xxLPxLPxxLxxLxLxxNxL adopt β-turns; the second conserved Pro corresponds to the residue, i, of β-turns. The types are distinguished by the Φ, Ψ, angles of residues i + 1 and i + 2. The average Cα (i)−Cα (i + 3) is 5.48 (0.22 Å); the numbers in parenthesis are standard deviations; they are reasonable [66]. The Φ, Ψ angles of residues i + 1 and i + 2 of the β-turns have the average angles of Φi+1 = − 60.7 (8.8°), Ψi+1 = − 22.6 (8.4°), Φi+2 = − 94.0 (13.8°), Ψi+2 = 3.4 (14.0°); the numbers in parentheses are standard deviations. These values are close to − 60°, − 30° and − 90°, 0° which define the type I β-turns. The β-turns assignments by PROMOTIF [60] and STRIDE [61] give the same results and indicate that most of the β-turns are type I (Table 2). Types IV and VIII rarely appear. Also very rarely β-turns are not assigned.
In the T-subtype VS PPIIs assigned may be divided into three patterns. Many VSs adopt two separate PPIIs with two or three and two residues, which are observed in the underlined residues of xxLPxxLPxx and xxLPxxLPxx (Fig. 2b; Supplementary Table S2). The second is one PPIIs with three or four residues in the underlined residues of xxLPxxLPxx, xxLPxxLPxx or xxLPxxLPxx. The third pattern is six residue PPII in the underlined residues of xxLPxxLPxx, which are seen in decorin, TLR9, and TLR8 by the XLTSSTR program. Moreover, the C-terminal two residues of the T-subtype VS are assigned to adopt mostly type I β-turn as does that of the N-subtype VS.
The average RMSD of the VS parts of the N-subtypes and of the T-subtypes is 0.589 (0.315 Å) and 0.993 (0.337 Å), respectively; all Bacterial VSs show the RMSD of 1.087 (0.549 Å).
In conclusion the N-subtype VS is characterized by a super secondary structure consisting of PPII with four, five, or six residues and a type I β-turn (Fig. 3), while the T-subtype VS strongly prefers one or two separate PPIIs and adopts a type I β-turn at the C-terminal side as does the N-subtype VS. It appears that the structure of the T-subtype is more variable.
3.3 Helical Parameters of PPIIs, Type I β-turns, and LRR Domains
The number of assigned PPII of which the helix length is longer than three residues increase in order of DSSP-PPII < PROSS < XLTSSTR < SEGNO (Table 3). SEGNO assigns longer PPIIs. Four residue PPIIs are regular and thus are a near ideal form (Table 2). All of five and six residue PPIIs are irregular. The helix regularity decreases with increasing helix length. The deviation of helix parameters from ideal values increases with increasing helix length. The helix irregularities in the five and six residue PPIIs are mostly due to larger deviations of the Φ, Ψ angles from ideal: − 75°, 145° in the first and/or second residues of N-terminal side in the sequence of xLPxLP. The sequence of VP(A/R)LP in TLR9 (LRR4), which corresponds to the underlined residues of xxLPxLPxx, adopt five residue PPIIs with large helix irregularity; p = 0.41–0.47 Å. In this case, the irregularity comes from large deviation of the Φ, Ψ angles of the conserved Leu at the C-terminal side. Taking account of the helical parameters, the five residue PPIIs are regarded as a highly deformed form. Consequently, the HELFIT analysis demonstrated that all four, five, and six residue PPIIs assigned by the secondary structures analyses are regular or irregular left handed polyproline helices.
In the T-subtype only the XLTSSTR program identifies long PPIIs with six residues in LRR7 of horse/bovine TLR9, LRR4 of human TLR8, and LRR4 of decorin, as noted. The p values are very large; p = 0.38–0.50 Å (Table 3). The high irregularity comes from the Φ, Ψ angle of any residue at position six in the T-subtype VS consensus of xxLPxxLPxx; the Φ, Ψ angles are in regions of a left handed α-helix. In addition, the helical parameters deviate highly from those of ideal form (Table 3). The six residue PPIIs assigned are not recognized as a PPII. Alternatively, the SEGNO program identifies regular, four residue PPIIs in the underlined residues of xxLPxxLPxx (Supplementary Table S2). Thus, the six residue PPIIs may be divided into two separate PPIIs with three and two residues. This supports the conclusion by the secondary structure assignment that the T-subtype VS contains two separate PPIIs.
The average helix parameters of the type I β-turns are; P = 6.11→ 6.14 Å, N = 3.65 →3.68 residues/turn, R = 2.26 →2.28 Å, and Δz = 1.68 → 1.69 Å (Table 4). It appears that the helical parameters are close to those of α-helix as it has P = 5.4 Å, N = 3.6 residues/turn, R = 2.4 Å, and Δz = 1.5 Å. The average p value is 0.02→ 0.03 Å. The HELFIT analysis indicates that these type I β-turns form a regular, right handed helix.
The helix parameters of LRR domains were determined for IpaH9.8 (n = 8), SspH1, (n = 10), SspH2 (n = 13), Y. pestis YopM (n = 16), Y. enterocolitica YopM (n = 21), and SlrP (n = 12). The bacterial LRR domains are represented by a right handed helix (Table 5). The helix parameters range over: P = 47.3 → 115 Å, N = 28.4 → 41.3 units/turn, R = 18.9 → 24.6 Å, Δz = 1.67 → 3.46 Å, and ΔΦ = 8.7° → 12.7°; p = 0.03 → 0.19 Å.
Figure 4 shows a plot of 2·R·sin (ΔΦ/2) versus Δz. The values fall on a circle with radius D (circle plot) [7]. D is a function of Δz, ΔΦ, and R [7].
D is the average Cα(i) – Cα(i + 1) distance between adjacent repeats—i and i + 1; D corresponds to the inter-strand distance. Equation 5 gives D = 4.97 ± 0.10 Å; this allows the formation of hydrogen bonds between parallel strands. This circle plot shows that the helix pitch, P, and rise per turn, Δz, of bacterial LRR is comparable to those of SDS22-like and Plant specific LRRs; while it is larger than those of RI-like and CC LRRs [7].
3.4 Geometrical Analysis
Figure 5 shows the frequency distributions of three angles of Ω1, Ω2, and Ω3. The Ω1 angle shows an asymmetrical distribution (Fig. 5b). The Ω1 angle ranges from 70° to 120°; the average value is ∼ 103° (Table 4). The Ω2 and Ω3 have the average values of ∼ 33° and ∼ 99°, respectively.
4 Discussion
4.1 Structural Role of PPIIs in Bacterial LRRs
The backbone dihedral angles (Φ, Ψ) of ideal PPII is (− 75°, 145°) [23]. Other dihedral angles has been also proposed; (Φ, Ψ) = (− 75°, 140°), (− 65°, 145°), and (− 60°, 140°) (Table 3) [26, 67, 68]. It appears that four residue PPIIs in proteins are a near ideal form with (Φ, Ψ) = (− 75°, 145°) or (− 75°, 140°).
The secondary structure assignment and the HELFIT analysis indicate that the N-subtype VS adopts one stretch of PPII of four, five, or six residues. In contrast, the T-subtype prefers two separate PPIIs consisting two or three and of two residues. Consequently, there is a clear difference in the PPII patterns between the two subtypes. The two hydrophobic residues in the VS part of the two subtypes are concentrated on the side that is oriented toward the hydrophobic core as well as other conserved hydrophobic residues in the HCS part (Fig. 6). This structural restriction makes the difference. The assignments by the four programs show different PPII patterns in most cases. This observation mainly comes from high flexibility of PPII due to no intrachain hydrogen bond.
The variable N-type VSs are seen in YopM—RSLCDLPPS and SGLSELPPN (Supplementary Table S2). The first repeat of twelve LRRs in chondroadherin is also a variable N-subtype VS of QKIPKVSEK; the structure of human chondroadherin which forms tetramers in crystal has been determined at 2.1 Å resolution [69]. Their secondary structure assignment sometimes shows no PPII or only short PPII of two or three residues. These observations indicate that two conserved prolines in the N-subtype VS parts are strongly required for the super secondary structure consisting of PPII and β-turn.
It may be significant that Src tyrosine kinases SH3 domain binds to short proline rich sequence of xPxxP that forms PPII [70]. This sequence is very similar to LPxLP in the N-subtype.
4.2 Super Secondary Structure
The present analyses demonstrate that the N-subtype adopts a super secondary structure consisting of a PPII and a β-turn. Ananthanarayanan et al. [71] first described this super secondary structure. Tandem repeats of the super secondary structure form a novel helical structure called the polyproline, β-turn helix [72]. This structure is observed in tandem repeats of the hepta-peptide, YSPSPSPS, in the C-terminal repeat domain (CTD) of the large subunit of RNA polymerase II (POL II) [73,74,75]. Many factors involved in RNA processing bind the CTD [76].
The VSs of RI-like and CC LRRs adopt an α-helical conformation (β–α structural units). Typical LRR VSs prefer tandem β-turns. The SDS22-like LRR VSs strongly prefer 310-helix (β-310). The Plant specific LRR structural unit is β-β-310. The occurrence of β-turns at the C-terminal sides of the VS parts are also observed in Plant specific, SDS22-like, and Typical LRRs as well as bacterial LRR [7]. Consequently, a super secondary structure consisting of 310-helix and β-turns is present in Plant specific and SDS22-like LRRs. The unique super secondary structures consisting of β-turns and PPII, and of β-turns and 310-helix should be recognized as structural elements in proteins.
Here we propose a structural parameter that characterizes the super secondary structure; the parameter is the angle between the two helix axes of PPII and type I β-turn (Ω1). Its average value is ∼ 103°. The Ω1 angle also helps to characterize other super secondary structures including those consisting of PPII and an α-helix, and of PPII and a 310-helix.
4.3 Solenoid Structure of Bacterial LRR Domains
The circle plot of bacterial LRR is comparable to those of SDS22-like and Plant specific LRR; while it differs from those of RI-like and CC LRRs (Fig. 4). We recently calculated helical parameters of 642 LRRs of known structures of 114 proteins by the HELFIT program [7]. The results indicate that the helical parameters are influenced by the structures of the ascending loops rather than of the descending loops, helical elements on the convex face, and the uniformity of parallel strand stacking on the concave face [1, 7]. The helix radius of PPII is the smallest between α-helix, 310-helix, π-helix, ω-helix, and PPII. This partly contributes to a relatively large helix pitch for the bacterial LRR domains.
We determined the Ω2 angle between the helix axes of PPII and of the bacterial LRR domain. The VSs of SDS22-like, Plant-specific, CC, and RI-like adopt 310-helix or α-helix instead of PPII. The comparison of the Ω2 angle with the angles between their helices and LRR domains may identify fundamental features of individual LRR classes.
4.4 The PPII Assignment by the Four Programs
In many methods for assignments of secondary structures from atomic coordinates, the termini of the segments are frequently ill-defined and it is difficult to decide unambiguously which residues at the edge of the segments have to be included [77]. In this study the PPII pattern assigned also differs between the four programs in most cases. The number of four, five, or six residue PPIIs assigned is larger in SEGNO and XLTSSTR than in DSSP-PPII and PROSS. However, the HELFIT analyses indicate that all PPIIs assigned are unambiguously regular or irregular left handed polyproline helices with only a few exceptions. The combination of the secondary structure assignment programs (SEGNO and XLTSSTR) and the HELFIT analysis is useful for PPII assignment in proteins.
5 Conclusions
The present study shows that the N-subtype bacterial LRRs are characterized by a unique super secondary structures consisting of PPII helices and a β-turn. In contrast, the T-subtype VS prefers two separate PPIIs with two or three or with only two residues. The type I β-turns can be regarded as regular, right handed helices. We propose three important structural parameters: the three angles between the two helix axes of PPII and β-turn, between two helix axes of PPII and LRR domain, and between the helix axis of LRR domain and the vector product of P × B. These three angles characterize the super secondary structure and the LRR domain.
Funding
This study was funded by National University of Mongolia (FELLOWSHIP GRANT-P2016-1173) (to P. E.)
Abbreviations
- LRR:
-
Leucine rich repeat
- PPII:
-
Polyproline II helix
- HCS:
-
Highly conserved segment
- VS:
-
Variable segment
- Cα :
-
α-Carbon
- CC:
-
Cysteine containing
- P :
-
Helix pitch
- R :
-
Helix radius
- N :
-
Number of repeat unit/residue per turn in helix
- δz :
-
Rise per repeat unit/residue in helix
- ΔΦ:
-
Rotation angle per repeat unit/residue in helix
- n :
-
Repeat number of leucine rich repeat
- PDB:
-
Protein data bank
- RMSD:
-
Root mean square deviation
- TLR:
-
Toll like receptor
- SLRP:
-
Small leucine rich repeat proteoglycan protein
- FLRT:
-
Fibronectin leucine rich repeat transmembrane protein
- CTD:
-
C-terminal repeat domain of the large subunit of RNA polymerase II
- POL II:
-
RNA polymerase II
References
Matsushima N, Kretsinger RH (2016) Leucine rich repeats: sequences, structures, ligand—interactions, and evolution. LAMBERT Academic Publishing, Saarbrücken
Kobe B, Deisenhofer J (1994) The leucine-rich repeat: a versatile binding motif. Trends Biochem Sci 19:415–421
Kobe B, Kajava AV (2001) The leucine-rich repeat as a protein recognition motif. Curr Opin Struct Biol 1:725–732
Matsushima N, Tachi N, Kuroki Y, Enkhbayar P, Osaki M, Kamiya M, Kretsinger RH (2005) Structural analysis of leucine-rich-repeat variants in proteins associated with human diseases. Cell Mol Life Sci 62:2771–2791
Bella J, Hindle KL, McEwan PA, Lovell SC (2008) The leucine-rich repeat structure. Cell Mol Life Sci 65:2307–2333
Matsushima N, Enkhbayar P, Kamiya M, Osaki M, Kretsinger RH (2005) Leucine-rich repeats (LRRs): structure, function, evolution and interaction with ligands. Drug Design Rev 2:305–322
Enkhbayar P, Miyashita H, Kretsinger RH, Matsushima N (2014) Helical parameters and correlations of Tandem Leucine rich repeats in proteins. J Proteom Bioinform 7:139–150
Ng AC, Eisenberg JM, Heath RJ, Huett A, Robinson CM, Nau GJ, Xavier RJ (2011) Human leucine-rich repeat proteins: a genome-wide bioinformatic categorization and functional analysis in innate immunity. Proc Natl Acad Sci USA 108(Suppl 1):4631–4638
Tör M, Lotze MT, Holton N (2009) Receptor-mediated signalling in plants: molecular patterns and programmes. J Exp Bot 60:3645–3654
Coburn B, Sekirov I, Finlay BB (2007) Type III secretion systems and disease. Clin Microbiol Rev 20:535–549
Haraga A, Miller SI (2006) A Salmonella type III secretion effector interacts with the mammalian serine/threonine protein kinase PKN1. Cell Microbiol 8:837–846
Matsushima N, Ohyanagi T, Tanaka T, Kretsinger RH (2000) Super-motifs and evolution of tandem leucine-rich repeats within the small proteoglycans-biglycan, decorin, lumican, fibromodulin, PRELP, keratocan, osteoadherin, epiphycan, and osteoglycin. Proteins 38:210–225
Matsushima N, Kamiya M, Suzuki N, Tanaka T (2000) Super-motifs of leucine-rich repeats (LRRs) proteins. Genome Inform 11:343–345
Matsushima N, Tanaka T, Enkhbayar P, Mikami T, Taga M, Yamada K, Kuroki Y (2007) Comparative sequence analysis of leucine-rich repeats (LRRs) within vertebrate toll-like receptors. BMC Genom 8:24–143
Siligardi G, Drak AF (1995) The importance of extended conformations and, in particular, the PII conformation for the molecular recognition of peptides. Biopolymers 37:281–292
MacArthur MW, Thornton JM (1991) Influence of proline residues on protein conformation. J Mol Biol 218:397–412
Rath A, Davidson AR, Deber CM (2005) The structure of “unstructured” regions in peptides and proteins: role of the polyproline II helix in protein folding and recognition. Biopolymers 80:179‒185
Vitagliano L, Berisio R, Mastrangelo A, Mazzarella L, Zagari A (2001) Preferred proline puckerings in cis and trans peptide groups: implications for collagen stability. Protein Sci 10:2627–2632
Adzhubei AA, Sternberg MJ (1993) Left-handed polyproline II helices commonly occur in globular proteins. J Mol Biol 229:472–493
Stapley BJ, Creamer TP (1993) A survey of left-handed polyproline II helices. Protein Sci 8:587‒595
Cubellis MV, Caillez F, Blundell TL, Lovell SC (2005) Properties of polyproline II, a secondary structure element implicated in protein–protein interactions. Proteins 58:880‒892
Berisio R, Loguercio S, De Simone A, Zagari A, Vitagliano L (2006) Polyproline helices in protein structures: a statistical survey. Protein Pept Lett 13:847‒854
Barlow DJ, Thornton JM (1988) Helix geometry in proteins. J Mol Biol 201:601–619
Creamer TP, Campbell MN (2012) Determinants of the polyproline II helix from modeling studies. Adv Protein Chem 62:263–282
Hopfinger A (2012) Conformational properties of macromolecules. Elsevier, Amsterdam
Schulz GE, Schirmer RH (1979) Principles of protein structure. Springer, Berlin
Kumar P, Bansal M (2016) Structural and functional analyses of PolyProline-II helices in globular proteins. J Struct Biol 196:414‒425
Kabsch W, Sander C (1983) Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers 22:2577–2637
Frishman D, Argos P (1995) Knowledge-based protein secondary structure assignment. Proteins 23:566–579
Berman HM, Kleywegt GJ, Nakamura H, Markley JL (2014) The protein data bank archive as an open data resource. J Comput Aided Mol Des 28:1009‒1014
Mansiaux Y, Joseph AP, Gelly JC, de Brevern AG (2011) Assignment of PolyProline II conformation and analysis of sequence-structure relationship. PLoS ONE 6:e18401
Srinivasan R, Rose GD (1999) A physical basis for protein secondary structure. Proc Natl Acad Sci USA 96:14258‒14263
Cubellis MV, Cailliez F, Lovell SC (2005) Secondary structure assignment that accurately reflects physical and evolutionary characteristics. BMC Bioinform 6(Suppl 4):S8
King SM, Johnson WC (1999) Assigning secondary structure from protein coordinate data. Proteins 35:313–320
Kumar P, Bansal M (2005) Identification of local variations within secondary structures of proteins. Acta Crystallogr 71:1077–1086
Richards FM, Kundrot CE (1988) Identification of structural motifs from protein coordinate data: secondary structure and first-level supersecondary structure. Proteins 3:71–84
Chiang YS, Gelfand TI, Kister AE, Gelfand IM (2007) New classification of supersecondary structures of sandwich-like proteins uncovers strict patterns of strand assemblage. Proteins 68:915–921
Adzhubei AA, Sternberg MJ (1994) Conservation of polyproline II helices in homologous proteins: implications for structure prediction by model building. Protein Sci 3:2395–2410
Evdokimov AG, Anderson DE, Routzahn KM, Waugh DS (2001) Unusual molecular architecture of the Yersinia pestis cytotoxin YopM: a leucine-rich repeat protein with the shortest repeating unit. J Mol Biol 312:807–821
Park H, Huxley-Jones J, Boot-Handford RP, Bishop PN, Attwood TK, Bella J (2008) LRRCE: a leucine-rich repeat cysteine capping motif unique to the chordate lineage. BMC Genom 9:599
Adzhubei AA, Sternberg MJ, Makarov AA (2013) Polyproline-II helix in proteins: structure and function. J Mol Biol 425:2100–2132
Keszei AF, Tang X, McCormick C, Zeqiraj E, Rohde JR, Tyers M, Sicheri F (2014) Structure of an SspH1-PKN1 complex reveals the basis for host substrate recognition and mechanism of activation for a bacterial E3 ubiquitin ligase. Mol Cell Biol 34:362‒373
Quezada CM, Hicks SW, Galan JE, Stebbins CE (2009) A family of Salmonella virulence factors functions as a distinct class of autoregulated E3 ubiquitin ligases. Proc Natl Acad Sci USA 106:4864‒4869
Zhu Y, Li H, Hu L, Wang J, Zhou Y, Pang Z, Liu L, Shao F (2008) Structure of a Shigella effector reveals a new class of ubiquitin ligases. Nat Struct Mol Biol 15:1302‒1308
Takagi K, Kim M, Sasakawa C, Mizushima T (2016) Crystal structure of the substrate-recognition domain of the Shigella E3 ligase IpaH9.8. Acta Crystallogr 72:269‒275
Zouhir S, Bernal-Bayard J, Cordero-Alba M, Cardenal-Munoz E, Guimaraes B, Lazar N, Ramos-Morales F, Nessler S (2014) The structure of the Slrp-Trx1 complex sheds light on the autoinhibition a of the type III secretion system effectors of the NEL family. Biochem J 464:135‒144
Kokatla HP, Sil D, Tanji H, Ohto U, Malladi SS, Fox LM, Shimizu T, David SA (2014) Structure-based design of novel human Toll-like receptor 8 agonists. ChemMedChem 9:719‒723
Ohto U, Shibata T, Tanji H, Ishida H, Krayukhina E, Uchiyama S, Miyake K, Shimizu T (2015) Structural basis of CpG and inhibitory DNA recognition by Toll-like receptor 9. Nature 520:702‒705
Tanji H, Ohto U, Shibata T, Miyake K, Shimizu T (2013) Structural reorganization of the toll-like receptor 8 dimer induced by agonistic ligands. Science 339:1426–1429
Paracuellos P, Kalamajski S, Bonna A, Bihan D, Farndale RW, Hohenester E (2017) Structural and functional analysis of two small leucine-rich repeat proteoglycans, fibromodulin and chondroadherin. Matrix Biol 63:106–116
Scott PG, McEwan PA, Dodd CM, Bergmann EM, Bishop PN, Bella J (2004) Crystal structure of the dimeric protein core of decorin, the archetypal small leucine-rich repeat proteoglycan. Proc Natl Acad Sci USA 101:15633‒15638
Scott PG, Dodd CM, Bergmann EM, Sheehan JK, Bishop PN (2006) Crystal structure of the biglycan dimer and evidence that dimerization is essential for folding and stability of class I small leucine-rich repeat proteoglycans. J Biol Chem 281:13324–13332
Seiradake E, del Toro D, Nagel D, Cop F, Härtl R, Ruff T, Seyit-Bremer G, Harlos K, Border EC, Acker-Palmer A, Jones EY, Klein R (2014) FLRT structure: balancing repulsion and cell adhesion in cortical and vascular development. Neuron 84:370–385
Lu YC, Nazarko OV, Sando R 3rd, Salzman GS, Südhof TC, Araç D (2015) Structural basis of latrophilin-FLRT-UNC5 interaction in cell adhesion. Structure 23:1678–1691
Ranaivoson FM, Liu Q, Martini F, Bergami F, von Daake S, Li S, Lee D, Demeler B, Hendrickson WA, Comoletti D (2015) Structural and mechanistic insights into the latrophilin3-FLRT3 complex that mediates glutamatergic synapse development. Structure 23:1665–1677
Jackson VA, Mehmood S, Chavent M, Roversi P, Carrasquero M, Del Toro D, Seyit-Bremer G, Ranaivoson FM, Comoletti D, Sansom MS, Robinson CV, Klein R, Seiradake E (2016) Super-complexes of adhesion GPCRs and neural guidance receptors. Nat Commun 17:11184
Enkhbayar P, Damdinsuren S, Osaki M, Matsushima N (2008) HELFIT: helix fitting by a total least squares method. Comput Biol Chem 32:307‒310
Matsushima N, Miyashita H, Mikami T, Yamada K (2011) A new method for the identification of leucine-rich repeats by incorporating protein secondary structure prediction. Tuteja R (ed) Bioinformatics: genome bioinformatics and computational biology. NOVA Sience Pulishers, Hauppauge, pp 61–88
Chebrek R, Leonard S, de Brevern AG, Gelly JC (2014) PolyprOnline: polyproline helix II and secondary structure assignment database. Database 2014:bau102
Hutchinson EG, Thornton JM (1996) PROMOTIF—a program to identify and analyze structural motifs in proteins. Protein Sci 5:212–220
Heinig M, Frishman D (2004) STRIDE: a web server for secondary structure assignment from known atomic coordinates of proteins. Nucleic Acids Res 32(WebServer issue):W500–W5022
Pettersen EF, Goddard TD, Huang CC, Couch GS, Greenblatt DM, Meng EC, Ferrin TE (2004) UCSF Chimera—a visualization system for exploratory research and analysis. J Comput Chem 25:1605–1612
Enkhbayar P, Hikichi K, Osaki M, Kretsinger RH, Matsushima N (2006) 3(10)-helices in proteins are parahelices. Proteins 64:691‒699
Enkhbayar P, Boldgiv B, Matsushima N (2010) Omega-Helices in proteins. Protein J 29:242‒249
Matsushima N, Miyashita H, Enkhbayar P, Kretsinger RH (2015) Comparative geometrical analysis of leucine-rich repeat structures in the nod-like and toll-like receptors in vertebrate innate immunity. Biomolecules 5:1955–1978
Wilmot CM, Thornton JM (1988) Analysis and prediction of the different types of beta-turn in proteins. J Mol Biol 203:221‒232
Jha AK, Colubri A, Zaman MH, Koide S, Sosnick TR, Freed KF (2005) Helix, sheet, and polyproline II frequencies and strong nearest neighbor effects in a restricted coil library. Biochemistry 44:9691–9702
Adzhubei AA, Eisenmenger F, Tumanyan VG, Zinke M, Brodzinski S, Esipova NG (1987) Approaching a complete classification of protein secondary structure. J Biomol Struct Dyn 5:689–704
Rämisch S, Pramhed A, Tillgren V, Aspberg A, Logan DT (2017) Crystal structure of human chondroadherin: solving a difficult molecular-replacement problem using de novo models. Acta Crystallogr 73:53–63
Kay BK, Williamson MP, Sudol M (2000) The importance of being proline: the interaction of proline-rich motifs in signaling proteins with their cognate domains. FASEB J 14:231–241
Ananthanarayanan VS, Soman KV, Ramakrishnan C (1987) A novel supersecondary structure in globular proteins comprising the collagen-like helix and beta-turn. J Mol Biol 198:705‒709
Matsushima N, Creutz CE, Kretsinger RH (1990) Polyproline, beta-turn helices. Novel secondary structures proposed for the tandem repeats within rhodopsin, synaptophysin, synexin, gliadin, RNA polymerase II, hordein, and gluten. Proteins 7:125‒155
Meredith GD, Chang WH, Li Y, Bushnell DA, Darst SA, Kornberg RD (1996) The C-terminal domain revealed in the structure of RNA polymerase II. J Mol Biol 258:413–419
Kumaki Y, Matsushima N, Yoshida H, Nitta K, Hikichi K (2001) Structure of the YSPTSPS repeat containing two SPXX motifs in the CTD of RNA polymerase II: NMR studies of cyclic model peptides reveal that the SPTS turn is more stable than SPSY in water. Biochim Biophys Acta 1548:81–93
Cramer P, Bushnell DA, Kornberg RD (2001) Structural basis of transcription: RNA polymerase II at 2.8 angstrom resolution. Science 292:1863–1876
Hsin JP, Manley JL (2012) The RNA polymerase II CTD coordinates transcription and RNA processing. Genes Dev 26:2119–2137
Martin J, Letellier G, Marin A, Taly JF, de Brevern AG, Gibrat JF (2005) Protein secondary structure assignment revisited: a detailed analysis of different assignment methods. BMC Struct Biol 5:17
Crooks GE, Hon G, Chandonia JM, Brenner SE (2004) WebLogo: a sequence logo generator. Genome Res 14:1188–1190
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
All authors declare that they have no conflicts of interest.
Electronic supplementary material
Below is the link to the electronic supplementary material.
10930_2018_9767_MOESM1_ESM.pdf
Supplementary material 1. Table S1 Percentage identity of 18 different protein chains containing bacterial LRRs. (PDF 116 KB)
10930_2018_9767_MOESM2_ESM.pdf
Supplementary material 2. Table S2 Secondary structure assignments of the N-subtype VS and the T-subtype VS of bacterial LRR (PDF 313 KB)
Rights and permissions
Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
About this article
Cite this article
Batkhishig, D., Bilguun, K., Enkhbayar, P. et al. Super Secondary Structure Consisting of a Polyproline II Helix and a β-Turn in Leucine Rich Repeats in Bacterial Type III Secretion System Effectors. Protein J 37, 223–236 (2018). https://doi.org/10.1007/s10930-018-9767-9
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10930-018-9767-9