Abstract
Psidium guajava L., a fruit crop belonging to the Myrtaceae family, is highly valued for its nutritional and medicinal properties. The family exhibits a diverse chemical profile of essential oils and serves as a valuable resource due to its ecological interactions, adaptability, and dispersal capacity. The Myrtaceae family has been extensively studied for its terpenoids. Genetic studies have focused on foliar terpene yield in species from the Eucalypteae and Melaleucaceae tribes. To understand the evolutionary trends in guava breeding, this study predicted terpene synthase genes (TPS) from different cultivars. Through this analysis, 43 full-length TPS genes were identified, and approximately 77% of them exhibited relative expression in at least one of the five investigated plant tissues (root, leaf, bud, flower, and fruit) of two guava cultivars. We identified intra-species variation in the terpene profile and single nucleotide polymorphisms (SNPs) in twelve TPS genes, resulting in the clustering of 62 genotypes according to their essential oil chemotypes. The high concentration of sesquiterpenes is supported by the higher number of TPS-a genes and their expression. The expansion for TPS sub-families in P. guajava occurred after the expansion of other rosids species. Providing insight into the origin of structural diversification and expansion in each clade of the TPS gene family within Myrtaceae. This study can provide insights into the diversity of genes for specialized metabolites such as terpenes, and their regulation, which can lead to a diverse chemotype of essential oil in different tissues and genotypes. This suggests a mode of enzymatic evolution that could lead to high sesquiterpene production, act as a chemical defense and contribute to the adaptive capacity of this species to different habitats.
Similar content being viewed by others
Introduction
Guava (Psidium guajava L.) is a fleshy fruit native to the tropical and subtropical regions of Mexico, Central, and South America1, and is part of the pantropical tribe Myrteae, which includes 51 genera and around 2500 species. The genus Psidium, with approximately 50 recognized species, shows remarkable diversity in Brazil's Atlantic Coastal Forest, Cerrado, and Caatinga biomes, which are considered the center of diversity for this genus2,3. Due to its remarkable adaptability and dispersion capacity, guava is grown successfully in many countries, even in neglected soils4,5.
The fruit is a rich source of fiber, potassium, manganese, copper, folic acid, and vitamins A and C6. Besides the nutritional value of guava, the plant also contains a diverse array of terpenes in its foliar essential oils7,8. These terpenes possess a wide range of beneficial properties, including antioxidant9, anti-stress10, anticancer, analgesic, anti-hyperglycemic, anti-inflammatory11,12, antidiabetic13, anti-aging14, antimicrobial12, antihypertensive, cardioprotective15 and antibacterial effects16. These terpenes also play roles mediating interactions with abiotic stressors17,18 and the plant’s biotic environment, contributing to its defense against herbivores and pathogens19, as well as attracting pollinators, particularly in neotropical species with fleshy berries that serve as a food source20,21.
The content and composition of essential oils may vary significantly among different species, which can be influenced by genetic and environmental factors22,23,24,25. In the case of P. guajava, the essential oil exhibits variation in its chemotypes, which is influenced by the genotype7,26,27. Understanding the genes associated with essential oil biosynthesis pathways, as part of genetic breeding programs, enables the identification and selection of specific genotypes. This knowledge, when combined with conventional techniques, facilitates the exploitation of essential oil production for industrial, pharmacological, and agronomic purposes28.
Terpenoids derive from a basic five-carbon unit, isopentenyl diphosphate (IPP), and its isomer dimethylallyl diphosphate (DMAPP)29. Prenyl transferases combine these building blocks into isoprene diphosphates of varying lengths, including geranyl diphosphate (GPP), farnesyl diphosphate (FPP), and geranyl diphosphate (GGPP), which are the main precursors of cyclic and acyclic monoterpenes (C10), sesquiterpenes (C15), and diterpenes (C20), respectively30. In the final steps of terpenoid biosynthesis, these precursors are then modified modified through carbocationic reactions which are controlled by a large number of enzymes called terpene synthases (TPS). These enzymes catalyze the cyclization, hydride shifts, or other rearrangement of precursor molecules to generate a multitude of terpenoids from a few common substrates29,30.
In plants, within genomic sequences, the enzymes are coded by members of the TPS gene family, divided into seven subfamilies. The subfamilies related to secondary metabolism, TPS-a (sesquiterpene), TPS-b (monoterpene), and TPS-g (acyclic mono-, sesqui-, and diterpenes), which are three angiosperm-specific subfamilies. The subfamily TPS-d is specific to gymnosperms (mono-, sesqui-, diterpenes) and the subfamily TPS-h is specific to the spikemoss Selaginella moellendorffii (putative bifunctional diterpenes). Also, the members associated with the primary metabolism, subfamilies TPS-e and -f proposed to be combined into the group of TPS-e/f (kaurene, mono-, sesqui-, and diterpenes), conserved among vascular plants, and TPS-c subfamily (diterpenes) of angiosperm copalyl diphosphate synthases29.
The action of terpenoids on the plant-environment interaction is much reported in the Myrtaceae family due to its remarkable amount of compounds and related genes. The genetic basis of foliar terpene yield has been extensively exploited, mainly in the subfamily recognized by dried fruits, such as species of Eucalypteae tribe, including Eucalyptus grandis, E. globulus, and Corymbia citriodora, which contain the most significant number of complete TPS genes reported in eudicotyledons (70, 69, and 89 complete genes, respectively). Terpene synthase genes have also been identified in Melaleuca alternifolia and Leptospermum scoparium, with 37 and 49 putative TPS genes, respectively31,32. The oil profile pattern in foliar terpenes in these species are the monoterpenes α-pinene and 1,8-cineole23,29,33.
Mono- and sesquiterpenes are pivotal constituents in the essential oil composition of Psidium neotropical species34. Only one study have been identified TPS genes in fleshy-fruited Myrtaceae species24, with monoterpenes as the major compounds. Psidium guajava essential oils were characterized by sesquiterpenes as their most abundant compounds, with the main constituents caryophyllene, aromadendrene skeletons acyclic, among others27,35,36. Thus, in this study, we structurally characterize predicted terpene synthase genes (TPS) from Psidium guajava to elucidate evolutionary trends for guava breeding. We also investigate the gene expression patterns among the cultivars with different agronomic traits and essential oil chemotypes. The genomic resource will elucidate the molecular mechanisms governing the formation and variation in the essential oils content.
Materials and methods
Plant materials and sequencing
The transcriptomic data for leaves, flowers, and fruit from the Indian cultivar Allahabad Safeda were retrieved from a public database (accession PRJNA472130), as detailed in Supplementary Table S237.
Genomic and transcriptomic data were obtained from sequencing two cultivars (Paluma and Cortibel RM) registered in the Brazilian Ministry of Agriculture (Ministério da Agricultura, Pecuária e Abastecimento-MAPA; http://sistemas.agricultura.gov.br/; accession PRJNA1020439).
Transcriptomic data derived from samples from five distinct tissues (flower buds, immature, young, and mature leaves, roots) were collected and grouped to obtain pools in Cortibel RM. For Paluma, samples from four tissues (immature, young, and mature leaves, roots), were utilized. Each pool represents one biological replicate consisting of samples from four independent seedlings. The seedlings used were four months old and were grown in a greenhouse. All tissues were immediately frozen in liquid nitrogen and stored at − 80 °C until RNA extraction (Table S2).
RNA was isolated from 100 to 300 mg of tissues using the CTAB-based method38. The isolated RNA was treated with DNase enzyme for removal of contaminant DNA and cleaned up using RNeasy Plant Mini kit (QIAGEN). The quality and quantity of total RNA were calculated through TapeStation System (Agilent) and Qubit (Thermo Fisher Scientific), respectively. Good quality RNA was further subjected to rRNA removal using RiboMinus Plant kit (Thermo Fisher Scientific). The transcriptome of the two Brazilian cultivars was sequenced using the Illumina PE NextSeq 500 platform.
Terpene synthase genes discovery
The resulting peptides of prediction of P. guajava genotypes were searched against the Pfam-A database locally, using HMMER 3.0 (Hidden Markov Model) with the previously identified C terminal (PF03936) and N terminal (PF01397) domains39. We also aligned the sequences from a curated database of plant sesquiterpene synthases using BLAST programs (e-value < 1e10−5)40.
The presence of the target domains was verified by the Pfam database (http://pfam.xfam.org/search), the Simple Modular Architecture Research Tool database (SMART; http://smart.embl-heidelberg.de/)41. The intron/exon structures, organization, and motif representation of putative TPS were determined using the Gene Structure Display Server (GSDS) program42. The conserved motifs, sequence logo and graphical representations were generated utilizing WebLogo 2.8.2 with default parameters (http://weblogo.berkeley.edu/)43.
Phylogenetic analysis
Considering the amino acid sequences of terpene synthases in P. guajava, we employed a phylogenetic reconstruction methodology including sequences from Eucalyptus grandis, Arabidopsis thaliana, Vitis vinifera (https://phytozome.jgi.doe.gov/)44, Eucalyptus globulus23, Melaleuca alternifolia45, Psidium cattelyanum24 and Corymbia citriodora46. The alignment was performed using the MAFFT v. 7.4 software with the G-INS-1 algorithm47 and optimized using trimAI v. 1.448.
After the alignment, the next step was the search for the best amino acid substitution model using the IQ-TREE 249 with the selection procedure ProtTest50. The best-fit model was chosen based on the Bayesian Information Criterion (BIC)51. The model created a maximum-likelihood phylogenetic tree file (.nwk) employing 10,000 bootstrapped replicates. The resulting file was subsequently imported into iTOL version 5.5.1 for visualization and editing52.
The generated tree was divided into TPS genes associated with the primary metabolism process (subfamilies -c, -e, and -f) and those involved in secondary metabolic pathways (subfamilies a, b, g). Additionally, the functionally characterized terpene synthases were also included in the phylogenetic analysis (Supp. File 1).
Transcriptomic analysis and differential expression profiling
Three replicates from Psidium guajava cv. Allahabad Safeda, two replicates from cv. Paluma and two replicates from cv. Cortibel RM were used to obtain the gene expression patterns. The data were submitted to quality control with the TRIMMOMATIC 0.22 software53 with parameters -phred33 LEADING:3 TRAILING:3 SLIDINGWINDOW: 4:30 MINLEN:85 and their quality certified with the FQC Dashboard software 1.5.854. The de novo RNA-Seq approach assembled the transcriptome by TRINITY 2.8.4 assembler55. The aligner HiSat256 mapped the filtered reads to the to the Psidium guajava draft genome assembly. The R/Bioconductor package Rsubread, using the function of featureCounts performed the counting table of the mapped reads for the following statistical analysis57. The R/Bioconductor package DESeq2 conducts the differential gene expression (DGE) test58. It is also applied to the R/Bioconductor package apeglm to shrink log-fold change59.
DArTseq-based SNP analysis
Purified DNA samples (1 μg for each sample) from 62 Brazilian guava genotypes (cultivated and naturalized) were prepared according to recommendations (https://www.diversityarrays.com/faq/) and sent to Diversity Arrays Technology Pty. Ltd company (Canberra, Australia) to identify regions with polymorphism and produce the library using a high-throughput genotyping-by-sequencing system, DArTseq™, and variant calling analysis. Sequencing was made using the Illumina HiSeq2500 sequencing platform, and sequences were processed using proprietary DArT analytical pipelines60,61,62. Barcode/sample sequences were identified and used in label calling. Low-quality sequences were filtered out, and the identical ones gathered into fastqcall files. These files were processed by DArT PL SNP calling pipelines (DArTsoft-seq), as described by Sansaloni and colleagues62.
The generated sequences with SNP content (total of 30,761 SNPs markers > 0.9 call rate) from each of 62 genotypes were aligned against the TPS genes with the blastn software 2.8.0 + (evalue 1e−5) to identify highly polymorphic regions.
The heatmap with markers associated with TPS genes was performed using the R software package63 and the packages ‘pheatmap’ and ‘RColorBrewer.’ The analysis used the Euclidean distance, and the UPGMA (Unweighted Pair Group Method using Arithmetic averages) clustering method was employed.
Results
Identification and expansion of terpene synthase genes
We identified 120 TPS loci in the P. guajava genome from Brazil assembly (cultivars Paluma and Cortibel RM assembly). Among these loci, the analysis detected 12 pseudogenes (less than three or more than 16 exons), 22 alternative transcripts, and 43 partial genes with only the C-terminal or N-terminal domain (Table S1). Supplementary Fig. 1A, B displays the partial genes and alternative transcripts’ structure and the phylogenetic tree.
The phylogenetic analysis used the Maximum-likelihood estimation of forty-three full-length genes of the P. guajava genome assembly. We identified five well-supported and very distinct clades (bootstrap = 100) corresponding to the TPS subfamily (i.e., subfamilies a, b, c, e/f, and g), including the JTT + R6 and JTT + F + I + G4 amino acid substitution model, respectively64. Specifically, we focused on subfamilies TPS-a, TPS-b, and TPS-g, which are associated with the biosynthesis of secondary metabolites. Our phylogenetic tree incorporated 468 sequences for this category, as depicted in Fig. 1A. In contrast, subfamilies TPS-c, -e, and -f are primarily responsible for producing fundamental metabolites such as gibberellin and abscisic acid. This analysis encompassed a comprehensive set of 44 sequences, featuring representatives from other Myrtaceae species, as illustrated in Supplementary Fig. 1C.
Most genes (21) clustered in a clade corresponding to TPS-a subfamily, recognized for their role in generating sesquiterpenes (C15). Additionally, the TPS-b subfamily was observed, with a predominance of fifteen members classified as TPS-b1, which produce cyclic monoterpenes. Notably, the research identified no TPS-b2 subfamily members responsible for encoding isoprenes and ocimenes (C5, C10). Five subfamily TPS-g members were identified, producing acyclic mono-, sesqui-, and diterpenes (Fig. 1A). One member of the class TPS-c (diterpenes producer) and one member of TPS-e/f (mono-, sesqui-, and diterpenes producer) were identified and participated in the primary metabolic process (Fig. S1C).
Multiple sequence alignment demonstrated that proteins had highly conserved aspartate-rich motifs (DDxxD) and less conserved NSE/DTE motifs at the C-terminal, and an RRx8W domain at the N-terminal (Fig. 1B). The aspartate-rich motifs harbor a sequence of 35 amino acids located downstream of the RXR/RDR motif, which serves a crucial function in the chelation of the diphosphate group after substrate ionization. The TPS-c subfamily is present in land plants and is characterized by the “DXDD” motif but not the “DDXXD” motif in their proteins, which was detected in only one guava TPS (Fig. 1B).
About the structure of the twenty-one (49%) TPS-a, most contain 5 to 8 exons, except for Pg55158, Pg54483, Pg26070 and Pg25656, with 3 and 4 exons (Fig. 1C). The TPS-b gene subfamily was the second largest, containing 6 to 9 exons and included 15 genes, about 35% of the total TPS genes, except Pg44627-b1, Pg13454-b2, Pg34032-b1 that contain less than five exons. Also, five genes represented the TPS-g subfamily (which predominantly produces acyclic mono-, sesqui-, and diterpene) and include 6 to 8 exons. For the remaining TPS subgroups, one gene encoding copalyl diphosphate synthase represents the TPS-c subgroup, and one represents the TPS-e/f subgroup. Genes that belong to both TPS-c and TPS-e/f contain 12 and 13 exons, respectively (Fig. 1C).
We also observed predominance of the TPS-a genes transcripts as a proportion of the total TPS genes: 49% compared with E. grandis, the next highest at 54%, followed by V. vinifera (44%), C. citriodora, (45%), E. globulus (43%), M. alternifolia (43%), and P. cattleyanum (40–43%) (Fig. 1D).
In addition to a comparative study of TPS, the study assessed the overall similarity of protein sequences among guava samples compared to other members of the Myrtaceae family. Guava shared only nine complete TPS genes in orthologous pairs (a single gene in one species more closely related to a single gene in a different species than a gene within its genome) with red and yellow araça, then the other rosids. However, it’s worth noting that none of the TPS genes in guava exhibited orthologous relationships with pairs from other eucalyptus species (Table S6). Conversely, in contrast to this observation, eight genes in the yellow morphotype of P. cattleyanum were found to occur in orthologous pairs with guava. In comparison, nine genes exhibited orthologous relationships with the red morphotype of P. cattleyanum and guava. Interestingly, only nine genes were orthologous with both red and yellow morphotypes of P. cattleyanum.
Global expression profiling of terpene synthase genes from Psidium guajava
To explore the plasticity characteristics of gene transcription, available RNA-seq data sets derived from cv. Allahabad Safeda, Cortibel RM, and Paluma were evaluated (Supp. Table S2). The number of fragments per kilobase of exon per million fragments mapped (FPKM) was used to estimate the relative expression levels of annotated genes (Supp. Tables S3, S4). Figure 2A. displays the heatmap illustrating the relative transcript abundance of 21 TPS genes in leaves, flowers, and fruit tissues in three biological replicates from P. guajava, cultivar Allahabad Safeda. The hierarchical cluster analysis shows that the TPS genes were more abundant in the leaf tissue and less in the fruit tissue. Three TPS genes (Pg54488, Pg46825, and Pg03078) were highly expressed in fruit. Two of these genes are members of the TPS-a subfamily and one of the TPS-e subfamily. The more expressed genes in flower tissue were Pg16954, Pg10950, and Pg54488, from subfamilies TPS-g, TPS-b1, and TPS-a, respectively. Although the expression of many TPS genes in leaf tissues, the relative abundance is highlighted only for the Pg23050 gene (putative betacaryophyllene synthase) from TPS-a subfamily.
The relative transcript abundance analysis of tissues from cultivars Paluma and Cortibel RM, also showed a coherent grouping of biological replicates for each tissue (Fig. 2B). The highest rates of relative abundance were observed in the mature leaf tissues, both in Paluma and Cortibel RM. Besides, it is possible to identify clusters of TPS genes abundant in the root tissues. In contrast, younger floral buds, immature leaves, and young leaf tissues have a greater variety of expressed TPS genes.
Cluster I in Fig. 2B was characterized by genes expressed mainly in immature leaf tissue from Cortibel RM genotype, as observed for the Pg56994, Pg23050, Pg56991 genes, all from the TPS-a subfamily. In contrast, this cluster includes less abundant genes, mainly mature leaf and root tissue, both for the Paluma and Cortibel RM genotypes. Cluster II (Pg25655, Pg25656, and Pg13455) has a significant abundance in young leaves. The Pg25655, Pg25656 genes are from the subfamily TPS-a. The genes Pg13455 and Pg13454 cluster close to RtTPS1 (Rhodomyrtus tomentosa), in the subfamily TPS-b1 and code for putative pinene synthase.
The genes that form cluster III have a more dispersed expression profile in the tissues, but there is an evident abundance in the green tissues from the Paluma genotype. The genes in this cluster are formed from TPS-a, TPS-g, and TPS-b1 subfamilies. Among these genes are a TPS-g subfamily (Pg16967), a putative (3S, 6E)-nerolidol synthase/(3S)-linalool synthase, and the TPS-a subfamily (Pg55462, Pg00544) which code for putative beta-caryophyllene synthase. The major cluster (IV) is expressed primarily on root tissue from both genotypes, and it is richly formed of genes from different families such as TPS-a, TPS-b2, TPS-c, and TPS-g, but predominantly from the subfamily TPS-b1. Interestingly, the research identified genes in the branch of cineol synthases, emphasizing the genes Pg44627, and Pg55752 (TPS-b1).
Clusters V and VI predominantly consist of genes from the TPS-a subfamily highlighted by the high abundance of transcripts Pg06308, Pg55458, Pg58130, and Pg15233, all putative betacaryophynele synthases. The genes belonging to group VI are predominantly expressed in mature leaf tissue of Cortibel RM, whereas those from group V exhibit higher expression levels in young chlorophyll-containing tissues. Clusters IV and V also have the only members of the expressed TPS-e/f subfamily, Pg03078, a putative linalool synthase, and the TPS-c subfamily Pg41789 encodes a copalyl diphosphate synthase.
Of the total of 86 TPS genes annotated, the expression of 50 (58%) TPS genes was confirmed (Fig. 2C). Among these, 33 (38%) expressed TPS genes containing both domains (PF01397 and PF03936). The RNA-seq experiment of cv. Allahabad Safeda, confirmed the expression of 21 TPS genes. Among these genes, 14 (16%) are complete, and two were expressed exclusively in this genotype (Pg54796-b1 and Pg54481-a). In general, for all cultivars analyzed, there is a greater abundance of genes expressed from the TPS-a subfamily, corresponding to 52% of the total TPS genes expressed in the cv. Allahabad Safeda, and 56% in the Paluma and Cortibel cultivars (Fig. 2C).
Variations in TPS genes of Brazilian guava germplasm and relationship with leaf terpenic composition
The study analyzed genetic variation within the TPS genes by aligning single nucleotide polymorphisms (SNPs), obtained from genotyping by sequencing using DArTseq, against the guava genome. The study mainly aimed to identify genetic markers for assessing genetic diversity within a comprehensive set of 62 guava genotypes, including Paluma and Cortibel RM varieties, all of which are maintained within a germplasm collection (Fig. 3A). Additionally, this analysis investigated the alternative splicing variants of the TPS genes.
Twelve TPS genes showed SNPs, of which ten were in full-length (five TPSa—Pg00544, Pg00545, Pg23613, Pg25655, and Pg55458; three TPSg—Pg16967, Pg16969, Pg16979; two TPSb Pg54796, Pg10950), and two were partial genes (Pg15480 and Pg42505). Nine of these genes had their expressions detected by RNAseq analysis. The SNPs occurred in the genes’ exons, introns, 3’UTR, and 5’UTR boundaries. Most of them are located in exons (Table S5).
Exploring the SNP positions within the exons, where the triplet of base pairs was responsible for encoding amino acids, data pointed out that the polymorphism occurred in the triplet of Pg15480.1, where the change corresponded to the same amino acid (leucine), indicating a synonymous mutation of the transversion type, as seen in Pg16967.1_2 and Pg16979.1_2 as well. However, the alternative transcripts Pg16967.1_3 and Pg16979.1_3 displayed a non-synonymous transversion mutation, substituting alanine for aspartic acid. Pg16969.1_2 displayed this type of mutation involving the substitution of threonine with asparagine (Table S5). The most variable SNP occurred in the intron portion of the gene Pg10950-b1.
The clustering of the 62 genotypes (Fig. 3A) based on the data of SNPs in the TPS genes reveals two most significant groups, representing the two chemotypes detected in Brazilian cultivars, as exemplified by Paluma and Cortibel RM genotypes7. The sesquiterpene prevalence in the essential oils of the mature leaf was also observed in Psidium guajava genotypes. Notably, the major compounds identified in the Paluma cultivar included caryophyllene, selinene, cineol, and aromadendrene types. In contrast, the essential oil profile of Cortibel RM was characterized by significant quantities of humulene, trans-nerolidol, D-limonene, bisabolol, α-humulene, and bisabolene (Fig. 3B).
Discussion
Psidium guajava, a native species of America, exhibits remarkable adaptability5, broad distribution, and economic, nutritional, and medicinal importance65. The essential oils in mature leaves of these plant species show a higher diversity of terpenic compounds, with profiles based on the chemical skeleton34.
In the Myrtaceae family, species with dry fruits have the largest numbers of TPS in plants23,24,45,46, evidenced by the expansion of the TPS gene and an expansive evolutionary divergence, resulting in often lineage-specific pathways and products29,66,67. The contraction of the TPS family in guava, to a certain extent, reflects the loss of redundant genes after whole-genome duplication since the numbers of TPS in species of Melaleuca alternifolia and Psidium cattleyanum also displayed a slight contraction compared to those found in the eucalyptus lineages24,45.
The analyses indicate broad conservation in gene numbers and subfamily representation in the TPS gene family in Psidium spp. The red and yellow morphotypes of P. cattleyanum share 28–30% orthologous pairs (9 out of 30 and 32; Table S6) and 39% of orthologous pairs with guava (17 out of 43 genes). A comparison of the two eucalypt species (31 out of 52 genes are found in orthologous pairs for E. grandis and 31 out of 45 for E. globulus) with 59%-68% of genes found in orthologous pairs23 shows more conserved genes as the eucalyptus species evolved over a relatively shorter time, approximately 12 million years23 than Psidium species, which separated about 26 million years of evolution68,69. The observed similarities in TPS genes between E. grandis and E. globulus may be attributed to significant gene family evolution prior to their divergence, and similar mechanisms could potentially operate in Psidium species.
The phylogenetic analysis revealed in TPS-a subfamily, five guava genes (Pg23050, Pg46818, Pg54496, Pg15233, Pg57413) closely related to RtTPS3 (AXY92168) from Rhodomyrtus tomentosa and EgranTPS038 (Euc_Eucgr_J01451) from E. grandis, both belong to a branch of the betacaryophylene synthase, a sesquiterpene70. It has been observed that the genes Pg23050 and Pg15233 (full-length) have demonstrated expression within the transcriptome, mainly in juvenile leaf tissues. Consequently, these genes have emerged as potential candidates for functional studies. Two guava genes (Pg00545 and Pg00544) were found in the same branch as RtTPS4 (AXY92169), also putative synthesizing a beta-caryophyllene (Fig. 1A). However, only the gene Pg00544 exhibited expression in the transcriptome, in both mature and immature leaves of the Paluma variety. Although Paluma exhibits higher levels of beta-caryophyllene compared to Cortibel RM and the Allahabad safeda cultivar, the final oil profile may result from the combined expression of diverse genes, contributing in varying amounts to the overall concentration found in the oil.
The prevalence of sesquiterpenes in the essential oils of mature guava leaves was observed in different genotypes. The Paluma cultivar exhibited compounds such as caryophyllene, selinene, cineol, and aromadendrene as major constituents. At the same time, humulene, trans-nerolidol, d-limonene, bisabolol, α-humulene, and bisabolane were the predominant compounds in Cortibel RM7,26. The terpenes (E)-b-ocimene, caryophyllene oxide, (E)-caryophyllene, epi‑a-Muurolol, and epiglobulol were the dominant constituents among the 54 recorded in Allahabad safeda cultivar72.
The gene expression across all investigated cultivars revealed a predominant representation of the TPS-a subfamily, constituting 52% of the total expressed TPS genes in the Allahabad Safeda cultivar and 56% in both the Paluma and Cortibel cultivars. This observed expression pattern significantly contributes to the overall composition of essential oil products. Additionally, distinct transcripts were identified in the examined tissues, underscoring the functional adaptability of TPS-a genes. The functional significance of these findings is underscored by the role of sesquiterpenes, identified as crucial signaling molecules in various plant–insect and plant-pathogen interactions71. These sesquiterpenes play a pivotal role in attracting pollinators and defending against insect herbivores. Notably, the prevalence of putative TPS-a suggests that sesquiterpenes may have played a key role in driving adaptive traits in P. guajava. This is particularly pertinent to the plant's survival and proliferation in the challenging environments of wet forests across the neotropics.
The TPS-b subfamily grouped into two clades. The TPS-b1 clade contains putative cyclic monoterpene synthases. In general, the most observed monoterpenes are cineol, limonene, pinene, and linalool. This study identified 12 genes associated with the expression of monoterpenes in Paluma and Cortibel. Among these genes, some are in the same branch of functionally characterized genes associated with pinene synthase production (Pg13454 and Pg13455), and cineole synthase production (Pg52150, Pg44627, Pg55752), demonstrating their significance as potential candidates for functional studies related to this chemical marker production within the Myrtaceae family73.
In maize, the production of β-caryophyllene in the root interacts with the attraction of the root-knot nematode74,75. In the context of guava cultivation in Brazil, the root-knot nematode, Meloidogyne enterolobbi stands out as a major pest76, causing galls and rot, thereby compromising the root system, restricting fruit production and quality, and ultimately leading to plant mortality77. Therefore, the study of the TPS-b1 subfamily genes in guava root tissues, especially Pg55752, Pg07990, Pg56076, by their potential involvement in caryophyllene production in root, is fundamental for a deeper understanding of the genetic mechanisms underlying defense responses against herbivores, such as the root-knot nematode, to develop targeted strategies to mitigate the damage caused by this pest in guava cultivation, preserving fruit yield and plant health.
The TPS-b2 subfamily contains putative isoprene/ocimene (C5, C10) synthases (Fig. 1A). Despite ample evidence of the isoprene synthase acyclic genes’ emergence before the emergence of Myrtaceae, no sequences were found in Psidium species. This observation suggests that some Myrtaceae could have lost the isoprene/ocimene type of Tps-b2 gene, which arose relatively recently, likely through either whole-genome or localized duplication. A similar loss followed by radiation is apparent in Arabidopsis thaliana genes. Isoprene, the smallest terpenoid compound, significantly influences Earth’s atmosphere, enhancing aerosol and ozone formation. In plants, it provides vital functions: heat protection, ozone tolerance, and defense against reactive oxygen species78. However, the high biosynthetic cost (at least 20 ATP and 14 NADPH79), highlights the ecological significance of isoprene emissions, particularly in high-emitting species like Eucalyptus. Psidium may not have experienced the same biotic and abiotic adaptive pressures to expand its TPS-b2 subfamily as the eucalypts since they diverged from their most recent common ancestor more than 70 million years ago80,81.
Hierarchical cluster analysis showed that TPS genes are more expressed in leaf tissues and less abundant in fruit tissues. Gene expression varied between cultivars, with specific genes from the TPS-a subfamily showing great expression in immature leaf tissues of the Cortibel RM cultivar and an evident abundance in the green tissues from the Paluma genotype with TPS-b1 and TPS-g subfamily. These results demonstrate the functional plasticity of TPS genes in different genotypes and developmental stages, and suggest their significant contribution to the chemical diversity of essential oil compounds in P. guajava.
The analyses identified only one copy of the full-length diterpene synthase genes (Pg41789 TPS-c and Pg03078-e/f) in P. guajava, originating as a synthase-producing gibberellin precursor (regulatory plant hormone)29. The other species from Myrtacaea present more than one copy gene to the TPSe/f subfamily (Supplementary Table S5).
Alternative splicing in the TPS genes was one mechanism of terpenic diversification shown in this work. The mechanism included 24 forms for 17 full-length genes, mostly from TPS-a followed by TPS-b1 subfamilies. The alternative transcripts indicated that the variation in emission might be regulated at the post-transcriptional level, as previously suggested for TPS from the Myrtaceae family82. The loss of function in specific terpene synthases due to altered splicing may be one of the causes of intra-specific variability observed in Myrtaceae. The functional significance of this alternative splicing still needs to be directly tested.
The study also focused on investigating variations in TPS genes within Brazilian guava germplasm and their relationship with the terpenic composition of leaves. To analyze the genetic variations, single nucleotide polymorphisms (SNPs) in TPS genes were examined through genotyping by sequencing, including Paluma and Cortibel RM. Twelve TPS genes exhibited SNPs in the population of 62 individuals, with nine showing expressions detected by RNAseq analysis. The hierarchical clustering of genotypes based on these SNPs in TPS genes showed clusters with genotypes representatively of different essential oil chemotypes identified by7 as Paluma, Cortibel RM, Cortibel XIII, and Roxa genotypes.
The genomic location of a Single Nucleotide Polymorphism (SNP) can hold functional significance for an individual. An SNP within the coding region can impact protein formation, representing a non-synonymous mutation wherein the base substitution alters an amino acid in the polypeptide chain83. Such a mutation was identified in the genes Pg16967, Pg16969, and Pg16979 (Table 1), responsible for (3S,6E)-nerolidol synthase. This mutation may exert influence by suppressing or favoring enzymatic activity84,85.
The high number of genes, the diversification of subfamily TPS-a, and the more abundance expression in the leaf tissues is consistent with the amount and variability of the terpenic compounds related in the leaf essential oils of P. guajava, predominantly of sesquiterpenes7,26. As examples of the Paluma e Cortibel RM cultivars studied in this work were related to 42 terpenic compounds, with more than 80% sesquiterpenes, with 17 exclusives of Cortibel RM and three exclusives of Paluma (Fig. 3B), showing genotyping specific profiles of the two chemotypes.
The most common and abundant foliar terpene pattern across Myrtaceae family is α-pinene and 1,8-cineole. We detected its putative genes in guava Pg16970, Pg16987, Pg13454, Pg13455 and Pg55752, Pg44627, Pg52150, Pg44441, respectively. In other species of the Myrtaceae family, such as those belonging to the tribes Myrteae, Kanieae, Syzygieae, Xanthostemoneae, Syncarpieae, and Lindsayomyrteae, these monoterpene are primarily replaced by β-caryophyllene (a sesquiterpene) as the most abundant terpene24,73. Caryophyllene has been identified as one of the major components in guava leaf essential oil across different countries8,34,86,87,88, with only a few monoterpenes present in the essential oil composition34,89. However, within the Psidium genus, for native Psidium cattleyanum species from Brazil, 1,8-cineole was the major component found, and for cultivated varieties across the globe, the β-caryophyllene and caryophyllene predominates34,73,90. In our study we also detected Pg55458, Pg06308 Pg55462, Pg58130, Pg15233, Pg54496, Pg46818, Pg23050, Pg57413 putative genes to the biosynthesis of β-caryophyllene.
Conclusions
This study constitutes an extensive examination of TPS genes in guava, involving the thorough identification and analysis of their underlying structural elements. A particular focus is directed towards elucidating the variations within these genes across key cultivars of significance in Brazil. Furthermore, we delve into the evolutionary relationships by examining their connections with other Myrtaceae species. In a culmination of our efforts, we present the expression profiles of TPS across diverse tissues and cultivars, thereby contributing valuable insights into the functional dynamics of these genes in the context of guava.
In summary, the relatively low number of TPS genes between P. guajava and P. cattleyanum reflects their close phylogenetic relationship. Additional investigations focused on the functional characterization of TPS genes and their regulatory mechanisms shall contribute to a deeper understanding of terpene biosynthesis in guava. During the continuous evolution of guava, copies of some genes were retained, while some losses occurred in other TPS genes.
Analyzing single nucleotide variations in TPS genes in the Brazilian guava cultivars, including alternative splicing forms, reveals significant diversity. This suggests potential implications for gene functionality and the terpenes biosynthesis. The SNPs in functional genes, particularly exonic non-synonymous mutations, could serve as valuable molecular markers for functional gene mapping and genetic improvement of guava for terpene biosynthesis.
The availability of TPS annotation can be valuable for future breeding programs and the selection of guava cultivars with desired terpenic compositions for specific applications in the food, fragrance, and pharmaceutical industries. This research will further enhance our comprehension of the genetic foundations that underlie the observed diversity and adaptability in this economically significant plant species.
Data availability
The sequences used for alignment are presented in the Supplementary Material, available at Scientific Reports’s website. The genome sequence data are available in the NCBI Short Read Archive under BioProject No. PRJNA631442. The transcriptomic data can be obtained through the specified accessions numbers, SRR7186630 and PRJNA1020439. The authors declare that the data supporting the findings are available within the paper.
References
Arévalo-Marín, E. et al. The taming of Psidium guajava: Natural and cultural history of a neotropical fruit. Front. Plant Sci. 12, 1–15 (2021).
Landrum, L. R. The genus Psidium (Myrtaceae) in the state of Bahia, Brazil. Canotia 13, 1–101 (2017).
Landrum, L. R. Psidium guajava L.: Taxonomy, relatives and possible origin. In Guava: Botany, Production and Uses. 1–21 (2021).
Urquía, D. et al. Psidium guajava in the Galapagos Islands: Population genetics and history of an invasive species. PLoS ONE 14, 1–21 (2019).
Otuoma, J., Nyongesah, J. M., Owino, J., Onyango, A. A. & Okello, V. S. Ecological manipulation of Psidium guajava to facilitate secondary forest succession in tropical forests. J. Ecol. Eng. 21, 210–221 (2020).
Gavhane, A., Chopade, S., Dighe, P. & Kour, A. The nutritional and bioactive potential of guava and possibilities for its commercial application in value-added products. Pharma Innov. J. 11, 2643–2647 (2022).
De Souza, T. D. S. et al. Chemotype diversity of Psidium guajava L.. Phytochemistry 153, 129–137 (2018).
Khadhri, A., El Mokni, R., Almeida, C., Nogueira, J. M. F. & Araújo, M. E. M. Chemical composition of essential oil of Psidium guajava L. growing in Tunisia. Ind. Crops Prod. 52, 29–31 (2014).
Flores, G., Wu, S. B., Negrin, A. & Kennelly, E. J. Chemical composition and antioxidant activity of seven cultivars of guava (Psidium guajava) fruits. Food Chem. 170, 327–335 (2015).
Biswas, S. et al. Evaluation of neurobehavioral activities of ethanolic extract of Psidium guajava Linn leaves in mice model. Future J. Pharm. Sci. 7, 133 (2021).
Vasconcelos, A. G., das Amorim, A., Dos Santos, R. C., Souza, J. M. T., de Souza, L. K. M., de Araújo, T., & de Leite, J. R. Lycopene rich extract from red guava (Psidium guajava L.) displays anti-inflammatory and antioxidant profile by reducing suggestive hallmarks of acute inflammatory response in mice. Food Res. Int. 99, 959–968 (2017).
Silva, E. A. J. et al. Antibacterial and antiproliferative activities of the fresh leaf essential oil of Psidium guajava L. (Myrtaceae). Braz. J. Biol. 79, 697–702 (2019).
Zhu, X. et al. Anti-hyperglycemic and liver protective effects of flavonoids from Psidium guajava L. (guava) leaf in diabetic mice. Food Biosci. 35, 100574 (2020).
Taha, T. F., Elakkad, H. A., Gendy, A. S. H., Abdelkader, M. A. I. & El Hussein, S. S. In vitro bio-medical studies on Psidium guajava leaves. Plant Arch. 19, 199–207 (2019).
Vijaya Anand, A., Velayuthaprabhu, S., Rengarajan, R. L., Sampathkumar, P. & Radhakrishnan, R. Bioactive compounds of guava (Psidium guajava L.). In BT-Bioactive Compounds in Underutilized Fruits and Nuts (eds. Murthy, H. N. & Bapat, V. A.). 503–527 (Springer, 2020).
Oncho, D. A., Ejigu, M. C. & Urgessa, O. E. Phytochemical constituent and antimicrobial properties of guava extracts of east Hararghe of Oromia, Ethiopia. Clin. Phytosci. 7, 37 (2021).
Suni, T. et al. Formation and characteristics of ions and charged aerosol particles in a native Australian eucalypt forest. Atmos. Chem. Phys. 8, 129–139 (2008).
Vickers, C. E., Gershenzon, J., Lerdau, M. T. & Loreto, F. A unified mechanism of action for volatile isoprenoids in plant abiotic stress. Nat. Chem. Biol. 5, 283–291 (2009).
Unsicker, S. B., Kunert, G. & Gershenzon, J. Protective perfumes: The role of vegetative volatiles in plant defense against herbivores. Curr. Opin. Plant Biol. 12, 479–485 (2009).
Cseke, L. J., Kaufman, P. B. & Kirakosyan, A. The biology of essential oils in the pollination of flowers. Nat. Prod. Commun. 2, 1317–1336 (2021).
Cordeiro, G. D. et al. Nocturnal floral scent profiles of Myrtaceae fruit crops. Phytochemistry 162, 193–198 (2019).
Padovan, A., Keszei, A., Wallis, I. R. & Foley, W. J. Mosaic eucalypt trees suggest genetic control at a point that influences several metabolic pathways. J. Chem. Ecol. 38, 914–923 (2012).
Külheim, C. et al. The Eucalyptus terpene synthase gene family. BMC Genomics 16, 450 (2015).
Canal, D., Escudero, F. L. G., Mendes, L. A., da Silva Ferreira, M. F. & Turchetto-Zolet, A. C. Genome-wide identification, expression profile and evolutionary relationships of TPS genes in the neotropical fruit tree species Psidium cattleyanum. Sci. Rep. 13, 1–16 (2023).
Silva, M. A. et al. Genomic and epigenomic variation in Psidium species and their outcome under the yield and composition of essential oils. Sci. Rep. 13, 1–10 (2023).
Mendes, L. A. et al. Spring alterations in the chromatographic profile of leaf essential oils of improved guava genotypes in Brazil. Sci. Hortic. (Amsterdam) 238, 295–302 (2018).
Hassan, E. M. et al. Comparative chemical profiles of the essential oils from different varieties of Psidium guajava L.. Molecules 26, 1–11 (2020).
de Souza, T. D. S. et al. Essential oil of Psidium guajava: Influence of genotypes and environment. Sci. Hortic. (Amsterdam) 216, 38–44 (2017).
Chen, F., Tholl, D., Bohlmann, J. & Pichersky, E. The family of terpene synthases in plants: A mid-size family of genes for specialized metabolism that is highly diversified throughout the kingdom. Plant J. 66, 212–229 (2011).
Han, X. et al. Crystal structures of ligand-bound octaprenyl pyrophosphate synthase from Escherichia coli reveal the catalytic and chain-length determining mechanisms. Proteins Struct. Funct. Bioinform. 83, 37–45 (2015).
Calvert, J., Baten, A., Butler, J., Barkla, B. & Shepherd, M. Terpene synthase genes in Melaleuca alternifolia: Comparative analysis of lineage-specific subfamily variation within Myrtaceae. Plant Syst. Evol. 304, 111–121 (2018).
Thrimawithana, A. H. et al. A whole genome assembly of Leptospermum scoparium (Myrtaceae) for mānuka research. N. Z. J. Crop Hortic. Sci. 47, 233–260 (2019).
Myburg, A. A. et al. The genome of Eucalyptus grandis. Nature 510, 356–362 (2014).
Silva, R. C. E. et al. Monoterpenes and sesquiterpenes of essential oils from Psidium species and their biological properties. Molecules 26, 4–5 (2021).
Vasconcelos, L. C., de Santos, E. S., Mendes, L. A., da Ferreira, M. F. S. & Praça-Fontes, M. M. Chemical composition, phytotoxicity and cytogenotoxicity of essential oil from leaves of Psidium guajava L. cultivars. Res. Soc. Dev. 10, e6110917710 (2021).
Kokilananthan, S., Bulugahapitiya, V. P., Manawadu, H. & Gangabadage, C. S. Sesquiterpenes and monoterpenes from different varieties of guava leaf essential oils and their antioxidant potential. Heliyon 8, e12104 (2022).
Mittal, A. et al. RNA-sequencing based gene expression landscape of guava cv. Allahabad Safeda and comparative analysis to colored cultivars. BMC Genomics 21, 1–19 (2020).
Carpinetti, P. D. A. et al. Efficient method for isolation of high-quality RNA from Psidium guajava L. tissues. PLoS One 16, 1–19 (2021).
Johnson, L. S., Eddy, S. R. & Portugaly, E. Hidden Markov model speed heuristic and iterative HMM search procedure. BMC Bioinform. 11, 431 (2010).
Durairaj, J. et al. An analysis of characterized plant sesquiterpene synthases. Phytochemistry 158, 157–165 (2019).
Letunic, I., Khedkar, S. & Bork, P. SMART: Recent updates, new developments and status in 2020. Nucleic Acids Res. 49, D458–D460. https://doi.org/10.1093/nar/gkaa937 (2021).
Hu, B. et al. GSDS 2.0: An upgraded gene feature visualization server. Bioinformatics 31, 1296–1297 (2015).
Crooks, G. E., Hon, G., Chandonia, J. M. & Brenner, S. E. WebLogo: A sequence logo generator. Genome Res. 14, 1188–1190 (2004).
Goodstein, D. M. et al. Phytozome: A comparative platform for green plant genomics. Nucleic Acids Res. 40, 1178–1186 (2012).
Voelker, J., Mauleon, R. & Shepherd, M. The terpene synthase genes of Melaleuca alternifolia (tea tree) and comparative gene family analysis among Myrtaceae essential oil crops. Plant Syst. Evol. 309, 1–19 (2023).
Butler, J. B. et al. Annotation of the Corymbia terpene synthase gene family shows broad conservation but dynamic evolution of physical clusters relative to Eucalyptus. Heredity (Edinb). 121, 87–104 (2018).
Katoh, K., Rozewicki, J. & Yamada, K. D. MAFFT online service: Multiple sequence alignment, interactive sequence choice and visualization. Brief Bioinform. 20, 1160–1166 (2019).
Capella-Gutiérrez, S., Silla-Martínez, J. M. & Gabaldón, T. trimAl: A tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics 25, 1972–1973 (2009).
Minh, B. Q. et al. IQ-TREE 2: New models and efficient methods for phylogenetic inference in the genomic era. Mol. Biol. Evol. 37, 1530–1534 (2020).
Darriba, D., Taboada, G. L., Doallo, R. & Posada, D. ProtTest 3: Fast selection of best-fit models of protein evolution. Bioinformatics 27, 1164–1165 (2011).
Schwarz, G. Estimating the dimension of a model. Ann. Stat. 6, 461–464 (1978).
Letunic, I. & Bork, P. Interactive tree of life (iTOL) v5: An online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49, W293–W296 (2021).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: A flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Brown, J., Pirrung, M. & McCue, L. A. FQC Dashboard: Integrates FastQC results into a web-based, interactive, and extensible FASTQ quality control tool. Bioinformatics 33, 3137–3139 (2017).
Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nat. Biotechnol. 29, 644–652 (2011).
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: A fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).
Liao, Y., Smyth, G. K. & Shi, W. FeatureCounts: An efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014).
Anders, S. et al. Count-based differential expression analysis of RNA sequencing data using R and Bioconductor. Nat. Protoc. 8, 1765–1786 (2013).
Zhu, A., Ibrahim, J. G. & Love, M. I. Heavy-tailed prior distributions for sequence count data: Removing the noise and preserving large differences. Bioinformatics 35, 2084–2092 (2019).
Kilian, A. et al. Diversity arrays technology: A generic genome profiling technology on open platforms. Methods Mol. Biol. 888, 3–12 (2012).
Sansaloni, C. et al. Diversity arrays technology (DArT) and next-generation sequencing combined: genome-wide, high throughput, highly informative genotyping for molecular breeding of Eucalyptus. BMC Proc. 5, 127 (2011).
Sansaloni, C. et al. Diversity analysis of 80,000 wheat accessions reveals consequences and opportunities of selection footprints. Nat. Commun. 11, 1–12 (2020).
R Core Team. R: A Language and Environment for Statistical Computing. https://www.R-project.org/ (R Foundation for Statistical Computing, 2023).
Minh, B. Q., Dang, C. C., Vinh, L. S. & Lanfear, R. QMaker: Fast and accurate method to estimate empirical models of protein evolution. Syst. Biol. 70, 1046–1060 (2021).
Irshad, Z., Hanif, M. A., Ayub, M. A., Jilani, M. I. & Tavallali, V. Chapter 26 - Guava in Medicinal Plants of South Asia. In Medicinal Plants of South Asia: Novel Sources for Drug Discovery (eds Hanif, M. A. et al.) 341–354 (Elsevier, 2020).
Tholl, D. Biosynthesis and biological functions of terpenoids in plants. Adv. Biochem. Eng. Biotechnol. 148, 63–106 (2015).
Zerbe, P. & Bohlmann, J. Plant diterpene synthases: Exploring modularity and metabolic diversity for bioengineering. Trends Biotechnol. 33, 419–428 (2015).
Murillo-A, J. C., Stuessy, T. F. & Ruiz, E. Explaining disjunct distributions in the flora of southern South America: Evolutionary history and biogeography of Myrceugenia (Myrtaceae). J. Biogeogr. 43, 979–990 (2016).
Dupuis, J. R. et al. Targeted amplicon sequencing of 40 nuclear genes supports a single introduction and rapid radiation of Hawaiian Metrosideros (Myrtaceae). Plant Syst. Evol. 305, 961–974 (2019).
He, S. M. et al. novo transcriptome characterization of Rhodomyrtus tomentosa leaves and identification of genes involved in α/β-pinene and β-caryophyllene biosynthesis. Front. Plant Sci. 9, 1231 (2018).
Bosman, R. N. & Lashbrooke, J. G. Grapevine mono- and sesquiterpenes: Genetics, metabolism, and ecophysiology. Front. Plant Sci. 14, 113 (2023).
Raj, M. S. A. et al. A comparative analysis of leaf essential oil profile, in vitro biological properties and in silico studies of four Indian guava (Psidium guajava L.) cultivars, a promising source of functional food. S. Afr. J. Bot. 153, 357–369 (2023).
Padovan, A., Keszei, A., Külheim, C. & Foley, W. J. The evolution of foliar terpene diversity in Myrtaceae. Phytochem. Rev. 13, 695–716 (2014).
Vieira, P. M. H., Arêdes, F. A. S., Ferreira, A. & Ferreira, M. F. S. Comparative analysis of soybean genotype resistance to Heterodera glycines and Meloidogyne species via resistance gene analogs. Genet. Mol. Res 15, 1–13 (2016).
Gfeller, V. et al. Root volatiles in plant–plant interactions I: High root sesquiterpene release is associated with increased germination and growth of plant neighbours. Plant. Cell Environ. 42, 1950–1963 (2019).
Gomes, V. M., Souza, R. M., Mussi-Dias, V., da Silveira, S. F. & Dolinski, C. Guava decline: A complex disease involving Meloidogyne mayaguensis and Fusarium solani. J. Phytopathol. 159, 45–50 (2011).
Khan, M. R., Poornima, K., Somvanshi, V. S. & Walia, R. K. Meloidogyne enterolobii: A threat to guava (Psidium guajava) cultivation. Arch. Phytopathol. Plant Prot. 55, 1961–1997 (2022).
Lantz, A. T., Allman, J., Weraduwage, S. M. & Sharkey, T. D. Isoprene: New insights into the control of emission and mediation of stress tolerance by gene expression. Plant. Cell Environ. 42, 2808–2826 (2019).
Sharkey, T. D. & Yeh, S. Isoprene emission from plants. Annu. Rev. Plant Biol. 52, 407–436 (2001).
Biffin, E. et al. Evolution of exceptional species richness among lineages of fleshy-fruited Myrtaceae. Ann. Bot. 106, 79–93 (2010).
Thornhill, A. H., Ho, S. Y. W., Külheim, C. & Crisp, M. D. Interpreting the modern distribution of Myrtaceae using a dated molecular phylogeny. Mol. Phylogenet. Evol. 93, 29–43 (2015).
Keszei, A. et al. Functional and evolutionary relationships between terpene synthases from Australian Myrtaceae. Phytochemistry 71, 844–852 (2010).
Rodrigues Caetano, A. Marcadores SNP: conceitos básicos, aplicações no manejo e no melhoramento animal e perspectivas para o futuro. Rev. Bras. Zootec. 3598, 64–71 (2009).
Du, F. et al. Volatile composition and classification of Lilium flower aroma types and identification, polymorphisms, and alternative splicing of their monoterpene synthase genes. Hortic. Res. 6, 22 (2019).
Ueki, M. et al. Evaluation of the functional effects of genetic variants-missense and nonsense SNPs, indels and copy number variations-in the gene encoding human deoxyribonuclease I potentially implicated in autoimmunity. Sci. Rep. 9, 1–11 (2019).
de Souza, W. F. C. et al. Exploiting the chemical composition of essential oils from Psidium cattleianum and Psidium guajava and its antimicrobial and antioxidant properties. J. Food Sci. 86, 4637–4649 (2021).
Soliman, F. M., Fathy, M. M., Salama, M. M. & Saber, F. R. Comparative study of the volatile oil content and antimicrobial activity of Psidium guajava L. and Psidium cattleianum Sabine leaves. Bull. Fac. Pharm. Cairo Univ. 54, 219–225 (2016).
El-Ahmady, S. H., Ashour, M. L. & Wink, M. Chemical composition and anti-inflammatory activity of the essential oils of Psidium guajava fruits and leaves. J. Essent. Oil Res. 25, 475–481 (2013).
da Silva, J. D. et al. Essential oils of the leaves and stems of four Psidium spp.. Flavour Fragr. J. 18, 240–243 (2003).
Macedo, J. G. F. et al. Therapeutic indications, chemical composition and biological activity of native Brazilian species from Psidium genus (Myrtaceae): A review. J. Ethnopharmacol. 278, 22 (2021).
Funding
This work was supported by Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq, Brasília-DF, Brazil; grants 443801/2014-2 and 308828/2015-1), Fundação de Amparo à Pesquisa do Espírito Santo (FAPES/VALE, Vitória-ES, Brazil; grant 75516586/16) and VALE. This study was financed in part by the Coordenação de Aperfeiçoamento de Pessoal de Nível Superior-Brazil (CAPES)-Finance Code 001.
Author information
Authors and Affiliations
Contributions
D.C.: Investigation, writing-original draft, Writing-review & editing. P.H.D.S.: Investigation, Writing-review & editing and Statistical Analysis. P.A.C.O.: Investigation. M.A.S.: Investigation and Writing-review & editing. M.F.: Statistical analysis. O.J.B.B.: Investigation. A.F.: Supervision and statistical analysis. M.F.S.F.: Supervision, conceptualization, writing-original draft, writing-review & editing. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Canal, D., dos Santos, P.H.D., de Avelar Carpinetti, P. et al. Exploring the versatility of sesquiterpene biosynthesis in guava plants: a comparative genome-wide analysis of two cultivars. Sci Rep 14, 574 (2024). https://doi.org/10.1038/s41598-023-51007-1
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-51007-1
- Springer Nature Limited