Introduction

Tulip (Tulipa L.) is widely known throughout the world market and has great commercial significance. It has been cultivated for cut flowers as a garden, potted, and landscaping plant. Until now, more than 8,000 tulip cultivars have been bred with a variety of traits regarding flower shapes, colors, lifespan, plant architecture, and resistance to abiotic stress (Van Scheepen 1996; Marasek-Ciolakowska et al. 2012). A majority of the cultivars, which number 1100 and are distributed in the commercial market, belong to Tulipa gesneriana L. and the Darwin hybrid group (Van Tuyl and Van Creil 2007). The latter is a large family of tulips that was bred through interspecific crosses between T. generiana and T. fosteriana and has remarkable resistance.

Flower longevity is an essential horticultural trait in tulips, which is determined based on petal senescence or abscission. A repetitive diurnal petal movement was observed during the flower opening period, which is considered an indicator of the duration of flower bloom in tulips (van Doorn and van Meeteren 2003; Azad et al. 2008). Phytohormones, transcription factors (TFs), and environmental stimuli are critical regulators of flower opening and senescence (Rogers 2013; Ma et al. 2018; Baharudin and Osman 2023). Abscisic acid (ABA) treatment sped up flower senescence in daylilies (Hemerocallis fulva) (Panavas et al. 1998) and daffodils (Narcissus pseudonarcissus) (Hunter et al. 2004). In Lily (Lilium spp.) and gladiolus (Gladiolus palustris), the ABA content increased following flower senescence (Arrom and Munné-Bosch 2012; Kumar et al. 2014). Ethylene plays a dual role in the regulation of flower senescence in tulips (Sexton et al. 2000; Wang et al. 2020). Transcription factors (TFs), in general, are critical regulators of senescence initiation in flowers in hormone-dependent or independent manners. Rosa hybrida MYB transcription factor 108 (RhMYB108) participated in petal senescence in roses induced by ethylene and Jasmonic acid (JA) (Zhang et al. 2019). FOREVER YOUNG FLOWER (FYF) encodes a MADS-box TF that acts as a negative regulator of ethylene responses to repress flower senescence and abscission (Chen et al. 2015). Silencing of Petunia hybrida homeodomain–leucine zipper (PhHD-Zip), which encodes a homeodomain–leucine zipper (HD-Zip) protein, remarkably extended flower longevity in petunias (Chang et al. 2014). Another HD-Zip TF, RhHB1, positively induces petal senescence in roses in an ABA- and ethylene-dependent manner (Lü et al. 2014). PhFBH4,a basic helix–loop–helix (bHLH) TF, regulated ethylene biosynthesis to modulate petal senescence (Yin et al. 2015). Besides, ANTHOCYANIN1 (AN1), another bHLH TF, regulated petal senescence and anthocyanin accumulation in petunias (Prinsi et al. 2016). Phytohormones also play essential roles in flower longevity regulation in tulips. In tulips, ethylene production was enhanced during petal senescence (Wang et al. 2020). An NAM/ATAF/CUC (NAC) TF, TgNAP, elevated salicylic acid (SA) and H2O2 accumulation to promote petal senescence in tulips (Meng et al. 2022). Recently, it was found that tulip TgWRKY75 activates salicylic acid and abscisic acid biosynthesis to synergistically promote petal senescence (Meng et al. 2024). All of these researches indicated that hormones and TFs usually performed their roles synergistically to regulate the lifespan of flowers.

Over the past decade, transcriptomics has been widely used to explore insights into the genetic information of plant species. All Tulipa spp. possess a large-sized genome, and the diploid 2C DNA values varied from 32 to 69 pg in T. clusiana (Clusius tulip) and T. gesneriana, respectively (Zonneveld 2009). In 2012, using 454 pyro-sequencing technology, the first set of transcriptome contigs was reported in tulip spp., which had an average length of 514 bp, and partial transcriptomic data was obtained (Shahin et al. 2012). Molecular markers such as simple sequence repeats (SSRs) and single nucleotide polymorphisms (SNPs) have been developed for genotyping identification in tulips with potential application in molecular breeding (Shahin et al. 2012; Tang et al. 2013). At present, little genomic and transcriptomic information is available in tulips, which limits the progress of molecular breeding research.

Full-length transcriptome sequencing is of great significance as it can overcome the limitations of short-read RNA sequencing without further assembly. This study employed PacBio Isoform-Seq and Illumina RNA-seq technologies to reveal the complexity of the full-length transcriptome of T. gesneriana used as a reference. Here, we report the inaugural release of a full-length reference transcriptome sequence in tulips. Additionally, the transcriptome and phytohormone profiles during flower opening and petal senescence have been comprehensively analyzed. In total, 51 and 49 TFs were identified to be significantly altered during flower opening and petal senescence, respectively. The most enriched pathways in these processes were those related to ABA and auxin, followed by ethylene, cytokinins (CKs), SA, and JA. Furthermore, a HB-Zip TF, TgHB12-like, was functionally characterized as a positive regulator of petal senescence in tulips. This research may contribute to unraveling the characteristics of the T. gesneriana transcriptome and provide a valuable genetic resource for further study in tulips.

Materials and methods

Plant material and growth conditions

A total of 37 tulip cultivars were used in this study, and the detailed information is listed in Table S1. T. gesneriana cultivar ‘Red Power’ was used for PacBio-based Isoform (Iso)-Seq, Illumina RNA-Seq, gene expression, and virus-induced gene silencing (VIGS) assay. The perimeters of all bulbs used in this study were 12–13 cm, which were ordered from the Beijing Sangu Co., Beijing, China. Tulip plants were planted in a greenhouse located on the campus of the Huazhong Agricultural University, Wuhan, China (114°21′E, 30°28′N; altitude 30 m). The growth conditions were maintained at 20°C day/15°C night under a 16 h light/8 h dark photoperiod. The relative humidity was kept at 60%–70%.

Samples collection and RNA preparation

For PacBio Iso-Seq, six tissues, including leaves, flowers, stems, roots, floral buds, and bulb scales of tulip plants, were collected. For Illumina RNA-Seq, flowers at five different developmental stages in triplicates (15 flowers for each stage) were gathered. The flower stages were selected as defined previously (Wang et al. 2020). The total RNA of each tissue was extracted using TRIzol™ reagent (Invitrogen, MA, USA). The RNA integrity number (RIN) of all RNAs is observed to be approximately 7–9. All sequencing experiments were performed at the Biomarker Information Co. Ltd., Beijing, China.

PacBio Iso-Seq, data processing, and correction

For single-molecule real-time (SMRT) sequencing, the RNA samples from all six tissues were pooled in equal amounts for cDNA synthesis using the SMRTer™ PCR cDNA Synthesis Kit (Clontech, CA, USA). Three SMRT libraries of lengths 1–2, 2–3, and 3–6 kb were generated and sequenced using eight SMRT cells on the RS II system (Pacific Biosciences, CA, USA) (Sharon et al. 2013). Full-length (FL) and full-length non-chimeric (FLNC) transcripts were screened with a polyA tail signal and the 5’ and 3’ primers of the cDNA in Regions of Interests (ROIs). The FLNC reads were clustered using the RS_IsoSeq module of SMRT analysis (V2.3.0). The Illumina RNA-Seq data was employed for hybrid error correction implemented with Proovread (v2.13.13) (Hackl et al. 2014). A post-correction accuracy of > 99% of the FL transcripts was obtained and used for further analysis. High-quality Iso-Seq FL transcripts were obtained after removing redundancy (identity > 0.99), filtering the adapter sequences, and low-quality reads (read-length < 50 bp and read-score < 0.75) (Gao et al. 2019). Bowtie2 was used to blast the SMRT transcript sequence reads obtained from each sample (Langmead and Salzberg 2012). Finally, the novel high-quality FL transcript data set for T. gesneriana was acquired. The workflow for SMRT sequencing is shown in Fig. S1.

Identification of SSRs, long non-coding RNAs, and prediction of alternative splice sites

Of the total sequence, 171,242,964 bp long, 76,291 transcripts > 500 bp were employed for SSR identification using MISA (http://pgrc.ipk-gatersleben.de/misa/). Those with more than two perfect SSRs and an interval less than 100 bp were indicated as compound SSRs. The candidate, long no-coding RNA (lncRNA) transcripts, were selected based on length more than 200 nts and exons more than 2, and further screened using the CPC/CNCI/CPAT/Pfam databases (Sun et al. 2013). The Iso-Seq™ data was utilized to run an all-vs-all Basic Local Alignment Search Tool (BLAST). The de novo alternative splice (AS) sites were detected as was previously done in T. gesneriana without the reference genome sequences (Liu et al. 2017).

Coding-domain sequence (CDS) detection and gene functional annotation

TransDecoder (v3.0.0) (https://github.com/TransDecoder/TransDecoder/releases) identified the candidate coding regions within the full-length transcripts (Altschul et al. 1997). Annotation was performed using four databases: Swiss-Prot; Protein family (Pfam); KOG/COG/eggNOG, KEGG, and Gene Ontology (GO); and NCBI nonredundant (NR) database. The TFs were annotated using the Plant TFDB database (http://planttfdb.gao-lab.org) (Jin et al. 2017).

Illumina RNA-Seq, weighted correlation network analysis (WGCNA), and differential expression analysis

Individual tulip cDNA libraries were constructed by the NEBNext® Ultra II™ RNA Library Prep Kit (New England Biolabs, MA, USA) and used for Illumina RNA-Seq. The cDNA libraries were sequenced with the paired-end sequencing method on a HiSeq 4000 sequencing system (Illumina, CA, USA). The raw Illumina-Seq reads were filtered to remove the low-quality reads, adaptor sequences, and ambiguous reads with N > 10%. The clean data were then mapped based on the assembled tulip transcriptome. Gene expression levels in each sample were estimated by RSEM software (http://deweylab.biostat.wisc.edu/rsem/) (Li and Dewey 2011) and quantified as fragments per kilobase of transcript per million (FPKM). WGCNA was used to construct the gene coexpression networks (https://horvath.genetics.ucla.edu/html/CoexpressionNetwork) (Langfelder and Horvath 2008). The differential expression analysis among the flower stages was performed utilizing the DESeq R package (V1.10.1) (Anders and Huber 2010). An false discovery rate (FDR) < 0.05 and log2 |(foldchange)| ≥ 1 was set as the threshold for identifying the significant differential expression. MFUZZ (https://mfuzz.sysbiolab.edu.eu) was used for the clustering of gene expression (Kumar and Futschik 2007).

GO-term and KEGG pathway enrichment analysis

The GO classification was executed using the GOseq R packages (Young et al. 2010). KEGG analysis was performed using differentially expressed genes (DEGs) obtained from each comparison (http://www.genome.jp/kegg/) (Kanehisa et al. 2016). The statistical enrichment of the DEGs identified in the KEGG pathways was accomplished by employing the KOBAS software (http://kobas.cbi.pku.edu.cn/genelist/) (Bu et al. 2021).

Hormone quantification

The phytohormones were comprehensively profiled as previously described (Wang et al. 2017). Briefly, five intact flowers were collected for each flower stage in triplicates totaling 15. Each sample, weighing 200 mg, was ground into a powder, divided into two equal parts, and then transferred into 10 mL centrifuge tubes. One part was prepared for the examination of CKs and acidic phytohormones, while the other one was for that of brassinosteroids (BRs). An LC-30AD HPLC system (Shimadzu, Tokyo, Japan) coupled with an AB SCIEX 4500 Triple Quad™ liquid chromatography-mass spectrometer (MS)/MS (Sciex, Foster City, CA, USA) was used for the quantification of CKs and acidic phytohormones. BRs were estimated according to a previous report (Yu et al. 2016). All quantifications were performed in three biological replicates, and the final concentration of each analog was indicated as the average value ± SE.

VIGS assay

The full-length CDS of F01.PB503 (TgHB12-like) was cloned using the specific primers listed in Table S2. Based on sequence alignment, a fragment with low similarity with other homologs was isolated and sub-cloned into the tobacco rattle virus RNA2 (TRV2) vector. It was then introduced into the Agrobacterium tumefaciens strain GV3101 cells and used for petal disc infiltration. Post-transformation, the expression levels of TgHB12-like and Tulipa gesneriana senescence-related gene6 (TgSAG6) were quantified by RT-qPCR, and electrolyte leakage (EL) was determined. The total anthocyanin content was ascertained as described previously (Yan et al. 2022) calculated by the formula: QAnthocyanins = (A530 – 0.25 × A657)/M; where A530 and A657 were the absorptions at these wavelengths, and M is the weight of the plant material used for extraction.

Phylogenetic analysis and sequence alignment

The proteins encoded by TgHB12-like and its closest orthologs were aligned, and phylogenetic trees were constructed using the MEGA 7.0 software (https://www.megasoftware.net/) with the Maximum Likelihood method based on the bootstrap analysis with 1000 replications.

Transcriptional activation ability assay

The entire length of TgHB12-like was constructed into a pGBKT7 vector. The resulting positive and empty vectors (as negative controls) were employed to co-transform the cells of the yeast strain AH109. The transcriptional activation of these transformed yeast cells was assessed by spotting the transformants on SD/-Trp/-Leu and SD/-Trp/-Leu/-His/-Ade media at 30℃ for three days.

RT-qPCR analysis

Total RNA was extracted using the EASYspin Plus Complex Plant RNA Kit (Vazyme, Nanjing, China). The reverse transcription was performed using 1.0 µg of the total RNA with HiScript® II 1st Strand cDNA Synthesis Kit (Vazyme, Nanjing, China). The primers for RT-qPCR were designed using the web tool GenScript (https://www.genscript.com/ssl-bin/app/primer). The relative transcription levels were calculated using the 2−ΔΔCT method (Livak and Schmittgen 2001). Tulipa gesneriana beta-actin (TgACTIN) was employed as the reference gene. The primer sequences are listed in Table S2.

Statistical analysis

All experiments were repeated at least thrice, and the means ± standard errors (SEs) of the results of these independent experiments are shown. Bars with asterisks indicate statistically significant differences at P < 0.01 calculated using Student’s t-test. The variations among the stages were assessed using Duncan’s multiple range test (DMRT) and indicated by different letters at *P < 0.05.

Data mining and deposition

The raw PacBio Iso-Seq data were deposited to the NCBI Sequence Read Archive (SRA) database with the BioProject number PRJNA703083. The RNA-Seq data of the five tulip flower stages were uploaded with the accession number GSE136183 (https://www.ncbi.nlm.nih.gov/).

Results

Variations of flower lifespan and physiological changes during the flowering period

As a commercially important cut flower, the flowering lifespan is of significance to tulips. The flowering phases among 37 commercial cultivars were compared to find that they varied between 9.7–21.4 days from bud emergence to petal senescence (Fig. 1a), and the flowering lifespan diverged in tulips with diverse genetic backgrounds (Table S1). The morphological changes were obvious during flower opening and senescence of an intact tulip flower. The first stage (S1) was defined with green floral buds, followed by the second stage (S2) with colored and expanded petals (Fig. 1b). The opening of flowers with rhythmic petal movement was defined as the third stage (S3). The early senescence stage (S4) was designated as the one when a flower lost petal movements and the cup shape. The flowers with obvious wilting and color fading were described to belong to the late senescence stage (S5) (Fig. 1b). The flower diameters increased from 1.03 (S1) to 8.34 cm (S5), while the relative water content (RWC) of petals decreased from 90.75% to 81.5% at the senescent stages (Fig. 1c, d). Besides an enhancement of cell membrane permeability at the last stages, the accumulation of anthocyanin contents was enhanced initially from S1 to S3, but then gradually declined in senescent petals (Fig. 1e, f). All these results indicated that flower opening and senescence in tulips are accompanied by morphological and physiological changes.

Fig. 1
figure 1

Variations in flower lifespan and physiological changes during flower opening and senescence in tulips. a The variations in flower longevity among tulip cultivars (n = 30). b The morphological changes in intact flowers from the stages of floral bud to senescence. cf Modifications in flower diameter. Relative water content (RWC) (c), electrolyte leakage (d), and anthocyanin contents (e) during flower opening and petal senescence (f). Letters above the columns indicate statistically significant differences at p < 0.05, analyzed by Tukey’s test. Bar = 1.0 cm. S1, first stage; S2, second stage; S3, third stage; S4, The early senescence stage; S5, the late senescence stage

Full-length transcriptome sequencing and assembly

A comprehensive full-length transcriptomic analysis was performed to explore the mechanisms underlying flower opening and senescence, generating a total of 1,202,336 reads (21.58 Gigabit, Gb) across all three cDNA libraries, of which 9,054,775 subreads were obtained. The mean lengths of the three cDNA libraries were 1,781 bp (1–2 kb); 2,868 (2–3 kb); and 3,450 bp (3–6 kb), respectively (Table S3). Finally, 587,619 reads of insert (ROIs) with an average length of 2,861 bp were obtained (Table S4). More than 41.5% (244,357) of the ROIs were full-length non-chimeric (FLNC) reads (Table S5). A total of 91,890 consensus isoforms with a mean length of 2,219 bp were obtained based on the clustering algorithm of iterative clustering for error correction (ICE). Using the Illumina reads, 71,517 high-quality consensus transcripts with an accuracy > 99% were recognized, along with 20,373 low-quality isoforms, which need to be further corrected (Table S6). Ultimately, 76,388 non-redundant transcripts were screened from all the T. gesneriana high-quality transcripts, with 33,505 SSRs identified from 21,904 transcripts and more than one SSR among the 4,480 (20.45%) unique transcripts. The most abundant SSR type was mono-nucleotide repeats (P1, 42.31%) (Fig. S2a). Furthermore, the density of SSRs in each 1.0 Mb of the sequence varied from 0.6 to 77.4 (Fig. S2b). All SSRs and the corresponding primers are listed in Table S7. Due to a lack of a reference genome, only 871 AS events were recognized (Table S8). Long non-coding RNAs (LncRNAs), i.e., those with length > 200 bases, were further predicted to identify 3,114 unique transcripts designated as candidate lncRNAs matching all four databases (Fig. S2c; Table S9).

Functional annotation of transcripts

Annotation was performed using 76,388 non-redundant transcripts, of which 17,513 belonged to KEGG; 29,157 to COG; 33,703 to GO; 44,813 to KOG; 53,217 to Swiss-Prot; 54,680 to Pfam; 65,943 to eggNOG; and 66,864 to the NR databases, respectively. Finally, 67,350 transcripts were annotated to the eight databases. Among all the 76,388 SMRT unique transcripts, 66,635 were identified to possess significant matches to the non-redundant (Nr) protein database. Among the transcripts of all plants, 11 species with matches > 1.0% were sorted, of which Elaeis gunieensis, Phoenix dactylifera, and Asparagus officinalis obtained 27.37, 21.66, and 8.33% of the matched hits, respectively (Fig. 2a). Besides, 7.29% of the hits were from other plant species (Fig. 2a). The number of open reading frames (ORFs) was 70,644 including 39,632 complete ORFs. The distribution of the lengths of the ORF-encoded amino-acid chains is shown in Fig. 2b. The functional annotation of all unique transcripts is illustrated in Fig. 2c and listed in Dataset S1.

Fig. 2
figure 2

National Center for Biotechnology Information (NCBI) non-redundant (Nr)-based distribution of homologous species and annotation of the candidate protein-coding RNAs. a The percentage of the total homologous gene hits distributed among the species. All of the top five species are monocots. b The distribution of the lengths of the open reading frame (ORF)-encoded amino acid sequences. c The gene ontology (GO)-term-based pathway annotation of all transcripts

WGCNA performance during flower opening and senescence in tulip

Illumina RNA-Seq was performed to explore the transcriptome variations among the five developmental stages during flower opening and petal senescence in tulips. A total of 119.46 GB of clean data was obtained from the 15 samples (three replications for each phase) with Q30 > 88.69% (Table S10). The clean data were mapped based on the assembled tulip transcriptome (Table S11). The FPKM values of each sample are shown in Fig. S3a. The correlation between the flower stages and biological replicate samples was evaluated using the Pearson coefficient values (Fig. S3b). WGCNA was performed to investigate the co-expression networks of genes that were associated with the flower developmental stage (Fig. 3a, b). All the genes were partitioned into 16 co-expression modules (Table S12; Fig. 3c), with sizes ranging remarkably from 58 genes in the light-cyan module to 10,003 in the turquoise module (Fig. 3d). All modules were further normalized into five stages to investigate the alterations with respect to the flower developmental stages. Along with flower opening and senescence, the 6,208 unigenes of the brown module were upregulated, whereas 4,907 unigenes of the yellow module were downregulated (Fig. 3d). Other modules included genes enriched in at least one or two specific developmental stages, such as the red (2987) and black modules which represented evidently up- and down-regulated genes at stage S4, respectively (Fig. 3d).

Fig. 3
figure 3

Co-expression network analyses of the differentially expressed genes (DEGs). a Selection of the soft-thresholding powers for scale-free co-expression network (n = 22). b Weighted correlation network analysis (WGCNA) for cluster dendrograms and module assignments. Genes were clustered based on the dissimilarity measure. Colors in the horizontal bar represent the modules, which correspond to the branch modules with highly interconnected gene groups. The gray modules indicate genes that did not fall into any module and were excluded from the subsequent study. Finally, 16 gene modules were identified by WGCNA. c Eigengene adjacency heatmap indicating the 16 modules of co-expressed genes. d The normalized average co-expression patterns of the 16 gene modules in five stages. Module colors are available online in Table S12

Identification of the DEGs related to flower opening and petal senescence

The number of DEGs identified between any two stages is shown in Fig. 4a (Dataset S2). Venn analysis indicated that 1,633 and 1,702 transcripts changed specifically during flower opening (S1 vs. S2 and S1 vs. S3) and in petal senescence (S3 vs. S4 and S3 vs. S5), respectively, while the alteration of 272 transcripts was common during the whole process of flower opening and petal senescence (Fig. 4b; Dataset S2). Heatmaps indicated the expression patterns of the specifically regulated 1,633 and 1,702 DEGs during flower opening (S1 vs. S3) and petal senescence (S3 vs. S5), respectively (Fig. 4c, d). These were subjected to GO-term and KEGG enrichment to understand further the biological and metabolic pathways associated with the above developmental processes. As shown in Fig. S4, the most enriched GO-terms of the 1,633 transcripts were “photosynthesis” and “fatty acid biosynthesis”, but in the 1,702 transcripts, “hydrolysis” and “DNA catabolic pathways” were dominantly enriched (Fig. S4a, b). Moreover, KEGG analysis indicated that most of the 1,633 DEGs annotated into photosynthesis pathways (cluter_frequency = 9.3%, p < 0.001) and amino acid metabolism pathway (cluter_frequency = 7.97%, p < 0.001) (Fig. 4e). Starch and sucrose metabolism (cluter_frequency = 8.3%, p < 0.001), as well as Valine (Val), Leucine (Leu), and Isoleucine (Ile) degradation (cluter_frequency = 8.3%, p < 0.05), were the most enriched pathways among the 1,702 senescence-related DEGs (Fig. 4f).

Fig. 4
figure 4

Identification of the differentially expressed genes (DEGs) among the five developmental stages of tulip flowers. a The numbers of DEGs detected between any two stages. b Venn map indicates the number of co-regulated and/or specifically regulated DEGs during flower opening and petal senescence. c, d Heat map analysis shows the relative expression levels in fragments per kilobase of transcript per million (FPKM) of the 1633 DEGs identified during the flower opening stages (c) and the 1702 co-regulated genes detected during the senescent stages (d). e Kyoto Encyclopedia of Genes and Genome (KEGG) pathway enrichment of the 1633 DEGs recognized among the first three stages (p < 0.05). f KEGG pathway enrichment of the 1702 DEGs identified among the last three stages (p < 0.05)

Phytohormone profiling and identification of hormone-related DEGs involved in flowering

As phytohormones are vital regulators of flower lifespan, the identification of DEGs related to the hormone pathways was then focused on. In the S1 vs. S2 or S1 vs. S3 comparisons, 114 and 112 DEGs were annotated to the hormone pathways, respectively (Fig. 5a). In the S3 vs. S4 or S3 vs. S5 comparisons, 105 and 188 hormone-related DEGs were characterized, respectively (Fig. 5a; Dataset S3). The most enriched pathways were ABA- and auxin-, followed by ethylene-, CKs-, SA-, and JA-related pathways; the lowest enrichment was in the gibberellins (GAs) and BRs associated pathways (Fig. 5b; Table S13). MapMan pathway analysis revealed the comparative expression patterns of unigenes involved in hormones signalling among the various stages (Fig. 5c). The ABA content gradually enhanced from S1 (11.2 ng/g) to S4 (41.4 ng/g) and then declined at S5 (27.7 ng/g) (Fig. 5d). The highest Indole-3-acetic acid (IAA) content was detected at S1 (8.6 ng/g), which reduced markedly at S2 and S3 that showed similar levels, and gradually decreased from S3 to the lowest level at S5 (2.5 ng/g) (Fig. 5e). The content of JA was drastically down-regulated from 27.7 ng/g (S1) to 0.6 ng/g (S2), but then gradually elevated from S2 to S5 (Fig. 5f). The contents of CKs, GAs, and BRs are shown in Table 1. Among all the analogs, the most accumulated types of CKs and GAs were cis-zeatin riboside (cZR) and GA4, respectively. Moreover, typhasterol (TY) was dominant at the first three stages but was hard to detect at the petal senescent stages (Table 1). These results collectively demonstrated that the accumulation of phytohormones and hormone-related transcripts was significantly modified during flower opening and senescence in tulips.

Fig. 5
figure 5

Identification of the hormone-related differentially expressed genes (DEGs) and quantification of the endogenous hormones. a Venn maps show the co-regulated and/or specifically regulated DEGs during flower opening and petal senescence. b The enrichment and distribution of hormones associated with the DEGs detected among comparisons S1 vs. S2, S1 vs. S3, S3 vs. S4, and S3 vs. S5 from (a). c MapMan pathway analysis of the DEGs associated with hormones. d Abscisic acid (ABA) contents. e Indole-3-acetic acid (IAA) contents. f Jasmonic acid (JA) contents. The data represent the mean ± SE of three independent replicates (n = 10). S1, first stage; S2, second stage; S3, third stage; S4, The early senescence stage; S5, the late senescence stage. SA, salicylic acid; BA, benzylaminopurine; GA, gibberellin acid; BR, brassinosteroids

Table 1 Quantification of cytokinins (CKs), gibberellins (GAs), and brassinolides (BRs) among the five development stages of tulip flowers

Identification and clustering analysis of TFs involved in flowering

In total, 2113 unigenes were annotated as TFs and assigned to 35 families of TFs (Dataset S4). MapMan pathway analysis indicated that the unigenes encoding TFs and those involved in protein modification and degradation pathways exhibited distinct expression patterns (Fig. S5). Among them, 2038 TFs (FKPM > 0 in at least one stage) were classified into six distinct expression clusters among the five developmental stages (Fig. 6a–f). Clusters 1 and 4 represented TFs, which were markedly elevated at the early and late senescent stages (S4 and S5), respectively (Fig. 6a–d). Those of cluster 2 were mainly diminished at S2 to S4 (Fig. 6b). The enriched genes in clusters 3 and 6 represented those that altered at S1 and S2, respectively (Fig. 6c–f). The TF-encoding unigenes in cluster 5 displayed a continuous up-regulation from S3 to S5 (Fig. 6e). The most identified TF family was MYB, followed by Zinc finger-CCCH, Homeobox, and bHLH (Fig. 6g). The numbers of up- and down-regulated TF-related genes varied from 113 in S4 vs. S3 to 223 in S5 vs. S3 (Fig. 6h). Among the 51 TF-related genes which specifically changed from S1 to S3 during flower opening, the most enriched were of bHLH with ten unigenes, accounting for 19.6%. Additionally, 16 NAC domain- and ten homeobox-domain-containing proteins were dominantly changed, which occupied 32.6% and 20.4% of the 49 TF-related DEGs in the petal senescence process of S3 to S5, respectively (Dataset S4).

Fig. 6
figure 6

Identification of the transcription factor (TF)-related differentially expressed genes (DEGs) during flower opening and senescence. af Cluster analysis indicating the relative expression patterns of 2038 TFs. Green or yellow lines correspond to genes with low membership values; purple and blue lines correspond to genes with high membership values. g The numbers of all the identified TFs and families to which they were annotated. h The numbers of the up- and down-regulated differentially expressed TFs in the S1 vs. S2/3 and S3 vs. S4/5 comparisons, respectively. S1, first stage; S2, second stage; S3, third stage; S4, The early senescence stage; S5, the late senescence stage

Identification of the functional role of TgHB12-like in the regulation of petal senescence

Venn map analysis was performed to characterize further the common DEGs identified by hormone-related, TF-related, and WGCNA modules, which were explicitly involved in the regulation of petal senescence (Fig. S6). A unigene, F01.PB503 was specifically identified among the TF- and hormone-related DEGs (Fig. 7a), which was also the only unigene recognized from the brown WGCNA module (Fig. S6b). RT-qPCR revealed that F01.PB503 was up-regulated in S4 and S5 (Fig. 7b). The full-length CDS of F01.PB503 was cloned. The encoded protein contains a conserved homeodomain–leucine zipper (HD-Zipper) domain and had the highest sequence similarities of 54.63% with PdHOX6-like (Phoenix dactylifera) and 55.88% with EgATHB12-like (Elaeis guineensis), respectively (Fig. S7a). On alignment with Arabidopsis homeobox-domain containing proteins, F01.PB503 showed the most similarity with AtHB12 (AT3G61890), which was hence named TgHB12-like (Fig. S7a, b). Moreover, the transcription activation ability of TgHB12-like protein was examined in the yeast strain AH109 (Fig. S7c, d). VIGS was used to characterize the role of TgHB12-like during petal senescence regulation in tulips. These results indicated that the transient suppression of TgHB12-like expression significantly delayed the progress of petal disc senescence (Fig. 7c, d). The values of EL and anthocyanin content, which are physiological indicators of flower senescence, were consistent with the petal disc phenotypes (Fig. 7e, f). Overall, the above results indicated that transcriptomic analysis facilitates the identification of hormones and TFs related to flower opening and senescence in T. gesneriana.

Fig. 7
figure 7

Functional identification of the Tulipa gesneriana homebox (HB) (Homeodomain-leucine zipper) 12-like. a Venn map shows the common and specific differentially expressed genes (DEGs) identified from hormone- and transcription factor (TF)-related transcripts in the senescence stages. b Quantitative real-time PCR (qRT-PCR) confirms the expression of Tulipa gesneriana Homeobox12-like (TgHB12-like). Tulipa gesneriana beta-actin (TgACTIN) was used as an internal control. c Petal disc phenotypes on the 5th day after virus-induced gene silencing (VIGS) assay. The left and right photographs indicate the infiltration with the tobacco rattle virus vector controls (TRV1 and TRV2) and TgHB12-like vector (TRV1 and TRV2-TgHB12-like), respectively. d VIGS assay indicated the relative expression levels of TgHB12-like and Tulipa gesneriana senescence-related gene6 (TgSAG6) genes. e The anthocyanin contents and f ion leakage of petal discs infiltrated with the empty and TgHB12-like vectors on the 5th day of treatment, respectively. The data represent the means ± SEs of three independent replicates (n = 15). S1, first stage; S2, second stage; S3, third stage; S4, The early senescence stage; S5, the late senescence stage

Discussion

The existing Tulipa species built up a vast genome of more than 20 Gb over hundreds of years of cultivation. To date, the complete tulip genome has not been released due to the difficulty in assembling a genome containing a large number of repeats. Nanopore-sequencing technology has been applied to overcome these challenges. A pioneering substantial chromosome-level genome assembly in garlic, ~ 16.24 Gb in total, has been constructed recently (Sun et al. 2020), which provided new clues and possibilities for assembling large genomes. Genome maps offer fundamental information for unraveling the molecular mechanisms involved in trait formation in crop species. Single-molecule long-read sequencing technology facilitates research without the need for native genome information. In the present study, PacBio Iso-Seq of six tissues was performed to obtain a comprehensive profile of the tulip transcriptome (Fig. S1). It consisted of 244,357 full-length transcripts, of which 67,350 were annotated and are of great significance for further research on tulips (Fig. 2; Fig. S2). Molecular marker-assisted selection/breeding is of great importance for improving the breeding efficiency, which depends on the development of molecular markers. In tulips, a genetic map was constructed using SNP and expressed sequence tags-simple sequence repeat (EST-SSR) markers, which identified six putative quantitative trait loci (QTLs) associated with resistance to Fusarium oxysporum (Tang et al. 2015). The genetic diversity and relationships among 280 individuals of 36 wild and cultivated tulip accessions were assessed using 15 highly polymorphic and reproducible genetic microsatellite markers (Pourkhaloee et al. 2018). The prediction of SSRs in this study provides more genetic tools for breeding, identification of germplasm diversity, and the conservation of threatened or rare wild tulip species.

Prolonging the lifespan of flowers is vital for tulip breeding; therefore, it is essential to explore the mechanism underlying flower senescence regulation. This study identified 16 gene co-expression modules that displayed distinct expression patterns during flower opening and senescence (Fig. 3). Of these, turquoise, yellow, and light-cyan modules showed a consistent downward expression pattern, but the blue and brown modules demonstrated an opposite trend from S1 to S5 (Fig. 3d). The 11 other modules were enriched in only one or two specific flower stages, which implied that these candidate genes were involved in the onset of one particular developmental process, i.e., petal expansion or color change. Additionally, 22,551 DEGs were identified between any two stages during flower opening and senescence (Fig. 4). GO analysis indicated that 1,633 DEGs were specifically enriched during flower opening and were mainly annotated into the photosynthesis and fatty acid biosynthesis pathways (Fig. 5; Fig. S4). These results corresponded to the primary morphological changes, including petal color change from green to red and petal size enlargement, which was consistent with previous reports (Müller et al. 2010). In contrast, during the flower senescence stages, the 1,702 specifically enriched DEGs were mainly annotated into hydrolysis and DNA catabolic pathways, which indicated the involvement of a programmed cell death (PCD) process (Fig. 4; Fig. S4). The flowering period typically terminates during petal senescence and PCD; moreover, petal deterioration is accompanied by varying degrees of nutrient remobilization (Verlinden 2003; Rogers 2006; Grosskinsky et al. 2018), which were consistent with the results of this report.

In this study, the phytohormone profile showed dynamic changes in hormone accumulation during flower development. Most analogs of CKs, GAs, and BRs gradually decreased from S1 to S5 (Table 1), which was consistent with various studies indicating that these three hormones may be typical negative regulators of petal senescence (Ma et al. 2018). CKs delayed flower senescence in petunias through rapid repression of ethylene biosynthesis and signal transduction (Trivellini et al. 2015). In the present study, most of the DEGs were enriched in the ABA and auxin pathways (Fig. 4a–c). The ABA contents were the highest in early senescent petals (Fig. 5d), which implied a role of ABA at the point of onset of petal senescence in tulips. However, in lilies, the ABA levels increased in the later stages of flower opening, which implied a species-specific variation in ABA function (Arrom and Munné-Bosch 2012; Cubría-Radío et al. 2017). Interestingly, the trends of change were similar between IAA and JA in the first three stages of tulip flowers, with both showing a remarkably high level at S1 and then declining significantly at S2 and S3 (Fig. 5e, f). The auxin levels substantially increased in wilting petals prior to their abscission in Lilium longiflorum (Lombardi et al. 2015). Nevertheless, the roles of auxin and JA in regulating petal senescence need more defining.

Besides, an integrative analysis was achieved through the identification of WGCNA co-expression modules, TF clusters, and those involved in hormone pathways during flower opening and senescence. Several modules were potentially linked to a specific developmental stage, such as Cluster 1, which is related to flower opening, and Cluster 6, which is related to petal senescence regulation (Fig. 6a–f). Among the six unigenes commonly characterized based on TF- and hormone-related DEGs, two HD-Zipper proteins, F01.PB22197 and F01.PB503 were identified (Fig. 7a; Dataset S4). TgHB12-like was selected not only for its dominantly high expression levels in the senescent stages but also for its common identification by TF-, hormone-, and WGCNA brown-module enrichment. Arabidopsis thaliana Homeobox12 (AtHB12) functioned in ABA sensitivity, root elongation, and leaf expansion (Valdés et al. 2012; Hur et al. 2015). In petunias, silencing of an ABA-induced gene PhHD-Zip reduced the expression of ABA biosynthesis-related genes and delayed flower senescence (Chang et al. 2014). Using the VIGS assay, the positive role of TgHB12-like in petal senescence was identified in tulips (Fig. 7). Hence, the functional identification of TgHB12-like implies the reliability of the network analysis of the transcriptome obtained in this report. Further experimental validation is needed to confirm the functions of other identified candidate genes and gene-network prediction during flower opening and senescence in tulips.

Conclusions

In summary, the first full-length transcriptome specific to T. gesneriana was characterized by 51 TF- and 40 hormone-related unigenes commonly altered during flower opening (S1 to S3). The accumulation of ABA evidently enhanced, unlike those of other hormones, including GAs, JA, IAA, CKs, and BRs. The most enriched KEGG pathways were photosynthesis, fatty acid biosynthesis, and amino acid metabolic pathways in flower opening. However, in reference to petal senescence, most of the common 49 TFs and 36 hormone-related unigenes identified were remarkably up-regulated; the levels of ABA, JA, and CKs were enhanced, unlike those of GAs, BRs, and IAA. The hydrolysis, DNA catabolic, starch, sucrose, and carbon metabolism were the most enriched pathways (Fig. 8). All these results provide an important and valuable genetic basis for further research on the potential mechanism associated with development and adaptation in T. gesneriana. In future studies, the functional analysis of the commonly identified TFs and/or hormone-related DEGs using gene editing needs to be performed to further document the processes underlying flower longevity in tulips.

Fig. 8
figure 8

A summary of the potential regulation of transcription factors, phytohormones, and metabolic pathways during flower opening and senescence in tulips. The dashed lines indicate that the inferences need further investigation. The red and green arrows show an up- or down-regulation. DEGs, differential expressed genes; bHLH, basic helix-loop-helix; MYB, v-Myb avian myeloblastosis viral oncogene homolog; ERF, ethylene responsive factor; GATA, GATA-binding transcription factor; NAC, NAM, ATAF1/2, CUC1/2; BT2, BTB-TAZ DOMAIN PROTEIN; SRM1, Serine/arginine repetitive matrix protein 1; WRKY, W-box binding transcription factor; ABA, abscisic acid; JA, jasmonic acid; Cks, cytokinins; GAs, gibberellins; BRs, brassinosteroids; IAA, indole-3-acetic acid. S1, first stage; S2, second stage; S3, third stage; S4, The early senescence stage; S5, the late senescence stage