Abstract
Long non-coding RNAs (lncRNAs) play critical roles in plant development. However, the information of lncRNAs in Jatropha curcas remains largely unexplored. Thus, an attempt has been made in J. curcas to identify 1,850 lncRNAs based on deep sequencing of developing seeds at three typical stages. About ten percent lncRNAs (196 lncRNAs) were differentially expressed lncRNAs during seed developing process. Together with reverse transcription quantitative real-time PCR, the lncRNA expression analyses revealed the stage-specific expression patterns of some novel lncRNAs in J. curcas. The target genes of lncRNAs were annotated for their roles in various biological processes such as gene expression, metabolism, and cell growth. Besides, 10 lncRNAs were identified as the precursors of microRNAs and 26 lncRNAs were predicted to be the targets of Jatropha miRNAs. A total of 31 key lncRNAs play critical roles in the seed developing process in the context of cell growth and development, lipid metabolism, and seed maturation. Our study provides the first systematic study of lncRNAs in the developing seeds of J. curcas and facilitates the functional research of plant lncRNAs and the regulation of seed development.
Similar content being viewed by others
Introduction
Jatropha curcas is a perennial tree belonging to Euphorbiaceae family and its seed has a high content of oil which can be used as biodiesel1,2. Gene expression profiles have been analyzed by several efforts in the developing seeds of Jatropha in order to understand the molecular processes of oil metabolism and seed development3,4,5,6. The whole genome sequence of J. curcas have been sequenced and assembled independently by Japan, China and South Korea, and these efforts lay a solid foundation for further exploration of the non-coding RNAs of J. curcas7,8,9.
Non-coding RNAs, including microRNAs (miRNAs) and long non-coding RNAs (lncRNAs), are functional transcripts that are not translated into proteins but usually function to regulate the expression of other genes. Studies demonstrate that lncRNAs regulate gene expression in plants by DNA methylation and chromatin remodeling, and in some cases, they act as miRNA sponges to enhance the expression of mRNA targeted by miRNA10. A great body of evidence indicates that lncRNAs function in both nuclear and cytoplasmic compartments and play essential regulatory roles in plant development processes (the development of pollen, fiber and lateral root; photomorphogenesis), plant reproduction processes (vernalization, flowering time and male sterility), and plant stress responses11,12,13,14,15. The role of lncRNAs in plant seed development has been acknowleged in recent years, as much effort has been made to identify the lncRNAs from seeds in many plants, including maize16,17,18, Brassica napus19, tree peony20, castor bean21, pigeonpea22, Ginkgo biloba23, and rice24. In maize, lncRNAs might play a part in the complex regulation of genetic imprinting during maize endosperm development17, and lncRNAs probably have function in lipid metabolism regulation of Brassica napus and tree peony developing seeds19,20. Previous work also indicates that lncRNAs impact on developmental and metabolic processes as endogenous target mimics which leads to sequestering of the miRNAs18,22. In all, these reports support the fact that lncRNAs play critical roles in regulating seed development including endosperm, embryo, and fruits.
Although much effort has recently been made to identify the function of protein-coding genes and miRNAs in J. curcas25,26,27, the identification and role of lncRNAs have not been revealed in this biodiesel tree. Furthermore, lncRNAs are not conserved between plant species, and the potential functions of lncRNAs in plant seeds remain largely unclear, especially in endosperm of oil seeds. Therefore, it is necessary to discover novel lncRNAs and analyze their function in the developing seeds of J. curcas.
The present study aims to identify lncRNAs and compare their differential expression in the developing seeds of Jatropha. Deep sequencing was carried out in the seeds at three typical developmental stages, and functional annotation was then performed on the targets of differentially expressed lncRNAs to examine their possible roles in seed developing process.
Results and discussion
Quality assessment of sequence data and transcripts assembly
To investigate lncRNA and their expression profiles in the developing seeds of J. curcas, we sequenced nine lncRNA libraries from three stages of seed development (small, middle and large seeds representing young, intermediate and mature respectively) with three biological replicates. After trimming the adaptors, filtering out poly-N regions and low-quality reads, the reads quality was checked based on Q-score, and the percentage of Q30 base was 96.53% or more (Table 1). A total of 107.39 Gb clean reads was obtained from nine samples with at least 10.28 Gb for each sample. After mapping the clean reads to the Jatropha genome JatCur_1.0 (https://www.ncbi.nlm.nih.gov/assembly/GCF_000696525.1/), the transcripts including protein-coding RNAs and non-coding RNAs were assembled.
Identification of J. curcas lncRNAs and the characteristic features
The prediction of new lncRNA includes two steps: basic screening and potential coding ability screening. Transcripts lengths, exon numbers, and FPKM were considered in the step of basic screening. Transcripts with FPKM more than 0.5 are usually considered to be convincing expression in RNA sequencing studies. In this report, transcripts with lengths > 200 nt, exons ≥ 2, and FPKM ≥ 0.5 were selected as lncRNA candidates. By basic screening, the sequence information of lncRNA candidates was ready for potential coding ability screening.
Next, all transcripts with protein-coding potential were removed. The protein-coding potential of transcripts were predicted jointly by four analyses: CPC analysis (Coding Potential Calculator)28, CNCI analysis (Coding-Non-Coding Index)29, CPAT analysis (Coding Potential Assessment Tool)30 and Pfam protein domain analysis31. CPC is a protein coding potential calculation tool based on sequence alignment with known protein databases and the biological sequence characteristics of transcripts and it is noncoding RNA when Score < 0. CNCI analysis is a method to distinguish non-coding from coding transcripts by the traits of adjacent nucleotide triplets. It does not depend on known annotation files and can effectively predict incomplete transcripts and antisense transcripts, and transcript is noncoding RNA when score < 0. CPAT analysis is a method to judge transcript encoding ability by constructing logistic regression model, calculating coding probability based on ORF length and ORF coverage. When coding probability < 0.38, it is noncoding RNA. The Pfam database is the most comprehensive classification system for protein domain annotations. The transcripts with a high similarity with known protein domain were defined as transcripts with coding ability (E-value < 0.001). Thus, four computational approaches (CPC/CNCI/Pfam/CPAT) were combined to sort lncRNA candidates from putative protein-coding RNAs in the group of unknown transcripts. The lncRNAs candidates identified using the four methods were counted statistically and plotted in a Venn diagram, and the intersection of the four sets of lncRNAs were accepted as putative lncRNAs (Fig. 1). After potential coding ability screening, lncRNA candidates with potential coding ability were removed, and a total of 1,850 lncRNAs were the newly predicted lncRNAs. Four types of lncRNAs were obtained: long intergenic lncRNAs (lincRNAs), antisense lncRNAs, intronic lncRNAs and sense lncRNAs. The results indicated that there were 893 lincRNAs (48.3%), 553 antisense-lncRNAs (29.9%), 50 intronic-lncRNAs (2.7%), and 354 sense-lncRNAs (19.1%) (Fig. 2).
The lengths of lncRNAs ranged from 202 to 10,587 bp, with the vast majority (84%) having lengths shorter than 2000 bp (Supplementary Table S1), which is similar to that reported for potato (90%)32. The average length of Jatropha lncRNAs was 1,238 bp, which is significantly higher than that reported for potato (895), rice (800 bp) and chickpeas (614 bp)32,33,34. More than half (51%) of mRNAs have lengths longer than 2000 bp, indicating that the average length of lncRNAs was lower than that of mRNAs (Supplementary Figure S1). In this study, 1,386 lncRNAs had two exons; 322, three exons; 88, four exons; and 54, between five and eleven exons. On average, the exon numbers associated with Jatropha lncRNAs were lower than those associated with mRNAs (Supplementary Figure S1). These features of lncRNAs could also be found in other plants32.
Quantitative analysis of lncRNAs
Plant lncRNAs play important functional roles in the regulation of plant growth and development. To gain insight into the roles of lncRNAs in Jatropha, the expression levels and patterns of lncRNAs in seed were determined at three different developmental stages. The lncRNA expression levels were presented as FPKM values which were comparable between different samples. Among the 1,850 lncRNAs identified in this work, about ten percent (196 lncRNAs) were differentially expressed lncRNAs (fold change > 2, and p < 0.01) (Supplementary Table S2). To compare the differential expression of lncRNAs between different development stages, the differential expression was presented as the base-2 logarithm of fold change of expression levels between small, middle and large stages (Fig. 3). In the 196 differentially expressed lncRNAs, 125 lncRNAs were up-regulated (Fig. 3a,b) whereas 69 lncRNAs were down-regulated (Fig. 3c,d), and the variation was continuous from small through large stages without fluctuation (Fig. 3). Only two lncRNAs changed without such rules. MSTRG.7207.40 lncRNA could be detected only in the middle stage of seed developing. On the contrary, MSTRG.20400.1 could NOT be detected in the middle stage of seed developing. Among the 125 up-regulated lncRNAs, about a half (61) lncRNAs were up-regulated less than five folds (Fig. 3a) while about a half (64) lncRNAs were up-regulated more than five folds (Fig. 3b); however, about 26% (18) lncRNAs were down-regulated less than five folds (Fig. 3c) while about 74% (51) lncRNAs were down-regulated more than five folds (Fig. 3d).
Interestingly, most differentially expressed lncRNAs (135 lncRNAs) were observed during seed development from middle to large ones, and only 16 lncRNAs (Supplementary Table S2) were observed differentially expressed when small seeds developed to middle seeds. Reverse transcription (RT) quantitative real-time PCR (qPCR) of 22 differentially expressed lncRNAs was then performed for the validation of the sequencing results. The RT-qPCR results of 13 down-regulated lncRNAs, 5 up-regulated lncRNAs, and 4 lncRNAs detected only in large seeds (MSTRG.25532.1, MSTRG.18228.2, MSTRG.17363.1 and MSTRG.4651.4) agreed well with the lncRNA expression profile displayed by the high-throughput sequencing data (Supplementary Figure S2). The differentially expressed lncRNAs were outlined in a heatmap (Supplementary Figure S3). Among three biological replicates, the overviews of the 196 differentially expressed lncRNAs were similar, and three clades were obtained from the samples of three different developmental stages accordingly, suggesting that the biological replicates were reliable, and three developmental stages could be used for differentially expressed lncRNA analysis (Supplementary Figure S3). A similar profile was observed between small seeds and middle ones, whereas a huge difference was observed in large seeds when compared with the small and middle seeds. This might be caused by similar metabolism happened in the small and middle seeds, where cell division, tissue differentiation and rapid growth were in progress, and drastic changes took place during the maturation process when dry matter accumulation started27,35. These results suggested that gene regulation of by lncRNAs was more active at the late seed developmental stages, which is much similar to the miRNA profile in the developing seeds of Jatropha27.
Prediction and annotation of lncRNA targets
Based on the mode of interaction between lncRNA and its target gene, we adopted two prediction methods. First, lncRNA regulates the expression of its adjacent genes, and predicts that the adjacent genes within the range of 10 kb of lncRNA are its target genes according to the location relationship between lncRNA and gene. Second, lncRNA might play a role on RNA due to complementary base pairing, and therefore LncTar was used to predict lncRNA target gene36. The target genes were found for a total of 1563 lncRNAs based on the two prediction methods (Supplementary Table S3). The targets of differentially expressed lncRNAs were annotated on the basis of KOG/COG. According to their physiological and biochemical functions, except for those genes with unknown function, the target genes are mainly involved in three aspects of plant physiological process: gene expression regulation, metabolism, and cell growth and development. Interestingly, among those 211 target genes with explicit KOG function, nearly half of the target genes (45%) are related to gene expression regulation, including signal transduction mechanisms (T, 24), transcription (K, 24), translation (J, 13), and posttranslational modification (O, 34). The second largest group of targets are involved in metabolism (35%), including carbohydrate transport and metabolism (G, 24), lipid transport and metabolism (I, 8), amino acid transport and metabolism (E, 7), inorganic ion transport and metabolism (P, 10), secondary metabolites biosynthesis, transport and catabolism (Q, 14), and energy production and conversion (C, 11). This is consistent with previous observations in other oil crops, such as castor bean21 and Brassica napus19, in which lncRNA is an important regulator involved in carbon metabolism and lipid metabolism. The third group of target genes function in cell growth and development (11%), such as intracellular trafficking, secretion, and vesicular transport (U, 11), cytoskeleton (Z, 6), cell wall biogenesis (M, 2), and cell cycle control (D, 5) (Fig. 4). These results indicated that the differentially expressed lncRNAs might play an important role in the seed developing and substance accumulation by regulating many target genes involved in gene expression regulation, metabolism, and cell growth.
The relationship of lncRNAs and miRNAs in Jatropha
In plants, a group of endogenous small non-coding RNAs named microRNAs (or miRNAs) target mRNAs for cleavage or translational repression. miRNAs are initially transcribed as long polyadenylated transcripts called pri-miRNAs which are processed into shorter miRNA precursors (pre-miRNAs), and these pre-miRNAs are further processed into 18–24 nucleotide (nt) mature miRNAs37. A total of 10 lncRNAs was found to be miRNA precursors by mapping miRNAs which were identified from our previously sequenced small RNA libraries27 to the 1,850 lncRNAs determined in this work, suggesting that some lncRNAs encode miRNAs in Jatropha (Supplementary Table S4). These miRNAs might be important regulators for seed development. For example, a target of miR168 (coded by lncRNA MSTRG.30828.3) was AGO1, which was essential for miRNA maturation38, and the interaction between miR168 and AGO1 maintained proper embryo development39. In addition, miR168 was shown to be involved directly in lipid biosynthesis in the developing seeds of another woody oil plant (sea buckthorn)40. miR396a (from lncRNA MSTRG.24217.2) targets growth-regulating factor (GRF) which is shown to control plant seed development41.
Furthermore, 26 lncRNAs were predicted to be the targets of Jatropha miRNAs (Supplementary Table S5). These lncRNAs showing complementarity to miRNAs, might act as decoys, competing with mRNAs for binding to miRNAs to regulate genes involved in seed development. Target mimicking of miRNA is one of the most important mechanisms of lncRNAs regulating the plant development11,42. In other plant species, such as chickpea, Arabidopsis, citrus, rice, canola and maize, it was shown that lncRNAs also could act as miRNA targets, miRNA precursors or endogenous target mimics34,42,43,44,45,46. To sum up, the interaction between lncRNAs and miRNAs could be an important posttranscriptional regulatory mechanism for gene regulation in the developing seeds of Jatropha.
The key lncRNAs involved in seed development
To be more specific about the roles of differentially expressed lncRNAs in seeds, lncRNAs were refined in the context of possible functions in the three aspects of oil seed development, i.e., cell growth and development, lipid metabolism, and seed maturation, and eventually 31 key lncRNAs were obtained (Fig. 5 and Supplementary Table S7).
Seven lncRNAs (MSTRG.8261.1, MSTRG.8877.1, MSTRG.10126.2, MSTRG.18228.1, MSTRG.18228.2, MSTRG.25202.3, and MSTRG.30256.1) were implicated in cell division according to the annotated function of their targets in the UniProt databases. Some targets participate in several stage of cell cycle, such as G2/mitotic-specific cyclin S13-7, anaphase-promoting complex subunit, and sister chromatid cohesion protein pds5. Likewise, some targets are required for cell division, including AUGMIN subunit 5 implicated in spindle assembly, SCD2 in cytokinesis, and DUF724 domain-containing protein 3 in the polar growth of plant cells47. AP2/ERF transcription factors and squamosa promoter-binding protein are both shown to control plant seed development48,49,50. UVR8 (target of lncRNA MSTRG.25202.3) is required for normal progression of endocycle, which is endoreduplication or DNA replication without mitosis, leading to the formation of nuclear type endosperm. This agrees well with the fact that endosperm of Jatropha is nuclear type in the early stage of seed development51. At least 8 lncRNAs (MSTRG.1315.1, MSTRG.9561.1, MSTRG.12857.1, MSTRG.12857.2, MSTRG.13522.1, MSTRG.23348.8, MSTRG.26115.1, MSTRG.29592.1) were associated with the formation of cell wall and seed coat development, as most of their targets are related to the synthesis of cellulose and pectate which are required for cell growth. In addition, previous work showed that beta-D-xylosidase (target of MSTRG.12857.1) play important roles in seed coat development52.
Seed oil is always the focus of J, curcas, a promising biodiesel plant. Interestingly, several important proteins involved in lipid metabolism were found to be the target of 9 differentially expressed lncRNAs, including lipase, lipid-transfer protein, O-fatty acyltransferase, oleosin, and non-specific lipid-transfer protein (Fig. 5). In oil seeds, oil is stored in oil body surrounded by a half-unit phospholipid membrane containing oleosin, and oleosin is the major protein constituent of Jatropha oil body53. It agrees with previous reports that the oleosin is the target of lncRNA in the developing seed of Brassica napus and pigeonpea19,22. The non-specific lipid-transfer proteins facilitate the transfer of fatty acids and phospholipids between membranes. Mean while, phospholipase D zeta regulated by MSTRG.27126.3 through miR827 enhance diacylglycerol flux into triacylglycerol54. These results suggested that lncRNAs act pivotal part in the oil accumulation of Jatropha seeds.
Many target genes regulated by lncRNAs have critical roles in seed maturation process during which seeds suffering dehydration stress. For example, heat shock protein (including DnaJ), AP2 ERF, and heat stress transcription factor may be involved in the regulation of gene expression by stresses. Peroxidase and superoxide dismutase are known antioxidant proteins, which may protect seeds from injury of free radical. NAC domain-containing protein 67 and late embryogenesis abundant protein (LEA) may play an essential role in seed survival during seed dehydration stress by maintaining cellular stability55. This is in agreement with a previous report that LEA protein was a lncRNA target in the developing seeds of pigeonpea22. DELLA protein my contribute to the regulation of seed dormancy process, which is also an important stage of seed maturation. Collectively, these observations suggest that lncRNAs play a pivitol part in the regulation of desiccation tolerance and antioxidant system during Jatropha seed maturation.
Conclusions
In summary, we screened and identified 1,850 lncRNAs followed by examining their expression patterns at three different developmental stages of J. curcas seeds. The possible roles were investigated for lncRNAs, including target gene regulator, miRNA precursor and miRNA target. Functional analysis of the target genes of differentially expressed lncRNAs showed that lncRNAs play important parts in multiple biological processes, such as posttranslational modification, carbohydrate metabolism and signal transduction. Further analysis showed that 31 key lncRNAs could function in the seed developing process in the context of cell growth and development, lipid metabolism, and seed maturation. It indicated that the up- or down-regulation of lncRNAs is an important mode to regulate the process of seed developing. This study reveals expression profiles of lncRNAs in seed developing, providing important data for further investigation on the mechanisms of molecular regulation of seed development.
Materials and methods
Seed collection and RNA isolation
It was performed by the same method as reported previously27. Fruits from J. curcas were collected randomly from 6 plants at 5–10 (small seeds), 12–20 (middle seeds) and 25–35 (large seeds) days after flower opening (DAF). Small, middle and large seeds represent three typical stages of seed developing, i.e., young, intermediate and mature. Seeds at each stage were collected and frozen immediately in liquid nitrogen and stored at -80 °C. Total RNA was isolated from a pool of seeds from each stage with Trizol (Invitrogen, CA, USA) according to the manufacturer’s protocol. RNAs were prepared from three independent biological replicates.
Preparation for lncRNA library
RNA libraries were constructed following methods described in Zuo et al56. In brief, a total of 1.5 μg RNA per sample was used as input material for rRNA removal using the Ribo-Zero rRNA Removal Kit (Epicentre, Madison, WI, USA). Sequencing libraries were prepared by using NEBNextR Ultra™ Directional RNA Library Prep Kit for IlluminaR (NEB, USA) following manufacturer’s protocol. Fragmentation was carried out in NEB Next First Strand Synthesis Reaction Buffer (5 ×) under elevated temperature. First strand cDNA was synthesized using reverse transcriptase and random hexamer primer. Second-strand cDNA synthesis was performed subsequently using RNase H and DNA Polymerase I. NEB Next Adaptor with hairpin loop structure were ligated to prepare for hybridization after adenylation of 3′ ends of DNA fragments. The library fragments were purified with AMPure XP Beads (Beckman Coulter, Beverly, USA) to select insert fragments of preferentially 150 ~ 200 bp in length. Then PCR was performed with universal PCR primers and Phusion High-Fidelity DNA polymerase. Finally, PCR products were purified (AMPure XP system), followed by library quality assessment on the Agilent Bioanalyzer 2,100. After sequencing with Illumina HiSeq2500 platform (San Diego, CA, USA), the paired-end reads were generated. Three independent RNA libraries were constructed from each of the three stages of the developing seeds resulting in nine lncRNA libraries.
Sequence data analysis
Sequence data analysis was performed according to procedures previously reported56. In brief, from raw reads in fastq format, reads containing adapter, reads containing ploy-N and low-quality reads were removed to obtain clean data (clean reads) with high quality. Clean reads of each sample were mapped to the J. curcas genome (JatCur_1.0, https://www.ncbi.nlm.nih.gov/assembly/GCF_000696525.1/) using HISAT257, and the mapped reads was subjected to further de novo assembly and quantification by StringTie58. The gffcompare program was used to annotate the assembled transcripts. Putative lncRNAs were screened from the unknown transcripts. To overcome transcriptional noise, two or more exons, length greater than 200 bp, and abundance greater than 0.5 FPKM in at least one of the samples were selected as lncRNA candidates. These candidates were further screened using four computational approaches (CPC/CNCI/Pfam/CPAT) which can distinguish the non-coding genes from the protein-coding genes. The different types of lncRNAs include lincRNA, intronic lncRNA, anti-sense lncRNA, and sense lncRNA were selected using cuffcompare.
Quantification of gene expression levels and differential expression analysis
StringTie (1.3.1) was used to calculate FPKMs of lncRNAs in each sample. FPKM means fragments per kilo-base of transcript per million fragments mapped, calculated based on the length of the transcript length (kilo-base) and mapped fragments (million). Differential expression analysis was performed using the DESeq R package (1.10.1) following the procedures described previously27. Benjamini and Hochberg’s approach was adopted to adjust the P values to controlling the false discovery rate. Genes with absolute value of log2(Fold change) > 1 and adjusted P value < 0.01 were assigned as differentially expressed genes.
Reverse transcription quantitative real-time PCR (RT-qPCR)
RT-qPCR was performed following previously established procedures27. In brief, total RNA was treated with DNase I followed by reverse transcription using random primers and SuperScript™ III reverse transcriptase (Invitrogen) according to the manufacturer’s instructions. Two-step Real-time PCR was performed in ABI StepOne (USA) following a standard SYBR Premix Ex Taq II (TaKaRa) protocol: 95 °C for 5 min, and 40 cycles of 95 °C for 10 s and 60 °C for 30 s. The differences in gene expression were calculated using the 2−ΔΔCt analysis method59 using the actin (GenBank: HM044307.1) as internal reference gene and the young seeds as control for gene expression normalization. Each reaction was performed in triplicate. All primers used in RT-qPCR experiments were listed in Supplementary Table S6.
Prediction and annotation of lncRNA targets
The adjacent genes within the range of 10 kb of lncRNA are predicted to be its target genes according to the location relationship between lncRNAs and genes. LncTar was used to predict lncRNA target gene based on complementary base pairing36. LncTar uses complementary sequences between lncRNA and RNA to calculate the free energy and standardized free energy. When the free energy of pairing sites between RNA and lncRNA was below the threshold of standardized free energy (< − 0.1), the RNA was considered to be a target of the lncRNA. Gene function was annotated based on KOG/COG database (Clusters of Orthologous Groups of proteins; https://www.ncbi.nlm.nih.gov/KOG).
Prediction lncRNAs targeted by miRNAs
The miRNA target prediction was performed by aligning the mature miRNA sequences against J. curcas lncRNA sequences using psRNAtarget with default parameters except for a strict Expectation value 360.
Data availability
All sequencing data were deposited in the NCBI Sequence Read Archive under ID SRR7640476 (small seeds), SRR7640477 (middle seeds) and SRR7640478 (large seeds). All data generated or analyzed during this study are included in this published article and its Supplementary Information files.
References
Abdulla, R., Chan, E. S. & Ravindra, P. Biodiesel production from Jatropha curcas: a critical review. Crit. Rev. Biotechnol. 31, 53–64 (2011).
Kandpal, J. B. & Madan, M. J. Jatropha curcus: a renewable source of energy for meeting future energy needs. Renew. Energ. 6, 159–160 (1995).
Natarajan, P. et al. Gene discovery from Jatropha curcas by sequencing of ESTs from normalized and full-length enriched cDNA library from developing seeds. BMC Genom. 11, 606 (2010).
Costa, G. G. L. et al. Transcriptome analysis of the oil-rich seed of the bioenergy crop Jatropha curcas L. BMC Genom. 11, 462 (2010).
Xu, R. H., Wang, R. L. & Liu, A. Z. Expression profiles of genes involved in fatty acid and triacylglycerol synthesis in developing seeds of Jatropha (Jatropha curcas L.). Biomass Bioenergy 35, 1683–1692 (2011).
Jiang, H. W. et al. Global analysis of gene expression profiles in developing physic nut (Jatropha curcas L.) seeds. PLoS ONE 7, e36522 (2012).
Sato, S. et al. Sequence analysis of the genome of an oil-bearing tree Jatropha curcas L.. DNA Res. 18, 65–76 (2011).
Ha, J. et al. Genome sequence of Jatropha curcas L., a non-edible biodiesel plant, provides a resource to improve seed-related traits. Plant Biotechnol. J. 17, 517–530 (2019).
Wu, P. et al. Integrated genome sequence and linkage map of physic nut (Jatropha curcas L.), a biodiesel plant. Plant J. 81, 810–821 (2015).
Tay, Y., Rinn, J. & Pandolfi, P. P. The multilayered complexity of ceRNA crosstalk and competition. Nature 505, 344–352 (2014).
Datta, R. & Paul, S. Long non-coding RNAs: fine-tuning the developmental responses in plants. J. Biosci. 44, 77 (2019).
Hou, J. et al. Non-coding RNAs and transposable elements in plant genomes: emergence, regulatory mechanisms and roles in plant development and stress responses. Planta 250, 23–40 (2019).
Zhang, X. et al. Mechanisms and functions of long non-coding RNAs at multiple regulatory levels. Int. J. Mol. Sci. 20, 5573 (2019).
Yu, Y., Zhang, Y., Chen, X. & Chen, Y. Plant noncoding RNAs: hidden players in development and stress responses. Annu. Rev. Cell Dev. Biol. 35, 407–431 (2019).
Wang, H. V. & Chekanova, J. A. Long noncoding RNAs in plants. Adv. Exp. Med. Biol. 1008, 133–154 (2017).
Kim, E. D. et al. Spatio-temporal analysis of coding and long noncoding transcripts during maize endosperm development. Sci. Rep. 7, 3838 (2017).
Zhang, M. et al. Extensive, clustered parental imprinting of protein-coding and noncoding RNAs in developing maize endosperm. Proc. Natl. Acad. Sci. USA 108, 20042–20047 (2011).
Zhu, M. et al. Transcriptomic analysis of long non-coding RNAs and coding genes uncovers a complex regulatory network that is involved in maize seed development. Genes 8, 274 (2017).
Shen, E. et al. Genome-wide identification of oil biosynthesis-related long non-coding RNAs in allopolyploid Brassica napus. BMC Genom. 19, 745 (2018).
Yin, D. D. et al. Identification of microRNAs and long non-coding RNAs involved in fatty acid biosynthesis in tree peony seeds. Gene 666, 72–82 (2018).
Xu, W. et al. Differential expression networks and inheritance patterns of long non-coding RNAs in castor bean seeds. Plant J. 95, 324–340 (2018).
Das, A. et al. Expressivity of the key genes associated with seed and pod development is highly regulated via lncRNAs and miRNAs in pigeonpea. Sci. Rep. 9, 18191 (2019).
Jiang, H. et al. Identification and characterization of long non-coding RNAs involved in embryo development of Ginkgo biloba. Plant Signal. Behav. 14, 1674606 (2019).
Zhao, J. et al. Genome-wide identification of lncRNAs during rice seed development. Genes 11, 243 (2020).
Galli, V. et al. Identifying microRNAs and transcript targets in Jatropha seeds. PLoS ONE 9, e83727 (2014).
Wang, C. M. et al. Isolation and identification of miRNAs in Jatropha curcas. Int. J. Biol. Sci. 8, 418–429 (2012).
Yang, M. F., Lu, H. S., Xue, F. Y. & Ma, L. Q. Identifying high confidence microRNAs in the developing seeds of Jatropha curcas. Sci. Rep. 9, 1–11 (2019).
Kong, L. et al. CPC: assess the protein-coding potential of transcripts using sequence features and support vector machine. Nucl. Acids Res. 35, W345-349 (2007).
Sun, L. et al. Utilizing sequence intrinsic composition to classify protein-coding and long non-coding transcripts. Nucl. Acids Res. 41, e166 (2013).
Wang, L. et al. CPAT: coding-potential assessment tool using an alignment-free logistic regression model. Nucl. Acids Res. 41, e74–e74 (2013).
Finn, R. D. et al. Pfam: the protein families database. Nucl. Acids Res. 42, D222–D230 (2014).
Hou, X. et al. Genome-wide analysis of long non-coding RNAs in potato and their potential role in tuber sprouting process. Int. J. Mol. Sci. 19, 101 (2017).
Zhang, Y. C. et al. Genome-wide screening and functional analysis identify a large number of long noncoding RNAs involved in the sexual reproduction of rice. Genome Biol. 15, 512 (2014).
Khemka, N., Singh, V. K., Garg, R. & Jain, M. Genome-wide analysis of long intergenic non-coding RNAs in chickpea and their potential role in flower development. Sci. Rep. 6, 33297 (2016).
Liu, H. et al. Proteomic analysis of the seed development in Jatropha curcas: From carbon flux to the lipid accumulation. J. Proteom. 91, 23–40 (2013).
Li, J. et al. LncTar: a tool for predicting the RNA targets of long noncoding RNAs. Brief. Bioinform. 16, 806–812 (2015).
Voinnet, O. Origin, biogenesis, and activity of plant microRNAs. Cell 136, 669–687 (2009).
Vaucheret, H. AGO1 homeostasis involves differential production of 21-nt and 22-nt miR168 species by MIR168a and MIR168b. PLoS ONE 4, e6442 (2009).
Vaucheret, H., Vazquez, F., Crete, P. & Bartel, D. P. The action of ARGONAUTE1 in the miRNA pathway and its regulation by the miRNA pathway are crucial for plant development. Genes Dev. 18, 1187–1197 (2004).
Ding, J., Ruan, C., Guan, Y. & Krishna, P. Identification of microRNAs involved in lipid biosynthesis and seed size in developing sea buckthorn seeds using high-throughput sequencing. Sci. Rep. 8, 4022 (2018).
Sun, P. et al. OsGRF4 controls grain shape, panicle length and seed shattering in rice. J. Integr. Plant Biol. 58, 836–847 (2016).
Wu, H. J., Wang, Z. M., Wang, M. & Wang, X. J. Widespread long noncoding RNAs as endogenous target mimics for microRNAs in plants. Plant Physiol. 161, 1875–1884 (2013).
Ke, L. et al. Evolutionary dynamics of lincRNA transcription in nine citrus species. Plant J. 98, 912–927 (2019).
Liu, H., Wang, R., Mao, B., Zhao, B. & Wang, J. Identification of lncRNAs involved in rice ovule development and female gametophyte abortion by genome-wide screening and functional analysis. BMC Genom. 20, 90 (2019).
Joshi, R. K., Megha, S., Basu, U., Rahman, M. H. & Kav, N. N. Genome wide identification and functional prediction of long non-coding RNAs responsive to Sclerotinia sclerotiorum infection in Brassica napus. PLoS ONE 11, e0158784 (2016).
Fan, C., Hao, Z., Yan, J. & Li, G. Genome-wide identification and functional analysis of lincRNAs acting as miRNA targets or decoys in maize. BMC Genom. 16, 793 (2015).
Cao, X. et al. Characterization of DUF724 gene family in Arabidopsis thaliana. Plant Mol. Biol. 72, 61–73 (2010).
El Ouakfaoui, S. et al. Control of somatic embryogenesis and embryo development by AP2 transcription factors. Plant Mol. Biol. 74, 313–326 (2010).
Maes, T. et al. Petunia Ap2-like genes and their role in flower and seed development. Plant Cell 13, 229–244 (2001).
Liu, Q., Harberd, N. P. & Fu, X. SQUAMOSA promoter binding protein-like transcription factors: targets for improving cereal grain yield. Mol. Plant 9, 765–767 (2016).
Krishnamurthy, K. V. In Embryology of Jatropha: A Review in Jatropha, Challenges for a New Energy Crop, pp 75–86 (eds Bahadur, B. et al.) (Springer, Berlin, 2013).
Arsovski, A. A. et al. AtBXL1 encodes a bifunctional beta-D-xylosidase/alpha-L-arabinofuranosidase required for pectic arabinan modification in Arabidopsis mucilage secretory cells. Plant Physiol. 150, 1219–1234 (2009).
Yang, M. F. et al. Proteomic analysis of oil mobilization in seed germination and postgermination development of Jatropha curcas. J. Proteome Res. 8, 1441–1451 (2009).
Yang, W. et al. Phospholipase D zeta enhances diacylglycerol flux into triacylglycerol. Plant Physiol. 174, 110–123 (2017).
Manfre, A. J., Lanni, L. M. & Marcotte, W. R. Jr. The Arabidopsis group 1 LATE EMBRYOGENESIS ABUNDANT protein ATEM6 is required for normal seed development. Plant Physiol. 140, 140–149 (2006).
Zuo, J. et al. Analysis of the coding and non-coding RNA transcriptomes in response to bell pepper chilling. Int. J. Mol. Sci. 19, 2011 (2018).
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).
Pertea, M., Kim, D., Pertea, G. M., Leek, J. T. & Salzberg, S. L. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat. Protoc. 11, 1650–1667 (2016).
Livak, K. J. & Schmittgen, T. D. Analysis of relative gene expression data using real-time quantitative PCR and the 2-delta delta Ct method. Methods 25, 402–408 (2001).
Dai, X., Zhuang, Z. & Zhao, P. X. psRNATarget: a plant small RNA target analysis server (2017 release). Nucl. Acids Res. 46, W49-w54 (2018).
Acknowledgements
This work was supported by the National Natural Science Foundation of China (31370674).
Author information
Authors and Affiliations
Contributions
M.F.Y. and L.Q.M. designed the experiments, analyzed the data and wrote the main manuscript text; X.H.Y. analyzed the data and prepared all figures and tables. All authors read and approved the final version of the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Yan, X., Ma, L. & Yang, M. Identification and characterization of long non-coding RNA (lncRNA) in the developing seeds of Jatropha curcas. Sci Rep 10, 10395 (2020). https://doi.org/10.1038/s41598-020-67410-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-020-67410-x
- Springer Nature Limited
This article is cited by
-
Regulation of coconut somatic embryogenesis: decoding the role of long non-coding RNAs
Plant Biotechnology Reports (2024)
-
Genome-wide screening and characterization of long noncoding RNAs involved in flowering/bolting of Lactuca sativa
BMC Plant Biology (2023)
-
Identification and functional prediction of CircRNAs of developing seeds in high oleic acid sunflower (Helianthus annuus L.)
Acta Physiologiae Plantarum (2023)
-
Genomic survey of high-throughput RNA-Seq data implicates involvement of long intergenic non-coding RNAs (lincRNAs) in cytoplasmic male-sterility and fertility restoration in pigeon pea
Genes & Genomics (2023)
-
Biologia Futura: progress and future perspectives of long non-coding RNAs in forest trees
Biologia Futura (2022)