Abstract
Long non-coding RNAs (lncRNAs) play important roles in genomic imprinting, cancer, differentiation and regulation of gene expression. Here, we identified 3844 long intergenic ncRNAs (lincRNA) in Plutella xylostella, which is a notorious pest of cruciferous plants that has developed field resistance to all classes of insecticides, including Bacillus thuringiensis (Bt) endotoxins. Further, we found that some of those lincRNAs may potentially serve as precursors for the production of small ncRNAs. We found 280 and 350 lincRNAs that are differentially expressed in Chlorpyrifos and Fipronil resistant larvae. A survey on P. xylostella midgut transcriptome data from Bt-resistant populations revealed 59 altered lincRNA in two resistant strains compared with the susceptible population. We validated the transcript levels of a number of putative lincRNAs in deltamethrin-resistant larvae that were exposed to deltamethrin, which indicated that this group of lincRNAs might be involved in the response to xenobiotics in this insect. To functionally characterize DBM lincRNAs, gene ontology (GO) enrichment of their associated protein-coding genes was extracted and showed over representation of protein, DNA and RNA binding GO terms. The data presented here will facilitate future studies to unravel the function of lincRNAs in insecticide resistance or the response to xenobiotics of eukaryotic cells.
Similar content being viewed by others
Introduction
Metazoan genomes encode a large number of regulatory and housekeeping noncoding RNAs (ncRNAs). Regulatory ncRNAs include small RNAs, such as microRNAs (miRNAs), small interfering RNAs (siRNAs) and long noncoding RNAs (lncRNAs), which are mainly involved in regulation of gene expression. RNA transcripts longer than 200 nucleotides, which do not contain an open reading frame of longer than 100 amino acids, are simply defined as lncRNA1,2. Although several lncRNAs have been identified and characterized in the last decade, genome-wide identification of this class of ncRNAs has only recently become possible with the advent of whole transcriptome sequencing technologies.
Generally, they are classified by their location relative to their neighboring protein-coding genes and include the long intergenic ncRNA (lincRNA), intronic lncRNA, antisense lncRNA and enhancer RNA1. However, some other nomenclature has been suggested in the light of the rising amount of deep sequencing data, which provides more information on lncRNA function and mechanism. Bonasio and Shiekhattar (2014) classified lncRNAs to promoter-associated RNA (pRNA), enhancer-associated RNA (eRNA), intervening RNA (iRNA) and gene body associated RNAs which included sense (gsRNA) and antisense overlapping non coding RNA (gaRNA)3.
RNA polymerase II is responsible for the transcription of lncRNA and usually some of these transcripts are precursors for small regulatory RNAs, but a large number of them have no distinguishable purpose4. However, understanding the biology of lncRNAs is still at an early stage and there are disagreements regarding their functionality and classification3. Although it has been suggested that lncRNAs are merely products of inaccuracy in transcription machinery or transcriptional noise5, their involvement in several biological pathways has been demonstrated6,7,8. Considering all possible scenarios, even if only 10% of identified lncRNAs have a biological role, more than a thousand human loci would generate functional lncRNAs4.
Recent findings have demonstrated that lncRNAs take part in genomic imprinting, cancer, cell differentiation, regulation of gene expression and development9,10,11,12,13,14. Databases have also been developed that facilitate exploring interactions of ncRNAs, including lncRNAs, with proteins, RNAs and viruses, but mainly in humans15,16. While studies on insect lncRNAs are limited, they suggest that lncRNAs could be involved in various functions. For example, it has been shown that Apis mellifera lncRNAs, which are highly expressed in ovaries, are probably involved in the fine tuning of developmental processes underlying phenotypic plasticity related to social life histories in honey bees9. It has also been reported that heterochromatin formation in insects is correlated with lncRNA expression12. Overexpression of lncRNA in certain developmental stages of Spodoptera frugiperda produced a variety of lncRNA associated small RNAs, such as repeat associated small interfering RNAs (rasiRNAs), which appeared to correlate with subsequent accumulation of a heterochromatic histone mark12. Recently, it was shown that lncRNAs also coordinate sex determination in Drosophila17.
Previous work has indicated linRNAs’ quick response to different stimuli and stress factors8,18,19. Up-regulation of some lncRNAs by genotoxic agents such as mitomycin C or doxorubicin has been reported in a few mammalian cell lines18. Some lincRNAs act as suppressors of the p53 transcriptional response and they were induced by DNA damage caused by doxorubicin20. Stress can also change the lincRNA expression profile in plants and in Arabidopsis thaliana, the expression level of a lincRNA (At5NC056820) increased by 22-fold when plants were treated with elf18 (EF-Tu), which triggers pathogen-associated molecular pattern responses21. The involvement of lncRNAs in pathways associated with responses to cellular stress makes them interesting candidates to investigate when organisms are exposed to insecticides and other toxicants22.
lncRNAs demonstrate low evolutionary sequence conservation even among closely related species. However, significant improvement in RNAseq technologies and reproducibility-based bioinformatics strategies for genome-wide screening have uncovered thousands of lncRNA sequences in the genomes of both lower living organisms, such as yeasts, as well as higher eukaryotes. For instance, more than 14,000 lncRNA genes in human4,23, 6480 in A. thaliana21, 4000 in bovine muscle24 and 1133 multi-exonic lncRNA transcripts from zebrafish (Danio rerio)25 have been reported in the last few years. Among these, only a few genes have been annotated as lncRNA in insects, which are mostly in model species such as Drosophila melanogaster and A. mellifera. It has been estimated that more than 5000 loci potentially encode non-coding transcripts in D. melanogaster, however, only seven loci (bxd, Hsrω, pgc, roX1, rox2, sphinx and yar) have been experimentally annotated as functional regulatory lncRNAs in the insect26. A recent study conducted by Padron et al. (2014) demonstrated that 43% of total midgut transcripts of Anopheles gambiae are lncRNAs and 32% of them showed some level of homology to other species27.
Diamondback moth (DBM), Plutella xylostella L (Lepidoptera: Plutellidae) is a notorious pest of cruciferous plants that has developed resistance to all groups of insecticides, including Bacillus thuringiensis endotoxins28. This has contributed towards P. xylostella becoming one of the world’s most destructive insect pests and its estimated cost to global economy has been estimated to be US$ 4–5 billion annually28,29. This emphasizes the necessity for the continued development of innovative alternative control measures and resistance management strategies.
The current study provides a glimpse of the lincRNA profile of this important agricultural insect pest. This catalog, as the first list of P. xylostella lincRNAs, is a complement to the list of other ncRNAs that have already been discovered in this species30,31 and therefore will help to better annotate the genome of DBM32. There is an ever-growing number of reports, which illustrate the role that lncRNAs play in cellular defence mechanisms against a wide range of toxic agents. In this study, we hypothesized that insect lncRNAs may regulate detoxification genes and act as important mediators in the development of insecticide resistance in P. xylostella. To support our hypothesis, we produced the expression profile of putative lincRNAs in four insecticide-resistant populations and found that a number of P. xylostella lincRNAs were considerably altered in resistant populations. Speculation regarding the biological functions of this class of ncRNAs is increasing and these data will facilitate future studies to unravel the function of lincRNAs in insecticide resistance and other detoxification responses in eukaryotic cells.
Materials and Methods
RNA-seq Data preparation
Previously sequenced RNA-seq raw data for P. xylostella were downloaded from NCBI Sequences Read Archive with accession number SRA034927 and SRP05946333,34. Raw data were stripped of adapters using CLC Genomic Workbench version 7.5.1 and reads with quality score of above 0.05 and maximum 2 ambiguous sequences were retained for further analysis.
Large gap mapping and transcript discovery
The CLC Genomic workbench’s Transcript Discovery plugin was used for long intergenic non-coding RNAs discovery in the P. xylostella genome. New transcripts were identified by large gap mapping of 330,410,738 reads of seven RNA-Seq libraries to genomic reference (Px genome assembly version2 available at http://iae.fafu.edu.cn/DBM). We implemented strict mapping criteria (mismatch, insertion and deletion costs: 2: 3: 3 respectively). The minimum similarity and length fraction of 0.9 between a mapped segment and the reference were allowed in mapping criteria. The large gap mapper algorithm also requires that each mapped segment must comprise at least 10% of the read and must be of minimum length 17 bp. We considered a gap with maximum 50 Kbp distance between mapped read segments to span the introns from RNA-seq data. The annotations are generated by examining the read mapping and identifying likely regions of genes, their exons and splice sites. The algorithm scanned each gap in the read mapping to see if the gap is assigned to a valid splice site or can be moved to a valid splice site without cost.
lincRNA identification pipeline
A stringent filtering pipeline was developed to discard transcripts with evidence for protein coding potential. The pipeline for P. xylostella lincRNAs discovery is summarized in Fig. 1. We identified 55,793 potential genes using the CLC Genomic Workbench transcript discovery algorithm. The genes that were annotated as known P. xylostella genes were discarded and 35,425 potential genes were also checked for any exon or intron overlap with other known P. xylostella genes. We selected 14,663 sequences, which were located more than 1 kb away from any other known transcripts, for finding putative open reading frames (ORF). All possible six frames were produced for all selected sequences and then the translated sequences were subjected to a domain search to identify any putative conserved protein domains through Pfam v27.0 database35. We discarded 4,746 sequences with potential ORF above 100aa or conserved protein domains. Any possible similarity with other known proteins was found by using BLASTx algorithm against nr and Swiss port database (E-value cut off 10−5) for 9,917 of sequences. We also implemented an expression threshold on our data to strengthen the identification pipeline. Sequences with more than 10 mappable reads in at least three out of eight RNA-seq libraries were considered as valid sequences and were kept for the next step. 4,522 sequences were subjected to Coding Potential Calculator (CPC) tool, which is publically available on http://cpc.cbi.pku.edu.cn to check for any other potential coding regions36. CPC is a Support Vector Machine-based classifier, which is able to assess the protein-coding potential of a transcript based on six biologically meaningful sequence features. Sequences with the score of above −1 were determined as putative protein coding genes and removed from the list. The data were also submitted to another coding potential assessment tool (CPAT), which uses a logistic regression model built with four sequence features: ORF size, ORF coverage, Fickett TESTCODE statistic and hexamer usage bias37. We applied the coding probability threshold of 0.3, which led to discarding 27 sequences as putative coding RNAs. Finally, identical and overlapped sequences were removed from the Px lincRNAs’ profile and 3844 potential lincRNAs were used for further study.
To identify P. xylostella putative lincRNAs that are regarded as small RNA associated lincRNAs, we used the Blast algorithm to search for DBM pre-miRNA sequences in the predicted DBM lincRNA dataset.
Px lincRNAs expression analysis
The P. xylostella genome was annotated with the final list of lincRNAs and used as reference for RNA-seq analysis in CLC Genomic Workbench. To measure the normalized value for RNA expression and remove technical biases, which is natural in the sequencing approach, RPKM (Reads Per Kilobase per Million reads) was assigned as expression value in each library38. To find the differential expression pattern in the insecticide resistant population, data from Fipronil (FR) and Chlorpyrifos (CR) resistant larvae were compared with the insecticide-susceptible strain larvae33. We also downloaded the P. xylostella midgut transcriptome data from Bacillus thuringiensis (Bt) resistant populations through SRA074406 and compared the expression values of the putative lincRNAs from two Bt-resistant strains (MK and GK) with those of a Bt-susceptible (MM) population39. Baggerley’s test, a count based statistical analysis was done on the data. The samples were given different weights depending on their sizes (total counts). The weights were obtained by assuming a Beta distribution on the proportions in a group and estimating these, along with the proportion of a binomial distribution.
The nearest neighbouring genes to the set of 3844 putative lincRNAs were extracted using Px genome assembly version 2 coordinates and their expression values were calculated by CLC Genomic Workbench in a similar way. The gene ontology (GO) enrichment was extracted for all the neighbouring protein-coding genes and more abundant terms were computed for each category of molecular function, biological process and cellular components.
Resistance selection and validation of Px lincRNA expression
To generate an insecticide-resistant P. xylostella population, a population of P. xylostella collected from a cabbage field in Gatton, south-east Queensland in early 2014 was exposed to deltamethrin (Decis®, 25 g/l EC, Bayer) in the laboratory for 15 generations. Based on comparison of LC50 values for this selected population (LC50: 1291.59 ppm; 95% CI: 1145.09–1496.93 ppm and slope ± SE: 9.189 ± 1.934) and the Waite insecticide-susceptible population (LC50: 0.519 ppm; 95% CI: 0.244–1.413 and slope ± SE: 0.98 ± 0.20), that has been maintained in the laboratory without exposure to any insecticides for more than 25 years40, this population demonstrated 2500-fold resistance to deltamethrin.
To determine the relative transcript levels of a selected number of lincRNAs in insecticide-resistant and insecticide-treated P. xylostella, thirty 3rd instar larvae from the deltamethrin-resistant population (Delta-R) were exposed to 1000 ppm deltamethrin (~LC50 of the previous generation) and the surviving larvae were collected 24 h post-treatment. In the control groups, we included insects from the Delta-R and Waite populations that had not been exposed to insecticide.
Total RNA was extracted from all larval samples using TRIzol® reagent (Life Technologies). Concentrations of the RNA samples were measured using a spectrophotometer and integrity was ensured through analysis on a 1% (w/v) agarose gel. First strand cDNA was synthesized from 2 μg of RNA using a poly-dT primer and Superscript III reverse transcriptase (Life Technologies). The follow up qPCR reaction consisted of 2 μL of diluted cDNA (10 ng), 5 μL of QuantiFast SYBR Green PCR Master Mix with ROX (Qiagen) and 1 μM of each lincRNA specific (forward and reverse) primers (Supplementary Table S1) in 10 μL total volume. Reactions were performed in triplicates in a Rotor-Gene thermal cycler (Qiagen) under the following conditions: 95 °C for 5 min; and 40 cycles of 95 °C for 10 s and 60 °C for 30 s, followed by the melting curve (68 °C to 95 °C). Melting curves for each sample were analysed after each run to check the specificity of amplification. Gene copy numbers were calculated using the Rotor-Gene software and an endogenous reference actin gene was used for normalization. One-way ANOVA with Tukey’s PostHoc test was used to identify statistically significant differences.
Results and Discussion
Identification and characterisation of P. xylostella lincRNAs
The P. xylostella genome is not completely annotated and its chromosome arrangement is not available, however, larger scaffolds with more predicted genes were found to have more lincRNA than shorter scaffolds. A positive correlation (R2 = 0.834, P < 0.0001) was found between the number of genes on scaffolds and putative lincRNAs (Fig. 2A). In total, 3,844 putative lincRNAs in 830 P. xylostella genome scaffolds were identified (Supplementary File 1). The majority of these scaffolds contain only one lincRNA locus (34%), however, 18 scaffolds (2%) were enriched with more than 20 lincRNAs (Fig. 2B). The detailed information of these scaffolds, which contain the highest number of lincRNAs are summarized in Table 1.
The DBM lincRNA genes displayed a slightly lower GC content (average 36.9%) in comparison to 42.0% in their protein coding gene sequences (Fig. 2C). The lower GC content or AT enrichment is a typical characteristic of lincRNAs and our findings are congruent with other predicted lincRNAs in other species41,42. However, it has been shown that lncRNAs with higher GC content are marginally more stable and they displayed an increased half-life in mouse Neuro-2a cell line42. Global measurement of half-lives of lncRNA in mouse neural cells exhibited a universal instability for lncRNAs as compared with mRNAs42, however, among different classes of lncRNAs, intergenic transcripts are more stable than intronic lncRNAs43.
The short transcript and gene lengths, low exon number and lower expression, compared to protein-encoding genes are typical characteristics of mammalian lncRNAs2,24. The majority of P. xylostella predicted lincRNAs are smaller than 4000 bp and their length distribution is represented in Fig. 2D. Plutella xylostella lincRNA candidates are notably shorter in length than protein-coding genes, demonstrating another well known characteristic of lncRNA transcripts (Fig. 3A). We measured the gene expression level (RPKM) of all the identified lincRNAs and their proximity to protein-coding transcripts. The data showed that overall the neighbouring protein coding genes are expressed much less than the P. xylostella predicted lincRNAs (Fig. 3B), this is in contrast to the current literature in mammalian species. However, Cabili et al. (2011) found that only 28% of their identified lincRNAs had a significant correlation with their neighbouring protein coding genes2 and it has also been shown that they have strong tissue-specific expression patterns. In addition, it has been demonstrated that in MALAT1 knockout mice, in which one of the most abundant and conserved lncRNAs (MALAT1) is absent, only a small number of its neighbouring genes were deregulated43,44.
We found some level of similarity among P. xylostella lincRNA sequences with other closely related insect genomes such as Bombyx mori, S. frugiperda and one distantly related mosquito, Aedes aegypti (Fig. 4). The E-value cut off 10−10 was applied to our screening with the BLAST algorithm to identify the conserved sequences among P. xylostella lincRNAs and the three other insect genomes. Although the DBM P. xylostella lincRNAs shared many sequences with genomes of the two closely related insects (864 and 1243 sequences with conserved areas in B. mori and S. frugiperda genomes, respectively), only 14 had detectable sequence similarity among all three species and were usually limited to a single short region of high conservation (Fig. 4). Ulitsky et al. (2011) identified 550 distinct lincRNAs in zebrafish, but only 29 showed similar sequence conservation with their putative mammalian orthologues45. Previous studies reported thousands of lincRNAs in human and other model organisms, while just a few have detectable sequence homology in other species. Lack of conservation among identified lincRNAs is one of the challenges in their comparative sequence analysis and functional studies among non-model organisms45.
In a recent annotation of DBM genome in NCBI, 707 loci were predicted as lncRNAs with Gnomon, a NCBI eukaryotic gene prediction tool46. In the current study, we recalled 250 of those sequences in the P. xylostella lincRNA repertoire with BLAST E-value cutoff of 10−10 and minimum bit score of 100, which share some conserved or repetitive regions. Nevertheless, only 78 newly predicted lincRNAs overlapped with the recently annotated P. xylostella lncRNAs and were located in the similar locus. Several P. xylostella lincRNAs also share repetitive sequences or previously identified microsatellite sequences47. It has been shown that many repetitive elements such as telomeric RNAs, satellite RNAs and even some retrotransposons are among lncRNAs. There is significant evidence of similar repetitive non-coding sequences in mammals, whose expressions are altered by external stimuli48. For example, transcripts of satellite RNA repeats within the chromatin could act as cis regulators49. lincRNAs also play a role in the regulation of the neighbouring genes and act as cis-regulatory elements, however, they also have the ability to act in trans and regulate distantly located genes.
In general, lincRNAs are co-expressed with their neighbouring genes2. Typically they are transcribed from the chromosome that they are regulating6. To functionally characterize P. xylostella lincRNAs, gene ontology (GO) enrichment of their associated protein-coding genes was extracted. The GO terms enrichment in molecular function category indicate that P. xylostella lincRNAs are likely to play an important role in binding associated activity because most of the overrepresented GO terms are linked to protein binding, zinc ion binding and even DNA and RNA binding functions (Fig. 5). Proteolysis and metabolic process were the most abundant GO terms among Biological Process. The GO term of “Regulation of transcription” was also overrepresented among the Biological Process category, which is a common and the most abundant term among other species of lncRNA GO analysis11,25,50,51.
Our results suggest that a small number of P. xylostella lincRNAs are involved in the production of small RNAs or serve as primary miRNAs. Previous studies also reported that the majority of lincRNAs are processed by small RNA-independent machinery4. We identified 10 miRNA precursors in our lincRNA profile (Supplementary Table S2). The pre-miRNA sequences of recently annotated P. xylostella miRNAs miR-8497, miR-8517 and miR-854631 were identified in more than 20 potential lincRNAs, which shows that these miRNAs originate from repetitive elements in the genome.
The different proposed scenarios on the interaction of small RNAs, such as miRNAs, with lncRNAs, have been reviewed by Yoon et al. 201452. Long non-coding RNAs can be regulated by some miRNAs. It has been revealed that miRNA let-7b contributed to lowering lincRNA-p21 stability in human cervical carcinoma cells and overexpression of this miRNA may facilitate degradation of the lincRNA53. In a recent study, miR-CLIP technology identified lincRNA H19 as a target of miR-106a. Surprisingly, the miRNA overexpression through ectopic delivery of miR-106a into cells caused an approximately 6-fold increase in human lincRNA H1954. In other cases, lncRNAs can act as miRNA decoys and also compete with them to bind to mRNAs. Those lncRNAs, which are involved in regulation of gene expression, may compete with miRNAs to control their target genes. As we have shown here, some miRNAs may originate from lncRNAs. It has been shown that human lincRNA MD1 generates miR-206 and miR-133b from an intron and an exon, respectively55. To date, most studies have been done in human cells and mainly on miRNA and lncRNA interactions. More studies are required to fully understand the complexity of interaction between small and long ncRNA and their link to various biological conditions.
DBM lincRNAs differentially expressed in insecticide resistance larvae
The RNA-seq data of P. xylostella insecticide-resistant populations were available on public databases33,39. We reanalyzed those data to determine the expression profiles of P. xylostella lincRNAs in insecticide-resistant larvae and compare them with the corresponding control groups to explore their differential expression patterns (Supplementary Files 2 and 3). The expression profiles of P. xylostella lincRNAs in insecticide-resistant larvae to two classes of insecticides, Fipronil (FR, from the phenylpyrazole family) and Chlorpyrfos (CR, an organophosphate), showed noticeable alteration when compared with those of susceptible larvae (Fig. 6). We also reanalyzed the P. xylostella larval midgut transcriptome data in two resistant strains (MK and GK) to Bacillus thuringiensis (Bt) endotoxin Cry1Ac to explore alterations in the expression profiles of P. xylostella lincRNAs in comparison with those of susceptible larvae (MM)39 (Fig. 6). Using strict criteria (Fold change above 4, P-value < 0.05 and presence of at least 10 reads in resistance or susceptible strains’ libraries), we found 358, 280, 169 and 191 lincRNA genes differentially expressed (DE) in CR, FR, GK and MK groups, respectively (Fig. 7A). The majority of DE lincRNAs were overexpressed (~70%) in insecticide-resistant populations; however, in Bt resistant strains almost 50% of DE lincRNAs were up-regulated. There were only three lincRNAs (lincRNAs 538, 3727 and 1382), which were noticeably altered in all the four resistant strains (Fig. 7B). We found that distinct sets of lincRNAs were commonly altered between the two insecticide and Bt endotoxin resistant strains, which suggests that this class of RNAs may act differently in cellular defense mechanisms against individual xenobiotic agents. In a previous study, it was shown that two genotoxic drugs, mitomycin C and doxorubicin, altered lncRNA expression profiles in HeLa cells but there were no commonly altered lncRNAs between these two genotoxic components due to their different mode of action18.
There is a growing body of evidence indicating that ncRNAs play significant roles in toxicogenomics56,57,58,59,60. However, there are still only few studies about the impact of xenobiotics on the expression of lncRNAs61. The gene expression profile of three well-characterized Chironomus riparius lncRNAs, telomeric repeats, Cla repetitive elements and the SINE CTRT1, were examined in response to various types of aquatic contaminants61. Transcription levels of telomeric repeats and Cla were increased after bisphenol A (BPA) and heavy metal cadmium (Cd) treatments61. A recent study in a mammalian cell line also introduced lncRNA as a new class of targets in neurotoxicity62. Their role in a few toxic-mediated neurological diseases has been studied but little is known regarding the potential function that lncRNAs may play in detoxification or other toxic related metabolisms.
Experimental validation of Px lincRNAs expression
To validate putative P. xylostella lincRNAs and their differential expression in insecticide-resistant and insecticide–susceptible larvae, we used qRT-PCR to measure the relative transcript levels of a number of randomly selected lincRNAs (Fig. 8). The majority of putative lincRNAs (lincRNA 93, 1046, 3128, 3380, 3727) showed significant overexpression in deltamethrin-resistant (Delta-R) larvae in comparison with the insecticide susceptible (Waite) larvae. All these up-regulated lincRNAs, except lincRNA 3128, showed considerable overexpression in other insecticide resistant populations based on transcriptome analyses (Supplementary Files 2 and 3). The overexpression of lincRNA 3727 was not only observed in all insecticide-resistant populations (ie. those resistant to deltamethrin, fipronil and chlorpyrifos), but also considerable enhancement was detected in both populations resistant to Bt endotoxins (Fig. 7B). Deep sequencing data analysis showed that lincRNA 623 was dramatically down-regulated in resistant larvae and we also found significant suppression of the lincRNA in Delta-R larvae (Fig. 8).
Response to insecticide exposure was also detected in some lincRNAs. When the deltamethrin-resistant (Delta-R) population was examined the transcript levels of lincRNA 1046 drastically dropped in deltamethrin treated larvae in comparison with larvae that were not exposed to the insecticide. Also significant overexpression response to direct treatment was observed in lincRNA 2998 (Fig. 8). Although the roles of the identified lincRNAs remain unknown, these genes have the potential to be involved in insecticide resistance development or detoxification pathways. Insecticide exposure may produce heritable modifications in gene expression that occur without a change in the DNA sequence, which are defined as epigenetic effects. Several epigenetic mechanisms, such as DNA methylation and histone modifications, can be caused by pesticides through lncRNA-mediated pathways63,64. A few studies have demonstrated the association of long and intermediate ncRNAs in epigenetic modification in distinct insect species65,66,67. For instance in B. mori, some intermediate ncRNAs are involved in the repression of transcription in silk gland by epigenetic modifications of histones67. Epigenetic effects of dichlorvos, fipronil and triazophos, members of different classes of insecticides, on miRNA profiles and mRNA gene expression of mammalian cells and other model animals such as zebrafish have also been studied56,68. Based on our knowledge of the current literature, our work is the first study to comprehensively investigate the larval lincRNA profiles in various types of insecticide resistant strains of P. xylostella.
Conclusions
In the current study, we provide a comprehensive list of lincRNAs from P. xylostella and show that their expression profile is changed in larvae resistant to three different classes of insecticides, an organophosphate, phenylpyrazole and Bt endotoxins, which have widely different modes of action (inactivate acetylcholinesterase, target GABA-gated chloride channel and lyse midgut cells, respectively). In addition, we observed significant alteration in expression of lincRNAs in insecticide-treated P. xylostella larvae, which we speculate may have direct or indirect links to detoxification pathways or stress responses. The general knowledge of the biological functions of lincRNAs in insecticide resistance and detoxification pathways is still limited, but when taken together these results provide further evidence to support the hypothesis that lincRNAs may play a role in insecticide resistance development. We believe, it is important to analyze the lincRNA’s responses in different species to a wider range of insecticides or other xenobiotics, which may help to explain some unknown role(s) of this class of ncRNAs in detoxification or other toxin related metabolisms.
Additional Information
How to cite this article: Etebari, K. et al. Genome wide discovery of long intergenic non-coding RNAs in Diamondback moth (Plutella xylostella) and their expression in insecticide resistant strains. Sci. Rep. 5, 14642; doi: 10.1038/srep14642 (2015).
References
Clark, M. B. & Mattick, J. S. Long noncoding RNAs in cell biology. Semin. Cell Dev. Biol. 22, 366–376 (2011).
Cabili, M. N. et al. Integrative annotation of human large intergenic noncoding RNAs reveals global properties and specific subclasses. Genes Dev. 25, 1915–1927 (2011).
Bonasio, R. & Shiekhattar, R. Regulation of transcription by long noncoding RNAs. Annu. Rev. Genet. 48, 433–455 (2014).
Ulitsky, I. & Bartel, D. P. lincRNAs: Genomics, evolution and mechanisms. Cell 154, 26–46 (2013).
Struhl, K. Transcriptional noise and the fidelity of initiation by RNA polymerase II. Nat. Struct. Mol. Biol. 14, 103–105 (2007).
Maenner, S., Mueller, M. & Becker, P. B. Roles of long, non-coding RNA in chromosome-wide transcription regulation: Lessons from two dosage compensation systems. Biochimie 94, 1490–1498 (2012).
Fitzgerald, K. A. & Caffrey, D. R. Long noncoding RNAs in innate and adaptive immunity. Curr. Opin. Immunol. 26, 140–146 (2014).
Lakhotia, S. C. Long non-coding RNAs coordinate cellular responses to stress. RNA 3, 779–796 (2012).
Humann, F. C., Tiberio, G. J. & Hartfelder, K. Sequence and expression characteristics of long noncoding RNAs in honey bee caste development- potential novel regulators for transgressive ovary size. PLoS One 8, e78915 (2013).
Soshnev, A. A. et al. Conserved long noncoding RNA affects sleep behavior in Drosophila. Genetics 189, 455–497 (2011).
Garmire, L. X. et al. A Global clustering algorithm to identify long intergenic non-coding RNA- with applications in mouse macrophages. PLoS One 6, e24051 (2011).
Stanojcic, S. et al. Correlation of LNCR rasiRNAs expression with heterochromatin formation during development of the holocentric insect Spodoptera frugiperda. PLoS One 6, e24746 (2011).
Zhang, S., Han, J., Zhong, D., Liu, R. & Zheng, J. Genome-wide identification and predictive modeling of lincRNAs polyadenylation in cancer genome. Comput. Biol. Chem. 52, 1–8, (2014).
Mercer, T. R., Dinger, M. E. & Mattick, J. S. Long non-coding RNAs: insights into functions. Nat. Rev. Genet. 10, 155–159 (2009).
Li, Y. et al. Connect the dots A systems level approach for analyzing the miRNA-mediated cell death network. Autophagy 9, 436–439 (2013).
Zhang, X. et al. RAID: a comprehensive resource for human RNA-associated (RNA-RNA/RNA-protein) interaction. RNA 20, 989–993 (2014).
Mulvey, B. B., Olcese, U., Cabrera, J. R. & Horabin, J. I. An interactive network of long non-coding RNAs facilitates the Drosophila sex determination decision. Biochim. Biophys. Acta 1839, 773–784 (2014).
Mizutani, R. et al. Identification and characterization of novel genotoxic stress-inducible nuclear long noncoding RNAs in mammalian cells. PLoS One 7, e34949 (2012).
Tani, H., Onuma, Y., Ito, Y. & Torimura, M. Long non-coding RNAs as surrogate indicators for chemical stress responses in human-induced pluripotent stem cells. PLoS One 9, e106282 (2014).
Huarte, M. et al. A Large intergenic noncoding RNA induced by p53 mediates global gene repression in the p53 response. Cell 142, 409–419 (2010).
Liu, J. et al. Genome-wide analysis uncovers regulation of long intergenic noncoding RNAs in Arabidopsis. Plant Cell 24, 4333–4345 (2012).
Chen, L. L. & Carmichael, G. G. Decoding the function of nuclear long non-coding RNAs. Curr. Opin. Cell Biol. 22, 357–364 (2010).
Kutter, C. et al. Rapid turnover of long noncoding RNAs and the evolution of gene expression. PLoS Genetics 8, e1002841 (2012).
Billerey, C. et al. Identification of large intergenic non-coding RNAs in bovine muscle using next-generation transcriptomic sequencing. BMC Genomics 15, 499 (2014).
Pauli, A. et al. Systematic identification of long noncoding RNAs expressed during zebrafish embryogenesis. Genome Res. 22, 577–591 (2012).
Marco, A. Regulatory RNAs in the light of Drosophila genomics. Brief Funct. Genomics 11, 356–365 (2012).
Padron, A. et al. In depth annotation of the Anopheles gambiae mosquito midgut transcriptome. BMC Genomics 15, 636 (2014).
Furlong, M. J., Wright, D. J. & Dosdall, L. M. Diamondback moth ecology and management: problems, progress and prospects. Annu. Rev. Entomol. 58, 517–541 (2013).
Zalucki, M. P. et al. Estimating the economic cost of one of the world’s major insect pests, Plutella xylostella (Lepidoptera: Plutellidae): Just how long is a piece of string? J. Econ. Entomol. 105, 1115–1129 (2012).
Etebari, K., Hussain, M. & Asgari, S. Identification of microRNAs from Plutella xylostella larvae associated with parasitization by Diadegma semiclausum. Insect Biochem. Mol. Biol. 43, 309–318 (2013).
Liang, P., Feng, B., Zhou, X. & Gao, X. Identification and developmental profiling of microRNAs in Diamondback moth, Plutella xylostella (L.). PLoS One 8, e78787 (2013).
You, M. et al. A heterozygous moth genome provides insights into herbivory and detoxification. Nat. Genet. 45, 220–225 (2013).
He, W. et al. Developmental and insecticide-resistant insights from the de novo assembled transcriptome of the diamondback moth, Plutella xylostella. Genomics 99, 169–177 (2012).
Etebari, K. et al. Deep sequencing-based transcriptome analysis of Plutella xylostella larvae parasitized by Diadegma semiclausum. BMC Genomics 12, 446 (2011).
Punta, M. et al. The Pfam protein families database. Nucleic Acids Res. 40, D290–D301 (2012).
Kong, L. et al. CPC: assess the protein-coding potential of transcripts using sequence features and support vector machine. Nucleic Acids Res. 35, W345–W349 (2007).
Wang, L. et al. CPAT: Coding-Potential Assessment Tool using an alignment-free logistic regression model. Nucleic Acids Res. 41, e74 (2013).
Mortazavi, A., Williams, B., McCue, K., Schaeffer, L. & Wold, B. Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat. Meth. 5, 621- 628 (2008).
Lei, Y. et al. Midgut transcriptome response to a Cry toxin in the diamondback moth, Plutella xylostella (Lepidoptera: Plutellidae). Gene 533, 180–187 (2014).
Baker, G. J. & Kovaliski, J. Detection of insecticide resistance in Plutella xylostella (L.) (Lepidoptera : Plutellidae) populations in South Australian crucifer crops. Aust. J. Entomol. 38, 132–134 (1999).
Broadbent, K. M. et al. A global transcriptional analysis of Plasmodium falciparum malaria reveals a novel family of telomere-associated lncRNAs. Genome Biol. 12, R56 (2011).
Clark, M. B. et al. Genome-wide analysis of long noncoding RNA stability. Genome Res. 22, 885–898 (2012).
Yoon, J. H., Abdelmohsen, K. & Gorospe, M. Posttranscriptional gene regulation by long noncoding RNA. J. Mol. Biol. 425, 3723–3730 (2013).
Nakagawa, S. et al. Malat1 is not an essential component of nuclear speckles in mice. RNA 18, 1487–1499 (2012).
Ulitsky, I., Shkumatava, A., Jan, C. H., Sive, H. & Bartel, D. P. Conserved function of lincRNAs in vertebrate embryonic development despite rapid sequence evolution. Cell 147, 1537–1550 (2011).
Souvorov, A. et al. Gnomon, NCBI eukaryotic gene prediction tool. http://www.ncbi.nlm.nih.gov/core/assets/genome/files/Gnomon-description.pdf (2010) accessed on 15.08.2015.
Esselink, G. D., Den Belder, E., Elderson, J. & Smulders, M. J. M. Isolation and characterization of trinucleotide repeat microsatellite markers for Plutella xylostella L. Mol. Ecol. Notes 6, 1246–1248 (2006).
Luke, B. & Lingner, J. TERRA: telomeric repeat-containing RNA. EMBO J. 28, 2503–2510 (2009).
Pezer, Z., Brajkovic, J., Feliciello, I. & Ugarkovic, D. Transcription of satellite DNAs in insects. In Progress in Molecular and Subcellular Biology (ed. Ugarkovic, D. ) 161–178 (Springer-Verlag Berlin 2011).
Qu, Z. & Adelson, D. L. Identification and comparative analysis of ncRNAs in human, mouse and zebrafish indicate a conserved role in regulation of genes expressed in brain. PLoS One 7, e52275 (2012).
Zhou, Z. Y. et al. Genome-wide identification of long intergenic noncoding RNA genes and their potential association with domestication in Pigs. Genome Biol. Evol. 6, 1387–1392 (2014).
Yoon, J. H., Abdelmohsen, K. & Gorospe, M. Functional interactions among microRNAs and long noncoding RNAs. Semin. Cell Dev. Biol. 34, 9–14 (2014).
Yoon, J. H. et al. LincRNA-p21 suppresses target mRNA translation. Mol. Cell 47, 648–655 (2012).
Imig, J. et al. miR-CLIP capture of a miRNA targetome uncovers a lincRNA H19-miR-106a interaction. Nat. Chem. Biol. 11, 107–143 (2015).
Cesana, M. et al. A long noncoding RNA controls muscle differentiation by functioning as a competing endogenous RNA. Cell 147, 358–369 (2011).
Wang, X., Zhou, S., Ding, X., Zhu, G. & Guo, J. Effect of triazophos, fipronil and their mixture on miRNA expression in adult zebrafish. J. Environ. Sci. Health B 45, 648–657 (2010).
Ross, J. A. et al. A potential microRNA signature for tumorigenic conazoles in mouse liver. Mol. Carcinog. 49, 320–323 (2010).
Izzotti, A. & Pulliero, A. The effects of environmental chemical carcinogens on the microRNA machinery. Int. J. Hyg. Environ. Health 217, 601–627 (2014).
Chanyshev, M. D., Kosorotikov, N. I., Titov, S. E., Kolesnikov, N. N. & Gulyaeva, L. F. Expression of microRNAs, CYP1A1 and CYP2B1 in the livers and ovaries of female rats treated with DDT and PAHs. Life Sci. 103, 95–100 (2014).
An, Y. R. et al. Analysis of microRNA and gene expression profiling in triazole fungicide-treated HepG2 cell line. Toxicology 303, 94–98 (2013).
Martinez-Guitarte, J. L., Planello, R. & Morcillo, G. Overexpression of long non-coding RNAs following exposure to xenobiotics in the aquatic midge Chironomus riparius. Aquat. Toxicol. 110, 84–90 (2012).
Tal, T. L. & Tanguay, R. L. Non-coding RNAs-Novel targets in neurotoxicity. Neurotoxicology 33, 530–544 (2012).
Collotta, M., Bertazzi, P. A. & Bollati, V. Epigenetics and pesticides. Toxicology 307, 35–41 (2013).
Joh, R. I., Palmieri, C. M., Hill, I. T. & Motamedi, M. Regulation of histone methylation by noncoding RNAs. Biochim. Biophys. Acta 1839, 1385–1394 (2014).
Deng, X. & Meller, V. H. roX RNAs are required for increased expression of X-linked genes in Drosophila melanogaster males. Genetics 174, 1859–1866 (2006).
Bonasio, R. & Nyas. Emerging topics in epigenetics: ants, brains and noncoding RNAs. Ann. N Y Acad. Sci. 1260, 14–23 (2012).
Li, D. D. et al. The expression analysis of silk gland-enriched intermediate-size non-coding RNAs in silkworm Bombyx mori. Insect Sci. 21, 429–438 (2014).
Li, S., Ran, X. Q., Xu, L. & Wang, J. F. microRNA and mRNA expression profiling analysis of Dichlorvos cytotoxicity in porcine kidney epithelial PK15 cells. DNA Cell Biol. 30, 1073–1083 (2011).
Acknowledgements
This project was funded by Horticulture Innovation Australia (VG13111) to S.A., M.J.F. and K.E.
Author information
Authors and Affiliations
Contributions
K.E. conceived and designed research, analysed the data and drafted the manuscript. M.J.F. drafted the manuscript. S.A. conceived the study and edited the manuscript. All authors read and approved the final manuscript.
Ethics declarations
Competing interests
The authors declare no competing financial interests.
Electronic supplementary material
Rights and permissions
This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
About this article
Cite this article
Etebari, K., Furlong, M. & Asgari, S. Genome wide discovery of long intergenic non-coding RNAs in Diamondback moth (Plutella xylostella) and their expression in insecticide resistant strains. Sci Rep 5, 14642 (2015). https://doi.org/10.1038/srep14642
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/srep14642
- Springer Nature Limited
This article is cited by
-
A de novo transcriptional atlas in Danaus plexippus reveals variability in dosage compensation across tissues
Communications Biology (2021)
-
Comprehensive Transcriptome Analyses in Sea Louse Reveal Novel Delousing Drug Responses Through MicroRNA regulation
Marine Biotechnology (2021)
-
Characterization and analysis of full-length transcriptomes from two grasshoppers, Gomphocerus licenti and Mongolotettix japonicus
Scientific Reports (2020)
-
Identification and Evaluation of Long Noncoding RNAs in Response to Handling Stress in Red Cusk-Eel (Genypterus chilensis) via RNA-seq
Marine Biotechnology (2020)
-
Comparative analysis of lincRNA in insect species
BMC Evolutionary Biology (2017)