Abstract
Alterations in the gene expression of organs in contact with the environment may signal exposure to toxins. To identify genes in lung tissue whose expression levels are altered by cigarette smoking, we compared the transcriptomes of lung tissue between 118 ever smokers and 58 never smokers. In all cases, the tissue studied was non-involved lung tissue obtained at lobectomy from patients with lung adenocarcinoma. Of the 17,097 genes analyzed, 357 were differentially expressed between ever smokers and never smokers (FDR < 0.05), including 290 genes that were up-regulated and 67 down-regulated in ever smokers. For 85 genes, the absolute value of the fold change was ≥2. The gene with the smallest FDR was MYO1A (FDR = 6.9 × 10−4) while the gene with the largest difference between groups was FGG (fold change = 31.60). Overall, 100 of the genes identified in this study (38.6%) had previously been found to associate with smoking in at least one of four previously reported datasets of non-involved lung tissue. Seven genes (KMO, CD1A, SPINK5, TREM2, CYBB, DNASE2B, FGG) were differentially expressed between ever and never smokers in all five datasets, with concordant higher expression in ever smokers. Smoking-induced up-regulation of six of these genes was also observed in a transcription dataset from lung tissue of non-cancer patients. Among the three most significant gene networks, two are involved in immunity and inflammation and one in cell death. Overall, this study shows that the lung parenchyma transcriptome of smokers has altered gene expression and that these alterations are reproducible in different series of smokers across countries. Moreover, this study identified a seven-gene panel that reflects lung tissue exposure to cigarette smoke.
Similar content being viewed by others
Introduction
Treatment of cell lines or laboratory animals with toxic chemicals alters their metabolism and induces a gene expression signature that reflects the effects of specific toxins or their mode of action. The analysis of alterations of gene expression of the whole transcriptome, following treatment with toxins, has originated a field of study called toxicogenomics1,2. The up- and down-regulated genes, following exposure to specific chemicals, are biomarkers of exposure but also provide evidence of the involved biochemical pathways and mechanisms of the toxic action3.
Tobacco combustion in cigarettes produces toxic compounds that have been shown to alter gene expression in directly exposed tissues, such as lung tissue4 and small airways5, and in indirectly exposed ones, such as blood6,7. In lung tissue, one study reported that cigarette smoke up-regulated cell cycle genes4. A more recent study8 identified 599 genes in lung tissue whose expression levels differed between never and current smokers, and while in former smokers these alterations had mostly normalized some genes remained up-regulated even 25 years after smoking cessation. Both these studies were carried out on non-involved lung tissue from patients with lung cancer, one of the most common diseases induced by tobacco smoking9. Smoking accounts for about 90% of all lung cancer cases10. Among the different lung cancer histotypes, lung adenocarcinoma is mainly observed in never smokers and former smokers11.
Understanding the molecular pathways in non-neoplastic lung tissue altered by smoking would provide clues to the pathogenicity of cigarette smoke. A way to reach this goal is identifying the signature of genes whose expression levels are modulated by cigarette smoking. We therefore performed a genome-wide transcriptome analysis on non-involved lung tissue from patients with lung adenocarcinoma, including 118 ever smokers and 58 never smokers, to investigate the differences in transcript levels and identify biochemical pathways altered by cigarette smoking.
Results
Lung tissue transcriptome distinguishes ever from never smokers
Microarray analysis of gene expression was done on 179 samples of non-involved lung tissue from 179 lung adenocarcinoma patients. For three samples (female ever smokers), microarray data were of poor quality, so these patients were excluded from analysis. Of the remaining 176 patients, 81 (46%) were female, 154 had pathological stage I cancer, and 58 were never smokers. The two groups of ever and never smokers had similar median ages at diagnosis: 66 years for ever smokers and 68 years for never smokers (Table 1). These two groups differed, instead, in the percentages of males and females, with relatively more females (71%) in the group of never smokers than in the ever smokers group (34%; P = 5.3 × 10−6, Fisher’s exact test). There was no significant association between group assignment and pathological stage (P = 0.57, Fisher’s exact test).
Microarray profiling provided data on 17,097 genes, including 15,525 protein-coding genes, 472 pseudo-genes, 742 non-coding RNAs, and 177 small nucleolar RNAs. Expression levels of these genes were compared between ever and never smokers to identify differentially expressed genes. Unsupervised clustering analysis of the samples according to their gene expression profiles did not distinguish the two groups (not shown). Instead, class comparison analysis revealed that 357 genes (2%) were differentially expressed with an FDR < 0.05 (Supplementary Table 1). Among these 357 genes, 290 were up-regulated and 67 were down-regulated in ever smokers. The absolute value of the fold change was ≥2 for 63 up-regulated and 22 down-regulated genes. The FGG gene had the widest difference in expression levels between groups (31.7-fold higher levels in ever than never smokers). The most down-regulated gene in ever smokers was CDHR3, whose levels were about 4-fold lower than those observed in never smokers.
At a statistical level of FDR < 0.01, there were 48 differentially expressed genes, and the top-ranking genes were MYO1A (FDR = 6.9 × 10−4, fold change = 1.42) and RRAGD (FDR = 6.9 × 10−4, fold change = 2.61), followed by CYP1B1 (FDR = 7.4 × 10−4, fold change = 3.84). The expression levels of these 48 genes are illustrated in Fig. 1, where they are sorted according to fold change from 31.69 (FGG) to 0.40 (EYA4).
To estimate the cellular composition of the analyzed non-involved lung tissue samples, based on their transcriptome profiles, we used the bioinformatic tool xCell. This analysis found that, among 64 possible cell types, there was significant enrichment for two stromal cell types (i.e. lymphatic endothelial cells, and microvascular endothelial cells) and four immune cell types (i.e. dendritic cells, neutrophils, regulatory T-cells, and immature dendritic cells) in at least 80% of patients (Supplementary Fig. 1). Comparison of never smokers with ever smokers showed that the immune cell enrichment scores were similar (Supplementary Fig. 2A; P = 0.09, Kruskal-Wallis non-parametric test). In contrast, the stromal cell enrichment score was significantly lower in ever smokers than in never smokers (Supplementary Fig. 2B; P = 0.0036). However, for none of the six enriched cell types was there a significant difference in frequency between ever and never smokers (not shown).
Cigarette smoke up-regulates inflammatory pathways
To identify molecular pathways in lung tissue affected by tobacco smoke, we analyzed the list of differentially expressed genes using the IPA online tool12. This analysis indicated that, among all the pathways defined in the IPA knowledge base, 52 were significantly associated with our gene dataset (Table 2). The top-ranking pathway was eicosanoid signaling (P = 1.7 × 10−4); seven of our differentially expressed genes encode proteins that participate in this pathway. Among the 52 identified pathways, 19 included at least five differentially expressed genes, with the sirtuin signaling pathway having the highest number of differentially expressed genes involved (n = 10). The majority of the identified pathways (n = 40) had differentially expressed genes that were all expressed at higher levels in ever smokers than in never smokers, suggesting that these pathways are up-regulated in people who smoke.
The analysis of gene interactions identified 14 networks with score >5 (Supplementary Table 2). The biological processes that recurred the most were inflammation, cell interactions, and cell death and survival. Different networks had only a few genes in common. The three networks that clustered the highest number of genes are represented in Fig. 2. Network 1 is composed of genes that have a role in the regulation of cell death and survival, cellular compromise, cell-to-cell signaling and interaction; 27 of our differentially expressed genes are included in this network. The genes in Network 2 are involved in immunological disease, dermatological diseases and conditions, and inflammatory disease; this network includes 24 of our differentially expressed genes. Network 3, instead, is composed of genes that have a role in infectious diseases, regulation of cellular movement, and immune cell trafficking; 23 differentially expressed genes are in this network.
Gene expression signatures overlap between studies
We next assessed the correspondence between our results and those from similar studies by Landi et al.4 and Bossé et al.8; this last study reported datasets from three independent series, hereafter called Laval, GRN and UBC. To this aim, we filtered our gene list (hereafter called IT) and the Laval, GRN and UBC lists according to the same criteria used by Landi et al. (P-value < 0.001 and absolute value of fold-change >1.5). As a result, we had 259 (IT), 591 (Laval), 113 (UBC), 129 (GRN), and 99 (Landi) genes for comparison.
The intersection of these five gene datasets is represented in the Venn diagram in Fig. 3. Overall, 100 of the 259 genes (38.6%) in the IT dataset were present in at least one of the other datasets, with 100% concordance in the direction of the effect of smoking on gene expression. In detail, 97, 23, 39 and 16 of the 259 IT genes were also differentially expressed in the Laval, UBC, GRN and Landi datasets, respectively. Interestingly, seven genes (KMO, CD1A, SPINK5, TREM2, CYBB, DNASE2B, FGG) were found to be differentially expressed between ever and never smokers in all five datasets, with concordant higher expression in ever smokers than in never smokers (Supplementary Table 3). Technical validation of the expression data of these seven genes was carried out by quantitative PCR in 54 patients from our series (27 ever and 27 never smokers). These data correlated positively with expression levels measured by microarray analysis (Supplementary Fig. 3). These seven genes were therefore considered to constitute a gene signature of smoking in non-involved lung tissue of patients with lung adenocarcinoma.
The gene signature is not unique to lung tissue from cancer patients
To determine if the seven-gene signature of smoking is specific to lung cancer patients or is also valid in non-cancer patients, we obtained the GSE47460 dataset of gene expression in lung tissue from 254 patients with interstitial lung disease, 220 patients with chronic obstructive lung disease (COPD), and 108 persons who had surgery for a suspicious lung nodule but were found to not have any lung disease. Of these 582 non-cancer cases, 23 were classified as current smokers, 400 as former smokers, and 136 as never smokers; for the remaining 23 cases information on smoking habit was not available.
Class comparison analysis was used to identify differentially expressed genes between: (i) ever smokers (current and former smokers, n = 423) and never smokers; and (ii) current and never smokers. These analyses identified nine and 1160 differentially expressed genes, respectively, at FDR < 0.05 (not shown). Examination of the first gene list (ever vs. never smokers) revealed that one of our seven signature genes, FGG, was also upregulated in ever smokers without cancer (Supplementary Table 4). Examination of the second gene list (current vs. never smokers) revealed that all genes of our signature of smoking, except SPINK5, were upregulated in current smokers without cancer. This analysis suggests that the seven-gene signature of smoking is not unique to lung cancer patients.
Discussion
This study compared the transcriptional landscape of non-involved lung tissue between ever and never smokers surgically treated for lung adenocarcinoma. We found that 357 genes significantly associated with smoking habit. Most of them were involved in biological processes such as inflammation, cell interactions, and cell death and survival. Almost 40% of our gene set overlapped with genes found to be differentially expressed in at least one of four published datasets4,8. Moreover, the effect of smoking on the transcription of these 100 genes was concordant between our study and the others, namely smoking always induced their up-regulation. Seven of these genes are up-regulated in five datasets, from this study and two others4,8.
Most of the genes that we found to be differentially expressed between ever and never smokers participate in pathways or networks that are associated with immunity, inflammation, or cell survival processes. This is congruent with an alteration of the protective mechanisms of the lung tissue induced by the toxic effect of cigarette smoking13. Of note, the most significantly enriched pathway was that of eicosanoid signaling, which produces lipids such as prostaglandins and leukotrienes that are important in initiating inflammation14. Up-regulation in lung tissue of genes in the eicosanoid pathway is concordant with the pro-inflammatory effects of smoke15 and is supported by experimental evidence. One study showed that human lung fibroblasts, when treated in vitro with cigarette smoke extract, up-regulated expression of COX2 (the gene that encodes cyclooxygenase 2), leading to high levels of prostaglandin E216. Two longitudinal studies of volunteers on a smoking-cessation program found that cigarette smoking increased systemic eicosanoid synthesis, resulting in abnormally high urine levels, and that these levels normalized upon smoking cessation (in those volunteers who did not use nicotine replacement therapy)17,18.
Among the seven genes found to be up-regulated by smoke in our study and in two previous reports4,8 is KMO, which codes for mitochondrial kynurenine 3-monooxygenase. This enzyme eliminates kynurenine, a byproduct of tryptophan metabolism, and thereby prevents its conversion to kynurenic acid19. This latter molecule was shown to be an antagonist of the α7 nicotinic acetylcholine receptor20, which is involved in the rewarding effects of nicotine21. It has recently been demonstrated that inhibition of kynurenine 3-monooxygenase decreased nicotine self-administration in animal models22. Our finding of a significant up-regulation of KMO gene in lung tissue of smokers is in agreement with these in vivo data, and suggests that the KMO enzyme might be a biomarker of smoking addiction in the lungs of smokers.
Another gene up-regulated in our study and the two previous studies is TREM2, which encodes TREM-2, a protein involved in tyrosine kinase-mediated membrane signaling. A recent preclinical study suggested that TREM-2 might be a negative immune regulator in lung cancer23.
The other five genes of our signature currently have only limited relevance to lung cancer. DNASE2B, which is part of the DNase II family of DNases, is expressed in the lung24; however, its role in lung cancer is not yet characterized. FGG, which encodes the gamma chain of fibrinogen, is down-regulated in the epithelial-to-mesenchymal transition, an important mechanism in cancer metastasis25. CYBB encodes the beta chain of cytochrome B-245; germline mutations in this gene are associated with chronic granulomatous disease, a hereditary immunodeficiency syndrome26. CD1A, which codes for a member of the CD1 transmembrane proteins, is abundantly expressed on Langerhans cells and is involved in inflammatory skin diseases27. SPINK5 encodes a serine peptidase inhibitor; germline missense mutations in this gene have been associated with allergic conditions and atopic manifestations28.
Besides these seven genes, one other gene was found to be up-regulated in ever smokers in our study and in the two previous studies4,8, but below statistical threshold in the GRN series. This gene is CYP1B1, encoding cytochrome P450 family 1 subfamily B member 1. CYP1B1, together with CYP1A1, CYP3A4 and CYP3A5, is involved in the metabolism of pro-carcinogens contained in cigarette smoke. The induction of CYP1B1 expression may be due to a feedback mechanism stimulated by exposure to cigarette smoke29.
Several studies have already documented the effects of cigarette smoking on lung gene transcription in otherwise healthy persons and in patients with various lung diseases. Analysis of one published dataset of gene expression in lung tissue from non-cancer patients (GSE47460) allowed us to validate six of the seven genes of our signature of smoking as being up-regulated in current smokers even in the absence of a diagnosis of cancer. This result indicates that this gene panel is not unique to patients with lung cancer. Because the number of current smokers in GSE47460 is small (n = 23), further studies of non-cancer patients are needed. Smoke-induced changes in gene expression have also been found in small airway epithelium29,30,31, but none of the smoking-responsive genes identified in those studies are part of our seven-gene signature. This discrepancy may be due to the known differences in transcriptome profiles between bronchial epithelial cells and lung parenchyma32, and suggests that our gene panel is particular to the effects of smoke on lung parenchyma.
In the comparison of our data with those of Bossé et al.8 and Landi et al.4, differences were noted in the microarray platforms, covariates in the statistical analyses, and tumor histotype of patients whose non-involved lung tissue was analyzed (we and Landi et al. included only adenocarcinoma patients whereas Bossé et al. included patients with other lung cancer histotypes). These other studies compared the transcriptomes between never, former, and current smokers. Unfortunately we could not distinguish between current and former smokers, and so classified them as ever smokers. This difference is an interesting topic for future study, considering that it has been postulated that the long-term risk of developing several smoke-related diseases after smoking cessation is due to an alteration of epigenetic mechanisms that in turn could modify gene expression33. However, two recent epigenome-wide studies of lung tissue did not find differences in methylated CpG islands between smokers and non-smokers at any of the seven genes whose expression in non-involved lung tissue we found to associate with smoking status34,35.
A possible limitation of our study is the unequal group sizes, which can be attributed to the fact that most lung cancer patients are smokers or former smokers. Although we studied fewer never smokers than ever smokers, our group of never smokers is relatively large compared to that of similar studies4,8.
Among the genes differentially expressed in the non-involved lung tissue between ever and never smokers with lung cancer, we observed an up-regulation of genes expressed in immune system cells. This finding is congruent with the inflammatory condition that characterizes the lung tissue of smokers, and suggests that anti-inflammatory agents may reduce smoking-induced damage to lung tissue and, consequently, the risk of smoking-induced diseases. However, ever smokers taking non-steroidal anti-inflammatory drugs experience only a modest reduction of lung cancer risk that is apparent only in men36,37.
Overall, this study builds on two previous studies to identify a seven-gene panel that should be a solid and reproducible biomarker of smoking effects on lung tissue. This tool should facilitate the development and monitoring (in animal or in vitro models) of novel therapies that, by contrasting smoking effects, may be able to reduce the risk of cancer and of other diseases associated with smoking habit.
Methods
Population series and biological material
The study analyzed data and tissue samples from 179 patients who had lobectomy for lung adenocarcinoma at the Fondazione IRCCS Istituto Nazionale dei Tumori, Fondazione IRCCS Ca’ Granda Ospedale Maggiore Policlinico, and Ospedale San Giuseppe (all in Milan, Italy), between 1992 and 2017. The Committees for Ethics of the of the institutes involved in recruitment (Fondazione IRCCS Istituto Nazionale dei Tumori, Fondazione IRCCS Ca’ Granda Ospedale Maggiore Policlinico, and Ospedale San Giuseppe) approved the protocol for collecting samples and clinical data. Written informed consent had been obtained from the patients at recruitment to use their biological material and data for research purposes. The research was conducted in accordance with the tenets of the Declaration of Helsinki. All methods were performed in accordance with the relevant guidelines and regulations.
From each patient, we obtained a sample of non-involved (apparently normal) lung tissue collected as far as possible from the tumor site, i.e., from the so-called left-over material of surgical treatment. Tissue samples were flash-frozen or immersed in RNAlater stabilization solution (Thermo Fisher Scientific) and stored at −80 °C or −20 °C, respectively. Clinical data for each patient regarded sex, age at diagnosis, pathological stage and self-reported habit regarding the smoking of tobacco-containing cigarettes. Smoking habit was recorded as either “ever” if they had ever smoked in their life (independently of whether they were currently smoking) or “never” if they had never smoked in their life.
Gene expression analysis
Total RNA was extracted from lung tissue and subjected to gene expression profiling on Illumina HumanHT-12 v4 Expression BeadChips as previously reported38. Microarray raw data were log2-transformed and normalized using the robust spline normalization method implemented in the lumi Bioconductor package39. Normalized data were additionally adjusted for batch effects using ComBat40. Corrected data were then collapsed from probe level to gene level by selecting, for each gene, the probe with the highest mean intensity across samples. Finally, the dataset was filtered by removing genes not expressed in any samples.
Unsupervised clustering analysis was performed using hierarchical clustering considering the top-ranking 1500, 1000, and 500 variable genes across samples according to inter-quartile range. Hierarchical clustering was applied using 1-Pearson’s correlation coefficient as distance metric and average linkage parameter. In a class comparison analysis, genes that were differentially expressed between ever smokers and never smokers were identified using the linear modeling approach with empirical Bayes moderation implemented in the limma package41. This analysis was performed considering sex as a covariate. Heatmaps of differentially expressed genes were made using the heatmap.2 function of the gplots package.
P-values were corrected for multiple testing using the Benjamini-Hochberg false discovery rate (FDR) method. Genes with an FDR < 0.05 were considered differentially expressed. Fold change in gene expression was calculated as the ratio of ever smokers to never smokers. Gene expression data were deposited in the Gene Expression Omnibus database (GEO, http://www.ncbi.nlm.nih.gov/geo/) with accession number GSE123352.
xCell analysis
The cellular composition of lung tissue samples was estimated from gene expression data using the online tool xCell tool using default parameters42 (http://xcell.ucsf.edu/). The tool produces, for each input sample (in this study, one transcriptome per patient), a score of enrichment for 64 different cell types. It also generates an immune cell enrichment score (considering B cells, CD4+ T-cells, CD8+ T cells, dendritic cells, eosinophils, macrophages, monocytes, mast cells, neutrophils, and NK cells) and a stromal cell enrichment score (considering adipocytes, endothelial cells, and fibroblasts). It also gives a P-value for the enrichment scores, which we set as <0.05 for the significance threshold. Normalized cell type enrichment scores (calculated, for each cell type, as the ratio between the enrichment score of each patient and the maximum value among all patients) were plotted using Heatmap3 in R environment. Enrichment scores were compared between never and ever smokers, for cell types having significant enrichment values in at least 80% of patients and for immune cells and stromal cells, using the Kruskal-Wallis test.
Pathway analysis
To identify biological pathways whose activity could be altered by the effects of smoking on gene transcription, we used the Ingenuity Pathway Analysis (IPA) web-based software (Qiagen)12 with default parameters and selecting Human HT-12v4.0 array as the platform. Pathways were considered significantly affected if a right-tailed Fisher’s exact test P-value was <0.05. Networks were considered relevant if they had a score >5.
Study comparison
To compare our transcriptome results with those of similar previous studies, we downloaded Supplementary Table 1 of Bossé et al.8 (http://cancerres.aacrjournals.org/content/72/15/3753) and Supplementary Table 3 of Landi et al.4 (http://dx.plos.org/10.1371/journal.pone.0001651). Both tables report fold changes in gene transcription between current and never smokers in non-involved lung tissue from lung cancer patients. Only those transcripts for which a gene symbol was available were considered here. Because Landi et al.4 reported data filtered on the basis of a P-value < 0.001 and an absolute value of fold change >1.5, we applied the same criteria to our results and to those of Bossé et al.8 in order to have comparable data for analysis. Venn diagrams were drawn using the InteractiVenn online tool43.
To assess the generalizability of our findings to lung tissue outside of the setting of lung cancer, we downloaded the normalized gene expression data from GSE47460 dataset. We carried out class comparison analyses as described above in the section “Gene expression analysis”, using sex and disease type (interstitial lung disease, COPD, or none) as covariates. This analysis was done between ever and never smokers, and between current and never smokers.
Quantitative PCR
To technically validate the expression levels of the seven genes found to be differentially expressed in this study and in those of Bossé et al. and Landi et al.4,8, we did quantitative PCR in a subset of 54 patients (27 ever smokers and 27 never smokers) from this study. Total RNA (400 ng) was reverse-transcribed to cDNA using Superscript IV Vilo Master Mix (Thermo Fisher Scientific) and analyzed using TaqMan Gene Expression Assays (KMO: Hs00175738_m1, CD1A: Hs00381751_m1, SPINK5: Hs00199260_m1, TREM2: Hs00219132_m1, CYBB: Hs00166163_m1, DNASE2B: Hs00998752_m1, FGG: Hs00241037_m1, Thermo Fisher Scientific) and TaqMan Fast Advanced Master Mix (Thermo Fisher Scientific). Each reaction used 20 ng cDNA as template in a final volume of 10 ul. The human hypoxanthine phosphoribosyltransferase 1 (HPRT1: Hs99999909_m1) gene was used to normalize expression data. Reactions were run in duplicate on an ABI 7900HT platform (Life Technologies). Relative quantities of mRNA levels were assessed using the comparative cycle threshold (Ct) method and calculated with respect to an RNA sample extracted from non-involved lung tissue of a single patient, used as calibrator.
Statistical analyses
Clinical characteristics between ever and never smokers were compared as follows: for age at diagnosis (a time-to-event variable), we used multivariable Cox proportional hazard regression, with sex as covariate, stratifying by decade of birth year44. For sex and pathological stage, Fisher’s exact test was used. Correlations between expression levels determined by microarray and by quantitative PCR were evaluated with Pearson’s test. xCell enrichment scores were compared using the non-parametric Kruskal-Wallis test. All statistical analyses were carried out using R software. A value of P < 0.05 was taken to indicate statistical significance.
References
Nuwaysir, E. F., Bittner, M., Trent, J., Barrett, J. C. & Afshari, C. A. Microarrays and toxicology: the advent of toxicogenomics. Mol. Carcinog. 24, 153–159 (1999).
Orphanides, G. Toxicogenomics: challenges and opportunities. Toxicol. Lett. 140-141, 145–148 (2003).
Davis, A. P. et al. The Comparative Toxicogenomics Database: update 2019. Nucleic Acids Res. (2018).
Landi, M. T. et al. Gene expression signature of cigarette smoking and its role in lung adenocarcinoma development and survival. PLoS One 3, e1651 (2008).
Harvey, B. G. et al. Modification of gene expression of the small airway epithelium in response to cigarette smoking. J. Mol. Med. 85, 39–53 (2007).
Vink, J. M. et al. Differential gene expression patterns between smokers and non-smokers: cause or consequence? Addict. Biol. 22, 550–560 (2017).
Huan, T. et al. A whole-blood transcriptome meta-analysis identifies gene expression signatures of cigarette smoking. Hum. Mol. Genet. 25, 4611–4623 (2016).
Bosse, Y. et al. Molecular signature of smoking in human lung tissues. Cancer Res. 72, 3753–3763 (2012).
Malhotra, J., Malvezzi, M., Negri, E., La Vecchia, C. & Boffetta, P. Risk factors for lung cancer worldwide. Eur. Respir. J. 48, 889–902 (2016).
Islami, F. et al. Proportion and number of cancer cases and deaths attributable to potentially modifiable risk factors in the United States. CA Cancer. J. Clin. 68, 31–54 (2018).
Wakelee, H. A. et al. Lung cancer incidence in never smokers. J. Clin. Oncol. 25, 472–478 (2007).
Kramer, A., Green, J., Pollard, J. Jr & Tugendreich, S. Causal analysis approaches in Ingenuity Pathway Analysis. Bioinformatics 30, 523–530 (2014).
Zhou, Z., Chen, P. & Peng, H. Are healthy smokers really healthy? Tob Induc Dis. 14, 35-016-0101-z. eCollection 2016 (2016).
von Moltke, J. et al. Rapid induction of inflammatory lipid mediators by the inflammasome in vivo. Nature 490, 107–111 (2012).
Rom, O., Avezov, K., Aizenbud, D. & Reznick, A. Z. Cigarette smoking and inflammation revisited. Respir. Physiol. Neurobiol. 187, 5–10 (2013).
Martey, C. A. et al. Cigarette smoke induces cyclooxygenase-2 and microsomal prostaglandin E2 synthase in human lung fibroblasts: implications for lung inflammation and cancer. Am. J. Physiol. Lung Cell. Mol. Physiol. 287, L981–91 (2004).
Goettel, M., Niessner, R., Scherer, M., Scherer, G. & Pluym, N. Analysis of Urinary Eicosanoids by LC-MS/MS Reveals Alterations in the Metabolic Profile after Smoking Cessation. Chem. Res. Toxicol. 31, 176–182 (2018).
Saareks, V., Ylitalo, P., Alanko, J., Mucha, I. & Riutta, A. Effects of smoking cessation and nicotine substitution on systemic eicosanoid production in man. Naunyn Schmiedebergs Arch. Pharmacol. 363, 556–561 (2001).
Smith, J. R., Jamie, J. F. & Guillemin, G. J. Kynurenine-3-monooxygenase: a review of structure, mechanism, and inhibitors. Drug Discov. Today 21, 315–324 (2016).
Hilmas, C. et al. The brain metabolite kynurenic acid inhibits alpha7 nicotinic receptor activity and increases non-alpha7 nicotinic receptor expression: physiopathological implications. J. Neurosci. 21, 7463–7473 (2001).
Levin, E. D. et al. Nicotinic alpha7- or beta2-containing receptor knockout: effects on radial-arm maze learning and long-term nicotine consumption in mice. Behav. Brain Res. 196, 207–213 (2009).
Secci, M. E. et al. Attenuating Nicotine Reinforcement and Relapse by Enhancing Endogenous Brain Levels of Kynurenic Acid in Rats and Squirrel Monkeys. Neuropsychopharmacology 42, 1619–1629 (2017).
Yao, Y. et al. TREM-2 serves as a negative immune regulator through Syk pathway in an IL-10 dependent manner in lung cancer. Oncotarget 7, 29620–29634 (2016).
Keyel, P. A. Dnases in health and disease. Dev. Biol. 429, 1–11 (2017).
Wang, H. et al. A systematic approach identifies FOXA1 as a key factor in the loss of epithelial traits during the epithelial-to-mesenchymal transition in lung cancer. BMC Genomics 14, 680–2164-14-680 (2013).
Eguchi, M. et al. Exon skipping in CYBB mRNA and skewed inactivation of X chromosome cause late-onset chronic granulomatous disease. Pediatr. Hematol. Oncol. 35, 341–349 (2018).
Kim, J. H. et al. CD1a on Langerhans cells controls inflammatory skin disease. Nat. Immunol. 17, 1159–1166 (2016).
Chavanas, S. et al. Mutations in SPINK5, encoding a serine protease inhibitor, cause Netherton syndrome. Nat. Genet. 25, 141–142 (2000).
Hackett, N. R. et al. RNA-Seq quantification of the human small airway epithelium transcriptome. BMC Genomics 13, 82-2164-13-82 (2012).
Beane, J. et al. Characterizing the impact of smoking and lung cancer on the airway transcriptome using RNA-Seq. Cancer. Prev. Res. (Phila) 4, 803–817 (2011).
Tilley, A. E. et al. Biologic phenotyping of the human small airway epithelial response to cigarette smoking. PLoS One 6, e22798 (2011).
Lonergan, K. M. et al. Identification of novel lung genes in bronchial epithelium by serial analysis of gene expression. Am. J. Respir. Cell Mol. Biol. 35, 651–661 (2006).
Joehanes, R. et al. Epigenetic Signatures of Cigarette Smoking. Circ. Cardiovasc. Genet. 9, 436–447 (2016).
Stueve, T. R. et al. Epigenome-wide analysis of DNA methylation in lung tissue shows concordance with blood studies and identifies tobacco smoke-inducible enhancers. Hum. Mol. Genet. 26, 3014–3027 (2017).
Sundar, I. K. et al. DNA methylation profiling in peripheral lung tissues of smokers and patients with COPD. Clin. Epigenetics 9, 38-017-0335-5. eCollection 2017 (2017).
Baik, C. S. et al. Nonsteroidal Anti-Inflammatory Drug and Aspirin Use in Relation to Lung Cancer Risk among Postmenopausal Women. Cancer Epidemiol. Biomarkers Prev. 24, 790–797 (2015).
McCormack, V. A. et al. Aspirin and NSAID use and lung cancer risk: a pooled analysis in the International Lung Cancer Consortium (ILCCO). Cancer Causes Control 22, 1709–1720 (2011).
Galvan, A. et al. Gene expression signature of non-involved lung tissue associated with survival in lung adenocarcinoma patients. Carcinogenesis 34, 2767–2773 (2013).
Du, P., Kibbe, W. A. & Lin, S. M. lumi: a pipeline for processing Illumina microarray. Bioinformatics 24, 1547–1548 (2008).
Johnson, W. E., Li, C. & Rabinovic, A. Adjusting batch effects in microarray expression data using empirical Bayes methods. Biostatistics 8, 118–127 (2007).
Phipson, B., Lee, S., Majewski, I. J., Alexander, W. S. & Smyth, G. K. Robust Hyperparameter Estimation Protects Against Hypervariable Genes and Improves Power to Detect Differential Expression. Ann. Appl. Stat. 10, 946–963 (2016).
Aran, D., Hu, Z. & Butte, A. J. xCell: digitally portraying the tissue cellular heterogeneity landscape. Genome Biol. 18, 220-017-1349-1 (2017).
Heberle, H., Meirelles, G. V., da Silva, F. R., Telles, G. P. & Minghim, R. InteractiVenn: a web-based tool for the analysis of sets through Venn diagrams. BMC Bioinformatics 16, 169-015-0611-3 (2015).
Dragani, T. A., Colombo, F., Pavlisko, E. N. & Roggli, V. L. Malignant mesothelioma diagnosed at a younger age is associated with heavier asbestos exposure. Carcinogenesis 39, 1151–1156 (2018).
Acknowledgements
Valerie Matarese provided scientific editing. Elisa Scalvini performed qPCR for technical validation of expression data. This work was supported in part by a grant from the Italian Association for Cancer Research (AIRC, grant no. IG 20226).
Author information
Authors and Affiliations
Contributions
T.A.D. and F.C. conceived the study. M.I., D.T. and L.S. provided biological samples from lung adenocarcinoma patients. S.N. and A.P. prepared RNA samples. L.D.C. performed transcriptome analysis. D.M. and M.D. were involved in data analysis. G.P., F.C. and T.A.D. were involved in experimental design and manuscript preparation. All authors participated in critical revision of the article and gave their final approval of the submitted version.
Corresponding author
Ethics declarations
Competing Interests
The authors declare no competing interests.
Additional information
Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Pintarelli, G., Noci, S., Maspero, D. et al. Cigarette smoke alters the transcriptome of non-involved lung tissue in lung adenocarcinoma patients. Sci Rep 9, 13039 (2019). https://doi.org/10.1038/s41598-019-49648-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-019-49648-2
- Springer Nature Limited
This article is cited by
-
Analysis of genomic and transcriptomic variations as prognostic signature for lung adenocarcinoma
BMC Bioinformatics (2020)