Abstract
Based on the severity of liver fibrosis, low or high-risk profile of developing end-stage liver disease was present in nonalcoholic fatty liver disease (NAFLD). However, the mechanisms inducing transition from mild to advanced NAFLD are still elusive. We performed a system-level study on fibrosing-NAFLD by weighted gene co-expression network analysis (WGCNA) to identify significant modules in the network, and followed by functional and pathway enrichment analyses. Moreover, hub genes in the module were analyzed by network feature selection. As a result, fourteen distinct gene modules were identified, and seven modules showed significant associations with the status of NAFLD. Module preservation analysis confirmed that these modules can also be found in diverse independent datasets. After network feature analysis, the magenta module demonstrated a remarkably correlation with NAFLD fibrosis. The top hub genes with high connectivity or gene significance in the module were ultimately determined, including LUM, THBS2, FBN1 and EFEMP1. These genes were further verified in clinical samples. Finally, the potential regulators of magenta module were characterized. These findings highlighted a module and affiliated genes as playing important roles in the regulation of fibrosis in NAFLD, which may point to potential targets for therapeutic interventions.
Similar content being viewed by others
Introduction
Nonalcoholic fatty liver disease (NAFLD) represents a wide spectrum of disorders ranging from simple steatosis (nonalcoholic fatty liver, NAFL), nonalcoholic steatohepatitis (NASH), to cirrhosis or hepatocellular carcinoma (HCC). To date, NAFLD is one of the most common types of liver disease in the world1.
The diagnosis of NAFLD rests on clinicopathological criteria, requiring both clinical and biopsy-based information. The histological findings are graded as fatty changeand necroinflammatory using NAFLD activity score (NAS) scoring system, while fibrosis staging is useful to assess the severity and underlying cause of liver disease2.Challenges still lies in a lacking consensus for the classification of fatty liver disease, and absence of a uniform histological definition of NAFLD3.
Correspondingly, the phenotype and outcome of NAFLD is quite heterogeneous. For instance, though most NAFLD patients do not develop clinically significant hepatic disease, some patients can progress to cirrhosis, leading ultimately to HCC4. In addition, not all individuals with NASH finally develop cirrhosis or liver cancer5. On the contrary, although the severity of NASH generally correlates with the stage of fibrosis, some individuals with advanced fibrosis have relatively little NASH6. Moreover, the stage of fibrosis on liver biopsy independently associates with liver-related mortality7. When advanced fibrosis is present, absence of NASH is no longer prognostic. For these reasons,recent clinical data focus increasing attention on determining fibrosis, as it is a strong indicator of the risk extent for NAFLD8. Recently, our group have shown that fibrosis score is a useful predictor of long-term outcome in NAFLD patients9. Mechanisms leading to strongly differing progression of NAFLD, in term of fibrosis, have to be elucidated.
The high-throughput technology such as gene expression profiling has been applied to NAFLD and provides insights into molecular aspects of NAFLD progression5. However, gene lists based on differential expression analysis methods are biased against genes with large changes in expression, while lacking the consideration of the relationship between changing genes as a whole. In this sense, biological networks represent valuable tools for understanding system-level properties10. One network approach, named weighted gene co-expression network analysis (WGCNA) allows for the grouping functionally related genes into modules11.It is believed that modules are stable units since the overall function of a module can remain the same while individual gene expression can be changed or replaced by other genes with similar redundant functions12. Therefore, functional modules can more effectively reveal consistent differences during NAFLD progression.
In the present study, we applied a WGCNA approach to quantitatively assess the traits of mild or advanced fibrosing-NAFLD. Genome-scale modules of co-expressed genes with clear functional annotations were identified.
Results
Weighted co-expression network construction
We used GSE49541 dataset in this study, since it contained the relatively largest NAFLD samples with clear fibrosis staging. After preprocessing the data, we applied the WGCNA package to compile the network. One outlier sample (GSM789152) was eliminated using hierarchical average linkage clustering (Supplementary File 1, Fig. S1). Keeping to the scale-free topology criterion, β = 5 was considered in this study (Fig. 1).
Following dynamic tree cut, the hierarchical clustering dendrogram identified 15 distinct gene modules, as shown in Fig. 1b. 348 genes failed to fit within a distinct group and were assigned to the grey module. The grey module was ignored in this study. The size of modules ranged from 38 (cyan module) to 2514 (turquoise module) genes. All attributes of genes and samples were shown in Supplementary File 2, Table S1–S2.
When compared with the mild samples, a total of 1134 differentially expressed genes (DEGs) were screened from the advanced NAFLD samples, including 762 upregulated and 371 downregulated genes. Consistent with the earlier research13, WGCNA modeling using DEGs could not match the scale-free feature of the network (Supplementary File 1, Fig. S2). DEG enrichment in each module was shown in Fig. 1c, in which DEG was mostly enriched in magenta module, and followed by black and blue module.
Identification of meta-modules associated with NAFLD severity
Next, we evaluated the relationship between each module and NAFLD status by correlating the eigengenes for each module with the fibrosis trait. Seven modules showed association evidence with p < 0.05(Fig. 2). Among them, five modules (tan, green, yellow, cyan, magenta) were positively correlated with fibrosis, thereafter named fibrosing-NAFLD modules. Two negatively correlated modules (blue, black) named NAFLD modules thereafter.
Module-module relationship, also called meta-module, is the groups of correlated eigengenes with correlation of eigengenes > 0.5. As shown in Fig. 2c,d, the dendrogram indicates that module mayanta is highly correlated with NAFLD fibrosis. While black and blue modules are highly related, their mutual correlations are stronger than their correlations with fibrosis trait.
Stability and preservation of co-expression modules
To test the stability of the identified modules, internal analysis by repeating network construction and module identification on expression data that consists of resampled sets of the original dataset was performed14. The result proved the robustness of module assignments (Supplementary File 1, Fig. S3).
To ask if the identified modules were common in different datasets, an independent validation was performed. We retrieved 8 datasets relevant to NAFLD. All samples were from human liver tissue. Fibrosing-NAFLD modules (tan, green, yellow, cyan, magenta) were stable across E-MEXP-3291, GSE48452, and GSE59045. To examine if these identified modules were also presented in fibrosing liver disease such as HBV, GSE84044 dataset containing 124 samples with different stages of fibrosis was used as a specific control (Fig. 3).
In addition, NAFLD modules (blue, black) were preserved in other 5 datasets (GSE17470, GSE24807, GSE37031, GSE46300, and GSE63067) according to the summary preservation statistics, while fibrosing-NAFLD modules showed weak to none evidence for module preservation (Supplementary File 1, Fig. S4).
Functional enrichment analysis of the gene modules of interest
Gene ontology annotation and enrichment analysis were accomplished using DAVID version 6.8 (https://david-d.ncifcrf.gov/)15. Top biological processes and KEGG pathway in each module was shown in Table 1.
Globally, top 5 biological processes were enriched in the modules of interest, including small molecule metabolic process (blue, FDR = 1.10E-35), extracellular matrix organization (magenta, FDR = 2.50E-10), gluconeogenesis (blue, FDR = 2.70E-06), extracellular matrix disassembly (magenta, FDR = 7.31E-06), respiratory electron transport chain (black, FDR = 1.30E-04). Top 5 enriched pathways were as follows: hsa01100: Metabolic pathways (FDR = 7.81E-15) in the blue module, hsa01200: Carbon metabolism (FDR = 8.11E-11) in the blue module, hsa04510: Focal adhesion (FDR = 2.91E-08) in the magenta module, hsa04512: ECM-receptor interaction (FDR = 1.91E-06) in the magenta module, and hsa04932: Non-alcoholic fatty liver disease (NAFLD)(FDR = 4.19E-03) in the black module. The complete annotation for each module was provided in Supplementary File 2, Table S3–S4.
Network analysis of the gene modules of interest
After viewing the global properties of the interesting networks, we next examined the gene constitution of particular modules based on network unique properties, such as gene significance (GS), module membership (MM) and intramodular connectivity (K.in).
Abstractly speaking, a gene is more meaningful with high GS, MM and K.in16. Thus, if MM, K.in or GS of the specific module were significantly connected and associated with the NAFLD fibrosis status, it implied that the module serves an more important biological role to NAFLD progression.
Of the seven interesting modules, significant correlations were observed between MM and GS in the yellow, blue, black, and magenta modules. We also found a markedly correlation between GS and K.in in the yellow, blue, black, and magenta modules (Fig. 4). Overall, module magenta shows the best as reflected by its strongly positive correlations(r = 0.84, p = 4.5E-56 in GS vs.MM; r = 0.78, p = 2.1E-43 in GS vs.K.in). These results indicated that magenta module is heavily involved in NAFLD fibrosis progression.
Characterization of the magenta module content and hub genes
A network view of magenta module, modeled by cytoscpae with TOM ≥ 0.1, is depicted in Fig. 5. The K.in count for each gene ranged from 0.1 to 14.5, with an average of 3.49 ± 2.90. The GS score for each gene ranged from −0.58 to 0.79, with an average of 0.34 ± 0.03. The MM for each gene ranged from −0.68 to 0.92, with an average of 0.43 ± 0.03. Using STRING17 or GeneMANIA18 database to model the network gave the similar results (Supplementary File1 Fig. S6).
Focusing on the magenta module, we explored those core genes that had a high significance for NAFLD status, as well as high K.in. Network top interesting genes of the magenta module based on the above two indexes are listed in Table 2. The three top network hub genes (LUM, FBN1, and THBS2) based on K.in and the three top genes (EFEMP1, THBS2, and LUM) ranked on GS were disclosed.
Specifically, several proteomic studies have identified lumican (LUM) is expressed differentially across the progressive stages of NAFLD19. Upregulated expression of LUM is in association to hepaticfibrosis20. Moreover, in animal studies, LUM is a prerequisite for hepatic fibrosis, which involves collagen fibrillogenesis, and matrix turnover21.
To the best of our knowledge, there was nothing directly implicating EFEMP1, FBN1 and THBS2 reported to be associated with severe NAFLD. However, EFEMP1 has been showed decreased expression in HCC tissue22. The following two genes, FBN1 and THBS2, were belong to the cellular adhesion and extracellular matrixconstituent. FBN1, has been shown mounting a hepatic progenitor cell response for tissue repair in rat liver23. THBS2 was found over-represented in patients with vascular liver lesions such as sinusoidal dilatations24.
All these four genes were significantly upregulated in advanced fibrosing-NAFLD (GSE49541). High expression of these genes was also confirmed in advanced NAFLD in other cohorts (E-MEXP-2191, GSE48452 and GSE84044, Fig. 5c). Conversely, these genes were not differentially expressed in the NAFLD dominant datasets (Supplementary File 1, Fig. S5), suggesting EFEMP1, FBN1 and THBS2 maybe the novel candidate biomarkers for fibrosing-NAFLD.
Hub genes were significantly up-regulated in the livers from NAFLD patients and mices
To investigate if hub genes were modified in fibrosising-NAFLD, the production of these genes were further examined in livers from NAFLD animals and patients.
Liver biopsy tissues from NAFLD patients were stained to assess disease severity. Compared with NAFLD fibrosis stage 0–1 patients (n = 4), LUM, EFEMP1, FBN1, and THBS2 were remarkablely up-regulated in the fibrosis stage 3–4 state patients (n = 4) (Fig. 6a,b).
ApoE−/− mice receiving a HFHC diet are the well-established animal models mimicking human NAFLD25. As expected, ApoE−/− mice fed a HFHC diet for 20 weeks evidenced hepatic steatosis, ballooning, hepatic inflammation and increased fibrosis. Similarly, Lum, Efemp1, Fbn1, and Thbs2 were significantly higher in livers from NAFLD mice with severe fibrosis compared to livers from mice with mild fibrosis (n = 3 in each group, Fig. 6c,d). The data in vivo above suggests a close relationship between hub genes and NAFLD fibrosis.
Functional organization of the magenta module
Next, function relevance of magenta module was reannotated using DAVID tool. With the cutoff set as FDR <0.1, focal adhesion, ECM-receptor interaction, and phosphatidylinositol 3-kinase(PI3K)-Akt signaling pathway constitue the main pathways in magenta module. In complete accord with the phenotype, GOterm extracellular matrix organization and cell adhesion were significantly enriched.
Since co-expressed genes may be co-regulated by the common transcription factors (TFs) and microRNAs, we performed gene-set enrichment analysis using ChEA, Encode, and Targetscan database26,27,28 for magenta module. Significant enrichments of transcription factors were observed for ESR1, SOX2, TP53, etc (Fig. 7; Supplementary File 2, Table S5–S7). Several TFs were reported to be functionally associated with NAFLD. For instance, CpG island methylation of ESR1 was found to be involved in lipid and glucose metabolism, and the progression of fibrosis in mouse feeding with methyl-deficient diets29. SOX2 expression may predict the prognosis of HCC patients30. As a metabolic modulator, growing evidences highlight TP53 a new player in NAFLD pathogenesis31. BACH1 gene ablation reduces steatohepatitis in mouse32. Specifically, RELA (p65), is well-known for its trigging inflammatory responses in NAFLD33. RUNX2 downregulation is involved in cirrhotic liver34. Most recently, TRIM28 was found to modulate the prevalence of obesity in the population35. The other two genes (ZNF217, TP63) enriched in magenta module have not been reported for their relationship to liver.
H3 lysine 27 trimethylation (H3K27me3) was strongly enriched for most of the genes in module magenta. Recent paper has shown that in human NAFLD-associated HCC, level of H3K27me3 was correlated positively in tumors compared with nontumor tissues36.
Finally, the most enriched miRNAs were observed for miR-200b, miR-200c and miR-429. In high fat diet feeding rats, miR-200a, miR-200b and miR-429 were significantly correlated with a severity of NAFLD-specific liver pathomorphological features37. Another study has showed, miR-200a, miR-200b, and miR-200c were up-regulated in NAFLD38.
Discussion
Here, we present a systematic WGCNA of NAFLD with either mild fibrosis or advanced fibrosis. Among the 14 modules discovered in this study, seven modules were significantly associated with disease progression.
As an alternative to traditional differential expression analyses which centered on individual genes distinguishing the status of NAFLD, in an unbiased manner, WGCNA groups co-expressed genes that are biologically integrated on a genome-wide scale. We found several features in term of the relationship between WGCNA and DEG analysis. First, using differential expression genes to model WGCNA network is not suggested, since it completely invalidates the scale-free topology assumption. Second, we found that, globally, DEG genes have less connectivity than non-DEG genes (9.57 ± 12.91 vs. 24.93 ± 37.59, respectively), and this association does not follow a simple monotonic trend in each module. Third, DEG genes tend not to be hub genes as determined by connectivity in most modules. However, DEG numbers show a significant enrichment in seven important modules (Fig. 1c). Thus, WGCNA and DEG showed efficient mutual complementation when for transcriptome analysis.
Results suggest that the modules identified here are biologically rational. First, fibrosing-NAFLD and NAFLD modules were clearly separated by WGCNA approach. Module preservation was extensively studied among different datasets. Second, most of the identified modules are enriched for specific GO terms and KEGG pathways. For instance, module black, together with module blue, are markedly inverse correlated with NAFLD fibrosis. Both the black and blue modules were enriched in respiratory electron transport chain and small molecule metabolic process, which have already been implicated in oxidative stress, and mitochondrial dysfunction for NAFLD39. KEGG analysis showed that these modules are directly related to NAFLD. Moreover, hub genes in these modules are implicated in NAFLD as reported by literature annotation. As the hub gene of blue module,selenium-binding protein 1(SELENBP1) has been shown to be downregulated in the liver tissue of HCC patients and the association of its gradual loss with an increased malignant grade40. Thus, it is temping to speculate that the NAFLD modules (black, and blue) dominate the period of NAFLD when fibronesis is not obvious.
Recently, the systems biology approach for NAFLD has been applied in studies by integrating genomic data and metabolic networks. One study showed that metabolism of amino acids, chondroitin and heparan sulphates seem to be involved in the appearance of NASH41. In our analysis, we observed regulation of cellular amino acid metabolic process in black or blue module, including branched-chain, sulfur, alpha-amino, serine family, aspartate family amino acid metabolic process. Simultaneously, cellular response to amino acid stimulus was also enriched in magenta module with relatively small propability, which suggests the continuous amino acid metabolic stress during NAFLD pathogenesis.
This study highlights the importance of magenta moduleas as a driver of fibrosis based on the meta-module, and further through the network feature (GS, MM and K.in) analysis. Enriched GO terms or pathway are highly concordant. In particular, extracellular matrix (ECM) and PI3K pathway were top core gene sets of the magenta module. The ECM is mainly composed of an intricate interlocking mesh of fibrillar and non-fibrillar collagens, elastic fibers and glycoproteins42, which is a highly dynamic structure undergoing controlled remodelling43. The deposition of increased and abnormal ECM is the hallmark of liver fibrosis44. Correspondingly, the PI3K signaling pathway has been shown to regulate procedures associated with hepatic stellate cell (HSC) activation such as collagen synthesis and cell proliferation45. Inhibition of PI3K signaling in HSCs suppresses ECM deposition, type I collagen synthesis, and reduce the expression of profibrogenic factors. In a feed-back manner, collagen crosslinking increases stiffness, β1 integrin clustering, PI3K signalling and focal adhesion formation to drive the disease progression43.
The progression of NAFLD from mild steatosis up to severe steatohepatitis and even cirrhosis or HCC, varies widely between individual patients. Recently, one study was to assess the histological severity of NAFLD in a cohort with serial biopsy data, and contrary to current dogma, this study suggests that steatosis can directly progress to NASH and clinically significant fibrosis46. Since causality between liver fibrosis and the prognosis of the liver disease is well recognized, the intervening measure may have to be adjusted in different subsets of NAFLD patients.
To demonstrate the usefulness of our modules in the development of efficient NAFLD interfere strategies, given magenta module as an example, small compounds derived from the Library of Integrated Network-based Cellular Signatures (LINCS) L1000 platform47 affecting the gene expression was provided in Supplementary File 2 Table S8–9. Next, novel potential biomarkers including EFEMP1, THBS1, FBN1 were disclosed in magenta module, after extensive cross-validation. Another noteworthy was the link between the regulators (transcription factors and epigenetic markers) and the co-expression mode of genes in magenta module, which may suggest the regulatory circuit during NAFLD progression. In the future, more experiments are needed to validate these discovery clues.
In summary, this study generated a comprehensive and unbiased snap-shot of the modules as well as genes in fibrosing-NAFLD. In particular, magenta module and genes regulating ECM remodelling during NAFLD progression deserve further attention. An identification of mechanistically linked key module and regulators will aid intervention development.
Methods
Gene expression dataset and processing
Transcription profile of NAFLD was downloaded from the Gene Expression Omnibus (GEO) with accession number GSE49541. The raw data were corrected and normalized using the RMA function of affy package of R 3.2.0 in Bioconductor. This datatset represents two clinically defined pathological groups at the extremes of NAFLD: mild NAFLD (n = 40, fibrosis stages 0–1), with little risk of developing severe liver disease; advanced NAFLD (n = 32, fibrosis stage 3–4), with significant likelihood of developing liver-related morbidity and mortality. As described earlier, the two groups were matched for gender, age and body mass index48. Differential expressed genes (DEG) were considered by using the criterion with Benjamini & Hochberg adjusted p value less than 0.05.
The microarray datasets referenced during the study (E-MEXP-3291, GSE45428, GSE50594 and GSE84044, etc) are available in a public repository from EBI (http://www.ebi.ac.uk/) or NCBI GEO. All the other data supporting the findings of this study are available within the article and its Supplementary File 1 Table S1. It is important to emphasize we include only the human NAFLD samples in our study. Full experimental methods and detailed descriptions of these public data sets can be found in the original references.
WGCNA network construction and module detection
After normalization, we removed lowly and nonexpressed genes by selecting probes with a mean expression in the top 50% of all probes. Next, genes with expression variance above average level were selected. Different probes targeting the same gene were collapsed. These steps finally resulted in 7012 genes to infer co-expression networks.
Networks were formed following the protocols of WGCNA49. A pairwise pearson correlation coefficient matrix was first computed, and an adjacency matrix and topological overlap matrix (TOM) were constructed11. TOM is a parameter referring to the interconnection between two genes, and a module is a cluster of genes with high topological overlap. Modules were identified on the dendrogram using the dynamic tree cut algorithm50. The module eigengene (ME) is defined as the first principal component of a given module, which can be considered a representative of the gene expression profiles in a module. Module Membership (MM), also known as eigengene-based connectivity (kME), is defined as the correlation between the module eigengene and gene expression values. Genes weakly correlated with all of the MEs (|kME| < 0.7) were assigned to none of the modules. Finally, the interesting module network was visualized by Cytoscape51.
Module preservation implemented in WGCNA was used to detect the conservation of gene pairs between two networks52. Briefly, three types of network based module preservation statistics, including density based preservation statistics, connectivity based preservation statistics, and network based statistics have been identified. Then, two composite measures have been defined. Median rank is defined as the mean of median ranks computed for connectivity and density measures of each module. Zsummary is used to assess the significance of observed statistics and is defined as the mean of Z scores computed for density and connectivity measures. We utilize median rank to identify module preservation and Zsummary to assess significance of module preservation via permutation testing 200 times.
Feature vectors in WGCNA network
Gene significance (GS) was defined as the the correlation between individual genes and NAFLD trait. The intramodular connectivity (K.in) was calculated as the summation of adjacency performed over all genes in a particular network. If GS and MM are highly correlated, it means that genes are the most important elements of modules and are highly significantly associated with the trait. Generally, the MM is high in relation to k.in, and a higher correlation indicates that a gene is more important to the given module11.
Hub genes tends to be located in the centre of a network, highly connected with other genes and hence of high functional significance. Therefore, a gene with high GS, high MM and high K.in in a module was considered to be a hub gene.
Functional annotation of the modules
Gene ontology (GO) and KEGG pathwayenrichment analysis for network modules were performed using the Database for Annotation, Visualization and Integrated Discovery (DAVID)15. In DAVID, an overrepresentation of a term is defined as a modified Fisher’s exact P value with an adjustment for multiple tests using Benjamini method.
In addition, we related interesting modules to biological curated gene sets on the basis of Enrichr (http://amp.pharm.mssm.edu/Enrichr), which currently contains a large collection of diverse gene set libraries available for analysis53. In Enrichr, apart from the Fisher exact test used in most gene list enrichment analyses, a z-score test statistics capture the deviation from the expected rank by the Fisher exact test. Then, a combined score evaluating the enrichment was computed as follows: C = log(p)⋅Z. Where C indicates the combined score, p is the p-value computed using the Fisher exact test, and Z is the z-score computed by assessing the deviation from the expected rank.
Ethical considerations
All the experiments protocols involving humans and animals were approved by the Human Ethics Committee of the affiliated hospital of Hangzhou Normal University. Methods were carried out in accordance with the approved guidelines and regulation. Written informed consent was obtained from all participants. Appropriate care was given to all animals included for experiments.
Patients
The criteria for NAFLD were based on those recommended by the Chinese Liver Disease Association. Liver biopsies from NAFLD patients with different stages of fibrosis, identified between Feb 2013 and Oct 2016 in the Department of Liver Diseases, affiliated hospital of Hangzhou Normal University, China was collected and stored at −80 °C before analysis. Paraffin embedded liver tissues were used for immunohistochemistry (IHC).
Animal studies
Four-week-old male ApoE−/− mices were purchased from Model Animal Research Center of Nanjing University (Nanjing, China). All mices were bred in a specific pathogen-free facility and maintained in a 12-hour light-dark-cycle at room temperature and fed ad libitum. The mices were divided into two groups and allocated into either a normal Chow-diet (Normal group) or high fat high cholesterol-diet (HFHC group). HFHC diet was from Research Diets, New Brunswick (D12079B; Research Diets New Brunswick, NJ). At the end of the experiments, a part of the liver tissue was fixed with 10% formaldehyde and the remaining liver was snap frozen.
Liver histology
Formalin-fixed liver tissue was processed into 4 μm thick paraffin sections and stained with hematoxylin and eosin (HE) and Masson’s staining. Degree of NAFLD activity score (NAS) were scored according to NASH clinical research network (CRN) scoring system.
Quantitative real-time PCR
Hepatic mRNA levels were analyzed by qRT-PCR using a 7900 Real Time PCR System (Applied Biosystems, USA). The RNA was isolated with TRIzol (Invitrogen, USA). cDNA was synthesized using 2 μg of total RNA with PrimeScript™ Reverse Transcriptase (Takara). Amplification reactions were performed using the SYBR® Premix Ex Taq kit (Takara) and 0.2 µM of gene specific primers (Supplementary File 1 Table S2) and PCR products were verified by melting curve analysis. The relative quantification expression was calculated using the delta-delta Ct method with each gene normalized to GAPDH.
References
Unalp-Arida, A. & Ruhl, C. E. Noninvasive fatty liver markers predict liver disease mortality in the U.S. population. Hepatology 63, 1170–1183, doi:10.1002/hep.28390 (2016).
Hubscher, S. G. Histological assessment of non-alcoholic fatty liver disease. Histopathology 49, 450–465, doi:10.1111/j.1365-2559.2006.02416.x (2006).
Hashimoto, E., Tokushige, K. & Ludwig, J. Diagnosis and classification of non-alcoholic fatty liver disease and non-alcoholic steatohepatitis: Current concepts and remaining challenges. Hepatology research: the official journal of the Japan Society of Hepatology 45, 20–28, doi:10.1111/hepr.12333 (2015).
Pagadala, M. R. & McCullough, A. J. The relevance of liver histology to predicting clinically meaningful outcomes in nonalcoholic steatohepatitis. Clinics in liver disease 16, 487–504, doi:10.1016/j.cld.2012.05.006 (2012).
Younossi, Z. M. et al. Pathologic criteria for nonalcoholic steatohepatitis: interprotocol agreement and ability to predict liver-related mortality. Hepatology 53, 1874–1882, doi:10.1002/hep.24268 (2011).
Caldwell, S. H. et al. NASH and cryptogenic cirrhosis: a histological analysis. Annals of hepatology 8, 346–352 (2009).
Mendes, F. D., Suzuki, A., Sanderson, S. O., Lindor, K. D. & Angulo, P. Prevalence and indicators of portal hypertension in patients with nonalcoholic fatty liver disease. Clinical gastroenterology and hepatology: the official clinical practice journal of the American Gastroenterological Association 10, 1028–1033 e1022, doi:10.1016/j.cgh.2012.05.008 (2012).
Haas, J. T., Francque, S. & Staels, B. Pathophysiology and Mechanisms of Nonalcoholic Fatty Liver Disease. Annual review of physiology 78, 181–205, doi:10.1146/annurev-physiol-021115-105331 (2016).
Xun, Y.-h. et al. Non-alcoholic fatty liver disease (NAFLD) fibrosis score predicts 6.6-year overall mortality of Chinese patients with NAFLD. Clinical and Experimental Pharmacology and Physiology 41, 643–649, doi:10.1111/1440-1681.12260 (2014).
Furlong, L. I. Human diseases through the lens of network biology. Trends in genetics: TIG 29, 150–159, doi:10.1016/j.tig.2012.11.004 (2013).
Zhang, B. & Horvath, S. A general framework for weighted gene co-expression network analysis. Statistical applications in genetics and molecular biology 4, Article17, doi:10.2202/1544-6115.1128 (2005).
Wang, A. et al. Functional modules distinguish human induced pluripotent stem cells from embryonic stem cells. Stem cells and development 20, 1937–1950, doi:10.1089/scd.2010.0574 (2011).
Yang, Y. et al. Gene co-expression network analysis reveals common system-level properties of prognostic genes across cancer types. Nature communications 5, 3231, doi:10.1038/ncomms4231 (2014).
Langfelder, P. & Horvath, S. Fast R Functions for Robust Correlations and Hierarchical Clustering. Journal of statistical software 46 (2012).
Huang da, W., Sherman, B. T. & Lempicki, R. A. Systematic and integrative analysis of large gene lists using DAVID bioinformatics resources. Nature protocols 4, 44–57, doi:10.1038/nprot.2008.211 (2009).
Langfelder, P., Mischel, P. S. & Horvath, S. When is hub gene selection better than standard meta-analysis? PloS one 8, e61505, doi:10.1371/journal.pone.0061505 (2013).
Szklarczyk, D. et al. STRING v10: protein-protein interaction networks, integrated over the tree of life. Nucleic acids research 43, D447–452, doi:10.1093/nar/gku1003 (2015).
Montojo, J., Zuberi, K., Rodriguez, H., Bader, G. D. & Morris, Q. GeneMANIA: Fast gene network construction and function prediction for Cytoscape. F1000Research 3, 153, doi:10.12688/f1000research.4572.1 (2014).
Nuno-Lambarri, N., Barbero-Becerra, V. J., Uribe, M. & Chavez-Tapia, N. C. Mitochondrial Molecular Pathophysiology of Nonalcoholic Fatty Liver Disease: A Proteomics Approach. International journal of molecular sciences 17, doi:10.3390/ijms17030281 (2016).
Bracht, T. et al. Analysis of disease-associated protein expression using quantitative proteomics-fibulin-5 is expressed in association with hepatic fibrosis. Journal of proteome research 14, 2278–2286, doi:10.1021/acs.jproteome.5b00053 (2015).
Krishnan, A. et al. Lumican, an extracellular matrix proteoglycan, is a novel requisite for hepatic fibrosis. Laboratory investigation; a journal of technical methods and pathology 92, 1712–1725, doi:10.1038/labinvest.2012.121 (2012).
Nomoto, S. et al. Epidermal growth factor-containing fibulin-like extracellular matrix protein 1, EFEMP1, a novel tumor-suppressor gene detected in hepatocellular carcinoma using double combination array analysis. Annals of surgical oncology 17, 923–932, doi:10.1245/s10434-009-0790-0 (2010).
Vestentoft, P. S. et al. Molecular constituents of the extracellular matrix in rat liver mounting a hepatic progenitor cell response for tissue repair. Fibrogenesis & tissue repair 6, 21, doi:10.1186/1755-1536-6-21 (2013).
Agostini, J. et al. Identification of molecular pathways involved in oxaliplatin-associated sinusoidal dilatation. Journal of hepatology 56, 869–876, doi:10.1016/j.jhep.2011.10.023 (2012).
Jeon, S., Park, Y. J. & Kwon, Y. H. Genistein alleviates the development of nonalcoholic steatohepatitis in ApoE(−/−) mice fed a high-fat diet. Molecular nutrition & food research 58, 830–841, doi:10.1002/mnfr.201300112 (2014).
Lachmann, A. et al. ChEA: transcription factor regulation inferred from integrating genome-wide ChIP-X experiments. Bioinformatics 26, 2438–2444, doi:10.1093/bioinformatics/btq466 (2010).
Mouse, E. C. et al. An encyclopedia of mouse DNA elements (Mouse ENCODE). Genome biology 13, 418, doi:10.1186/gb-2012-13-8-418 (2012).
Agarwal, V., Bell, G. W., Nam, J. W. & Bartel, D. P. Predicting effective microRNA target sites in mammalian mRNAs. eLife 4, doi:10.7554/eLife.05005 (2015).
Cordero, P., Campion, J., Milagro, F. I. & Martinez, J. A. Transcriptomic and epigenetic changes in early liver steatosis associated to obesity: effect of dietary methyl donor supplementation. Molecular genetics and metabolism 110, 388–395, doi:10.1016/j.ymgme.2013.08.022 (2013).
Huang, P. et al. Role of Sox2 and Oct4 in predicting survival of hepatocellular carcinoma patients after hepatectomy. Clinical biochemistry 44, 582–589, doi:10.1016/j.clinbiochem.2011.02.012 (2011).
Derdak, Z. et al. Inhibition of p53 attenuates steatosis and liver injury in a mouse model of non-alcoholic fatty liver disease. Journal of hepatology 58, 785–791, doi:10.1016/j.jhep.2012.11.042 (2013).
Inoue, M. et al. Bach1 gene ablation reduces steatohepatitis in mouse MCD diet model. Journal of clinical biochemistry and nutrition 48, 161–166, doi:10.3164/jcbn.09-122GFR (2011).
Willy, J. A., Young, S. K., Stevens, J. L., Masuoka, H. C. & Wek, R. C. CHOP links endoplasmic reticulum stress to NF-kappaB activation in the pathogenesis of nonalcoholic steatohepatitis. Molecular biology of the cell 26, 2190–2204, doi:10.1091/mbc.E15-01-0036 (2015).
Hattori, S. et al. FR-167653, a selective p38 MAPK inhibitor, exerts salutary effect on liver cirrhosis through downregulation of Runx2. Laboratory investigation; a journal of technical methods and pathology 87, 591–601, doi:10.1038/labinvest.3700539 (2007).
Dalgaard, K. et al. Trim28 Haploinsufficiency Triggers Bi-stable Epigenetic Obesity. Cell 164, 353–364, doi:10.1016/j.cell.2015.12.025 (2016).
Tian, Y. et al. Histone Deacetylase HDAC8 Promotes Insulin Resistance and beta-Catenin Activation in NAFLD-Associated Hepatocellular Carcinoma. Cancer research 75, 4803–4816, doi:10.1158/0008-5472.CAN-14-3786 (2015).
Alisi, A. et al. Mirnome analysis reveals novel molecular determinants in the pathogenesis of diet-induced nonalcoholic fatty liver disease. Laboratory investigation; a journal of technical methods and pathology 91, 283–293, doi:10.1038/labinvest.2010.166 (2011).
Feng, Y. Y. et al. Aberrant hepatic microRNA expression in nonalcoholic fatty liver disease. Cellular physiology and biochemistry: international journal of experimental cellular physiology, biochemistry, and pharmacology 34, 1983–1997, doi:10.1159/000366394 (2014).
Paradies, G., Paradies, V., Ruggiero, F. M. & Petrosillo, G. Oxidative stress, cardiolipin and mitochondrial dysfunction in nonalcoholic fatty liver disease. World journal of gastroenterology 20, 14205–14218, doi:10.3748/wjg.v20.i39.14205 (2014).
M, D. I. S. et al. A possible predictive marker of progression for hepatocellular carcinoma. Oncology letters 2, 1247–1251, doi:10.3892/ol.2011.378 (2011).
Mardinoglu, A. et al. Genome-scale metabolic modelling of hepatocytes reveals serine deficiency in patients with non-alcoholic fatty liver disease. Nature communications 5, 3083, doi:10.1038/ncomms4083 (2014).
Cox, T. R. & Erler, J. T. Remodeling and homeostasis of the extracellular matrix: implications for fibrotic diseases and cancer. Disease models & mechanisms 4, 165–178, doi:10.1242/dmm.004077 (2011).
Bonnans, C., Chou, J. & Werb, Z. Remodelling the extracellular matrix in development and disease. Nature reviews. Molecular cell biology 15, 786–801, doi:10.1038/nrm3904 (2014).
Wells, R. G. Cellular sources of extracellular matrix in hepatic fibrosis. Clinics in liver disease 12, 759–768, viii, doi:10.1016/j.cld.2008.07.008 (2008).
Son, M. K. et al. HS-173, a novel PI3K inhibitor, attenuates the activation of hepatic stellate cells in liver fibrosis. Scientific reports 3, 3470, doi:10.1038/srep03470 (2013).
McPherson, S. et al. Evidence of NAFLD progression from steatosis to fibrosing-steatohepatitis using paired biopsies: implications for prognosis and clinical management. Journal of hepatology 62, 1148–1155, doi:10.1016/j.jhep.2014.11.034 (2015).
Vempati, U. D. et al. Metadata Standard and Data Exchange Specifications to Describe, Model, and Integrate Complex and Diverse High-Throughput Screening Data from the Library of Integrated Network-based Cellular Signatures (LINCS). Journal of biomolecular screening 19, 803–816, doi:10.1177/1087057114522514 (2014).
Moylan, C. A. et al. Hepatic gene expression profiles differentiate presymptomatic patients with mild versus severe nonalcoholic fatty liver disease. Hepatology 59, 471–482, doi:10.1002/hep.26661 (2014).
Langfelder, P. & Horvath, S. WGCNA: an R package for weighted correlation network analysis. BMC bioinformatics 9, 559, doi:10.1186/1471-2105-9-559 (2008).
Langfelder, P., Zhang, B. & Horvath, S. Defining clusters from a hierarchical cluster tree: the Dynamic Tree Cut package for R. Bioinformatics 24, 719–720, doi:10.1093/bioinformatics/btm563 (2008).
Demchak, B. et al. Cytoscape: the network visualization tool for GenomeSpace workflows. F1000Research 3, 151, doi:10.12688/f1000research.4492.2 (2014).
Langfelder, P., Luo, R., Oldham, M. C. & Horvath, S. Is my network module preserved and reproducible? PLoS computational biology 7, e1001057, doi:10.1371/journal.pcbi.1001057 (2011).
Kuleshov, M. V. et al. Enrichr: a comprehensive gene set enrichment analysis web server 2016 update. Nucleic acids research, doi:10.1093/nar/gkw377 (2016).
Acknowledgements
This work was supported in part through National Natural Science Foundation of China (81570524), Zhejiang Provincial Natural Science Foundation (LY15H070004), Hangzhou Health Science and technology plan (201763676), and Major project of Hangzhou health and family planning science and technology project (2015ZD02).
Author information
Authors and Affiliations
Contributions
J.Y. and J.P.S. conceived, designed, and supervised the overall study. Y.L., G.Y.T., and Y.S. carried out data processing and computational analysis. Y.L.L. and Y.D.C. performed samples preparation and expression analysis. All of the authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing Interests
The authors declare that they have no competing interests.
Additional information
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Lou, Y., Tian, GY., Song, Y. et al. Characterization of transcriptional modules related to fibrosing-NAFLD progression. Sci Rep 7, 4748 (2017). https://doi.org/10.1038/s41598-017-05044-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-017-05044-2
- Springer Nature Limited
This article is cited by
-
Transcriptomics and co-expression network analysis revealing candidate genes for the laccase activity of Trametes gibbosa
BMC Microbiology (2023)
-
Identification and validation of fatty acid metabolism-related lncRNA signatures as a novel prognostic model for clear cell renal cell carcinoma
Scientific Reports (2023)
-
Identification of genes for seed isoflavones based on bulk segregant analysis sequencing in soybean natural population
Theoretical and Applied Genetics (2023)
-
mRNA–miRNA bipartite networks reconstruction in different tissues of bladder cancer based on gene co-expression network analysis
Scientific Reports (2022)
-
ATG101-related signature predicts prognosis and therapeutic option in hepatocellular carcinoma
Scientific Reports (2022)