Introduction

Human immunodeficiency virus type 1 (hereafter referred to as “HIV”) is the causative agent of acquired immune deficiency syndrome (AIDS). In the host cells, HIV infection causes to produce viral RNAs and proteins, results in profound functional and morphological changes, and alerts the progression of the host cell cycle. Meanwhile, HIV genomes can integrate to the host cellular DNA as a persistent viral reservoir and harboring replication-competent proviruses with a dormancy state1.

Through the development of specific inhibitors of HIV replication, significant progress has been made in the treatment of HIV infection. The success of anti-retroviral (ARV) therapy reduces HIV plasma viremia to an under-detectable level2. However, HIV is not eradicated and AIDS patients must remain on ARV drugs for life. A major obstacle to cure AIDS is the HIV latency that is a state of reversibly nonproductive infection in the resting CD4 T cells and possibly other cells3. The long-lived reservoir is resistant to ARV therapy and to clearance by the host immune system. Once treatment is stopped, HIV originating from the reservoir lead to re-seed the infection4.

The molecular mechanisms of HIV latency are complex and numerous studies have been performed to explain this phenomenon3,5,6. In general, a combination of trans-effects and cis-effects governs the activity of HIV promoter transcription making the reservoir the best-established barrier to HIV eradication. Mostly cis-effects reflect the variety of chromatin environments at different sites of integration genome and trans-effects reflect the trans-acting transcription factors within the host cells5. For example, heterochromatin of the host genome after HIV infection impairs the gene expression by impeding transcription factors access to the underlying DNA7. As the fundamental structural unit of chromatin, nucleosomes are regulated by posttranslational modifications of the histone tails which include phosphorylation, methylation, acetylation, ubiquitylation and poly ADP-ribosylation8. Specific combinations of these repressive histone marks have been proposed to govern the chromatin assembly and HIV gene expression9. Similarly, the strong repressive state of the HIV latency is correlated with methylation of CpG islands within the HIV promoter and DNA demethylation is associated with re-activation of latent HIV10. Another mechanism that can cause HIV transcriptional silence is the transcriptional interference of the HIV promoters and cellular promoters in the integration genome11. Depending on the orientation of the HIV provirus relative to the host cells genes, different forms of transcriptional interference occur12.

It has become clear that reactivation of HIV latency cells in AIDS patients on ARV therapy will be necessary to curing HIV infection. This is the strategy named “shock and kill” to facilitate a cure for HIV. This strategy administers latency reactivate agents to patients while on ARV therapy in order to induce pro-viral transcription in latently infected cells13. Several classes of compounds have been discovered to “Shock” HIV out of latency including; protein kinase C (PKC) agonists14, histone deacetylase inhibitors (HDACis)15, histone methyltransferase inhibitors (HMTIs)16, cytokines, bromodomain inhibitors17 and so on.

The PKC agonists is the only one latency-reversal agents (LRAs) class to demonstrate reproducible results across different HIV latency models18. Generally, the PKC agonists can be divided into three major chemical families: phorbol esters, including prostratin, 12-deoxyphorbol 13-phenylacetate (DPP) and phorbol 12-myristate 13-acetate (PMA); diterpenes, which include compounds of ingenol; and macrocyclic lactones including bryostatin and its analogs19. Among these PKC agonists, bryostatin-1 (hereafter referred to as “bryostatin”) has shown significant potency to revert HIV latency in ex vivo experiment using patient cells, compared with other LRAs20.

Although these compounds above have proven effective to reactivate the latency cell, the molecular mechanisms underlying the effects are not entirely clear. The activities of these compounds cannot be attributed wholly to a single target and may involve several molecular targets and pathways. Unfortunately, there is no systematic analysis of molecular targets of these LRAs and their related pathway. In this study, we performed an integrated analysis of the target profile of bryostatin and transcriptome of the reactivated CD4+ T cells after exposing to bryostatin. The result showed a distinct gene expression profile between the latency cells and bryostatin reactivated cells. Extensive changes of gene expression occurred in the CD4+ T cells treated with bryostatin. Furthermore, we found bryostatin can target multiple types of protein other than only PKC. Functional network analysis of the target profile and differential expressed genes (DEGs) suggested that bryostatin may activate a few novel pathways such as pyrimidine metabolism and purine metabolism and p53 signaling pathway, in addition to the commonly known pathways DNA replication, cell cycle, nucleotide excision repair and mismatch repair. These results provide mechanistic insights into the role of bryostatin and its affected pathways in controlling HIV latency and reactivation.

Results

Extensive changes of gene expression in CD4+ T cell exposed to bryostatin

Compared with the unstimulated CD4+ T cell, we identified 597 DEGs (P value< 0.01, Table S1) in the bryostatin stimulated CD4+ T cell. In the DEGs, there are 538 up-regulated and 59 down-regulated genes (Fig. 1). It indicates that extensive changes of gene expression in CD4+ T cells after exposing to bryostatin. We listed the top 30 most significantly deregulated genes, most of which were up-regulated (Table 1 and labeled in Fig. 1). The expression of genes CENPP, GGH, MEA1, CENPU, SGK1, GALNT4, and SLC38A5 was upregulated more than 10 folds. In contrast, the expression of genes KMT2B, IL7R was decreased by over four-fifths. The most obvious DEGs for up-regulation and down-regulation were CENPP (FC = 48.57) and IL7R (FC = 0.132), respectively.

Figure 1
figure 1

The differential expressed genes in CD4+ T cell treated with bryostatin. The x-axis is log2 ratio of gene expression levels between the bryostatin stimulated and unstimulated CD4+ T cells; the y-axis is P value based on −log10. The red and blue dots represent the up-regulated and down-regulated gene (P value < 0.01), respectively; the most significant DEGs (P value < 0.05, AUC >= 0.95 and |log2FC| > 2) were labeled with gene symbol.

Table 1 The information about the most significantly deregulated genes (|Log2 FC| > 2 and AUC > 0.95).

The bryostatin treated and unstimulated CD4+ T cells showed a distinct gene expression profile

To further overview the expression profile of CD4+ T cells under two conditions, hierarchical clustering analysis with a heatmap was performed (Fig. 2). The hierarchical clustering was generated using 464 DEGs (P value < 0.01 and |log2FC| > 1). In the hierarchical clustering result, 18 samples were divided two distinct categories clearly. The bryostatin treated samples showed a distinct gene expression profile from the that of the unstimulated CD4+ T cells, except the sample “TM3_ unstimulated”. Most of these DEGs are up-regulated in the bryostatin treated CD4+ T cells.

Figure 2
figure 2

A distinct gene expression profile between the bryostatin stimulated and unstimulated CD4+ T cells. The heatmap of 464 significant DEGs (P value < 0.01 and |log2FC| > 1) shows the hierarchical clustering of the 18 samples. (Bryostatin: bryostatin treated cell; Unstimulated: unstimulated CD4+ T cells). The color and shade change correspond to the expression value of gene after computing logarithms (log2) and normalization.

Significantly changed pathways in CD4+ cells from latency to reactivation state

As mentioned above, after treated with bryostatin, CD4+ cells were activated and extensive changes of gene expression occurred. A total of 538 and 59 genes were up-regulated and down-regulated, respectively. These significantly DEGs were enriched in 20 pathways revealed by KEGG pathway enrichment analysis and false discovery rate (FDR) adjustment (Fig. 3A). The top changed pathways were DNA replication, pyrimidine metabolism, drug metabolism - other enzymes, cell cycle, nucleotide excision repair, purine metabolism, mismatch repair, carbon metabolism, p53 signaling pathway, and biosynthesis of amino acids. In these pathways, 62 genes were significantly differential expressed (fold change > 2) (Fig. 3B). These genes can perform functions in multiple pathways. For example, POLD2 gene is involved in five pathways such as purine metabolism, pyrimidine metabolism, DNA replication, nucleotide excision repair and mismatch repair. Another example is PCNA gene, which is involved in four pathways as showed in the Fig. 3B. These pathways and involved DEGs provide clear clues for the molecular mechanism of HIV latency reactivation.

Figure 3
figure 3

KEGG pathways significantly enriched by DEGs and the network of the top 10 enriched pathways and DEGs. (A) The KEGG pathways enriched by DEGs. The line width indicates the enrichment percentage. The dotted line in the box indicates the significance threshold (FDR P value = 0.05). P values were measured by a hypergeometric test. (B) The selected top 10 pathways and DEGs for the explanations the treatment effect of bryostatin. The network showed the relationship between the affected pathways and DEGs (|log2FC| > 2).

The 17 putative targets of bryostatin predicted by two different methods

In order to further understand the action mechanism of bryostatin, we predicted the putative targets for bryostatin. Target prediction is still a challenging task so far and plays a critical role in the drug development21. It is useful to explore ligand-target interactions and their related bio-chemical mechanisms. We used the two methods CGBVS and 3NN to predict the putative targets of bryostatin. Chemical genomics-based virtual screening (CGBVS) is a proteome-wide method which predicts compound and protein interactions (CPIs) only based on the character description of protein sequence. In contrast, 3NN method is based on ligand similarity. A total of 100 and 28 targets of bryostatin were predicted by CGBVS and 3NN, respectively (Fig. 4A). Among these predicted targets, 17 overlap targets were considered as consistent targets of bryostatin. The 17 overlap targets were PRKCB, PRKCG, PRKCH, PRKCD, PRKCQ, PRKCE, PTAFR, IARS, METAP2, JUN, PTPN1, STAT3, HMGCR, PLA2G2A, PLA2G1B, SF3B3, PTPA (Fig. 4B). They mainly belong to the protein families of protein kinase, phospholipase A2, GPCR, peptidase M24A and so on. Among these targets, protein kinase C family is the primary known targets of bryostatin. The other protein families are defined as putative targets of bryostatin based on the result of the two methods of CGBVS and 3NN methods. Bryostatin may reactivate the HIV latent CD4+ cells through multiple targets and their affected pathways.

Figure 4
figure 4

Venn diagram of the predicted targets for bryostatin by the two different methods and the consistent targets. (A) Venn diagram of the predicted targets by CGBVS and 3NN. (B) Details for the 17 consistent targets predicted by the two methods.

Novel pathways of HIV latency reactivation revealed by functional network between bryostatin targets and DEGs

KEGG enrichment analysis revealed the top 10 changed pathways and 62 genes in these pathways significantly differential expressed (fold change >2) after exposing to bryostatin. Meanwhile, we predicted 17 consistent targets for bryostatin. To determine whether or how these targets of bryostatin affect the top 10 changed pathways and DEGs, we constructed a functional network between these targets and DEGs using GeneMANIA (Fig. 5). As Fig. 5 showed, the functional network revealed intense interaction between bryostatin targets and DEGs. Based on the KEGG database, the DEGs were labeled by different pathways. We can see clearly that bryostatin reactivate the HIV latent CD4+ cells through the effects of multiple targets. These targets affect several well-known pathways involved in the HIV latent reactivation such as DNA replication, cell cycle, nucleotide excision repair, and mismatch repair. Beside those well-known pathways, we also observed some novel pathways are closely linked to HIV latency reactivation (Fig. 5). These novel pathways are p53 signaling pathway and metabolic pathways including pyrimidine metabolism, purine metabolism, carbon metabolism, and biosynthesis of amino acids. They were not reported to be associated with latent HIV activation based on the past studies. The bryostatin targets may affect these metabolic pathways and p53 signaling, and thus play a role in the reactivation of HIV latent CD4+cells.

Figure 5
figure 5

Functional network analysis between the consistent targets of bryostatin and the most DEGs. The network shows the functional interaction between the consistent targets (green rectangle) and most DEGs that are circled and labeled by different pathways. The gray nodes represent the result genes involved in the functional interaction between targets (green rectangle) and DEGs. The black arrows represent the known pathways involved in HIV latent reactivation. The dotted arrows represent the novel pathway revealed by this study. Each colored line represents a different interaction, and the color line width indicates the confidence of the interactions.

There are more than 30 genes in pyrimidine metabolism pathway highly expressed in the CD4+ cells after exposing to bryostatin (Fig. 6). In contrast, only a few genes were down-regulated. From the upstream genes (UMPS, DHODH, DCK, NME1, RRM1, PNP, etc.) to downstream genes (POLA1, POLR2E, etc.) of this pathway, mostly of genes were highly active. Interestingly, we have found that the half parts of DNA synthesis and RNA transcription are highly active. In contrast, the other parts related to the catabolism and amino acid synthesis are under extreme inhibition and haven’t changed much in gene expression. Similarly, more than 50 genes in purine metabolism pathway were highly expressed in the CD4+ cells after exposing to bryostatin (Fig. 7). The pattern of gene expression was also similar to that of the pyrimidine metabolism pathway. This differential gene expression may drive the metabolic flux toward to the parts of DNA synthesis and RNA transcription, other than the parts related to the catabolism and amino acid synthesis.

Figure 6
figure 6

Gene expression profiles of pyrimidine metabolism pathway (hsa00240) in the bryostatin treated CD4+ T cells. The red and green colors represent the log2(FC) of the corresponding gene in pyrimidine metabolism pathway. The dotted line circles the inactive part of this pathway50,51,52.

Figure 7
figure 7

Gene expression profiles of purine metabolism pathway (hsa00230) in the bryostatin treated CD4+ T cells. The red and green colors represent the log2(FC) of the corresponding gene in purine metabolism pathway. The dotted line circles the inactive part of this pathway50,51,52.

The p53 signaling pathway is a key factor that helps to conserve the stability of the genome. It regulates a wide variety of cellular processes including apoptosis, senescence, cell cycle arrest, differentiation and so on. The p53 signaling pathway is significantly changed after exposing to bryostatin (Fig. 8). Genes CDK1, CDK2, CDK4, CCNE1, CCNB1 were up-regulated, which are contributes to the cell cycle arrest. Gene CASP3 is up-regulated, which will lead to apoptosis. Therefore, the cell cycle arrest and apoptosis may have a potential role in the reactivation of HIV latency.

Figure 8
figure 8

Gene expression profiles of p53 signaling pathway (hsa04115) in the bryostatin treated CD4+ T cells. The red and green colors represent the log2(FC) of the corresponding gene in p53 signaling pathway50,51,52.

Discussion

The latent reservoir is mostly found in resting memory CD4+ T cells22,23. The most worrisome reservoir consists of latently infected resting memory CD4+ T cells carrying integrated HIV-1 DNA. Bryostatin has shown significant potency to revert HIV latency in ex vivo experiment using patient cells, compared with other LRAs20. In this study, we focused to characterize the transcriptome in both the bryostatin treated and unstimulated HIV latent CD4+ T cells. Bryostatin stimulation will lead to reactivation of HIV-latent CD4+ T cells. To further confirm the change of HIV gene expression before and after reactivation, we checked the study that generated and provided the data24. The result showed that bryostatin significantly reactivated the latency cell compared with unstimulated condition (Fig. S1). And bryostatin has the highest efficiency at latency reversal among LRAs. Furthermore, as the importance of interleukins in the reactivation, we checked the gene expression for all interleukins and their receptors. We showed the genes that expressed in at least one sample (Fig. S2). The result indicates that the expression of interleukins and their receptors were heterogeneous. Only three genes IL7R, IL2RA and IL2RA were differential expressed between unstimulated and bryostatin treated samples. IL7R was down-regulated in the bryostatin treated samples. However, IL2RA and IL2RA were up-regulated in the bryostatin treated samples. Most interleukins and their receptors did not show differential expression, which suggests interleukins and their receptors (including IL7R) may not affect ART efficiency in this study.

Accompanying the reactivation of HIV latency, we found extensive changes of gene expression. We identified a total of 597 DEGs, and most of DEGs are up-regulated. Among the top 30 DEGs, the genes CXCR6, DNMT1 and JOSD1 had studies on the HIV infection25,26,27, but had no reports on the HIV latency-reversal. Moreover, genes CENPP, CENPU and CENPM, as the component of the centromere complex (nucleosome-associated), were all from centromere protein (CENP) family and upregulated in the HIV latent CD4+ T cells treated by bryostatin. Specifically, the gene CENPP was the component of CENPA (Histone H3-Like Centromeric Protein A)-CAD (nucleosome distal) complex, and the genes CENPU/M were the component of CENPA-NAC (nucleosome-associated) complex28. Cellular chromatin provides essential structure to the viral genome and is necessary for successful completion of the HIV life cycle. HIV can disrupt host cellular chromatin during heterochromatin integration and latent reactivation in unique centromere protein (CENPP, CENPU and CENPM) ways. These upregulated genes in the CD4+ T cells stimulated by bryostatin may play an important role in the HIV latency-reversal.

The biochemical mechanisms involved in the induction of pro-viral transcription in latent HIV-infected CD4+ T cells have not been fully defined. With the development of ARV therapy, it is import to understand how HIV infection changes the host cells pathways and metabolism. In our study, we found that pyrimidine metabolism, purine metabolism, and p53 signaling pathways are very closely related to HIV latency. Purines are key components of cellular energy systems (eg, ATP and NAD), signaling (eg, GTP, cAMP, and cGMP), and, along with pyrimidines, RNA and DNA production29,30. Therefore, pyrimidines and purines are crucial for nucleic acid biosynthesis, but also for lipid and carbohydrate. Previous studies revealed that pyrimidine and purine analogues were used as antiviral and immunosuppressive agents31,32. For example, arabinosylcytosine (ara-C), 5-fluorouracil (5-FU), and azidothymidine(AZT) have been used to target HIV reverse transcriptase32. In general, HIV infection will seriously impair the capability of T cells to synthesize the purine and pyrimidine ribonucleotide intermediates33. The shortage of these intermediates obstructs the HIV-infected cells to complete their cycle, thus turns the cell into latent state33. Therefore, bryostatin stimulates the latent cell and then results in up-regulation of pyrimidine and purine metabolism, which facilitates reactivation of the latent HIV infected cell.

P53 signaling pathway, a sensor for a broad range of cellular stresses, involved in several different aspects of cell cycle arrest, apoptosis, control of genome integrity, and DNA repair34. Cell cycle arrest and apoptosis are the most prominent outcomes of p53 activation35. In this study, p53 and its downstream genes (CCNB1, CDK1, CCNE1, CKK2, CDK4, and CASP3) were highly up-regulated (Fig. 8). The up-regulated genes CCNB1 and CDK1 will result in the sustained G2 arrest. Similarly, the up-regulated genes CCNE1, CKK2, and CDK4 will lead to the sustained G1 arrest. The up-regulated gene CASP3 will induce apoptosis. Therefore, we assume that bryostatin may lead to cell cycle arrest and apoptosis in the process of reactivation of latent HIV infected CD4+ cell. Previous studies indicate that latent HIV-1 can sense the apoptosis of its host cell and responds by completing its replication cycle36,37. Another study revealed that G2 arrest of cell cycle induced HIV-1 transcriptional activation38. Together with previous studies, our results suggest that the reactivation of HIV latent cell, responding to bryostatin, may be partly due to the cell-cycle arrest and apoptosis induced by up-regulation of p53 signaling pathway.

It is worth noting that bryostatin, a potent activator of protein kinase C, may be involved in mitochondrial dysfunction39. Protein kinase C activation can induce mitochondrial dysfunction40,41,42. Protein p53 also can influence mitochondrial function43,44. Pyrimidine biosynthesis links mitochondrial respiration to the p53 pathway45. Therefore, mitochondrial dysfunction may be involved in the process that bryostatin reactivates the HIV-latent cell through up-regulation of pyrimidine and purine metabolism, and p53 signaling pathway.

In summary, we firstly revealed the extensive changes of gene expression in the process of HIV latent cells, responding to bryostatin. Hundreds of DEGs were identified and enriched both in the well-known and novel pathways. Secondly, we predicted 17 putative targets for bryostatin by two different methods. The 17 targets belong to multiple protein families including protein kinase, phospholipase A2, GPCR, peptidase M24A and so on. Finally, we discovered three novel pathways of pyrimidine metabolism, purine metabolism and p53 signaling important for the reactivation of HIV latency through functional network analysis of the bryostatin targets and DEGs. We assume that bryostatin reactivates the HIV-latent cell through up-regulation of pyrimidine and purine metabolism or through starting the cell-cycle arrest and apoptosis induced by up-regulation of p53 signaling pathway. Our study provides some novel insights into the role of bryostatin and its affected pathway in controlling HIV latency and reactivation.

Methods

Expression data collection and preprocessing

The CD4+ T cells expression data (GSE94150) was downloaded from Gene Expression Omnibus (GEO). The dataset was generated from the Illumina HiSeq. 2500 (Homo sapiens) and included 36 samples. The information for all samples was shown in Table S2. We described the key information of the samples here. All details can be found from https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE94150 and reference24. The CD4+ T cells were isolated from a cohort of Florida HIV-infected subjects that have been on successful ART for >36 months with a viral load ≤50 copies/mL. The CD4+ T cells were divided into three subsets namely central memory (TCM), transitional memory (TTM) and effector memory (TEM) CD4+ T cells. The memory CD4+ T cell TCM, TTM, and TEM subsets were stimulated for 24 h using different classes of agents including PKC activators (50 nM Bryostatin), gamma-c cytokines (10 ng/ml IL-15) and PMA+Ionomycin (100 ng/ml PMA and 1 μg/ml ionomycin). RNA-Seq was performed on the unstimulated and 24h-stimulated subsets. Unstimulated cells from the same donors were included as pre-reactivation baseline controls. Of 36 samples, 27 samples were stimulated with Bryostatin-1, IL-15 and PMA+Ionomycin, respectively. The other 9 samples were unstimulated. Our aim is to study transcriptional responses of CD4+ T cells to the bryostatin-1 and 18 samples were retained (9 bryostatin-1 stimulated and 9 unstimulated CD4+ T cells). The raw high-throughput mRNA sequencing reads was downloaded and analyzed by HISAT46, StringTie47 and Ballgown48 protocol. Firstly, the raw RNA-seq reads were mapped to the reference human genome using HISTA. Then, the alignments were passed to StingTie, which assembles and merges together all the gene structures found in any of the samples. In order to re-estimate the transcript abundance using the merged structures, the merged transcripts were fed back to StringTie again. Finally, Ballgown groups the all transcripts and abundances from StringTie by stimulated condition and determines which genes and transcripts are differentially expressed between conditions. The R v3.5.0 and Bioconductor library were used to analyze the expression data.

KEGG pathways and biological processes analysis

We used the functions enrichKEGG and enrichGOR in clusterProfiler R package49 to perform KEGG pathways50,51,52 and biological processes enrichment analysis. A pathway or process with a P value ≤ 0.05 was considered to be significantly enriched. The enriched pathways and processes were visualized using custom written R code. We selected top 10 significantly pathways affected by the compound bryostatin for further study. Therefore, we used the pathview R package53 to display the expression profiles of the corresponding genes in these pathways. This package could provide the links between genes and pathways based on the KEGG pathways.

Prediction of the targets of latency-reversal agent bryostatin

In order to understand the action mechanism of LRA bryostatin, two methods of CGBVS54,55 and 3NN56 were used to predict the targets of bryostatin. The CGBVS is a chemical genomics-based virtual screening method for target prediction54. CGBVS predicts compound-protein interactions (CPIs) by using machine learn only based protein sequence rather than three-dimensional structures. The 3NN is a ligand-based similarity rankings method for target prediction56. 3NN method uses the 3 most similar scores (or their average value) between the query and the ligands sets in reference library to rank possible targets. We used venn diagram to show the number of target genes predicted by the two methods. Overlap targets were defined as the consistent targets of bryostatin.

Functional interaction network analysis

GeneMANIA57 plug-in of Cytoscape58 was applied to identify the functional interaction between the consistent targets of bryostatin and most DEGs. We used the consistent targets of bryostatin and most DEGs as input parameters. GeneMANIA (http://www.genemania.org)59 can find other genes that are related to the set of input genes and produce a functional association network based on their relationships, such as pathways, co-expression, co-localization, genetic interaction, physical interaction, shared protein domains and so on, based on the published literature. Next, we used Cytoscape v3.2.1 to visualize the network.