Abstract
Spermatogonial stem cells are the most primitive spermatogonia in testis, which can self-renew to maintain the stem cell pool or differentiate to give rise to germ cells including haploid spermatids. All-trans-retinoic acid (RA), a bioactive metabolite of vitamin A, plays a fundamental role in initiating spermatogonial differentiation. In this study, single-cell ATAC-seq (scATAC-seq) was used to obtain genome-wide chromatin maps of cultured germline stem cells (GSCs) that were in control and RA-induced differentiation states. We showed that different subsets of GSCs can be distinguished based on chromatin accessibility of self-renewal and differentiation signature genes. Importantly, both progenitors and a subset of stem cells are able to respond to RA and give rise to differentiating cell subsets with distinct chromatin accessibility profiles. In this study, we identified regulatory regions that undergo chromatin remodeling and are associated with the retinoic signaling pathway. Moreover, we reconstructed the differentiation trajectory and identified novel transcription factor candidates enriched in different spermatogonia subsets. Collectively, our work provides a valuable resource for understanding the heterogeneity associated with differentiation and RA response in GSCs.
Similar content being viewed by others
Introduction
Spermatogenesis is a complicated and precisely regulated process in which mature spermatozoa are produced inside the seminiferous tubules. Haploid spermatozoa are derived from undifferentiated spermatogonia. The undifferentiated spermatogonia consist of A single (As), A paired (Apr) and A aligned (Aal) spermatogonia, including spermatogonial stem cells (SSCs) and progenitor cells. SSCs (As) are the most primitive stem cell population, and can undergo active self-renewal or give rise to differentiation-primed progenitor cells by mitotic cell division (Apr and Aal)1. To initiate differentiation, Aal spermatogonia transit into A1 spermatogonia. Following a few more rounds of divisions, A2, A3, and A4 spermatogonia form, respectively. After that, A4 spermatogonia mature into intermediate and type B spermatogonia, which will divide into primary spermatocytes1.
In the undifferentiated spermatogonia population, spermatogonia markers can display a gradient of expression level and define different cellular states. Currently, the SSC population cannot be defined by a single marker but a panel of SSC self-renewal genes instead, such as Gfra12, Id43, Egr24, Lhx14, Lhx25 and Etv56. When it starts to transition to progenitor cells, genes such as Upp1, Sox3, Ngn3, Nanos3, Ddit4 and Piwil4 will be upregulated, and the self-renewal genes will be downregulated5. The gene expression of spermatogonia is therefore highly heterogeneous. During spermatogonial differentiation, genes associated with differentiation, such as Kit and Stra8, will be upregulated7.
All-trans-retinoic acid (RA) is a bioactive metabolite of vitamin A. Previous studies have found that RA signaling plays a fundamental role in initiating spermatogonial differentiation. Spermatogonia cannot reach the A1 stage from Aal stage in mice when RA signaling is blocked due to vitamin A-deficient (VAD) diet8. According to the "revised Asingle model," spermatogonia's tendency to respond to the RA signal is proportional to chain length, and Apr can also respond to RA9,10. The action of RA on target gene expression depends on two families of nuclear hormone receptors, the retinoic acid receptors (RARs) and the retinoid X receptors (RXRs). RARs usually function with RXRs to form heterodimers, which can bind to retinoic acid response elements (RAREs) of the genome to regulate RA-target gene expression11. In spermatogonia, RARG is the major functional RAR isotype and it can form heterodimer with RXRA12.
Recent single cell RNA-seq analysis has deepened our understanding of the cellular heterogeneity and transcriptome dynamics associated with spermatogonial differentiation. However, the dynamic remodeling of chromatin landscapes and differential transcription factors (TFs) motif usage during RA-induced spermatogonial differentiation have not been addressed. To initiate or regulate gene expression, TFs need to bind to their DNA binding sites at the promoter and enhancer DNA of the target gene first. The TF binding sequence is usually represented as binding site motifs, which indicate the preferentially bound sequences13.The chromatin in TF bound regions and gene loci of genes with active transcription is more open and accessible14. These regions demonstrate decreased nucleosomal density and are sometimes nucleosome-free15. ATAC-seq combined with TF binding motifs has proven increasingly effective for discovering dynamic changes in chromatin landscapes and predicting critical regulatory events that cause chromatin remodeling. While Maezawa et al. have provided much information on the changes in chromatin landscapes during spermatogenesis using bulk ATAC-seq, the THY1+ undifferentiated spermatogonia sample consisted of a mix of spermatogonial stem and progenitor cells16. Thus, the main aim of this pilot study was to assess the utility of single-cell ATAC-seq for uncovering cis-regulatory elements and TF regulators in spermatogonial differentiation, especially the transition from stem cells to progenitor cells, and then to differentiating spermatogonia.
In this study, we administered RA to GSCs culture and profiled the chromatin accessibility landscape by single-cell ATAC-seq (scATAC-seq). Our study shows that stem cell-like and progenitor-like cells display distinct chromatin accessibility profiles, with stem cell-like cells displaying more profound chromatin accessibility changes induced by RA. We also explored the regulatory regions in spermatogonial subpopulations after RA stimulation. We further reconstructed the pseudotime differentiation trajectory and revealed dynamics of TF activity associated with the transition. Collectively, our study provides novel insights on the epigenetic regulations of spermatogonial differentiation induced by RA.
Results
Differentiation of GSC in vitro
To facilitate the investigation of the differentiation of SSC, we established cell culture using a well established protocol17. The cultured SSCs, designated as germline stem cells, (GSCs) were derived from isolated Oct4-EGFP+/KIT− germ cells of P5.5 mice and stabilized on mouse embryonic fibroblasts (MEF) as feeder cells (Fig. 1a). In this protocol, the culture medium is supplemented with glial cell-derived neurotrophic factor (GDNF), basic fibroblast growth factor (bFGF/FGF2) and epidermal growth factor (EGF) to sustain long term culture. While GNDF and bFGF are the bona fide self-renewal factors for SSCs, the combination of these growth factors has been demonstrated to be essential for the self-renewal, proliferation and differentiation of SSCs18,19. Our culture can be maintained for more than 3 months, suggesting this culture system can support stem cell proliferation. Induction of Receptor Tyrosine Kinase (KIT) expression is a hallmark for spermatogonial differentiation20. Interestingly, we found a subset of GSC (~ 20%) in our culture which showed spontaneous differentiation without RA treatment as revealed by KIT expression, albeit at a lower level when compared to the RA treated sample (Fig. 1b,c). This is consistent with the precarious report that spermatogonia cultured with bFGF exhibit more differentiated characteristics such as lower SSC activity and higher KIT expression21. Moreover, the EGF in medium could stimulate spermatogonia differentiation22. We found that 24 h of RA treatment resulted in a drastic increase in the number of KIT+ cells as revealed by FACS analysis when compared with DMSO control (Fig. 1b,c). We also examined the OCT4 level of RA-treated and control cells, and found that the proportion of GFP negative, GFP low and GFP high cells are similar between control and RA-treated cells (Fig. 1d). Taken together, we demonstrated that RA successfully induced spermatogonia differentiation in our GSC culture.
Profiling the change in chromatin state after RA treatment in GSC
To investigate the cellular heterogeneity and dynamic TF regulation during in vitro spermatogonial differentiation induced by RA, we performed scATAC-seq on the following samples: (1) GSCs treated with vehicle (CTRL); (2) GSCs treated with RA (RA) (Fig. 2a). 48 h after treatment, the culture cells were harvested and the isolated nuclei were subjected to Tn5 tagmentation and single-cell capture with the Bio-Rad SureCell platform. After filtering out low-quality nuclei and doublets, we obtained 2256 and 1888 high-quality single nuclei from CTRL and RA group, respectively (Supplementary Table S1), with a read depth of 14,847 (CTRL) and 10,850 (RA) fragments per nuclei and median transcription start site (TSS) ratio (the percentage of fragments at TSS) of 11.8 (CTRL) and 12.5 (RA) (Fig. 2b,c). The information of individual nuclei is listed in Supplementary Table S2. Marker peak analysis identified that 100 peaks were differentially upregulated and 476 peaks were differentially downregulated after RA treatment (Supplementary Figure S1; Supplementary Table S3). To confirm that RA had effects on the chromatin accessibility of key regulator genes in GSCs, we generated synthetic pseudo-bulk datasets by merging data from cells in control and RA-treated groups to show re-configuration of chromatin (Supplementary Table S4). A gene score is used to predict gene expression based on the accessibility of regulatory elements in the vicinity of the gene, including the gene body and putative distal regulatory elements. Reassuring, we observed that gene score of self-renewal genes (e.g. Gfra1 and Plzf) were decreased after RA treatment while that RA-induced genes (e.g. Rarb, Stra8 and Nrg3) increased accessibility (Fig. 2d,e; Supplementary Figure S2)23.
scATAC captures the epigenetic landscape of GSC in vitro
We subjected cells to dimensionality reduction using ArchR and identified 6 major clusters of cells using the graph clustering approach implemented by Seurat (Fig. 3a). We examined the property of the six clusters based on gene scores of known markers (Fig. 3b, Supplementary Figure S3). Cluster 2 (SSC) showed the highest gene scores of SSC marker genes Gfra1, Eomes24 and Id43, indicating this cluster showed stem cell properties. SSC marker gene scores in transitional Cluster 3 (Trans.SSC.1) and 5 (Trans.SSC.2) are lower. Cluster 4 (Progenitor) exhibited higher gene scores of progenitor markers (Sox3 and Upp1). Previous studies showed that Sox3 is not expressed by the steady-state stem cell population (GFRα1-positive) but is rapidly upregulated during initial differentiation commitment steps of stem cells25. Cluster 6 (differentiating spermatogonia (Diff.SPG)) showed higher gene scores of differentiation markers (Kit and Stra8), indicating they are at a more differentiated state. Lastly, Cluster 1 (spermatocyte (SPC)-like) had higher gene scores for spermatocyte-expressed genes like Tesmin, Pgk2, and Asrgl1. It has been reported that RA is sufficient for the in vitro induction of mouse spermatocytes20. It is possible that Cluster 1 cells were the more differentiated or spermatocyte-like cells in the culture system. Cluster 1 cells also showed higher gene activity of Kit, which may partly contribute to the KIT+ cells as seen in flow cytometry analysis (Figs. 1c, 3c). This subpopulation warranted future investigation of both transcript and/or protein levels of spermatocyte markers in the culture system.
RA caused a significant increase in Diff.SPG cluster by proportion, which suggested that Diff.SPG cluster should be the differentiating cells resulting from RA stimulation. On the contrary, there are more cells in Trans.SSC.1/2 clusters in the control sample. This shift from Trans.SSC.1/2 to Diff.SPG after RA treatment indicates a profound chromatin accessibility change in a subset of GSCs (Fig. 3d). Moreover, although the Progenitor cluster in RA samples showed increased gene activity of Kit than CTRL samples, they were still clustered together. This suggested that stem-cell-like subsets (Cluster 3 and 5) have more profound change in global chromatin accessibility after RA treatment than progenitor clusters (Cluster 4) (Fig. 3c).
Taken together, single-cell chromatin accessibility profiling successfully deconvoluted cellular subsets associated with GSC differentiation (Fig. 3e).
Regulatory regions associated with RA response identified by cisTopic
Toward uncovering RA-associated chromatin dynamics, we used MACS to call peaks and identify 112,292 peaks in the merged dataset. We examined the distribution of consensus peaks across genomic features in the mouse reference genome. These peaks represent a mixture of cis-regulatory elements including intergenetic regions and promotors, which is consistent with the typical ATAC-seq pattern (Supplementary Figure S4). Comparing the pseudobulk RA sample with CTRL sample revealed that peaks at known RAR/RXR direct target genes Rarb, Hgf and Stra8 gain accessibility (Fig. 4a).
We then used cisTopic to investigate RA-induced dynamic changes in chromatin accessibility at single-cell level. cisTopic identified groups of regulatory regions with similar distribution patterns as a “regulatory topic”26. The model with 16 topics was selected (Supplementary Figure S5). 16 topics with different enrichment patterns across clusters and samples were identified (Fig. 4b,c). Topics 2, 5 and 6 are more enriched in the CTRL sample, while Topics 9, 10 and 11 are enriched in the RA-treated sample (Fig. 4d). Genomic Regions Enrichment of Annotations Tool (GREAT) analysis revealed that CTRL-enriched topics are associated with stem cell population maintenance, maintenance of cell number and regulation of stem cell differentiation, which is consistent with the function of GSCs (Fig. 4e). On the other hand, RA-enriched topics were associated with cellular response to steroid hormone stimulus and hormone-mediated signaling pathway, suggesting the RA-treated GSCs were more prepared for spermatogenesis (Fig. 4f). For example, Topic 11 includes regions associated with genes in the RA receptor signaling pathway, such as Cyp26b1, Rara, Rarb, Rarg and Rxra. RA-enriched topics were also related to the protein catabolic process.
Reconstruction of the stem-to-differentiating transition trajectory
We sought to reconstruct the differentiation trajectory using a semi-supervised pseudotime approach. Trans.SSC.2 (Cluster 5) showed a dramatic decrease in cell number in the RA treated group accompanied by the emergence of Diff.SPG (Cluster 6), while the proportion change of other clusters is less profound (Fig. 3d). Therefore, we reasoned that the SSC cluster (Cluster 2) reached the Diff.SPG cluster through Trans.SSC.2. Our data do not exclude the possibility that the differentiation process starts from SSC (Cluster 2) to progenitors (Cluster 4) and finally differentiated cells. However, we decided to focus on the stem-to-differentiating spermatogonia transition (Clusters 2/5 to Cluster 6), which reflects the major cellular dynamic in our RA treatment model. We then generated an ordering of single cells (referred to as ‘pseudotime’) along the trajectory (Fig. 5a). Following that, we identified the gene candidates associated with the transition upon RA induction and revealed a list of genes with varied gene scores across the trajectory (Fig. 5b). For example, Cdkn2d is upregulated at the end of the trajectory, and Cdkn2d expression has been reported to be significantly induced by RA to promote differentiation in leukemia27 (Fig. 5c). Nav2, which is RA-inducible in the nervous system to regulate cell migration, is also enriched at the late stage of the trajectory28 (Fig. 5c).
To identify critical TFs that are involved in the RA-induced differentiation process, we next correlated the gene score of a TF (infers gene expression/activity) across pseudotime to its corresponding chromVAR TF z-score (infers TF binding activity). In brief, a TF z-score, which is calculated by chromVAR, can infer the activity of a TF based on the abundance of its corresponding motif within open chromatin regions29. However, it has a limitation that it could not provide a one-to-one correspondence between binding motifs and particular TF proteins, because TFs of the same family often share a similar motif. We reasoned that a TF with high binding activity should also be more abundant in the cell, which could be accompanied with higher gene expression. Therefore, we identified the TFs with high correlation between gene activity and TF binding activity. This uncovered a list of TFs enriched at different stages of the trajectory (Fig. 5d). TFs from the FOX, MYF and TCF families were identified in the early stages of the trajectory, which is consistent with previous TFs identified in undifferentiated spermatogonia during mouse spermatogenesis16. In line with the role of EOMES in SSC maintenance, the gene activity and TF binding activity of EOMES decrease along the trajectory (Fig. 5e). In addition, it has been reported that LEF1 is specifically expressed in undifferentiated spermatogonia and decreased expression of LEF1 led to a down-regulation of PLZF30. In contrast, RFX2 showed higher gene activity and TF binding activity at the end of the trajectory. Rfx2 is crucial for fertility as knockout of Rfx2 leads to arrest of spermatogenesis in mice in the round spermatid phase of spermiogenesis31 (Fig. 5e). Several Krüppel-like factor (KLF) family members also exhibited higher activity at the late stage of pseudotime trajectory. We also identified some candidates that have been reported to respond to RA in other cell types but not in spermatogonia. For instance, Pbx2 can be induced by RA both transcriptionally and post-translationally in P19 cells32 (Fig. 5e). C/EBPα showed higher gene score after RA treatment in leukemia cells, which also induced differentiation33 (Fig. 5e). Notably, the gene activities of Pbx2 and Cebpa were upregulated in Diff.SPG, which further implicates its potential role in spermatogonial differentiation induced by RA (Fig. 5c). Figla and Lhx6 are enriched in the trajectory during the intermediate stage. In fact, Figla regulates genes related to postnatal spermatogenesis in male germ cells34 (Fig. 5d).
Discussion
Understanding the mechanisms underlying the RA-induced spermatogonial differentiation requires a comprehensive understanding of gene expression programs and the corresponding regulatory mechanisms at the chromatin level. However, to date the studies mainly focused on the changes caused by RA at the transcriptome level. Moreover, the heterogeneous nature of spermatogonia makes it more challenging to decipher this complex process. In this study, we present the first single-cell chromatin landscape capturing spermatogonial differentiation in cultured GSCs.
First, we demonstrated that the chromatin accessibility information is able to identify the cells at stem cell or more differentiating states in culture, and further separate cells that undergo stem-to-differentiating transitions. It is immediately obvious that culture GSCs without RA treatment can already be separated into two subsets. The minor population (Progenitor cluster) harbored a more differentiating signature such as higher gene scores of progenitor markers (Upp1 and Sox3) and showed similar proportions before and after RA treatment.
There are several possible reasons to explain the existence of the Progenitor cluster. First, this population might be traced back to the progenitors or differentiating spermatogonia in vivo isolated along with stem cells when the culture is derived. However, since the culture has been maintained for a long time and the progenitors or differentiating cells could be depleted during this period, this is least likely to happen. Second, it is possible that these two subsets of cells represent two modes of self-renewal. In fact, it has been reported that FGF2 supplementation enabled in vitro SSC expansion without GDNF and the FGF2-supported and GDNF-supported cells showed differences in the self-renewal and differentiation patterns19. However, we did not detect an increased gene activity of FGF receptors (Fgfr1, Fgfr2 and Fgfr3) in the minor subset. Therefore, we favor the interpretation that they originated from the differentiation of stem cell compartment in culture. The GSCs undergo spontaneous differentiation and the cellular heterogeneity in culture involves a dynamic equilibrium between stem cells and progenitors in the normal condition.
We showed that a subset of stem cells can respond to RA and transform to a more differentiated state, which is signified by a more profound change in global chromatin accessibility when compared to the response of progenitor cells. Using state-of-the-art tools, we identified differentially accessible regions, which may represent cis-regulatory elements of RA-responsive genes.
Regulation of gene expression requires the binding of a TF to the gene promoter and other regulatory elements. In this study, we used different approaches to investigate the TF dynamics during spermatogonial differentiation and revealed known and novel TF regulators. We found several KLFs with increased gene scores and increased TF binding activity in the Diff.SPG by the pseudotime analysis. KLFs are zinc finger-containing TFs that regulate development, proliferation and differentiation and are expressed in a wide range of cell types35. Some of the TFs that were identified in our study have been implicated in cell differentiation, such as Klf2 in osteoblasts and Klf11 in adipocytes36,37,38. Therefore, it is plausible that these KLFs also drive spermatogonial differentiation.
Our study focused on chromatin accessibility and did not include scRNA-seq analysis of GSCs. Recently, multimodal single cell profiling (“multi-omics”) has greatly improved our ability to detect unique cell types and states. Joint data of scATAC-seq and scRNA-seq from the same cell showed that they are comparable for determining cell identity, suggesting cells coordinate chromatin structure with transcription. Nevertheless, some cell states may not reflect equally in both profiles. Future studies would benefit from joint profiling to provide additional information that further elucidates spermatogonial differentiation. Despite the limitations, our results expand our recognition of the molecular mechanisms and epigenetic regulation underlying the action of RA as the key regulator of spermatogonial differentiation. Our work thus provides a comprehensive resource for understanding the heterogeneity associated with RA response and the TF dynamics related to spermatogonial differentiation in GSCs.
Methods
Animals
All animal procedures were performed according to protocols approved by the Animal Experimentation Ethics Committee (AEEC) of the Chinese University of Hong Kong and following the Animals (Control of Experiments) Ordinance (Cap. 340) licensed from the Hong Kong Government Department of Health. All the mice were housed under a cycle of 12-h light/dark and kept in ad libitum feeding and controlled the temperature of 22–24 °C. Oct4-EGFP transgenic mice (B6;CBA-Tg(Pou5f1-EGFP)2Mnn/J, Stock no.: 004654) were acquired from The Jackson Laboratory and maintained in CUHK Laboratory Animal Services Center39. All experiments were done in accordance with ARRIVE guidelines and complied with the ethical guidelines of the Animal Experimental Ethics Committee (AEEC) of the Chinese University of Hong Kong.
SSC isolation and derivation of long-term culture
Spermatogonial stem cells from testes of Oct4-EGFP transgenic mice at P5.5 were purified using the method described previously17. Isolated seminiferous tubules were digested with 1 mg/ml type 4 collagenase (Gibco), 1 mg/ml hyaluronidase (Sigma-Aldrich) and 5 µg/ml DNase I (Sigma-Aldrich) at 37 °C for 20 min with occasional shaking. The suspension was passed through a 40-µm strainer cap (BD Falcon) to yield a uniform single cell suspension. After incubation in staining buffer (PBS supplemented with 1% FBS, HEPES, glucose, pyruvate and penicillin–streptomycin) with APC anti mouse CD117 c-KIT antibody (553356, BD Biosciences) at 4 °C for 30 min, Oct4-EGFP+/KIT− cells were collected with a BD FACSAria Fusion Flow Cytometer (BD Biosciences). Primary SSCs were then cultured on the mouse embryonic fibroblast (MEF) layer (150,000 cells per well of a 24 well tissue culture plate). GSC medium (Supplementary Table S5) was used for GSC maintenance. GSCs were passaged by treating with 0.25% trypsin (Invitrogen) for 2 min and adding fresh medium to stop trypsinization. After centrifugation at 1000 rpm for 5 min, the cell pellet was resuspended in medium with a density of 100,000 cells per well of a 24 well tissue culture plate seeded with MEF.
In vitro retinoic acid treatment and flow cytometry
For retinoic acid (RA) treatment, GSCs were cultured in complete GSC medium containing 0.5 μM RA (R2625, Sigma) for 24 h before flow cytometry analysis. Medium containing DMSO was used as a control.
GSCs were subjected to flow cytometry analysis after trypsin digestion. For analysis, cells were incubated for 30 min at 4 °C in staining buffer with APC anti mouse CD117 c-KIT antibody. Flow cytometry was performed on a BD FACSAria Fusion Flow Cytometer (BD Biosciences). Data were analyzed by FlowJo software (FlowJo, LLC).
scATAC-seq analysis
Sample collection
Cells at passage 5 were used for experiments. For the RA treatment group, 5 wells of cells were seeded on MEF feeder cells and treated with 0.5 μM RA. On the day of the scATAC-seq experiment, the cells were trypsinized and collected. Cells were purified using FACS to remove MEF feeder cells, cell debris and cell aggregate, and pooled together in equal numbers (total 120,000 cells for library construction). For the control group, cells were cultured on the MEF feeder cells and were treated with vehicle (DMSO) for 48 h.
Cell lysis and tagmentation
Cell tagmentation was performed according to SureCell ATAC-Seq Library Prep Kit (17004620, Bio-Rad) User Guide (10000106678, Bio-Rad) and the protocol based on Omni-ATAC was followed40. In brief, washed and pelleted cells were lysed with the Omni-ATAC lysis buffer containing 0.1% NP-40, 0.1% Tween-20, 0.01% digitonin, 10 mM NaCl, 3 mM MgCl2 and 10 mM Tris–HCl pH 7.4 for 3 min on ice. The lysis buffer was diluted with ATAC-Tween buffer that contains 0.1% Tween-20 as a detergent. Nuclei were counted and examined under microscope to ensure successful isolation. Same number of nuclei were subjected to tagmentation with equal ratio of cells/Tn5 transposase to minimize potential batch effect. Nuclei were resuspended in tagmentation mix, buffered with 1 × PBS supplemented with 0.1% BSA and agitated on a ThermoMixer for 30 min at 37 °C. Tagmented nuclei were kept on ice before encapsulation.
scATAC-seq library preparation and sequencing
Tagmented nuclei were loaded onto a ddSEQ Single-Cell Isolator (Bio-Rad). scATAC-seq libraries were prepared using the SureCell ATAC-Seq Library Prep Kit (17004620, Bio-Rad) and SureCell ATAC-Seq Index Kit (12009360, Bio-Rad). Bead barcoding and sample indexing were performed with PCR amplification as follows: 37 °C for 30 min, 85 °C for 10 min, 72 °C for 5 min, 98 °C for 30 s, eight cycles of 98 °C for 10 s, 55 °C for 30 s and 72 °C for 60 s, and a single 72 °C extension for 5 min to finish. Emulsions were broken and products were cleaned up using AMpure XP beads. Barcoded amplicons were further amplified for 8 cycles. PCR products were purified using AMpure XP beads and quantified on an Agilent Bioanalyzer (G2939BA, Agilent) using the High-Sensitivity DNA kit (5067-4626, Agilent). Libraries were sequenced on HiSeq 2000 with 150 bp paired-end reads.
Sequencing reads preprocessing
Sequencing data were processed using the Bio-Rad ATAC-Seq Analysis Toolkit (v1.0.1). This toolkit is a streamlined computational pipeline, including tools for FASTQ debarcoding, read trimming, alignment, bead filtration, bead deconvolution, cell filtration and peak calling. The reference index was built upon the mouse genome mm10. For generation of the fragments file, which contain the start and end genomic coordinates of all aligned sequenced fragments, sorted bam files were further process with “bap-frag” module of BAP (https://github.com/caleblareau/bap, v0.6.0). Downstream analysis was performed with ArchR (ArchR_1.0.2)41. Fragment files were used to create the Arrow files in the ArchR package.
Clustering and gene score/transcription factor activity analysis
We filtered out low-quality nuclei with stringent selection criteria, including read depth per cell (> 2000) and TSS enrichment score (> 4). Potential doubles were further removed based on the ArchR method. Bin regions were cleaned by eliminating bins overlapping with ENCODE Blacklist regions, mitochondrial DNA as well as the top 5% of invariant features (house-keeping gene promoters). ArchR was used to estimate gene expression for genes and TF motif activity from single cell chromatin accessibility data. Gene scores were calculated using the addGeneScoreMatrix function with gene score models implemented in ArchR. addDeviationsMatrix function was used to compute enrichment of TF activity on a per-cell basis across all motif annotations based on chromVAR (chromVAR_0.3). Dimensionality reduction was performed using the addIterativeLSI function and the cell embedding was generated using the addUMAP in ArchR. The clustering was performed using Seurat (Seurat_4.2.0) in ArchR. To identify marker genes based on gene scores, we used the getMarkerFeatures function in ArchR with useMatrix = "GeneScoreMatrix". ArchR performed Wilcoxon testing for comparing the CTRL and RA groups after accounting for the biases (i.e., “TSSEnrichment" score and "log10(nFrags)". These P values are then adjusted for multiple hypothesis using the FDR method. The volcano plot was generated by EnhancedVolcano package (https://github.com/kevinblighe/EnhancedVolcano).
cisTopic analysis
Topic modeling was performed using 10–35 (1 by 1), 40 and 500 iterations with cisTopic (cisTopic_0.3.0)42. The model was selected based on the highest log-likelihood. The cell-topic UMAP representation was obtained by using UMAP on the normalized topic-cell matrix (by Probability). Cell clustering result (6 clusters) from ArchR was used for visualization. Heatmap was then generated based on the cell-cisTopic distributions to identify topics associated with each cluster. To select a representative set of regions of the topic, Region-topic distributions were binarized with a probability threshold of 0.99 to identify the top contributing regions in each topic. For identifying enriched GO terms per topic, the binarized topics (i.e. sets of top regions per topic) were analyzed over GREAT (rGREAT_1.22.0) or clusterProfiler (clusterProfiler_3.18.1)43,44.
Trajectory analysis
Trajectory analysis was performed in ArchR. addTrajectory function in ArchR was used to construct trajectory on UMAP embedding. To perform integrative analyses for identification of positive TF regulators by integration of gene scores with motif accessibility across pseudo-time, we used the correlateTrajectories function which takes two SummarizedExperiment objects retrieved from the getTrajectories function.
Github repository
The analytical codes used in this study are available on the GitHub repository (https://github.com/liaojinyue/SSC_culture_RA_scATAC).
Statistical analysis
Assessment of statistical significance was performed using two-tailed unpaired t-tests, one-way ANOVA with Tukey multiple comparisons tests or Chi-squared tests. Statistical analysis was performed using GraphPad Prism v8. Associated P values are indicated as follows: *P < 0.05; **P < 0.01; ***P < 0.001; ****P < 0.0001; not significant (ns) P > 0.05.
Data availability
The datasets generated and/or analyzed during the current study are available in the NCBI Gene Expression Omnibus (GEO) repository, under accession number GSE167531 (https://www.ncbi.nlm.nih.gov/geo/).
References
Oatley, J. M. & Brinster, R. L. Regulation of spermatogonial stem cell self-renewal in mammals. Annu. Rev. Cell Dev. Biol. 24, 263–286 (2008).
He, Z., Jiang, J., Hofmann, M.-C. & Dym, M. Gfra1 silencing in mouse spermatogonial stem cells results in their differentiation via the inactivation of RET tyrosine kinase. Biol. Reprod. 77, 723–733 (2007).
Sun, F., Xu, Q., Zhao, D. & Degui Chen, C. Id4 marks spermatogonial stem cells in the mouse testis. Sci. Rep. 5, 17594 (2015).
Oatley, J. M., Avarbock, M. R., Telaranta, A. I., Fearon, D. T. & Brinster, R. L. Identifying genes important for spermatogonial stem cell self-renewal and survival. Proc. Natl. Acad. Sci. USA 103, 9524–9529 (2006).
La, H. M. et al. Identification of dynamic undifferentiated cell states within the male germline. Nat. Commun. 9, 2819 (2018).
Wu, X., Goodyear, S. M., Tobias, J. W., Avarbock, M. R. & Brinster, R. L. Spermatogonial stem cell self-renewal requires ETV5-mediated downstream activation of Brachyury in mice. Biol. Reprod. 85, 1114–1123 (2011).
Zhang, L. et al. c-kit and its related genes in spermatogonial differentiation. Spermatogenesis 1, 186–194 (2011).
Hogarth, C. A. & Griswold, M. D. The key role of vitamin A in spermatogenesis. J. Clin. Invest. 120, 956–962 (2010).
de Rooij, D. G. The nature and dynamics of spermatogonial stem cells. Development 144, 3022–3030 (2017).
Lord, T. & Oatley, J. M. A revised Asingle model to explain stem cell dynamics in the mouse male germline. Reproduction 154, R55–R64 (2017).
Bastien, J. & Rochette-Egly, C. Nuclear retinoid receptors and the transcription of retinoid-target genes. Gene 328, 1–16 (2004).
Gely-Pernot, A. et al. Retinoic acid receptors control spermatogonia cell-fate and induce expression of the SALL4A transcription factor. PLoS Genet. 11, e1005501 (2015).
Siggers, T. & Gordân, R. Protein-DNA binding: Complexities and multi-protein codes. Nucleic Acids Res. 42, 2099–2111 (2014).
Zhao, Y., Zheng, D. & Cvekl, A. Profiling of chromatin accessibility and identification of general cis-regulatory mechanisms that control two ocular lens differentiation pathways. Epigenet. Chromatin 12, 27 (2019).
Long, H. K., Prescott, S. L. & Wysocka, J. Ever-changing landscapes: Transcriptional enhancers in development and evolution. Cell 167, 1170–1187 (2016).
Maezawa, S., Yukawa, M., Alavattam, K. G., Barski, A. & Namekawa, S. H. Dynamic reorganization of open chromatin underlies diverse transcriptomes during spermatogenesis. Nucleic Acids Res. 46, 593–608 (2018).
Garcia, T. & Hofmann, M.-C. Isolation of undifferentiated and early differentiating type A spermatogonia from Pou5f1-GFP reporter mice. Methods Mol. Biol. 825, 31–44 (2012).
Azizi, H., Skutella, T. & Shahverdi, A. Generation of mouse spermatogonial stem-cell-colonies in a non-adherent culture. Cell J. 19, 238–249 (2017).
Masaki, K. et al. FGF2 has distinct molecular functions from GDNF in the mouse germline Niche. Stem Cell Rep. 10, 1782–1792 (2018).
Wang, S. et al. Retinoic acid is sufficient for the in vitro induction of mouse spermatocytes. Stem Cell Rep. 7, 80–94 (2016).
Takashima, S. et al. Functional differences between GDNF-dependent and FGF2-dependent mouse spermatogonial stem cell self-renewal. Stem Cell Rep. 4, 489–502 (2015).
Haneji, T., Koide, S. S., Tajima, Y. & Nishimune, Y. Differential effects of epidermal growth factor on the differentiation of type A spermatogonia in adult mouse cryptorchid testes in vitro. J. Endocrinol. 128, 383–388 (1991).
Zhang, J. et al. Neuregulins are essential for spermatogonial proliferation and meiotic initiation in neonatal mouse testis. Development 138, 3159–3168 (2011).
Sharma, M. et al. Identification of EOMES-expressing spermatogonial stem cells and their regulation by PLZF. Elife 8, 25 (2019).
McAninch, D. et al. SOX3 promotes generation of committed spermatogonia in postnatal mouse testes. Sci. Rep. 10, 6751 (2020).
Chen, H. et al. Assessment of computational methods for the analysis of single-cell ATAC-seq data. Genome Biol. 20, 241 (2019).
Wang, Y. et al. Transcriptional repression of CDKN2D by PML/RARα contributes to the altered proliferation and differentiation block of acute promyelocytic leukemia cells. Cell Death Dis. 5, e1431 (2014).
Schmidt, K. L. et al. The cell migration molecule UNC-53/NAV2 is linked to the ARP2/3 complex by ABI-1. Development 136, 563–574 (2009).
Schep, A. N., Wu, B., Buenrostro, J. D. & Greenleaf, W. J. chromVAR: Inferring transcription-factor-associated accessibility from single-cell epigenomic data. Nat. Methods 14, 975–978 (2017).
Song, W. et al. E-cadherin maintains the undifferentiated state of mouse spermatogonial progenitor cells via β-catenin. Cell Biosci. 12, 141 (2022).
Wu, Y. et al. Transcription factor RFX2 is a key regulator of mouse spermiogenesis. Sci. Rep. 6, 20435 (2016).
Qin, P., Haberbusch, J. M., Soprano, K. J. & Soprano, D. R. Retinoic acid regulates the expression of PBX1, PBX2, and PBX3 in P19 cells both transcriptionally and post-translationally. J. Cell. Biochem. 92, 147–163 (2004).
Iriyama, N. et al. Enhancement of differentiation induction and upregulation of CCAAT/enhancer-binding proteins and PU.1 in NB4 cells treated with combination of ATRA and valproic acid. Int. J. Oncol. 44, 865–873 (2014).
Hu, W., Gauthier, L., Baibakov, B., Jimenez-Movilla, M. & Dean, J. FIGLA, a basic helix-loop-helix transcription factor, balances sexually dimorphic gene expression in postnatal oocytes. Mol. Cell. Biol. 30, 3661–3671 (2010).
McConnell, B. B. & Yang, V. W. Mammalian Krüppel-like factors in health and diseases. Physiol. Rev. 90, 1337–1381 (2010).
Hou, Z. et al. KLF2 regulates osteoblast differentiation by targeting of Runx2. Lab. Invest. 99, 271–280 (2019).
Li, D. et al. Krüppel-like factor-6 promotes preadipocyte differentiation through histone deacetylase 3-dependent repression of DLK1. J. Biol. Chem. 280, 26941–26952 (2005).
Yamamoto, K. et al. Transcriptional regulation of a brown adipocyte-specific gene, UCP1, by KLF11 and KLF15. Biochem. Biophys. Res. Commun. 400, 175–180 (2010).
Ohbo, K. et al. Identification and characterization of stem cells in prepubertal spermatogenesis in mice. Dev. Biol. 258, 209–225 (2003).
Corces, M. R. et al. An improved ATAC-seq protocol reduces background and enables interrogation of frozen tissues. Nat. Methods 14, 959–962 (2017).
Granja, J. M. et al. ArchR is a scalable software package for integrative single-cell chromatin accessibility analysis. Nat. Genet. 53, 403–411 (2021).
Bravo González-Blas, C. et al. cisTopic: Cis-regulatory topic modeling on single-cell ATAC-seq data. Nat. Methods 16, 397–400 (2019).
McLean, C. Y. et al. GREAT improves functional interpretation of cis-regulatory regions. Nat. Biotechnol. 28, 495–501 (2010).
Yu, G., Wang, L.-G., Han, Y. & He, Q.-Y. clusterProfiler: An R package for comparing biological themes among gene clusters. OMICS 16, 284–287 (2012).
Acknowledgements
We thank Tommy Lo in Bio-rad for scATAC-Seq experimental assistance. We also thank the SBS Core Laboratories in The Chinese University of Hong Kong for technical support. We would like to acknowledge Dr. Tin-Lap Lee for providing financial support for this research project.
Funding
JL was supported by the Chinese University of Hong Kong, Department of Chemical Pathology’s Faculty Startup Fund.
Author information
Authors and Affiliations
Contributions
H.C.S. and J.L. conceived and designed the study. H.C.S., A.C.S.L. and J.L. performed experiments. H.C.S. and J.L. analyzed and interpreted data. H.C.S. and J.L. wrote the paper. H.C.S., A.C.S.L. and J.L. reviewed and edited the manuscript. All authors read and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Suen, H.C., Luk, A.C.S. & Liao, J. scATAC-Seq reveals heterogeneity associated with spermatogonial differentiation in cultured male germline stem cells. Sci Rep 12, 21482 (2022). https://doi.org/10.1038/s41598-022-25729-7
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-022-25729-7
- Springer Nature Limited