Pentad: a tool for distance-dependent analysis of Hi-C interactions within and between chromatin compartments

Magnitov, Mikhail D.; Garaev, Azat K.; Tyakht, Alexander V.; Ulianov, Sergey V.; Razin, Sergey V.

doi:10.1186/s12859-022-04654-6

Pentad: a tool for distance-dependent analysis of Hi-C interactions within and between chromatin compartments

Software
Open access
Published: 02 April 2022

Volume 23, article number 116, (2022)
Cite this article

Download PDF

You have full access to this open access article

BMC Bioinformatics Aims and scope Submit manuscript

Pentad: a tool for distance-dependent analysis of Hi-C interactions within and between chromatin compartments

Download PDF

Mikhail D. Magnitov ORCID: orcid.org/0000-0002-3628-1663¹^na1,
Azat K. Garaev²^na1,
Alexander V. Tyakht¹,
Sergey V. Ulianov^1,2 &
…
Sergey V. Razin^1,2

2680 Accesses
3 Citations
3 Altmetric
Explore all metrics

Abstract

Background

Understanding the role of various factors in 3D genome organization is essential to determine their impact on shaping large-scale chromatin units such as euchromatin (A) and heterochromatin (B) compartments. At this level, chromatin compaction is extensively modulated when transcription and epigenetic profiles change upon cell differentiation and response to various external impacts. However, detailed analysis of chromatin contact patterns within and between compartments is complicated because of a lack of suitable computational methods.

Results

We developed a tool, Pentad, to perform calculation, visualisation and quantitative analysis of the average chromatin compartment from the Hi-C matrices in cis, trans, and specified genomic distances. As we demonstrated by applying Pentad to publicly available Hi-C datasets, it helps to reliably detect redistribution of contact frequency in the chromatin compartments and assess alterations in the compartment strength.

Conclusions

Pentad is a simple tool for the analysis of changes in chromatin compartmentalization in various biological conditions. Pentad is freely available at https://github.com/magnitov/pentad.

Evaluation of 3D Chromatin Interactions Using Hi-C

Comparison of computational methods for Hi-C data analysis

Article 12 June 2017

Methods for the Differential Analysis of Hi-C Data

Background

High-throughput chromosome conformation capture (Hi-C) studies of the 3D genome architecture have revealed several features of spatial genome organization in higher eukaryotes. Within the chromosome territories [1], transcriptionally active and repressed loci are spatially segregated into A and B compartments [2], that closely resemble eu- and heterochromatin, respectively. At the scale of megabases, chromatin is folded into topologically associated domains (TADs) [3, 4], commonly interpreted as relatively stable globules. In mammals, TAD boundaries are enriched in CTCF/cohesin binding [3] and demarcate areas of enhancer action [5]. Regulatory elements within TADs, such as promoters and enhancers, interact with each other and form chromatin loops, whose bases are frequently marked with binding of architectural proteins such as CTCF [6], YY1 [7], ZNF143 [8], and others [9, 10]. As revealed by the depletion of subunits of the cohesin complex [11] and CTCF [12], the overwhelming majority of TADs and loops in mammalian cells are established by cohesin-driven CTCF-restricted chromatin fiber extrusion. In contrast, mechanisms of compartment formation and maintenance are largely unknown. Compartment profile along the genome and contact patterns within A/B compartments are sensitive to changes in gene expression during cell differentiation [13] and cell senescence [14, 15], alter in response to osmotic stress [16] and depend on the activity of loop extrusion machinery [17, 18]. Despite the increasing number of observations on dynamics of compartment structure in different biological conditions, the determinants of genome compartmentalization remain elusive [19]. Thus, multiple ongoing studies aim to shed light on the aspects of compartment formation [20].

In contrast to TAD and loop annotation and visualization tools (Additional file 1: Table S1), only a limited number of methods for A/B compartments annotation and analysis are available. For instance, compartments were initially discovered using principal component analysis (PCA) [2] which became a method of choice for compartment annotation. Recently, CscoreTool [21] and POSSUMM [22] were reported as a PCA-based memory-efficient algorithms for compartment annotation, while SNIPER [23] and Calder [24] algorithms were developed for sub-compartment detection in moderately covered Hi-C data and at various map resolutions, respectively. However, averaged contact frequency between genomic bins belonging to different compartments is mostly analysed using the saddle plot diagram [25, 26]. Despite its utility, saddle plot representation is clearly lacking the separation of short- and long-range interactions, and is not convenient to analyze the average contact frequencies at a predefined scale. Thus, the available tools cannot systematically probe the dynamics and perturbations of chromatin contact patterns within compartments. To fill this gap, we developed a new tool, Pentad, which can calculate, visualize and quantify the average compartment structure within a predefined range of genomic distances. Using published Hi-C datasets, we demonstrate that Pentad accurately detects the redistribution of contacts between and within A and B compartments without requiring additional analyses.

Implementation

The average compartment visualisation provided by Pentad represents short- and long-range contacts within A and B compartments together with intercompartmental interactions. The visualisation comprises several types of areas from the Hi-C matrix that are determined based on the annotated A/B compartment signal, which is usually a first principal component (PC1) from PCA of the Hi-C matrix (Fig. 1A). The obtained visualisation is then used to estimate the average compartment strength.

To create an average compartment visualisation, compartment areas of different types are extracted from the observed-over-expected Hi-C matrix and subjected to filtering. First, areas are filtered based on their dimensions in genomic bins, because small areas are likely to represent noisy regions of the Hi-C matrix. Next, areas with a low number of contacts are removed because of their poor resolution. Finally, areas at a distance between the anchors larger than a specific cutoff value are removed. Areas that meet the criteria are then rescaled using bilinear interpolation into squares of a predefined size. Rescaled areas of the same type are averaged genome wide using median pixel values, and they are aggregated into one plot.

To calculate compartment strength, the mean value of contacts from areas representing interactions within A and B compartments are divided by the mean value of contacts between these compartments (Fig. 1B). To avoid bias towards low values of the compartment signal when estimating intercompartment interactions, the edges of the corresponding average compartment square are cropped to remove residual interactions occurring in the A and B compartments. Compartment strength is calculated for each chromosome from the Hi-C matrix, enabling a comparison of the results with statistical tests.

Current implementation of Pentad is provided as a set of Python scripts that can average cis and trans Hi-C interactions, to stratify the compartment areas by genomic distance, and calculate compartment strength directly from the average compartments (see Additional file 1: Methods and Additional file 1: Figure S1 for more details). The required input files are a Hi-C matrix in cooler format [27] and a compartment signal in the bedGraph format.

Results

To demonstrate the utility of the Pentad algorithm, it was first applied to the Hi-C datasets with a known impact on the compartment’s structure. Thus, we focused on conditional knock-outs of cohesin loading factor NIPBL [18] and cohesin release factor WAPL [17] in mammalian cells. As previously reported, removing NIPBL enhances chromatin compartmentalization, and knocking out WAPL compromises the segregation of A and B compartments. We confirmed the increase in compartment segregation in NIPBL-deficient cells (Fig. 2A, the central square of the average compartment), and we found that both A and B compartments gain interactions at long genomic distances. In addition, we showed that increased compaction of the A compartment is provided by a shift of the interactions from the main diagonal of the Hi-C matrix to longer distances because of the disruption of TADs. In WAPL-deficient cells (Fig. 2B), we observed decreased compartment segregation, with the B compartment losing interactions at all genomic distances and the A compartment losing interactions only on long-range distances. We also observed a gain of contacts at short genomic distances for the A compartment, potentially caused by an increased number of loops upon WAPL knock-out.

We next applied Pentad to a time-course datasets to assess its ability to capture the A/B compartment dynamics. First, we probed the compartmentalization that occurs when human cells transition from mitosis to G1 [28]. As expected, in the prometaphase and at the entry of G1, we did not see any compartment structure. It emerges 3 h after the release of the cells from prometaphase arrest (Fig. 3A). When applied to the compartments stratified by genomic distance, Pentad revealed that A and B compartments have different assembly kinetics at short and long distances (Figs. 3B, 3C). Second, we inspected changes in compartmentalization during the early development of mouse embryos [29]. Here, we observed a prolonged formation of chromatin compartments, which are reduced after fertilisation and re-established during preimplantation development (Fig. 4A). By analysing allele-specific Hi-C contact matrices, we detected that compartmentalization already occurs in zygotes for the paternal genome, but it is weakly pronounced until the later stages for the maternal genome for short-range A and long-range B compartments (Figs. 4B, 4C).

Conclusions

Pentad is a simple tool that allows one to analyse chromatin compartments based on a Hi-C matrix and compartment signal only. Our results demonstrate the tool’s utility for quantitative analysis of A/B compartments and tracing the changes of the average compartment structure at different genomic scales in various biological conditions. It is fast and easy to use, and it provides reliable results, and this makes Pentad a useful tool for analysing the impact of various factors on the 3D genome organization. We anticipate that Pentad could simplify data interpretation and stimulate formulating novel hypotheses to understand the mechanisms underlying chromatin compartments formation, and would be used for the analysis of A/B compartment structure in a wide range of biological conditions and model systems.

Availability and requirements

Project name: Pentad.

Project home page: https://github.com/magnitov/pentad.

Operating system(s): Platform independent.

Programming language: Python.

Other requirements: conda.

License: MIT License.

Any restrictions to use by non-academics: None.

Availability of data and materials

The datasets re-analysed during the current study are available in the NCBI GEO repository via accession numbers GSE63525, GSE93431, GSE95014, GSE133462, GSE82185. The developed tool and code used for the analysis are available at https://github.com/magnitov/pentad.

Abbreviations

3D:: Three-dimensional
CTCF:: CCCTC-binding factor
GEO:: Gene Expression Omnibus
Hi-C:: High-throughput chromosome conformation capture
ICM:: Inner cell mass
NIPBL:: Nipped-B-like protein
PCA:: Principal component analysis
TAD:: Topologically associated domains
WAPL:: Wings apart-like protein homolog
YY1:: Yin yang 1
ZNF143:: Zinc finger protein 143

References

Cremer T, Cremer M. Chromosome territories. Cold Spring Harb Perspect Biol. 2010;2(3):a003889.
Article Google Scholar
Lieberman-Aiden E, van Berkum NL, Williams L, Imakaev M, Ragoczy T, Telling A, et al. Comprehensive mapping of long-range interactions reveals folding principles of the human genome. Science. 2009;326(5950):289–93.
Article CAS Google Scholar
Dixon JR, Selvaraj S, Yue F, Kim A, Li Y, Shen Y, et al. Topological domains in mammalian genomes identified by analysis of chromatin interactions. Nature. 2012;485(7398):376–80.
Article CAS Google Scholar
Nora EP, Lajoie BR, Schulz EG, Giorgetti L, Okamoto I, Servant N, et al. Spatial partitioning of the regulatory landscape of the X-inactivation centre. Nature. 2012;485(7398):381–5.
Article CAS Google Scholar
Tena JJ, Santos-Pereira JM. Topologically associating domains and regulatory landscapes in development, evolution and disease. Front Cell Dev Biol. 2021;9:702787.
Article Google Scholar
Rao SSP, Huntley MH, Durand NC, Stamenova EK, Bochkov ID, Robinson JT, et al. A 3D map of the human genome at kilobase resolution reveals principles of chromatin looping. Cell. 2014;159(7):1665–80.
Article CAS Google Scholar
Weintraub AS, Li CH, Zamudio AV, Sigova AA, Hannett NM, Day DS, et al. YY1 is a structural regulator of enhancer-promoter loops. Cell. 2017;171(7):1573-88.e28.
Article CAS Google Scholar
Zhou Q, Yu M, Tirado-Magallanes R, Li B, Kong L, Guo M, et al. ZNF143 mediates CTCF-bound promoter-enhancer loops required for murine hematopoietic stem and progenitor cell function. Nat Commun. 2021;12(1):43.
Article CAS Google Scholar
Phillips-Cremins JE, Sauria MEG, Sanyal A, Gerasimova TI, Lajoie BR, Bell JSK, et al. Architectural protein subclasses shape 3D organization of genomes during lineage commitment. Cell. 2013;153(6):1281–95.
Article CAS Google Scholar
Hsieh T-HS, Cattoglio C, Slobodyanyuk E, Hansen AS, Rando OJ, Tjian R, et al. Resolving the 3D landscape of transcription-linked mammalian chromatin folding. Mol Cell. 2020;78(3):539-53.e8.
Article CAS Google Scholar
Rao SSP, Huang S-C, Glenn St Hilaire B, Engreitz JM, Perez EM, Kieffer-Kwon K-R, et al. Cohesin loss eliminates all loop domains. Cell. 2017;171(2):305-20.e24.
Article CAS Google Scholar
Nora EP, Goloborodko A, Valton A-L, Gibcus JH, Uebersohn A, Abdennur N, et al. Targeted degradation of CTCF decouples local insulation of chromosome domains from genomic compartmentalization. Cell. 2017;169(5):930-44.e22.
Article CAS Google Scholar
Dixon JR, Jung I, Selvaraj S, Shen Y, Antosiewicz-Bourget JE, Lee AY, et al. Chromatin architecture reorganization during stem cell differentiation. Nature. 2015;518(7539):331–6.
Article CAS Google Scholar
Criscione SW, De Cecco M, Siranosian B, Zhang Y, Kreiling JA, Sedivy JM, et al. Reorganization of chromosome architecture in replicative cellular senescence. Sci Adv. 2016;2(2):e1500882.
Article Google Scholar
Sati S, Bonev B, Szabo Q, Jost D, Bensadoun P, Serra F, et al. 4D genome rewiring during oncogene-induced and replicative senescence. Mol Cell. 2020;78(3):522-38.e9.
Article CAS Google Scholar
Amat R, Böttcher R, Le Dily F, Vidal E, Quilez J, Cuartero Y, et al. Rapid reversible changes in compartments and local chromatin organization revealed by hyperosmotic shock. Genome Res. 2019;29(1):18–28.
Article CAS Google Scholar
Haarhuis JHI, van der Weide RH, Blomen VA, Yáñez-Cuna JO, Amendola M, van Ruiten MS, et al. The Cohesin release factor WAPL restricts chromatin loop extension. Cell. 2017;169(4):693-707.e14.
Article CAS Google Scholar
Schwarzer W, Abdennur N, Goloborodko A, Pekowska A, Fudenberg G, Loe-Mie Y, et al. Two independent modes of chromatin organization revealed by cohesin removal. Nature. 2017;551(7678):51–6.
Article Google Scholar
Falk M, Feodorova Y, Naumova N, Imakaev M, Lajoie BR, Leonhardt H, et al. Heterochromatin drives compartmentalization of inverted and conventional nuclei. Nature. 2019;570(7761):395–9.
Article CAS Google Scholar
Mirny LA, Imakaev M, Abdennur N. Two major mechanisms of chromosome organization. Curr Opin Cell Biol. 2019;58:142–52.
Article CAS Google Scholar
Zheng X, Zheng Y. CscoreTool: fast Hi-C compartment analysis at high resolution. Bioinformatics. 2018;34(9):1568–70.
Article CAS Google Scholar
Gu H, Harris H, Olshansky M, Eliaz Y, Krishna A, Kalluchi A, et al. Fine-mapping of nuclear compartments using ultra-deep Hi-C shows that active promoter and enhancer elements localize in the active A compartment even when adjacent sequences do not. https://doi.org/10.1101/2021.10.03.462599v2 (2021).
Xiong K, Ma J. Revealing Hi-C subcompartments by imputing inter-chromosomal chromatin interactions. Nat Commun. 2019;10(1):5069.
Article Google Scholar
Liu Y, Nanni L, Sungalee S, Zufferey M, Tavernari D, Mina M, et al. Systematic inference and comparison of multi-scale chromatin sub-compartments connects spatial organization to cell phenotypes. Nat Commun. 2021;12(1):2439.
Article CAS Google Scholar
Flyamer IM, Gassler J, Imakaev M, Brandão HB, Ulianov SV, Abdennur N, et al. Single-nucleus Hi-C reveals unique chromatin reorganization at oocyte-to-zygote transition. Nature. 2017;544(7648):110–4.
Article CAS Google Scholar
Imakaev M, Fudenberg G, McCord RP, Naumova N, Goloborodko A, Lajoie BR, et al. Iterative correction of Hi-C data reveals hallmarks of chromosome organization. Nat Methods. 2012;9(10):999–1003.
Article CAS Google Scholar
Abdennur N, Mirny LA. Cooler: scalable storage for Hi-C data and other genomically labeled arrays. Bioinformatics. 2020;36(1):311–6.
Article CAS Google Scholar
Abramo K, Valton A-L, Venev SV, Ozadam H, Fox AN, Dekker J. A chromosome folding intermediate at the condensin-to-cohesin transition during telophase. Nat Cell Biol. 2019;21(11):1393–402.
Article CAS Google Scholar
Du Z, Zheng H, Huang B, Ma R, Wu J, Zhang X, et al. Allelic reprogramming of 3D chromatin architecture during early mammalian development. Nature. 2017;547(7662):232–5.
Article CAS Google Scholar

Download references

Acknowledgements

Not applicable.

Funding

This work was supported by the Russian Science Foundation (RSF) grant #19-74-10009. Analysis of publicly available datasets was supported by the RSF grant #21-64-00001. S.V.U. was supported by the Russian Foundation for Basic Research #20-54-12022. S.V.R. was supported by the Russian Foundation for Basic Research #18-29-13013 and #17-00-00179. S.V.U. and S.V.R. were supported by the Interdisciplinary Scientific and Educational School of Moscow University “Molecular Technologies of the Living Systems and Synthetic Biology”. The funding body did not play any role in the design and implementation of the tool, in collection of the data and in writing the manuscript.

Author information

Mikhail D. Magnitov and Azat K. Garaev have contributed equally to this work.

Authors and Affiliations

Institute of Gene Biology, Russian Academy of Sciences, Moscow, Russia, 119334
Mikhail D. Magnitov, Alexander V. Tyakht, Sergey V. Ulianov & Sergey V. Razin
Faculty of Biology, Lomonosov Moscow State University, Moscow, Russia, 119234
Azat K. Garaev, Sergey V. Ulianov & Sergey V. Razin

Authors

Mikhail D. Magnitov
View author publications
You can also search for this author in PubMed Google Scholar
Azat K. Garaev
View author publications
You can also search for this author in PubMed Google Scholar
Alexander V. Tyakht
View author publications
You can also search for this author in PubMed Google Scholar
Sergey V. Ulianov
View author publications
You can also search for this author in PubMed Google Scholar
Sergey V. Razin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.D.M. designed the study, performed computational analysis, and wrote the manuscript. A.K.G. designed the study and performed computational analysis. A.V.T. designed the study and provided feedback. S.V.U. and S.V.R. supervised the study, provided feedback, and wrote the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Mikhail D. Magnitov.

Ethics declarations

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1.

Supplementary Information: Supplementary Methods, Figure S1 with Pentad technical performance assessment, and Table S1 with a list of available tools for the pile-up analysis of Hi-C features.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Magnitov, M.D., Garaev, A.K., Tyakht, A.V. et al. Pentad: a tool for distance-dependent analysis of Hi-C interactions within and between chromatin compartments. BMC Bioinformatics 23, 116 (2022). https://doi.org/10.1186/s12859-022-04654-6

Download citation

Received: 16 December 2021
Accepted: 25 March 2022
Published: 02 April 2022
DOI: https://doi.org/10.1186/s12859-022-04654-6

Pentad: a tool for distance-dependent analysis of Hi-C interactions within and between chromatin compartments