Abstract
The phylum Cnidaria consists of several morphologically diverse classes including Anthozoa, Cubozoa, Hydrozoa, Polypodiozoa, Scyphozoa, Staurozoa, and Myxozoa. Myxozoa comprises two subclasses of obligate parasites—Myxosporea and Malacosporea, which demonstrate various degrees of simplification. Myxosporea were previously reported to lack the majority of core protein domains of apoptotic proteins including caspases, Bcl-2, and APAF-1 homologs. Other sequenced Cnidaria, including the parasite Polypodium hydriforme from Polypodiozoa do not share this genetic feature. Whether this loss of core apoptotic proteins is unique to Myxosporea or also present in its sister subclass Malacosporea was not previously investigated. We show that the presence of core apoptotic proteins gradually diminishes from free-living Cnidaria to Polypodium to Malacosporea to Myxosporea. This observation does not favor the hypothesis of catastrophic simplification of Myxosporea at the genetic level, but rather supports a stepwise adaptation to parasitism that likely started from early parasitic ancestors that gave rise to Myxozoa.
Similar content being viewed by others
Introduction
Multicellular organisms can get rid of old, damaged, unuseful or precancerous cells by apoptosis, a form of programmed cell death or cell suicide. Key apoptotic proteins include proteolytic enzymes called caspases, which trigger cell death by cleaving specific cellular proteins1. Inactive precursors, or procaspases, are activated through cleavage by other caspases2. Extracellular or intracellular death signals can initiate this proteolytic cascade under tight regulation of adaptor and regulatory proteins such as Fas-associated death domain protein (FADD) and Bcl-2 family proteins3,4.
Variation exists between the apoptotic pathways of invertebrates. For example, the Pacific oyster has mammalian-like apoptotic pathways5, whereas, Caenorhabditis elegans does not use cytochrome C for the apoptotic caspase cascade activation6. Apoptosis has been described and well-studied in model free-living Cnidaria, such as Hydra7 which has numerous caspases, Bcl-2 protein family members, an APAF-1 homolog, components of a putative death receptor pathway and inhibitors of apoptotic proteases. Like other animals, Hydra is capable of developing tumors8 and the transcriptomes of these tumors reveal misregulation of genes related to mammalian apoptosis genes.
Let us review the general features of the better-understood mammalian apoptosis, keeping in mind that similar actors can be found in free-living Cnidaria7,9. There are two principal apoptotic pathways: the extrinsic pathway and intrinsic pathway1. In its simplified version, the extrinsic apoptotic pathway starts with external signals that activate Death domain harboring transmembrane receptors (TNFR/Fas)1. This signal is transduced via adaptor proteins such as TRADD/FADD with Death and Death-effector domains that transform initiator Death-effector domain containing procaspases into active initiator caspases which induce the apoptosis caspase cascade1.
The intrinsic apoptotic pathway is activated in response to various oncogenes and intracellular stressors, including DNA damage, reactive oxygen species, endoplasmic reticulum stress, and hypoxia1. The activation is mediated via transcription factors such as p5310. In response BH3-only Bcl-2 family proteins are expressed, leading to the inhibition of anti-apoptotic Bcl-2. This allows liberated Bak proteins to form channels within the mitochondrial membranes causing cytochrome C release. Apoptotic protease activating factor 1 (APAF-1) proteins bind cytochrome C with their WD40 repeats and then bind with each other via their CARD domains and form the apoptosome1. The apoptosome activates the CARD-domain containing initiator procaspases of the intrinsic apoptotic pathway causing the apoptotic caspase cascade11. Inhibitors of apoptosis proteins (IAPs) inhibit caspases12. More detailed descriptions of apoptosis are reviewed elsewhere1.
Surprisingly, Myxosporea (see Fig. 1B for simplified cladogram) species Thelohanellus kitauei, Kudoa iwatai, Myxobolus pronini, Sphaeromyxa zaharoni, Enteromyxum leei were reported to completely lack a number of Pfam-domains belonging to key apoptotic proteins including caspases, Bcl-2, APAF-1, and p53 homologs13. This finding was interpreted in favor of the SCANDALs (Speciated by CANcer Development AnimaLS) hypothesis that suggested that Myxosporea evolved from a transmissible cancer and have undergone catastrophic simplification. The limitations of that study were the consideration of Pfam-domains only, but not complete genes and that genetic data for the sister subclass Malacosporea was unavailable.
Unlike Myxosporea, Malacosporea are unlikely candidates for the SCANDAL hypothesis because of their more sophisticated morphology and development stages similar to gastrulation14. Malacosporean species, Tetracapsuloides bryosalmonae, can form ~ 350 µm spherical spore sacs15 while another malacosporean, Buddenbrockia plumatellae, forms 1 mm long worm-like sacs with muscle cells that are capable of active movement in their bryozoan hosts16. In the fish host these species form a pseudoplasmodial structure in renal tubules15,17. Tetracapsuloides bryosalmonae causes a proliferative kidney disease in salmonids which affects important wild and aquacultural populations18. Two other described, but less studied species of Malacosporea demonstrate similar morphology and structure14,19.
Recently transcriptomic data for members of Malacosporea Buddenbrockia plumatellae and Tetracapsuloides bryosalmonae became available20,21,22,23. Also, genomic data became available for additional Myxosporea species: Henneguya salminicola, Myxobolus squamalis24, and Myxobolus honghuensis25.
Notably, the Henneguya salminicola has been shown to completely lack a mitochondrial genome which is especially interesting in the context of the apparent loss of apoptosis in Myxosporea species24. We decided to complete this data with our own sequences of Polypodium hydriforme and Myxobolus pronini to comprehensively study the loss of apoptosis-related genes in parasitic Cnidaria.
In this article we applied a number of comparative genomic methods, such as BLAST26 for homology search, functional domain prediction using HMMER27, phylogenetic analysis, and filtering techniques to available genomic and transcriptomic data of Myxosporea, Malacosporea, Polypodiozoa, and free-living Cnidaria to establish the evolutionary gain and loss of their apoptosis-related genes. We show that the SCANDAL hypothesis for Myxosporea origin is weakened by the observation that Malacosporea also lost many genes involved in apoptosis regulation, although to a lesser extent. Our analysis reveals and describes in detail the gradual process of apoptosis-related gene loss in Myxozoa that appears to have started in the common ancestor of both Myxosporea and Malacosporea subclasses.
Results
Presence of genes coding key apoptotic proteins in parasitic Cnidaria
Comparison of cnidarian species reveals that the transition to parasitism is associated with gradual loss of key apoptotic factors in the parasitic classes Polypodiozoa and Malacosporea, up to a complete reduction of almost all pathway components in Myxosporea (Fig. 1 and Table 1). We chose a ‘standard’ mammalian-like apoptotic pathway as a reference, because comparative genomic studies find many similarities between it and the predicted apoptotic pathway of free-living Cnidaria such as Hydra9.
The free-living Cnidaria Hydra vulgaris was previously known to share all key components of both the intrinsic and extrinsic apoptotic pathways (Table 1). We found that the parasitic Cnidaria Polypodium hydriforme lost the main components of the extrinsic apoptotic pathway: it completely lacks the Death domain required for the extrinsic receptor and adaptor proteins as well as the Death-effector domain required for the extrinsic initiator caspase. However, it maintains most of the intrinsic apoptotic pathway components with the exception of BH3-only Bcl-2 family members. Also, we were able to detect only one predicted intrinsic pathway initiator (CARD domain containing) caspase and one executioner caspase compared to two and eleven found in Hydra. Malacosporea have undergone further loss of apoptosis-related proteins including p53, all Bcl-2 family members, APAF-1, CARD domain, and DEATH domain containing proteins (see Supplementary tables, Supplementary table S2, for full list of domains). One caspase without a CARD domain was found in Buddenbrockia. Myxosporea universally lack all main actors of apoptosis except cytochrome C, whose gene is however absent in Myxobolus squamalis and Henneguya salminicola, and is a pseudogene with multiple inner stop-codons in Kudoa iwatai, Sphaeromyxa zaharoni, and Enteromyxum leei. Henneguya salminicola apparently entirely lacks a mitochondrial genome24. Detailed findings and a comparison with other animals are presented in Table 1.
This pattern of apoptotic-protein loss among parasitic Cnidaria is consistent with gradual loss-of-function in contrast to previous findings suggesting catastrophic loss of apoptotic and cancer-related proteins in Myxosporea13. While this study focuses only on cnidarian apoptosis, we observe a gradual decline in the number of cancer-related PFAM domains in order from free-living Cnidaria to Polypodium to Malacosporea to Myxosporea (see Supplementary tables, Supplementary table S1).
Number of caspases gradually diminishes in parasitic Cnidaria
Caspases are proteases that play a crucial role in both intrinsic and extrinsic apoptotic pathways. Our results suggest the complete absence of caspases not only in Myxosporea, but also in one of two analyzed members of Malacosporea (Tetracapsuloides bryosalmonae). The other Malacosporea used in our analysis (Buddenbrockia plumatellae) has only one caspase (Fig. 2) in contrast to the free-living Hydra that has fifteen7. Based on domain composition the single Buddenbrockia caspase is an executioner caspase and not an initiator (CARD-domain containing) caspase, thus insufficient to trigger either of the apoptotic pathways. In contrast Polypodium has both an initiator and an executioner caspase, allowing for the classical intrinsic apoptotic pathway to work. Based on phylogenetic analysis (see Supplementary materials, Fig. S3 for phylogenetic tree) we can conclude that the CARD domain containing caspase from Polypodium is homologous to the CARD domain containing caspases of Hydra HyCARDCasp-1 and HyCARDCasp-2. And the only identified executioner caspase in Polypodium is unlikely to be orthologous to the one found in Buddenbrockia. The loss of extrinsic apoptotic pathways in Malacosporea, Myxosporea, and Polypodium can be predicted from the lack of DEATH receptors, adaptor proteins, and Death-effector domain containing initiator caspases.
In contrast to Polypodium, Myxozoa likely lack Bcl-2 family members
The Bcl-2 family plays an important role in regulating the intrinsic apoptotic pathway. Some Bcl-2 family members are proapoptotic (Bak and Bok proteins in humans) and others are anti-apoptotic (human Bcl-2)33. Our analysis suggests that Myxozoa lack any Bcl-2 protein members, consistent with the predicted lack of intrinsic apoptotic pathways. In Polypodium we predicted five Bcl-2 protein family members and attempted to infer their pro- or anti-apoptotic roles based on similarity with known human and Hydra Bcl-2 family protein members (Fig. 3, see Supplementary materials, Fig. S1 for phylogenetic tree). One Polypodium Bcl-2 protein was predicted as a pro-apoptotic Bok homologue by InterProScan34. The separation between Bcl-2 and Bak proteins is based on phylogenetic analysis (see Supplementary materials, Fig. S1). Overall, the results favor the existence of an intrinsic apoptotic pathway in Polypodium and its lack in Myxozoa.
Cytochrome C is likely lost in some myxosporeans
Predicted genes for cytochrome C were found in Polypodium, Malacosporea (both Tetracapsuloides and Buddenbrockia), and Myxosporea. However, in Myxosporea the situation with cytochrome C is rather complex. In Myxobolus pronini, M. honghuensis, and Thelohanellus kitauei the predicted cytochrome C proteins share high similarity with each other. Although, cytochrome C could not be found in a published Myxobolus cerebralis transcriptome (GBKL00000000.1). Also, we were unable to identify any potential genes of cytochrome C in the available genomes and transcriptomes of Henneguya salminicola and Myxobolus squamalis. Note that Henneguya salminicola was previously shown to lack a mitochondrial genome so this is a special case24.
While we were able to find cytochrome C homologs in Sphaeromyxa zaharoni, Enteromyxum leei, and Kudoa iwatai, we found numerous stop codons and insertions within their predicted structural Cytochrome C domains. Enteromyxum leei and Kudoa iwatai have stop codons only within the above-mentioned insertions, whereas Sphaeromyxa zaharoni also has stop codons outside of them, within a highly conserved 69 aa region (Fig. 4).
Annotated mitochondrial genomes are available for myxosporeans Enteromyxum leei35 and Kudoa iwatai36. Despite the presence of stop-codons in their cytochrome C genes, genes of core subunits of complex IV and cytochrome b (related to complex III) are present in their mitochondrial genomes. They also possess the Pfam domains of respiratory subunits of complex III: PF00355 (Rieske) and PF02167 (Cytochrome C1). The latter statement is also true for Sphaeromyxa zaharoni, the third myxosporean with stop-codons in the cytochrome C gene. This suggests that the truncated cytochromes C of myxosporeans, resulting from these nonsense mutations, might still maintain their function in the respiratory chain.
Not only Myxosporea, but also Malacosporea lack calpains, APAF-1, IAPs, and p53
Calpains are a family of proteases, some members of which are involved in caspase activation in response to calcium release from the endoplasmic reticulum37. The caspase activation is performed via the Peptidase_C2 and Calpain III peptidase domains, while the EF-hand domain is capable of binding calcium38. We were unable to find any calpain family members in Myxozoa, however, we found 4 putative calpains in the parasitic cnidarian Polypodium (Fig. 5). The domain structure of two of those calpains (presence of both peptidase domains and EF-hand domains) suggests their role in apoptosis. We refer to them as “classical” calpains as described in Croall and Ersfeld39, and in Luo et al.40. The EF-hand domain is important for calcium binding, whereas the Peptidase_C2 and Calpain III domains are homologous to peptidase domains capable of caspase activation.
APAF-1 or apoptotic protease activating factor 1 is essential for forming the apoptosome when bound to cytochrome C, released from the mitochondria. Among studied parasitic Cnidaria only Polypodium has an APAF-1 homologue that we could identify. This homologue has the WD40 domain necessary for cytochrome C binding and the CARD domain for apoptosome formation (Fig. 6). This further suggests that Polypodium has functional apoptosis, while Myxosporea and Malacosporea do not.
IAPs are inhibitors of apoptosis. Among studied parasitic Cnidaria only Polypodium has an IAP. However, this predicted protein lacks the XIAP/BIRC8_UBA domain found in its human homolog and has a shorter BIR_rpt domain which binds Zn2+ and then binds the active site of caspases inhibiting their catalytic activity (Fig. 6). p53 is a transcription factor that is activated in response to a number of stressors including DNA damage and increases the expression of a number of pro-apoptotic genes (including APAF-1, Bak, Bax)10. The only parasitic Cnidaria in our analysis with p53 is Polypodium. However, the Polypodium p53 homolog is truncated and lacks some of the domains present in the human p53 (Fig. 6). It lacks the p53_TAD2 and p53_tetr domains which are involved in p53 transactivation and tetramerization respectively. The p53 DNA binding domain is present in Polypodium.
Limitations
Our analysis relies on both gene presence and gene absence. The near-completeness of malacosporean transcriptomes is supported by the observation that almost all Pfam domains found in myxosporean genomes are also present in malacosporean transcriptomes (see Supplementary tables, Supplementary table S3).
Some genes can still be absent due to data incompleteness: when a gene is missing from the data, it might be due to insufficient read coverage. However, when this is the case, we expect different genes to be lost in independently sequenced related animals. For each species of Myxosporea we observe roughly the same set of lost apoptosis-related genes. A similar picture was observed for cancer-related gene loss of Myxosporean species as reported previously13.
Conclusions
We evaluated the presence of core apoptotic proteins in free-living and parasitic Cnidaria. We observe a gradual loss of apoptosis-related proteins, starting with Polypodium who lost the major components of the extrinsic apoptotic pathway, followed by Myxozoa who lost the intrinsic apoptotic pathway as well. Compared to Myxosporea, Malacosporea retained one potentially functional caspase homolog (in Buddenbrockia). While both studied malacosporeans retained cytochrome C, the latter is missing in myxosporeans Myxobolus squamalis, Henneguya salminicola and its gene contains multiple stop-codons in Kudoa iwatai, M. honghuensis, and Sphaeromyxa zaharoni. However, the presence of other respiratory chain proteins suggests that the genes might be producing truncated, yet possibly functional cytochrome C proteins. Thus, Cnidaria present us with a unique opportunity to observe different stages of gradual loss of apoptosis. The observation that Malacosporea lost many genes involved in apoptosis regulation, weakens the previously proposed SCANDAL hypothesis that suggested a transmissible cancer origin for Myxosporea catastrophic simplification13.
Previously Panchin et al.13, did not observe an obvious trend for the loss of apoptosis and cancer-related genes in a subsample of parasitic Metazoa. A recent study of apoptosis in parasitic Schistosoma41 revealed the presence of most proteins typically involved in apoptosis. This makes the case of extreme loss of apoptosis-related proteins in Myxosporea and Malacosporea rather unique and not a necessary consequence of their parasitic life-style.
Materials and methods
Cnidaria genomes and transcriptomes
We used genomic data for Myxobolus squamalis GCA_010108815.1, Henneguya salminicola GCA_009887335.1, Enteromyxum leei GCA_001455295.2, Sphaeromyxa zaharoni GCA_001455285.1, Kudoa iwatai GCA_001407235.2, Thelohanellus kitauei GCA_000827895.1, Myxobolus honghuensis GWHBFXL00000000, Hydra vulgaris GCA_000004095.1, Clytia hemisphaerica GCA_902728285.1, Nematostella vectensis GCA_000209225, Dendronephthya gigantea GCA_004324835.1, Acropora digitifera GCF_000222465.1, and Aurelia aurita GCA_004194415.1. We additionally sequenced and assembled the genomes of Polypodium hydriforme and Myxobolus pronini (see Materials and methods, Sequence assembly). Assembly quality reports are provided in supplementary materials (Supplementary table S4). Transcriptomic data were available for Tetracapsuloides bryosalmonae, Buddenbrockia plumatellae, and Polypodium hydriforme. For Tetracapsuloides bryosalmonae we used a set of transcriptomic data, namely ready-made T. bryosalmonae assemblies from Kumar et al., 2021 (BioProject: PRJNA680464) and Faber et al.22 (BioProject: PRJEB19471), and T. bryosalmonae paired-end reads from Faber et al.22, Kumar et al.23, and Ahmad et al.21 (BioProject: PRJNA668017) which we assembled using SPAdes42 with k-mer lengths of 33, 49, and 93 (Table 2).
One of the approaches to filter contaminations of transcriptomic data of a parasitic animal is to compare its sequences from two hosts (as was done in Faber et al.22) or compare sequences from an infected and non-infected host of the same species. We created three filtered datasets using these approaches (Table 2). In two cases we used the intersection of RNA reads of T. bryosalmonae from a fish host and T. bryosalmonae assembly from a bryozoan host. In one case we used RNA reads from infected by T. bryosalmonae kidney of Salmo trutta and filtered out reads mapping to S. trutta reference genome. Reads were mapped on assemblies using Bwa mem algorithm43. After filtering, predicted T. bryosalmonae reads were assembled using SPAdes. SPAdes tool was used with k-mer lengths of 33, 49, and 93. All .sam and .bam files were processed with samtools package44.
Exclusion of host contaminations in Myxozoan genomes and transcriptomes
In the eight available genomes of myxosporean species (Myxobolus squamalis GCA_010108815.1, Henneguya salminicola GCA_009887335.1, Enteromyxum leei GCA_001455295.2, Sphaeromyxa zaharoni GCA_001455285.1, Kudoa iwatai GCA_001407235.2, Thelohanellus kitauei GCA_000827895.1, Myxobolus pronini (see Materials and methods, Sequence assembly), Myxobolus honghuensis GWHBFXL00000000) we searched for possible host contaminations using an original Python script (see supplementary script genome_filter.py). Each contig from the genomes was used as a query for blastn-searches26 against a BLAST database built from the host genome. In the case of Myxosporea we used the publicly available genome of either the exact fish host or its closest available relative from Genbank (see Supplementary materials, List S1). If the best hit demonstrated > 85% identity and e-value < 1-e75, the query contig was excluded from the final assembly.
Another original python script (see supplementary script comparison.py) was used to remove host-derived contaminating sequences from the transcriptomes of malacosporeans B. plumatellae and T. bryosalmonae. First, we created two BLAST databases. The first consisted of several cnidarian transcriptomes, namely of Aurelia aurita, Polypodium hydriforme, Hydra vulgaris, Pocillopora acuta, Myxobolus squamalis, Myxidium lieberkuehni, and malacosporean transcriptomes that were obtained from hosts other than those which are being filtered. The second database consisted of a wide range of non-cnidarian organisms that may be regarded as additional probable contaminants. It was shown that possible contaminants include algae and cyanobacteria23, so we included Tetraselmis striata and Nodularia spumigena as representatives of these taxa. We also included the genome of bryozoan Cristatella mucedo, the transciptome of bryozoan Fredericella sultana, a transcriptome of another lophophorate—Lingula anatina, and also transcriptomes of Drosophila melanogaster, Ruditapes philippinarium, Trichinella pseudospiralis, Salmo trutta, Danio rerio, Cricetulus griseus. We found contaminations likely derived from these taxa when we manually checked BLAST search results of Malacosporean contigs. Such contaminations may sometimes occur when the same lab equipment is used for various sequencing projects or because bryozoans are filtrators and are susceptible to environmental contaminations.
Databases were created using translated ORFs obtained from the assemblies with the TransDecoder tool45, using a minimal ORF length of 70 amino acids. Each contig from malacosporean assemblies was searched against each of the two BLAST databases using blastp from the BLAST +26 tool package. Contigs were divided into three groups (“cnidaria”, “contamination” and “not clear”) based on e-value and bit score differences between the best hits against cnidarian and contaminant blast databases. If the e-value of the best cnidarian hit was at least 100 × less and its bit score at least 1.5 × more than that of the best hit from the contaminant dataset, this contig was considered as “cnidaria”. If the opposite was true, the contig was flagged as “contamination”. Otherwise, it was placed in the “not clear” group and subjected to further GC-content analysis (see below).
It was reported that Myxozoa have relatively lower GC-content compared to other Metazoa46. To visualize GC-content of each group of contigs we built a scatter plot with the length of contig and the GC-content on the Y- and X-axis respectively for “cnidaria”, “contamination”, and “not clear” groups of contigs. In the “cnidaria” group of contigs we obtained a set bounded by a bell curve with the average GC-content at 27–30% depending on the particular organism. The “contamination” group of contigs were placed as a flat bell-bounded set with average GC-content of 55%. In the “not clear” group we observed a distribution of GC-content indicating the presence of both “cnidaria” and “contamination” contigs (see Supplementary materials, Fig. S2) We used the following GC-content cutoffs: 39.03% for B. plumatellae, 40.80% for T. bryoslamonae assembly from fish kidney, 36.92% for T. bryosalmonae from bryozoan host to include the “not clear” contigs into the final “cnidaria” set of contigs.
Manual checks of potential contamination sources were conducted using BLAST searches and phylogenetic tree construction using IQ-TREE47,48 with further visualization in iTOL49.
Identification of PFAM domains
We used the hmmscan tool from HMMER27 version 3.3.2 and the PFAM-A database (version 34.0)50 to identify PFAM domains in TransDecoder-predicted ORFs within the filtered contigs. This was done for all filtered Myxozoan genomes and transcriptomes, the Polypodium hydriforme genome, and several genomes of free-living Cnidaria genomes from the NCBI Genbank, namely Acropora digitifera GCF_000222465.1, Aurelia aurita GCA_004194415.1, Clytia hemisphaerica GCA_902728285.1, Dendronephthya gigantea GCA_004324835.1, Hydra vulgaris GCA_000004095.1, Nematostella vectensis GCA_000209225.
Search for apoptosis-related domains and proteins
To elucidate the presence of core apoptotic proteins as functional multidomain units we conducted additional BLAST searches. Similar to Lasi et al.7, we adopted the list of proteins that are most important for apoptosis, namely caspase homologs, Bcl-2-family proteins, Apoptotic protease activating factor 1 (APAF-1), tumor necrosis factor receptor (TNF-R), inhibitor of apoptosis proteins (IAPs), Fas-associated protein with death domain (FADD), calpains, p53, and cytochrome C. For each protein we collected several reference cnidarian protein sequences from UniProt and NCBI Genbank, Hydra sequences from Lasi et al.7, were also included (see Supplementary materials, List S2, for list of accessions).
We performed blastp searches for each of the reference proteins against malacosporean (filtered and unfiltered B. plumatellae, T. bryosalmonae from invertebrate host, T. bryosalmonae from fish host with further manual analysis), Polypodium hydriforme genome and transcriptome, and also against filtered myxosporean genomes. Domain structures of identified hits were analyzed using the InterProScan tool34 with InterPro database51. Multiple alignments were built with the Muscle tool52 and visualized in the Jalview application53.
Sample collection, DNA isolation
Specimens of Carassius auratus gibelio were caught in a pond near Barguzin River (53°69'N, 109°80'E) (Lake Baikal basin, Russia) in summer 2013. Round cysts of Myxobolus pronini were dissected from the body of fish and preserved in RNAlater for further analysis.
Total DNA was isolated from tissue samples by Diatom DNA Prep (IsoGene). One paired-end and two mate-pair libraries were prepared for Myxobolus pronini using the Nextera library preparation protocol (Illumina). The mate-pair libraries were prepared with the estimated mean insert lengths of 3500 bp and 7000 bp. The libraries were sequenced using an Illumina HiSeq 2000 instrument, generating 163 M 100 bp read pairs for the paired-end library, 31 M read pairs for the 3500 bp mate-pair library and 38 M read pairs for the 7000 bp mate-pair library.
Specimens of Acipenser gueldenstaedtii were caught by A.A. Gerasimov in the Volga River delta near the fall of the river into the Caspian Sea (region of city of Astrakhan) (46°16′N, 47°57′E) in 2013. This was done by A.A. Gerasimov as part of an official fish conservation practice. This practice involves catching a small number of fish, extraction of their oocytes for further breeding, followed by the release of fry back into their wild habitat. Stolons of Polypodium hydriforme were surgically isolated from infected oocytes of Acipenser gueldenstaedtii host, washed, and preserved in RNAlater for further analysis.
Total DNA was isolated from stolon samples by Diatom DNA Prep (IsoGene). One paired-end library was prepared for using the TruSeq library preparation protocol (Illumina). The Polypodium library was sequenced using an Illumina HiSeq 2500 instrument, generating 260 M read pairs with the read length of 250 bp.
Sequence assembly
The sequencing reads for Myxobolus pronini and Polypodium hydriforme were trimmed using Trimmomatic54 to remove the adapters. Prior to assembly, the Myxobolus pronini reads were processed using the error correction utility of the ALLPATHS-LG program55. The assembly for Myxobolus pronini data was performed using Velvet56 with a k-mer size of 55. Scaffolds obtained with Velvet were then processed using the Redundans pipeline57 with an identity threshold of 0.85 to obtain the final assembly draft. The assembly of Polypodium data was performed using SPAdes42 with three k-mer sizes: 77, 99, 127. The assemblies were evaluated using QUAST58.
Data availability
The datasets supporting the conclusions of this article are included within the article (and its additional files). The sequenced genomic data are available under the following accession numbers: Myxobolus pronini: NCBI BioProject: PRJNA957449, NCBI SRA: SRR24216800, SRR24216801, SRR24216802, SRR24216803, SRR24216804; Polypodium hydriforme: NCBI BioProject: PRJNA957841, NCBI SRA: SRR24234439. The sequences derived from sequencing data are available in the NCBI GenBank repository under the following accession numbers: WEC89223, WEC89224, WEC89225. The fasta-file with all sequences predicted and described in this article (including sequences derived from publicly available sequencing data) is accessible in the Figshare repository under the following link: https://figshare.com/articles/dataset/Apoptotic_gene_loss_in_Cnidaria_is_associated_with_transition_to_parasitism/21719930.
References
Elmore, S. Apoptosis: A review of programmed cell death. Toxicol. Pathol. 35, 495–516 (2007).
Shi, Y. Caspase activation, inhibition, and reactivation: A mechanistic view. Protein Sci. 13, 1979–1987 (2004).
Alberts, B., Johnson, A. & Lewis, J. Molecular Biology of the Cell (Garland Science, 2002).
Kim, R. Unknotting the roles of Bcl-2 and Bcl-xL in cell death. Biochem. Biophys. Res. Commun. 333, 336–343 (2005).
Li, Y. et al. Conservation and divergence of mitochondrial apoptosis pathway in the Pacific oyster, Crassostrea gigas. Cell Death Dis. 8, e2915–e2915 (2017).
Arvanitis, M., Li, D.-D., Lee, K. & Mylonakis, E. Apoptosis in C. elegans: Lessons for cancer and immunity. Front. Cell. Infect. Microbiol. 3, 67 (2013).
Lasi, M., David, C. N. & Böttger, A. Apoptosis in pre-Bilaterians: Hydra as a model. Apoptosis 15, 269–278 (2010).
Domazet-Lošo, T. et al. Naturally occurring tumours in the basal metazoan Hydra. Nat. Commun. 5, 4222 (2014).
Krasovec, G., Quéinnec, É. & Chambon, J.-P. Intrinsic apoptosis is evolutionarily divergent among metazoans. bioRxiv https://doi.org/10.1101/2021.12.21.473695 (2021).
Aubrey, B. J., Kelly, G. L., Janic, A., Herold, M. J. & Strasser, A. How does p53 induce apoptosis and how does this relate to p53-mediated tumour suppression?. Cell Death Differ. 25, 104–113 (2018).
Logue, S. E. & Martin, S. J. Caspase activation cascades in apoptosis. Biochem. Soc. Trans. 36, 1–9 (2008).
Silke, J. & Meier, P. Inhibitor of apoptosis (IAP) proteins-modulators of cell death and inflammation. Cold Spring Harb. Perspect. Biol. 5, a008730–a008730 (2013).
Panchin, A. Y., Aleoshin, V. V. & Panchin, Y. V. From tumors to species: A SCANDAL hypothesis. Biol. Direct 14, 3 (2019).
Canning, E. U., Curry, A., Hill, S. L. L. & Okamura, B. Ultrastructure of Buddenbrockia allmani n. sp. (Myxozoa, Malacosporea), a parasite of Lophopus crystallinus (Bryozoa, Phylactolaemata). J. Eukaryot. Microbiol. 54, 247–262 (2007).
Sudhagar, A., Kumar, G. & El-Matbouli, M. The malacosporean myxozoan parasite Tetracapsuloides bryosalmonae: A threat to wild salmonids. Pathogens 9, 16 (2019).
Gruhl, A. & Okamura, B. Development and myogenesis of the vermiform Buddenbrockia (Myxozoa) and implications for cnidarian body plan evolution. EvoDevo 3, 10 (2012).
Naldoni, J., Adriano, E. A., Hartigan, A., Sayer, C. & Okamura, B. Malacosporean myxozoans exploit a diversity of fish hosts. Parasitology 146, 968–978 (2019).
Okamura, B., Hartikainen, H., Schmidt-Posthaus, H. & Wahli, T. Life cycle complexity, environmental change and the emerging status of salmonid proliferative kidney disease. Freshw. Biol. 56, 735–753 (2011).
Patra, S., Hartigan, A., Morris, D. J., Kodádková, A. & Holzer, A. S. Description and experimental transmission of Tetracapsuloides vermiformis n. sp. (Cnidaria: Myxozoa) and guidelines for describing malacosporean species including reinstatement of Buddenbrockia bryozoides n. comb. (syn. Tetracapsula bryozoides). Parasitology 144, 497–511 (2017).
Hartigan, A. et al. Recruitment of toxin-like proteins with ancestral venom function supports endoparasitic lifestyles of Myxozoa. PeerJ 9, e11208 (2021).
Ahmad, F. et al. Know your enemy - transcriptome of myxozoan Tetracapsuloides bryosalmonae reveals potential drug targets against proliferative kidney disease in salmonids. Parasitology 148, 726–739 (2021).
Faber, M. et al. Comparative transcriptomics and host-specific parasite gene expression profiles inform on drivers of proliferative kidney disease. Sci. Rep. 11, 2149 (2021).
Kumar, G., Ertl, R., Nilsen, F., Bartholomew, J. L. & El-Matbouli, M. Data of de novo transcriptome assembly of the myxozoan parasite Tetracapsuloides bryosalmonae. Data Brief 35, 106831 (2021).
Yahalomi, D. et al. A cnidarian parasite of salmon (Myxozoa: Henneguya) lacks a mitochondrial genome. Proc. Natl. Acad. Sci. U. S. A. 117, 5358–5363 (2020).
Guo, Q. et al. A myxozoan genome reveals mosaic evolution in a parasitic cnidarian. BMC Biol. 20, 51 (2022).
Camacho, C. et al. BLAST+: Architecture and applications. BMC Bioinf. 10, 421 (2009).
Eddy, S. R. Accelerated profile HMM searches. PLoS Comput. Biol. 7, e1002195 (2011).
Igaki, T. & Miura, M. Role of Bcl-2 family members in invertebrates. Biochim. Biophys. Acta 1644, 73–81 (2004).
Tan, F. J., Fire, A. Z. & Hill, R. B. Regulation of apoptosis by C. elegans CED-9 in the absence of the C-terminal transmembrane domain. Cell Death Differ. 14, 1925–1935 (2007).
Denning, D. P., Hatch, V. & Horvitz, H. R. Both the caspase CSP-1 and a caspase-independent pathway promote programmed cell death in parallel to the canonical pathway for apoptosis in Caenorhabditis elegans. PLoS Genet. 9, e1003341 (2013).
Chen, Y.-Z., Mapes, J., Lee, E.-S., Skeen-Gaar, R. R. & Xue, D. Caspase-mediated activation of Caenorhabditis elegans CED-8 promotes apoptosis and phosphatidylserine externalization. Nat. Commun. 4, 2726 (2013).
Strasser, A. & Vaux, D. L. Viewing BCL2 and cell death control from an evolutionary perspective. Cell Death Differ. 25, 13–20 (2018).
Kale, J., Osterlund, E. J. & Andrews, D. W. BCL-2 family proteins: Changing partners in the dance towards death. Cell Death Differ. 25, 65–80 (2018).
Jones, P. et al. InterProScan 5: Genome-scale protein function classification. Bioinformatics 30, 1236–1240 (2014).
Yahalomi, D. et al. The multipartite mitochondrial genome of Enteromyxum leei (Myxozoa): Eight fast-evolving megacircles. Mol. Biol. Evol. 34, 1551–1556 (2017).
Takeuchi, F. et al. The mitochondrial genomes of a myxozoan genus Kudoa are extremely divergent in Metazoa. PLoS ONE 10, e0132030 (2015).
Momeni, H. R. Role of calpain in apoptosis. Cell J. 13, 65–72 (2011).
Harwood, S. M., Yaqoob, M. M. & Allen, D. A. Caspase and calpain function in cell death: Bridging the gap between apoptosis and necrosis. Ann. Clin. Biochem. 42, 415–431 (2005).
Croall, D. E. & Ersfeld, K. The calpains: Modular designs and functional diversity. Genome Biol. 8, 218 (2007).
Luo, Y., Sellitti, D. F. & Suzuki, K. The Calpain Proteolytic System. in Encyclopedia of Cell Biology 670–680 (Elsevier, 2016).
Han, H. et al. Apoptosis phenomenon in the schistosomulum and adult worm life cycle stages of Schistosoma japonicum. Parasitol. Int. 62, 100–108 (2013).
Prjibelski, A., Antipov, D., Meleshko, D., Lapidus, A. & Korobeynikov, A. Using SPAdes DE novo assembler. Curr. Protoc. Bioinformatics 70, e102 (2020).
Li, H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. (2013) https://doi.org/10.48550/ARXIV.1303.3997.
Danecek, P. et al. Twelve years of SAMtools and BCFtools. Gigascience 10, giab008 (2021).
Haas, B. J. et al. De novo transcript sequence reconstruction from RNA-seq using the Trinity platform for reference generation and analysis. Nat. Protoc. 8, 1494–1512 (2013).
Alama-Bermejo, G. & Holzer, A. S. Advances and discoveries in myxozoan genomics. Trends Parasitol. 37, 552–568 (2021).
Kalyaanamoorthy, S., Minh, B. Q., Wong, T. K. F., von Haeseler, A. & Jermiin, L. S. ModelFinder: Fast model selection for accurate phylogenetic estimates. Nat. Methods 14, 587–589 (2017).
Nguyen, B. T. et al. Respond. Am. J. Public Health 105, e2 (2015).
Letunic, I. & Bork, P. Interactive Tree Of Life (iTOL) v5: An online tool for phylogenetic tree display and annotation. Nucleic Acids Res. 49, W293–W296 (2021).
Mistry, J. et al. Pfam: The protein families database in 2021. Nucleic Acids Res. 49, D412–D419 (2021).
Blum, M. et al. The InterPro protein families and domains database: 20 years on. Nucleic Acids Res. 49, D344–D354 (2021).
Edgar, R. C. MUSCLE: Multiple sequence alignment with high accuracy and high throughput. Nucleic Acids Res. 32, 1792–1797 (2004).
Waterhouse, A. M., Procter, J. B., Martin, D. M. A., Clamp, M. & Barton, G. J. Jalview Version 2–a multiple sequence alignment editor and analysis workbench. Bioinformatics 25, 1189–1191 (2009).
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Gnerre, S. et al. High-quality draft assemblies of mammalian genomes from massively parallel sequence data. Proc. Natl. Acad. Sci. U. S. A. 108, 1513–1518 (2011).
Zerbino, D. R. & Birney, E. Velvet: Algorithms for de novo short read assembly using de Bruijn graphs. Genome Res. 18, 821–829 (2008).
Pryszcz, L. P. & Gabaldón, T. Redundans: An assembly pipeline for highly heterozygous genomes. Nucleic Acids Res. 44, e113 (2016).
Gurevich, A., Saveliev, V., Vyahhi, N. & Tesler, G. QUAST: quality assessment tool for genome assemblies. Bioinformatics 29, 1072–1075 (2013).
Acknowledgements
We thank A.A. Gerasimov for providing the Polypodium samples.
Funding
This work was supported by Russian Science Foundation Grant [19–74-20147] and project of the Russian Academy of Science (No. AAAA-A17-117011810039–4).
Author information
Authors and Affiliations
Contributions
A.M.N., A.Y.P., and Y.V.P. conceived the study. M.D.B., V.V.A., Y.V.P. performed sample collection. K.V.M. performed sequencing and sequence assembly. A.M.N. and A.Y.P. prepared the draft manuscript. A.M.N., A.Y.P., Y.V.P, and K.V.M. analyzed the data. Y.V.P. and V.V.A. supervised the project. All authors read, edited, and approved the final manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interest.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Neverov, A.M., Panchin, A.Y., Mikhailov, K.V. et al. Apoptotic gene loss in Cnidaria is associated with transition to parasitism. Sci Rep 13, 8015 (2023). https://doi.org/10.1038/s41598-023-34248-y
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-34248-y
- Springer Nature Limited