Introduction

Animal coloration has long been a focus of research in genetics, animal behavior, and evolutionary biology because it is a highly variable trait that is often shaped by natural and sexual selection (Endler and Mappes 2017). Colors are frequently used in communication (e.g. as sexual ornaments; (Hill et al. 2006; Caro and Allen 2017), predator deterrence (e.g. in aposematic signals; Ruxton et al. 2004), thermal regulation (Trullas et al. 2007), and various other ecological functions. In some species, two or more distinct, genetically-determined phenotypes exist in different geographic localities (i.e. color polytypism; (Huxley 1955; Endler 1980). Such polytypic species have the potential to shed light on fundamental processes including local adaptation, the maintenance of genetic variation, and speciation (Gray and McKinnon 2007; Bolton et al. 2015). To understand the role of animal coloration in organismal biology, we need to examine not only the selective forces acting on color traits, but also their genetic and developmental underpinnings. Unfortunately, our understanding of the genetics responsible for the diversity in animal coloration in non-model organisms is still in its infancy (Shawkey and D’Alba 2017). However, this is rapidly changing with the advent of newer sequencing technologies and computational approaches that make elucidating underlying genetic mechanisms much more tractable in non-model organisms (Price-Waldman and Stoddard 2021; Funk et al. 2018).

The poison frogs (family Dendrobatidae) comprise a taxon with high inter- and intra-specific variation in coloration (Brown et al. 2011). Adult coloration in poison frogs is shaped by several different evolutionary forces, including natural selection, sexual selection, and stochastic processes. Natural selection has produced several genera that display bright aposematic coloration that is used to warn predators of the presence of toxic alkaloids in their skin (Darst et al. 2006; Saporito et al. 2007; Noonan and Comeault 2009; Stuckert et al. 2014a, b). Sexual selection has also contributed to shaping coloration and the development of color polymorphism in some poison frogs. For example in O. pumilio, a species with several color morphs, females show color-based mate preferences for the most common local color morph (Reynolds and Fitzpatrick 2007; Maan and Cummings 2008, 2012; Yusan Yang et al. 2016) and males are also more aggressive in their territorial interactions toward other males bearing that color (Yang et al. 2018, 2019). Genetic drift has also likely played a role in shaping the color diversity of poison frogs. For example, evidence indicates drift is a primary driver of divergent color morphs of O. pumilio seen on different islands (Tazzyman and Iwasa 2010; Gehara et al. 2013). Additionally, work examining selection based on color between morphs of Dendrobates tinctorius has also revealed a pattern of neutral evolution consistent with drift (Wollenberg et al. 2008). Similarly, evidence indicates that variation in warning signals can persist even when warning signals are weak when there is limited gene flow (Lawrence et al. 2019). Thus, the evolutionary forces shaping poison frog coloration appear to be varied and complex, and we are just now beginning to understand the details of how color variation evolved in many species.

Recent genomic and transcriptomic studies have provided insight into the genetic mechanisms underlying color variation. Twomey et al. (2020a; b) examined a red/yellow polymorphism in the Sira poison frog, Ranitomeya sirensis, using pigment analysis and RNA sequencing of adult individuals. The authors attribute color differences between the two morphs to varying expression levels of carotenoid-modifying enzymes as well as a loss of function mutation in the gene Beta-Carotene Oxygenase 2 (bco2). Rodriguez et al. (2020) examined the genetic basis of coloration in O. pumilio by performing a differential gene expression analysis of red, green, and blue adult phenotypes. In that study, the authors observe differential expression across numerous functional gene categories, including carotenoid synthesis, melanin synthesis, pteridine synthesis, structural coloration, and pigment cell development. Most recently, several causative genes underlying color and pattern elements between color morphs were identified in the mimic poison frog Ranitomeya imitator. In this species, the color morphs appear to have evolved via selective sweeps on polygenic variation (Linderoth et al. 2023). Their findings highlight the complexity of the genetic mechanisms producing bright, aposematic coloration in amphibians and underscore the need for further coloration research in poison frogs.

A particularly understudied area of animal coloration is how integument color develops in animals with larval stages. There is a growing body of literature on the genetics of pattern formation in Danio fishes (Huang et al. 2021), made possible by decades of work and excellent genomic resources. However, while the morphogenesis, migration, and interactions among pigment cells during amphibian larval stages are described for a handful of anurans (Goldberg et al. 2019; Thibaudeau and Altig 2012), the genetics of these processes are almost entirely unknown. Stuckert et al. (2019) examined expression differences among color morphs in Dendrobates auratus using skin samples from final-stage larva (Gosner stages 41–43), providing candidate genes involved in melanin and iridophore pathways. Work examining gene expression across the development of mimetic Ranitomeya suggested similar pathways at play, but intriguingly suggested that even congeneric species which have evolved mimetic phenotypes may use different gene expression patterns throughout development to produce these convergent phenotypes (Stuckert et al. 2021). However, a limitation of each of these studies is that they used skin samples that included multiple colors which can muddle gene expression signals. Thus our expectations for how gene expression of color and pattern genes occur during development remains an unknown but important part of understanding color and pattern formation in amphibians.

Here we aim to fill a gap in our understanding of the development of coloration in amphibians by studying the development of coloration in the polytypic poison frog O. pumilio. Using whole-transcriptome sequences from pre- and post-metamorphic captive-bred individuals, we examine gene expression profiles across ontogenetic stages and compare green dorsal skin to yellow ventral skin. Frogs in this population exhibit a dark, forest green dorsum and bright yellow venter (Fig. 1C). In amphibians, yellow skin is likely produced by different pigments (e.g., pteridines, carotenoids) that are deposited within the xanthophores (Obika and Bagnara 1964). Green skin is primarily produced by (1) the orientation of structural elements in the iridophores, melanin deposited within melanophores absorbing some light, and possibly pigments in the xanthophores or (2) via solely the iridophores themselves (Bagnara et al. 1968). Green pigments have been identified from frogs (Taboada et al. 2020), but this is not a likely mechanism in poison frogs. Notably, in addition to possessing a green dorsum, frogs in this population display black spots on the dorsum. The size of these and how densely they are spread on the skin are both variable within the population. These are produced by the presence of melanophores and melanin deposited within them (Freeborn 2021).

Fig. 1
figure 1

Visual representation of the Isla Popa phenotype. A Representative images of sampling timepoints throughout development (referred to by Gosner stages, Gosner 1960). B Spectral reflectance measurements. C Images of the dorsum and venter of an adult

In this study, we identify putative color genes differentially expressed between all sampled timepoints and other genes expressed during some, but not all, developmental stages. We also conduct differential expression analyses between dorsal and ventral skin throughout the developmental series, followed by network and GO enrichment analyses. We identify a priori coloration genes differentially expressed between the dorsum and venter at any age and reveal novel insights about the production of coloration across development.

Methods

IACUC statements

This work was done in compliance with Institutional Animal Care and Use Committee protocols at University of Pittsburgh (No. 18103943).

Data and code availability

All read data are available in the European Nucleotide Archive under accession number PRJEB41509. All code for analyses are available at https://github.com/AdamStuckert/Oophaga_pumilio and Stuckert 2023.

Husbandry and sample collection

Animals used in this experiment were sampled from a pure-breeding line established from wild-caught individuals collected from Isla Popa in the Bocas del Toro province of Panama. Within this population, adult males and females are qualitatively similar in coloration: green with black spots dorsally and bright yellow ventrally (Summers et al. 2003; Maan and Cummings 2009); Fig. 1).

All animals were housed in the same walk-in environmental chamber (Darwin Chambers Company, St. Louis, MO) at the University of Pittsburgh per Dugas et al. (Dugas et al. 2013) throughout development. Adult frogs were fed with springtails (Collembola) and fruit flies (wingless Drosophila melanogaster) dusted with powdered supplement (Repashy calcium supplement Plus ICB, Repashy Ventures Inc., San Marcos, CA). The fruit flies were raised on a growth medium supplemented with carotenoids (see Dugas et al. 2013 for details). Females regularly provide tadpoles with unfertilized eggs until close to metamorphosis (~ 45 days) and each tadpole is fed approximately five times (typically one trophic egg per feed) before metamorphosis (Pröhl and Hödl 1999).

We collected dorsal and ventral skin tissue from adults and tadpoles at four developmental stages (hereafter Gosner stages, or GS; (Gosner 1960)) that were easily distinguishable: GS26 (shortly after hatching), GS37 (hindlimb mostly developed), GS42 (forelimbs emerged), and GS46 (tail fully resorbed). Tadpoles of O. pumilio appear uniformly brown/black in color until late in development (~ GS42, Fig. 2) when they begin to take on their adult coloration. We collected samples from five tadpoles per Gosner stage (GS26, GS37, GS42, GS46), and 15 adults for this study.

Fig. 2
figure 2

Principal components analysis of gene expression across samples. Age represents either adult frogs or is presented as the Gosner stage at which a tadpole was sampled. Location represents whether RNA was extracted from a small patch of skin on the dorsum (green on the frog) or venter (yellow on the frog). Axes are labeled with the proportion of the data that is explained by each principal component

Lab work

Adult frogs and tadpoles were euthanized by decapitation followed by double-pithing. The dorsal and ventral skins were dissected from the body and submerged immediately in vials containing ~ 1.5 ml of RNAlater (Ambion, Austin, Texas). The vials were then flash frozen in liquid nitrogen and stored at − 80 °C until shipment to the University of New Hampshire for RNA extraction. To extract RNA, skin patches were lysed in Trizol using a tissue homogenizer, treated with DNAse to remove any remaining genomic DNA, and total RNA was then extracted using a standardized Trizol protocol. mRNA libraries were prepared using standard poly-A tail purification, prepared using Illumina primers, and individually dual-barcoded using a New England Biolabs Ultra II Directional kit (E7760). Individually barcoded samples were pooled and sequenced on one lane of an Illumina HiSeq X at the Vincent J. Coates Genomics Sequencing Laboratory and Functional Genomics Laboratory at the University of California at Berkeley. Reads were paired-end and 150 base pairs in length.

Transcriptome assembly

We merged reads from dorsal and ventral samples from a single random individual per Gosner Stage to create a single paired-end dataset for transcriptome assembly. We then used seqtk’s “sample” function to subsample each species’ readset to 50 million read pairs (https://github.com/lh3/seqtk). We used this subsampled dataset to build a de novo transcriptome using the Oyster River Protocol version 2.2.7 (MacManes 2018) which automates the following processes. Initial error correction was done using RCorrector 1.01 (Song and Florea 2015), followed by quality trimming using Trimmomatic version 0.36 at a Phred score of ≤ 3 (Bolger et al. 2014) as aggressive quality trimming decreases assembly completeness (MacManes 2014). The Oyster River Protocol assembles a transcriptome by using a series of different transcriptome assemblers and also multiple kmer lengths, merging them into a single transcriptome. Assemblies were conducted using Trinity version 2.7.0 (Grabherr et al. 2011), Trans-ABySS (Robertson et al. 2010), and SPAdes assembler version 3.12.0 with a kmer length of 55 and a second time with a kmer length of 75 (Bankevich et al. 2012; Prjibelski et al. 2020). These individually built transcriptomes were then merged together using OrthoFuser (MacManes 2018). Unique contigs which were dropped in Orthofuser were recovered using a reciprocal blast search of the final assembly against the individual assemblies for unique contigs. We removed all contigs with expression lower than one Transcript Per Million (TPM) using the TPM = 1 flag in the ORP. Contigs that were dropped due to low expression but which likely represent expressed genes were recovered by blasting these against the Uniprot database (Bairoch and Apweiler 2000). Finally, transcriptome quality was assessed using BUSCO version 4 and the tetrapoda database “tetrapoda_odb10” (Simão et al. 2015) and TransRate 1.0.3 (Smith-Unna et al. 2016).

Gene expression analyses

We quantified sample libraries using salmon version 0.14.1 (Patro et al. 2017) to map reads back to our reference transcriptome. We then annotated our transcriptome using the SwissProt database (UniProt Consortium 2019) using Diamond version 0.9.10 (Buchfink et al. 2015) at an e-value cutoff of 0.001 and without the -max_target_seqs 1 flag. We imported quantification results into R version 4.0.2 (Team 2017) using the package “tximport” (Soneson, Love, and Robinson 2015), and conducted a gene level analysis using DESeq2 version 1.28.1 (Love et al. 2014). We shrunk log fold change estimates using the function “lfcShrink” in DESeq2, using the “normal” shrinkage estimate, which allows targeted comparisons. Since we were particularly interested in running comparisons between skin color patches (i.e., dorsum vs. venter), we then ran a Wald test comparing these two color patches at each time point from Gosner stage 26 through Gosner stage 46, as well as adult frogs. Additionally, we were interested in gene expression changes over time, so we ran a Likelihood Ratio Test to examine what genes change significantly throughout development (Gosner stages 26–46). We used a Benjamini-Hochberg (Benjamini and Hochberg 1995) correction for every analysis of gene expression, and considered a q-value of < 0.05 to be statistically significant. We then extracted data from our models for particular a priori color genes that play a role in color or pattern production in other taxa. This a priori list of candidate color genes is from (Stuckert et al. 2021) but was updated by searching for genes that have been implicated in coloration in genomics studies in the period 2019–2021 and includes 561 genes in total). Plots were produced using ggplot2 version 3.3.2 (Wickham 2011).

In this studywe use two approaches to our gene expression analysis. The first uses our de novo assembled transcriptome to analyze readsets. The second uses an O. pumilio assembly (Rogers et al. 2018) that was scaffolded with RNA seq data by Rodriguez et al. (Rodríguez et al. 2020) to analyze our data. Both approaches use the same overall methodological approach following this initial step, and our results are largely congruous between methods.

Weighted gene correlation network analysis

To identify networks of genes that interact in response to differences in tissue location or developmental stage, we used weighted gene co-expression network analysis (WGCNA) using the package WGCNA (Langfelder and Horvath 2008). For the WGCNA analysis, we used the variance stabilizing transformed data produced by DESeq2. WGCNA requires filtering out genes with low expression, which we did prior to differential expression analyses. We estimated a soft threshold power (β) that fits our data, by plotting this value against Mean Connectivity to determine the minimum value at which Mean Connectivity asymptotes, which represents scale free topology. For our data, we used β = 4, the recommended minimum module size of 30, and we merged modules with a dissimilarity threshold below 0.25. After module formation, we tested whether the eigen modules (conceptually equivalent to a first principal component of the modules) were correlated with sampling location (e.g., dorsum vs venter) at p < 0.05. To examine gene ontology of these modules we then took module membership for each gene within these modules and ranked them in decreasing order of module membership. We then supplied this single ranked list of module membership to the Gene Ontology enRIchment anaLysis and visuaLizAtion tool (GOrilla), and ran it in fast mode using Homo sapiens set as the organism (Eden et al. 2009) to examine enrichment of Function. Gene Ontology (GO) analyses were conducted on 10 December, 2020. Finally, we extracted hub genes for WGCNA modules of interest, and plotted the interactions between these hub genes within a module using Cytoscape version 3.8.0, running on Java 11.0.6 (Su et al. 2014).

Results

Transcriptome assembly, annotation, and read mapping

We produced a de novo transcriptome assembly for the Isla Popa morph of O. pumilio. This transcriptome had a TransRate score of 0.371, and an optimal TransRate score of 0.447, indicating high read support for the assembled transcripts. Our BUSCO analysis indicated that this transcriptome had 70.7% of the complete tetrapod orthologs (62.5% complete, single copies; 8.2% duplicated copies), 7.1% of copies were present but fragmented, and 22.2% were missing. We annotated 37,164 assembled transcripts (24.3%) using the Swiss-Prot database. We mapped reads to our reference transcriptome using salmon, successfully mapping 65.5% (± 7.1% SD) of reads.

We also utilized an alternative reference transcriptome produced by Rodriguez et al. (Rodríguez et al. 2020). This transcriptome was built by scaffolding an existing O. pumilio reference genome using RNA seq data (Rogers et al. 2018). We annotated this transcriptome in the same manner with which we produced our de novo assembly and were able to annotate 40,383 contigs out of 55,640 total contigs (72.57%). When we mapped our reads to this assembly using Salmon, we were able to map 67.33% (± 6.42% SD).

Transcriptome guided differential expression analyses

We conducted six individual comparisons. One comparison was a likelihood ratio test between all tadpole timepoints to examine any change in expression patterns. This comparison yielded a total of 6460 differentially expressed genes after Benjamini–Hochberg correction (all numbers below represent significant genes after the same correction), 160 of which are putatively color genes as identified via our a priori list.

Five of the comparisons we ran were a Wald test of differential expression between the dorsal and ventral skin at each time point (i.e., tadpoles from Gosner stage 26 through Gosner stage 46, as well as adult frogs). Comparatively few genes were differentially expressed at each larval developmental stage relative to adulthood (Table 1). In total, there were 1,176 unique differentially expressed genes between the dorsum and venter at any age, 54 of which were in our a priori candidate list. These 1,176 unique genes that were differentially expressed at any age contained six genes which were differentially expressed between the dorsum and venter in every age class (casq1, hand2, myh8, prva, tbx3, and zic1). Of these, hand2, myh8, tbx3, and zic1 have been identified as being related to the development of coloration in other taxa and were in our a priori candidate list (Fig. 3). There were a number of additional genes that were differentially expressed between dorsum and venter at multiple time points, including gata3 (4 age classes), edn3 (3 age classes), and asip and oca2 (2 age classes). Of the 54 genes that we identified a priori which were differentially expressed, the vast majority of these were differentially expressed exclusively in adults (38 out of 43). Many of these genes are related to melanophore production or melanin synthesis, including asip, oca2, mc1r, kit, lef1, pmel, sox10, tyr and tyrp1 (Fig. 4). Fewer were related to pteridine synthesis (xdh, only at Gosner stage 26) or carotenoid metabolism (adh1, adh2, aldh1a2, rdh10; Fig. 5).

Table 1 Specific comparisons conducted in this study
Fig. 3
figure 3

Genes that are differentially expressed between the dorsum (green) and venter (yellow) in every age class

Fig. 4
figure 4

Melanin genes that are differentially expressed between the dorsum (green) and venter (yellow)

Fig. 5
figure 5

Carotenoid (adh1, aldh1a2, rdh10) and pteridine (xdh) genes that are differentially expressed between the dorsum (green) and venter (yellow)

Genome guided differential expression analyses

Using the Rogers et al. (2018) genome that had been scaffolded by Rodriguez et al. (2020), we again conducted the same six individual comparisons. Results are qualitatively quite similar to our de novo transcriptome based approach. One comparison was a likelihood ratio test between all tadpole timepoints to examine any change in expression patterns. This comparison yielded a total of 5684 differentially expressed genes, 153 of which are putatively color genes as identified via our a priori list.

Again, comparatively few genes were differentially expressed at each developmental stage relative to adulthood (Table 1). In total, there were 1374 unique differentially expressed genes between the dorsum and venter at any age, 51 of which were in our a priori candidate list. These 1374 unique genes that were differentially expressed at any age contained nine genes that were differentially expressed between the dorsum and venter at each age (casq1, eno3, hand2, myom2, myoz1, pvalb, srl, tbx3, and zic1). Of these, eno3, hand2, myom2, myoz1, pvalb, tbx3, and zic1 have been identified as coloration in other taxa and found in our a priori candidate list. There were a number of additional genes that were differentially expressed between dorsum and venter at multiple time points, including myh8 (3 age classes, and differentially expressed at every time point in the transcriptome analysis) and asip, gata3, and oca2 (2 age classes). Of the 51 genes that we identified a priori which were differentially expressed, the vast majority of these were differentially expressed exclusively in adults (38 out of 43). Many of these genes are related to melanophore production of melanin synthesis, including asip, oca2, mc1r, kit, lef1, pmel, sox10, tyr and tyrp1. Fewer were related to pteridine synthesis (xdh, only at Gosner stage 26) or carotenoid metabolism (adh1, adh2, aldh1a2, rdh10).

Weighted gene correlation network analysis

A total of eight modules that were identified by WGCNA were significantly correlated with sample location (dorsum vs. venter) in adult samples (Table 2). Parsing the results of network analyses can be difficult, given the abundance of modules (identified herein by number, which is without biological significance) and GO terms that are identified as statistically significant, but may be unrelated to the phenotype of interest. The second module contained numerous GO terms related to actin binding or cytoskeletal binding, including actin binding (GO:0003779), actin filament binding (GO:0051015), and cytoskeletal protein binding (GO:0008092). These GO terms involve many myosin genes, including a number that were differentially expressed between the dorsum and venter in numerous age classes (e.g., myh8, myoz1). Further, four of the hub genes in the black module are those that were differentially expressed between the dorsum in every age class (casq1, myh8, myoz1, and prva; Fig. 6).

Table 2 WGCNA module results for tissue location (dorsal vs. ventral) in adults
Fig. 6
figure 6

Hub genes for the second module associated with sampling location, which WGCNA identified as correlated with coloration in adult frogs. The four genes highlighted in red (casq1, myh8, myoz1, and prva) are those that were differentially expressed between the dorsum and venter in every age class (tadpoles of Gosner stage 26 through adults)

Additionally, a total of eight gene modules were identified by WGCNA to be significantly associated with developmental stage (Table 3). As mentioned above, parsing these results and their significance can be difficult. Many of the enriched gene ontology terms in these modules relate to the production of the extracellular matrix. For example, the five most enriched GO terms in the first module are protein-lysine 6-oxidase activity (GO:0004720), chondroitin sulfate binding (GO:0035374), extracellular matrix structural constituent conferring compression resistance (GO:0030021), extracellular matrix constituent conferring elasticity (GO:0030023), and procollagen-lysine 5-dioxygenase activity (GO:0097493). Of these five most enriched gene ontology terms, all are related to collagen or the extracellular matrix. Further, a number of the hub genes in this module were collagen type 1–6 genes (10 out of 36 hub genes) and two were in the lysyl oxidase gene family (LOXL1, LOXL2; Fig. 7). Similarly, the fourth module has a variety of enriched GO terms related to keratinization (GO:0031424), cornification (GO:0070268), or intermediate filament organization (GO:0045109). Most of these modules are likely to be strictly related to development, and not coloration related per se. However, certain modules (e.g., the second module) are associated with an abundance of GO terms related to G proteins: GDP binding (GO:0019003), GTPase activity (GO:0003924), GTP binding (GO:0005525), purine ribonucleoside binding (GO:0032550), purine nucleoside binding (GO:0001883), ribonucleoside binding (GO:0032549), guanyl ribonucleotide binding (GO:0032561), guanyl nucleotide binding (GO:0019001), and nucleoside binding (GO:0001882). These modules are driven by a number of genes in the ras oncogene family. While these ras oncogenes are best known for cancer phenotypes in mutants, many of them have been implicated in coloration in other taxa (Houben et al. 2004; McLean et al. 2017).

Table 3 WGCNA module results across developmental stages
Fig. 7
figure 7

Hub genes for the first gene module associated with development, which WGCNA identified as correlated with age class. Genes in the lysyl oxidase gene family (LOXL1, LOXL2) are highlighted in yellow, while collagen type 1–6 genes are highlighted in red

Discussion

In this paper, we examined gene expression in the dorsum and venter throughout development in the strawberry poison frog, O. pumilio, to better understand both differential color production and ontogenetic changes in the integument throughout development. These data allow us to look at the differences in color production between green skin (dorsum) and yellow skin (venter), while also examining how changes in gene expression contribute to external development prior to, and after metamorphosis. Here we highlight genes that are differentially expressed between the dorsum and venter at multiple developmental stages (e.g., casq1, hand2, myh8, prva, tbx3, and zic1), key pathways varying across developmental stages, and finally candidate color genes for producing the difference between green and yellow integument.

Genes differentially expressed between the dorsum and venter at multiple developmental stages

A total of six genes were differentially expressed between the dorsum and venter at each time point: casq1, hand2, myh8, prva, tbx3, and zic1. Their differential expression throughout the developmental stages we tracked (recently hatched tadpoles through froglets) make these excellent candidate genes for the production of different structures in the integument between the dorsum and venter. This includes both structures related to color (chromatophores) as well as structures related to toxicity (granular glands). Hand2, myh8, tbx3, and zic1 have been putatively identified as players in coloration in other taxa. Heart- and neural crest derivatives-expressed protein 2 (hand2) encodes a transcription factor that plays a role in a variety of developmental processes, including heart morphogenesis (Shen et al. 2010) and limb development (Yelon et al. 2000). In dark-eyed juncos (Junco hyemalis), expression of hand2 is correlated with feather coloration, with higher expression in gray feathers than black feathers (Abolins-Abols et al. 2018). Similarly, hand2 has higher expression in the lighter-colored egg spots in certain cichlids (Santos et al. 2016). We see expression patterns that recapitulate the findings in both of these species: higher expression in lighter colored regions of the body (the venter in our study system).

Hand2 plays a role in maintaining expression of T-Box Transcription Factor 5 (Yelon et al. 2000). While we did not see the same pattern of differential expression in hand2 and tbx5, we did see the same expression between hand2 and the T-Box Transcription Factor 3 gene (tbx3). This potentially indicates that hand2 expression is also related to tbx3 expression in a manner similar to tbx5, which is plausible given they are both in the tbx2 subfamily. Tbx3 has been shown to inhibit pigment synthesis and produce lighter colored hair in horses (Imsland et al. 2016). We see a similar pattern in our data with much higher expression in the lighter ventral integument, potentially indicating that tbx3 is inducing lighter phenotypes in O. pumilio as it does in other taxa.

While hand2 and tbx3 seem to play a role in coloration specifically, other genes differentially expressed between the dorsum and venter seem to be related to early differentiation during development and not coloration. Myosin Heavy Chain 8 (myh8) is associated with multiple human diseases associated with variety of developmental defects, as well as abnormal spotted pigmentation such as Trismus‐pseudocamptodactyly syndrome (Toydemir et al. 2006) and Cushing’s syndrome (Majumder et al. 2020). This gene is clearly important to a variety of developmental processes, and it is likely that its effect on pigmentation is due to abnormal organization in the integument. Zic Family Member 1 (zic1) is critically important to basic neural crest development and signaling (Milet et al. 2013; Plouhinec et al. 2014). Further, zic1 is involved in dorsoventral patterning in zebrafish, and is affected by both BMP and hedgehog signaling (Rohr et al. 1999). Zic1 is known to influence the location of iridophores and melanophores in medaka (Oryzias latipes) and thus impact color and pattern phenotypes. However, this seems to largely be a result of how zic1 mutants cause mirroring effects between the dorsum and venter, seemingly indicating that this gene largely drives differences in dorsoventral patterning in general, and not coloration per se(Ohtsuka et al. 2004). Despite this, zic1 is a good candidate gene for coloration in this taxon, and plausibly others.

We also discovered differential expression between the dorsum and venter at every time point in the genes Calsequestrin 1 (casq1) and Parvalbumin (pvalb). These genes buffer Ca2+ and are a Ca2+ transporter respectively. Alkaloids are known to affect voltage gated channels. Pumiliotoxins in particular appear to be fairly specific inhibitors of Ca2+-ATPase activity (Tamburini et al. 1981). O. pumilio is an obligate egg-feeding species (Weygoldt et al. 1980). In addition to providing key nutrients to their young, these mothers also provision trophic eggs with alkaloids. Tadpoles first exhibit very low levels of toxicity around Gosner stage 34, and show an increase in both toxin quantity and diversity after this developmental stage (Stynoski et al. 2014). In O. pumilio, granular glands in this species do not begin developing until around Gosner stage 32, and are visible by Gosner stage 40 (Stynoski and O’Connell 2017). This is perhaps surprising, as O. pumilio tadpoles feed on a diet of trophic eggs provisioned with alkaloids through development and before they have the physiology to sequester them in the skin. We expect a greater number and possibly overall larger size of granular glands in the dorsum, as this is where potential predators are most likely to attack and in many toxic dendrobatids, including O. pumilio, the dorsal skin is visibly more granular than the ventral skin (CR-Z and YY pers. obs.). In this study, we found differential expression with higher expression in the venter in every time point in the genes casq1 and pvalb, but that the difference between the dorsum and venter dramatically decreased around Gosner stage 37–42, which corresponds with the onset and maturation of granular gland development in O. pumilio. Although this expression pattern is in the opposite direction than what we would have predicted, these results still lead to us proposing casq1 and pvalb as candidate genes. This finding, coupled with evidence that pumiliotoxins (and other alkaloids) impact Ca2+ channels and the developmental data lead us to hypothesize that casq1 and pvalb are associated with granular glands and play a role in preventing autotoxicity in O. pumilio.

Key pathways in developmental stages

Amphibian metamorphosis involves vast reorganization of the body. The process of metamorphosis is initiated via Thyroid Hormone (TH), which causes cascading effects throughout the organism, including organogenesis, reorganization of the intestinal tract, reorganization/ossification of the skull, and the initiation of apoptosis in the tail (Mukhi et al. 2010). Most work focuses on one of those processes, with less work focusing on reorganization of the integument. Our gene network analyses indicate a number of gene pathways and key hub genes that are putatively driving reorganization of the body and integument in frogs throughout development.

Prominent among these genes are those related to the extracellular matrix, collagen, and keratin. Collagen is a large family of extracellular matrix proteins, and are extremely abundant molecules in vertebrates (Myllyharju and Kivirikko 2004). We identified many collagen type 1–6 genes as hub genes in gene networks associated with age, indicating their potential importance in the reorganization of the integument which tadpoles undergo during metamorphosis. Not only are these genes likely candidates for reorganizing the integument during metamorphosis, but collagen genes may influence coloration as well. Coloration is at least in part related to physical structures in the skin. Recent work has found that the orientation and thickness of guanine platelets in iridophores plays a key role in light reflectance and thus the color of the skin (Twomey et al. 2020a; b). Genes which play a role in integumental structure (e.g., keratin, collagen, etc.) may change the orientation of these elements in subtle ways, and are thus plausible candidates for producing different colors. Several collagen genes (col1a1, col5a1, and col14a1) have already been implicated in the production of different colors in another study of O. pumilio or in the poison frog D. auratus (Stuckert et al. 2019; Rodríguez et al. 2020).

Candidate color genes

We also examined specific a priori candidate genes for coloration within our data. Of particular note are genes influencing melanin synthesis or carotenoid metabolism within our data, as those are genes we would predict to be playing a role in the differences between green and yellow skin. In amphibians, yellow and green skin is likely to derive from either pteridines or carotenoids deposited in xanthophores, the outermost chromatophore layer (Bagnara et al. 1968). We hypothesize that there are different quantities of carotenoids between yellow and green skin, although which carotenoids are at play and in which color skin carotenoids are in a greater abundance in this morph of O. pumilio are outstanding questions.Therefore it is difficult to identify a priori expectations for carotenoid gene expression because it is unclear which skin has more carotenoids. We identified a number of carotenoid genes that are differentially expressed between green skin (dorsum) and yellow skin (venter), including adh1, adh2, aldh1a2, and rdh10. All of these are genes in the retinoic acid pathway, and together they catalyze the first two steps in the transition from cellular retinol into retinoic acid (Parés et al. 2008; Thompson et al. 2019; Sandell et al. 2007; Niederreither and Dollé 2008). Genes in this pathway have been repeatedly linked to coloration in other taxa (eg (McLean et al. 2017, 2019; Stuckert et al. 2021)). In fact, adh1, rdh10 and aldh1a1 (note we found differential expression in aldh1a2, but they are likely playing similar roles here) were differentially expressed between red, blue, and green O. pumilio in another study (Rodríguez et al. 2020). Furthermore, the retinoic acid pathway is known to activate tbx3 (Ballim et al. 2012), one of the genes differentially expressed between the dorsum and venter at every developmental stage, and one that impacts color phenotypes in other taxa. Although these genes have been implicated in coloration and are correlated with coloration in this study, whether these genes are producing color or are merely associated with color due to downstream processes related to cleaved carotenoids remains unclear and requires functional testing.

Greens are likely to be derived from a combination of certain wavelengths of light being transmitted through pigments in the xanthophores (again, either pteridines or carotenoids), reflection from iridophores, and light absorption via melanins in the melanophores (although it is possible to produce greens via other mechanisms, see Taboada et al. 2020). Darker greens are putatively derived from higher quantities of melanin (Bagnara et al. 1968). However, the dorsum of these frogs also exhibits small black spots, and these black spots are likely contributing to some expression patterns related to melanin production. We identified a number of melanin genes that are differentially expressed between the and the lighter venter in our samples. We found a number of genes in the melanin synthesis pathway which are differentially expressed between the dorsum and venter, many of which have been identified as important in frogs or other vertebrates. Prominent amongst these are agouti signaling protein (asip) and melanocortin receptor 1 (mc1r). Mc1r is found in the plasma membrane of melanocytes, and activation of this gene leads to melanogenesis. We find higher expression of mc1r in darker skin, concordant with other examples in a variety of taxa (e.g., (Mandal et al. 2020; Espinasa et al. 2018; Mundy 2005; Suzuki et al. 2001). Asip is an agonist of mc1r, and when it binds to mc1r it can cause a switch from the production of eumelanin (brown/black pigment) to pheomelanin (red to reddish-brown pigment). Note that while pheomelanin has been implicated as a primary pigment in one frog species to date (Wolnicka-Glubisz et al. 2012), its presence and role in amphibians largely remains unexplored. We found that asip has the highest expression in the light venter of adults, a result that tracks with other taxa in which asip is correlated with lighter phenotypes.

Similarly, tyrosinase related protein 1 (tyrp1) is capable of producing a lighter colored phenotype via changing the ratios of eumelanin and pheomelanin (Videira et al. 2013). While tyrp1’s role in melanin synthesis is not entirely clear, it is possible that it is important for maintaining the stability of the tyrosinase protein (Sarangarajan and Boissy 2001). We also saw differential expression of the gene tyrosinase (tyr). Tyrosinase is the rate limiting enzyme in the production of melanin, and as a result is important to the ultimate color phenotype. Tyrp1 is also differentially expressed between colors between morphs in O. pumilio and D. auratus (Rodríguez et al. 2020; Stuckert et al. 2019). Counter to predictions, we see higher expression of tyrp1 and lower expression of tyr in the dark green skin. Indeed, our results for tyrp1 expression are counter to what was seen in Dendrobates auratus, in which lighter morphs had higher expression of tyrp1 (Stuckert et al. 2019). We also found differential expression in Proto-oncogene c-KIT (kit), which exhibited a parabolic expression pattern, peaking around Gosner stage 42. Expression was consistently higher in the green dorsal skin, consistent with evidence that kit is required for migration and survival of melanocyte precursors during development (Wehrle-Haller 2003). Mutations in this gene are responsible for piebaldism in humans, in which there are unpigmented areas in the body, typically the forehead and belly (Giebel and Spritz 1991). Finally, we discovered a similar pattern of expression in Lymphoid enhancer-binding factor 1 (lef1) to kit, with a parabolic expression pattern peaking around Gosner stage 42. Lef1 mediates Wnt signaling and ultimately impacts melanogenesis. In minks and the green-and-black poison frog, lef1 expression is correlated with lighter phenotypes (Song et al. 2017; Stuckert et al. 2019). Given their roles in development, and the somewhat surprising expression patterns between the dorsum and venter in our data, it is possible that the expression patterns we observed in lef1, tyr, and tyrp1 are primarily a result of development rather than coloration itself. Despite this, we believe a number of the genes we have discussed related to melanin synthesis are playing a role in coloration differences between the dorsum and venter in O. pumilio, particularly asip and mc1r.

Conclusion

In this paper, we examined gene expression in the dorsum and venter throughout development in the strawberry poison frog, O. pumilio. We used these data to examine the differences in color production between green skin (dorsum) and yellow skin (venter), while also examining how changes in gene expression contribute to external development. We have highlighted key genes that are likely related to coloration. These include melanin synthesis genes such as asip and mc1r, genes in the retinoic acid pathway (e.g., adh1, adh2, aldh1a2, and rdh10), and candidate color genes that were differentially expressed between the dorsum and venter at every developmental stage (hand2, myh8, tbx3, and zic1). Further, our data indicate that a number of genes related to Ca2+ buffering and transport (casq1 and pvalb) may be critically important to preventing autotoxicity, and are especially good candidates for allowing the sequestration and maintenance of pumiliotoxins. Finally, we discuss a number of enriched pathways associated with development that are likely key in the transition from aquatic tadpole to terrestrial froglet, such as those associated with oxidoreductase activity, the extracellular matrix, keratinization, and cornification.