Abstract
The use of molecular markers in sweetpotato spans first, second, and the more recent NGS-based (next-generation sequencing) third-generation platforms. This attests to the long-term interest in sweetpotato as an economically important crop. The six homoeologous chromosomes of sweetpotato lead to complex inheritance patterns that require accurate estimation of allele dosage. The use of NGS for dosage-based genotyping marked a significant advancement in sweetpotato research. Analytical pipelines have emerged to handle dosage-based genotype datasets that account for complex patterns of inheritance polyploid models. Recent approaches for dosage-based variant calling leverage reference genomes of putative ancestral progenitors or haplotype-resolved reference genome. Although pseudo-diploidized genotypes from second-generation platforms remain valuable for certain applications, especially when coarse genetic differentiation suffices, NGS-based genotyping offers a cost-effective, high-throughput, and cutting-edge alternative. Studies indicate that accurate dosage-based genotype datasets significantly enhance applications in linkage analysis, genome-wide association analysis, and genomic prediction. The affordability of NGS has spurred the adoption of high-density and dosage-sensitive molecular markers. Notably, in the three decades of molecular marker utilization in sweetpotato, about half of the peer-reviewed publications have emerged within the last four years, predominantly based on third-generation marker platforms.
You have full access to this open access chapter, Download chapter PDF
Keywords
4.1 Introduction
The use of molecular markers has had a significant impact on our understanding of the genetic basis for phenotypic expression and consequently enables applications in trait discovery (i.e., identification and functional validation of marker–trait associations) and crop improvement. Landmarks in the application of molecular markers have been achieved in the past half-century and have continued to evolve (Schlotterer 2004). Applications in sweetpotato that have benefitted from the use of molecular markers include genetic diversity, DNA fingerprint, genomic prediction, genetic and physical mapping, QTL mapping, and association mapping. Like in most crops, the earliest molecular marker technologies were deployed with some limited success and resolution. Nevertheless, some of the inferences drawn from those initial efforts still hold up following the use of better technologies. In this chapter, we highlight historical perspectives on the incremental improvements of molecular markers and their applications in sweetpotato. Besides the limitations inherent in specific molecular marker platforms, we also review how the biology and genomic landscape of hexaploid sweetpotato impacts the accuracy and utility of each molecular marker system.
Sweetpotato has benefited from molecular marker technologies since the advent of the first molecular markers, i.e., protein-based Isozymes and DNA-based Restriction Fragment Length Polymorphism (RFLP) markers. Like other species where these first-generation molecular markers were used, the use of polymorphisms in DNA-based markers was rapidly favored over the protein-based Isozyme marker system. The preferences for the DNA-based markers were mostly driven by the marker density across the genome and the technical ease of developing and generating these markers. Furthermore, the relative ease of localizing DNA molecular markers to physical genomic locations enables the use of these markers for functional analysis and marker–trait validation. For example, tightly linked or functional markers can be directly used in marker-assisted selection and/or for identifying candidate genes, which is a requirement for developing genetically modified organisms. Similarly, knowledge of alleles, allele effect estimates, and allele dosage effect, particularly in polyploids, can be highly informative and useful for evaluating how much of the phenotypic variation can be explained in controlled experiments before embarking on the implementation of time-consuming selective breeding or development of transgenic lines.
The evolution of DNA molecular markers and classification into first, second, and third-generation platforms is based on a combination of marker density (i.e., low-, medium- and high-density markers, respectively) and the strategy (i.e., assay method) used for the identification of polymorphisms, i.e., DNA–DNA hybridization, PCR, and sequence-based methods, respectively. Consequently, while the properties associated with these methods are often used as the basis for classifying them into first, second, and third-generation platforms, there are exceptions where more advanced methods incorporate methods from older technologies.
First-generation molecular markers use biochemical reactions (Isozymes), hybridization of antibodies (isozymes), and DNA probes (RFLP) to detect variants of the molecule separated on a gel matrix. Second-generation platforms are based on PCR amplification with random primers (e.g., RAPD: Random Amplified Polymorphic DNA) or sequence-specific primers (e.g., SSR: Simple Sequence Repeats, SCAR: SCAR, and STS: Sequence-Tagged Sites); a combination of PCR amplification and restriction enzyme digest (e.g., AFLP: Amplified Fragment Length Polymorphism; and CAPs: Cleavage Amplified Polymorphism); and detection of allelic differences based on physical changes in the DNA conformation of amplified fragments (SSCP: Single Strand Conformation Polymorphism). Third-generation molecular marker platforms, marked by the genomic era, typically target single nucleotide polymorphisms (SNPs) by deploying assays for allele-specific hybridization (SNP chip/array and DArT: Diversity Arrays Technology) and genome-wide sequencing. The NGS-based genotyping can be untargeted (e.g., GBS: Genotyping-By-Sequencing, and ddRADseq: double digest restriction-site associated DNA sequencing) or targeted (e.g., multiplexed PCR or hybridization-based sequence capture followed by sequencing). While third-generation platforms are designed to typically target SNPs, the start-of-the-art sequencing approaches can also identify other types of polymorphisms such as insertion-deletion polymorphisms (Indels) and short sequence repeats.
Even though sweetpotato research is supported by a small community of researchers, the diversity of molecular markers that have been used span multiple methods within each of the first-, second-, and third-generation marker platforms (Figs. 4.1 and 4.2). These highlight the importance of sweetpotato as a globally and economically important crop, as well as the interest in its polyploid evolution, domestication, and relationship with other wild diploid Ipomoae spp. (i.e., morning glories). These research interests span decades and pre-date molecular marker technologies.
4.2 First-Generation Molecular Marker Platforms Deployed in Sweetpotato
Following the inability to use morphological and cytogenetic markers to establish phylogenetic relationships between cultivated sweetpotato and its crop wild relatives (CWR), the earliest uses of molecular markers in sweetpotato were reported in 1992 at the USDA Agricultural Research Station, Griffin, GA, USA (Jarret et al. 1992) and North Carolina State University, Raleigh, NC, USA (Reyes and Collins 1992). The studies used RFLP and Isozyme markers, respectively, to understand phylogenetic relationships among cultivated sweetpotato and crop wild relatives, particularly species in the batatas complex. All polyploid populations were found to share almost the same number of isozymes as the diploid I. trifida, one of the putative ancestral progenitors of cultivated sweetpotato. The RFLP markers revealed diploid I. trifida and two Mexican tetraploids, an endangered I. tabascana (Austin 1988; Austin and De La Puente 1991; McDonald and Austin 1990) and an accession K233, that were closely related to cultivated sweetpotato with strong bootstrap support. This finding is in concordance with the fact that I. trifida has been observed to have traits that are required for commercial/cultivated sweetpotato, i.e., some lines develop thick roots similar to the sweetpotato, although rare, and some lines are sexually compatible with sweetpotato (Iwanaga 1988; Orjeda et al. 1990). This was reported at the first planning conference on exploration, maintenance, and utilization of sweetpotato genetic resources, held at the International Potato Center, Lima, Peru (Austin 1988). The RFLP marker data suggested two possible rounds of polyploidization events. The study with these molecular markers provided the first evidence for the allo-autopolyploid nature of the hexaploid sweetpotato.
Besides these 2 studies that used isozyme and RFLP markers in sweetpotato (Jarret et al. 1992; Reyes and Collins 1992), another study that used chloroplast-derived RFLP markers in new world Ipomoea spp. (McDonald and Austin 1990). The transition to the use of second-generation markers was a rapid shift due to their ease of use and relatively lower cost. The Isozyme marker system provides a universal protocol that produces markers that are conserved across diverse genetic backgrounds (intra- and inter-specific). The evolutionary constraints on isozyme proteins and the polymorphisms that do not inactivate enzyme activity allow for their utility in intra- and inter-specific studies. The DNA-based RFLP marker and the inclusion of a more diverse set of old and new world Ipomoea species provided the initial insights into the origins of sweetpotato and its relationship with CWR. Nevertheless, technical issues and low-throughput assays associated with first-generation molecular markers limit their routine application. For Isozyme markers, the requirement for fresh samples (or freezing of fresh material), instability of some proteins, and a limited number of markers across the genome can limit their application. They can also suffer from bias since these proteins can be a product of direct selection that is unrelated to traits of interest or phylogenetic models of species trees (Schlotterer 2004). While RFLPs can produce a higher number of markers across the genome (i.e., a high abundance of restriction sites), in practice the tedious assay limits the number of markers that can be developed. Additionally, RFLPs and isozymes require the development of DNA probes and biochemical assays, respectively, and large amounts of starting material are required. Consequently, their applications have only been limited to diversity, phylogenetic, and low-resolution linkage mapping studies. Because they are dominant markers (2 alleles represented by the presence and absence of a band), the inability to identify heterozygotes (achievable with co-dominant markers) is a major drawback for estimating allele effects. While it is impossible to determine allele dosage with isozyme makers, allele dosage can be theoretically inferred from the intensity of bands using RFLP markers, however, this is often not feasible since there is low confidence in making such an inference.
4.3 Second-Generation Platforms Deployed in Cultivated Sweetpotato
Before the emergence of SNP arrays and NGS-based genotyping, which is central to the state-of-the-art third-generation marker platforms, second-generation molecular markers were the markers of choice since the early 1990s and remain in use. Similar to the observations in other species, only a handful of second-generation markers have been consistently used in sweetpotato (i.e., RAPD, AFLP, and SSR; Fig. 4.2). Other second-generation markers occasionally used in sweetpotato include ITS, RIP, SRAP, cpSSR, ISSR, EST-SSR, and competitive allele-specific PCR markers (e.g., KASP). While KASP markers have been sparingly used in the past, there is sustained interest to continue using this marker platform for applications that require only a few markers. Since the use and results from the low-throughput KASP marker platform have not been reported in peer-reviewed publications, its frequency of use in sweetpotato is anecdotal. Some of these second-generation markers, such as AFLPs and SSRs, can produce medium-density markers (i.e., a few thousand), which allows for their utility in genetic analyses that require genome-wide marker data. Besides their use in diversity, fingerprinting, and phylogenetic analysis, they have been used to generate the first sweetpotato linkage map and QTL analyses (Kriegner et al. 2003). The limited genome resolution in large genomes such as the approximately 3 Gb genome of sweetpotato results in major gaps in linkage groups, multiple noncontiguous linkage groups that should all map to a single chromosome, and the unknown sequence context of the markers that are rarely anchored to positions in a physical reference genome assembly. Consequently, their utility for functional validation will tend to be limited. Second-generation markers suffer from this limitation since a primer pair can produce amplification products from multiple loci. Consequently, since amplicons are not sequenced the sequence context of alleles cannot be verified or confidently assigned to a physical genomic location. For example, SSR primer pairs tend to produce multiple alleles/bands (i.e., fragments with variable sizes) depending on PCR conditions, even in diploid genomes that should only produce 2 alleles per sample.
With the availability of reference genomes for hexaploid sweetpotato, the sequence context and physical position(s) for some of these second-generation markers can be determined. Although not limited to second-generation markers, genotypes with multiple alleles per locus and individual (even in diploids), are indicative of alleles that are obtained from multiple loci. Not to be confused with multi-allelic markers in polyploids (i.e., potentially up to 6 alleles per locus in a hexaploid), multi-locus markers derived from paralogous sequences can lead to erroneous interpretation in some genetic analyses and violate the assumption that markers are derived from a single locus. For example, this is one of the reasons for segregation distortion in genetic linkage maps and possibly false negatives and false positives during marker–trait associations. In the latter, it would be more problematic for single marker–trait genome-wide association analysis and to a lesser extent in interval mapping approaches that use interval mapping approaches based on markers that have been tested to segregate in mendelian fashion, i.e., no segregation distortion (Table 4.1).
RAPD markers were mostly used for diversity studies from 1994 to 2020, while AFLP markers were for genetic diversity and linkage analysis (genetic map construction and QTL analysis) from 2000 to 2014. Consequently, both RAPD and AFLP markers have not been reported for use in sweetpotato studies. On the other hand, SSR markers have remained in use since they were first in 2008. Application of SSR for genetic analyses includes genetic diversity analysis, phylogenetic analysis, genetic linkage analysis, marker–trait association, and genomic prediction. The first effort to characterize and develop SSR loci was based on EST-derived (expressed sequence tags) SSR markers. A study that aimed to develop genome-wide SSR markers revealed that of the 181,615 ESTs, a total of 8294 SSRs were identified from 7163 unique ESTs, i.e., a total of 3.9% of ESTs evaluated (Wang et al. 2011). The di-nucleotide repeats were the predominant repeats (41.2%), with AG/CT accounting for 26.9% of repeats. Other repeats in high frequency include AAG/CTT, AT/TA, CCG/CGG, and AAT/ATT, and accounted for 13.5%, 10.6%, 5.8%, and 4.5% of SSR repeats. Consequently, only 1060 high-quality SSR primer pairs were designed. Following validation, 816 primer pairs produced reproducible and strong amplificons, while 195 and 342 SSR markers exhibited polymorphism between 2 and among 8 cultivated sweetpotato clones, respectively. The medium-density marker data derived from SSR limits their application for analysis that requires genome-wide data. The co-dominant nature and ease of use (i.e., simple PCR assay and ability to resolve some polymorphism on agarose gels) make SSR markers a popular choice.
4.3.1 Application of Second-Generation Markers for the Relationship Between Sweetpotato and Its Crop Wild Relatives, and Genetic Diversity Studies
The initial use of ITS markers derived from the internal transcribed spacer (i.e., spacer DNA situated between the subunits of ribosomal RNA genes) revealed that the ITS markers poorly resolved relationships among 13 Ipomoea spp (Huang and Sun 2000). In contrast, AFLP and SSR markers were found to be more efficient in characterizing genetic diversity and phylogenetic relationships at both intra- and interspecific levels in 36 accessions that represent 10 Ipomoea spp (Huang et al. 2002). A total of 1182 AFLP bands (loci) were identified, of which 891 were polymorphic across all accessions evaluated. The AFLP markers were generated using six primer combinations. Consistent with using first-generation markers to study the relationship between sweetpotato and its crop wild relatives, I. trifida was found to be the most closely related to hexaploidy sweetpotato (I. batatas), while I. ramosissima and I. umbraticola were the most distantly related to I. batatas (Huang et al. 2002). In a study that used ITS sequences, while the nuclear ITS suggested an autopolyploid origin for sweetpotato, two I. batatas chloroplast lineages were identified (Roullier et al. 2013). More divergence was found between the I. batatas chloroplast lineages than with the closest putative progenitor, I. trifida. While this indicated allopolyploid or all-autopolyploid origin, the study also proposed two distinct autopolyploidization events involving polymorphic wild populations of a single progenitor species. Subsequent studies with high-density third-generation molecular markers all support an allo-autopolyploid origin from previous findings.
The second-generation markers routinely used for genetic diversity studies include platforms that can produce at least medium-density markers data, i.e., hundreds to a few thousand markers (Fajardo et al. 2002; Zhang et al. 2000, 2004). A total of 210 polymorphic AFLP fragments revealed the highest genetic diversity was found in Central America and the lowest in Peru-Ecuador. These results support the hypothesis that Central America is the primary center of diversity and most likely the center of origin of sweetpotato. Furthermore, while the post-Columbus dispersal of sweetpotato to Asia and the Pacific is well documented, the hypothesis that there was a prehistoric transfer of sweetpotato by Peruvian or Polynesian voyagers from Peru to Oceania has long been a controversial issue. A set of 210 AFLP markers revealed that Mexican and Oceania cultivars grouped together, while Peru-Ecuador germplasm was genetically distant from Oceania germplasm (Zhang et al. 2004). Consequently, the study suggested that Peru-Ecuador may not be the source of the Oceania germplasm.
4.3.2 Application of Second-Generation Markers for Genetic Linkage Analysis
The medium-density marker data produced from AFLP and SSR markers have been applied to genetic linkage map construction and QTL analysis. While these molecular marker platforms cannot call dosage directly, studies have classified the pseudo-diploidized codominant SSR and dominant AFLP genotypic classes to infer dosage based on the Mendelian segregation ratio. Linkage models in these studies often tested for autopolyploidy (hexasomic) and allo-autopolyploidy (tetrasomic) without direct dosage information. Double reduction events, where sister alleles move to the same gamete during meiosis and multivalent formation, were not modeled in the era of second-generation markers. In hexasomic inheritance (autopolyploids), pairing is random with all pairs of homoeologous chromosomes. Assuming sweetpotato is an allo-autopolyploid and that preferential pairing occurs, we would expect hexasomic (if there is partial preferential pairing), tetrasomic (random pairing with pairs of 4 of 6 homologous chromosomes), and disomic (random pairing with pairs of 2 of 6 homologous chromosomes) inheritance.
Based on Jones’s cytological hypotheses in sweetpotato (Jones 1967), where the other parental genotype is nulliplex, markers were classified into four types based on their segregation ratios: (1) simplex/single-dose markers present in one parent in a single copy and with a segregation ratio of 1:1 (presence: absence); (2) duplex/double-dose markers present in one parent in two copies and with hexasomic (4:1), tetrasomic (5:1), or disomic/tetradisomic (3:1) ratios; (3) triplex/triple-dose markers present in one parent in three copies and with hexasomic (19:1), tetradisomic (11:1) or disomic (7:1) ratios; and (4) double-simplex markers present in both parents in a single copy and with a 3:1 segregation ratio (Cervantes-Flores et al. 2008; Kriegner et al. 2003). Inferring dosage or mode of inheritance in this manner is only limited to cases where the other parent is a nulliplex (or simplex in both parents), i.e., can’t be applied to multiple dose marker genotypes in both parents. Furthermore, even though second-generation genetic markers can produce bridge markers (allele segregating in both parents; simplex-by-simplex marker configuration), earlier genetic linkage maps using these markers did not always use the bridge marker information to determine the 6 sets of linkage groups that correspond to the 6 sets of homoeologs. The exception is a study that partially identified some homoeologous linkage groups (Ma et al. 2020). Furthermore, no attempt was made using these markers to create a consensus map from the two parental maps. These highlight the major limitations of second-generation molecular markers for applications in genetic linkage analysis. The limitations are probably due to the inability to directly call dosage-based genotypes and the high marker density required for this genome-wide analysis.
The first attempt to construct a genetic linkage map in sweetpotato was based on a study that used 134 polymorphic markers and 76 F1 progenies (Thompson et al. 1997). While a linkage map was not constructed, the 1:1 segregation ratio from 74 polymorphic markers (presence–absence of bands in Vardaman and Regal parents) indicated linkage map construction is possible with sufficient markers and progenies. The first genetic linkage map was constructed by (Kriegner et al. 2003) using a total of 632 (Tanzania) and 435 (Bikilamaliya) AFLP markers that were mapped to and ordered on 90 and 80 linkage groups, respectively. The map lengths covered 3655.6 cM and 3011.5 cM, respectively, with an average marker interval of 5.8 cM. In this study, to determine if sweetpotato is an autopolyploid or allopolyploid, the ratio of linkage in the coupling phase to linkage in the repulsion phase and the ratio of non-simplex to simplex markers were examined. The results support the predominance of polysomic inheritance with some degree of preferential pairing that suggests an allo-autopolyploid genome. Consequently, Cervantes-Flores et al. (2008) generated 1944 and 1751 AFLP bands in Tanzania and Beauregard, with 1511 and 1303 being single-dose markers, respectively (Cervantes-Flores et al. 2008). The framework maps consisted of 86 and 90 linkage groups for Tanzania and Beauregard, respectively. The first sweetpotato map that used SSR markers was based on ISSR (inter simple sequence repeat) markers, which produced a low-resolution map with linkage groups ranging from 10.7 to 149.1 cM (Chang et al. 2009). Only 37 and 47 markers were mapped to parental maps spanning 479.8 and 853.5 cM, respectively. Another study that used only 130 EST-SSR markers, combined it with 1824 AFLP to produce a genome-wide genetic linkage map of the sweetpotato genome (Yu et al. 2014). The only case of deploying SRAP (sequence-related amplified polymorphism) markers sweetpotato is for linkage analysis (Li et al. 2010). A total of 800 SRAP markers were used to construct 2 parental linkage maps (Luoxushu 8 × Zhengshu 20) with 473 (81 linkage groups) and 328 (66 linkage groups) spanning 5802.5 and 38967.9 cM, respectively, and with average marker interval of 10.2 and 12.0 cM, respectively.
An SSR-based genetic linkage map with high resolution that comprised only SSR was constructed by using a de novo assembly of publicly available ESTs and mRNAs in sweetpotato (Zhang et al. 2016). A total of 1824 SSR markers were obtained from 1476 primer pairs. Of these, 214 pairs of primers that identified polymorphic loci produced 1278 alleles with an average of 5.97 per locus and a major allele frequency of 0.77. Another study that mapped only 210 SSR markers produced a gene linkage map that had a low resolution and only produced small linkage groups that were limited in their representation of the sweetpotato genome (Kim et al. 2017). Similarly, a map by Ma et al. (2020) that used higher marker density (484 and 573 polymorphic SSR markers) in Jizishu 1 and Longshu 9, respectively, had a significant number of the linkage groups that mostly small and low resolution (Ma et al. 2020). Most of these efforts that use hundreds of SSR markers for linkage analysis revealed the limitation of the marker density in initial SSR markers compared to studies that show that at least a few thousand markers are required for good genome coverage.
Meng et al. (2021) were able to generate 5057 polymorphic SSR markers from 571 polymorphic genomic SSR primer pairs and 35 EST-based SSR primer pairs. They produced 90 linkage groups and covered 13,299.9Â cM with a marker density of 2.6Â cM. Using 3009 SSR markers, the Zhengshu 20 parental map spanned 11,1229Â cM, comprised 90 linkage groups, and had a marker density of 3.7Â cM. The SSR primer pairs were derived from an initial 2545 primer pairs, including 1215 genomic SSR (gSSR) primer pairs and 1330 BES-SSR (bSSR) primer pairs designed from BAC-end sequences, respectively. Using a cross between Xushu 18 and Xu 781 sweetpotato cultivars and 601 SSR primer pairs, Zheng et al. (2023) generated 5547 SSR markers and 4599 SSR markers, respectively, to produce parental maps that also spanned 18,263.5Â cM and 18,043.7Â cM, respectively.
At an average of 8.86–9.23 markers per SSR primer pair, it is expected that a significant number of the primer pairs are non-specifically amplified products from multiple loci (paralogous sequences). The maximum number of possible alleles per individual can be no more than 6 alleles in hexaploidy sweetpotato. To resolve this ambiguity, unique SSR bands (i.e., same fragment or similar length under low resolving power of agarose gel) are often scored as dominant markers that are derived from a single locus. Nevertheless, some paralogs can produce amplicons of the same fragment length. The ability to accurately score genotypes is crucial for polyploids since markers have a high chance to erroneously fit multiple segregation ratios under hexasomic, tetrasomic, and disomic inheritance (i.e., 1:1, 4:1, 5:1, 3:1, 19:1, 11:1, 7:1, 3:1).
4.3.3 Application of Second-Generation Markers for Selection of Parental Genotypes
While second-generation markers have not been used for genomic selection in sweetpotato, to prevent the narrowing of the genetic base during breeding, knowledge of genetic relationships and diversity is important. Naidoo et al. (2022) used SSR markers for the selection of parental cultivars to maintain genetic diversity during breeding (Naidoo et al. 2022). Using 31 genotypes originating from the African and American continent and eight highly polymorphic SSR primers that produced 83 alleles, it was revealed that despite the high diversity among the genotypes, genetic distances among the genotypes were relatively low. To some extent, clustering identified three groups that reflect geographic origins and pedigree. The study suggested two heterotic groups African and American origin.
4.4 Third-Generation Platforms Deployed in Cultivated Sweetpotato
The third-generation molecular markers (aka next-generation or advanced molecular markers) are the current state-of-the-art that represents a significant advancement marked by high-throughput genotyping at a significantly lower cost than any of the other marker platforms. These molecular marker platforms use technologies that include DNA arrays (on slides or beads made from plastic or glass) or NGS. They offer improved capabilities for applications that require high-density genome-wide marker data, and allele-dose information, and for studying complex traits and complex genomic features.
Diploid and a few polyploid organisms started benefiting from early third-generation marker platforms, including SNP arrays developed in 1998 (The Whitehead Institute and Affymetrix SNP array/chip; Pinkel et al. 1998) and NGS-based genotyping (Baird et al. 2008; Balagué-Dobón et al. 2022; Elshire et al. 2011; Peterson et al. 2012). However, sweetpotato, like most complex polyploids lagged due to the prohibitive cost of developing SNP arrays for a small community of use and the inability of early NGS-based genotyping platforms to accurately capture allele dosage. Initially, the ability of SNP arrays to call allele dosage was limited and in the cases where they were used in polyploids, genotype calls were often limited to 2 × pseudo-diploidized genotypes in autopolyploids (e.g., potato) or subgenome-specific diploid genotype calls in allopolyploid (e.g., wheat) (Sun et al. 2020). Since then, the development of tools such as FitTetra (Zych et al. 2019) and ClusterCall (Schmitz Carley et al. 2017) now allow for dosage calling, although the application is limited to auto-tetraploids.
For higher ploidy levels in autopolyploids, the development of superMASSA allowed for dosage calling using a graphical Bayesian model for SNP genotyping. However, its application is limited to biparental populations since it requires Mendelian segregation information to re-classify genotypes into appropriate dosage classes (Serang et al. 2012). Although it can model all dosage configurations, it is similar to earlier approaches used for second-generation markers in that dosage calls are imputed or re-assigned based on expected Mendelian segregation rather than strictly using allelic read depth information. The approach is necessary since early third-generation marker platforms are inherently limited in their ability to accurately quantify allele dose. The polyrad (Clark et al. 2019) and updog (Gerard et al. 2018) software provide similar functionality as superMASSA but extend dosage calling to diversity panels and natural populations by using multiple features to update priors. The features in polyrad include population structure, model allele frequency gradients, rate of self-fertilization range from zero to one, and linkage disequilibrium of markers that have known physical map positions in the reference genome. The features in updog include allele bias, overdispersion, and sequencing error.
While the advent of reduced representation sequencing (RRS) democratized high throughput genotyping in model, non-model, and under-studied crops, the ability to use them for dosage calling is limited in the first iteration of RRS protocols (e.g., RADseq, GBS, and ddRADseq). This is due to the allele read depth ratios that are often highly skewed and lack uniformity in read depth across the genome resulting in a significant number of loci with low read depth that is insufficient for dosage calling. These limitations necessitated the need for tools such as superMASSA, polyrad, and updog. In studies where the first iteration of RRS methods was used, sweetpotato genotype calls were often based on 2 × pseudo-diploidized genotype calls rather than 6 × dosage calls. Examples of sweetpotato studies that used pseudo-diploid genotypes from RRS data include using DArTseq for GWAS, quality assurance and control, and genomic prediction studies (Bararyenya et al. 2020; Gemenet et al. 2020a, 2020b); and using SLAF-seq for linkage/QTL analysis (Yan et al. 2022). While analysis with the pseudo-diploidized genotypes produced meaningful results, comparison with 6 × dosage genotype calls in these studies revealed using dosage calls often produced superior results (Gemenet et al. 2020b). Other studies have attempted to use this first iteration RRS method with some limited success (Table 4.2).
Dosage-sensitive genotyping-by-sequencing (qRRS-based genotyping) has emerged as an amenable and robust strategy in polyploids due to its low cost and ability to quantitatively sequence alleles for dosage estimation. While approaches implementing quantitative reduced representation sequencing, GBSpoly and OmeSeq-qRRS, have been used in sweetpotato (da Silva et al. 2020; Gemenet et al. 2020b; Mollinari et al. 2020; Oloka et al. 2021; Wadl et al. 2018; Wu et al. 2018), multiplexed-PCR based approaches are been tested in sweetpotato and other polyploid crops. The GBSpoly protocol is a modification of the ligation-based GBS protocol that uses double-digestion with methylation-sensitive (TseI; rare cutter) and methylation-insensitive (CviAII, frequent cutter) restriction enzymes. To improve the quantitative sequencing assay, a library construction method (OmeSeq-qRRS) uses isothermal amplification (instead of a ligation approach) to incorporate barcoded adapters into genomic fragments following double-digestion of the genome with methylation-insensitive restriction enzymes (NsiI and NlaII). The methylation-insensitive restriction enzymes eliminate variability in hypomethylated and hypermethylated sequences, which are also variable across tissue types. A lower number of overall NGS reads is sufficient to achieve similar marker density in OmeSeq-qRRS compared to GBSpoly-qRRS. Additionally, the ligation bias of smaller genomic fragments and chimeric ligation are mitigated and eliminated, respectively, by using isothermal amplification.
The RRS/NGS-based genotyping is more amenable to polyploids than SNP arrays, which require significant assay development costs and accurate genome sequence data and assembly. The more recent NGS-based genotyping platforms aim to quantitatively sequence loci while maintaining allelic ratio in other to estimate dosage more accurately. Consequently, tools such as GATK, Freebayes, and Freebayes directly use quantitative sequencing information (i.e., allele read depth) for dosage-based variant calling by using haplotype-based calling. Variant calling based on dosage presents greater challenges in polyploids due to many potential genotypes at each locus, which arise from various combinations of unique alleles. Sequence reads are incapable of distinguishing identical copies of alleles, particularly when not physically linked to other heterogenous alleles. Additionally, as the ploidy level increases, determining allele copy numbers through dosage calling becomes increasingly complex at a specific read depth. The adoption of a haplotype-based strategy enhances genotyping accuracy by jointly assessing the combinations of multiple nearby alleles, known as haplotypes (Cooke et al. 2021).
While variant calling tools for polyploids assume that the genome is autopolyploid, studies in sweetpotato often indicate that it is an allo-autopolyploid and that large structural variations might exist between sweetpotato accessions (Wu et al. 2018). To address the allo-autopolyploid nature, accurate variant calling in polyploids can benefit from using multiple reference genomes of putative ancestral progenitors and haplotype-resolved genome assembly (6 sweetpotato haplomes). In addition to the sequencing of putative ancestral progenitors within the I. batatas complex (I. trifida, I. triloba, and I. tabascana), haplotype-resolved genome assemblies based on multiple sweetpotato cultivars are available (http://sweetpotato.uga.edu). A variant calling pipeline, GBSapp (Bararyenya et al. 2020; Gemenet et al. 2020b; Wadl et al. 2018), was developed to address the allo-autopolyploid nature of the sweetpotato genome by resolving sequence reads that map uniquely to haplomes or subgenomes and that are conserved across all 6 sweetpotato haplomes or subgenomes (using putative ancestral progenitors). Modeling dosage based on the number of homoeologs containing a specific sequence is particularly important for allo-autopolyploids since current variant calling tools, including haplotype-based variant calling tools, will assume and erroneously coarse genotypes to the dosage specified by the user (i.e., assumes strict autopolyploidy). In the absence of high-quality and complete haplotype-resolved reference genome assembly, which is preferred for variant calling, the GBSapp pipeline can use the known progenitors. Since the identity of the other ancestral progenitor is not known with certainty, GBSapp uses the closest ancestral diploid progenitor (I. trifida) and the more distantly related diploid (I. triloba) as reference subgenomes. This ensures that despite the evolutionary divergence of the latter, sequences conserved across both genomes would likely exist hexaploid sweetpotato and other species within the batatas species complex, hence dosage would be most likely 6 × dose.
4.4.1 Application of Third-Generation Markers for the Relationship Between Sweetpotato and Its Crop Wild Relatives, and Genetic Diversity Studies
The two studies conducted to understand the origins and domestication of sweetpotato were based on shotgun whole genome sequencing (Munoz-Rodriguez et al. 2022; Yan et al. 2024). Both studies confirmed I. trifida as a putative ancestral progenitor, that hexaploid sweetpotato is an allo-autopolyploid, and aimed to identify the putative tetraploid ancestral progenitor. These are findings that significantly advance the understanding of sweetpotato domestication and that inform if variant calling, and other genetic analyses should model sweetpotato as an autopolyploid or allo-autopolyploid. Munoz-Rodriguez et al. (2022) proposed I. aequatoriensis, a species of Mexican origin, played a direct role in the origin of the hexaploidy sweetpotato. This also underscored Central America as the origin of sweetpotato. Likewise, Yan et al. (2024) also used a haplotype-based phylogenetic analysis (HPA) to confirm that sweetpotato originated from reciprocal crosses between a diploid and tetraploid progenitor.
Genetic diversity analysis performed with two sets of USDA diversity panel (417 and 604 accessions) and using the GBSpoly-qRRS marker platform revealed similar results (Slonecki et al. 2023; Wadl et al. 2018). The clusters identified from STRUCTURE and phylogenetic analysis correspond to the geographical location that the accessions were collected from. Accessions from the Pacific Islands and Caribbean/Central American cluster within close proximity, supporting initial studies that germplasm from the Pacific Islands originated from Central America. Distinct clusters comprised of North American accessions cluster within close proximity.
4.4.2 Application of Second-Generation Markers for Genetic Linkage Analysis
Genetic linkage maps constructed with third-generation markers are typically marked by of high-density marker data set (i.e., about 30,000 markers). Early NGS-based genotyping platforms (ddRADseq and SLAF-seq) are limited to using simplex and nulliplex markers for linkage analysis (Shirasawa et al. 2017), while recent quantitative NGS-based genotyping markers (qRRS) using all marker configurations include low- and high-dose markers (Mollinari et al. 2020). Several publications have used third-generation markers for both QTL and genome-wide analysis (Bararyenya et al. 2020; da Silva et al. 2020; Gemenet et al. 2020c; Haque et al. 2020a; Haque et al. 2020b; Oloka et al. 2021).
4.4.3 Application of Second-Generation Markers in Genomic Prediction
The third-generation markers deployed for genomic prediction in sweetpotato are derived from the DArTSeq and GBSpoly-qRRS platforms (Batista et al. 2022; Gemenet et al. 2020b). Using DArtSeq (pseudo-diploidized markers) and GBSpoly-qRRS, genomic predictive abilities (PA) in a biparental population (Beauregard x Tanzania, BT) across root quality and yield-related traits revealed that models that used allele dosage information and G-matrix based additive effects have the best PA for most of the traits (Gemenet et al. 2020b).
4.5 Conclusion
Until 2018, sweetpotato lagged in the use of high throughput third-generation molecular markers since existing methods predating 2018 were limited in their ability to estimate allele dosage. Consequently, the need for high throughput and inexpensive genotyping in sweetpotato has driven innovations in quantitative genotyping-by-sequencing (or qRRS). The availability of NGS-based RRS and shotgun whole genome sequencing (WGS) data in sweetpotato has prompted the need to develop cutting-edge bioinformatic and analytical tools/pipelines for polyploid genetics that are being applied to other simple and complex polyploids. These tools include GBSapp (Wadl et al. 2018), MAPpoly (Mollinari et al. 2020), QTLpoly (da Silva et al. 2020), VIEWpoly (Taniguti et al. 2022), haplotype-based phylogenetic analysis (Yan et al. 2024), Ranbow (Moeinzadeh et al. 2020), and ngsAssocPoly (Yamamoto et al. 2020). The emergence of molecular marker platforms sensitive enough for quantifying allele dosage is revitalizing various areas of research in sweet potatoes and their wild counterparts.
References
Austin DF (1988) The taxonomy, evolution and genetic diversity of sweet potatoes and related wild species. In: Gregory P (ed) Exploration, maintenance and utilization of sweet potato genetic resources. Report of the 1st sweet potato planning conference 1987, International Potato Center, Lima, Peru
Austin DF, De La Puente F (1991) Ipomoea tabascana, an endangered tropical species. Econ Bot 45:435. https://doi.org/10.1007/BF02887086
Baird NA, Etter PD, Atwood TS, Currey MC, Shiver AL, Lewis ZA, Selker EU, Cresko WA, Johnson EA (2008) Rapid SNP discovery and genetic mapping using sequenced RAD markers. PLoS ONE 3(10):e3376. https://doi.org/10.1371/journal.pone.0003376
Balagué-Dobón L, Cáceres A, González JR (2022) Fully exploiting SNP arrays: a systematic review on the tools to extract underlying genomic structure. Brief Bioinform 23(2). https://doi.org/10.1093/bib/bbac043
Bararyenya A, Olukolu BA, Tukamuhabwa P, Grüneberg WJ, Ekaya W, Low J, Ochwo-Ssemakula M, Odong TL, Talwana H, Badji A, Kyalo M, Nasser Y, Gemenet D, Kitavi M, Mwanga ROM (2020) Genome-wide association study identified candidate genes controlling continuous storage root formation and bulking in hexaploid sweetpotato. Bmc Plant Biol 20(1). https://doi.org/10.1186/s12870-019-2217-9
Batista LG, Mello VH, Souza AP, Margarido GRA (2022) Genomic prediction with allele dosage information in highly polyploid species. Theor Appl Genet 135(2):723–739. https://doi.org/10.1007/s00122-021-03994-w
Cervantes-Flores JC, Yencho GC, Kriegner A, Pecota KV, Faulk MA, Mwanga ROM, Sosinski BR (2008) Development of a genetic linkage map and identification of homologous linkage groups in sweetpotato using multiple-dose AFLP markers. Mol Breeding 21(4):511–532. https://doi.org/10.1007/s11032-007-9150-6
Chang KY, Lo HF, Lai YC, Yao PJ, Lin KH, Hwang SY (2009) Identification of quantitative trait loci associated with yield-related traits in sweet potato (Ipomoea batatas). Bot Stud 50:43–55
Clark LV, Lipka AE, Sacks EJ (2019) polyRAD: genotype calling with uncertainty from sequencing data in polyploids and diploids. G3 (Bethesda) 9(3):663–673. https://doi.org/10.1534/g3.118.200913
Connolly AG, Godwin ID, Cooper M, DeLacy IH (1994) Interpretation of randomly amplified polymorphic DNA marker data for fingerprinting sweetpotato (Ipomoea batatas L.) genotypes. Theor Appl Genet 88:332–336. https://doi.org/10.1007/BF00223641
Cooke DP, Wedge DC, Lunter G (2021) A unified haplotype-based method for accurate and comprehensive variant calling. Nat Biotechnol 39(7):885–892. https://doi.org/10.1038/s41587-021-00861-3
da Silva PG, Gemenet DC, Mollinari M, Olukolu BA, Wood JC, Diaz F, Mosquera V, Gruneberg WJ, Khan A, Buell CR, Yencho GC, Zeng ZB (2020) Multiple QTL mapping in autopolyploids: a random-effect model approach with application in a hexaploid sweetpotato full-sib population. Genetics 215(3):579–595. https://doi.org/10.1534/genetics.120.303080
Elameen A, Larsen A, Fjellheim SS, Sundheim L, Msolla S, Masumba E, Rognli OA (2011) Phenotypic diversity of plant morphological and root descriptor traits within a sweet potato, Ipomoea batatas (L.) Lam., germplasm collection from Tanzania. Genet Resour Crop Evol 58:397–407. https://doi.org/10.1007/s10722-010-9585-1
Elshire RJ, Glaubitz JC, Sun Q, Poland JA, Kawamoto K, Buckler ES, Mitchell SE (2011) A robust, simple genotyping-by-sequencing (GBS) approach for high diversity species. PLoS ONE 6(5):e19379. https://doi.org/10.1371/journal.pone.0019379
Fajardo S, La Bonte DR, Jarret RL (2002) Identifying and selecting for genetic diversity in Papua New Guinea sweetpotato (L.) Lam. germplasm collected as botanical seed. Genet Resour Crop Evol 49(5):463–470. https://doi.org/10.1023/A:1020955911675
Gemenet DC, Lindqvist-Kreuze H, De Boeck B, Pereira GD, Mollinari M, Zeng ZB, Yencho GC, Campos H (2020b) Sequencing depth and genotype quality: accuracy and breeding operation considerations for genomic selection applications in autopolyploid crops. Theor Appl Genet 133(12):3345–3363. https://doi.org/10.1007/s00122-020-03673-2
Gemenet DC, Kitavi MN, David M, Ndege D, Ssali RT, Swanckaert J (2020a) Development of diagnostic SNP markers for quality assurance and control in sweetpotato [Ipomoea batatas (L.) Lam.] breeding programs. Plos One 15(5):e0233828. https://doi.org/10.1371/journal.pone.0232173
Gemenet DC, Pereira GD, De Boeck B, Wood JC, Mollinari M, Olukolu BA, Diaz F, Mosquera V, Ssali RT, David M, Kitavi MN, Burgos G, Zum Felde T, Ghislain M, Carey E, Swanckaert J, Coin LJM, Fei Z, Hamilton JP, Yada B, Yencho GC, Zeng ZB, Mwanga ROM, Khan A, Gruneberg WJ, Buell CR (2020c) Quantitative trait loci and differential gene expression analyses reveal the genetic basis for negatively associated β-carotene and starch content in hexaploid sweetpotato [Ipomoea batatas (L.) Lam.]. Theoret Appl Genet 133(1):23–36. https://doi.org/10.1007/s00122-019-03437-7
Gerard D, Ferrao LFV, Garcia AAF, Stephens M (2018) Genotyping polyploids from messy sequencing data. Genetics 210(3):789–807. https://doi.org/10.1534/genetics.118.301468
Gichuki ST, Berenyi M, Zhang DP, Hermann M, Schmidt J, Glössl J, Burg K (2003) Genetic diversity in sweetpotato [Ipomoea batatas (L.) Lam.] in relationship to geographic sources as assessed with RAPD markers. Genet Resour Crop Evol 50(4):429–437. https://doi.org/10.1023/A:1023998522845
Grodzicker T, Williams J, Sharp P (1974) Physical mapping of temperature-sensitive mutations of adenovirus. Cold Spring Harbor Symp Quant Biol 34:439–446. https://doi.org/10.1101/sqb.1974.039.01.056
Haque E, Yamamoto E, Shirasawa K, Tabuchi H, Yoon UH, Isobe S, Tanaka M (2020b) Genetic analyses of anthocyanin content using polyploid GWAS followed by QTL detection in the sweetpotato (Ipomoea batatas L.) storage root. Plant Root 14:11–21. https://doi.org/10.3117/plantroot.14.11
Haque E, Tabuchi H, Monden Y, Suematsu K, Shirasawa K, Isobe S, Tanaka M (2020a) QTL analysis and GWAS of agronomic traits in sweetpotato (Ipomoea batatas L.) using genome wide SNPs. Breed Sci 70(3):283–291. https://doi.org/10.1270/jsbbs.19099
Huang JC, Sun M (2000) Genetic diversity and relationships of sweetpotato and its wild relatives in Ipomoea series Batatas (Convolvulaceae) as revealed by inter-simple sequence repeat (ISSR) and restriction analysis of chloroplast DNA. Theor Appl Genet 100:1050–1060. https://doi.org/10.1007/s001220051386
Huang J, Corke H, Sun M (2002) Highly polymorphic AFLP markers as a complementary tool to ITS sequences in assessing genetic diversity and phylogenetic relationships of sweetpotato (Ipomoea batatas (L.) Lam.) and its wild relatives. Genet Resour Crop Evol 49:541–550. https://doi.org/10.1023/A:1021290927362
Iwanaga M (1988) Use of wild germplasm for sweet potato breeding. In: Gregory P (ed) Exploration, maintenance, and utilization of sweet potato genetic resources. Report of the 1st sweet potato planning conference 1987, International Potato Center (CIP), Lima, Peru
Jarret RL, Gawe N, Whittemore A (1992) Phylogenetic Relationships of the Sweetpotato [Ipomoea batatas (L.) Lam.]. J Am Soc Hort Sci 117(4):633–637. https://doi.org/10.21273/JASHS.117.4.633
John H, Birnstiel M, Jones K (1969) RNA-DNA hybrids at the cytological level. Nature 223:582–587. https://doi.org/10.1038/223582a0
Jones A (1967) Should Nishiyama's K123 (Ipomoea trifida) be designated I. batatas? Econ Bot 21:163–166. http://www.jstor.org/stable/4252862
Kim JH, Chung IK, Kim KM (2017) Construction of a genetic map using EST-SSR markers and QTL analysis of major agronomic characters in hexaploid sweet potato (Ipomoea batatas (L.) Lam). Plos One 12: e0185073–9. https://doi.org/10.1371/journal.pone.0185073
Kriegner A, Cervantes JC, Burg K, Mwanga ROM, Zhang D (2003) A genetic linkage map of sweetpotato [Ipomoea batatas (L.) Lam.] based on AFLP markers. Mol Breed 11:169–185. https://doi.org/10.1023/A:1022870917230
Lee KJ, Lee GA, Lee JR, Sebastin R, Shin MJ, Cho GT, Hyun D (2019) Genetic diversity of sweet potato (L. Lam) germplasms collected worldwide using chloroplast SSR markers. Agron-Basel 9(11). https://doi.org/10.3390/agronomy9110752
Lewontin RC, Hubby JL (1966) A molecular approach to the study of genic heterozygosity in natural populations. II. Amount of variation and degree of heterozygosity in natural populations of Drosophila pseudoobscura. Genetics 54:595–609. https://doi.org/10.1093/genetics/54.2.595
Li AX, Liu Q, Wang Q, Zhang L, Zhai H, Liu S (2010) Construction of molecular linkage maps using srap markers in sweetpotato. Acta Agron Sin 36:1286–1295. https://doi.org/10.1016/S1875-2780(09)60065-1
Ma ZM, Gao WC, Liu LF, Liu MH, Zhao N, Han MK, Wang Z, Jiao WJ, Gao ZY, Hu YY, Liu QC (2020) Identification of QTL for resistance to root rot in sweetpotato ((L.) Lam) with SSR linkage maps. Bmc Genom 21(1). https://doi.org/10.1186/s12864-020-06775-9
Maquia I, Muocha I, Naico A, Martins N, Gouveia M, Andrade I, Goulao LF, Ribeiro AI (2013) Molecular, morphological and agronomic characterization of the sweetpotato (Ipomoea batatas L.) germplasm collection from Mozambique: genotypeselection for drought prone regions. South Afr J Bot 88:142–151. https://doi.org/10.1016/j.sajb.2013.07.008
McDonald JA, Austin DF (1990) Changes and additions in Ipomoea section Batatas (Convolvulaceae). Brittonia 42:116–120. https://doi.org/10.2307/2807625
Meng YS, Zheng CX, Li H, Li AX, Zhai H, Wang QM, He SZ, Zhao N, Zhang H, Gao SP, Liu QC (2021) Development of a high-density SSR genetic linkage map in sweet potato. Crop J 9(6):1367–1374. https://doi.org/10.1016/j.cj.2021.01.003
Meng YS, Zhao N, Li H, Zhai H, He SZ, Liu QC (2018) SSR fingerprinting of 203 sweetpotato (Ipomoea batatas (L.) Lam.) varieties. J Integr Agric 17(1):86–93. https://doi.org/10.1016/S2095-3119(17)61687-3
Moeinzadeh MH, Yang J, Muzychenko E, Gallone G, Heller D, Reinert K, Haas S, Vingron M (2020) Ranbow: a fast and accurate method for polyploid haplotype reconstruction. PLoS Comput Biol 16(5):e1007843. https://doi.org/10.1371/journal.pcbi.1007843
Mollinari M, Olukolu BA, Pereira GDS., Khan A, Gemenet D, Yencho GC, Zeng ZB (2020) Unraveling the hexaploid sweetpotato inheritance using ultra-dense multilocus mapping. G3 (Bethesda) 10(1):281–292. https://doi.org/10.1534/g3.119.400620
Munoz-Rodriguez P, Wells T, Wood JRI, Carruthers T, Anglin NL, Jarret RL, Scotland RW (2022) Discovery and characterization of sweetpotato’s closest tetraploid relative. New Phytol 234(4):1185–1194. https://doi.org/10.1111/nph.17991
Naidoo SIM, Laurie SM, Amelework AB, Shimelis H, Laing M (2022) Selection of sweetpotato parental genotypes using simple sequence repeat markers. Plants-Basel 11(14). https://doi.org/10.3390/plants11141802
Obata N, Tabuchi HT, Kurihara MK, Yamamoto EY, Shirasawa KS, Monden YM (2022) Mapping of nematode resistance in hexaploid sweetpotato using a next-generation sequencing-based association study. Front Plant Sci 13. https://doi.org/10.3389/fpls.2022.858747
Okada Y, Monden Y, Nokihara K, Shirasawa K, Isobe S, Tahara M (2019) Genome-wide association studies (GWAS) for yield and weevil resistance in sweet potato (Ipomoea batatas (L.) Lam). Plant Cell Rep 38(11):1383–1392. https://doi.org/10.1007/s00299-019-02445-7
Oloka BM, da Silva PG, Amankwaah VA, Mollinari M, Pecota KV, Yada B, Olukolu BA, Zeng ZB, Yencho GC (2021) Discovery of a major QTL for root-knot nematode (Meloidogyne incognita) resistance in cultivated sweetpotato (Ipomoea batatas). Theor Appl Genet 134(7):1945–1955. https://doi.org/10.1007/s00122-021-03797-z
Orjeda G, Freyre R, Iwanaga M (1990) Production of 2n pollen in diploid Ipomoea trifida, a putative wild ancestor of sweet potato. J Hered 81:462–467. https://doi.org/10.1093/oxfordjournals.jhered.a111026
Pardue ML, Gall JG (1969) Molecular hybridization of radioactive RNA to the DNA of cytological preparations. Proc Natl Acad Sci U S A 64:600–604. https://doi.org/10.1073/pnas.64.2.600
Peterson BK, Weber JN, Kay EH, Fisher HS, Hoekstra HE (2012) Double digest RADseq: an inexpensive method for de novo SNP discovery and genotyping in model and non-model species. PLoS ONE 7(5):e37135. https://doi.org/10.1371/journal.pone.0037135
Pinkel D, Segraves R, Sudar D, Clark S, Poole I, Kowbel D, Collins C, Kuo WL, Chen C, Zhai Y, Dairkee SH, Ljung BM, Gray JW, Albertson DG (1998) High resolution analysis of DNA copy number variation using comparative genomic hybridization to microarrays. Nat Genet 20:207–211
Reyes LM, Collins WW (1992) Genetic control of seven enzyme systems in Ipomoea species. J Am Soc Hort Sci 117(6):1000–1005. https://doi.org/10.21273/JASHS.117.6.1000
Ronaghi M, Karamohamed S, Pettersson B, Uhlén M, Nyrén P (1996) Real-time DNA sequencing using detection of pyrophosphate release. Anal Biochem 242(1):84–89. https://doi.org/10.1006/abio.1996.0432
Roullier C, Duputié A, Wennekes P, Benoit L, Bringas VMF., Rossel G, Tay D, McKey D, Lebot V (2013) Disentangling the Origins of Cultivated Sweet Potato ((L.) Lam.). Plos One 8(5). https://doi.org/10.1371/journal.pone.0062707
Sagredo B, Hinrichsen P, López H, Cubillos A, Muñoz C (1998) Genetic variation of sweet potatoes (Ipomoea batatas L) cultivated in Chile determined by RAPDs. Euphytica 101(2):193–198. https://doi.org/10.1023/A:1018301009296
Saiki RK, Scharf S, Faloona F, Mullis KB, Horn GT, Erlich HA, Arnheim N (1985) Enzymatic amplification of beta-globin genomic sequences and restriction site analysis for diagnosis of sickle cell anemia. Science 230(4732):1350–1354. https://doi.org/10.1126/science.2999980
Samiyarsih S, Fitrianto N, Azizah E, Herawati W, Rochmatino (2020) Anatomical profile and genetic variability of sweet potato (Ipomoea batatas) cultivars in Banyumas, Central Java, based on RAPD markers. Biodiversitas 21(4):1755–1766. https://doi.org/10.13057/biodiv/d210460
Schlotterer C (2004) The evolution of molecular markers–just a matter of fashion? Nat Rev Genet 5(1):63–69. https://doi.org/10.1038/nrg1249
Schmitz Carley CA, Coombs JJ, Douches DS, Bethke PC, Palta JP, Novy RG, Endelman JB (2017) Automated tetraploid genotype calling by hierarchical clustering. Theor Appl Genet 130(4):717–726. https://doi.org/10.1007/s00122-016-2845-5
Serang O, Mollinari M, Garcia AA (2012) Efficient exact maximum a posteriori computation for bayesian SNP genotyping in polyploids. PLoS ONE 7(2):e30906. https://doi.org/10.1371/journal.pone.0030906
Shirasawa K, Tanaka M, Takahata Y, Ma DF, Cao QH, Liu QC, Zhai H, Kwak SS, Jeong JC, Yoon UH, Lee HU, Hirakawa H, Isobe S (2017) A high-density SNP genetic map consisting of a complete set of homologous groups in autohexaploid sweetpotato. Sci Rep 7:44207. https://doi.org/10.1038/srep44207
Slonecki TJ, Rutter WB, Olukolu BA, Yencho GC, Jackson DM, Wadl PA (2023) Genetic diversity, population structure, and selection of breeder germplasm subsets from the USDA sweetpotato collection. Front Plant Sci 13:1022555. https://doi.org/10.3389/fpls.2022.1022555
Soegiant A, Ardiarini NR, Sugiharto AN (2011) Genetic Diversity of Sweet Potato (Ipomoea batatas L.) in East Java, Indonesia. J Agric Food Tech 1(9):179–183
Southern EM (1975) Detection of specific sequences among DNA fragments separated by gel electrophoresis. J Mol Biol 98(3):503–517. https://doi.org/10.1016/S0022-2836(75)80083-0
Sun X, Liu D, Zhang X, Li W, Liu H, Hong W, Jiang C, Guan N, Ma C, Zeng H, Xu C, Song J, Huang L, Wang C, Shi J, Wang R, Zheng X, Lu C, Wang X, Zheng H (2013) SLAF-seq: an efficient method of large-scale de novo SNP discovery and genotyping using high-throughput sequencing. PLoS ONE 8(3):e58700. https://doi.org/10.1371/journal.pone.0058700
Sun C, Dong Z, Zhao L, Ren Y, Zhang N, Chen F (2020) The Wheat 660K SNP array demonstrates great potential for marker-assisted selection in polyploid wheat. Plant Biotechnol J 18(6):1354–1360. https://doi.org/10.1111/pbi.13361
Taniguti CH, Gesteira GS, Lau J, Pereira GS, Zeng ZB, Byrne D, Riera-Lizarazu O, Mollinari M (2022) VIEWpoly: a visualization tool to integrate and explore results of polyploid genetic analysis. J Open Sour Softw 7(74):4242. https://doi.org/10.21105/joss.04242
Thompson PG, Hong LL, Ukoskit K, Zhu Z (1997) Genetic linkage of randomly amplified polymorphic DNA(RAPD) markers in sweetpotato. J Am Soc Hort Sci 122:79–82. https://doi.org/10.21273/JASHS.122.1.79
Tiselius AA (1937) New apparatus for electrophoretic analysis of colloidal mixtures. Trans Faraday Soc 33:524–531. https://doi.org/10.1039/tf9373300524
Veasey EA, Borges A, Rosa MS, Queiroz-Silva JR, Bressan EDA., Peroni N (2008) Genetic diversity in Brazilian sweet potato (Ipomoea batatas (L.) Lam., Solanales, Convolvulaceae) landraces assessed with microsatellite markers. Genet Molecul Biol 31(3):725–733. https://doi.org/10.1590/S1415-47572008000400020
Vos P, Hogers R, Bleeker M, Reijans M, van de Lee T, Hornes M, Frijters A, Pot J, Peleman J, Kuiper M et al (1995) AFLP: a new technique for DNA fingerprinting. Nucleic Acids Res 23(21):4407–4414. https://doi.org/10.1093/nar/23.21.4407
Wadl PA, Olukolu BA, Branham SE, Jarret RL, Yencho GC, Jackson DM (2018) Genetic diversity and population structure of the USDA sweetpotato germplasm collections using GBSpoly. Front Plant Sci 9:1166. https://doi.org/10.3389/fpls.2018.01166
Wang ZY, Fang BP, Chen JY, Zhang XJ, Luo ZX, Huang LF, Chen XL, Li YJ (2010) Assembly and characterization of root transcriptome using Illumina paired-end sequencing and development of cSSR markers in sweetpotato. BMC Genomics 11:1–14. https://doi.org/10.1186/1471-2164-11-726
Wang ZY, Li J, Luo ZX, Huang LF, Chen XL, Fang BP, Li YJ, Chen JY, Zhang XJ (2011) Characterization and development of EST-derived SSR markers in cultivated sweetpotato. BMC Plant Biol 11:1–9. https://doi.org/10.1186/1471-2229-11-139
Williams JG, Kubelik AR, Livak KJ, Rafalski JA, Tingey SV (1990) DNA polymorphisms amplified by arbitrary primers are useful as genetic markers. Nucleic Acids Res 18(22):6531–6535. https://doi.org/10.1093/nar/18.22.6531
Wu S, Lau KH, Cao Q, Hamilton JP, Sun H, Zhou C, Eserman L, Gemenet DC, Olukolu BA, Wang H, Crisovan E, Godden GT, Jiao C, Wang X, Kitavi M, Manrique-Carpintero N, Vaillancourt B, Wiegert-Rininger K, Yang X, Bao K, Schaff J, Kreuze J, Gruneberg W, Khan A, Ghislain M, Ma D, Jiang J, Mwanga ROM, Leebens-Mack J, Coin LJM, Yencho GC, Buell CR, Fei Z (2018) Genome sequences of two diploid wild relatives of cultivated sweetpotato reveal targets for genetic improvement. Nat Commun 9(1):4580. https://doi.org/10.1038/s41467-018-06983-8
Xiao S, Dai X, Zhao L, Zhou Z, Zhao L, Xu P, Gao B, Zhang A, Zhao D, Yuan R, Wang Y, Wang J, Li Q, Cao Q (2023) Resequencing of sweetpotato germplasm resources reveals key loci associated with multiple agronomic traits. Hortic Res 10(1):uhac234. https://doi.org/10.1093/hr/uhac234
Yada B, Tukamuhabwa P, Wajala B, Kim DJ, Skiton RA, Alajo BA, Mwanga ROM (2010) Characterization of Uganda sweet potato germplasm using fluorescence repeat markers. Horticult Sci 45(2):225–230. https://doi.org/10.21273/HORTSCI.45.2.225
Yamakawa H, Haque E, Tanaka M, Takagi H, Asano K, Shimosaka E, Akai K, Okamoto S, Katayama K, Tamiya S (2021) Polyploid QTL-seq towards rapid development of tightly linked DNA markers for potato and sweetpotato breeding through whole-genome resequencing. Plant Biotechnol J 19(10):2040–2051. https://doi.org/10.1111/pbi.13633
Yamamoto E, Shirasawa K, Kimura T, Monden Y, Tanaka M, Isobe S (2020) Genetic mapping in autohexaploid sweet potato with low-coverage NGS-based genotyping data. G3 10:2661–2670
Yan M, Li M, Wang Y, Wang X, Moeinzadeh MH, Quispe-Huamanquispe DG, Fan W, Fang Y, Wang Y, Nie H, Wang Z, Tanaka A, Heider B, Kreuze JF, Gheysen G, Wang H, Vingron M, Bock R, Yang J (2024) Haplotype-based phylogenetic analysis and population genomics uncover the origin and domestication of sweetpotato. Mol Plant 17(2):277–296. https://doi.org/10.1016/j.molp.2023.12.019
Yan H, Ma M, Ahmad MQ, Arisha MH, Tang W, Li C, Zhang Y, Kou M, Wang X, Gao R, Song W, Li Z, Li Q (2022) High-density single nucleotide polymorphisms genetic map construction and quantitative trait locus mapping of color-related traits of purple sweet potato [Ipomoea batatas (L.) Lam.]. Front Plant Sci 12:797041. https://doi.org/10.3389/fpls.2021.797041
Yu XX, Zhao N, Li H, Jie Q, Zhai H, He SZ, Li Q, Liu QC (2014) Identification of QTLs for starch content in sweetpotato (Ipomoea batatas (L.) Lam.). J Integr Agric 13(2):310–315. https://doi.org/10.1016/S2095-3119(13)60357-3
Zhang D, Ghislain M, Huamán Z, Golmirzaie A, Hijmans R (1998) RAPD variation in sweetpotato (Ipomoea batatas (L.) Lam) cultivars from South America and Papua New Guinea. Genet Resour Crop Evol 45:271–277. https://doi.org/10.1023/A:1008642707580
Zhang K, Wu Z, Tang D, Lv C, Luo K, Zhao Y, Liu X, Huang Y, Wang J (2016) Development and identification of SSR markers associated with starch properties and β-carotene content in the storage root of sweet potato (Ipomoea batatas L.). Front Plant Sci 7:1–21. https://doi.org/10.3389/fpls.2016.00223
Zhang DP, Cervantes J, Huamán Z, Carey E, Ghislain M (2000) Assessing genetic diversity of sweet potato (Ipomoea batatas (L.) Lam.) cultivars from tropical America using AFLP. Genet Resour Crop Evol 47(6):659–665. https://doi.org/10.1023/A:1026520507223
Zhang DP, Rossel G, Kriegner A, Hijmans R (2004) AFLP assessment of diversity in sweetpotato from Latin America and the Pacific region: Its implications on the dispersal of the crop. Genet Resour Crop Evol 51(2):115–120. https://doi.org/10.1023/B:GRES.0000020853.04508.a0
Zhao N, Yu XX, Jie Q, Li H, Li H, Hu J, Zhai H, He SZ, Liu QC (2013a) A genetic linkage map based on AFLP and SSR markers and mapping of QTL for dry-matter content in sweetpotato. Mol Breeding 32(4):807–820. https://doi.org/10.1007/s11032-013-9908-y
Zhao N, Zhai H, Yu XX, Liu ZS, He SZ, Li Q, Ma DF, Liu QC (2013b) Development of SRAP markers linked to a gene for stem nematode resistance in sweetpotato, (L.) Lam. J Integr Agric 12(3):414–419. https://doi.org/10.1016/S2095-3119(13)60241-5
Zheng CX, Jiang ZC, Meng YS, Yu J, Yang XS, Zhang H, Zhao N, He SZ, Gao SP, Zhai H, Liu QC (2023) Construction of a high-density SSR genetic linkage map and identification of QTL for storage-root yield and dry-matter content in sweetpotato. Crop J 11(3):963–967. https://doi.org/10.1016/j.cj.2022.11.003
Zietkiewicz E, Rafalski A, Labuda D (1994) Genome fingerprinting by simple sequence repeat (SSR)-anchored polymerase chain reaction amplification. Genomics 20(2):176–183. https://doi.org/10.1006/geno.1994.1151
Zych K, Gort G, Maliepaard CA, Jansen RC, Voorrips RE (2019) FitTetra 2.0—improved genotype calling for tetraploids with multiple populations and parental data support. BMC Bioinformatics 20(1):148. https://doi.org/10.1186/s12859-019-2703-y
Acknowledgements
Funding for this research was provided by the Bill and Melinda Gates Foundation (OPP1052983 and OPP1213329) through the Genomic Tools for Sweetpotato Improvement Project (GT4SP) and Sweetpotato Genetic Advances and Innovative Seed Systems (SweetGAINS) projects, as well as a Specialty Crop Research Initiative Grant No. 2021-51181-35865 from the USDA National Institute of Food and Agriculture.
Conflicts of Interest
The authors declare no conflict of interest.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Open Access This chapter is licensed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license and indicate if changes were made.
The images or other third party material in this chapter are included in the chapter's Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the chapter's Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder.
Copyright information
© 2025 The Author(s)
About this chapter
Cite this chapter
Olukolu, B.A., Craig Yencho, G. (2025). Evolution of Molecular Marker Use in Cultivated Sweetpotato. In: Yencho, G.C., Olukolu, B.A., Isobe, S. (eds) The Sweetpotato Genome. Compendium of Plant Genomes. Springer, Cham. https://doi.org/10.1007/978-3-031-65003-1_4
Download citation
DOI: https://doi.org/10.1007/978-3-031-65003-1_4
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-65002-4
Online ISBN: 978-3-031-65003-1
eBook Packages: Biomedical and Life SciencesBiomedical and Life Sciences (R0)