Abstract
The ever-growing demand for inexpensive, rapid, and accurate exploration of genomes calls for refinement of existing sequencing techniques. The development of next-generation sequencing (NGS) was a revolutionary milestone in genome analysis. While modified nucleotides already were inherent tools in sequencing and imaging, further modification of nucleotides enabled the expansion into even more diverse applications. Herein we describe the design and synthesis of oligonucleotide-tethered 2′,3′-dideoxynucleotide (ddONNTP) terminators bearing universal priming sites attached to the nucleobase, as well as their enzymatic incorporation and performance in read-through assays. In the context of NGS library preparation, the incorporation of ddONNTP fulfills two requirements at once: the fragmentation step is integrated into the workflow and the obtained fragments are readily labeled by platform-specific adapters. DNA polymerases can incorporate ddONNTP nucleotides, as shown by primer extension assays. More importantly, reading through the unnatural linkage during DNA synthesis was demonstrated, with 25-30% efficiency in single-cycle extension.
Similar content being viewed by others
Introduction
We are witnessing rapid development and fast diffusion of next-generation sequencing (NGS) technologies for almost two decades. The establishment of NGS techniques revolutionized research abilities in modern biology and biomedical sciences. All current sequencing platforms require nucleic acid pre-processing to generate library suitable for sequencing. Generally this includes DNA or RNA fragmentation to a platform-specific size range, followed by end polishing and specific adapter ligation to the 3′ and 5′ termini1,2. Enzymatic ligation of adapters is notorious for low efficiency3 leading to decreased complexity of original library and impoverishment of sequencing results. Consequently, novel library preparation techniques need to be developed to improve the conversion efficiency and simplify the workflow. Some alternative chemical ligation techniques based on the formation of ribose-to-ribose connection were reported (Fig. 1a)4,5. Applying 3′-azido-2′,3′-dideoxynucleotides as terminators for alternative fragment generation the azido group-bearing DNA fragments have been obtained. Followed by the addition of 5′-alkynyl oligonucleotides via Cu(I)-catalyzed Huisgen’s [3 + 2] azide-alkyne cycloaddition, known as click chemistry, chemical ligation was used to form triazole-linkage derivatives and introduce sequencing adapters into DNA fragments (Fig. 1a)6,7. However, this strategy exhibited several drawbacks: (1) after fragmentation, additional purification prior to chemical ligation was necessary to remove the residual free 3′-azido-ddNTP; (2) for efficient click reaction a complementary template to 3′-azido oligonucleotide was necessary, therefore it was impossible to implement for unknown sequences; (3) non-templated chemical ligation required high DNA amount which is far from practical for many NGS applications; (4) to avoid Cu-induced inhibition of polymerases, additional purification had to be performed after chemical ligation; (5) more than half of the obtained reads were shorter than expected, caused by Cu-mediated DNA degradation;7 and (6) the read-through efficiency was low, as single primer extension cycle generated undetectable amount of product (less than 4%) (Fig. 1a, b). Previous studies revealed the importance of DNA backbone structure for oligonucleotide recognition by polymerase, due to hydrogen bonding8,9,10. Triazole is considered a phosphate-mimicking group given its similar size and the ability to form hydrogen bonds8. These studies have shown the possibilities to design and successfully use nucleic acids with artificial backbones in molecular biology applications.
Base-modified nucleotides are widely used as carriers of tags and labels for diagnostics, chemical biology and bioprocess monitoring11,12,13. Apart from relatively small molecular labels, such as redox14,15,16,17, spin18,19 labels and reactive groups (for post-enzymatic modification)20,21,22,23, synthesis of biomolecule-modified (oligonucleotides24,25,26 and proteins27) nucleotides have been also reported. Crystallographic structure analysis reveals that certain modifications partially enter the polymerase active site11, thus suitable nucleobase and label-linker design plays a pivotal role. Usually, the main challenge is to avoid steric hindrance caused by the label during the enzymatic nucleotide incorporation. However, the linker is usually not adapted for bridging function in enzymatic read-through. We hypothesized that the ability to read through the linker would enable the use of oligonucleotide modification as a priming site for subsequent synthesis of a complementary strand. This, in turn, would make templates containing oligo-tethered terminators biocompatible, e.g. serve as a template in PCR. In this study, we present oligonucleotide-tethered 2′,3′-dideoxynucleotide (ddONNTP) analogues for labeling of any DNA strand with a universal priming site (Fig. 1c, d).
Results and discussion
Chemical synthesis of ddONNTPs
Several reported articles describe chemical ligation approach for the oligonucleotide conjugation via ribose-to-ribose connection7,8,10. In the attempts to design chemical linkage which orthogonally resembles natural DNA several triazole-based phosphodiester mimic groups were applied (Fig. 2a). The use of triazole as a phosphodiester analogue has been well established in antisense oligonucleotides technology, since it stabilizes nucleic acid and increases the interaction with the complementary strand28,29,30. Even though the majority of these linkages were tolerated in the read-through assays, in many cases they caused insertion/deletion events, as well as poor read-through efficiency6,7,10. Besides indicating the importance of the proximity between 5′ and 3′ sugar rings, authors also point out the lack of linker’s flexibility. During enzymatic synthesis flexibility is critical, as the polymerase has to twist the template backbone by ~90°31,32 and the rigidness of the linker may cause deletions10. We propose a different approach to bridge oligonucleotides - via nucleobase-to-ribose connection (Fig. 2b) that might provide more flexibility during read-through and allows a seamless synthesis of the complementary strand.
Several important aspects of linker design were considered: whilst the linker had to be long enough to avoid label-induced steric hindrance during enzymatic incorporation, it also had to be length-compatible for polymerase to read through it. Considering all that, we designed functionalized azido-ddNTPs (ddN3NTP) with desired linker properties, as substrates for Cu (I)-catalyzed alkyne-azide cycloaddition (CuAAC) with alkyne-oligonucleotides. An efficient and straightforward synthesis strategy of the ddN3NTPs has been developed (Fig. 2c, for more details see Supplementary Fig. S1). Our scheme consists of triphosphorylation, aqueous phase Sonogashira coupling33,34, and acylation by the azidoacetic acid NHS ester cascade. This chemistry allowed to shorten the synthesis of modified ddNTPs to three steps, whereas the conventional path is at least five steps-long12,34,35,36. Propargylamine was chosen for linker formation for several reasons: it can be easily applied for C-C bond formation via the Pd-catalyzed coupling, more importantly, the reactive amine group may serve for further modification, while the triple bond next to the heterocyclic base unit is not reactive in the following click chemistry37. Finally, the desired ddONNTPs were synthesized via the CuAAC reaction between ddN3NTP and 5′-hexynyl-oligonucleotide (Fig. 2d, for exact structure of 5′-hexynyl-oligonucleotide see Supplementary Fig. S2). This selective and potent method has quite a few advantages: the precursors of the azide and alkyne moieties can be either synthesized or purchased; CuAAC is compatible with aqueous media and results in the formation of highly stable triazole-linkage entity38. Notably, the most critical part of the ddONNTP preparation was the purification steps. Due to highly sensitive enzymatic reactions, even the trace amount of additives used in chemical synthesis, such as Cu, would lead to inhibition39,40. Moreover, we observed that ddN3NTPs that bear sterically smaller group are significantly better substrates for DNA polymerases (Klenow exo-, Thermo sequenase, Phusion exo-, TdT, Superscript IV) than the ddONNTPs with their bulky ON-substituents. During enzymatic termination the ddN3NTPs would be preferred over the ddONNTPs reducing the efficiency of the labeling by oligonucleotide. Therefore, efficient purification was required to avoid contamination of polymerase reaction by the precursors (ddN3NTPs) used in click chemistry. Reverse-phase chromatography was applied and HPLC methods were developed to purify purine and pyrimidine derivatives (for more synthesis and purification details see Supplementary Methods).
Enzymatic incorporation of ddONNTPs
After ddN3NTPs and ddONNTPs were synthesized and purified primer extension experiments were run (Fig. 2e) using polymerases of families A, B, X and RT (Fig. 3). In general, DNA polymerases are classified into seven families (A, B, C, D, X, Y and RT) according to their crystallographic structure and sequence homology41, which could lead to their different performance with our substrates. Oligonucleotide duplex which served as a template contained 10 nt overhang. Hence, during primer extension reaction (PEX) in the presence of dNTPs up to 10 nucleotides could be added, while the incorporation of ddONNTP would allow to elongate the primer by 23 nt (see Methods section). All tested polymerases were 3′-5′ exonuclease-deficient and accepted ddN3NTPs and ddONNTPs as substrates. The efficiency of ddONNTP incorporation varied between polymerases. In addition, a slight preference for terminator’s nucleobase was observed. Klenow exo- exhibited better incorporation efficiency with oligo-modified 2′,3′-dideoxypurines rather than pyrimidines, and similar trend was observed with ddN3NTP derivatives. Thermo Sequenase, another DNA polymerase from family A, was able to incorporate all types of ddN3NTPs and ddONNTPs with superior efficiency compared to other tested enzymes, since it does not discriminate between dNTPs and ddNTPs42.
Phusion™ exo- DNA polymerase from family B catalyzed the addition of ddONNTPs with lower efficiency compared to Thermo Sequenase, especially in the case of ddONUTP. Terminal deoxynucleotidyl transferase (TdT) performed with the lowest efficiency, although TdT is known to tolerate bulky base modifications43. This result could be attributed to suboptimal reaction conditions, such as the ddONNTP-to-template ratio. The ddONNTP addition by SuperScript™ IV reverse transcriptase (RT) had similar efficiency as the Thermo Sequenase (Fig. 3). In practice an ddONNTP-compatible RT enzyme could enable cDNA labeling, while a thermostable DNA polymerase could be used in nucleic acid analysis workflows with ssDNA or dsDNA templates.
Semi-targeted sequencing
In most applications ddONNTP-labeled DNA fragments would be subjected to PCR amplification, thus we looked for thermostable DNA polymerase that would synthesize through an unnatural linkage. We’ve tested Phusion exo- DNA polymerase and found that it was able to build complementary DNA strand using oligonucleotide modification as a priming site, with a single-cycle read-through efficiency of 25–30% (Supplementary Fig. S3). The observed polymerase stalling near the linker position in a single-cycle reaction and somewhat better read-through efficiency with a prolonged extension time suggested that the read-through phase might be a rate-limiting step. When 15 linear cycles of the extension were performed allowing the polymerase multiple attempts to bypass the linker, the amount of full-length product increased substantially.
The approach proposed herein offers a significantly simplified NGS library preparation protocol with nucleic acid fragmentation and labeling integrated into a single enzymatic step (Fig. 4a). To demonstrate the applicability of our technology, we applied ddONUTPs and identified incorporating and replicating polymerases for the generation of fragment libraries compatible with Illumina next-generation sequencing. We designed primers specific to two loci within M13mp18 genome and performed a linear primer extension in the presence of Thermo Sequenase enzyme and a mixture of dNTPs and ddONUTP (see Methods section). Upon incorporation, ddONUTP terminated DNA synthesis and simultaneously labeled the nascent DNA strand with a partial Illumina adapter sequence attached to its nucleobase. This step generated fragments of suitable length for short read sequencing flanked by both adapters. The obtained sequencing data indeed indicated capture of two genomic loci with a characteristic insert structure: fixed terminus corresponded to the specific priming site, while randomly distributed terminus illustrated the stochastic nature of ddONUTP incorporation (Fig. 4b). The inspection of sequencing reads has not revealed any systematic errors or artefacts which could be associated with the presence of unnatural backbone within the fragment library. The analysis of base distribution per sequenced position showed clear dominance (>90%) of A base at the first position of reverse reads (Fig. 4c), which corresponds to ddUTP incorporation site. This indicates that the vast majority of sequenced molecules derived from a seamless read-through, i.e. the copying polymerase incorporated a complementary nucleotide immediately downstream of the ddONUTP linker. To compare, 69% of the reads indicated correct read-through in library preparation method based on chemical ligation. The authors reasoned that the presence of an incorrect base at the position downstream of the chemical linkage was a result of substitution events7. As the termini of reverse reads in our sequencing experiment aligned to the reference genome even in the case when the first base is not A, imperfect read-through likely originated from base skipping leading to deletions. Importantly, this rare effect does not impair the analysis of sequencing data and does not require additional trimming of sequencing reads.
We termed the developed library preparation technique semi-targeted sequencing emphasizing that only a single primer is needed to capture a desired locus, while ddONNTPs can label primer extension products irrespective of the sequence context of the template downstream of the priming site. Somewhat similar targeted library preparation approaches involving single primer extension and open-ended amplification were previously reported in the field, including multiplexed primer extension sequencing44 and anchored multiplex PCR45. Both methods proved to be feasible, however lengthy multistep workflows with intermediate purification steps that inevitably lead to the loss of material are not attractive for adoption in settings where sample preparation speed is important. The addition of sequencing adapters to DNA strands of unknown sequence is achieved via separate fragmentation, end repair and enzymatic ligation of synthetic oligonucleotides. ddONNTPs open the doors to obtain equivalent results with a simple and rapid workflow that integrates fragment generation and labeling. In addition, efficient replication through the ddONNTP linker preserves library complexity and implies the compatibility of our approach with the analysis of complex genomes or transcriptomes.
Conclusions
In conclusion, we have designed and synthesized oligonucleotide-modified 2′,3′-dideoxynucleotide terminators, with properties that go beyond commonly labeled nucleotides. Represented efficient ddONNTPs synthesis and purification strategy enables to obtain these complex molecules with necessary qualities for enzymatic processes. We showed that, even though, their structure possesses bulky oligonucleotide label, the linker design empowers both the enzymatic incorporation and read-through. We demonstrated that modified template replication not necessarily requires mimicking of the phosphodiester group of oligonucleotides. Nucleobase-to-ribose conjugation performed with accuracy and satisfying efficiency opened new possibilities for further linker development. This enabled us to integrate the fragmentation and adapter addition into a single enzymatic step, thus substantially simplifying sample preparation workflow for high-throughput sequencing. The biocompatibility of 2-(4-butyl-1H-1,2,3-triazol-1-yl)-N-(prop-2-ynyl)acetamide artificial backbone paves the way for numerous applications, such as whole-genome and whole-transcriptome fragment library preparation for NGS analysis, semi-targeted library preparation with great potential to investigate gene fusion events or characterize microbial communities46, nucleic acid labeling for subsequent detection employing universal priming site, and others that benefit from the addition of a known artificial sequence to DNA or cDNA. Notably, oligonucleotide modification can contain further modifications, such as biotin or phosphate to enable specific capture of ddONNTP-containing molecules or to block 3′ terminus, respectively.
Methods
General
All chemicals and solvents were obtained commercially and used without further purification. Iodinated nucleosides ddIU, ddIC, ddIA, ddIG were obtained from AEchem Scientific. Moisture sensitive reactions were performed under dry argon atmosphere using oven-dried glassware. Reactions, fractions and final purities were monitored by VanquishTM UHPLC system using TSKgelTM ODS- 100V 3 µm column (mobile phases: A – 100 mM TEAAc, B – 40% 100 mM TEAAc + 60% ACN). Modified nucleotides were purified by BiotageTM IsoleraTM One or Biotage™ IsoleraTM Prime liquid chromatography systems with UV detectors using Q Sepharose™ Fast Flow, Biotage SNAP ULTRA C18 columns. Purification of oligonucleotide-tethered 2′,3′-dideoxynucleotides was held with Waters 2555 Quaternary gradient system using YMC-Actus Triart C18 100 × 20 mm S–5 µm column, while product for read-through efficiency measurements was purified by Vanquish UHPLC system using YMC-Triart C18 150 × 4.6 mm S–3 µm column. Conductivity was measured with Mettler-Toledo™ FiveGoTM F3 conductometer (prior to ion exchange chromatography reaction mixtures were diluted to 3.5–4 mS/m2 conductivity). Absorbance of products were measured by Thermo Fisher EvolutionTM 201 UV-Visible spectrophotometer. Oligonucleotide-tethered nucleotides amount was determined using NanoDropTM 2000 spectrophotometer. Monoisotopic masses measurements were performed using triple quadrupole mass spectrometer TSQ EnduraTM with ESI ion source, oligonucleotides molecular masses were determined using hybrid Quadrupole-OrbitrapTM mass spectrometer Q Exactive plusTM (Thermo Fisher Scientific). NMR spectra were recorded using Bruker Avance III 400 MHz (9.0 T). Chemical shifts are given in parts per million relative to tetramethylsilane using residual solvents signals as internal standards. Signal patterns are indicated as br, broad; s, singlet; d, doublet; t, triplet; q, quartet; m, multiplet. The complete assignment of 1H and 13C signals was performed by an analysis of the correlated homonuclear H,H-COSY, and heteronuclear H,C-HSQC and H,C-HMBC spectra.
All enzymes and buffers were manufactured by Thermo Fisher Scientific unless specified otherwise. Oligonucleotides were synthesized by Metabion GmbH requesting HPLC purification.
Chemical synthesis
General procedure for CuAAc click reaction
All reaction components were applied to the reaction mixture as solutions in water unless specified differently. Corresponding ddN3NTP (3 eq., 2–4 mM) solution was added to alxyl-oligonucleotide (5′-hexynyl-AGATCGGAAGAGCACACGTCTG-3′-biotin) (100–210 nmol) solution in sodium phosphate buffer (1 ml, 100 mM, pH 7). In a separate vial CuSO4 (100 mM, 12 eq.) and THPTA (250 mM, 5 eq. to CuSO4) were premixed and added to the reaction mixture, followed by addition of sodium ascorbate (1 M, 50 eq. to CuSO4). Reaction mixture was stirred for 20 min at 42 °C, quenched with 0.5 M EDTA-Na2 solution (0.5 ml, pH 8). The products were purified by semi-preparative reverse phase HPLC using a linear gradient of 100 mM TEAAc/ACN (11–18% ACN for ddONCTP and ddONUTP, 10–20% ACN for ddONATP and ddONGTP). ddONNTPs were desalted using water/ACN (5–100%) gradient. The characteristics of ddONNTP are given in Table 1 (for MS and HPLC chromatograms see Supplementary Figs. S4–13).
ddONNTP incorporation assays
A selection of various DNA polymerases, including representatives from family A, family B, family X, and reverse transcriptase (RT) family were tested for capability of incorporating ddONNTP. The experimental system was based on the extension of 3′-recessed ends of the oligonucleotide duplexes.
Templates for incorporation testing
To prepare oligonucleotide duplexes with different protruding ends, oligonucleotides listed in Table 2 were annealed in 1× annealing buffer (10 mM Tris-HCl pH 8.1 mM EDTA, 50 mM NaCl) to obtain 2 µM final solution. Primers were labeled with a fluorescent dye for subsequent detection of PEX products on a gel.
PEX reaction conditions for incorporation testing
Primer extension reactions were performed in optimal buffers and optimal or near-optimal temperatures for each tested polymerase. Control reactions with native dNTPs and ddN3NTPs were conducted to ensure that polymerase of interest is (i) capable to perform conventional primer extension at given conditions and (ii) is able to incorporate ddNTPs with small base modification. Tested polymerases and specific reaction conditions are listed in Table 3. In all cases, 2 pmol of oligonucleotide duplex (or single-stranded primer for TdT) and 20 pmol of ddONNTP or corresponding native dNTP, or ddN3NTP were used per reaction.
Reaction products were resolved on 15% TBE-Urea PAGE. Prior to loading on a gel, samples were mixed in a 1:1 ratio with 2× DNA loading buffer (98% formamide, 10 mg/mL blue dextran, 10 mM EDTA), heated at 95 °C for 5 min and then immediately cooled on ice. Electrophoresis was carried out in 1× TBE buffer at 400 V for 1 h at 55 °C. Gels were imaged with Typhoon™ FLA 9500 system (GE Healthcare).
Read-through assay
For successful application of ddONNTP for nucleic acid detection and analysis, it is highly desirable to identify a thermostable DNA polymerase able to read through the linker – this would make ddONNTP-containing nucleic acids compatible with PCR amplification.
To prepare ddONUTP-containing template (ON-ddONU) for read-through assay, ddONUTP was PEX-incorporated by SuperScript IV RT into the 3′ end of oligonucleotide 5′-TGCAGACATGGGTAGGCATCCTTGGCGTA-3′ annealed to the RNA oligonucleotide 5′-auacgccaaggaugccuacccaugucugca-3′. RNA strand was then digested by RNase H treatment, DNA was purified and desired single-stranded ON-ddONU was purified by HPLC.
Primer of sequence 5′-CAGACGTGTGCTCTTCC-3′ was 5′-radio-labeled by T4 polynucleotide kinase in the presence of [γ-33P]-ATP. The 5′-labeled primer was desalted using Zeba™ Spin desalting columns (7K MWCO). In all, 50 fmol of purified ON-ddONU annealed with primer (1.1:1 molar ratio respectively) were mixed with 0.2 mM dNTP mix and 20 U Phusion exo- in 1× of the following buffers or master mixes: (i) Phusion™ HF buffer, (ii) Phusion™ GC buffer, (iii) Phire™ reaction buffer, (iv) SuperFi™ buffer, (v) Invitrogen™ Collibri™ Library Amplification Master Mix, (vi) Invitrogen™ Collibri™ Library Amplification Master Mix treated with thermolabile proteinase K (NEB) to remove SuperFi polymerase. Read-through PEX reactions were executed as follows: denaturation at 95 °C for 1 min, annealing at 60 °C for 1 min and extension at 72 °C for either 1 min or 15 min. In addition, 15 cycles of linear amplification were tested with 1 min extension. Reaction products were resolved on 15% TBE-Urea PAGE. Visualization was carried out by phosphorimaging using the Typhoon™ FLA 9500 imaging system (GE Healthcare).
Semi-targeted NGS library preparation and sequencing
For proof-of-principle demonstration, M13mp18 bacteriophage single-stranded DNA (Thermo Fisher Scientific) was used as a sample input. Target-specific primers were designed to contain partial Illumina i5 adapter sequence (underlined) at their 5′ termini:
M13-1: 5′-TACACGACGCTCTTCCGATCTAACGGTACGCCAGAATCTTG-3′
M13-2: 5′-TACACGACGCTCTTCCGATCTAGAGCCACCACCGGAAC-3′
0.125 pmol of each primer were mixed with 200 ng of M13mp18 DNA in a 20 µl reaction mixture containing 2 pmol of ddONUTP, 20 pmol of dNTP mix and 40 U of Thermo Sequenase with thermostable pyrophosphatase in 1× Thermo Sequenase reaction buffer. Primer extension was executed as follows: denaturation at 95 °C for 30 s, followed by 15 cycles of denaturation at 95 °C for 30 s, annealing/extension at 60 °C for 2 min and final extension at 60 °C for 30 min
Half of the primer extension reaction was used directly for indexing PCR. 10 µl of primer extension reaction mixture was combined with 25 µl of Invitrogen Collibri Library Amplification Master Mix (Thermo Fisher Scientific), 20 U of Phusion exo- (1 µl), 5 µl of indexing primers i5 and i7 (50 pmol each) of the sequences given below and 9 µl of nuclease-free water.
i5 primer: 5′-AATGATACGGCGACCACCGAGATCTACACTCTTTCCCTACACGACGCTCTTCCGATCT-3′
i7 primer: 5′-CAAGCAGAAGACGGCATACGAGAT[8nt index]GTGACTGGAGTTCAGACGTGTGCTCTTCCGATCT-3′
Cycling was performed as follows: denaturation at 98 °C for 30 s, followed by 20 cycles of denaturation at 98 °C for 10 s, annealing at 60 °C for 30 s, extension at 72 °C for 1 min, and final extension at 72 °C for 1 min. PCR products were then purified using Dynabeads™ Cleanup Beads (Thermo Fisher Scientific). DNA binding to the beads was performed by mixing 45 µl of bead suspension with 50 µl of sample and subsequent incubation at room temperature for 5 min. Sample was then placed on magnet, supernatant was removed and beads were resuspended in 50 µl of elution buffer containing 10 mM Tris-HCl (pH 8). 50 µl of fresh beads were added again to the sample and binding was repeated. After room temperature incubation, sample was placed on magnet, supernatant was removed, and beads were washed twice with 85% ethanol. To elute libraries, beads were resuspended in 22 µl of elution buffer and incubated for 1 min at room temperature. Fragment size distribution was then assessed by Agilent™ 2100 Bioanalyzer™ system (Agilent Technologies) with High Sensitivity DNA Kit. Quantification of sequenceable molecules was performed with Invitrogen Collibri Library Quantification Kit (Thermo Fisher Scientific).
Libraries were sequenced on the Illumina™ MiSeq™ instrument using the MiSeq Reagent Kit v2 (300-cycle) at 2 × 150 bp paired-end mode.
Reporting summary
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Data availability
All data that support the findings of this study have been provided in the manuscript and Supplementary Information file or could be obtained from the corresponding author on reasonable request. Supplementary Information provides general experimental procedure for synthesis of ddINTPs, ddPANTPs, ddN3NTPs, read-through product – ON-ddONU (for MS and HPLC chromatograms see Supplementary Figs. S14–15), NMR (Supplementary Figs. S16–98) and HRMS data for selected compounds.
References
Metzker, M. Sequencing technologies – the next generation. Nat. Rev. Genet. 11, 31–46 (2010).
Shendure, J. & Ji, H. Next-generation DNA sequencing. Nat. Biotechnol. 26, 1135–1145 (2008).
Aigrain, L., Gu, Y. & Quail, M. A. Quantitation of next generation sequencing library preparation protocol efficiencies using droplet digital PCR assays - a systematic comparison of DNA library preparation kits for Illumina sequencing. BMC Genomics 17, 1–11 (2016).
El-Sagheer, A. H. & Brown, T. Click nucleic acid ligation: applications in biology and nanotechnology. Acc. Chem. Res. 45, 1258–1267 (2012).
Kumar, R. et al. Template-directed oligonucleotide strand ligation, covalent intramolecular DNA circularization and catenation using click chemistry. J. Am. Chem. Soc. 129, 6859–6864 (2007).
Routh, A., Head, S. R., Ordoukhanian, P. & Johnson, J. E. ClickSeq: fragmentation-free next-generation sequencing via click ligation of adaptors to stochastically terminated 3′-azido cDNAs. J. Mol. Biol. 427, 2610–2616 (2015).
Miura, F. et al. Triazole linking for preparation of a next-generation sequencing library from single-stranded DNA. Nucleic Acids Res. 46, e95 (2018).
El-Sagheer, A. H., Sanzone, A. P., Gao, R., Tavassoli, A. & Brown, T. Biocompatible artificial DNA linker that is read through by DNA polymerases and is functional in Escherichia coli. Proc. Natl Acad. Sci. USA 108, 11338–11343 (2011).
Li, Y., Korolev, S. & Waksman, G. Crystal structures of open and closed forms of binary and ternary complexes of the large fragment of Thermus aquaticus DNA polymerase I: structural basis for nucleotide incorporation. EMBO 17, 7514–7525 (1998).
Shivalingam, A., Tyburn, A. E. S., El-Sagheer, A. H. & Brown, T. Molecular requirements of high-fidelity replication-competent DNA backbones for orthogonal chemical ligation. J. Am. Chem. Soc. 139, 1575–1583 (2017).
Hottin, A. & Marx, A. Structural insights into the processing of nucleobase-modified nucleotides by DNA polymerases. Acc. Chem. Res. 49, 418–427 (2016).
Hocek, M. Synthesis of base-modified 2′-deoxyribonucleoside triphosphates and their use in enzymatic synthesis of modified DNA for applications in bioanalysis and chemical biology. J. Org. Chem. 79, 9914–9921 (2014).
Kuwahara, M. & Sugimoto, N. Molecular evolution of functional nucleic acids with chemical modifications. Molecules 15, 5423–5444 (2010).
Balintova, J. et al. Benzofurazane as a new redox label for electrochemical detection of DNA: towards multipotential redox coding of DNA bases. Chem. Eur. J. 19, 12720–12731 (2013).
Balintova, J. et al. Azido-phenyl as a click-transformable redox label of DNA suitable for electrochemical detection of DNA–protein interactions. Chem. Sci. 6, 575–587 (2015).
Havranova-Vidlakova, P. et al. Vicinal diol-tethered nucleobases as targets for DNA redox labeling with osmate complexes. ChemBioChem 21, 171–180 (2020).
Simonova, A. et al. Tuning of oxidation potential of ferrocene for ratiometric redox labeling and coding of nucleotides and DNA. Chem. Eur. J. 26, 1286–1291 (2020).
Obeid, S., Yulikov, M., Jeschke, G. & Marx, A. Enzymatic synthesis of multiple spin-labeled DNA. Angew. Chem. Int. Ed. 47, 6782–6785 (2008).
Cekan, P., Smith, A. L., Barhate, N., Robinson, B. H. & Sigurdsson, S. T. Rigid spin-labeled nucleoside Ç: a nonperturbing EPR probe of nucleic acid conformation. Nucleic Acids Res. 36, 5946–5954 (2008).
Dadova, J. et al. Vinyl-sulfonamide and acrylamide modification of DNA for cross-linking with proteins. Angew. Chem. Int. Ed. 52, 10515–10518 (2013).
Arndt, S. & Wagenknecht, H. A. “Photoclick” postsynthetic modification of DNA. Angew. Chem. Int. Ed. 53, 14580–14582 (2014).
Ren, X., El-Sagheer, A. H. & Brown, T. Efficient enzymatic synthesis and dual-colour fluorescent labelling of DNA probes using long chain azido-dUTP and BCN dyes. Nucleic Acids Res. 44, e79 (2016).
Matyasovsky, J., Perlikova, P., Malnuit, V., Pohl, R. & Hocek, M. 2-Substituted dATP derivatives as building blocks for polymerase-catalyzed synthesis of DNA modified in the minor groove. Angew. Chem. Int. Ed. 55, 15856–15859 (2016).
Baccaro, A., Steck, A. L. & Marx, A. Barcoded nucleotides. Angew. Chem. Int. Ed. 51, 254–257 (2012).
Verga, D., Welter, M., Steck, A. L. & Marx, A. DNA polymerase-catalyzed incorporation of nucleotides modified with a G-quadruplex-derived DNAzyme. Chem. Commun. 51, 7379–7381 (2015).
Verga, D., Welter, M. & Marx, A. Sequence selective naked-eye detection of DNA harnessing extension of oligonucleotide-modified nucleotides. Bioorg. Med. Chem. Lett. 26, 841–844 (2016).
Welter, M., Verga, D. & Marx, A. Sequence-specific incorporation of enzyme–nucleotide chimera by DNA polymerases. Angew. Chem. Int. Ed. 55, 10131–10135 (2016).
Isobe, H., Fujino, T., Yamazaki, N., Guillot-Nieckowski, M. & Nakamura, E. Triazole-linked analogue of deoxyribonucleic acid (TLDNA): design, synthesis, and double-strand formation with natural DNA. Org. Lett. 10, 3729–3732 (2008).
Varizhuk, A. M. et al. Synthesis of triazole-linked oligonucleotides with high affinity to DNA complements and an analysis of their compatibility with biosystems. J. Org. Chem. 78, 5964–5969 (2013).
Sharma, V. K. & Watts, J. K. Oligonucleotide therapeutics: chemistry, delivery and clinical progress. Future Med. Chem. 7, 2221–2242 (2015).
Doublie, S., Sawaya, M. R. & Ellenberger, T. An open and closed case for all polymerases. Structure 7, R31–R35 (1999).
Kunkel, T. A. DNA replication fidelity. J. Biol. Chem. 279, 16895–16898 (2004).
Filichev, V. V., Astakhova, I. V., Malakhov, A. D., Korshun, V. A. & Pedersen, E. B. 1-, 2-, and 4-Ethynylpyrenes in the structure of twisted intercalating nucleic acids: structure, thermal stability, and fluorescence relationship. Chem. Eur. J. 14, 9968–9980 (2008).
Hocek, M. & Fojta, M. Cross-coupling reactions of nucleoside triphosphates followed by polymerase incorporation. Construction and applications of base-functionalized nucleic acids. Org. Biomol. Chem. 6, 2233–2241 (2008).
Moriyama, K., Kimoto, M., Mitsui, T., Yokoyama, S. & Hirao, I. Site-specific biotinylation of RNA molecules by transcription using un-natural base pairs. Nucleic Acids Res. 33, e129 (2005).
Borsenberger, V., Kukwikila, M. & Howorka, S. Synthesis and enzymatic incorporation of modified deoxyuridine triphosphates. Org. Biomol. Chem. 7, 3826–3835 (2009).
Astakhova, K. & Wengel, J. Interfacing click chemistry with automated oligonucleotide synthesis for the preparation of fluorescent DNA probes containing internal xanthene and cyanine dyes. Chem. Eur. J. 19, 1112–1122 (2013).
El-Sagheerab, A. H. & Brown, T. Click chemistry with DNA. Chem. Soc. Rev. 39, 1388–1405 (2010).
Popenoe, E. A. & Schimaeler, M. A. Interaction of human DNA polymerase β with Ions of copper, lead, and cadmium. Arch. Biochem. Biophys. 196, 109–120 (1979).
Novello, F. & Stirpe, F. The effects of copper and other ions on the ribonucleic acid polymerase activity of isolated rat liver nuclei. Biochem. J. 111, 115–119 (1969).
Rothwell, P. J. & Waksman, G. Structure and mechanism of DNA polymerases. Adv. Protein Chem. 71, 401–440 (2005).
Reeve, M. A. & Fuller, C. W. A novel thermostable polymerase for DNA sequencing. Nature 376, 796–797 (1995).
Sorensen, R. S. et al. Enzymatic ligation of large biomolecules to DNA. ACS Nano 7, 8098–8104 (2013).
Xu, H., Fair, B. J., Dwyer, Z. W., Gildea, M. & Pleiss, J. A. Detection of splice isoforms and rare intermediates using multiplexed primer extension sequencing. Nat. Methods 16, 55–58 (2019).
Zheng, Z. et al. Anchored multiplex PCR for targeted next-generation sequencing. Nat. Med. 20, 1479–1484 (2014).
Kapustina, Ž. et al. High-resolution microbiome analysis enabled by linking of 16S rRNA gene sequences with adjacent genomic contexts. Microb Genom. 7, 624 (2021).
Acknowledgements
We are grateful Dr. Lukas Taujenis, Dr. Vytautas Tamošiūnas, and Martynas Melinskas for cooperation in finding suitable method for LC-MS analysis and data processing. Also, we thank Dr. Alessandro Panattoni and Prof. Dr. Michal Hocek for guidance and useful advices and Ramunė Leipuvienė for providing comments on the manuscript.
Author information
Authors and Affiliations
Contributions
J.M. designed molecules structure, synthesis strategy, synthesized target materials, and wrote the manuscript. Ž.K. designed and performed sequencing experiment, co-wrote the manuscript. S.Ž. contributed to synthesis of target compounds. J.J. and R.S. implemented PEX experiments (incorporation and read-through). I.Č. supervised the research and co-wrote the manuscript. A.L. conceived the original idea and supervised the research. All authors reviewed the paper and provided comments.
Corresponding author
Ethics declarations
Competing interests
The authors declare the following competing financial interests: J.M., Ž.K., S.Ž., J.J., R.S., I.Č., and A.L. are employees of Thermo Fisher Scientific Baltics. A patent covering oligonucleotide-tethered nucleotides and their uses is pending; patent application assigned to Thermo Fisher Scientific, J.M., Ž.K., S.Ž., I.Č., and A.L. are among the inventors.
Peer review
Peer review information
Communications Chemistry thanks the anonymous reviewers for their contribution to the peer review of this work.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Medžiūnė, J., Kapustina, Ž., Žeimytė, S. et al. Advanced preparation of fragment libraries enabled by oligonucleotide-modified 2′,3′-dideoxynucleotides. Commun Chem 5, 34 (2022). https://doi.org/10.1038/s42004-022-00649-9
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s42004-022-00649-9
- Springer Nature Limited