Abstract
The emergence of Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV-2) was met with rapid development of robust molecular-based detection assays. Many SARS-CoV-2 molecular tests target multiple genetic regions of the virus to maximize detection and protect against diagnostic escape. Despite the relatively moderate mutational rate of SARS-CoV-2, numerous mutations with known negative impact on diagnostic assays have been identified. In early 2021, we identified four samples positive for SARS-CoV-2 with a nucleocapsid (N) gene drop out on Cepheid Xpert® Xpress SARS-CoV-2 assay. Sequencing revealed a single common mutation in the N gene C29200T. Spatiotemporal analysis showed that the mutation was found in at least six different Canadian provinces from May 2020 until May 2021. Phylogenetic analysis showed that this mutation arose multiple times in Canadian samples and is present in six different variants of interest and of concern. The Cepheid testing platform is commonly used in Canada including in remote regions. As such, the existence of N gene mutation dropouts required further investigation. While commercial SARS-CoV-2 molecular detection assays have contributed immensely to the response effort, many vendors are reluctant to make primer/probe sequences publicly available. Proprietary primer/probe sequences create diagnostic ‘blind spots’ for global SARS-CoV-2 sequence monitoring and limits the ability to detect and track the presence and prevalence of diagnostic escape mutations. We hope that our industry partners will seriously consider making primer/probe sequences available, so that diagnostic escape mutants can be identified promptly and responded to appropriately to maintain diagnostic accuracy.
Similar content being viewed by others
Introduction
Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) was identified as the etiological agent of coronavirus disease (COVID-19) with initial cases documented in Wuhan, China, in December 2019. A pandemic was declared in March 2020, and more than 200 million cases have been documented worldwide as of October 20211. SARS-CoV-2 is an RNA virus with a mutation acquisition rate estimated at 1.1 × 10–3 subs/site/year2. PCR-based diagnostic assays have been at the forefront of identification of COVID-19 cases and have played an integral role in the public health efforts to control transmission. Academic and industry groups alike have designed and produced many rRT-PCR assays to detect different genes of SARS-CoV-2.
In the winter of 2021, two nasopharyngeal swab (NPS) specimens tested with the Cepheid Xpert® Xpress SARS-CoV-2 assay in our institution produced an unusual pattern of detection, with low cycle threshold (Ct) values (between 13 and 15) for the envelope (E) gene target and nucleocapsid (N) gene ‘not detected’. We performed whole genome sequencing (WGS) to study the reasons leading to the N gene drop out and identified only one common mutation (synonymous C29200T) in the N gene of those two specimens (EPI_ISL_1490673 and EPI_ISL_1490676). Analysis of whole genome sequences previously generated from SARS-CoV-2-positives identified by our laboratory found two additional specimens harbouring the C29200T mutation (EPI_ISL_1490692 and EPI_ISL_1490679). These two specimens had been originally tested with another rRT-PCR assay (AltoStar SARS-CoV-2 assay; targets the spike (S) and E genes) and resulted in similarly low Ct values for the E gene (Ct 15.1 and 17.1) with N gene not detected when a 1:10 dilution was tested on the Xpert® Xpress SARS-CoV-2 PCR assay. These four samples were diluted 1:10 due to the limited specimen volumes and subsequently tested on the BD SARS-CoV-2 Reagents for BD MAX™ System; both N1 and N2 targets were detected with Ct values ranging from 11.4–16.7 and 12.4–17.0, respectively. Two of these four SARS-CoV-2-positive samples (EPI_ISL_1490673 and EPI_ISL_1490679) were also tested on the Xpert® Xpress SARS-CoV-2/Flu/RSV Assay, which combines E and N2 targets in a single channel and, expectedly, tested positive with Ct values of 12 and 15. However, we could not determine if the C29200T mutation caused an N gene drop out in the Xpert® Xpress SARS-CoV-2/Flu/RSV assay.
We then evaluated the distribution of the C29200T mutation in Canada. We analysed all 41,712 SARS-CoV-2 genomes from Canada available on GISAID as of May 22nd, 2021, and found 266 sequences with the C29200T mutation (0.6%), 7 sequences with Y ambiguities (meaning C or T), and 794 sequences with indeterminate base pair (N) at that position (Fig. 1). SARS-CoV-2 sequences from eight provinces were downloaded from GISAID with six provinces having at least one sequence with the C29200T mutation. Of those provinces, Ontario had the highest number of genomes with the C29200T mutation in 241 of 16,772 sequences (1.4%). Alberta, British Columbia, Nova Scotia, Manitoba, and Quebec had a lower number of sequences with the C29200T mutation ranging from 1 to 15 (0.1–0.2%) (Table 1). By the end of April 2021, approximately 1.2 million infections had been documented in Canada and we analysed sequences for 3.5% of those cases3. Therefore, the frequency presented above could be inaccurate. We then analysed the spatiotemporal distribution of the SARS-CoV-2 C29200T mutation and found the first sequences in Canada in the spring of 2020, and an increase in the number and proportion of sequences with this mutation in the fall and winter of 2020/2021, particularly in Ontario (Fig. 1). Because specimens sequenced and uploaded to GISAID are not known to be representative, we did not further analyze the mutation frequency by geography or time period (Table 1).
Finally, we performed phylogenetic analyses to study genetic relatedness in Canada and identified 16 different clades and two singletons with the C29200T mutant strains including a clade with 230 sequences (Fig. 2). The main clade (#18) included 224 sequences from Ontario, five from Alberta and one strain from Nova Scotia from November 2020 to January 2021. The sequence from Nova Scotia (EPI_ISL_1055724) and five sequences from Alberta are closely related to the strains from Ontario. There are two standalone Alberta clades with two and eight sequences (#5 and #13). There were nine other smaller clades in Ontario (clades #1, 6, 8–9, 12, 14–17) each including one to five sequences with C29200T mutation with other closely related sequences without that mutation. Of the four sequences from British Columbia, three cluster together (July 2020) and one is different (October 2020). The sequences from Quebec are located on three different branches and cluster mainly with sequences from Quebec without the C29200T mutation. The single Manitoba sequence is included in a clade of closely related sequences without the mutation C29200T from British Columbia and Alberta. Singletons #4 and #7 include one taxon from Quebec and British Columbia, respectively. Furthermore, we found 19 sequences with the C29200T mutation in six different lineages that have been identified as of variants of interest (VOI; B.1.429 and B.1.526) or variants of concern (VOC; B.1.1.7, B.1.351 and B.1.617.2) (Table 2)4. Additionally, we downloaded sequences from GISAID with submission dates between (i) May 25, 2021–September 3, 2021, that are of the ‘B.1.617.2 + AY.*’ (Delta) lineage, and (ii) up to Apr 13, 2022, that are of the ‘B.1.1.529 + BA.*’ (Omicron) lineage, as those lineages predominated in Canada during those times, to examine the presence of the N gene C29200T mutation. In total, 11,092 Delta and 68,535 Omicron sequences were reviewed and the C29200T mutation was found in 4 and 23 sequences, respectively. These analyses showed that this synonymous mutation can, and continues to, arise as independent events and then can be spread through transmission, even interprovincially.
Previous studies also identified some samples with an N gene escape on the Cepheid Xpert® Xpress SARS-CoV-2 assay5,6,7,8. Sequencing and RT-PCR on amplicons allowed Ziegler et al. to identify the mutation C29200T resulting in N gene detection escape. That mutation anneals in the centre of the CDC-N2 probe leading them to hypothesize that the Xpert® Xpress SARS-CoV-2 assay used that design6. Miller et al. sequenced the Xpert SARS-CoV-2 amplification products (positions 29164 to 29230) and identified two N gene mutations (C29197T and C29200T) that caused a Ct shift and/or loss of N gene amplification8. Leelawong et al. found five strains displaying an N gene drop out on the Xpert® Xpress SARS-CoV-2 assay with the common mutation C29200T5. However, when they used another CDC N2 gene-based assay they found that both the E and N gene-targets were well amplified. This led Leelawong et al. to question whether a molecular beacon (instead of a TaqMan probe) could result in the N gene drop out in the Xpert® Xpress SARS-CoV-2 assay. That our analysis also found strong amplification of the N1 and N2 genes using the BD SARS-CoV-2 Reagents for BD MAX™ System further supports the hypothesis proposed by Leelawong et al. Despite these investigations, a definitive understanding of the question at hand will ultimately require knowledge of the primer and probe sequences utilised in the Xpert® Xpress SARS-CoV-2 assay. In our experience, circumstance (i.e. identification of 2 disparate strains with a single common N gene mutation with 2 additional samples discovered subsequently by sequencing) aided in the identification of the C29200T mutation as the cause of the N gene drop out. While critical in these findings, WGS would have been insufficient on its own to definitively identify a causative mutation had circumstances been different and viral target drop-out had been observed in a single SARS-CoV-2 positive sample. Notably, the Food and Drug Administration has added the Xpert® Xpress SARS-CoV-2 assay to their list of molecular tests impacted by SARS-CoV-2 mutations9. C29200T is a synonymous mutation that arose multiple times in evolution and was previously found in 0.2% of the sequences on GISAID from at least five countries5. Our study supports and extends the findings of others as we show that this mutation arose multiple times in SARS-CoV-2 evolution, can be spread by transmission, and is found in different lineages including those designated as VOIs and VOCs. Moreover, this is the first report detailing the distribution and emergence of this mutation within Canada.
The existence of recurring mutations that affect PCR efficiency highlights the need for increased transparency around commercial primer/probe sequences. Viral target redundancy has been a bedrock principle of molecular-based diagnostic assays for SARS-CoV-2 in an effort to minimize the potential for false negative results. The C29200T mutation found worldwide results in escape of N gene detection by the Cepheid Xpert® Xpress SARS-CoV-2 assay. This assay has been employed in many settings including tertiary and community hospitals, as well as remote community non-health care settings, owing to the minimal hands-on time requirement, quick turn around and high performance. Co-emergence of another mutation impacting the amplification of the E-gene could fully jeopardize the assay. Importantly however, loss of diagnostic performance is not limited to instances of complete diagnostic escape. Often, analytical sensitivity is not equivalent amongst the viral genes being targeted by a given assay and can differ by orders of magnitude10. As a result, loss of even a single gene target can significantly impact assay performance and increase the potential for false negatives. As such, the identification of SARS-CoV-2 strains carrying N gene escape mutations represents a major concern for a diagnostic platform that is highly relied upon in some of the most vulnerable of settings. Even more concerning is the emergence of such mutations amongst SARS-CoV-2 strains designated as VOI’s and VOC’s. Emergence of partial diagnostic escape mutants is not unique to the Xpert® Xpress SARS-CoV-2 PCR assay as other commercial assays have also been affected11,12,13,14,15. Inevitably, diagnostic escape mutations will continue to emerge as SARS-CoV-2 evolves. Of immense importance is the ability to identify and track the emergence of such mutations to mitigate diagnostic vulnerabilities in real time and not in retrospect. While commercial assays have been an invaluable pillar in response to the pandemic, many, but not all, vendors have obstructed line of sight towards diagnostic escape under the veil of proprietary knowledge. For this type of information to be withheld amid a pandemic that has claimed the lives of millions and infected hundreds of millions more is a major disappointment, at minimum. In this day and age, the rationale for withholding primer/probe sequences seems limited. In an era where molecular cloning and sequencing are routine methodologies, reverse engineering primer and probe sequences is possible for those who wish to do so. Moreover, primer/probe sequences are not the sole determinant of assay performance. Furthermore, for many clinical laboratories the decision to employ a (particular) commercial assay is complex, driven by numerous considerations and rarely influenced by a lack of knowledge of high-performing primer/probe sequences. Major efforts around the globe have been made to routinely sequence the SARS-CoV-2 genome, and for these sequences to be shared world-wide. In addition, various open-source initiatives have been launched to automate the identification of SARS-CoV-2 mutations that could impact test performance. However, primer/probe sequences that are proprietary create a diagnostic ‘blind spot’ for such initiatives. We call upon our industry partners, and our regulatory authorities, to work towards uncovering these blind spots.
Materials and methods
Sequence data, demographics, maps and statistics
A total of 41,712 genomes of SARS-CoV-2 (excluding non-human sequences) and demographics when available were obtained from GISAID (downloaded on May 22, 2021 http://gisaid.org/) to perform the statistical analyses of the C29200T mutation. The dataset included sequences representing eight Canadian provinces. The geographical maps of Canada were built with the geographic information system QGIS (v2.18.21, https://qgis.org).
PCR and sequencing
Cepheid Xpert® Xpress SARS-CoV-2 Assay (E gene and N gene; Cepheid, Sunnyvale, CA) and BD SARS-CoV-2 Reagents for BD MAX™ System (N(1) and N(2) gene; BD, Franklin Lakes, NJ) testing was performed on nasopharyngeal swabs according to the manufacturer’s instructions using samples EPI_ISL_1490673, EPI_ISL_1490676, EPI_ISL_1490692, and EPI_ISL_1490679. Due to limited sample volumes after clinical PCR testing, samples EPI_ISL_1490692 and EPI_ISL_1490679 were diluted 1:10 in COBAS PCR media prior to Xpert® Xpress SARS-CoV-2 PCR testing. All samples were diluted 1:10 in COBAS PCR media prior to BD Max rRT-PCR testing. No samples were diluted prior to extraction for sequencing. NEBNext ARTIC SARS-CoV-2 Companion Kit (Oxford Nanopore Technologies®) (New England Biolabs, Ipswich, MA) in conjunction with the Native Barcoding Expansion 96 kit, Sequencing Auxilliary Vials (EXP-AUX001), Flow Cell Priming Kit and MinION instrument with R9.4.1 flow cell (Oxford Nanopore Technologies®, Oxford, United Kingdom) were used for sample/library preparation and sequencing as previously described16,17.
Bioinformatic analyses were performed using the Artic Network Pipeline v1.1.3 using Nanopolish for variant calling18. QC was subsequently performed using Ncov-tools v1.5 with default config.yaml settings except for genome completeness which was set at 0.9019.
Molecular evolution
A total of 41,712 SARS-CoV-2 genomes were obtained from GISAID (downloaded on May 22, 2021). Multiple alignment was performed with MAFFT (v.7.309) and sequences with C29200T mutation were identified. A maximum likelihood tree with IQ tree (v.1.6.2, http://www.iqtree.org/) using HKY substitution model and 1,000 ultrafast bootstraps was created and edited on Itol (v.6.3, https://itol.embl.de/). All Canadian sequences with the C29200T mutation were included as well as 70 closely related sequences without the C29200T mutation as controls. Positions analysed were 345 to 29,661 based on reference sequence used NC_045512 (GenBank). All sequences used in our analyses can be found in GISAID and accession numbers can be found in Fig. 2 and Supplemental Table 1. Additionally, we downloaded all sequences uploaded to GISAID with submission dates of (i) May 25–September 3, 2021, of the Delta (GISAID variant field = ‘B.1.617.2 + AY.*’) lineage and (ii) all up to April 13, 2022, that are of the Omicron (GISAID variant field = ‘B.1.1.529 + BA.*’) lineage. We created an alignment using MAFFT and searched for the C29200T mutation.
Ethics statement
Specimens were collected from patients by healthcare workers of Unity Health Toronto for testing as part of routine clinical service and informed consent was obtained from all participants and/or their legal guardians. Whole genome sequencing of SARS-CoV-2 positive samples was approved by the St-Michael’s Hospital Research Ethics Board (REB# 94-103) and did not require additional patient’s consent. All methods were carried out in accordance with relevant guidelines and regulations. To protect patient privacy and confidentiality, data are aggregated and reported in an anonymized format.
Change history
19 August 2022
A Correction to this paper has been published: https://doi.org/10.1038/s41598-022-17590-5
References
Center for Systems Science and Engineering at Johns Hopkins University. COVID-19 Dashboard. https://coronavirus.jhu.edu/map.html. Accessed 14 Aug 2021 (2021).
Duchene, S. et al. Temporal signal and the phylodynamic threshold of SARS-CoV-2. Virus Evol. 6, 061 (2020).
Canada COVID-19 situation update. https://www.canada.ca/en/public-health/services/diseases/2019-novel-coronavirus-infection.html?utm_campaign=hc-sc-phm-21-22&utm_medium=sem&utm_source=ggl&utm_content=ad-text-en&utm_term=covid19cases&adv=2122-0008&id_campaign=396443591&id_source=1251244778444200&id_content=&gclsrc=aw.ds&&gclid=076d528fb1d21d9e97a0338fd35d09c8&gclsrc=3p.ds. Accessed 12 May 2021 (2021).
World Health Organization. Tracking SARS-CoV-2 variants. https://www.who.int/activities/tracking-SARS-CoV-2-variants. Accessed 9 May 2022 (2022).
Leelawong, M. et al. SARS-CoV-2 N gene mutations impact detection by clinical molecular diagnostics: Reports in two cities in the United States. Diagn. Microbiol. Infect. Dis. 101, 115468 (2021).
Ziegler, K. et al. SARS-CoV-2 samples may escape detection because of a single point mutation in the N gene. Eurosurveillance 25, 2001650 (2020).
Rhoads, D. D. et al. Endemic sars-cov-2 polymorphisms can cause a higher diagnostic target failure rate than estimated by aggregate global sequencing data. J. Clin. Microbiol. 59, 1–2 (2021).
Miller, S., Lee, T. & Merritt, A. Single-point mutations in the N gene of SARS-CoV-2 adversely assay. Microbiol. Spectrum 9, 1–7 (2021).
SARS-CoV-2 Viral Mutations: Impact on COVID-19 Tests. https://www.fda.gov/medical-devices/coronavirus-covid-19-and-medical-devices/sars-cov-2-viral-mutations-impact-covid-19-tests. Accessed 3 June 2021 (2021).
LeBlanc, J. J. et al. Real-time PCR-based SARS-CoV-2 detection in Canadian laboratories. J. Clin. Virol. 128, 104433 (2020).
Brown, K. A. et al. S-Gene target failure as a marker of variant B.1.1.7 among SARS-CoV-2 isolates in the Greater Toronto area, December 2020 to March 2021. JAMA J. Am. Med. Assoc. 325, 2115–2116 (2021).
Artesi, M. et al. CoArt 02.Artesi a recurrent mutation at position 26340 of SARS-CoV-2 is associated with failure of the e gene quantitative reverse. J. Clin. Microbiol. 58, 1–8 (2020).
Wang, R., Hozumi, Y. & Yin, C. Mutations on COVID-19 diagnostic targets. Genomics 112, 5204 (2020).
Sánchez-Calvo, J. M., AladosArboledas, J. C., Ros Vidal, L., de Francisco, J. L. & López Prieto, M. D. Diagnostic pre-screening method based on N-gene dropout or delay to increase feasibility of SARS-CoV-2 VOC B.1.1.7 detection. Diagn. Microbiol. Infect. Dis. 101, 115491 (2021).
Vanaerschot, M. et al. Identification of a polymorphism in the N gene of SARS-CoV-2 that adversely impacts detection by reverse transcription-PCR. J. Clin. Microbiol. 59, 1–4 (2021).
Quick, J. & Barnes, K. nCoV-2019 sequencing protocol v3 (LoCost). https://www.protocols.io/view/ncov-2019-sequencing-protocol-v3-locost-bp2l6n26rgqe/v3 (2020).
Tyson, J. R. et al. Improvements to the ARTIC multiplex PCR method for SARS-CoV-2 genome sequencing using nanopore. bioRxiv Preprint Serv. Biol. https://doi.org/10.1101/2020.09.04.283077 (2020).
Quick, J. et al. Real-time, portable genome sequencing for Ebola surveillance. Nature 530, 228–232 (2016).
Simpson J. ncov-tools. http://github.com/jts/ncov-tools. Accessed 30 Sept 2021 (2021).
Acknowledgements
We are grateful to the clinical laboratory team members who have been involved in SARS-CoV-2 testing throughout the pandemic at Unity Health Toronto. We would like to thank Manon Tetreault for data collection. We also thank Ilinca Lungu, Bernard Lam, Jeremy Johns, Paul Krzyzanowski, Michael Laszloffy, Lawrence E. Heisler, and Felicia Vincelli for data produced at OICR and released to GISAID. We thank Patryk Aftanas for his contributions (Sunnybrook Research Institute). We are grateful to the numerous scientists who have shared their genome data with the world on GISAID (Supplementary data Table 1, EPI_SET_20220511hw, EPI_SET_20220511gd and EPI_SET_20220511bs). We acknowledge the contributions of Ontario’s COVID-19 Genomics Rapid Response Coalition (oncov.ca) and the Ontario COVID-19 Genomics Network, and CanCOGGeN VirusSeq (cancogen.ca) for coordinating, supporting, and contributing to funding for SARS-CoV-2 sequencing in Ontario. The results here are in whole or part based upon data hosted at the Canadian VirusSeq Data Portal: https://virusseq-dataportal.ca/. We wish to acknowledge the following organisations/laboratories for contributing data to the Portal: Canadian Public Health Laboratory Network (CPHLN), CanCOGGeN VirusSeq, Saskatchewan—Roy Romanow Provincial Laboratory (RRPL), Alberta ProvLab North (APLN), National Microbiology Laboratory (NML), BCCDC Public Health Laboratory, and Manitoba Cadham Provincial Laboratory.
Funding
Funding was provided by the University of Toronto (Toronto COVID-19 Action Fund) and the University of Toronto McLaughlin Centre. S.M. is supported through funds provided by the University of Toronto (Toronto COVID-19 Action Fund) and the University of Toronto McLaughlin Centre. J.T.S. is supported by the Ontario Institute for Cancer Research through funds provided by the Government of Ontario and the Government of Canada through Genome Canada and Ontario Genomics (OGI-136 and OGI-201).
Author information
Authors and Affiliations
Contributions
S.I., K.B., R.F.: study design, data analyses, manuscript redaction. M.A.: study design, phylogenetic analyses, created Fig. 2, manuscript redaction. M.R.I.: geographic data analysis, created Fig. 1, manuscript redaction. R.B., P.C.Z., C.P.S., K.R.B., P.M.S., L.M.M., J.B.G., A.J.M., S.M., J.T.S.: study design, sequencing, manuscript redaction. All authors reviewed the manuscript.
Corresponding author
Ethics declarations
Competing interests
J.T.S. receives research funding from Oxford Nanopore Technologies (ONT) and has received travel support to attend and speak at meetings organised by ONT. All other authors declare no potential conflict of interest.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
The original online version of this Article was revised: The original version of this Article contained an error in the spelling of the author Allison J. McGeer which was incorrectly given as Alisson J. McGeer. In addition, the author Marc R. Isabel was incorrectly affiliated with “Department of Laboratory Medicine and Pathobiology, Temerty Faculty of Medicine, University of Toronto, Toronto, ON, Canada”. The correct affiliations are listed here: “Marc R. Isabel is an independent scientist” and “Matane, Quebec, Canada”.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Isabel, S., Abdulnoor, M., Boissinot, K. et al. Emergence of a mutation in the nucleocapsid gene of SARS-CoV-2 interferes with PCR detection in Canada. Sci Rep 12, 10867 (2022). https://doi.org/10.1038/s41598-022-13995-4
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-022-13995-4
- Springer Nature Limited
This article is cited by
-
SARS-CoV-2 viral variants can rapidly be identified for clinical decision making and population surveillance using a high-throughput digital droplet PCR assay
Scientific Reports (2023)
-
A mutation responsible for impaired detection by the Xpert SARS-CoV-2 assay independently emerged in different lineages during the SARS-CoV-2 pandemic
BMC Microbiology (2023)