Abstract
The COVID-19 pandemic has profoundly affected all aspects of our lives. Through real-time monitoring and rapid vaccine implementation, we succeeded in suppressing the spread of the disease and mitigating its consequences. Finally, conclusions can be summarized and drawn. Here, we use the example of Poland, which was seriously affected by the pandemic. Compared to other countries, Poland has not achieved impressive results in either testing or vaccination, which may explain its high mortality (case fatality rate, CFR 1.94%). Through retrospective analysis of data collected by the COVID-19 Data Portal Poland, we found significant regional differences in the number of tests performed, number of cases detected, number of COVID-19-related deaths, and vaccination rates. The Masovian, Greater Poland, and Pomeranian voivodeships, the country’s leaders in vaccination, reported high case numbers but low death rates. In contrast, the voivodeships in the eastern and southern parts of Poland (Subcarpathian, Podlaskie, Lublin, Opole), which documented low vaccination levels and low case numbers, had higher COVID-19-related mortality rates. The strong negative correlation between the CFR and the percentage of the population that was vaccinated in Poland supports the validity of vaccination. To gain insight into virus evolution, we sequenced more than 500 genomes and analyzed nearly 80 thousand SARS-CoV-2 genome sequences deposited in GISAID by Polish diagnostic centers. We showed that the SARS-CoV-2 variant distribution over time in Poland reflected that in Europe. Haplotype network analysis allowed us to follow the virus transmission routes and identify potential superspreaders in each pandemic wave.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Introduction
Coronavirus disease 2019 (COVID-19), caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), emerged in November 2019 in Wuhan, China, and quickly spread worldwide. Between December 2019 and December 2022, the novel coronavirus infected more than 660 million people and caused nearly 6.69 million deaths worldwide1,2,3. Approximately 36% of the world’s cases and 29% of the total deaths occurred in Europe2. Poland's first SARS-CoV-2 infection case was confirmed and reported on March 4, 2020. As of January 1, 2023, with 6.3 million cases of COVID-19, Poland was ranked 8th in Europe and 21st in the world2,3. Additionally, with 118.5 thousand reported deaths, Poland was ranked 7th and 16th in Europe and the world, respectively. Therefore, Poland may be considered one of the countries most seriously affected by the COVID-19 pandemic.
The appearance of COVID-19 in Poland caused the implementation of restrictions according to WHO recommendations, including lockdowns during the first wave of the pandemic. From March 20, 2020, to May 15, 2022, the epidemic was in force in Poland, and restrictions were temporarily suspended or restored depending on the number of active cases4. Once vaccines were developed and approved, the Polish government announced the National COVID-19 Vaccination Program and recommended vaccination as an effective method for preventing SARS-CoV-2 infection5. Mathematical modeling confirmed the validity of this strategy6. However, despite nationwide campaigns and educational programs promoting vaccination, the general attitude toward vaccination in Poland was cautious7,8. As a result, on January 1, 2023, only 56.76% of the Polish population was fully vaccinated3,9. For comparison, at the same time, the world average was 63.39%, while for Europe and the European Union, these values were even greater (66.90% and 72.82%, respectively)3,9.
The development of tests and vaccines would not be possible without knowledge about the SARS-CoV-2 sequence and structure. Chinese scientists published the first whole-genome sequence of the novel zoonotic betacoronavirus in January 202010. To follow the evolution of the virus, mass sequencing of samples from infected patients began. At the beginning of 2023, almost 15 million SARS-CoV-2 whole-genome sequences were collected by the Global Initiative on Sharing All Influenza Data (GISAID, www.gisaid.org)11. Corresponding repositories were established locally, e.g., the European COVID-19 Data Platform (www.covid19dataportal.org), operated by the European Commission and EMBL's European Bioinformatics Institute (EMBL-EBI)12. As a part of the EMBL-EBI initiative, we (the Institute of Bioorganic Chemistry Polish Academy of Sciences (IBCH PAS) and the affiliated Poznan Supercomputing and Networking Center (PSNC)) launched the COVID-19 Data Portal Poland (https://dataportal.covidhub.psnc.pl)13. This platform includes databases and tools supporting researchers in sharing and visualizing data associated with SARS-CoV-2 and COVID-19 in Poland.
Here, for the first time, we present a comprehensive analysis of the situation in Poland over two years of the pandemic across the country and against a worldwide background. Taking advantage of the COVID-19 Data Portal Poland, we retrospectively traced the development of the pandemic in our country. In addition, we analyzed nearly 80 thousand SARS-CoV-2 genome sequences available in the GISAID database, including more than 500 genomes sequenced in our laboratory.
Methods
Sample collection
Postdiagnostic nasopharyngeal samples (1027) and previously extracted RNA samples (81) were obtained from the Provincial Sanitary-Epidemiological Station and Centers for Medical Genetics GENESIS in Poznan, Baptism of Poland Memorial Hospital in Gniezno, the Regional Science and Technology Center in Podzamcze and the National Institute of Geriatrics, Rheumatology and Rehabilitation in Warsaw. The study conformed to the ethical guidelines of the World Medical Association Declaration of Helsinki. Because the biological material came from postdiagnostic waste and the samples were anonymized, it was not possible to obtain consent from individual people. However, appropriate approval was obtained from the Bioethics Committee at the Poznan University of Medical Sciences to use archival material (No. 457/20 from 17 June 2020). The data deposited in the databases did not contain any information based on which sample donors could be identified.
Epidemiological data analysis
Epidemiological data and worldwide statistics at the country level were retrieved from public repositories, i.e., Our World In Data3 and Worldometer2. Regional data for Poland were downloaded from the COVID-19 Data Portal Poland13.
Viral RNA isolation. Total RNA from nasopharyngeal swabs was extracted with a CoV RNA kit (A&A Biotechnology, Poland) in the BSL-2 chambers of the Laboratory of Cell and Tissue Culture, IBCH PAS. Before library preparation, the samples were additionally checked for viral load via RT‒qPCR of the ORF1a/b and S genes via the MediPAN-2G COVID test (IBCH PAS/Medicofarma, Poland).
Next-generation sequencing. RNA was reverse transcribed to complementary DNA (cDNA) with SuperScript IV Reverse Transcriptase (Thermo Fisher Scientific, MA, USA). Second strand synthesis was performed with the NEBNext Ultra II Non-Directional RNA Second Strand Synthesis Module (New England Biolabs). One nanogram of dsDNA was used for library preparation with the TruePrep DNA Library Prep Kit V2 for Illumina (1 ng) (Vazyme). Indexed DNA libraries were pooled (4–6 samples) and subjected to probe-based enrichment with a myBaits Expert SARS-CoV-2 Panel (Arbor Biosciences). Library quantitation was performed with a Qubit DNA HS Assay (Thermo Fisher Scientific). For the library quality check, the Bioanalyzer 2100 and DNA 1000 Kit or DNA High Sensitivity Kit (Agilent Technologies, CA, USA) were used according to the manufacturer’s instructions. Libraries were sequenced on an Illumina NextSeq 550 platform using paired-end reads and a NextSeq 500/550 High Output Kit v2.5 (300 cycles) (Illumina, San Diego, CA, USA). Up to 96 samples were sequenced per run, resulting in an average of 10.5 million (median 3.15 million) raw reads and an average coverage of 3233x (median 370x) per sample. The raw fastQ files were aligned to the SARS-CoV-2 reference sequence (NC_045512.2)14 with BBMap15. BAM files were further processed with BBTools, SAMtools and BCFtools to create final consensus sequences16.
Analysis of SARS-CoV-2 genome sequencing data. SARS-CoV-2 sequences dated up to May 23, 2022, were downloaded from GISAID (11) (EPI_SET_231114ua). First, the search criteria were limited according to host (human), location (Poland), and complete genome sequence (> 2900 nt). Next, we excluded low-coverage entries with more than 5% unknown nucleotides (N). We used the same inclusion criteria for our sequencing data. Then, we performed redundant metadata cleanup and excluded repeated data records, nonhuman origin records, and records without a complete subset of metadata for a final number of 79,194 (78,674 from GISAID and 520 from our laboratory).
Haplotype network. A total of 79,194 SARS-CoV-2 sequences were divided into five pandemic waves. Due to the large number of sequences representing the 3rd, 4th and 5th waves, we subsampled the raw data for those waves with the Genome-Sampler tool17. The first filtering step involved selecting genomes across time, with the default parameters of seven genomes per seven-day interval. The second filter was diversity-based; the sequences were clustered at the 99.00% identity threshold, and the centroid sequence from each cluster was selected as a representative. Finally, a minor allele frequency (MAF) filter of 5% was applied to exclude rare variants. Then, the consensus sequences were created and submitted for haplotype network analysis. Rapid calculation of the full-length MSAs of closely related viral genomes was performed with MAFT-7.490 using the default parameters –6merpair, –keeplength, and –addfragments18. The haplotype network was created in the R package (R 4.2.0) as follows19. The sequences were trimmed with the ips package. For haplotype extraction and network construction, the pegas package was applied20. Finally, the network was visualized using igraph (layout_with_graphopt-force-directed layout algorithm used for scaling the graph)21,22.
Statistical data visualization. Maps and selected plots presenting the epidemiology data were retrieved from the COVID-19 Data Portal Poland13 and Our World In Data Portal3. Other graphs for data visualization were generated using the ggplot2 package23 in R 4.2.0 and OriginPro 2021 9.8.0.200. The phylogenetic tree and variant frequencies of SARS-CoV-2 worldwide and in Europe were visualized via Nextstrain (https://gisaid.org/phylodynamics/global/nextstrain/) used under the CC-BY-4.0 license24.
Ethics approval and consent to participate
Patient consent was not applicable because only postdiagnostic archival and anonymized material was used in the study. Appropriate approval was obtained from the Bioethics Committee at the Poznan University of Medical Sciences (No. 457/20 from 17 June 2020).
Results
To compare the pandemic situation in Poland with those of other countries, we analyzed epidemiological data from Worldometer2, Our World In Data Portal3 and the COVID-19 Data Portal Poland13, as summarized in Supplementary Table 1.
COVID-19 cases and tests in Poland compared to other countries
Poland had almost 14 times fewer COVID-19 cases than the US, nearly four times fewer than Germany and the UK, and 2.8 times fewer than Italy; Poland also had 1.5 times more cases than Israel and Czechia and approximately 2.4 times more than Slovakia and Sweden (Fig. 1A)3. Because these data are not conclusive without adjustment for the country's population, we compared the number of COVID-19 cases per million inhabitants in the same countries (Fig. 1B). In Poland, this number exceeded 150 thousand. In the US and Sweden, it was 1.6 times greater, whereas in Germany and Italy, it was two times greater. Interestingly, very high numbers were recorded in Slovakia (> 460 thousand), Israel (> 440 thousand), and Lithuania (> 410 thousand).
As the observed differences might be influenced by the number of tests carried out in individual countries, we calculated the ratio of the number of tests to the number of COVID-19 cases per million people (Fig. 1B, Supplementary Table 1). Poland, with a ratio of 6.2, Germany (4.9), Sweden (7.5), and Lithuania (7.8) were among the countries with the lowest ratios. Notably, in the countries that implemented massive testing strategies, this ratio exceeded 20 (the UK and Slovakia).
Mortality due to COVID-19 in Poland compared to other countries
Nearly one-sixth of total COVID-19-related deaths were reported in the US (> 1 million deaths; February 2023)2. Over the same period, in Poland, nearly 8.6 times fewer (< 120 thousand) deaths due to COVID-19 were registered13. Poland reported more than 2.9 times more deaths than Czechia, 5.9 times more deaths than Slovakia, 12.5 times more deaths than Lithuania, and 16.7 times more deaths than Belarus (Fig. 1C). Germany was the only neighboring country of Poland with a greater total number of deaths. Unsurprisingly, there was a very strong correlation between the total number of cases and number of deaths reported in each country (Pearson’s r = 0.98, p = 1.66E−8; Supplementary Fig. 1A).
Unexpectedly, a comparison of the number of COVID-19-related deaths normalized per million people indicated that the mortality rate in Poland was similar to that in the US, UK, and Italy and was 1.75 times greater than that in Germany (Fig. 1D). Even higher mortality rates were noted in Czechia, Slovakia, and Lithuania. We found a moderate correlation between the normalized values of COVID-19 cases and deaths in each country (Pearson’s r = 0.64, p = 0.024; Supplementary Fig. 1B).
To objectively compare mortality rates, we calculated the case fatality rate (CFR) for each country (Supplementary Table 2). The CFR indicates the relationship between the total number of deaths due to COVID-19 and the number of diagnosed cases and is expressed as a percentage. Surprisingly, among the analyzed countries, the highest CFR was observed in Poland (1.94%). This result was significantly greater than that of the global CFR (1.20%), twice as high as that of the European CFR (0.95%), and almost three times greater than that of the average EU CFR (0.78%). A CFR equal to the global average was noted in the US (1.20%), whereas the lowest was in Israel (0.26%).
To estimate the impact of COVID-19 on overall mortality in each country, we also analyzed the excess mortality value, calculated as the difference between the cumulative number of deaths (per million) and the number of deaths predicted based on previous years. The greatest number of excess deaths were observed in Lithuania, Slovakia, Poland, and Czechia (Fig. 1E). The shapes of the curves plotted in Fig. 1E indicate the times of rapid increase in excess mortality, reflecting the pandemic waves (spring 2020, autumn 2020, and winter 2021/2022).
Vaccination against COVID-19 in Poland compared to other countries
As of May 30, 2022, on average, 59.2% of the world’s population has been fully vaccinated against SARS-CoV-23,9. At the same time, the European average was 65.9%, and the EU average was 72.6%. In Europe, Malta (88%), Portugal (86%), and Spain (85%) were the leaders in vaccination. The vaccination rate achieved by Poland (56.5%) was below the global average (Fig. 1F)5. Among the neighboring countries of Poland, a lower percentage was noted only in Slovakia (45.7%). In Czechia, Belarus, Lithuania, and Germany, vaccination rates were much greater (66.5%, 65.4%, 68.2%, and 76%, respectively) (Fig. 1F). Notably, very high vaccination rates were observed in China (88.2%) and Italy (81.2%), which were severely affected by COVID-19 at the beginning of the pandemic (Fig. 1F).
By analyzing the vaccination timeline, we concluded that most populations were vaccinated shortly after the vaccination programs started, i.e., between April and August 2021 (Fig. 1G). Initially, countries such as Israel, the US, and the UK were leaders in vaccination but were eventually surpassed by other countries (e.g., China, Italy, and Germany). In Poland, before the massive vaccination program was launched, particularly vulnerable groups were vaccinated (> 65 years old, teachers, and health care workers). The percentage of people who completed the initial vaccination protocol increased rapidly between May and July 2021, exceeding 40%. After that, the rate of vaccination slowed. We found a moderate negative correlation between the numbers of COVID-19 cases and deaths (normalized per million people) and the percentage of the population vaccinated with at least one dose of COVID-19 vaccine, but the results were not statistically significant (case-vaccination Pearson’s r = − 0.39, p value = 0.22; death-vaccination Pearson’s r = − 0.45, p value = 0.13; Supplementary Fig. 1C and D, respectively).
Regional distribution of SARS-CoV-2 infections in Poland
To gain deeper insight into the situation in Poland, we analyzed data collected by the COVID-19 Data Portal Poland13. On May 30, 2022, the highest number of cases was reported in the Masovian voivodeship (15.31% of all cases), followed by Silesian (11.94%) and Greater Poland (9.70%), whereas the lowest number was reported in Świętokrzyskie (2.44%), Opole (2.46%), Podlaskie (2.50%), and Lubusz (2.56%) (Fig. 2B,C, Supplementary Table 3A).
Normalization of the case numbers per 100,000 people reduced the differences between voivodeships (Fig. 2C,D), but the differences remained statistically significant (chi-square test value = 3.49E+3, two-tailed p value < 0.0001). More SARS-CoV-2 infections were detected in northern and central Poland, particularly in the Kuyavian-Pomeranian (7.19%), West Pomeranian (7.05%), Masovian (7.03%), and Pomeranian (7.01%) voivodeships. The lowest percentages were recorded in Subcarpathian (4.71%) and Świętokrzyskie (4.87%), which are located in southeastern Poland, and Podlaskie (5.25%), which is located in northeastern Poland (Fig. 2C,D, Supplementary Table 3B). A moderate positive correlation was found between population density and the total number of cases in each voivodeship (Pearson’s r = 0.64, p = 0.00743; Supplementary Fig. 2A).
The increase in infection rate over time showed a similar trend in all the voivodeships (Fig. 2E, F). The plot indicates several time points when cases increased faster: November 2020-February 2021, February-April 2021, and November 2021-February 2022.
Regional distribution of COVID-19-related deaths in Poland
The highest numbers of COVID-19-related deaths were reported in the Masovian (12.41% of all cases in Poland), Silesian (12.12%), and Greater Poland (9.01%) voivodeships (Fig. 3A, C, Supplementary Table 4A). This finding is consistent with the total number of cases reported in these regions (Fig. 2B). The lowest percentages of deaths were recorded in the Lubusz, Opole, Podlaskie, and Świętokrzyskie voivodeships (2.72%, 2.87%, 3.16%, and 3.55%, respectively). Notably, these regions also had the lowest percentage of SARS-CoV-2 infections (Fig. 3C). A very strong correlation was found between the number of cases and number of deaths in each voivodeship (Pearson's r = 0.96, p = 8.36E−9; Supplementary Fig. 2B).
The highest CFR values were observed for Subcarpathian (2.92%), Świętokrzyskie (2.81%), and Lublin (2.50%), while the lowest CFR values were observed for Masovian (1.57%), Lower Silesian (1.65%), and Pomeranian (1.67%) (Supplementary Table 5). We found a strong correlation between the number of deaths and population density for each voivodeship (Pearson’s r = 0.73, p = 0.00109; Supplementary Fig. 2C) but no correlation between the CFR and population density (Pearson's r = − 0.18, p = 0.5; Supplementary Fig. 2D).
Normalization per 100,000 people reduced the differences in the number of COVID-19-related deaths among voivodeships, but the difference remained statistically significant (chi-square test value = 5.02E+01, two-tailed p value < 0.0001). However, the ranking changed completely (Fig. 3B,C, Supplementary Table 4B). The highest percentages of population-normalized deaths were recorded in the Lublin (7.45%), Kuyavian-Pomeranian (7.01%), Opole (6.84%) and Subcarpathian (6.74%) voivodeships. In contrast, the lowest percentages were observed for the Lower Silesian (5.16%), Masovian (5.39%), and West Pomeranian (5.51%) voivodeships. The high mortality in the Kuyavian-Pomeranian voivodeship was consistent with the high population-normalized number of cases in this region (Fig. 2C,D), but this relationship was not observed for other voivodeships. For example, Lublin and Opole were in the middle of the ranking of the normalized number of cases, while Subcarpathian was the last in the country (Fig. 2C,D). In contrast, in terms of the normalized number of deaths, these regions were ranked 1st, 3rd, and 4th in the country, respectively (Fig. 3B,C).
Analysis of the increase in the number of deaths over time indicated a similar pattern for all regions (Fig. 3D,E). In general, this increase was consistent with the increase in SARS-CoV-2 infections, reflecting the dynamics of the COVID-19 pandemic in Poland.
Vaccination in Poland: regional distribution
In Poland, the highest total numbers of fully vaccinated people were recorded in three voivodeships, namely, Masovian, Silesian, and Greater Poland, while the lowest were recorded in Opole, Podlaskie, Lubusz, and Świętokrzyskie (Fig. 4A). For the numbers of fully vaccinated people normalized per 100,000 people, the leaders were Masovian, Greater Poland, and Pomeranian (Fig. 4B). These three voivodeships had a remarkably high percentage of reported cases (Fig. 2C) but a relatively low death rate (Fig. 3C).
The lowest level of vaccination was recorded in the voivodeships located in the eastern part of the country, namely, Subcarpathian (43%), Podlaskie (48%), and Lublin (50.6%), followed by Opole voivodeship in the south (52.6%) (Fig. 4C). Notably, Subcarpathian and Podlaskie were among the regions with the lowest percentage of cases in the country, while in Lublin and Opole, a moderate percentage of cases was observed (Fig. 2C). However, Lublin, Opole, and Subcarpathian had the highest mortality rates in the country (Fig. 3C). We found a strong negative correlation between the CFR and the percentage of the population that was vaccinated in each voivodeship (Pearson’s r = − 0.83, p = 7.04E−5; Supplementary Fig. 2E, Supplementary Table 5).
Next-generation sequencing of SARS-CoV-2 genomes
In cooperation with six health care and diagnostic institutions in Poland, we collected postdiagnostic samples from SARS-CoV-2-infected people from different regions. In total, over a thousand samples were collected between May 2020 and August 2021, encompassing the 1st and 2nd waves of coronavirus in Poland and the time between the waves. Due to the poor quality and/or quantity of some RNA samples, only 729 were used for library preparation. After the quality assessment, we chose 640 libraries for sequencing (NextSeq550, Illumina). As a result, we obtained 522 sequences with very good quality (< 5% of unread [N] nucleotides), which were subsequently submitted to the COVID-19 Data Portal Poland database (Supplementary Table 6). In addition, all sequences were deposited in the European Nucleotide Archive (ENA).
Analysis of SARS-CoV-2 genetic variability in Poland
To analyze the genetic variability of the virus, we used the data generated in our laboratory, as well as 78,657 good-quality SARS-CoV-2 genome sequences from Poland downloaded from the GISAID database (as of May 30, 2022). Most sequences were submitted during the 3rd, 4th, and 5th waves of the COVID-19 pandemic (Fig. 5A, B) and were isolated from middle-aged patients (Fig. 5C). The largest collection of sequencing data was from the Pomeranian (11.49%), Masovian (10.5%), Greater Poland (8.11%), and Silesian (7.63%) voivodeships (Fig. 5D). Generally, we found a strong correlation between the number of submitted sequences and the number of SARS-CoV-2 infections detected in each region (Spearman’s r = 0.76, p < 0.05).
The Delta 21J, Omicron 21K, Alpha 20I, and Omicron 21L variants were the most frequent in Poland (Fig. 6A). In each voivodeship, those 4 variants accounted for more than 90% of cases, but their proportions differed (Fig. 6B). The Alpha 20I variant, which circulated between December 2020 and June 2021, was observed in greater proportions in the Świętokrzyskie, Lower Silesian, Warmian-Masurian, and Silesian voivodeships. Nearly 55% of all sequences from the Podlaskie voivodeship were derived from the Delta 21J variant. The Omicron 21K variant accounted for more than 30% of the sequences from Lesser Poland, Opole, Greater Poland, and Subcarpathian. Approximately 20% of sequences from the Kuyavian-Pomeranian voivodeship were identified as Omicron 21L. Both Omicron variants, 21L and 21K, accounted for almost 55% of all variants identified in this region. Interestingly, Lublin was the only voivodeship where a noticeable proportion of the Omicron 21M variant was found.
By analyzing the variant distribution in Poland over time (Fig. 6C), we observed a founding 19A variant in less than 20% of the samples only at the beginning of the pandemic (March–May 2020). The samples sequenced until the end of 2020 mainly contained the 20A and 20B variants and a small proportion of the 20C, 20D, 20E, and 20G variants. Between January and June 2021, almost all the sequences were determined to be the Alpha 20I variant, which was displaced mostly by Delta 21J and small proportions of Delta 21I and Delta 21A. Delta variants circulated slightly longer than did Alpha variants from May 2021 until the end of 2021. Then, the viral population was dominated by Omicron variants. Interestingly, almost 31% of the sequences analyzed in this study were the Omicron 20K variant, which circulated between December 2021 and March 2022. From the end of January 2022, it was slowly displaced by Omicron 21L and disappeared by the end of 2022. Generally, the variant distribution over time in Poland reflected that in Europe (Supplementary Fig. 3B).
Haplotype network and superspreaders
To identify the virus transmission routes and potential superspreaders, we analyzed haplotype networks built based on the sequence similarity of SARS-CoV-2 variants circulating in Poland during two years of the COVID-19 pandemic. The term 'superspreaders’ usually refers to individuals who become a source of numerous infections and less often to settings, e.g., places and events responsible for extremely effective pathogen dissemination25. Due to the high number of sequences available, we generated separate haplotype networks for each pandemic wave (Supplementary Figs. 4–6). Moreover, for waves 3–5, subsampling was performed to reduce the number of sequences (Supplementary Table 7). Otherwise, the networks would have been too dense and impossible to interpret. The subsampling criteria are described in the Methods section.
The 1st wave network was simple and consisted of 58 haplotypes, but only eight nodes were represented by more than 10 cases (Fig. 7A, Supplementary Table 8). Of these, the H56 node contained the earliest SARS-CoV-2 cases (March–April 2020) and can therefore be treated as a symbolic starting point of the pandemic in Poland (Supplementary Fig. 5A). A total of 56% of H56 haplotypes were classified as 20A, 34% as 20B and 9% as 19A variants. Three voivodeships provided the greatest contribution to this node: Masovian, Lesser Poland and Pomeranian (Supplementary Fig. 4A). The largest node, H50, linked closely to H56 (through a smaller H52 node), consisted of 191 haplotypes, most of which belonged to 20A (48%), 20B (41%), and 19A (8%), which was the first SARS-CoV-2 clade (19A). These haplotypes were detected in the Łódź, Pomeranian, Masovian and Lesser Poland voivodeships, but unfortunately, for more than half of the H50 haplotypes, information about the location was not provided. All other nodes, except H57, were composed of 20B variants localized mainly in Greater Poland. This result is likely because IBCH, which was involved in COVID-19 diagnostics in Greater Poland, supplied almost half of the data for the 1st wave in Poland (Supplementary Table 7). Interestingly, in the H57 node, which was assigned mainly to the Pomeranian and Silesian voivodeships, one-third of the variants belonged to the 20D clade.
In the 2nd wave network, built from 1,460 sequences, 185 haplotypes were identified, including 21 represented by more than 10 cases (Fig. 7B, Supplementary Table 9). The largest node, H1, localized in the central part of the plot, reflects the beginning of this wave and corresponds mainly to the Greater Poland voivodeship (Supplementary Figs. 4B and 5B). The H1 node consists of 20A and 20B variants in comparable proportions, similar to its derived nodes H2, H4 and H41. A significant contribution of the 20I (Alpha) variant was observed in nodes H5, H75, H78, H6 and H61. Among the studied locations, potential superspreaders may be found in Greater Poland (H1, H6, H14, H12), Masovian (H53, H57, H61, H4, H75), West Pomeranian (H48, H41, H38), and Świętorzyskie (H81, H2, H4). The highest proportion of the 20I (Alpha) variant (> 25%) was observed in H5 (corresponding mainly to Warmian-Masurian) and H75 (shared by Masovian and Podlaskie).
The haplotype network generated for wave 3 (first half of 2021) consisted of 187 nodes, but only 13 contained more than 10 cases (Fig. 8A, Supplementary Table 10). The whole network is entirely dominated by the 20I (Alpha) variant (989 cases out of 1050 subsampled), which is consistent with the plot in Fig. 6C. The earliest cases were part of node H6, assigned to the Lower Silesian, Łódź, Świętokrzyskie, Greater Poland, Silesian and Warmian-Masurian voivodeships (Supplementary Figs. 4C and 5C). Similarly, two larger nodes (H3 and H1) were balanced mixtures of different voivodeships, demonstrating that the pandemic spread equally throughout the entire country. However, in some hubs, particular voivodeships predominated, which indicates the presence of potential superspreaders: Pomeranian (H13, H22, H60, H14), Lesser Poland (H18, H68), Lower Silesian (H4, H6, H27), Masovian (H17, H35, H15), Świętkorzyskie (H12) and Łódź (H2).
The wave 4 network was quite dense (362 nodes, including 39 with at least 10 cases) and dominated by the 21J (Delta) variant (1903 out of 2107, Fig. 8B, Supplementary Table 11). Another variant that contributed to this wave, although to a much lesser extent, was 21K (BA.1). This variant constituted more than half of the haplotypes in the H184 node, assigned mainly to Masovian and Lesser Poland (Supplementary Fig. 4D). A small number of cases of 21I (Delta) was also detected, particularly at node H14 (Silesian, Lesser Poland, Opole). Large nodes with significant contributions from particular voivodeships may be considered potential superspreaders, e.g., H16 (100% Silesian), H122 (100% Podlaskie), H7 (shared equally by Łódź and Greater Poland), H3 and H13 (Warmian-Masurian, 52% and 44%, respectively).
During wave 5, two variants, previously called Omicron, i.e., 21K (BA.1, 63%) and 21L (BA.2, 31%), predominated, with small contributions of 21J (Delta, < 5%) and 21M (Omicron, 1%). The whole network consisted of 445 nodes, 36 of which were represented by more than 10 cases (Fig. 9, Supplementary Table 12). Among them, H7, H61, H9, H14, H71 and H13, dated to March–April 2022 (Supplementary Fig. 5E), were dominated by the 21L (BA.2) clade. Most nodes were divided between different voivodeships (Supplementary Fig. 4E). However, in some nodes, specific voivodeships predominated, e.g., Silesian (100% of H39, 76% of H26, 70% of H71, 64% of H45), Greater Poland (68% of H5, 63% of H7, 61% of H3, 50% of H6, 48% of H2), Kuyavian-Pomeranian (65% of H13, 64% of H130, 51% of H83, 49% of H12, 19% of H1), Lublin (64% of H9, 58% of H11, 48% of H8), Warmian-Masurian (56% of the largest node, H1, and 26% of H12), Subcarpathian (74% of H10, 42% of H11) and Opole (61% of H34).
Discussion
The outbreak of the COVID-19 pandemic has changed the world irreversibly because it occurred in the era of the internet and global information exchange. On the one hand, high population density and commonly available long-distance travel enabled the spread of the virus. On the other hand, the development of science and technology allowed for an immediate response through massive testing and vaccinations. After the pandemic passed, it was determined that many factors influenced the spread of the virus and the course of the pandemic, including local restriction policies, testing and vaccination strategies, demographic features, sociobehavioral aspects, and the level of health care in each country26.
The response to the COVID-19 pandemic varied across countries. For example, in the beginning of the pandemic, the governments of the UK, the US, and Sweden decided to pursue the herd natural immunity strategy, which was criticized worldwide27. A high number of reinfections revealed that this strategy was not optimal in the case of SARS-CoV-228,29. Analysis of the first year of the pandemic in Nordic countries (Denmark, Norway, Iceland, Sweden, and Finland) showed that the recommendation-based model in Sweden rather than legislation-based restriction models led to increased SARS-CoV-2 infections30. Long-term observation in the UK also confirmed that the most efficient strategy was based on restrictions31. Travel restrictions and border migration control in the EU prevented the virus from spreading32. In Poland, soon after the first SARS-CoV-2 infection was reported, the government started to impose restrictions, including social distancing, travel limitations, and massive testing33. The restrictions in Poland were relaxed for the first time in June-October 2020. At the same time, restrictions were relaxed in Czechia, Lithuania, and Slovakia. When the second coronavirus wave hit approximately November 2020, a drastic increase in the number of excess deaths occurred in those four countries (Fig. 1E). Therefore, it can be assumed that restriction relaxation during the summer holidays contributed to the sudden increase in fatal infections in Poland and neighboring countries.
Despite relatively low numbers of infections per million people, Poland had a high death rate, comparable to that of the US, UK, and Italy, countries that reported more COVID-19 cases. The simplest explanation is that the actual number of cases in Poland was underestimated due to an insufficient number of tests performed. However, a low ratio of the number of tests to the number of detected cases was also observed in Germany, Sweden, and Lithuania. These countries had much lower CFRs than Poland. The observed contrast might result from differences in health care efficiency in particular countries. A report published in 202134 (based on Organization for Economic Cooperation and Development (OECD) data (stats.oecd.org)) showed that Poland ranked third from the bottom in terms of spending on health care and expenditures per capita. In 2019, expenditures on health care in Poland amounted to 4.3% of gross domestic product (GDP), which was much lower than the European average (6.53%). The highest ranking country was Germany (9.9% GDP), followed by France (9.4% GDP) and Sweden (9.3% GDP). It is worrying that health—related expenditures in Poland as a portion of GDP, after an increase in 2017, have been decreasing since 2018. The consequences might be severe: limited access to medical care, delayed medical intervention, and untreated chronic diseases. In our opinion, comorbidities could have contributed to the higher COVID-19-related mortality in Poland.
Mass vaccinations of the population helped to address the pandemic. Multiple studies have shown that even if vaccination does not prevent the possibility of viral infection, it affected the course of COVID-19 and contributed to decreased mortality35,36,37,38,39. Among the European countries, Poland, Hungary, Romania, and Bulgaria made no progress in reducing the number of COVID-19-related deaths40. The study showed a strong reverse correlation between mortality and the primary vaccination rate. Consistently, our data revealed a strong negative correlation between the CFR and the percentage of the vaccinated population in Poland. Notably, Bulgaria, Romania, and Slovakia had lower vaccination rates and more excess deaths per 100,000 people than Poland3. Israel, the first country worldwide to launch a massive vaccination strategy, exhibited the opposite results. In March 2021, 50% of its population was vaccinated (Fig. 1G)41. Despite the relatively high number of cases, a low number of COVID-19-related deaths and excess deaths were noted in Israel (Fig. 1A–E). Nevertheless, Israel had one of the lowest CFRs worldwide3. This finding indicates that the faster a vaccination is implemented, the better the prevention.
Interestingly, Kim et al. analyzed the incidence and mortality of COVID-19 in 107 countries worldwide42. They considered many factors, including socioeconomic and demographic features, geography, health care capacity, and health behaviors. They found that COVID-19 incidence and the CFR were influenced mainly by ethnicity and that East Asia was the least influenced by the pandemic. The authors concluded that Asian populations might be more resistant to SARS-CoV-2, as they have likely encountered similar pathogens in the past.
Regarding regional differences in Poland, the greatest total number of cases and deaths were observed in the Masovian and Silesian voivodeships. Warsaw, the capital of Poland and the largest and most populated city in the country, is located in the Masovian voivodeship. Warsaw is also a large labor market and international transfer hub and a potential source of new infections. Silesian is one of the smallest voivodeships in Poland but is the most urbanized region in the country, with the highest population density43 and severe air pollution. Some research has noted the influence of air pollution on susceptibility to viral infections, including SARS-CoV-244,45,46. An association between air pollution and COVID-19 mortality and morbidity was also observed in Poland47. Our data (Supplementary Table 5) showed that the highest CFR was found in Subcarpathian, Świętokrzyskie, and Lublin, regions that are less polluted and sparsely populated. In this case, insufficient vaccination seems to be the main reason for the high CFR, as Subcarpathian and Lublin are among the three voivodeships with the lowest vaccination rates.
Since the discovery of the first SARS-CoV-2 genome sequence, many subsequent variants with different transmission potentials and virulence levels have been described48,49,50,51,52. The worldwide variant evolution over time24,53 shows that new SARS-CoV-2 variants occasionally appeared and spread more successfully (Supplementary Fig. 3). For example, in Europe, the Alpha variant circulating between January and July 2021 displaced earlier variants (19A, 20A, 20B, 20E) and was further displaced by Delta (circulating June-December 2021) (Fig. 5B). In turn, the Delta variant was effectively replaced by Omicron 21K and its variants (21L, 21M, 22A, 22B, and 22C). The distribution of variants over time was generally similar in Europe and worldwide (Supplementary Fig. 3C). The differences occurred in autumn and winter 2020, when the 20E (EU1) variant was spreading mainly in Europe, and in spring and summer 2021, when a greater proportion of the Gamma variant (20J) (Supplementary Fig. 3C) was observed in the world than in Europe. At the same time, a lower percentage of the 20I Alpha VI variant and a slightly greater percentage of beta (20H) and lambda (21G) variants were observed worldwide than in Europe. Approximately November 2021, the Omicron variant (21K) efficiently displaced other variants, and since then, Omicron subvariants (21L, 22A, 22B and 22C) have circulated throughout the world (Supplementary Fig. 3C).
Our analysis of the viral genome sequencing data showed that the overall variant distribution in Poland over time did not differ from that in the rest of Europe. However, we found some regional differences in Poland. For example, the Delta variant 21J was responsible for ~ 55% of infections in Podlaskie and was also very frequent in the Subcarpathian, Świętokrzyskie, Lubusz, Masovian, and Warmian-Masurian voivodeships. All of them except Masovian had very high CFRs and low vaccination rates. Therefore, it cannot be excluded that the molecular features of Delta 21J, together with the low vaccination rate, contributed to higher mortality in these regions. Indeed, the Delta variant possesses multiple mutations in the Spike gene, making the virus 60% more transmissible than the Alpha variant and causing a more severe course of COVID-1954,55.
The haplotype network analysis of sequences obtained at the beginning of the pandemic (waves 1 and 2) may be biased by the small number of reported cases and unequal representation of sequenced samples for all regions of Poland. The haplotype networks for waves 4 and 5 are denser and more uniformly dispersed, which showed that in the second half of 2021, the pathogen was gradually, widely and evenly transmitted throughout the country. Nevertheless, some regional and temporal trends indicated the presence of potential superspreaders, particularly in Greater Poland (waves 1, 2, 4 and 5), Silesian and Warmian-Masurian (waves 4 and 5), Pomeranian and Lesser Poland (wave 3), Masovian and West Pomeranian (wave 2), and Kuyavian-Pomeranian and Lublin (wave 5) voivodeships. These results reflect the dynamics of the COVID-19 pandemic.
In conclusion, our research enabled us to demonstrate the relationships between various factors that could have been responsible for the course of the COVID-19 pandemic in Poland and other countries. Overall, the COVID-19 experience has become an invaluable resource for understanding infectious diseases and viral evolution. Learning from multiple studies on COVID-19 should allow us to better prepare for future pandemics.
Data availability
The data sequenced in our laboratory are available here: https://www.ebi.ac.uk/ena/browser/view/PRJEB39761 (sample IDs used in the analysis are listed in the first column of Supplementary Table 6). The data downloaded from GISAID, with the identifier EPI_SET_231114ua, are available at https://doi.org/10.55876/gis8.231114ua. All genome sequences and associated metadata in this dataset are published in GISAID’s EpiCoV database (https://www.epicov.org/epi3/frontend#256835). EPI_SET_231114ua is composed of 78,674 individual genome sequences. The collection dates ranged from 2020-03-03 to 2022-05-23. The data were collected in Poland. All sequences in this dataset were compared to those of hCoV-19/Wuhan/WIV04/2019 (WIV04), the official reference sequence employed by GISAID (EPI_ISL_402124, https://gisaid.org/WIV04).
References
World Health Organization (WHO). WHO Coronavirus (COVID-19) Dashboard. Available from: https://covid19.who.int
Worldometers.info. Worldometer-COVID-19 Coronavirus Pandemic. Dover Delaware U.S.A. Available from: https://www.worldometers.info/coronavirus/
Our World In Data https://ourworldindata.org/coronavirus. Available from: https://ourworldindata.org/coronavirus
Narodowy Program Szczepień przeciw COVID-19 [National Vaccination Programme against COVID-19]. 1–34 (2020). Available from: https://www.gov.pl/web/szczepimysie/narodowy-program-szczepien-przeciw-covid-19
Watson, O. J. et al. Global impact of the first year of COVID-19 vaccination: A mathematical modeling study. Lancet Infect Dis. 3099(22), 1–10. https://doi.org/10.1016/S1473-3099(22)00320-6 (2022).
Babicki, M. & Mastalerz-Migas, A. Attitudes toward vaccination against COVID-19 in Poland. A longitudinal study was performed before and two months after the commencement of the population vaccination program in Poland. Vaccines 9(5), 503 (2021).
Raciborski, F., Jankowski, M., Gujski, M., Pinkas, J. & Samel-Kowalik, P. Changes in attitudes towards the COVID-19 vaccine and the willingness to get vaccinated among adults in Poland: Analysis of serial, cross-sectional, representative surveys, January–April 2021. Vaccines 9(8), 832 (2021).
Mathieu, E. et al. A global database of COVID-19 vaccinations. Nat. Hum. Behav. 5(7), 947–953. https://doi.org/10.1038/s41562-021-01122-8 (2021).
Chan, J. F. et al. Genomic characterization of the 2019 novel human-pathogenic coronavirus isolated from a patient with atypical pneumonia after visiting Wuhan. Emerg. Microb. Infect. 9(1), 221–236 (2020).
Khare, S. et al. GISAID’s role in pandemic response. China CDC Wkly. 3(49), 1049–1051. https://doi.org/10.46234/ccdcw2021.255 (2021).
Harrison, P. W. et al. The COVID-19 data portal: Accelerating SARS-CoV-2 and COVID-19 research through rapid open access data sharing. Nucleic Acids Res. 49(W1), W619–W623 (2021).
COVID-19 Data Portal Poland. Available from: https://dataportal.covidhub.psnc.pl
Wu, F. et al. A new coronavirus associated with human respiratory disease in China. Nature 579(7798), 265–269 (2020).
Bushnell, B. BBMap: A Fast, Accurate, Splice-Aware Aligner. Lawrence Berkeley National Laboratory (2014). https://escholarship.org/uc/item/1h3515gn
Danecek, P. et al. Twelve years of SAMtools and BCFtools. Gigascience 10(2), giab008 (2021).
Bolyen, E. et al. Reproducibly sampling SARS-CoV-2 genomes across time, geography, and viral diversity. F1000Research 9, 657 (2020).
Katoh, K. & Standley, D. M. MAFFT multiple sequence alignment software version 7: Improvements in performance and usability. Mol. Biol. Evol. 30(4), 772–780 (2013).
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria (2021).
Paradis, E. pegas: An R package for population genetics with an integrated–modular approach. Bioinformatics 26(3), 419–420 (2010).
Gábor, C. & Tamás, N. The igraph software package for complex network research. Int. Complex Syst. 1695, 1–9 (2006).
Csárdi, G., Nepusz, T., Traag, V., Horvát, S., Zanin, F., Noom, D., et al. igraph: Network Analysis and Visualization in R. R Package version 1501 (2023). https//CRANR-project.org/package=igraph.
Wickham, H. ggplot2 Elegant Graphics for Data Analysis (Springer, 2016). https://doi.org/10.1007/978-3-319-24277-4.
Hadfield, J. et al. Nextstrain: Real-time tracking of pathogen evolution. Bioinformatics 34(23), 4121–4123. https://doi.org/10.1093/bioinformatics/bty407 (2018).
Cave, E. COVID-19 super-spreaders: Definitional quandaries and implications. Asian Bioeth. Rev. 12(2), 235–242. https://doi.org/10.1007/s41649-020-00118-2 (2020).
Cao, Y., Hiyoshi, A. & Montgomery, S. COVID-19 case-fatality rate and demographic and socioeconomic influencers: Worldwide spatial regression analysis based on country-level data. BMJ Open 10(11), e043560 (2020).
Vogel, G. Sweden’s gamble. Science 370(6513), 159–163 (2020).
Morens, D. M., Folkers, G. K. & Fauci, A. S. The concept of classical herd immunity may not apply to COVID-19. J. Infect. Dis. 226(2), 195–198. https://doi.org/10.1093/infdis/jiac109 (2022).
Aschwanden, C. The false promise of herd immunity for COVID-19. Nature 587(7832), 26–28 (2020).
Duchene, S. et al. The impact of public health interventions in the Nordic countries during the first year of SARS-CoV-2 transmission and evolution. Eurosurveillance 26, 44. https://doi.org/10.2807/1560-7917.ES.2021.26.44.2001996 (2021).
Keeling, M. J. et al. Predictions of COVID-19 dynamics in the UK: Short-term forecasting and analysis of potential exit strategies. PLOS Comput Biol. 17(1), e1008619. https://doi.org/10.1371/journal.pcbi.1008619 (2021).
Linka, K., Peirlinck, M., Sahli Costabal, F. & Kuhl, E. Outbreak dynamics of COVID-19 in Europe and the effect of travel restrictions. Comput. Methods Biomech. Biomed. Eng. 23(11), 710–717 (2020).
Website of the Republic of Poland www.gov.pl. Coronavirus: Information and recommendations. Temporary limitations. Available from: https://www.gov.pl/web/coronavirus/temporary-limitations
Skóbel, B., Kocemba, E. & Rudka, R. Nakłady na ochronę zdrowia w Polsce na tle innych państw OECD. (2021). ISSN 2657-8638. Available from: https://linkinghub.elsevier.com/retrieve/pii/S1097276520305189
Rahmani, K. et al. The effectiveness of COVID-19 vaccines in reducing the incidence, hospitalization, and mortality from COVID-19: A systematic review and meta-analysis. Front. Public Heal. 10, 873596. https://doi.org/10.3389/fpubh.2022.873596/full (2022).
Villanueva, G. et al. Efficacy and safety of COVID-19 vaccines. Chochrane Database Syst. Rev. 12, 221 (2022).
Ranzani, O. T. et al. Effectiveness of the CoronaVac vaccine in older adults during a gamma variant associated epidemic of COVID-19 in Brazil: Test negative case-control study. BMJ 374, n2015. https://doi.org/10.1136/bmj.n2015 (2021).
Lopez Bernal, J. et al. Effectiveness of the Pfizer-BioNTech and Oxford-AstraZeneca vaccines on COVID-19 related symptoms, hospital admissions, and mortality in older adults in England: Test negative case-control study. BMJ. 373, n1088. https://doi.org/10.1136/bmj.n1088 (2021).
Hyams, C. et al. Effectiveness of BNT162b2 and ChAdOx1 nCoV-19 COVID-19 vaccination at preventing hospitalizations in people aged at least 80 years: A test-negative, case-control study. Lancet Infect. Dis. 21(11), 1539–1548 (2021).
Vicente, P. & Suleman, A. COVID-19 in Europe: From outbreak to vaccination. BMC Public Health 22(1), 2245. https://doi.org/10.1186/s12889-022-14454-5 (2022).
Rosen, B., Waitzberg, R. & Israeli, A. Israel’s rapid rollout of vaccinations for COVID-19. Isr. J. Health Policy Res. 10(1), 6. https://doi.org/10.1186/s13584-021-00440-6 (2021).
Kim, J. et al. Factors associated with the difference between the incidence and case-fatality ratio of coronavirus disease 2019 by country. Sci. Rep. 11(1), 18938 (2021).
Kacperczyk, E., Ciecielska, K., Korzeniewski, K., Macioch, B., Marczak, M. & Budzyński, I. Area and population in the territorial profile in 2021. Stat Poland, Stand Regist Dep [Internet]. 2021. Available from: https://stat.gov.pl/en/topics/population/population/area-and-population-in-the-territorial-profile-in-2020,4,14.html
Woodby, B., Arnold, M. M. & Valacchi, G. SARS-CoV-2 infection, COVID-19 pathogenesis, and exposure to air pollution: What is the connection?. Ann. N. Y. Acad. Sci. 1486(1), 15–38. https://doi.org/10.1111/nyas.14512 (2021).
Travaglio, M. et al. Links between air pollution and COVID-19 in England. Environ. Pollut. 268(Pt A), 115859 (2021).
Wu, X., Nethery, R. C., Sabath, M. B., Braun, D. & Dominici, F. Air pollution and COVID-19 mortality in the United States: Strengths and limitations of an ecological regression analysis. Sci. Adv. https://doi.org/10.1126/sciadv.abd4049 (2020).
Semczuk-Kaczmarek, K. et al. Association between air pollution and COVID-19 mortality and morbidity. Intern. Emerg. Med. 17(2), 467–473. https://doi.org/10.1007/s11739-021-02834-5 (2022).
Fernandes, Q. et al. Emerging COVID-19 variants and their impact on SARS-CoV-2 diagnosis, therapeutics and vaccines. Ann. Med. 54(1), 524–540. https://doi.org/10.1080/07853890.2022.2031274 (2022).
Choi, J. Y. & Smith, D. M. SARS-CoV-2 variants of concern. Yonsei Med. J. 62(11), 961–968 (2021).
Hirabara, S. M. et al. SARS-COV-2 variants: Differences and potential of immune evasion. Front. Cell Infect. Microbiol. 11, 781429 (2021).
Telenti, A., Hodcroft, E. B. & Robertson, D. L. The evolution and biology of SARS-CoV-2 variants. Cold Spring Harb. Perspect Med. 12(5), 041390 (2022).
Koelle, K., Martin, M. A., Antia, R., Lopman, B. & Dean, N. E. The changing epidemiology of SARS-CoV-2. Science 375(6585), 1116–1121 (2022).
McLean, G. et al. The impact of evolving SARS-CoV-2 mutations and variants on COVID-19 vaccines. MBio https://doi.org/10.1128/mbio.02979-21 (2022).
Dhawan, M., Sharma, A., Priyanka, T. N., Rajkhowa, T. K. & Choudhary, O. P. Delta variant (B.1.617.2) of SARS-CoV-2: Mutations, impact, challenges and possible solutions. Hum. Vaccin. Immunother. 18(5), 2068883 (2022).
Acknowledgements
We are grateful to all the managers of the external units that provided samples for analysis; the team of the Cell and Tissue Culture Laboratory, IBCH PAS, for preparing the laboratories to work with the virus; and all the members of the Viral Support Group, a group of volunteers from IBCH PAS who supported COVID-19 diagnostics in Poland at the beginning of the pandemic. We also gratefully acknowledge all the data contributors, i.e., the authors and their originating laboratories responsible for obtaining the specimens and their submitting laboratories for generating the genetic sequence and metadata and sharing via the GISAID Initiative, on which this research is based.
Funding
This research was funded by the Ministry of Science and Higher Education in Poland under the dedicated subsidy and SPUB (5536/E-63/SPUB/2017/1), by the Marshal of the Wielkopolska under Grant Regional COVID-hub (RPWP.02.01.01-30-0001/20) from the Regional Operational Program 2014-2020, and by the internal resources of the IBCH PAS.
Author information
Authors and Affiliations
Contributions
BM analyzed the epidemiological data, extracted the viral RNA, prepared the NGS libraries, wrote the first draft of the manuscript, and prepared Figs. 1–6, Supplementary Figs. 1–3, and Supplementary Tables 1–5; MZ enriched the libraries in the SARS-CoV-2 sequences, processed the raw data sequenced in our laboratory, analyzed the data, performed subsampling and rare variant removal, created consensus sequences, generated plots for Fig. 5 and Supplementary Table 6; KN downloaded sequences from the GISAID database, cleaned redundant and messy data, performed haplotype network analysis and generated plots for Supplementary Figs. 4–6 and data for Supplementary Tables 8–12; IS and NK supported the functioning of the COVID-19 Data Portal Poland, which was a source of data analyzed in this paper; IS and KN deposited viral sequences generated in our laboratory into the COVID-19 Data Portal; JP, MR, and MMS helped with library preparation and performed sequencing; PZ consulted epidemiological data and participated in manuscript preparation; ELO and MO extracted viral RNA; KŁ, MJ, ŁM, MC and AJ collected and provided samples; KK was the founder of the COVID-19 Data Portal Poland; MF supervised the study and provided funding; LH supervised the study; coordinated sample collection, experimental work and bioinformatic analyses; prepared Figs. 7–9, Supplementary Figs. 4–6, Supplementary Tables 7–12; and wrote the final version of the manuscript. All the authors read and accepted the content of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Mirska, B., Zenczak, M., Nowis, K. et al. The landscape of the COVID-19 pandemic in Poland emerging from epidemiological and genomic data. Sci Rep 14, 14416 (2024). https://doi.org/10.1038/s41598-024-65468-5
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-024-65468-5
- Springer Nature Limited