Abstract
Escherichia coli ST1485 strains belong to the clinically important phylogroup F and have disseminated worldwide in humans, animals, and the environment. Here, we elucidated the pathogenome of a global collection of E. coli ST1485 isolates from diverse sources retrieved from public databases and a high-quality sequenced complete genome of colistin-resistant E. coli strain CFSAN061771 isolated from raw milk cheese which designated as a reference strain. CFSAN061771 belongs to O83:H42-ST1485 pathotype and carries a conjugative ColV plasmid, pCFSAN061771_01, combining extraintestinal virulence genes (ompt, sitA, iroN, etsC, traT, cvaC, hylF, iss, tsh, mchf, iucC, iutA) with a multidrug resistance island (blaTEM-1, aph(6)-Id, aph(3″)-Ib, sul2, dfrA14). Comparative genomic analysis revealed a high frequency of pCFSAN061771_01-like plasmids in E. coli ST1485. A notable evolutionary genetic event in E. coli ST1485 strains is the acquisition of a pCFSAN061771_02-like plasmid, which confers resistance to several antimicrobials, tellurium, and quaternary ammonium compounds. The identical virulence and antibiotic resistance profiles identified in some human and animal strains are worrisome. This is the first study to emphasize the significance of E. coli ST1485 as a global high-risk virulent and multidrug-resistant clone with zoonotic potential.
Similar content being viewed by others
Introduction
Escherichia coli is a commensal bacterium inhabiting the gastrointestinal tract of healthy humans and animals. Nevertheless, various strains have developed pathogenetic pathways capable of producing a variety of diseases. The pathogenic E. coli that cause diarrhea are referred to as diarrheagenic E. coli (DEC). They are classified into pathotypes based on their virulence characteristics and the symptoms they produce1,2. Escherichia coli strains termed extraintestinal pathogenic E. coli (ExPEC) exhibit a broad range of genes that encode virulence factors involved in colonization, adherence, invasion, persistence, and/or toxin production in the host, causing illnesses such as urinary tract infections, neonatal meningitis, pneumonia, and septicemia3,4. It is still unclear what differentiates ExPEC from commensal E. coli1,2,5. ExPEC strains mainly include uropathogenic E. col (UPEC), neonatal meningitis-causing E. coli (NMEC), avian pathogenic E. coli (APEC), sepsis-associated E. coli (SEPEC) and mammary pathogenic E. coli (MPEC)4,6,7.
Many ExPEC-associated virulence factors have been identified in the literature, either on conserved pathogenicity-associated islands (PAIs) or virulence plasmids6. Even though the plasmid composition of ExPEC is quite varied, virulence factors tend to be restricted to a narrow fraction of plasmid types, like ColV plasmids, which supply both virulence and fitness-associated features8. Such virulence ColV plasmids harbor a PAI that encodes several virulence components, including multiple iron acquisition and utilization systems, serum survival proteins, hemolysins, adhesins, outer membrane proteins, and autotransporters9. These virulence factors promote host colonization and invasion and dodge or impair host defensive systems causing life-threatening illnesses such as hemolytic uremic syndrome and newborn meningitis10,11. They are also quite prevalent among avian pathogenic E. coli (APEC) that cause severe respiratory and systemic disease in poultry12. Remarkably, ColV plasmids often have been discovered in specific ExPEC clonal groups from humans and food-producing animals, including ST95, ST131, ST117, and ST5813,14,15,16.
Since 2005, pan-genome analysis has been used as an efficient tool to explain the variance in genetic content among bacterial species17. The term “pan-genome” represents the total number of non-redundant genes in a particular dataset18.On the other hand, multisequence typing (MLST) has gained widespread popularity as a routine typing tool for epidemiological investigations at local and global scales19. It is noteworthy that a large proportion of antibiotic-resistant infections are caused by a small set of sequence types1,20. Indeed, the mechanism behind the remarkable worldwide spread of particular E. coli lineages is not fully understood, a situation that warrants regular screening of the genomic plasticity of E. coli isolates from different sources, particularly those emerging in animal and food of animal origin and have zoonotic importance.
Escherichia coli ST1485 strains belong to the clinically important phylogroup F and have disseminated worldwide in humans, animals, and the environment21,22,23,24. Yet to date, no genomic comparisons of E. coli ST1485 strains nor pan-genomic epidemiological analysis have been published. In this study, we elucidate a high-quality sequenced complete genome of E. coli ST1485 strain CFSAN061771, isolated from raw milk cheese in Egypt. Further, we performed comparative genomic analyses using CFSAN061771 as a reference with a global collection of E. coli ST1485 strains retrieved from public databases25,26 and clustered them based on their virulence and antibiotic resistance gene content.
Results
Basic genomic features
In silico genotyping revealed that E. coli strain CFSAN061771 belongs to ST1485, O83:H42 serogroup, and phylogroup F. The complete genome of E. coli strain CFSAN061771 comprises a chromosome of 4,908,204 bp with a G + C content of 50.6%. Further, two plasmids named pCFSAN061771_01 and pCFSAN061771_02 of 167,754 bp and 215,531 bp were detected, respectively. Plasmid replicons IncFIA, IncFIB (AP001918), and IncFIC (FII) were detected in pCFSAN061771_01, whereas IncHI2 and IncHI2A were detected in pCFSAN061771_02.
Closest relative genomes to E. coli CFSAN061771
As illustrated in Fig. 1, grapeTree was used to produce and visualize a minimum spanning tree (MST) based on comparing cgMLST allelic profiles. Escherichia coli CFSAN061771 was clustered with two unpublished strains, E. coli 119 (accession number: JABADS01) and E. coli 120 (accession numbers: JABADT01), isolated from ditch water in the Netherland, with only 32 allelic differences. Noteworthy, CFSAN061771 was also similar to two Chinese strains isolated from chicken, YH17143 (accession number: PTNO01) and YH17174 (accession number: PTMO01), with 39 and 43 allelic differences, respectively. The remaining two strains that were a part of the same grape are DF376 and M160133, that showed 44 and 90 allelic differences, respectively.
On the other hand, the closest reference and representative genomes to our strain identified by Mash/MinHash27 were YH17143, YH17174, and M160133 (accession number: CP022164) with distances of 0.00105797, 0.00115999, and 0.00216198, respectively.
Pan-genome analysis
We conducted the pan-genome analysis to investigate possible differences in gene repertoires among the 85 E. coli ST1485 used in this study. The overall pangenome consisted of 13,376 genes represented by 3998 core genes (3527 hard + 471 soft core genes) and 9378 accessory genes (1324 shell + 8054 cloud genes). Strikingly, only 12 strains carried pCFSAN061771_02 like plasmid genes, as illustrated in Fig. 2 (outlined in red). The closest match to CFSAN061771 is M160133, a strain isolated from a human patient in the United States23. Pan-genome analysis revealed that they share a core genome of 4751 genes accounting for 91.1% (4751/5215) of their pan-genome. A more extensive comparison of CFSAN061771with the M160133 strain indicated that they shared 94.03% (4367/4644) of their chromosomal genes. In terms of plasmids, pCFSAN061771_01 and pM160133 p2 shared 166 core plasmid genes, accounting for 86.91% (166/191) of their pan plasmid genes, whereas pCFSAN061771_02 and pM160133_p1 share 221 core plasmid genes accounting for 78.36% (221/282) of their pan plasmid genes. Notably, the M160133 strain has an additional plasmid, pM160133_p3, consisting of 113,428 bp, not present in CFSAN061771.
Virulome analysis
Chromosomal virulence genes
The chromosome of CFSAN061771 carries different virulence genes that are commonly involved in urinary tract infections, including yfcV (encodes the major subunit of a putative chaperone-usher fimbria), chuA (encodes a heme binding protein)28, heat-resistant agglutinin (hra) gene29, outer membrane protein (ompT) gene30, and capsular genes (kpsE, kpsMII)31. Interestingly, it also carries enteroaggregative E. coli (EAEC) virulence gene regulator (eilA), air gene that encodes enteroaggregative immunoglobulin repeat protein32, and long polar fimbriae (lpfA) gene that was detected in EAEC and other pathogenic E. coli strains33. Furthermore, it carries ompA, ibeB, ibeC, and aslA genes that encode structures critical to neonatal meningitis E. coli (NMEC) for crossing of the blood–brain barrier and subsequently invasion of brain endothelial cells34,35. It also harbors two bacteriocins, mchF and mcmA, which encode for the ABC transporter MchF and the microcin M, respectively, and are associated with antibacterial activity against closely related species36.
Plasmid virulence genes
The genetic organization of the virulence genes in pCFSAN061771_01 is presented in Fig. 3. The plasmid carries the genes encoding: colicin V (ColV operon) (cvaABC, cvi); a core region of ColV plasmids which includes three different iron uptake and utilization systems (ferric aerobactin system (iutA/iucABCD), iron and manganese ABC transport system (sitABCD), and salmochelin siderophore system (iroBCDEN)); an outer membrane protein T-encoding gene ompT; ABC transport system etsABC; the increased serum survival gene involved in complement resistance iss; and a hemolysin-encoding gene hlyF. The putative virulence region of pCFSAN061771_01 also harbors tsh gene, which encodes temperature-sensitive hemagglutinin that was confirmed to be associated with the virulence of APEC37. Additionally, different maintenance systems associated with virulence plasmids were identified, including plasmid partitioning system (parABS) and toxin-antitoxin-based addiction systems. The toxin-antitoxin-based addiction systems comprised: postsegregational killing (PSK) system, ccdA/ccdB; virulence-associated genes, vagC and vagD; and host killing gene, hok38. Further, the whole transfer (tra) region (traN, traF, traQ, traH, traG, traT, traD, traI, traX, traJ, traY, traA, traL, traE, traK, traB, traP, traR, traC, traW, and traU) that encodes for the transfer components of plasmids were detected. On the other hand, pCFSAN061771_02 did not carry any virulence genes.
The global spread of pCFSAN061771_01 and pCFSAN061771_02 like plasmids in E. coli ST1485
The alignment of contigs of 84 genomes retrieved from public databases on the pCFSAN061771_01 sequence further confirmed the presence of pCFSAN061771_01–like plasmids in most E. coli ST1485 strains, regardless of country and source of isolation (Fig. 3). On the other hand, when we used pCFSAN061771_02 as a BLAST query sequence against 84 E. coli ST1485 genome sequences, only 11 genomes were returned. These genomes were identified from: animal, LD67-1 (pLD67-1-157 kb); food (NIFDS_EC2017_2); human (M160133 (pM160133_p1), MER_90, 3468); and poultry (PP743, LREC_201, ampC_0104, 20,151,021, YH17174, YH17143) (Fig. 4).
Clustering of globally disseminated E. coli ST1485 strains based on their virulence profiles
Figure 5 shows that 85 E. coli ST1485 strains, including CFSAN061771, harbored seven or more virulence genes from a panel of 43 genes and displayed 56 virulence patterns. It is worth noting that the majority of strains from diverse sources had eight or more genes of a panel consisting of 12 virulence genes (ompt, sitA, iroN, etsC, traT, cvaC, hylF, iss, tsh, mchf, iucC, iutA) grouped in cluster 2, shown in red, which is likewise present on pCFSAN061771_01. All strains harbored at least six genes from another panel comprised of eight genes grouped in cluster 3, which was shown in green (hra, eilA, kpsE, air, yfcv, terC, chuA, IpfA). Our food strain (marked as a yellow box on the left of Fig. 5) clustered with a poultry strain (YH17143) with the same virulence pattern.
Resistome analysis
Chromosomal antibiotic resistance genes
The chromosome of CFSAN061771 possesses mutations in the DNA gyrase (gyrA, S83L) and parC (S80I), which are associated with fluoroquinolone resistance.
Plasmids' antibiotic resistance genes
pCFSAN061771_01 harbors an MDR-encoding region integrated with several mobile elements (Fig. 3). It comprises a cluster of genes encoding resistance to sulfonamide (sul2), β-lactam (blaTEM-IB), kanamycin (aph(3″)-Ib), and streptomycin (aph(6)-Id) flanked by tnpR and IS110 as well as a class 1 integron (Int191) harboring a sole gene cassette, dfrA14, encoding resistance to trimethoprim. The genetic organization of this region in our strain and pM160133_p2, a plasmid of closely related strain M160133, were compared in Fig. 6. Notably, CFSAN061771 lacks the tetracycline tetA. On the other hand, as shown in Fig. 4, pCFSAN061771_02 harbored mcr-1 gene, encoding colistin resistance with an upstream copy of ISApl1 as well as class 1 integron, In641, with gene cassettes encoding resistance to aminoglycosides (estX, aadA2, aadA2), chloramphenicol (cmlA1) and quaternary ammonium compounds (qacL). The sulfonamide resistance gene, sul3, which was found to be associated with In6439, was also detected. The genetic organization of antibiotic resistance genes and alignment of genome sequences on the pCFSAN061771_02 sequence is illustrated in Fig. 4.
Clustering of globally disseminated E. coli ST1485 strains based on their antibiotic resistance profiles
Figure 7 shows that 85 E. coli ST1485 strains harbored one or more antibiotic resistance genes from a panel of 57 genes and displayed 58 resistance patterns. Antibiotic resistance genes were clustered into 30 clusters, and those in cluster 7 (blaTEM-1, aph(6)-Id, aph(3″)-Ib, sul2, dfrA14) were found to compromise a multidrug resistance region (MDR) in pCFSAN061771_01 (Fig. 6). Our strain formed a separate cluster (cluster 26) and was shown as a yellow box on the left of Fig. 7.
Discussion
The availability of high-quality annotated complete genome sequences of potentially pathogenic E. coli from non-clinical sources provides a crucial resource for understanding the pathogenome evolution among different E. coli pathotypes. In this study, we elucidated the pathogenome of a global collection of E. coli ST1485 isolates from diverse sources retrieved from public databases and a high-quality sequenced complete genome of colistin-resistant E. coli strain CFSAN061771 isolated from raw milk cheese in Egypt which designated as a reference strain. Notably, few dominant STs are thought to make up the population of phylogroup F isolates which are characterized by a high prevalence of virulence and antibiotic resistance genes (such as ST59, ST354, ST405, and ST648)40. Here, the importance of evolving E. coli ST1485 belonging to phylogroup F as a worldwide high-risk virulent and multidrug-resistant clone is highlighted.
Pan-genome analysis with virulence and antibiotic resistance profiling revealed that CFSAN061771 was related to the urinary E. coli strain isolated from a New York patient, suggesting its zoonotic potential23. It is essential to point out that the cgMLST demonstrated a lesser resolving ability than the pan-genome analysis did when it came to grouping closely related strains that had comparable or identical virulence and antibiotic resistance characteristics. This is because the cgMLST tree was constructed using predefined core genes of the individual strains without their accessory genes41. It is worth noting that epidemiological studies that rely on publicly available genetic data are sometimes uncertain as to whether the whole dataset accurately represents the genomic and source variation of the entire population. Even though poultry made up most of our source population, we think we have observed most of the genomic diversity in ST1485 because of its large pangenome, and wide range of antibiotic resistance and virulence genes.
The identification of ColV plasmids in literature is based on Liu’s criteria which consider a strain to be ColV-positive if it possesses at least one or more genes from four or more of the six gene sets (i) cvaABC and cvi, (ii) iroBCDEN, (iii) iucABCD and iutA, (iv) etsABC, (v) ompT and hlyF, and (vi) sitABCD13. But these criteria are limited because they only take into consideration specific genes and don't consider the architecture of the plasmid backbone. Therefore, in order to test the theory that ColV plasmids were responsible for the evolution of ST1485 as a pathogen, we analyzed a global collection of 84 draft genomes for the presence of closed ColV plasmid sequence pCFSAN061771_01. ST1485 strains have acquired a diversity of ColV plasmids, and most of the ColV plasmids studied contain the complete repertoire of archetypal ColV genetic backbone. Figure 3 displays the evolutionary changes in ColV-like plasmid sequences in E. coli ST1485 strains recovered from diverse sources. Collectively, poultry strains show the highest striking differences in gene repertoire of ColV pCFSAN061771_01 like plasmids. The extraordinary consistency of ColV plasmid carriage across most of the E. coli ST1485 strains from diverse origins and geographical regions may be attributable to the presence of multiple plasmid maintenance systems that allow sustained plasmid inheritance, even in the absence of selection pressure38. The evidence points to the possibility that E. coli ST1485 might serve as a potential reservoir for disseminating this virulence and multidrug resistance plasmid.
A notable evolutionary genetic event in E. coli ST1485 is the acquisition of pCFSAN061771_02-like plasmid, which confers selective advantages for several antibiotics (aminoglycosides, chloramphenicol, sulfonamides, colistin), tellurium, and quaternary ammonium compounds. Colistin has become the last-resort antibiotic for treating lethal infections caused by multidrug-resistant Enterobacteriaceae in human medicine. The World Organisation also classified it as a Veterinary Highly Important Antimicrobial Agent (VHIA) for Animal Health42. Considering the relevance of human movement and migration in the rapid transcontinental transfer of antibiotic-resistant bacteria, the continuous availability of colistin over-the-counter in certain nations remains a public health concern.
Interestingly, the constructed phylogenetic tree based on pan-genome analysis demonstrated that 10 out of 12 strains carried a pCFSAN061771_02-like plasmid clustered in one clade, including a food strain (CFSAN061771), an animal strain (LD67-1), human strains (M160133, MER_90, 3468) and poultry strains (LREC_201, ampC_0104, PP743, YH17174, YH17143) (Fig. 2). This finding reflects a significant genetic overlap among human, animal, and poultry E. coli ST1485 strains from many regions throughout the globe, raising concerns about the zoonotic potential of E. coli ST1485. Furthermore, using pCFSAN061771_02 as a reference, a comparison of draft genomes harboring the pCFSAN061771_02-like plasmid as well as two fully sequenced plasmids (pLD67-1-157 kb from LD67-1 and pM160133_p1 from M160133)23,24, indicated backbone conservation (Fig. 5).
It is surprising how identical the virulence and antibiotic resistance profiles are in some humans (H), livestock (L), food (F), poultry (P), companions (C), and wild animals (W) (denoted as red boxes on the left annotation of Figs. 5 and 7). The insertion of MDR-encoding island in pCFSAN061771_01-like plasmids carrying extraintestinal virulence genes is especially concerning. This MDR island is rich in mobile elements and operates as a hotspot for sequential gene uptake through multiple mechanisms (Fig. 6). It seems to be an essential feature of E. coli ST1485's adaptive machinery for colonizing various niches. Our findings point toward that E. coli ST1485 can adapt to several hosts armed with either pCFSAN061771_01 or pCFSAN061771_02- like plasmids or both, which confers selective advantages for the host strain, contributing to the recent emergence of E. coli ST1485. We also detected E. coli ST1485 strains devoid of pCFSAN061771_01 and pCFSAN061771_02's virulence and antibiotic resistance genes, which might be the ancestor of E. coli ST1485 from which these hybrid genotype strains were formed.
UPEC has a variety of structural and secreted virulence elements that contribute to their ability to cause illness, including flagella, fimbriae, pili, toxins, and iron-acquisition systems. They were defined as strains positive to two or more chuA loci (heme-binding proteins), fyuA (encodes the yersiniabactin receptor), vat (encodes an autotransporter serine protease toxin), and yfcV (encodes the major subunit of a putative chaperone-usher fimbria)28. According to these criteria, all the E. coli ST1485 strains studied, including CFSAN061771, have likely the potential to be uropathogens as they carried both chuA and yfcV (Fig. 5). The emergence of the virulence gene ferric yersiniabactin uptake receptor (fyuA) in an environmental strain (20-MO00076-0) and a poultry strain (FSIS12209249) implies the propensity of E. coli ST1485 strains to acquire more virulence genes. On the other hand, the carriage of iron acquisition systems on the ColV plasmids identified in this study are crucial for UPEC survival in an iron-limited environment such as the urinary tract, providing additional virulence potential for E. coli ST1485 strains to cause urinary tract infection. Worryingly, traT gene, which was reported to be harbored exclusively on IncF plasmids and involved in increasing resistance to serum killing, was found in tra operon of pCFSAN061771_01 and pCFSAN061771_01 like plasmids. This gene was frequently detected in UPEC43 and NMEC strain from France44. Interestingly, a recent study revealed that it was also a prevalent gene in E. coli strains isolated from dairy cows with clinical mastitis45, raising the concern about the possible dissemination of this gene through consumption of contaminated dairy products. This work demonstrates that the foodborne E. coli ST1485 strain has the potential to function as a uropathogen, stressing the urgent need for more research to identify the percentage of extraintestinal illness in humans caused by E. coli ST1485 acquired through food.
In conclusion, the deciphering of the pathogenome of the emerging hybrid E. coli ST1485 strain, CFSAN061771, and screening of the virulence profiles of globally disseminated E. coli ST1485 strains from various sources, reveals the outstanding ability of this sequence type belonging to the phylogroup F to acquire virulence genes implicated in both intestinal and extraintestinal pathogenicity. Escherichia coli ST1485 is a potential reservoir of ColV plasmids, and the carriage of ColV plasmids is not generally present nor exclusive to APEC strains; rather, they have emerged in strains from various sources, including environment, animal, and human isolates. Some strains of this sequence type, including CFSAN061771, have exceptional versatility in integrating newly discovered genes such as mcr-1 and genes conferring broad antibiotic resistance. The notion that the mcr-1 gene has made its way into E. coli ST1485- a sequence type capable of colonizing different hosts- could make it hard to prevent further transmission of this element to humans. The identical virulence and antibiotic resistance profiles identified in some human and animal E. coli ST1485 strains are worrisome. They suggest that this sequence type may have zoonotic potential, posing a serious threat to public health, and should be closely monitored across the world. Finally, this study provides new insights into the extent and possibly human health implications of consuming raw milk cheese which may be a vehicle for disseminating cattle-associated hypervirulent and multidrug-resistant E. coli ST1485 to humans, causing multiple diseases, particularly urinary tract infections.
Methods
Bacterial strain
In a study conducted from December 2016 to February 2017 aimed to determine the incidence and molecular characteristics of colistin-resistant E. coli in karish cheese, a famous Egyptian raw milk cheese, we cultivated colistin-resistant E. coli strains from two hundred samples (100 g each of karish cheese). At the Department of Food Hygiene and Control, Faculty of Veterinary Medicine, University of Sadat City, Egypt, 225 ml of buffered peptone water (Oxoid, Basingstoke, England) were added to cheese samples (25 g) in a sterile plastic package. The samples were subsequently homogenized in a stomacher, and spread onto plates of eosin methylene blue agar (Oxoid, Basingstoke, England) containing 2 mg/l colistin. Then the plates were incubated for 24 h at 37 °C. Colonies grown on eosin methylene were tested by traditional biochemical tests. Strains showed typical E. coli phenotypes by the API 20E system (bioMe'rieux, Marcy l'Etoile, France) and were shipped to the Center for Food Safety and Applied Nutrition, U.S. Food and Drug Administration, College Park, Maryland, USA. Upon arriving at the FDA laboratory, the CFSAN061771 E. coli strain was chosen for sequencing.
DNA extraction and genome sequencing of CFSAN061771
The genomic DNA of strain CFSAN061771 was isolated from overnight culture using the DNeasy Blood and Tissue kit (Qiagen Inc., Valencia, CA) and sequenced on the Pacific Biosciences (PacBio) RS II sequencing platform as previously described46. The sequences were annotated using the NCBI Prokaryotic Genomes Annotation Pipeline (PGAP)47.
Publicly available sequences
Enterobase (http://enterobase.warwick.ac.uk/species/index/ecoli)25 and PATRIC (https://www.patricbrc.org/)26databases were queried for released ST1485 whole-genome sequences with available metadata for source, collection year, continent, and country. Associated metadata for these sequences is available in Supplementary Table 1.
Genomic analyses
CFSAN061771’s genome was in silico typed with regard to O:H serotypes and sequence type by the database of the Center for Genomic Epidemiology (https://cge.cbs.dtu.dk/services/) using SerotypeFinder48, and MLSTtyper49. BacWGSTdb (http://bacdb.cn/BacWGSTdb/50was used to determine the closest relative strains with the function “Single genome analysis” and a core genome MLST (cgMLST) allele threshold based on 500 alleles. The “Similar genome finder” service in PATRIC (https://www.patricbrc.org/)26 was used to find the closest reference and representative genomes to our strain based on Mash distance27.
Pan-genome analysis
The genome of CFSAN061771 and a global collection of 84 genomes were annotated using Prokka51, and the generated gff3 files were used as input for pangenome identification with Roary52, which groups full-length genes into core (hard and soft core) and accessory (shell and cloud) genomes. Hard-core genes are present in > 99 percent of genomes, whereas soft-core genes are found in 95–99 percent of genomes. Shell and cloud genes exit in 15–95% and less than 15% of genomes, respectively. This analysis yielded a matrix of core and accessory gene presence/absence. The phylogenetic tree was generated using FastTree 253 based on the presence or absence of accessory genes. The matrix of presence/absence genes was then compared to the phylogenetic tree and visualized using Phandango54.
Screening for the presence of similar sequences to CFSAN061771’s plasmids
E. coli ST1485 genomes retrieved from public databases25,26 were investigated for the presence of similar sequences to pCFSAN061771_01 and pCFSAN061771_02 using nucleotide-nucleotide BLAST55. Similar sequences were visualized with Blast Ring Image Generator (BRIG)56.
Analysis of plasmid replicons, virulome, resistome, and mobilome
The presence of antibiotic resistance and virulence determinants and plasmid replicons were screened using ABRicate v0.8.1057 (https://github.com/tseemann/ABRicate) with the ResFinder database58, Virulence Factor Database (VFDB)59, and PlasmidFinder database60 (cutoffs, identity, and coverage of > 90%). Chromosomal mutations associated with antimicrobial resistance genes were studied using PointFinder available at https://cge.cbs.dtu.dk/services/ResFinder/58. ISfinder61 was used to identify insertion sequences and transposons bracketing antibiotic resistance and virulence genes. Additionally, we screened 84 strains of a global collection of E. coli ST1485 for their virulence and antibiotic resistance gene content by using BLASTN against previously described reference genes in VirulenceFinder and ResFinder databases6,58. ComplexHeatmap (v2.6.2) R package62 was used to plot a summary heatmap for the presence or absence of virulence and antibiotic resistance genes.
Data availability
All data generated or analyzed during this study are included in this published article and in the Supplementary Table 1. The complete nucleotide sequences of E. coli strain CFSAN061771 chromosome pCFSAN061771_01 and pCFSAN061771_02 have been deposited into GenBank under the accession numbers of NZ_CP042896, NZ_CP042897, and NZ_CP042898, respectively.
References
Manges, A. R. et al. Global extraintestinal pathogenic Escherichia coli (ExPEC) lineages. Clin. Microbiol. Rev. https://doi.org/10.1128/CMR.00135-18 (2019).
Riley, L. W. Distinguishing pathovars from nonpathovars: Escherichia coli. Microbiol. Spectr. https://doi.org/10.1128/microbiolspec.AME-0014-2020 (2020).
Biran, D. & Ron, E. Z. Extraintestinal pathogenic Escherichia coli. In Escherichia coli, a Versatile Pathogen (eds Frankel, G. & Ron, E. Z.) 149–161 (Springer International Publishing, 2018).
Sora, V. M. et al. Extraintestinal pathogenic Escherichia coli: Virulence factors and antibiotic resistance. Pathogens https://doi.org/10.3390/pathogens10111355 (2021).
Mitchell, N. M., Johnson, J. R., Johnston, B., Curtiss, R. & Mellataa, M. Zoonotic potential of Escherichia coli isolates from retail chicken meat products and eggs. Appl. Environ. Microb. 81, 1177–1187. https://doi.org/10.1128/Aem.03524-14 (2015).
Malberg Tetzschner, A. M., Johnson, J. R., Johnston, B. D., Lund, O. & Scheutz, F. In silico genotyping of Escherichia coli Isolates for extraintestinal virulence genes by use of whole-genome sequencing data. J. Clin. Microbiol. https://doi.org/10.1128/JCM.01269-20 (2020).
Jung, D. Y. et al. Comparative genomic analysis of Escherichia coli isolates from cases of bovine clinical mastitis identifies nine specific pathotype marker genes. Microb. Genom. 7, 000597. https://doi.org/10.1099/mgen.0.000597 (2021).
Johnson, T. J. Role of plasmids in the ecology and evolution of High-Risk extraintestinal pathogenic Escherichia coli clones. EcoSal Plus https://doi.org/10.1128/ecosalplus.ESP-0013-2020 (2021).
Johnson, T. J., Siek, K. E., Johnson, S. J. & Nolan, L. K. DNA sequence of a ColV plasmid and prevalence of selected plasmid-encoded virulence genes among avian Escherichia coli strains. J. Bacteriol. 188, 745–758. https://doi.org/10.1128/Jb.188.2.745-758.2006 (2006).
Lemaitre, C. et al. A conserved virulence plasmidic region contributes to the virulence of the multiresistant Escherichia coli meningitis strain S286 belonging to phylogenetic group C. Plos One 8, e74423. https://doi.org/10.1371/journal.pone.0074423 (2013).
Aguero, M. E., Delafuente, G., Vivaldi, E. & Cabello, F. ColV increases the virulence of Escherichia-Coli K1 strains in animal-models of neonatal meningitis and urinary-infection. Med. Microbiol. Immun. 178, 211–216. https://doi.org/10.1007/Bf00202554 (1989).
Apostolakos, I., Laconi, A., Mughini-Gras, L., Yapicier, O. S. & Piccirillo, A. Occurrence of colibacillosis in broilers and its relationship with avian pathogenic Escherichia coli (APEC) population structure and molecular characteristics. Front. Vet. Sci. 8, 737720. https://doi.org/10.3389/fvets.2021.737720 (2021).
Liu, C. M. et al. Escherichia coli ST131-H22 as a foodborne uropathogen. Mbio 9, e00470-18. https://doi.org/10.1128/mBio.00470-18 (2018).
Cummins, M. L., Reid, C. J. & Djordjevic, S. P. F plasmid lineages in Escherichia coli ST95: Implications for host range, antibiotic resistance, and zoonoses (vol 7, e01212-21, 2022). Msystems 7, e00210-22. https://doi.org/10.1128/msystems.00210-22 (2022).
Mehat, J. W., van Vliet, A. H. M. & La Ragione, R. M. The avian pathogenic Escherichia coli (APEC) pathotype is comprised of multiple distinct, independent genotypes. Avian Pathol. 50, 402–416. https://doi.org/10.1080/03079457.2021.1915960 (2021).
Reid, C. J. et al. A role for ColV plasmids in the evolution of pathogenic Escherichia coli ST58. Nat. Commun. 13, 1–15. https://doi.org/10.1038/s41467-022-28342-4 (2022).
Tettelin, H. et al. Genome analysis of multiple pathogenic isolates of Streptococcus agalactiae: Implications for the microbial “pan-genome”. Proc. Natl. Acad. Sci. U S A 102, 13950–13955. https://doi.org/10.1073/pnas.0506758102 (2005).
Guimaraes, L. C. et al. Inside the pan-genome: Methods and software overview. Curr. Genom. 16, 245–252. https://doi.org/10.2174/1389202916666150423002311 (2015).
Perez-Losada, M., Arenas, M. & Castro-Nallar, E. Multilocus sequence typing of pathogens: methods, analyses, and applications. In Genetics and Evolution of Infectious Diseases (Second Edition) (ed. Tibayrenc, M.) 383–404 (Elsevier, 2017).
Duan, Y. et al. Antibiotic resistance and virulence of extraintestinal pathogenic Escherichia coli (ExPEC) vary according to molecular types. Front. Microbiol. 11, 598305. https://doi.org/10.3389/fmicb.2020.598305 (2020).
Deshpande, L. M., Hubler, C., Davis, A. P. & Castanheira, M. Updated prevalence of mcr-like genes among Escherichia coli and Klebsiella pneumoniae in the SENTRY program and characterization of mcr-1.11 variant. Antimicrob. Agents Chemother. https://doi.org/10.1128/AAC.02450-18 (2019).
Lu, Q. et al. Genetic diversity and multidrug resistance of phylogenic groups B2 and D in InPEC and ExPEC isolated from chickens in central China. BMC Microbiol. 22, 60. https://doi.org/10.1186/s12866-022-02469-2 (2022).
Gilrane, V. L. et al. Complete genome sequence of a colistin-resistant Escherichia coli strain harboring mcr-1 on an IncHI2 plasmid in the United States. Genome Announc. https://doi.org/10.1128/genomeA.01095-17 (2017).
Lu, X. et al. Widespread prevalence of plasmid-mediated colistin resistance gene mcr-1 in Escherichia coli from Pere David’s deer in China. mSphere https://doi.org/10.1128/mSphere.01221-20 (2020).
Zhou, Z. M. et al. The EnteroBase user’s guide, with case studies on Salmonella transmissions, Yersinia pestis phylogeny, and Escherichia core genomic diversity. Genome Res. 30, 138–152. https://doi.org/10.1101/gr.251678.119 (2020).
Wattam, A. R. et al. Improvements to PATRIC, the all-bacterial bioinformatics database and analysis resource center. Nucleic Acids Res. 45, D535–D542. https://doi.org/10.1093/nar/gkw1017 (2017).
Ondov, B. D. et al. Mash: Fast genome and metagenome distance estimation using MinHash. Genome Biol. 17, 132. https://doi.org/10.1186/s13059-016-0997-x (2016).
Spurbeck, R. R. et al. Escherichia coli isolates that carry vat, fyuA, chuA, and yfcV efficiently colonize the urinary tract. Infect. Immun. 80, 4115–4122. https://doi.org/10.1128/IAI.00752-12 (2012).
Srinivasan, U., Foxman, B. & Marrs, C. F. Identification of a gene encoding heat-resistant agglutinin in Escherichia coli as a putative virulence factor in urinary tract infection. J. Clin. Microbiol. 41, 285–289. https://doi.org/10.1128/JCM.41.1.285-289.2003 (2003).
He, X. L. et al. Role of uropathogenic Escherichia coli outer membrane protein T in pathogenesis of urinary tract infection. Pathog. Dis. https://doi.org/10.1093/femspd/ftv006 (2015).
Buckles, E. L. et al. Role of the K2 capsule in Escherichia coli urinary tract infection and serum resistance. J. Infect. Dis. 199, 1689–1697. https://doi.org/10.1086/598524 (2009).
Sheikh, J. et al. EilA, a HilA-like regulator in enteroaggregative Escherichia coli. Mol. Microbiol. 61, 338–350. https://doi.org/10.1111/j.1365-2958.2006.05234.x (2006).
Toma, C., Higa, N., Iyoda, S., Rivas, M. & Iwanaga, M. The long polar fimbriae genes identified in Shiga toxin-producing Escherichia coli are present in other diarrheagenic E. coli and in the standard E. coli collection of reference (ECOR) strains. Res. Microbiol. 157, 153–161. https://doi.org/10.1016/j.resmic.2005.06.009 (2006).
Wang, S. et al. IbeB is involved in the invasion and pathogenicity of avian pathogenic Escherichia coli. Vet. Microbiol. 159, 411–419. https://doi.org/10.1016/j.vetmic.2012.04.015 (2012).
Kim, K. S. Escherichia coli translocation at the blood-brain barrier. Infect. Immun. 69, 5217–5222. https://doi.org/10.1128/IAI.69.9.5217-5222.2001 (2001).
Massip, C. & Oswald, E. Siderophore-microcins in Escherichia coli: Determinants of digestive colonization, the first step toward virulence. Front. Cell Infect. Microbiol. 10, 381. https://doi.org/10.3389/fcimb.2020.00381 (2020).
Dozois, C. M. et al. Relationship between the Tsh autotransporter and pathogenicity of avian Escherichia coli and localization and analysis of the Tsh genetic region. Infect. Immun. 68, 4145–4154. https://doi.org/10.1128/IAI.68.7.4145-4154.2000 (2000).
Sengupta, M. & Austin, S. Prevalence and significance of plasmid maintenance functions in the virulence plasmids of pathogenic bacteria. Infect. Immun. 79, 2502–2509. https://doi.org/10.1128/IAI.00127-11 (2011).
Zelendova, M. et al. Characterization of the complete nucleotide sequences of mcr-1-encoding plasmids from enterobacterales isolates in retailed raw meat products from the Czech Republic. Front. Microbiol. 11, 604067. https://doi.org/10.3389/fmicb.2020.604067 (2020).
Zhuge, X. K. et al. Chicken-source Escherichia coli within phylogroup F shares virulence genotypes and is closely related to extraintestinal pathogenic E. coli causing human infections. Transbound. Emerg. Dis. 68, 880–895. https://doi.org/10.1111/tbed.13755 (2021).
Ruan, Z. & Feng, Y. BacWGSTdb, a database for genotyping and source tracking bacterial pathogens. Nucleic Acids Res. 44, D682-687. https://doi.org/10.1093/nar/gkv1004 (2016).
OIE. List of antimicrobial agents of veterinary importance. Retrieved from https://www.oie.int/fileadmin/Home/eng/Our_scientific_expertise/docs/pdf/AMR/A_OIE_List_antimicrobials_July2019.pdf (2019).
Rezatofighi, S. E., Mirzarazi, M. & Salehi, M. Virulence genes and phylogenetic groups of uropathogenic Escherichia coli isolates from patients with urinary tract infection and uninfected control subjects: a case-control study. BMC Infect. Dis. 21, 361. https://doi.org/10.1186/s12879-021-06036-4 (2021).
Peigne, C. et al. The plasmid of Escherichia coli strain S88 (O45:K1:H7) that causes neonatal meningitis is closely related to avian pathogenic E. coli plasmids and is associated with high-level bacteremia in a neonatal rat meningitis model. Infect. Immun. 77, 2272–2284. https://doi.org/10.1128/IAI.01333-08 (2009).
Guerra, S. T. et al. Investigation of extra-intestinal pathogenic Escherichia coli virulence genes, bacterial motility, and multidrug resistance pattern of strains isolated from dairy cows with different severity scores of clinical mastitis. J. Dairy Sci. 103, 3606–3614. https://doi.org/10.3168/jds.2019-17477 (2020).
Yao, K. et al. Complete genome and methylome sequences of Salmonella enterica subsp. enterica Serovar Panama (ATCC 7378) and Salmonella enterica subsp. enterica Serovar Sloterdijk (ATCC 15791). Genome Announc. https://doi.org/10.1128/genomeA.00133-16 (2016).
Tatusova, T. et al. NCBI prokaryotic genome annotation pipeline. Nucleic Acids Res. 44, 6614–6624. https://doi.org/10.1093/nar/gkw569 (2016).
Joensen, K. G., Tetzschner, A. M., Iguchi, A., Aarestrup, F. M. & Scheutz, F. Rapid and easy in silico serotyping of Escherichia coli iIsolates by use of whole-genome sequencing data. J. Clin. Microbiol. 53, 2410–2426. https://doi.org/10.1128/JCM.00008-15 (2015).
Larsen, M. V. et al. Multilocus sequence typing of total-genome-sequenced bacteria. J. Clin. Microbiol. 50, 1355–1361. https://doi.org/10.1128/JCM.06094-11 (2012).
Feng, Y., Zou, S., Chen, H., Yu, Y. & Ruan, Z. BacWGSTdb 2.0: A one-stop repository for bacterial whole-genome sequence typing and source tracking. Nucleic Acids Res. 49, D644–D650. https://doi.org/10.1093/nar/gkaa821 (2021).
Seemann, T. Prokka: Rapid prokaryotic genome annotation. Bioinformatics 30, 2068–2069. https://doi.org/10.1093/bioinformatics/btu153 (2014).
Page, A. J. et al. Roary: Rapid large-scale prokaryote pan genome analysis. Bioinformatics 31, 3691–3693. https://doi.org/10.1093/bioinformatics/btv421 (2015).
Price, M. N., Dehal, P. S. & Arkin, A. P. FastTree 2-approximately maximum-likelihood trees for large alignments. Plos One 5, e9490. https://doi.org/10.1371/journal.pone.0009490 (2010).
Hadfield, J. et al. Phandango: An interactive viewer for bacterial population genomics. Bioinformatics 34, 292–293. https://doi.org/10.1093/bioinformatics/btx610 (2018).
Camacho, C. et al. BLAST+: Architecture and applications. BMC Bioinf. 10, 421. https://doi.org/10.1186/1471-2105-10-421 (2009).
Alikhan, N. F., Petty, N. K., Ben Zakour, N. L. & Beatson, S. A. BLAST ring Image generator (BRIG): Simple prokaryote genome comparisons. BMC Genom. 12, 402. https://doi.org/10.1186/1471-2164-12-402 (2011).
Seemann, T. ABRicate, https://github.com/tseemann/abricate. 2016).
Zankari, E. et al. Identification of acquired antimicrobial resistance genes. J. Antimicrob. Chemother. 67, 2640–2644. https://doi.org/10.1093/jac/dks261 (2012).
Chen, L. et al. VFDB: A reference database for bacterial virulence factors. Nucleic Acids Res. 33, D325-328. https://doi.org/10.1093/nar/gki008 (2005).
Carattoli, A. et al. In silico detection and typing of plasmids using PlasmidFinder and plasmid multilocus sequence typing. Antimicrob. Agents Chemother. 58, 3895–3903. https://doi.org/10.1128/AAC.02412-14 (2014).
Siguier, P., Perochon, J., Lestrade, L., Mahillon, J. & Chandler, M. ISfinder: The reference centre for bacterial insertion sequences. Nucleic Acids Res. 34, D32-36. https://doi.org/10.1093/nar/gkj014 (2006).
Gu, Z., Eils, R. & Schlesner, M. Complex heatmaps reveal patterns and correlations in multidimensional genomic data. Bioinformatics 32, 2847–2849. https://doi.org/10.1093/bioinformatics/btw313 (2016).
Funding
The contributions of SSKK, AAG and ME to this research were supported by the National Institute of General Medical Sciences of the National Institutes of Health under Award Numbers [SC1GM135110] and the US Department of Homeland Security [2014-ST-062-000058] to ME.
Author information
Authors and Affiliations
Contributions
Conceptualization, M.H., A.M.H; Methodology, M.H., A.M.H., and N.G.E.; Bioinformatic analysis, A.M.H., N.G.E., A.E., N.H.A, and A.A.G.; Wet lab experiment, A.M.H., A.E., and N.H.A.; Resources, M.H., N.G.E., and M.E.; Data curation, A.M.H., A.E., and N.H.A.; Writing—original draft preparation, A.M.H.; Writing—review and editing, M.H., M.E., N.H.A., and S.S.K.K.; Project administration, M.H.; Funding acquisition, M.E. All authors have read and agreed to the published version of the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Hammad, A.M., Gonzalez-Escalona, N., El Tahan, A. et al. Pathogenome comparison and global phylogeny of Escherichia coli ST1485 strains. Sci Rep 12, 18495 (2022). https://doi.org/10.1038/s41598-022-20342-0
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-022-20342-0
- Springer Nature Limited